BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 017168
(376 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225452167|ref|XP_002264334.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
[Vitis vinifera]
Length = 509
Score = 681 bits (1756), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 325/367 (88%), Positives = 342/367 (93%)
Query: 10 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
AELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE+EL
Sbjct: 143 AELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESEL 202
Query: 70 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
AYLTGSPTKAE+LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFVA
Sbjct: 203 AYLTGSPTKAEVLERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFVA 262
Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
+QSCVVHLQKVSLARRFALVPLGPPLLAY S CKAMLAAVD +VQLVVDRPYKAGESIVV
Sbjct: 263 IQSCVVHLQKVSLARRFALVPLGPPLLAYRSNCKAMLAAVDGSVQLVVDRPYKAGESIVV 322
Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
WCGPQPNSKLL+NYGFVDEDN YDR+VVEAALNTEDPQYQDKRMVAQRNGKL+VQ FHV
Sbjct: 323 WCGPQPNSKLLLNYGFVDEDNSYDRIVVEAALNTEDPQYQDKRMVAQRNGKLTVQKFHVS 382
Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
G+E+EA+SDMLPYLRLGYVSD SEMQSVISS GPICPVSPCMERAVLDQL DYF+ RLA
Sbjct: 383 VGKEREAVSDMLPYLRLGYVSDPSEMQSVISSQGPICPVSPCMERAVLDQLVDYFERRLA 442
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPA 369
GYP T+SEDE +L D NL+PKK VATQLVR+EKKMLNACL+ T D+I LPD TVSPCPA
Sbjct: 443 GYPTTMSEDECLLADSNLNPKKLVATQLVRLEKKMLNACLKATVDLINQLPDHTVSPCPA 502
Query: 370 PYAPLLN 376
PY PLL
Sbjct: 503 PYTPLLK 509
>gi|359488614|ref|XP_003633789.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
[Vitis vinifera]
Length = 515
Score = 681 bits (1756), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 325/367 (88%), Positives = 342/367 (93%)
Query: 10 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
AELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE+EL
Sbjct: 149 AELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESEL 208
Query: 70 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
AYLTGSPTKAE+LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFVA
Sbjct: 209 AYLTGSPTKAEVLERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFVA 268
Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
+QSCVVHLQKVSLARRFALVPLGPPLLAY S CKAMLAAVD +VQLVVDRPYKAGESIVV
Sbjct: 269 IQSCVVHLQKVSLARRFALVPLGPPLLAYRSNCKAMLAAVDGSVQLVVDRPYKAGESIVV 328
Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
WCGPQPNSKLL+NYGFVDEDN YDR+VVEAALNTEDPQYQDKRMVAQRNGKL+VQ FHV
Sbjct: 329 WCGPQPNSKLLLNYGFVDEDNSYDRIVVEAALNTEDPQYQDKRMVAQRNGKLTVQKFHVS 388
Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
G+E+EA+SDMLPYLRLGYVSD SEMQSVISS GPICPVSPCMERAVLDQL DYF+ RLA
Sbjct: 389 VGKEREAVSDMLPYLRLGYVSDPSEMQSVISSQGPICPVSPCMERAVLDQLVDYFERRLA 448
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPA 369
GYP T+SEDE +L D NL+PKK VATQLVR+EKKMLNACL+ T D+I LPD TVSPCPA
Sbjct: 449 GYPTTMSEDECLLADSNLNPKKLVATQLVRLEKKMLNACLKATVDLINQLPDHTVSPCPA 508
Query: 370 PYAPLLN 376
PY PLL
Sbjct: 509 PYTPLLK 515
>gi|296090251|emb|CBI40070.3| unnamed protein product [Vitis vinifera]
Length = 428
Score = 679 bits (1753), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 325/366 (88%), Positives = 342/366 (93%)
Query: 10 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
AELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE+EL
Sbjct: 62 AELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESEL 121
Query: 70 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
AYLTGSPTKAE+LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFVA
Sbjct: 122 AYLTGSPTKAEVLERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFVA 181
Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
+QSCVVHLQKVSLARRFALVPLGPPLLAY S CKAMLAAVD +VQLVVDRPYKAGESIVV
Sbjct: 182 IQSCVVHLQKVSLARRFALVPLGPPLLAYRSNCKAMLAAVDGSVQLVVDRPYKAGESIVV 241
Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
WCGPQPNSKLL+NYGFVDEDN YDR+VVEAALNTEDPQYQDKRMVAQRNGKL+VQ FHV
Sbjct: 242 WCGPQPNSKLLLNYGFVDEDNSYDRIVVEAALNTEDPQYQDKRMVAQRNGKLTVQKFHVS 301
Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
G+E+EA+SDMLPYLRLGYVSD SEMQSVISS GPICPVSPCMERAVLDQL DYF+ RLA
Sbjct: 302 VGKEREAVSDMLPYLRLGYVSDPSEMQSVISSQGPICPVSPCMERAVLDQLVDYFERRLA 361
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPA 369
GYP T+SEDE +L D NL+PKK VATQLVR+EKKMLNACL+ T D+I LPD TVSPCPA
Sbjct: 362 GYPTTMSEDECLLADSNLNPKKLVATQLVRLEKKMLNACLKATVDLINQLPDHTVSPCPA 421
Query: 370 PYAPLL 375
PY PLL
Sbjct: 422 PYTPLL 427
>gi|224117488|ref|XP_002331687.1| SET domain protein [Populus trichocarpa]
gi|222874165|gb|EEF11296.1| SET domain protein [Populus trichocarpa]
Length = 502
Score = 665 bits (1715), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 319/367 (86%), Positives = 339/367 (92%)
Query: 10 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
AELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE EL
Sbjct: 136 AELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSEAEL 195
Query: 70 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
AYLTGSPTKAE+L+RA+GIKREY ELDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFVA
Sbjct: 196 AYLTGSPTKAEVLDRADGIKREYEELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFVA 255
Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
+QSCVVHLQKVSLARRFALVPLGPPLLAYSS CKAML AVD AV+LVVDRPYKAGE IVV
Sbjct: 256 IQSCVVHLQKVSLARRFALVPLGPPLLAYSSNCKAMLTAVDGAVELVVDRPYKAGEPIVV 315
Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
WCGPQPNSKLL+NYGFVDEDNPYDR+ VEAALNTEDPQYQDKRMVAQRNGKLSVQVF V+
Sbjct: 316 WCGPQPNSKLLLNYGFVDEDNPYDRIAVEAALNTEDPQYQDKRMVAQRNGKLSVQVFQVY 375
Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
AG+EKEA+SD+LPYLRLGYVSD SEMQSVISS GP+CPVSPCME+AVLDQL YF+ RLA
Sbjct: 376 AGKEKEAVSDILPYLRLGYVSDPSEMQSVISSQGPVCPVSPCMEQAVLDQLTVYFRTRLA 435
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPA 369
GY ++SEDE ML D NL+PKKRVATQLVR+EKKML ACLQ T D+I LPD T+ PCPA
Sbjct: 436 GYCTSISEDELMLADPNLNPKKRVATQLVRLEKKMLKACLQATVDLINQLPDHTMPPCPA 495
Query: 370 PYAPLLN 376
PYAPLL
Sbjct: 496 PYAPLLK 502
>gi|449455876|ref|XP_004145676.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cucumis
sativus]
gi|449492872|ref|XP_004159127.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cucumis
sativus]
Length = 521
Score = 664 bits (1712), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 317/366 (86%), Positives = 341/366 (93%)
Query: 10 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
AELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE EL
Sbjct: 155 AELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSEDEL 214
Query: 70 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
YL+GSPTK E+LERAEGIK+EYNELDTVWFMAGSLFQQYPYDIPTEAF+FEIFKQAFVA
Sbjct: 215 DYLSGSPTKKEVLERAEGIKKEYNELDTVWFMAGSLFQQYPYDIPTEAFSFEIFKQAFVA 274
Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
VQSCVVHLQKVSLARRFALVPLGPPLLAY S CKAML AVD AV+LVVDRPYKAGESI V
Sbjct: 275 VQSCVVHLQKVSLARRFALVPLGPPLLAYRSNCKAMLTAVDGAVELVVDRPYKAGESIAV 334
Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
WCGPQPNSKLL+NYGFVDEDN YDRLVVEAALNTEDPQYQDKRMVAQRNG+LS+Q F+V+
Sbjct: 335 WCGPQPNSKLLLNYGFVDEDNRYDRLVVEAALNTEDPQYQDKRMVAQRNGRLSIQAFYVY 394
Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
AG+EKEA+ DMLPYLRLGYV+ SEMQSVISS GP+CPVSPCMERA+L+Q+ADYFK RLA
Sbjct: 395 AGKEKEAVLDMLPYLRLGYVTHPSEMQSVISSQGPVCPVSPCMERAMLEQVADYFKRRLA 454
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPA 369
GYP TLSEDE +L D NL+PKKRVATQLVR+EKK+L++CL+VT D I LPD TVSPCPA
Sbjct: 455 GYPTTLSEDEFLLADGNLNPKKRVATQLVRLEKKLLHSCLEVTIDFINQLPDHTVSPCPA 514
Query: 370 PYAPLL 375
PYAPLL
Sbjct: 515 PYAPLL 520
>gi|356571407|ref|XP_003553868.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
isoform 1 [Glycine max]
gi|356571409|ref|XP_003553869.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
isoform 2 [Glycine max]
Length = 502
Score = 649 bits (1674), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 307/366 (83%), Positives = 336/366 (91%)
Query: 10 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
AELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQL+VESPLLW ++EL
Sbjct: 136 AELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLSVESPLLWLKSEL 195
Query: 70 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
YL+GSP K E+++R E I++EYNELDTVWFMAGSLFQQYPYDIPTEAF+FEIFKQAF A
Sbjct: 196 DYLSGSPIKDEVIQREEAIRKEYNELDTVWFMAGSLFQQYPYDIPTEAFSFEIFKQAFAA 255
Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
+QSCVVHLQKVSLARRFALVPLGPPLL+Y S CKAML AVD AV+L VDRPYKAG+ IVV
Sbjct: 256 IQSCVVHLQKVSLARRFALVPLGPPLLSYQSNCKAMLTAVDGAVELAVDRPYKAGDPIVV 315
Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
WCGPQPNSKLLINYGFVDE+N DRL+VEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV+
Sbjct: 316 WCGPQPNSKLLINYGFVDENNSNDRLIVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVY 375
Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
AG+E+EA+ DML Y+RLGYVSD SEM+SVISS GP+CPVSPCMERA LDQLADYFKARLA
Sbjct: 376 AGKEREAVLDMLRYMRLGYVSDPSEMESVISSQGPVCPVSPCMERAALDQLADYFKARLA 435
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPA 369
GYP TL+EDE+MLTD NL+PKKRVATQ VR+EKKML+ACLQ T D I LPD T+SPCPA
Sbjct: 436 GYPTTLAEDESMLTDDNLNPKKRVATQYVRLEKKMLHACLQATTDFINQLPDHTISPCPA 495
Query: 370 PYAPLL 375
PYAPLL
Sbjct: 496 PYAPLL 501
>gi|356511552|ref|XP_003524489.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
[Glycine max]
Length = 503
Score = 644 bits (1662), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 307/367 (83%), Positives = 335/367 (91%), Gaps = 1/367 (0%)
Query: 10 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
AELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQL+VESPLLWS++EL
Sbjct: 136 AELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLSVESPLLWSKSEL 195
Query: 70 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
YL+GSP K E+++R E I++EY ELDTVWFMAGSLFQQYPYDIPTEAF+FEIFKQAF A
Sbjct: 196 DYLSGSPIKDEVIQREEAIRKEYKELDTVWFMAGSLFQQYPYDIPTEAFSFEIFKQAFAA 255
Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
+QSCVVHLQKVSLARRFALVPLGPPLL+Y S CKAML AVD AV+L VDRPYKAG+ IVV
Sbjct: 256 IQSCVVHLQKVSLARRFALVPLGPPLLSYQSNCKAMLTAVDGAVELAVDRPYKAGDPIVV 315
Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
WCGPQPNSKLLINYGFVDE+N DRL+VEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV+
Sbjct: 316 WCGPQPNSKLLINYGFVDENNSNDRLIVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVY 375
Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
AG+E+EA+ DML Y+RLGYVSD SEMQSVISS GP+CPVSPCMERA LDQLADYFKARLA
Sbjct: 376 AGKEREAVLDMLRYMRLGYVSDPSEMQSVISSQGPVCPVSPCMERAALDQLADYFKARLA 435
Query: 310 GYPATLSEDEAMLTD-YNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCP 368
GYP L+EDE+MLTD NL+PKKRVATQ VR+EKKML+ACLQ T D I LPD T+SPCP
Sbjct: 436 GYPTILAEDESMLTDGGNLNPKKRVATQYVRLEKKMLHACLQATIDFINQLPDHTISPCP 495
Query: 369 APYAPLL 375
APYAPLL
Sbjct: 496 APYAPLL 502
>gi|357497055|ref|XP_003618816.1| SET domain protein [Medicago truncatula]
gi|355493831|gb|AES75034.1| SET domain protein [Medicago truncatula]
Length = 501
Score = 641 bits (1654), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 302/366 (82%), Positives = 332/366 (90%)
Query: 10 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
AELLTTNK SELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE+EL
Sbjct: 135 AELLTTNKFSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESEL 194
Query: 70 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
AYL GSP K EI++R EGI++EYNELDTVWFM+GSLFQQYPYD+PTEAF FEIFKQAF A
Sbjct: 195 AYLEGSPLKDEIVKRIEGIRKEYNELDTVWFMSGSLFQQYPYDLPTEAFPFEIFKQAFAA 254
Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
VQSCVVHLQ VSLARRFALVPLGPPLLAY S CKAML AVD AVQLVVDRPYKAG+ IVV
Sbjct: 255 VQSCVVHLQNVSLARRFALVPLGPPLLAYCSNCKAMLTAVDGAVQLVVDRPYKAGDPIVV 314
Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
WCGPQPN+KLL NYGFVDEDN DRL+VE AL+TEDPQYQDKR+VAQRNGKLS+Q F+V+
Sbjct: 315 WCGPQPNTKLLTNYGFVDEDNSNDRLIVEVALSTEDPQYQDKRIVAQRNGKLSIQTFYVY 374
Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
G+E+EA+SDM+PY+RLGYVSD SEMQSVISS GP+CPVSPCMERAVLDQLADYF RLA
Sbjct: 375 TGKEREAVSDMIPYMRLGYVSDPSEMQSVISSQGPVCPVSPCMERAVLDQLADYFNTRLA 434
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPA 369
YP TL+EDE+MLTD +L+PK+RVATQLVR+EKKML+ACLQ D+I LPD +VSPCPA
Sbjct: 435 AYPTTLAEDESMLTDGSLNPKRRVATQLVRLEKKMLHACLQAIMDLISQLPDHSVSPCPA 494
Query: 370 PYAPLL 375
PYAP L
Sbjct: 495 PYAPSL 500
>gi|22326803|ref|NP_196930.2| Rubisco methyltransferase family protein [Arabidopsis thaliana]
gi|30684815|ref|NP_851038.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
gi|42573363|ref|NP_974778.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
gi|17473570|gb|AAL38260.1| putative protein [Arabidopsis thaliana]
gi|23297671|gb|AAN13005.1| unknown protein [Arabidopsis thaliana]
gi|332004624|gb|AED92007.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
gi|332004625|gb|AED92008.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
gi|332004626|gb|AED92009.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
Length = 514
Score = 637 bits (1642), Expect = e-180, Method: Compositional matrix adjust.
Identities = 300/367 (81%), Positives = 329/367 (89%)
Query: 10 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
AELLTTNKLSELACLALYLMYEKKQGKKS W PYIRELDRQRGRGQL ESPLLWSE EL
Sbjct: 148 AELLTTNKLSELACLALYLMYEKKQGKKSVWYPYIRELDRQRGRGQLDAESPLLWSEAEL 207
Query: 70 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
YLTGSPTKAE+LERAEGIKREYNELDTVWFMAGSLFQQYP+DIPTEAF+FEIFKQAFVA
Sbjct: 208 DYLTGSPTKAEVLERAEGIKREYNELDTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVA 267
Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
+QSCVVHLQ V LARRFALVPLGPPLLAY S CKAML AVD AV+LVVDRPYKAG+ IVV
Sbjct: 268 IQSCVVHLQNVGLARRFALVPLGPPLLAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVV 327
Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
WCGPQPN+KLL+NYGFVDEDNPYDR++VEAALNTEDPQYQDKRMVAQRNGKLS QVF V
Sbjct: 328 WCGPQPNAKLLLNYGFVDEDNPYDRVIVEAALNTEDPQYQDKRMVAQRNGKLSQQVFQVR 387
Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
G+E+EA+ DMLPYLRLGY+SD SEMQSVISS GP+CP+SPCMERAVLDQLA+YF RL+
Sbjct: 388 VGKEREAVQDMLPYLRLGYMSDPSEMQSVISSQGPVCPMSPCMERAVLDQLANYFMRRLS 447
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPA 369
GYP T ED+A+L D +L P+KRVAT+LV++EKK+L ACL T D++ LPD +SPCPA
Sbjct: 448 GYPTTPKEDDALLADPSLSPRKRVATRLVQLEKKILVACLTTTVDLLNQLPDTAISPCPA 507
Query: 370 PYAPLLN 376
PYAP L
Sbjct: 508 PYAPSLK 514
>gi|297807453|ref|XP_002871610.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297317447|gb|EFH47869.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 516
Score = 634 bits (1635), Expect = e-179, Method: Compositional matrix adjust.
Identities = 299/367 (81%), Positives = 328/367 (89%)
Query: 10 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
AELLTTNKLSELACLALYLMYEKKQGKKS W PYIRELDRQRGRGQL ESPLLWSE EL
Sbjct: 150 AELLTTNKLSELACLALYLMYEKKQGKKSVWYPYIRELDRQRGRGQLDAESPLLWSEAEL 209
Query: 70 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
YLTGSPTKAE+LERAEGIKREYNELDTVWFMAGSLFQQYP+DIPTEAF+FEIFKQAFVA
Sbjct: 210 DYLTGSPTKAEVLERAEGIKREYNELDTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVA 269
Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
+QSCVVHLQ V LARRFALVPLGPPLLAY S CKAML AVD AV+LVVDRPYKAG+ IVV
Sbjct: 270 IQSCVVHLQNVGLARRFALVPLGPPLLAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVV 329
Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
WCGPQPN+KLL+NYGFVDEDNPYDR++VEAALNTEDPQYQDKRMVAQRNGKLS QVF V
Sbjct: 330 WCGPQPNAKLLLNYGFVDEDNPYDRIIVEAALNTEDPQYQDKRMVAQRNGKLSQQVFQVR 389
Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
G+E+EA+ DMLPYLRLGY+SD SEMQSVISS GP+C +SPCMERAVLDQLA+YF RL+
Sbjct: 390 VGKEREAVQDMLPYLRLGYMSDPSEMQSVISSQGPVCTMSPCMERAVLDQLANYFMRRLS 449
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPA 369
GYP T ED+A+L D +L P+KRVAT+LV++EKK+L ACL T D++ LPD +SPCPA
Sbjct: 450 GYPTTPKEDDALLADPSLSPRKRVATRLVQLEKKILAACLTTTVDLLNQLPDTAISPCPA 509
Query: 370 PYAPLLN 376
PYAP L
Sbjct: 510 PYAPSLK 516
>gi|18377718|gb|AAL67009.1| unknown protein [Arabidopsis thaliana]
Length = 514
Score = 633 bits (1633), Expect = e-179, Method: Compositional matrix adjust.
Identities = 299/367 (81%), Positives = 328/367 (89%)
Query: 10 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
AELLTTNKLSELACLALYLMYEKKQGKKS W PYIRELDRQRGRGQL ESPLLWSE EL
Sbjct: 148 AELLTTNKLSELACLALYLMYEKKQGKKSVWYPYIRELDRQRGRGQLDAESPLLWSEAEL 207
Query: 70 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
YLTGSPTKAE+LERAEGIKREYNELDTVWFMAGSLFQQYP+DIPTEAF+FEIFKQAFVA
Sbjct: 208 DYLTGSPTKAEVLERAEGIKREYNELDTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVA 267
Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
+QSCVVHLQ V LARRFALVPLGPPLLAY S CKAML AVD AV+LVVDRPYKAG+ IVV
Sbjct: 268 IQSCVVHLQNVGLARRFALVPLGPPLLAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVV 327
Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
WCGPQPN+KLL+NYGFVDEDNPYDR++VEAALNTE PQYQDKRMVAQRNGKLS QVF V
Sbjct: 328 WCGPQPNAKLLLNYGFVDEDNPYDRVIVEAALNTEGPQYQDKRMVAQRNGKLSQQVFQVR 387
Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
G+E+EA+ DMLPYLRLGY+SD SEMQSVISS GP+CP+SPCMERAVLDQLA+YF RL+
Sbjct: 388 VGKEREAVQDMLPYLRLGYMSDPSEMQSVISSQGPVCPMSPCMERAVLDQLANYFMRRLS 447
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPA 369
GYP T ED+A+L D +L P+KRVAT+LV++EKK+L ACL T D++ LPD +SPCPA
Sbjct: 448 GYPTTPKEDDALLADPSLSPRKRVATRLVQLEKKILVACLTTTVDLLNQLPDTAISPCPA 507
Query: 370 PYAPLLN 376
PYAP L
Sbjct: 508 PYAPSLK 514
>gi|125536207|gb|EAY82695.1| hypothetical protein OsI_37912 [Oryza sativa Indica Group]
Length = 505
Score = 606 bits (1562), Expect = e-171, Method: Compositional matrix adjust.
Identities = 284/367 (77%), Positives = 321/367 (87%)
Query: 10 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
AELLTTNKLSELACLALYLMYEKKQG+ SFW PYI+ELDRQRGRGQLAVESPLLW+E+EL
Sbjct: 139 AELLTTNKLSELACLALYLMYEKKQGQDSFWYPYIKELDRQRGRGQLAVESPLLWTESEL 198
Query: 70 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
YL GSP K E++ R EGI+REYNELDT+WFMAGSLFQQYP+DIPTEAF FEIFKQAFVA
Sbjct: 199 NYLKGSPIKDEVVARDEGIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIFKQAFVA 258
Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
VQSCVVHLQKVSLARRFALVPLGPPLL Y S CKAML AV D+V+LVVDRPYKAGE I+V
Sbjct: 259 VQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTAVGDSVRLVVDRPYKAGEPIIV 318
Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
WCGPQPNS+LL+NYGF+DEDNPYDR+V+EA+LN EDPQ+Q+KRMVAQRNGKL++Q FHV
Sbjct: 319 WCGPQPNSRLLLNYGFIDEDNPYDRIVIEASLNIEDPQFQEKRMVAQRNGKLAIQNFHVC 378
Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
G+EKE I++MLPYLRLGY+SD EMQS++SS G CPVSPC ERAVLDQL Y ++RLA
Sbjct: 379 VGKEKETIAEMLPYLRLGYISDPDEMQSILSSEGDTCPVSPCTERAVLDQLVGYLESRLA 438
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPA 369
YP TL ED+AML D NL PKK VAT+LVR+EKK+L+ CLQ + I LPD TVSPCPA
Sbjct: 439 DYPTTLDEDDAMLADGNLEPKKEVATRLVRLEKKLLHGCLQAANEFINDLPDHTVSPCPA 498
Query: 370 PYAPLLN 376
P+AP L
Sbjct: 499 PFAPELK 505
>gi|125578929|gb|EAZ20075.1| hypothetical protein OsJ_35675 [Oryza sativa Japonica Group]
Length = 536
Score = 605 bits (1561), Expect = e-171, Method: Compositional matrix adjust.
Identities = 284/367 (77%), Positives = 321/367 (87%)
Query: 10 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
AELLTTNKLSELACLALYLMYEKKQG+ SFW PYI+ELDRQRGRGQLAVESPLLW+E+EL
Sbjct: 170 AELLTTNKLSELACLALYLMYEKKQGQDSFWYPYIKELDRQRGRGQLAVESPLLWTESEL 229
Query: 70 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
YL GSP K E++ R EGI+REYNELDT+WFMAGSLFQQYP+DIPTEAF FEIFKQAFVA
Sbjct: 230 NYLKGSPIKDEVVARDEGIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIFKQAFVA 289
Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
VQSCVVHLQKVSLARRFALVPLGPPLL Y S CKAML AV D+V+LVVDRPYKAGE I+V
Sbjct: 290 VQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTAVGDSVRLVVDRPYKAGEPIIV 349
Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
WCGPQPNS+LL+NYGF+DEDNPYDR+V+EA+LN EDPQ+Q+KRMVAQRNGKL++Q FHV
Sbjct: 350 WCGPQPNSRLLLNYGFIDEDNPYDRIVIEASLNIEDPQFQEKRMVAQRNGKLAIQNFHVC 409
Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
G+EKE I++MLPYLRLGY+SD EMQS++SS G CPVSPC ERAVLDQL Y ++RLA
Sbjct: 410 VGKEKETIAEMLPYLRLGYISDPDEMQSILSSEGDTCPVSPCTERAVLDQLVGYLESRLA 469
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPA 369
YP TL ED+AML D NL PKK VAT+LVR+EKK+L+ CLQ + I LPD TVSPCPA
Sbjct: 470 DYPTTLDEDDAMLADGNLEPKKEVATRLVRLEKKLLHGCLQAANEFINDLPDHTVSPCPA 529
Query: 370 PYAPLLN 376
P+AP L
Sbjct: 530 PFAPELK 536
>gi|115487958|ref|NP_001066466.1| Os12g0236900 [Oryza sativa Japonica Group]
gi|77554044|gb|ABA96840.1| SET domain containing protein, expressed [Oryza sativa Japonica
Group]
gi|113648973|dbj|BAF29485.1| Os12g0236900 [Oryza sativa Japonica Group]
Length = 509
Score = 605 bits (1560), Expect = e-170, Method: Compositional matrix adjust.
Identities = 284/367 (77%), Positives = 321/367 (87%)
Query: 10 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
AELLTTNKLSELACLALYLMYEKKQG+ SFW PYI+ELDRQRGRGQLAVESPLLW+E+EL
Sbjct: 143 AELLTTNKLSELACLALYLMYEKKQGQDSFWYPYIKELDRQRGRGQLAVESPLLWTESEL 202
Query: 70 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
YL GSP K E++ R EGI+REYNELDT+WFMAGSLFQQYP+DIPTEAF FEIFKQAFVA
Sbjct: 203 NYLKGSPIKDEVVARDEGIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIFKQAFVA 262
Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
VQSCVVHLQKVSLARRFALVPLGPPLL Y S CKAML AV D+V+LVVDRPYKAGE I+V
Sbjct: 263 VQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTAVGDSVRLVVDRPYKAGEPIIV 322
Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
WCGPQPNS+LL+NYGF+DEDNPYDR+V+EA+LN EDPQ+Q+KRMVAQRNGKL++Q FHV
Sbjct: 323 WCGPQPNSRLLLNYGFIDEDNPYDRIVIEASLNIEDPQFQEKRMVAQRNGKLAIQNFHVC 382
Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
G+EKE I++MLPYLRLGY+SD EMQS++SS G CPVSPC ERAVLDQL Y ++RLA
Sbjct: 383 VGKEKETIAEMLPYLRLGYISDPDEMQSILSSEGDTCPVSPCTERAVLDQLVGYLESRLA 442
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPA 369
YP TL ED+AML D NL PKK VAT+LVR+EKK+L+ CLQ + I LPD TVSPCPA
Sbjct: 443 DYPTTLDEDDAMLADGNLEPKKEVATRLVRLEKKLLHGCLQAANEFINDLPDHTVSPCPA 502
Query: 370 PYAPLLN 376
P+AP L
Sbjct: 503 PFAPELK 509
>gi|357160358|ref|XP_003578740.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
[Brachypodium distachyon]
Length = 516
Score = 604 bits (1557), Expect = e-170, Method: Compositional matrix adjust.
Identities = 285/367 (77%), Positives = 318/367 (86%)
Query: 10 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
AELLTTNKLSELACLALYLMYEKKQGK S W PYI+ELDRQRGRGQLAVESPLLW+E+EL
Sbjct: 150 AELLTTNKLSELACLALYLMYEKKQGKDSLWYPYIKELDRQRGRGQLAVESPLLWTESEL 209
Query: 70 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
YL GSP + E++ R EGI+REYNELDT+WFMAGSLF+QYP+D+PTEAF FEIFKQAFVA
Sbjct: 210 DYLNGSPMRDEVVVRDEGIRREYNELDTLWFMAGSLFKQYPFDVPTEAFPFEIFKQAFVA 269
Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
VQSCVVHLQKVSLARRFALVPLGPPLL Y S CKAML AVDD+V+LVVDRPYKAGE I+V
Sbjct: 270 VQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTAVDDSVRLVVDRPYKAGEPIIV 329
Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
WCGPQPNS+LL+NYGFVDEDNPYDR+ +EA+LN EDPQYQ+KRMVAQRNGKL++Q F V
Sbjct: 330 WCGPQPNSRLLLNYGFVDEDNPYDRIAIEASLNMEDPQYQEKRMVAQRNGKLAIQKFQVC 389
Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
G+EKE IS+MLPYLRLGY+SD EMQ ++SS G CPVSPC ERAVLDQL Y K+RLA
Sbjct: 390 VGKEKETISEMLPYLRLGYISDPDEMQCILSSEGDTCPVSPCSERAVLDQLVVYLKSRLA 449
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPA 369
GYP TL EDEAML D NL PKK VAT+LVR+EKK+L+ CLQ + I LPD TVSPCPA
Sbjct: 450 GYPTTLDEDEAMLADGNLEPKKEVATRLVRLEKKLLHGCLQAAHEFISALPDHTVSPCPA 509
Query: 370 PYAPLLN 376
YAP L
Sbjct: 510 LYAPNLK 516
>gi|414881266|tpg|DAA58397.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
Length = 512
Score = 603 bits (1555), Expect = e-170, Method: Compositional matrix adjust.
Identities = 282/366 (77%), Positives = 321/366 (87%)
Query: 10 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
AELLT NKLSELACLALYLMYEKKQGK SFW PYI+ELDR RGRGQLAVESPLLW+E+EL
Sbjct: 146 AELLTNNKLSELACLALYLMYEKKQGKDSFWYPYIKELDRHRGRGQLAVESPLLWTESEL 205
Query: 70 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
YLTGSP K E++ R E I+REYNELDT+WFMAGSLFQQYP+DIPTEAF FEIFKQAFVA
Sbjct: 206 DYLTGSPLKDEVVARDEAIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIFKQAFVA 265
Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
VQSCVVHLQKVSLARRFALVPLGPPLL Y S CKAML A D+V+LVVDRPYKAGE I++
Sbjct: 266 VQSCVVHLQKVSLARRFALVPLGPPLLTYRSNCKAMLTADGDSVRLVVDRPYKAGEPIII 325
Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
WCGPQ NS+L++NYGFVDEDNP+DR+ +EA+LNTEDPQYQ+KRMVAQRNGKL++Q F+V+
Sbjct: 326 WCGPQTNSRLVLNYGFVDEDNPFDRVAIEASLNTEDPQYQEKRMVAQRNGKLAIQNFNVY 385
Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
G+EK+ +++MLPYLRLGY+S+ EMQS++SS G CPVSPC ERAVLDQL Y ++RLA
Sbjct: 386 VGKEKQTVAEMLPYLRLGYISNPDEMQSILSSEGDTCPVSPCTERAVLDQLVGYLESRLA 445
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPA 369
GYP TL EDEAML D NL PKK VAT+LVR+EKKML+ACLQ T + I LPD TVSPCPA
Sbjct: 446 GYPTTLDEDEAMLADGNLEPKKEVATRLVRLEKKMLHACLQATNEFINDLPDHTVSPCPA 505
Query: 370 PYAPLL 375
PYAP L
Sbjct: 506 PYAPEL 511
>gi|242053769|ref|XP_002456030.1| hypothetical protein SORBIDRAFT_03g029140 [Sorghum bicolor]
gi|241928005|gb|EES01150.1| hypothetical protein SORBIDRAFT_03g029140 [Sorghum bicolor]
Length = 512
Score = 599 bits (1544), Expect = e-169, Method: Compositional matrix adjust.
Identities = 279/366 (76%), Positives = 321/366 (87%)
Query: 10 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
AELLT NKLSELACLALYLMYEKKQGK SFW PYI+ELDR RGRGQLAVESPLLW+E+EL
Sbjct: 146 AELLTNNKLSELACLALYLMYEKKQGKDSFWYPYIKELDRHRGRGQLAVESPLLWTESEL 205
Query: 70 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
YLTGSP K E++ R E I+REYNELDT+WFMAGSLFQQYP+DIPTEAF FEIFKQAFVA
Sbjct: 206 DYLTGSPLKDEVVARDEAIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIFKQAFVA 265
Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
VQSCVVHLQKVSLARRFALVPLGPPLL Y S CKAML A D+V+LVVDRPYKAGE I++
Sbjct: 266 VQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTADGDSVRLVVDRPYKAGEPIII 325
Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
WCGPQ NS+L++NYGFVDEDNP+DR+ +EA+LN+EDPQYQ+KRMVAQRNGKL++Q F+V+
Sbjct: 326 WCGPQTNSRLVLNYGFVDEDNPFDRIAIEASLNSEDPQYQEKRMVAQRNGKLAIQNFNVY 385
Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
G+EK+ +++MLPYLRLGY+SD EMQS++SS G CP+SPC ERAVLDQL Y ++RLA
Sbjct: 386 VGKEKQTVAEMLPYLRLGYISDPDEMQSILSSEGDTCPLSPCTERAVLDQLVGYLESRLA 445
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPA 369
GYP TL EDEAML D +L PKK VAT+LVR+EKKM++ACLQ T + I LPD TVSPCPA
Sbjct: 446 GYPTTLDEDEAMLADGSLEPKKEVATRLVRLEKKMIHACLQATNEFINDLPDHTVSPCPA 505
Query: 370 PYAPLL 375
PYAP L
Sbjct: 506 PYAPEL 511
>gi|326510275|dbj|BAJ87354.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525555|dbj|BAJ88824.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 523
Score = 596 bits (1537), Expect = e-168, Method: Compositional matrix adjust.
Identities = 280/367 (76%), Positives = 317/367 (86%)
Query: 10 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
AELLTTNKLSELACLALYLMYEKKQG+ S W PYI+ELDRQRGRGQLAVESPLLW+E+EL
Sbjct: 157 AELLTTNKLSELACLALYLMYEKKQGRDSLWYPYIKELDRQRGRGQLAVESPLLWTESEL 216
Query: 70 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
YL GSP + E++ R EGIK+EYNELDT+WFMAGSLF+QYP+D+PTEAF FEIFKQAFVA
Sbjct: 217 DYLNGSPMRDEVVVRDEGIKKEYNELDTLWFMAGSLFKQYPFDVPTEAFPFEIFKQAFVA 276
Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
VQSCVVHLQKVSLARRFALVPLGPPLL Y S CKAML AVD +V+L+VDRPYKAGE I+V
Sbjct: 277 VQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTAVDGSVRLLVDRPYKAGEPIIV 336
Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
WCGPQPNS+LL+NYGFVDEDNPYDR+ +EA+LNTEDPQYQ+KRMVAQRNGKL++Q F V
Sbjct: 337 WCGPQPNSRLLLNYGFVDEDNPYDRIAIEASLNTEDPQYQEKRMVAQRNGKLAIQKFQVC 396
Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
G+EK+ IS+MLPYLRLGY+SD EMQ ++SS G CPVSPC ERAVLDQL Y K+RLA
Sbjct: 397 VGKEKQTISEMLPYLRLGYISDPDEMQCILSSEGDTCPVSPCSERAVLDQLVVYLKSRLA 456
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPA 369
GYP L EDEAML D +L PKK VAT+LVR+EKKML+ CL+ + I LPD TVSPCPA
Sbjct: 457 GYPTNLDEDEAMLADGSLEPKKEVATRLVRLEKKMLHGCLEAANEFISGLPDHTVSPCPA 516
Query: 370 PYAPLLN 376
YAP L
Sbjct: 517 LYAPELK 523
>gi|116786810|gb|ABK24248.1| unknown [Picea sitchensis]
Length = 507
Score = 596 bits (1536), Expect = e-168, Method: Compositional matrix adjust.
Identities = 281/367 (76%), Positives = 319/367 (86%)
Query: 10 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
AELLTTNKLSELACLALYLMYEKKQG +SFW P+IRELDRQRGRGQLAVESPLLWS EL
Sbjct: 141 AELLTTNKLSELACLALYLMYEKKQGNQSFWRPFIRELDRQRGRGQLAVESPLLWSSEEL 200
Query: 70 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
Y TGSP K +LER GIKREY ELDTVWFMAGSLF+QYPYDIPTEAF FEIFKQAFVA
Sbjct: 201 KYFTGSPMKEIMLERNSGIKREYEELDTVWFMAGSLFKQYPYDIPTEAFPFEIFKQAFVA 260
Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
VQSCVVHLQ V+LARRFALVPLGPPLL+Y S CKAML AV D+VQL VDR YKAGE IVV
Sbjct: 261 VQSCVVHLQNVNLARRFALVPLGPPLLSYKSNCKAMLKAVGDSVQLEVDREYKAGEPIVV 320
Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
WCGPQPN++LL+NYGFVDEDNP+DRL+VE +L+T+DP YQDKR++AQRNGKLSVQ F+++
Sbjct: 321 WCGPQPNARLLLNYGFVDEDNPHDRLIVEVSLDTKDPLYQDKRIIAQRNGKLSVQTFNIY 380
Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
GREKEA+ DMLPYLRL YVSD SEMQSV+SS GP+CPVSPC ERAVLDQL+ YF+ RLA
Sbjct: 381 IGREKEAVLDMLPYLRLAYVSDPSEMQSVLSSQGPVCPVSPCTERAVLDQLSRYFRERLA 440
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPA 369
GYP T SEDE +L D +PK++VATQLV +EKKMLN+CL ++I LPD+ V+PCP+
Sbjct: 441 GYPTTASEDEIVLADPTTNPKRQVATQLVLIEKKMLNSCLAAVYEIIDQLPDLAVTPCPS 500
Query: 370 PYAPLLN 376
PY+P+L
Sbjct: 501 PYSPILK 507
>gi|7573451|emb|CAB87765.1| putative protein [Arabidopsis thaliana]
Length = 537
Score = 556 bits (1433), Expect = e-156, Method: Compositional matrix adjust.
Identities = 268/335 (80%), Positives = 288/335 (85%), Gaps = 18/335 (5%)
Query: 10 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
AELLTTNKLSELACLALYLMYEKKQGKKS W PYIRELDRQRGRGQL ESPLLWSE EL
Sbjct: 148 AELLTTNKLSELACLALYLMYEKKQGKKSVWYPYIRELDRQRGRGQLDAESPLLWSEAEL 207
Query: 70 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
YLTGSPTKAE+LERAEGIKREYNELDTVWFMAGSLFQQYP+DIPTEAF+FEIFKQAFVA
Sbjct: 208 DYLTGSPTKAEVLERAEGIKREYNELDTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVA 267
Query: 130 VQSCVVHLQ------------------KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD 171
+QSCVVHLQ V LARRFALVPLGPPLLAY S CKAML AVD
Sbjct: 268 IQSCVVHLQVVLVASSNLDCYASSCTQNVGLARRFALVPLGPPLLAYCSNCKAMLTAVDG 327
Query: 172 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 231
AV+LVVDRPYKAG+ IVVWCGPQPN+KLL+NYGFVDEDNPYDR++VEAALNTEDPQYQDK
Sbjct: 328 AVELVVDRPYKAGDPIVVWCGPQPNAKLLLNYGFVDEDNPYDRVIVEAALNTEDPQYQDK 387
Query: 232 RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPC 291
RMVAQRNGKLS QVF V G+E+EA+ DMLPYLRLGY+SD SEMQSVISS GP+CP+SPC
Sbjct: 388 RMVAQRNGKLSQQVFQVRVGKEREAVQDMLPYLRLGYMSDPSEMQSVISSQGPVCPMSPC 447
Query: 292 MERAVLDQLADYFKARLAGYPATLSEDEAMLTDYN 326
MERAVLDQLA+YF RL+GYP T ED+A+ N
Sbjct: 448 MERAVLDQLANYFMRRLSGYPTTPKEDDALEASCN 482
>gi|302794360|ref|XP_002978944.1| hypothetical protein SELMODRAFT_110000 [Selaginella moellendorffii]
gi|300153262|gb|EFJ19901.1| hypothetical protein SELMODRAFT_110000 [Selaginella moellendorffii]
Length = 432
Score = 518 bits (1334), Expect = e-144, Method: Compositional matrix adjust.
Identities = 252/370 (68%), Positives = 298/370 (80%), Gaps = 4/370 (1%)
Query: 10 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
AELLTTNKLSELACLALYLMYEKK+GK+SFW P+IRELDRQRGRGQ+AVESPLLW+ EL
Sbjct: 62 AELLTTNKLSELACLALYLMYEKKRGKESFWYPFIRELDRQRGRGQVAVESPLLWTSEEL 121
Query: 70 -AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 128
Y TGS K +LER EGIKREY ELDTVWFMAGSLF++YP+DIPTEAF+FEIFKQAFV
Sbjct: 122 DEYFTGSRMKEVVLERLEGIKREYQELDTVWFMAGSLFKEYPFDIPTEAFSFEIFKQAFV 181
Query: 129 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 188
AVQSCVVHLQ VSL RRFALVPLGPPLLAY S CKAML A D V+L VDR YK GE I+
Sbjct: 182 AVQSCVVHLQGVSLPRRFALVPLGPPLLAYKSNCKAMLKAAGDLVRLEVDRAYKKGEQIL 241
Query: 189 VWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV 248
VWCGPQPN++LL+NYGFVD DNP+DRL VEA+LNT DP YQ+KR++ Q+N +L++Q F +
Sbjct: 242 VWCGPQPNTRLLLNYGFVDPDNPHDRLSVEASLNTRDPFYQNKRIIVQKNNRLTIQNFQI 301
Query: 249 HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARL 308
GREKEA+ +MLPYLRLG+VSD M+SV S+ GP CPVS C ERAVLDQLA YF+ R+
Sbjct: 302 FKGREKEAVLEMLPYLRLGHVSDPYHMESVFSAEGPTCPVSACNERAVLDQLAQYFQERI 361
Query: 309 AGYPATLSEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVS- 365
A Y T+ ED A+L D +++PK+RVATQL+ +EK++L+ L V LPD +V+
Sbjct: 362 AKYKTTIDEDRALLEDGSSDINPKQRVATQLLLIEKEILHNTLDVVNGFRNQLPDGSVAP 421
Query: 366 PCPAPYAPLL 375
PC + P L
Sbjct: 422 PCCGDFVPKL 431
>gi|302809535|ref|XP_002986460.1| hypothetical protein SELMODRAFT_269129 [Selaginella moellendorffii]
gi|300145643|gb|EFJ12317.1| hypothetical protein SELMODRAFT_269129 [Selaginella moellendorffii]
Length = 432
Score = 517 bits (1332), Expect = e-144, Method: Compositional matrix adjust.
Identities = 252/370 (68%), Positives = 298/370 (80%), Gaps = 4/370 (1%)
Query: 10 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
AELLTTNKLSELACLALYLMYEKK+GK+SFW P+IRELDRQRGRGQ+AVESPLLW+ EL
Sbjct: 62 AELLTTNKLSELACLALYLMYEKKRGKESFWYPFIRELDRQRGRGQVAVESPLLWTSEEL 121
Query: 70 -AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 128
Y TGS K +LER EGIKREY ELDTVWFMAGSLF++YP+DIPTEAF+FEIFKQAFV
Sbjct: 122 DEYFTGSRMKEVVLERLEGIKREYQELDTVWFMAGSLFKEYPFDIPTEAFSFEIFKQAFV 181
Query: 129 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 188
AVQSCVVHLQ VSL RRFALVPLGPPLLAY S CKAML A D V+L VDR YK GE I+
Sbjct: 182 AVQSCVVHLQGVSLPRRFALVPLGPPLLAYKSNCKAMLKAAGDLVRLEVDRAYKKGEQIL 241
Query: 189 VWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV 248
VWCGPQPN++LL+NYGFVD DNP+DRL VEA+LNT DP YQ+KR++ Q+N +L++Q F +
Sbjct: 242 VWCGPQPNTRLLLNYGFVDPDNPHDRLSVEASLNTRDPFYQNKRIIVQKNNRLTIQNFQI 301
Query: 249 HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARL 308
GREKEA+ +MLPYLRLG+VSD M+SV S+ GP CPVS C ERAVLDQLA YF+ R+
Sbjct: 302 FKGREKEAVLEMLPYLRLGHVSDPYHMESVFSAEGPTCPVSACNERAVLDQLAQYFQERI 361
Query: 309 AGYPATLSEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVS- 365
A Y T+ ED A+L D +++PK+RVATQL+ +EK++L+ L V LPD +V+
Sbjct: 362 AKYKTTIDEDRALLEDCSSDINPKQRVATQLLLIEKEILHNTLDVVNGFRNQLPDGSVAP 421
Query: 366 PCPAPYAPLL 375
PC + P L
Sbjct: 422 PCCGDFVPKL 431
>gi|168044593|ref|XP_001774765.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673920|gb|EDQ60436.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 523
Score = 514 bits (1325), Expect = e-143, Method: Compositional matrix adjust.
Identities = 248/367 (67%), Positives = 291/367 (79%), Gaps = 4/367 (1%)
Query: 10 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
AELLTTNKLSELACLALYLMYEKKQGK+S+W PYIRELDRQRGRGQL+V SPLLWS EL
Sbjct: 159 AELLTTNKLSELACLALYLMYEKKQGKESYWYPYIRELDRQRGRGQLSVASPLLWSREEL 218
Query: 70 -AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 128
Y TGS K +LER GIKREY ELDTVWFMAGSLF+QYP+D+PTEAF+FEIFKQAFV
Sbjct: 219 NEYFTGSTMKEVVLERLAGIKREYEELDTVWFMAGSLFKQYPFDLPTEAFSFEIFKQAFV 278
Query: 129 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 188
AVQSCVVHLQ VSLARRFALVPLGPPLLAY S CKAML AVDD V L VDR YKAG+ I
Sbjct: 279 AVQSCVVHLQGVSLARRFALVPLGPPLLAYKSNCKAMLKAVDDNVVLEVDRAYKAGDPIA 338
Query: 189 VWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV 248
VWCGPQPNSKLL+NYGFVDEDNPYDRL VEA+L+TEDP YQ KR + Q+N +L++Q F +
Sbjct: 339 VWCGPQPNSKLLLNYGFVDEDNPYDRLAVEASLDTEDPLYQQKRAIVQKNNRLTIQTFQI 398
Query: 249 HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARL 308
+ G+E EA+ DMLPY+RL +++D EM++V + GP+CPVS C ERAVL+QL YF+ RL
Sbjct: 399 YKGKEMEAVLDMLPYMRLAHLADPEEMETVSFAQGPVCPVSACNERAVLEQLEQYFEKRL 458
Query: 309 AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCP 368
AGY ++ + + D + KKRVA +L+ +EK +L L ++I LPD +SPC
Sbjct: 459 AGYKSSHATEGG---DAKKNAKKRVAEKLLCIEKSILRNALAAVQELISQLPDSAISPCI 515
Query: 369 APYAPLL 375
PY P L
Sbjct: 516 GPYLPNL 522
>gi|168020073|ref|XP_001762568.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686301|gb|EDQ72691.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 427
Score = 501 bits (1289), Expect = e-139, Method: Compositional matrix adjust.
Identities = 243/367 (66%), Positives = 284/367 (77%), Gaps = 4/367 (1%)
Query: 10 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
AELLTTNKLSELACLALYLMYEKKQGK+S+W PYIRELDRQRGRGQL+V SPLLWS EL
Sbjct: 63 AELLTTNKLSELACLALYLMYEKKQGKESYWYPYIRELDRQRGRGQLSVASPLLWSPEEL 122
Query: 70 -AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 128
Y TGS K +LER GIKREY ELDTVWFMAGSLF+QYP+D+PTEAF+FEIFKQAFV
Sbjct: 123 NEYFTGSTMKEVVLERLAGIKREYEELDTVWFMAGSLFKQYPFDLPTEAFSFEIFKQAFV 182
Query: 129 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 188
AVQSCVVHLQ VSLARRFALVPLGPPLLAY S CKAML AV D VQL VD YK G+ I
Sbjct: 183 AVQSCVVHLQGVSLARRFALVPLGPPLLAYKSNCKAMLKAVGDNVQLEVDHAYKTGDPIA 242
Query: 189 VWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV 248
VWCGPQPNSKLL+NYGFVDEDNP+DRL VEA+LNTEDP YQ KR V Q+N +L++Q F +
Sbjct: 243 VWCGPQPNSKLLLNYGFVDEDNPFDRLAVEASLNTEDPLYQQKRAVVQKNNRLTIQTFQI 302
Query: 249 HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARL 308
+ G+E EA+ DMLPY+RLG+++D E+++V + P+C VS C ERAVL+Q+ +F+ RL
Sbjct: 303 YKGKEMEAVRDMLPYMRLGHLADPEEIETVSFAQEPLCYVSACNERAVLNQIEHFFERRL 362
Query: 309 AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCP 368
AGY S D D K+ VA +L+ +EK +L L ++I LPD +SPC
Sbjct: 363 AGYK---SSDTTKAVDAKKDAKRTVAKKLMSIEKNILRNALAAVHELIRELPDGAISPCI 419
Query: 369 APYAPLL 375
PY P L
Sbjct: 420 GPYLPNL 426
>gi|326503142|dbj|BAJ99196.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 425
Score = 473 bits (1218), Expect = e-131, Method: Compositional matrix adjust.
Identities = 218/274 (79%), Positives = 248/274 (90%)
Query: 10 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
AELLTTNKLSELACLALYLMYEKKQG+ S W PYI+ELDRQRGRGQLAVESPLLW+E+EL
Sbjct: 152 AELLTTNKLSELACLALYLMYEKKQGRDSLWYPYIKELDRQRGRGQLAVESPLLWTESEL 211
Query: 70 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
YL GSP + E++ R EGIK+EYNELDT+WFMAGSLF+QYP+D+PTEAF FEIFKQAFVA
Sbjct: 212 DYLNGSPMRDEVVVRDEGIKKEYNELDTLWFMAGSLFKQYPFDVPTEAFPFEIFKQAFVA 271
Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
VQSCVVHLQKVSLARRFALVPLGPPLL Y S CKAML AVD +V+L+VDRPYKAGE I+V
Sbjct: 272 VQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTAVDGSVRLLVDRPYKAGEPIIV 331
Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
WCGPQPNS+LL+NYGFVDEDNPYDR+ +EA+LNTEDPQYQ+KRMVAQRNGKL++Q F V
Sbjct: 332 WCGPQPNSRLLLNYGFVDEDNPYDRIAIEASLNTEDPQYQEKRMVAQRNGKLAIQKFQVC 391
Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLG 283
G+EK+ IS+MLPYLRLGY+SD EMQ ++SS G
Sbjct: 392 VGKEKQTISEMLPYLRLGYISDPDEMQCILSSEG 425
>gi|212721460|ref|NP_001132025.1| uncharacterized protein LOC100193433 [Zea mays]
gi|194693232|gb|ACF80700.1| unknown [Zea mays]
gi|414881264|tpg|DAA58395.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
gi|414881265|tpg|DAA58396.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
Length = 252
Score = 374 bits (961), Expect = e-101, Method: Compositional matrix adjust.
Identities = 173/242 (71%), Positives = 206/242 (85%)
Query: 134 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 193
++ QKVSLARRFALVPLGPPLL Y S CKAML A D+V+LVVDRPYKAGE I++WCGP
Sbjct: 10 LIQEQKVSLARRFALVPLGPPLLTYRSNCKAMLTADGDSVRLVVDRPYKAGEPIIIWCGP 69
Query: 194 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGRE 253
Q NS+L++NYGFVDEDNP+DR+ +EA+LNTEDPQYQ+KRMVAQRNGKL++Q F+V+ G+E
Sbjct: 70 QTNSRLVLNYGFVDEDNPFDRVAIEASLNTEDPQYQEKRMVAQRNGKLAIQNFNVYVGKE 129
Query: 254 KEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 313
K+ +++MLPYLRLGY+S+ EMQS++SS G CPVSPC ERAVLDQL Y ++RLAGYP
Sbjct: 130 KQTVAEMLPYLRLGYISNPDEMQSILSSEGDTCPVSPCTERAVLDQLVGYLESRLAGYPT 189
Query: 314 TLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAP 373
TL EDEAML D NL PKK VAT+LVR+EKKML+ACLQ T + I LPD TVSPCPAPYAP
Sbjct: 190 TLDEDEAMLADGNLEPKKEVATRLVRLEKKMLHACLQATNEFINDLPDHTVSPCPAPYAP 249
Query: 374 LL 375
L
Sbjct: 250 EL 251
>gi|413950742|gb|AFW83391.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
gi|413950743|gb|AFW83392.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
Length = 252
Score = 363 bits (933), Expect = 6e-98, Method: Compositional matrix adjust.
Identities = 166/240 (69%), Positives = 203/240 (84%)
Query: 134 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 193
++ QKVSLARRFALVPLGPPLL Y S CKAML ++V+LVVDRPYKAGE I++WCGP
Sbjct: 10 LIQEQKVSLARRFALVPLGPPLLTYKSNCKAMLTVDGESVRLVVDRPYKAGEPIIIWCGP 69
Query: 194 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGRE 253
Q NS+L++NYGFVDE+NP+DR+ +EA+LNTEDPQYQ+KRMVAQRNGK ++Q F+V+ G+E
Sbjct: 70 QTNSRLVLNYGFVDENNPFDRISIEASLNTEDPQYQEKRMVAQRNGKHAIQNFNVYVGKE 129
Query: 254 KEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 313
K+ +++MLPYLRLGY+SD EMQS++SS G CPVSPC ERAVLDQL Y ++RLAGYP
Sbjct: 130 KQTVAEMLPYLRLGYISDPDEMQSILSSEGDTCPVSPCTERAVLDQLGGYLESRLAGYPT 189
Query: 314 TLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAP 373
TL+EDEAML D +L PK+ VAT+LVR+EKKML+ACLQ T + I LPD TVSPCPA YAP
Sbjct: 190 TLNEDEAMLADGSLEPKQEVATRLVRLEKKMLHACLQATNEFITDLPDHTVSPCPAQYAP 249
>gi|384246822|gb|EIE20311.1| hypothetical protein COCSUDRAFT_48681 [Coccomyxa subellipsoidea
C-169]
Length = 539
Score = 363 bits (931), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 182/341 (53%), Positives = 240/341 (70%), Gaps = 9/341 (2%)
Query: 10 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
AELLTT+KLSELACL LYLMYEKK G++S W +I+ELDR +GRGQ+ +SPLLW E ++
Sbjct: 80 AELLTTDKLSELACLTLYLMYEKKNGRQSVWYEFIKELDRIQGRGQMGAKSPLLWDEGQV 139
Query: 70 -AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 128
YL GSP AEI ER +GI++EY ELDTVWFMAGSLF+ YPYD+PTEAF+ ++F+Q F
Sbjct: 140 DEYLAGSPLVAEIKERLKGIEKEYAELDTVWFMAGSLFKSYPYDVPTEAFSLKLFRQGFA 199
Query: 129 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGES 186
AVQ+ VVHLQ V L++RFALVPLGPPLL+YSS KAML +A VQL VDR Y GE
Sbjct: 200 AVQASVVHLQGVPLSKRFALVPLGPPLLSYSSTAKAMLTYNREAKEVQLAVDRSYTKGEP 259
Query: 187 IVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 246
I WCGPQPN +LL+NYG V ++NP+D++ + L DP +Q KR V Q+N + Q F
Sbjct: 260 IEAWCGPQPNRRLLLNYGIVTDNNPHDKMALTVTLPHADPLFQAKRAVLQQNNLSTQQTF 319
Query: 247 HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPIC--PVSPCMERAVLDQLADYF 304
+ R+K +LPYLRL + +D + ++ +++L C P+SP ER VL QLA +
Sbjct: 320 QLQ--RDKGLPELLLPYLRLAHCTDAASLK--LATLDTCCAAPISPENERTVLHQLASHL 375
Query: 305 KARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
+ RL Y T EDE ++ P+++VA +L+R+EK +L
Sbjct: 376 QDRLDRYKTTCEEDEVIIRSTTAGPRQKVAARLLRIEKAIL 416
>gi|413950744|gb|AFW83393.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
Length = 281
Score = 347 bits (889), Expect = 7e-93, Method: Compositional matrix adjust.
Identities = 166/269 (61%), Positives = 203/269 (75%), Gaps = 29/269 (10%)
Query: 134 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 193
++ QKVSLARRFALVPLGPPLL Y S CKAML ++V+LVVDRPYKAGE I++WCGP
Sbjct: 10 LIQEQKVSLARRFALVPLGPPLLTYKSNCKAMLTVDGESVRLVVDRPYKAGEPIIIWCGP 69
Query: 194 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGRE 253
Q NS+L++NYGFVDE+NP+DR+ +EA+LNTEDPQYQ+KRMVAQRNGK ++Q F+V+ G+E
Sbjct: 70 QTNSRLVLNYGFVDENNPFDRISIEASLNTEDPQYQEKRMVAQRNGKHAIQNFNVYVGKE 129
Query: 254 KEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 313
K+ +++MLPYLRLGY+SD EMQS++SS G CPVSPC ERAVLDQL Y ++RLAGYP
Sbjct: 130 KQTVAEMLPYLRLGYISDPDEMQSILSSEGDTCPVSPCTERAVLDQLGGYLESRLAGYPT 189
Query: 314 TLSEDEAM-----------------------------LTDYNLHPKKRVATQLVRMEKKM 344
TL+EDEAM L D +L PK+ VAT+LVR+EKKM
Sbjct: 190 TLNEDEAMVMSCDFLRVVSWSLYKLAECYGIGFGHCQLADGSLEPKQEVATRLVRLEKKM 249
Query: 345 LNACLQVTADMIMLLPDVTVSPCPAPYAP 373
L+ACLQ T + I LPD TVSPCPA YAP
Sbjct: 250 LHACLQATNEFITDLPDHTVSPCPAQYAP 278
>gi|307107385|gb|EFN55628.1| hypothetical protein CHLNCDRAFT_57818 [Chlorella variabilis]
Length = 435
Score = 327 bits (839), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 169/341 (49%), Positives = 228/341 (66%), Gaps = 10/341 (2%)
Query: 10 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
AEL+TT KLSELACL LYL YEKK+GK+ W +I+ELDR +GRG +SPLLW E +
Sbjct: 71 AELVTTGKLSELACLTLYLAYEKKRGKEGCWYRFIKELDRMQGRGSQGAKSPLLWDEGQA 130
Query: 70 A-YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 128
A L GSP EI R +GI++EY ELDTVW++AGSLF + P+ PTE F+F +F+QAF
Sbjct: 131 AELLAGSPVVGEIEARLQGIRKEYEELDTVWYLAGSLFNRQPFSPPTEQFSFPVFRQAFT 190
Query: 129 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGES 186
AVQS VVHLQ V+L +RFALVP+GPPLL YSS KAML ++ V+L VDR Y+ GE+
Sbjct: 191 AVQSSVVHLQGVALGKRFALVPMGPPLLTYSSTAKAMLKFDPESHEVRLAVDRAYQPGEA 250
Query: 187 IVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 246
++ WCGPQPNS+LLINYG VDE NPYD+L + + ++DP Y+ KR G + Q F
Sbjct: 251 VLAWCGPQPNSRLLINYGIVDESNPYDKLPLSITIPSDDPLYRLKRDRLAERGLSTQQTF 310
Query: 247 HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVI--SSLGPICPVSPCMERAVLDQLADYF 304
+ A A +LPYLRL + + ++++ V GP+ P + E VL+QL +
Sbjct: 311 QLQAAASLPA--QLLPYLRLVHSTREADVEGVKWEEEAGPVAPEN---ELTVLNQLITHL 365
Query: 305 KARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
+ R + Y T+ EDEA++ D P+ VA +L+++EK +L
Sbjct: 366 RLRQSRYRTTIEEDEAIIADPAKGPRPTVAARLLKIEKGIL 406
>gi|302847476|ref|XP_002955272.1| hypothetical protein VOLCADRAFT_76643 [Volvox carteri f.
nagariensis]
gi|300259344|gb|EFJ43572.1| hypothetical protein VOLCADRAFT_76643 [Volvox carteri f.
nagariensis]
Length = 488
Score = 312 bits (800), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 169/342 (49%), Positives = 220/342 (64%), Gaps = 9/342 (2%)
Query: 8 YAAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 67
+ AELLTT+KLSELACLALYLMYEKK KKSFW PYI+ELD+Q+ RG A ESPLLW +
Sbjct: 106 FVAELLTTDKLSELACLALYLMYEKKLKKKSFWYPYIKELDKQQARGPQAAESPLLWGDQ 165
Query: 68 EL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 126
EL + L GSP + +R GI++EY LDTVWFMAGSLF +YP+D+PTE F+FE+F+QA
Sbjct: 166 ELDSLLKGSPLLPAVRQRQAGIRKEYEALDTVWFMAGSLFNKYPFDLPTETFSFELFQQA 225
Query: 127 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAG 184
F VQ+ +VHLQ V +A+RFALVPLGPPL+AYSS K M+ +D +V+LVV P +AG
Sbjct: 226 FAVVQASIVHLQGVPIAKRFALVPLGPPLMAYSSTSKNMMTYDEDSRSVRLVVSGPVEAG 285
Query: 185 ESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE--AALNTEDPQYQDKRMVAQRNGKLS 242
+ WCGPQPNS+LL+NYG VDE NP+D+L L T DP + KR V G +
Sbjct: 286 RPVAAWCGPQPNSRLLLNYGVVDEHNPFDKLQARFTFTLPTSDPLFPAKRAVLSEAGLAT 345
Query: 243 VQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLAD 302
Q F V R +LPY+ L + ++ SV S +E A L L
Sbjct: 346 QQSFDVSVARPLP--PQLLPYMMLALATTPEQVASV--SFSDTAGHDRELEAAALAALMA 401
Query: 303 YFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 344
Y + R A Y L D ++ D + P+++VA +L ++EK +
Sbjct: 402 YVQRRTAAYAHPLWRDLEIINDPSSTPRQKVAARLTKIEKSI 443
>gi|255536985|ref|XP_002509559.1| conserved hypothetical protein [Ricinus communis]
gi|223549458|gb|EEF50946.1| conserved hypothetical protein [Ricinus communis]
Length = 348
Score = 278 bits (712), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 130/160 (81%), Positives = 148/160 (92%)
Query: 217 VEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQ 276
++AALNTEDPQYQDKRMVAQRNGKLS+QVF ++ G+EKEAISD+LPYLRLGYVSD SEMQ
Sbjct: 189 LQAALNTEDPQYQDKRMVAQRNGKLSIQVFQIYVGKEKEAISDILPYLRLGYVSDPSEMQ 248
Query: 277 SVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQ 336
SVISS GPICPVSPCME+AVLDQLADYFK RLAGYP +L+EDE ML D+NL+PKKRVATQ
Sbjct: 249 SVISSQGPICPVSPCMEQAVLDQLADYFKRRLAGYPTSLNEDELMLADHNLNPKKRVATQ 308
Query: 337 LVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 376
LVR+EKK+LNACLQ TAD+I LPD++VSPCPAPYAP+L
Sbjct: 309 LVRLEKKILNACLQATADLINQLPDLSVSPCPAPYAPILK 348
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 121/128 (94%), Positives = 125/128 (97%)
Query: 11 ELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA 70
ELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE ELA
Sbjct: 63 ELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSEAELA 122
Query: 71 YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 130
YLTGSPTKAE+LERA+GIKREY+ELDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFVA+
Sbjct: 123 YLTGSPTKAEVLERADGIKREYDELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFVAI 182
Query: 131 QSCVVHLQ 138
QSCVVHLQ
Sbjct: 183 QSCVVHLQ 190
>gi|413950741|gb|AFW83390.1| hypothetical protein ZEAMMB73_201403, partial [Zea mays]
Length = 130
Score = 231 bits (590), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 108/128 (84%), Positives = 117/128 (91%)
Query: 11 ELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA 70
ELLT NKLSELACLALYLMYEKKQGK SFW PYI+ELDR RGRGQLAVESPLLW+E+EL
Sbjct: 1 ELLTNNKLSELACLALYLMYEKKQGKDSFWYPYIKELDRHRGRGQLAVESPLLWTESELD 60
Query: 71 YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 130
YL+GSP K E++ R E I+REYNELDT+WFMAGSLFQQYP+DIPTEAF FEIFKQAFVAV
Sbjct: 61 YLSGSPLKDEVVARDEAIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIFKQAFVAV 120
Query: 131 QSCVVHLQ 138
QSCVVHLQ
Sbjct: 121 QSCVVHLQ 128
>gi|298706765|emb|CBJ29688.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Ectocarpus siliculosus]
Length = 521
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 105/359 (29%), Positives = 167/359 (46%), Gaps = 27/359 (7%)
Query: 8 YAAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 67
+ A+++T + LSE +AL ++EK +GK+SFW YI L V LW+E
Sbjct: 165 FGADVITED-LSEYIAIALLAVHEKAKGKESFWSSYIGVLPTVE-----EVYPTYLWAEE 218
Query: 68 ELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
+LA L GSP A ++ EY ++ L ++P +P E T+E F+ AF
Sbjct: 219 DLALLEGSPVIAATESMRRKLEVEYATVEN------DLLDKFPEILPREVHTYEEFQWAF 272
Query: 128 VAVQSCVVHLQKVSLARRFALVPL------GPPLLAY-SSKCKAMLAAVDDAVQLVVDRP 180
+ S + L +S ALVP P +Y ++ + + + D V + DR
Sbjct: 273 AMLFSRAIRLGGLSTGEAVALVPYADLFNHNPFANSYIDARQQGLFFSKTDEVVVYADRS 332
Query: 181 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 240
YK E + + GP+ NS LL+ YGF + NPY+ + V +L+ D Y+ K+ G
Sbjct: 333 YKKMEQVYISYGPKGNSDLLLLYGFSLDRNPYNSVDVTVSLDENDELYERKKAFLSEAGL 392
Query: 241 LSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQL 300
+ F ++ R + ++L YLRL ++ + L + E VLD L
Sbjct: 393 PPTKAFPLYNDRYPD---ELLQYLRLIQLNTDQLRGRTLEDLSFEKKQTDVNELMVLDSL 449
Query: 301 ADYFKARLAGYPATLSEDEAMLTDYN----LHPKKRVATQLVRMEKKMLNACL-QVTAD 354
+ KA +AGYP T +D ++ D L +R+A + R EK +L + VT D
Sbjct: 450 VEACKATIAGYPTTEEQDSKLMNDPGMFRALSKTQRMAVKHRRQEKVILRRTIAAVTKD 508
>gi|168002824|ref|XP_001754113.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694667|gb|EDQ81014.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 638
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 102/335 (30%), Positives = 158/335 (47%), Gaps = 27/335 (8%)
Query: 19 SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS-ETELAYLTGSPT 77
++ L L+LMYE+ G+KS W PY++ SP+LW+ E + L GSP
Sbjct: 151 GDVIGLTLWLMYERSLGEKSVWYPYLQTFPS-------TTLSPILWTAEEQQKLLKGSPA 203
Query: 78 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
E+ +R+ ++ EY +L S F + P P E F+ E FK AF + S V+L
Sbjct: 204 LEEVQQRSAALEGEYEDLQ-------SYFTKDPQAFPQEYFSLEAFKSAFSVILSRAVYL 256
Query: 138 QKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCG-PQ 194
L FALVP L + + +A L + D AV VDR YK GE + G +
Sbjct: 257 PSADL---FALVPYADAL-NHRADSQAYLDYSMEDQAVVFPVDRNYKEGEQVFTSYGRER 312
Query: 195 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREK 254
N+ LLI YGFVDE+N D L +E L D K+ + Q+ S Q F ++ R
Sbjct: 313 SNADLLITYGFVDENNAMDYLDLEVGLVDGDRLLVLKQQILQQAMLDSPQTFPLYLDR-- 370
Query: 255 EAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPAT 314
+ +L Y+RL + D + ++ + + E L L + +L Y
Sbjct: 371 -FPTQLLTYMRLSRLQDPALFPKIVFDKDIM--LDQANEYECLQLLMGECRTKLGNYEGG 427
Query: 315 LSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 349
+ ++ +L + + ++RVA QL EKK+L + +
Sbjct: 428 VDDEIRLLKNKKISQRERVAAQLRLCEKKILTSTM 462
>gi|452821842|gb|EME28868.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Galdieria sulphuraria]
Length = 490
Score = 114 bits (285), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 100/341 (29%), Positives = 166/341 (48%), Gaps = 39/341 (11%)
Query: 20 ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPL-LWSETELAYLTGSPTK 78
E + L L+YEK +G SF+ PY+ L L +PL LWS +L L GSPT
Sbjct: 165 EYVSIGLLLLYEKAKGFDSFFKPYLDILP------TLDELNPLFLWSNKDLDLLQGSPTL 218
Query: 79 AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ 138
+ + + + REY ++ ++ Q P + ++ F+ F+ AF + S +
Sbjct: 219 SACEQLRDKLLREYT------YLGKNIIPQIP-NFASKPIDFKQFQWAFGILFSRAICFP 271
Query: 139 KVSLARRFALVPLGPPLLAYSSKCKAML--------AAVDDAVQLVVDRPYKAGESIVVW 190
++R ALVP LL +S C A + V +AV + VDR Y+ E + V
Sbjct: 272 S---SKRIALVPYAD-LLNHSPFCSAFIDEEKIPFGNGVTEAV-VYVDRLYEPYEQVYVS 326
Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 250
GP+ N +LL+ YGF E NP+D + + L+ DP Y +K + + GK +Q F ++
Sbjct: 327 YGPRSNQELLLLYGFSLERNPFDCVEITIGLDKTDPLYLEKCRMLESYGKSPLQSFPLYM 386
Query: 251 GREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAG 310
R +M +LR + +++Q+ ++ VS E + LD+L +Y +L
Sbjct: 387 DRYP---VEMAEFLRFCCIDTETDLQADFGTI-----VSASNEESALDKLLNYIVDQLRK 438
Query: 311 YPATLSEDEAMLTDY----NLHPKKRVATQLVRMEKKMLNA 347
YP +L +DE ++ D L +R+A + EK++L+A
Sbjct: 439 YPTSLEDDEKIIRDRAMFQTLEKNQRMAIRQRLGEKRILHA 479
>gi|219121061|ref|XP_002185762.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Phaeodactylum tricornutum CCAP
1055/1]
gi|209582611|gb|ACI65232.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Phaeodactylum tricornutum CCAP
1055/1]
Length = 575
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 95/369 (25%), Positives = 168/369 (45%), Gaps = 52/369 (14%)
Query: 16 NKLSELACLALYLMYEKK-QGKKSFWLPYIR---ELDRQRGRGQLAVESPLLWSETELAY 71
++++E +A +L+YE+ +G++S W PY+ ++D V W + +LA+
Sbjct: 166 SEINEYLAMACHLIYERNVRGEESPWKPYLDVLPDIDE--------VNPTFTWPDEDLAF 217
Query: 72 LTGSPTKAEILERAEGIKREYNELDTVWFMAG--SLFQQYPYDIPTEAFTFEIFKQAFVA 129
L GSP A ++REY+ L + G L +YP P EAF F+ ++ AF
Sbjct: 218 LNGSPVIAATKSLQMKLRREYDAL-----LGGEDGLLAKYPDRFPAEAFNFKAWEWAFTM 272
Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA----------MLAAVDDAVQLVVDR 179
+ S + L+ + ALVP L+ +S +A + + D+ V L DR
Sbjct: 273 LFSRAIRLRSLKQGETLALVPYAD-LINHSPFSQAYIDARQNGDWLFKSGDEEVILYADR 331
Query: 180 PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE--------------- 224
Y+ E I + GP+ N++LL+ YGF E NP++ + V ++
Sbjct: 332 GYRRMEQIYISYGPKSNAELLLLYGFAVERNPFNSVDVTVSIAPRTASFVKELDDDTIPV 391
Query: 225 DPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGP 284
DP ++K ++ G+ + F +A R +ML YLRL ++ ++
Sbjct: 392 DPLAEEKAAFLEQVGRDATVDFPCYADRYP---VEMLEYLRLMQMTPEDTRGKPLAEFDY 448
Query: 285 ICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP----KKRVATQLVRM 340
+S E AVL + +L+ YP + +D A++ D +L +R+A + R
Sbjct: 449 SRTISLGNEAAVLTSVITAVSRQLSNYPQSEEDDAALIKDKSLFRLLSYNQRMAVRHRRN 508
Query: 341 EKKMLNACL 349
EK++L +
Sbjct: 509 EKRLLKRTI 517
>gi|302764082|ref|XP_002965462.1| hypothetical protein SELMODRAFT_406852 [Selaginella moellendorffii]
gi|300166276|gb|EFJ32882.1| hypothetical protein SELMODRAFT_406852 [Selaginella moellendorffii]
Length = 481
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 100/333 (30%), Positives = 152/333 (45%), Gaps = 27/333 (8%)
Query: 20 ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLT-GSPTK 78
EL LAL+LMYE+++ + S W PY++ + SPLLW + E L GS
Sbjct: 117 ELVGLALWLMYEQERSQDSPWYPYLKVF-------PASTLSPLLWEQEEQEELLRGSSAL 169
Query: 79 AEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
A++ ++ +++ ++ L DT+ + D P E FTF FK AF V S V+L
Sbjct: 170 AKVKDQLTSLRQTFDALKDTL---------KDNKDFPMEKFTFSAFKAAFSVVLSRAVYL 220
Query: 138 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPN 196
L FALVP G + SS+ + V+L VD+ YK G+ + + +
Sbjct: 221 PSAEL---FALVPFGDLINHESSRSLLDYDIEEQKVKLAVDKRYKKGDQVFASYAQNLTS 277
Query: 197 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEA 256
+ LI YGF+DE + D + +E L + D KR + Q G Q F V+ R
Sbjct: 278 ADFLIRYGFLDESDENDFIEIEVGLVSGDSLAPLKREILQEVGLTVPQKFPVYLNR---F 334
Query: 257 ISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLS 316
+ +L Y RL + D+ + I V E L L + +L + T+
Sbjct: 335 PTQLLTYTRLARIQDSGLFAKITFEKDLI--VCQTNEYETLMLLMADCRTKLLSFSDTME 392
Query: 317 EDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 349
+D L NL K+RVA QL EK++L +
Sbjct: 393 DDMQTLKRKNLSYKQRVAAQLRLKEKRILTDTM 425
>gi|357462493|ref|XP_003601528.1| SET domain-containing protein [Medicago truncatula]
gi|355490576|gb|AES71779.1| SET domain-containing protein [Medicago truncatula]
gi|388500078|gb|AFK38105.1| unknown [Medicago truncatula]
Length = 497
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 76/246 (30%), Positives = 117/246 (47%), Gaps = 21/246 (8%)
Query: 9 AAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 68
A E+L N + + LA YL+ E K S W YI L RQ S L WS+ E
Sbjct: 128 AGEVLKKNSVPDWPLLATYLISEASLMKSSRWFSYISALPRQ-------PYSLLYWSQAE 180
Query: 69 L-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
L YL S + +ER + YN+ M +F +YP P E F E FK +F
Sbjct: 181 LDRYLEASQIRERAIERTNNVIGTYND------MRVRIFSKYPDFFPEEVFNIESFKWSF 234
Query: 128 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV--VDRPYKAGE 185
+ S +V L S+ + ALVP ++ +S + + L + +V DRPY+ GE
Sbjct: 235 GILFSRMVRLP--SMDGKNALVPWAD-MMNHSCEVETFLDYDKSSKGIVFPTDRPYQPGE 291
Query: 186 SIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 243
+ + G + N +LL++YGFV ++ NP D + + +L D Y++K + ++ G
Sbjct: 292 QVFISYGKKSNGELLLSYGFVPKEGTNPSDSVELSLSLKKSDESYKEKLELLKKYGLSGS 351
Query: 244 QVFHVH 249
Q F +
Sbjct: 352 QCFPIR 357
>gi|357469947|ref|XP_003605258.1| SET domain-containing protein [Medicago truncatula]
gi|355506313|gb|AES87455.1| SET domain-containing protein [Medicago truncatula]
Length = 494
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 76/246 (30%), Positives = 117/246 (47%), Gaps = 21/246 (8%)
Query: 9 AAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 68
A E+L N + + LA YL+ E K S W YI L RQ S L WS+ E
Sbjct: 125 AGEVLKKNSVPDWPLLATYLISEASLMKSSRWFSYISALPRQ-------PYSLLYWSQAE 177
Query: 69 L-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
L YL S + +ER + YN+ M +F +YP P E F E FK +F
Sbjct: 178 LDRYLEASQIRERAIERTNNVIGTYND------MRVRIFSKYPDFFPEEVFNIESFKWSF 231
Query: 128 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV--VDRPYKAGE 185
+ S +V L S+ + ALVP ++ +S + + L + +V DRPY+ GE
Sbjct: 232 GILFSRMVRLP--SMDGKNALVPWAD-MMNHSCEVETFLDYDKSSKGIVFPTDRPYQPGE 288
Query: 186 SIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 243
+ + G + N +LL++YGFV ++ NP D + + +L D Y++K + ++ G
Sbjct: 289 QVFISYGKKSNGELLLSYGFVPKEGTNPSDSVELSLSLKKSDESYKEKLELLKKYGLSGS 348
Query: 244 QVFHVH 249
Q F +
Sbjct: 349 QCFPIR 354
>gi|224129218|ref|XP_002320530.1| predicted protein [Populus trichocarpa]
gi|222861303|gb|EEE98845.1| predicted protein [Populus trichocarpa]
Length = 503
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 77/243 (31%), Positives = 113/243 (46%), Gaps = 21/243 (8%)
Query: 9 AAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 68
A E+L + + LA YL+ E K S W YI L RQ S L W+ E
Sbjct: 134 AGEVLKKYSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PYSLLYWTRAE 186
Query: 69 L-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
L YL S + +ER + YN+L +F +YP+ P E F E FK +F
Sbjct: 187 LDTYLEASQIRERAIERITNVTGTYNDLRL------RIFSKYPHLFPEEVFNMETFKWSF 240
Query: 128 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGE 185
+ S +V L S+ R ALVP +L +SS+ + L V DRPY+ GE
Sbjct: 241 GILFSRLVRLP--SMDGRVALVPWAD-MLNHSSEVETFLDYDKSSKGVVFTTDRPYQPGE 297
Query: 186 SIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 243
+ + G + N +LL++YGFV + NP D + + +L D Y++K +++G
Sbjct: 298 QVFISYGRKSNGELLLSYGFVPREGTNPSDSVELSLSLKKSDKCYKEKLEALKKHGLSVS 357
Query: 244 QVF 246
Q F
Sbjct: 358 QCF 360
>gi|323456050|gb|EGB11917.1| hypothetical protein AURANDRAFT_61181 [Aureococcus anophagefferens]
Length = 516
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 92/343 (26%), Positives = 151/343 (44%), Gaps = 34/343 (9%)
Query: 15 TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 74
+ +E +AL L+ E+ +G +SFW YI L G + W ELAYL G
Sbjct: 162 NDDTNEYIAIALLLILERSKGSRSFWSEYIAILPTNEDVG-----ATFTWPAEELAYLEG 216
Query: 75 SPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 134
SP + ++ E+ A L D E FTFE ++ AF + S
Sbjct: 217 SPAASATASMMAKLRAEH---------AAVLEGNSALD--PEIFTFEAWQWAFTNLFSRA 265
Query: 135 VHLQKVSLARRFALVPL-----GPPLLAYSSKCKAMLAAV-----DDAVQLVVDRPYKAG 184
+ L+ A+VP P + + + A +D V L DR YK
Sbjct: 266 IRLKASRAGELLAMVPYVDFINHSPFSSSYVDAREVPKAFPWEEKEDEVVLFADRAYKKF 325
Query: 185 ESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK-LSV 243
E + + GP+ N+ LL+ YGF + NP++ + + + +D Y K A+ G+ +S
Sbjct: 326 EQVFISYGPKSNADLLLLYGFALDRNPFNSVDLAVGASKDDALYDAKERFARGAGRDVSS 385
Query: 244 QVFHVHAGREKEAISDMLPYLRLGYVS-DTSEMQSVISSLGPICPVSPCMERAVLDQLAD 302
F ++A R + +++ +LR+ + D + + + +S E AVLD + D
Sbjct: 386 AAFPLYADRFPD---ELVQFLRMACATEDHLGARPLDDPDNYVDILSLDNELAVLDTIRD 442
Query: 303 YFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
A +A YPA +D + D L +R+A +LV EK++L
Sbjct: 443 ACDAAVAAYPAKSGDD---VPDAFLSRNQRMAKRLVNTEKRIL 482
>gi|397613505|gb|EJK62256.1| hypothetical protein THAOC_17139 [Thalassiosira oceanica]
Length = 648
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 96/372 (25%), Positives = 163/372 (43%), Gaps = 50/372 (13%)
Query: 18 LSELACLALYLMYEK-KQGKKSFWLPYIR---ELDRQRGRGQLAVESPLLWSETELAYLT 73
++E +A L++EK +G+ SF+ Y+ E+D V W + +LA+L
Sbjct: 247 INEYLAVACQLIHEKFVKGEDSFYAAYMGVLPEVDE--------VNPTFTWPDEDLAFLE 298
Query: 74 GSPTKAEILERAEGIKREYNELDTVWFMAG--SLFQQYPYDIPTEAFTFEIFKQAFVAVQ 131
GSP A ++REY++L + G L ++P P E +TFE ++ AF +
Sbjct: 299 GSPVVAATRSLQMKLRREYDDL-----LGGPDGLVAKFPLRFPAEHYTFENWEWAFTMLF 353
Query: 132 SCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA----------VQLVVDRPY 181
S + L+ + + R A+VP L+ +S+ +A + A + V L DR Y
Sbjct: 354 SRAIRLRNLQVGERLAMVPYAD-LINHSAFSQAFIDARESGDWLFKSGEEEVILYADRGY 412
Query: 182 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN-------------TEDPQY 228
+ E + + G + N++LL+ YGF E NPY+ + V ++ EDP
Sbjct: 413 RQMEQVYISYGQKSNAELLLLYGFALERNPYNSVDVTVSIAPRTKQIAEANEGVEEDPLA 472
Query: 229 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPV 288
+K G+ F +A R +ML YLRL ++ +S +
Sbjct: 473 DEKLEFLLSVGRDQTVDFPCYADRYP---VEMLEYLRLMMMTPEDTRGKPLSDFDYSRTI 529
Query: 289 SPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP----KKRVATQLVRMEKKM 344
S E +VL + K +L +P T +D A++ D + +R+A + R EK++
Sbjct: 530 SSANEASVLRSVVAAVKYQLGLFPQTEEDDAAIIKDKGMFRLFSYNQRMAVRHRRNEKRL 589
Query: 345 LNACLQVTADMI 356
L L I
Sbjct: 590 LKRTLAALEKQI 601
>gi|302823067|ref|XP_002993188.1| hypothetical protein SELMODRAFT_449044 [Selaginella moellendorffii]
gi|300138958|gb|EFJ05708.1| hypothetical protein SELMODRAFT_449044 [Selaginella moellendorffii]
Length = 600
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 95/328 (28%), Positives = 147/328 (44%), Gaps = 25/328 (7%)
Query: 20 ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 79
EL LAL+LMYE+++ + S W PY++ + L E + L GS A
Sbjct: 117 ELVGLALWLMYEQERSQDSPWYPYVKVFPAS------TLSLLLWEQEEQEELLRGSSALA 170
Query: 80 EILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ 138
++ ++ +++ ++ L DT+ + D P E FTF FK AF V S V+L
Sbjct: 171 KVKDQLTSLRQTFDALKDTL---------KDNKDFPMEKFTFSAFKTAFSVVLSRAVYLP 221
Query: 139 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNS 197
L FALVP G + SS+ + V+L VD+ YK G+ + + ++
Sbjct: 222 SAEL---FALVPFGDLINHESSRSLLDYDIEEQKVKLAVDKRYKKGDQVFASYAQNLTSA 278
Query: 198 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI 257
LI YGF+DE + D + +E L + D KR + Q G Q F ++ R
Sbjct: 279 DFLIRYGFLDESDENDCIEIEVGLVSGDSLAPLKREILQEVGLTVPQKFPLYLNR---FP 335
Query: 258 SDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSE 317
+ +L Y RL + D+ + I VS E L L + +L T+ +
Sbjct: 336 TQLLTYTRLARIQDSGLFAKITFEKDLI--VSQTNEYETLMLLMADCRTKLLSSSDTMED 393
Query: 318 DEAMLTDYNLHPKKRVATQLVRMEKKML 345
+ L NL K+RVA QL EK++L
Sbjct: 394 EMQTLRRKNLSYKQRVAAQLRLKEKRIL 421
>gi|297829320|ref|XP_002882542.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297328382|gb|EFH58801.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 504
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 75/246 (30%), Positives = 116/246 (47%), Gaps = 21/246 (8%)
Query: 9 AAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 68
A E++ + + LA YL+ E K S W YI L RQ S L W+ TE
Sbjct: 135 AGEVMKRYDVPDWPLLATYLISEASLQKSSRWYNYISALPRQ-------PYSLLYWTRTE 187
Query: 69 L-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
L YL S + +ER + Y +L + +F ++P+ P E F E FK +F
Sbjct: 188 LDMYLEASQIRERAIERITNVVGTYEDLRS------RIFSKHPHLFPKEVFNDETFKWSF 241
Query: 128 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGE 185
+ S +V L S+ RFALVP +L ++ + + L V DRPY+ GE
Sbjct: 242 GILFSRLVRLP--SMDGRFALVPWAD-MLNHNCEVETFLDYDKSSKGVVFTTDRPYQPGE 298
Query: 186 SIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 243
+ + G + N +LL++YGFV + NP D + + +L D Y++K +++G +
Sbjct: 299 QVFISYGNKSNGELLLSYGFVPREGTNPSDSVELALSLRKNDKCYKEKLDALKKHGLSTP 358
Query: 244 QVFHVH 249
Q F V
Sbjct: 359 QCFPVR 364
>gi|223992783|ref|XP_002286075.1| rubisco small subunit small subunit n-methyltransferase
[Thalassiosira pseudonana CCMP1335]
gi|220977390|gb|EED95716.1| rubisco small subunit small subunit n-methyltransferase
[Thalassiosira pseudonana CCMP1335]
Length = 434
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 94/364 (25%), Positives = 158/364 (43%), Gaps = 44/364 (12%)
Query: 16 NKLSELACLALYLMYEKKQ-GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 74
+ ++E +A L++EK G +S W Y+ L V W + +LA+L G
Sbjct: 85 DGINEYLAIACQLIHEKYVLGDESEWDAYMGVLPEVE-----EVNPTFTWKDEDLAFLDG 139
Query: 75 SPTKAEILERAEGIKREYNELDTVWFMAG--SLFQQYPYDIPTEAFTFEIFKQAFVAVQS 132
SP A ++REY+ L + G L ++P P E FT+E + AF + S
Sbjct: 140 SPVVAATRSLQMKLRREYDAL-----LGGQDGLIAKFPDRFPAEHFTYENWVWAFTMLFS 194
Query: 133 CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD----------DAVQLVVDRPYK 182
+ L+ + + R A+VP L+ +S+ A + A + + V L DR Y+
Sbjct: 195 RAIRLRNLQVGERLAMVPYAD-LINHSAFSGAFIDARESGDWLFKNGEEEVILYADRGYR 253
Query: 183 AGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE-------------DPQYQ 229
E + + G + N++LL+ YGF E NPY+ + V ++ DP Q
Sbjct: 254 QMEQVYISYGQKSNAELLLLYGFALERNPYNSVDVTVSIAPRTAALAAANEGIEVDPLAQ 313
Query: 230 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVS 289
+K G+ F +A R +ML +LRL ++ ++ +S
Sbjct: 314 EKVEFLASVGRDQTVDFPCYADRYP---VEMLEFLRLMMMTPEDTRGKPLADFDYSRTIS 370
Query: 290 PCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP----KKRVATQLVRMEKKML 345
P E AVL + + K +L YP + +D ++ D L +R+A + R EK++L
Sbjct: 371 PANEAAVLSSVVEAVKYQLNLYPQSEEDDANIIKDKALFRLLSYNQRMAVRHRRNEKRLL 430
Query: 346 NACL 349
L
Sbjct: 431 KRTL 434
>gi|356547583|ref|XP_003542190.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Glycine
max]
Length = 499
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 74/246 (30%), Positives = 113/246 (45%), Gaps = 21/246 (8%)
Query: 9 AAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 68
A E+L N + + LA YL+ E + S W YI L RQ S L W++ E
Sbjct: 130 AGEVLKRNSVPDWPLLATYLISEASLMESSRWSNYISALPRQ-------PYSLLYWTQAE 182
Query: 69 L-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
L YL S + +ER + YN+L +F +YP P E F E FK +F
Sbjct: 183 LDRYLEASQIRERAIERINNVIGTYNDLRL------RIFSKYPDLFPDEVFNIESFKWSF 236
Query: 128 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGE 185
+ S +V L S+ ALVP +L +S + L + DRPY+ GE
Sbjct: 237 GILFSRLVRLP--SMGGNVALVPWAD-MLNHSCDVETFLDYDKTSKGIVFTTDRPYQPGE 293
Query: 186 SIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 243
+ + G + N +LL++YGFV ++ NP D + + +L D Y++K + + G +
Sbjct: 294 QVFISYGKKSNGELLLSYGFVPKEGANPSDSVELSLSLKKSDASYKEKLELLKNYGLSAS 353
Query: 244 QVFHVH 249
Q F +
Sbjct: 354 QCFPIQ 359
>gi|21537309|gb|AAM61650.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
Length = 504
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 75/246 (30%), Positives = 115/246 (46%), Gaps = 21/246 (8%)
Query: 9 AAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 68
A E++ + + LA YL+ E K S W YI L RQ S L W+ TE
Sbjct: 135 AGEVMKRYDVPDWPLLATYLISEANLQKSSRWFNYISALPRQ-------PYSLLYWTRTE 187
Query: 69 L-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
L YL S + +ER + Y +L + +F ++P P E F E FK +F
Sbjct: 188 LDMYLEASQIRERAIERITNVVGTYEDLRS------RIFSKHPQLFPKEVFNDETFKWSF 241
Query: 128 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGE 185
+ S +V L S+ RFALVP +L ++ + + L V DRPY+ GE
Sbjct: 242 GILFSRLVRLP--SMDGRFALVPWAD-MLNHNCEVETFLDYDKSSKGVIFTTDRPYQPGE 298
Query: 186 SIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 243
+ + G + N +LL++YGFV + NP D + + +L D Y++K +++G +
Sbjct: 299 QVFISYGNKSNGELLLSYGFVPREGTNPSDSVELALSLRKNDKCYEEKLDALKKHGLSTP 358
Query: 244 QVFHVH 249
Q F V
Sbjct: 359 QCFPVR 364
>gi|3065835|gb|AAC14296.1| putative methyltransferase [Arabidopsis thaliana]
Length = 504
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 75/246 (30%), Positives = 115/246 (46%), Gaps = 21/246 (8%)
Query: 9 AAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 68
A E++ + + LA YL+ E K S W YI L RQ S L W+ TE
Sbjct: 135 AGEVMKRYDVPDWPLLATYLISEASLQKSSRWFNYISALPRQ-------PYSLLYWTRTE 187
Query: 69 L-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
L YL S + +ER + Y +L + +F ++P P E F E FK +F
Sbjct: 188 LDMYLEASQIRERAIERITNVVGTYEDLRS------RIFSKHPQLFPKEVFNDETFKWSF 241
Query: 128 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGE 185
+ S +V L S+ RFALVP +L ++ + + L V DRPY+ GE
Sbjct: 242 GILFSRLVRLP--SMDGRFALVPWAD-MLNHNCEVETFLDYDKSSKGVVFTTDRPYQPGE 298
Query: 186 SIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 243
+ + G + N +LL++YGFV + NP D + + +L D Y++K +++G +
Sbjct: 299 QVFISYGNKSNGELLLSYGFVPREGTNPSDSVELALSLRKNDKCYEEKLDALKKHGLSTP 358
Query: 244 QVFHVH 249
Q F V
Sbjct: 359 QCFPVR 364
>gi|15231493|ref|NP_187424.1| rubisco methyltransferase-like protein [Arabidopsis thaliana]
gi|6466950|gb|AAF13085.1|AC009176_12 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
gi|6648179|gb|AAF21177.1|AC013483_1 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
gi|15028205|gb|AAK76599.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
gi|19310671|gb|AAL85066.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
gi|332641064|gb|AEE74585.1| rubisco methyltransferase-like protein [Arabidopsis thaliana]
Length = 504
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 75/246 (30%), Positives = 115/246 (46%), Gaps = 21/246 (8%)
Query: 9 AAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 68
A E++ + + LA YL+ E K S W YI L RQ S L W+ TE
Sbjct: 135 AGEVMKRYDVPDWPLLATYLISEASLQKSSRWFNYISALPRQ-------PYSLLYWTRTE 187
Query: 69 L-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
L YL S + +ER + Y +L + +F ++P P E F E FK +F
Sbjct: 188 LDMYLEASQIRERAIERITNVVGTYEDLRS------RIFSKHPQLFPKEVFNDETFKWSF 241
Query: 128 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGE 185
+ S +V L S+ RFALVP +L ++ + + L V DRPY+ GE
Sbjct: 242 GILFSRLVRLP--SMDGRFALVPWAD-MLNHNCEVETFLDYDKSSKGVVFTTDRPYQPGE 298
Query: 186 SIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 243
+ + G + N +LL++YGFV + NP D + + +L D Y++K +++G +
Sbjct: 299 QVFISYGNKSNGELLLSYGFVPREGTNPSDSVELALSLRKNDKCYEEKLDALKKHGLSTP 358
Query: 244 QVFHVH 249
Q F V
Sbjct: 359 QCFPVR 364
>gi|255582876|ref|XP_002532210.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
gi|223528106|gb|EEF30179.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
Length = 508
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 78/247 (31%), Positives = 117/247 (47%), Gaps = 23/247 (9%)
Query: 9 AAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 68
A E+L + + LA+YL+ E K S W YI L RQ S L W+ E
Sbjct: 139 AGEVLKQYSVPDWPLLAIYLISEANLQKSSKWSNYISALPRQ-------PYSLLYWTRAE 191
Query: 69 L-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
L YL S + +ER + YN+L +F +YP P E F E FK +F
Sbjct: 192 LDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNLETFKWSF 245
Query: 128 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DRPYKAG 184
+ S +V L S+ + ALVP +L +S + + L D + Q VV DR Y+ G
Sbjct: 246 GILFSRLVRLP--SMDGKVALVPWAD-MLNHSCEVETFLD-YDKSSQGVVFTTDRQYEPG 301
Query: 185 ESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLS 242
E + + G + N +LL++YGFV + NP D + + +L D Y++K +++G +
Sbjct: 302 EQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELSLSLKKSDKSYKEKLEALKKHGFSA 361
Query: 243 VQVFHVH 249
Q F V
Sbjct: 362 SQCFPVR 368
>gi|225447500|ref|XP_002267469.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic [Vitis
vinifera]
gi|296085051|emb|CBI28466.3| unnamed protein product [Vitis vinifera]
Length = 497
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 75/247 (30%), Positives = 118/247 (47%), Gaps = 23/247 (9%)
Query: 9 AAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 68
A E+L N + + LA YL+ E + S W YI L RQ S L W+ E
Sbjct: 128 AGEVLKRNSVPDWPLLATYLIGEASFMQSSRWSNYISALPRQ-------PYSLLYWTRAE 180
Query: 69 L-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
L YL S + +ER + YN+L +F ++P+ P E F E FK +F
Sbjct: 181 LDKYLEASQIRERAIERINDVTGTYNDLRL------RIFSKHPHLFPEEVFNMETFKWSF 234
Query: 128 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DRPYKAG 184
+ S +V L S+ + ALVP +L +S + + L D + Q VV DR Y+
Sbjct: 235 GILFSRLVRLP--SMDEKIALVPWAD-MLNHSCEVETFLD-YDKSSQGVVFTTDRTYQPS 290
Query: 185 ESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLS 242
E + + G + N +LL++YGFV + NP D++ + +L D Y++K +++G +
Sbjct: 291 EQVFISYGKKSNGELLLSYGFVPREGTNPNDKVELLLSLKKSDKCYKEKSEAMKKHGLST 350
Query: 243 VQVFHVH 249
Q F +
Sbjct: 351 SQCFPIQ 357
>gi|440792294|gb|ELR13522.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
Length = 568
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 102/375 (27%), Positives = 157/375 (41%), Gaps = 63/375 (16%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
CLAL+L+ EK SFW PYI+ L + G + L ++ EL L GSPT +
Sbjct: 148 CLALHLLVEK-HNHSSFWTPYIKTLPKSYG-------TCLYFTLEELEGLRGSPTFTSAI 199
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV---QSCVVHLQK 139
+ + +Y + LFQ + AFT++ F A AV Q+ V
Sbjct: 200 KVIATVAIQYTYIH-------DLFQIRKDILHINAFTWDEFIWAMSAVGSRQNQVPQWGH 252
Query: 140 VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 199
+L+ +AL+P + D+ + R YK GE + ++ GP+PNS L
Sbjct: 253 NALSE-YALIPAWDMCNHDHGDLQTFWDVNSDSTESHAMRAYKKGEQVYIFYGPRPNSDL 311
Query: 200 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDK-RMVAQRNGKLSVQVFHVHAGREKEAIS 258
L++ GFV E+N +D L + L + +DK R++ N K+ Q + G
Sbjct: 312 LLHAGFVYENNRFDALAIRVRLAPDAEHIKDKLRLLHLNNMKMDSQYYLYGLG----LAV 367
Query: 259 DMLPYLRLGYVSDTSEMQSVI----------------SSLGPICPVSPCMERAVLDQLAD 302
D++ +LR+ + + E+Q V+ +S G + R L+ +
Sbjct: 368 DLMAFLRI-HAMNEQELQQVLGAYDQQEAKVHNGEHPASNGEVVASGVFDPRVKLNDRNE 426
Query: 303 YFKARLAG---------YPATLS---------EDEAMLTDYNLHPKKRVATQLVRMEKKM 344
+LA YP TL ED+A L +L P R T L EK++
Sbjct: 427 LAALQLAEAKCLSLLSLYPTTLQVANGVELKQEDQAALRTTSLTPNMRAVTLLRLKEKEI 486
Query: 345 LNACLQVTADMIMLL 359
LN L D I LL
Sbjct: 487 LNRTL----DAIRLL 497
>gi|3403236|gb|AAC29137.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Spinacia oleracea]
Length = 491
Score = 90.9 bits (224), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 92/340 (27%), Positives = 159/340 (46%), Gaps = 24/340 (7%)
Query: 16 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 75
N L +AL+LM EKK G S W PYI L + S + WSE EL+ L GS
Sbjct: 122 NGLKPWVSVALFLMREKKLGNSSSWKPYIDILPD-------STNSTIYWSEEELSELQGS 174
Query: 76 PTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF--TFEIFK-QAFVAVQS 132
L E + E+ +L+ + Q +P+D+ + F F + + +AF ++
Sbjct: 175 QLLNTTLGVKELVANEFAKLEEEVLVPHK--QLFPFDVTQDDFFWAFGMLRSRAFTCLEG 232
Query: 133 CVVHLQKVSLARRFALVP-LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-W 190
+ L + LA P + P A+ + A L + + L P KAG+ +++ +
Sbjct: 233 QSLVL--IPLADLANHSPDITAPKYAWEIR-GAGLFSRELVFSLRNPTPVKAGDQVLIQY 289
Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 250
+ N++L ++YG + + + + + D Y DK +A+ NG F +
Sbjct: 290 DLNKSNAELALDYGLTESRSERNAYTLTLEIPESDSFYGDKLDIAESNGMGESAYFDIVL 349
Query: 251 GREKEAISDMLPYLRLGYVS--DTSEMQSVI--SSLGPI-CPVSPCMERAVLDQLADYFK 305
E+ ++MLPYLRL + D ++S+ S G + P+SP E + + D
Sbjct: 350 --EQPLPANMLPYLRLVALGGEDAFLLESIFRNSIWGHLDLPISPANEELICQVIRDACT 407
Query: 306 ARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
+ L+GY T++EDE +L + ++ P+ +A + EKK+L
Sbjct: 408 SALSGYSTTIAEDEKLLAEGDIDPRLEIAITIRLGEKKVL 447
>gi|168003103|ref|XP_001754252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694354|gb|EDQ80702.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 431
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 75/268 (27%), Positives = 129/268 (48%), Gaps = 27/268 (10%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSP 76
L E LA++L+ E + + S W PY L + S L W+E E+ +LT SP
Sbjct: 83 LPEWPMLAIFLISEASREESSRWFPYFATLPK-------TPSSILQWTEEEVNTWLTASP 135
Query: 77 TKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH 136
+ + LE + Y +L ++F ++P P++ +T FK AF + S +V
Sbjct: 136 VREKALECIRDVTETYRDL------RATIFLKHPEVFPSQVYTLAAFKWAFGILFSRLVR 189
Query: 137 LQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD---AVQLVVDRPYKAGESIVVWCGP 193
L V + ALVP +L +S + + L + +V V DR Y++GE + + G
Sbjct: 190 LPSVG---KLALVPWAD-MLNHSPQVDSFLDFDQNNAKSVVTVTDRAYQSGEQVFISYGK 245
Query: 194 QPNSKLLINYGFV-DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR 252
+ + +L + YGF+ E N +D + +E ++++DP ++ K A G S Q F V R
Sbjct: 246 RSSGELFLAYGFIPSELNVHDSVELEMEIDSDDPSFEAKLRAANEQGLSSPQRFPV---R 302
Query: 253 EKEAISDMLPYLRL--GYVSDTSEMQSV 278
+ + +L Y RL SD +++ +
Sbjct: 303 KDGFPAQLLAYARLIASRTSDPAQLSRI 330
>gi|449453618|ref|XP_004144553.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Cucumis
sativus]
gi|449511789|ref|XP_004164054.1| PREDICTED: LOW QUALITY PROTEIN: ribulose-1,5 bisphosphate
carboxylase/oxygenase large subunit N-methyltransferase,
chloroplastic-like [Cucumis sativus]
Length = 497
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 76/247 (30%), Positives = 118/247 (47%), Gaps = 23/247 (9%)
Query: 9 AAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 68
A E+L N + + +A YL+ E K S W YI L RQ S L W+ E
Sbjct: 128 AGEVLKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQ-------PYSLLYWTREE 180
Query: 69 L-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
L YL S + +ER + YN+L +F ++P P E F E FK +F
Sbjct: 181 LDRYLEASEIRERAIERITNVVGTYNDLSI------RVFSKHPELFPEEVFNIETFKWSF 234
Query: 128 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DRPYKAG 184
+ S +V L S+ + ALVP +L ++ + + L D A Q VV DR Y+ G
Sbjct: 235 GILFSRLVRLP--SMDGKVALVPWAD-MLNHNCEVETFLD-YDKASQGVVFTTDRAYQPG 290
Query: 185 ESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLS 242
E + + G + N +LL++YGFV ++ NP D + + +L D Y++K +++G +
Sbjct: 291 EQVFISYGKKSNGELLLSYGFVPKEGSNPSDSVELLLSLKKSDKCYKEKLEALKKHGLRA 350
Query: 243 VQVFHVH 249
Q F +
Sbjct: 351 SQCFPIQ 357
>gi|3403234|gb|AAC29136.1| ribulose-1,5-bisphosphate carboxylase/oxygenase N-methyltransferase
[Spinacia oleracea]
gi|3403238|gb|AAC29138.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase II [Spinacia oleracea]
Length = 495
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 89/342 (26%), Positives = 159/342 (46%), Gaps = 24/342 (7%)
Query: 16 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 75
N L +AL+LM EKK G S W PYI L + S + WSE EL+ L GS
Sbjct: 122 NGLKPWVSVALFLMREKKLGNSSSWKPYIDILPD-------STNSTIYWSEEELSELQGS 174
Query: 76 PTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF--TFEIFK-QAFVAVQS 132
L E + E+ +L+ + Q +P+D+ + F F + + +AF ++
Sbjct: 175 QLLNTTLGVKELVANEFAKLEEEVLVPHK--QLFPFDVTQDDFFWAFGMLRSRAFTCLEG 232
Query: 133 ---CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
++ L + + + + P A+ + A L + + L P KAG+ +++
Sbjct: 233 QSLVLIPLADLWVQQANHSPDITAPKYAWEIR-GAGLFSRELVFSLRNPTPVKAGDQVLI 291
Query: 190 -WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV 248
+ + N++L ++YG + + + + + D Y DK +A+ NG F +
Sbjct: 292 QYDLNKSNAELALDYGLTESRSERNAYTLTLEIPESDSFYGDKLDIAESNGMGESAYFDI 351
Query: 249 HAGREKEAISDMLPYLRLGYVS--DTSEMQSVI--SSLGPI-CPVSPCMERAVLDQLADY 303
E+ ++MLPYLRL + D ++S+ S G + P+SP E + + D
Sbjct: 352 VL--EQPLPANMLPYLRLVALGGEDAFLLESIFRNSIWGHLDLPISPANEELICQVIRDA 409
Query: 304 FKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
+ L+GY T++EDE +L + ++ P+ +A + EKK+L
Sbjct: 410 CTSALSGYSTTIAEDEKLLAEGDIDPRLEIAITIRLGEKKVL 451
>gi|146181028|ref|XP_001021989.2| SET domain containing protein [Tetrahymena thermophila]
gi|146144300|gb|EAS01744.2| SET domain containing protein [Tetrahymena thermophila SB210]
Length = 590
Score = 87.4 bits (215), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 87/342 (25%), Positives = 155/342 (45%), Gaps = 48/342 (14%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
L+ +L+ E+K K+S W PY+ L + P+ +SE +L++L GSP + ++ E
Sbjct: 230 LSTFLLQERK-NKESKWKPYLDILPSDYN------QFPIFFSEDDLSWLKGSPFQNQVRE 282
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 143
+ IKR+Y+++ +V F +Y TFE F A + S V LQ ++
Sbjct: 283 KKADIKRDYDDICSV----APEFAEY---------TFEDFCWARMTASSRVFGLQ-INEQ 328
Query: 144 RRFALVPLGPPLLAYSSKCKAMLAAVDD-----AVQLVVDRPYKAGESIVVWCGPQPNSK 198
+ A VPL L + K DD +Q + D P GE + G + NS+
Sbjct: 329 KTDAFVPLADML--NHRRPKQTSWQYDDQREGFVIQALEDIP--RGEQVYDSYGRKCNSR 384
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL-SVQVFHVHAGREKEAI 257
+NYGF++ DN + + + + EDP + K+ + G + +V+ + +++ +
Sbjct: 385 FFLNYGFINLDNDANEVALRLTFDAEDPTIERKKEMM--GGDVPEFKVYRILENYQEQNV 442
Query: 258 SDMLPYLRLGYVSDTS-------------EMQSVISSLGP--ICPVSPCMERAVLDQLAD 302
S+ + YLR + D S E +S P P+S E + ++++
Sbjct: 443 SEFMSYLRFILIRDNSKLLMLSSLHEQQTENSENLSGYKPQKTPPISIQNETDMWVRISN 502
Query: 303 YFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 344
+ ++ Y TL ED+ +L NL +R L EK++
Sbjct: 503 MCQTSISLYNTTLKEDKELLAKDNLTQNQRNCVLLRSGEKEV 544
>gi|134254196|gb|AAI35195.1| LOC549331 protein [Xenopus (Silurana) tropicalis]
Length = 507
Score = 87.4 bits (215), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 88/365 (24%), Positives = 165/365 (45%), Gaps = 37/365 (10%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFWLPYI+ L + ++PL ++E E+ YL + ++
Sbjct: 81 TLAFHLLCERAD-PNSFWLPYIKTLPNE-------YDTPLYFNEDEVQYLQSTQAILDVF 132
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P +P ++FTF+ ++ A +V + +
Sbjct: 133 SQYKNTARQY-----AYFY--KVIQTHPNANKLPLKDSFTFDDYRWAVSSVMTRQNQIPT 185
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL +S DD + V + +K+GE I ++ G + N++
Sbjct: 186 EDGSRVTLALIPLWDMCNHTNSLITTGYNLEDDRCECVALQDFKSGEQIYIFYGTRSNAE 245
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF E+N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 246 FVIHNGFFFENNLHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHV-TEPPISA 304
Query: 259 DMLPYLRL---------GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR-- 307
+L +LR+ G++ + + + PVS E + +L + +AR
Sbjct: 305 QLLAFLRVFCMNEDELKGHLIGDHAIDKIFTLGNSEFPVS--WENEI--KLWTFLEARAS 360
Query: 308 --LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVS 365
L Y T+ +D +L ++ +A +L R+EK++L L+ +D L +
Sbjct: 361 LLLKTYKTTVEDDNKVLEQPDMTFHSAMAIKLRRVEKEILEKALKSASDNRKLYSKNSEE 420
Query: 366 PCPAP 370
P P
Sbjct: 421 GTPLP 425
>gi|302821397|ref|XP_002992361.1| hypothetical protein SELMODRAFT_430576 [Selaginella moellendorffii]
gi|300139777|gb|EFJ06511.1| hypothetical protein SELMODRAFT_430576 [Selaginella moellendorffii]
Length = 463
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 66/258 (25%), Positives = 109/258 (42%), Gaps = 47/258 (18%)
Query: 13 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 72
L ++ +SE A LAL L++EK+ G+ S W PYI L R + S W + ELA +
Sbjct: 98 LLSSGVSEWAQLALLLLFEKRAGEASIWAPYISCLPRWG-----TIHSTAFWRKEELAMI 152
Query: 73 TGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 132
S E + R I+ E+NE+ + FQ+Y + + ++ FK A+V
Sbjct: 153 QESSLSYETMSRRAAIREEFNEMQPI-------FQRYEH-VFGGPVSYASFKHAYVTATV 204
Query: 133 CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD--------------------- 171
C ++ + A+VP + + AML D
Sbjct: 205 CS-RAWRIDGLEKLAMVPFAD-FMNHDWSSNAMLTYDTDNGSTEVEEVKVYSDCLDIALF 262
Query: 172 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 231
QL D+ Y AGE + + GP N+ L +++GF NP+D++ + ++ D ++K
Sbjct: 263 CAQLFADKNYAAGEQVTISFGPLCNASLALDFGFTVPYNPWDKVQLWLGISRRDSLRKEK 322
Query: 232 RMVAQRNGKLSVQVFHVH 249
+Q H H
Sbjct: 323 -----------LQYLHAH 329
>gi|10439587|dbj|BAB15525.1| unnamed protein product [Homo sapiens]
Length = 512
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 87/365 (23%), Positives = 168/365 (46%), Gaps = 34/365 (9%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ YL + ++
Sbjct: 74 ALAFHLLCERAS-PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 125
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 126 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 178
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V + ++AGE I ++ G + N++
Sbjct: 179 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 238
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 239 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 297
Query: 259 DMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ +++ S + + + PVS E + L D L
Sbjct: 298 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 357
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPA 369
Y T+ ED+++L +++L + ++A +L EK++L ++ A + P + +SP
Sbjct: 358 TYKTTIEEDKSVLKNHDLSVRAKMAIKLRLGEKEILEKAVKSAA----VNPGI-LSPTDG 412
Query: 370 PYAPL 374
APL
Sbjct: 413 GKAPL 417
>gi|226501968|ref|NP_001140387.1| uncharacterized protein LOC100272441 [Zea mays]
gi|194699272|gb|ACF83720.1| unknown [Zea mays]
gi|413923744|gb|AFW63676.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Zea mays]
Length = 503
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 68/242 (28%), Positives = 113/242 (46%), Gaps = 21/242 (8%)
Query: 10 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
+++ N + + +A YL+ E S W+ YI L RQ S L W+ EL
Sbjct: 135 GDVMKRNSVPDWPLIATYLISEASLEGSSRWISYIAALPRQ-------PYSLLYWTRAEL 187
Query: 70 -AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 128
AYL SP + ++R + YN+L +F ++P P E + E F +F
Sbjct: 188 DAYLVASPIRKRAIQRITDVIGTYNDLRD------RIFSRHPDLFPEEVYNIETFLWSFG 241
Query: 129 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGES 186
+ S +V L S+ R ALVP +L +S + + L + DR Y+ GE
Sbjct: 242 ILFSRLVRLP--SMDGRVALVPWAD-MLNHSPEVETFLDFDKSSRGIVFTTDRSYQPGEQ 298
Query: 187 IVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ 244
+ + G + + +LL++YGFV ++ NP D + + +L+ D Y++K +RNG +
Sbjct: 299 VFISYGKKSSGELLLSYGFVPKEGTNPNDSVELLVSLDKSDNCYKEKLQALKRNGLSESE 358
Query: 245 VF 246
F
Sbjct: 359 SF 360
>gi|302785554|ref|XP_002974548.1| hypothetical protein SELMODRAFT_101776 [Selaginella moellendorffii]
gi|300157443|gb|EFJ24068.1| hypothetical protein SELMODRAFT_101776 [Selaginella moellendorffii]
Length = 467
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 67/235 (28%), Positives = 112/235 (47%), Gaps = 20/235 (8%)
Query: 9 AAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 68
A +++ L E LA YL+ E GK S W PYI L R+ G S LLW+ +
Sbjct: 99 AGKVIRAKDLPEWPFLACYLISEASLGKSSPWYPYIAALPRRPG-------SILLWTALD 151
Query: 69 L-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
+ A+L+ + K L+ ++ +N+L+ FM + P E F E FK AF
Sbjct: 152 VEAHLSATSIKDRALQCVREVEDTFNDLNKQVFMKNR------EEFPPEVFNLESFKWAF 205
Query: 128 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGE 185
+ S +V L SL ++ AL+P G +L + ++ L + ++ +DR Y++ +
Sbjct: 206 GILFSRLVRLP--SLGQKLALIPFG-DMLNHDTEVTTFLDFDSGSKSITCTLDRGYESNK 262
Query: 186 SIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 239
+ + G + N +LL+ YGFV N D + + L+ D Y+ K + +G
Sbjct: 263 EVFISYGKRSNGELLVAYGFVPSGKNSEDSVSITLGLDPADEMYEAKLGALKEHG 317
>gi|293333172|ref|NP_001168589.1| uncharacterized protein LOC100382373 [Zea mays]
gi|223949395|gb|ACN28781.1| unknown [Zea mays]
gi|414885391|tpg|DAA61405.1| TPA: hypothetical protein ZEAMMB73_723554 [Zea mays]
Length = 489
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 98/340 (28%), Positives = 152/340 (44%), Gaps = 38/340 (11%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
+AL L+ E +G S W PY+ L RQ +S + WSE EL + G+ ++L
Sbjct: 130 AVALLLLSEVARGADSPWAPYLAILPRQ-------TDSTIFWSEEELLEIQGT----QLL 178
Query: 83 ERAEGIKREY--NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV---VHL 137
G+K EY +E D+V S + D+ + TF+ F AF ++S V +
Sbjct: 179 STTVGVK-EYVQSEFDSVQAEIISTNK----DLFPGSITFDDFLWAFGMLRSRVFPELRG 233
Query: 138 QKVSLARRFALVPLGPPLLAYSS----KCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCG 192
K++L LV P + + S K K + + L K+G+ I + +
Sbjct: 234 DKLALIPFADLVNHSPNITSEGSSWEIKGKGLFGR-ELMFSLRTPVNVKSGQQIYIQYDL 292
Query: 193 PQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR 252
+ N++L ++YGFV+ + D V ++ DP Y DK +A+ NG F V
Sbjct: 293 DKSNAELALDYGFVESNPSRDSFTVTLEISESDPFYGDKLDIAEANGLGETAYFDVIL-- 350
Query: 253 EKEAISDMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYFK 305
+ MLPYLRL + T SV L P+SP E ++ + D K
Sbjct: 351 NEPLPPQMLPYLRLLCIGGTDAFLLEALFRNSVWGHLE--LPLSPDNEESICQAMRDACK 408
Query: 306 ARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
+ LA Y T+ EDE + NL P+ +A + EKK+L
Sbjct: 409 SALADYHTTIEEDEELSGRENLQPRLAIAIGVRAGEKKVL 448
>gi|302804384|ref|XP_002983944.1| hypothetical protein SELMODRAFT_119151 [Selaginella moellendorffii]
gi|300148296|gb|EFJ14956.1| hypothetical protein SELMODRAFT_119151 [Selaginella moellendorffii]
Length = 439
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 92/345 (26%), Positives = 154/345 (44%), Gaps = 36/345 (10%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
+ALYL++EK + S W YIR L R ++SPL WSE ELA L G+ + +
Sbjct: 66 AVALYLLHEKAK-PHSDWSAYIRVLPR-------TLDSPLFWSEEELAELKGTQLLSSMN 117
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV------H 136
E +KREY+++ T + + P +T E F AF ++S +
Sbjct: 118 GFKEFLKREYDKVMT------EVIEPRPDVFDRSLYTLEAFTWAFGILRSRTFPPLIGDN 171
Query: 137 LQKVSLA----RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG 192
L V LA F L P + K + A + + E ++ +
Sbjct: 172 LALVPLADFVNHGFGLTNEDP---GWKVKSAGVFARQETLTLQAAANCAEKQEVLIQYGK 228
Query: 193 PQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 250
+ N++L +YGFVD D N D + ++ + DK +AQ G S F+++
Sbjct: 229 KKGNAQLATDYGFVDSDEKNNRDSFTLTLQVSLSERFADDKVDIAQMAGLDSTAYFNLY- 287
Query: 251 GREKEAISDMLPYLRLG--YVSDTSEMQSVISSL---GPICPVSPCMERAVLDQLADYFK 305
R + DM+ YLRL + SD+ ++++ + P+S E A+ + + + +
Sbjct: 288 -RNQGPPEDMIAYLRLIALFGSDSFLLEALFRNTVWDHLRLPISRENEEAICEAMIEGCR 346
Query: 306 ARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 350
A L Y +T+ ED +L L +K++A + EK++L LQ
Sbjct: 347 ATLREYSSTIDEDTMLLNSSELSTRKKMAVVVRLGEKRILQEQLQ 391
>gi|346474100|gb|AEO36894.1| hypothetical protein [Amblyomma maculatum]
Length = 459
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 89/360 (24%), Positives = 156/360 (43%), Gaps = 53/360 (14%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA++L+ E G+ SFW PYI L + + L ++ EL L+GS E L
Sbjct: 121 ALAIFLILELSAGESSFWHPYISVLPD-------SFNTVLYFNIEELELLSGSAVLDEAL 173
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV---QSCVVHLQK 139
+ I R+Y + F L + P+ + FT+++++ A AV Q+ V +
Sbjct: 174 KLHRSIARQYAYFHKI-FRTHPLAKSLPF---KDCFTYDLYRWAVSAVMTRQNAVPWTES 229
Query: 140 VSL----------ARRFALVPL--------GPPLLAYSS-----KCKAMLAAVDDAVQLV 176
L A ALVPL G L Y S +C AM
Sbjct: 230 DGLGGDDVEIDGTAAVTALVPLWDMCNHSDGKVLTDYDSSASMVRCYAM----------- 278
Query: 177 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 236
R + GE + ++ G + N++ I+ GFV EDN YD + ++ ++ +DP + K + +
Sbjct: 279 --RDFDKGEEVTIFYGKRTNAEFFIHNGFVFEDNRYDAVDIKLGVSKKDPLFAVKSKLCE 336
Query: 237 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 296
+ LS+ R++ D+ +LR+ + D S+ ++ S I S R
Sbjct: 337 -DHDLSLSGTFALVARDRPVSEDLSTFLRILVLKDASQPEAF--SAEHILTSSDSNARDA 393
Query: 297 LDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 356
L L + L +P + E E ++ D + + ++A +L +E K+L + L+ + +
Sbjct: 394 LTFLVVRIELLLKAFPKSDEEYEDIIKDGASNARVKMAARLRLLESKVLASVLETLGNHV 453
>gi|332321747|sp|B7ZUF3.1|SETD3_XENTR RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|213624517|gb|AAI71209.1| LOC549331 protein [Xenopus (Silurana) tropicalis]
Length = 582
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 87/365 (23%), Positives = 164/365 (44%), Gaps = 37/365 (10%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFWLPYI+ L + ++PL ++E E+ YL + ++
Sbjct: 156 TLAFHLLCERAD-PNSFWLPYIKTLPNE-------YDTPLYFNEDEVQYLQSTQAILDVF 207
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P +P ++FTF+ ++ A +V + +
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPNANKLPLKDSFTFDDYRWAVSSVMTRQNQIPT 260
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V + +K+GE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFKSGEQIYIFYGTRSNAE 320
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF E+N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 321 FVIHNGFFFENNLHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHV-TEPPISA 379
Query: 259 DMLPYLRL---------GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR-- 307
+L +LR+ G++ + + + PVS E + +L + +AR
Sbjct: 380 QLLAFLRVFCMNEDELKGHLIGDHAIDKIFTLGNSEFPVS--WENEI--KLWTFLEARAS 435
Query: 308 --LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVS 365
L Y T+ +D +L ++ +A +L R+EK++L L+ +D L +
Sbjct: 436 LLLKTYKTTVEDDNKVLEQPDMTFHSAMAIKLRRVEKEILEKALKSASDNRKLYSKNSEE 495
Query: 366 PCPAP 370
P P
Sbjct: 496 GTPLP 500
>gi|62857953|ref|NP_001016577.1| histone-lysine N-methyltransferase setd3 [Xenopus (Silurana)
tropicalis]
gi|89272100|emb|CAJ81720.1| novel protein containing a SET domain [Xenopus (Silurana)
tropicalis]
Length = 581
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 87/364 (23%), Positives = 164/364 (45%), Gaps = 37/364 (10%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
LA +L+ E+ SFWLPYI+ L + ++PL ++E E+ YL + ++
Sbjct: 156 LAFHLLCERAD-PNSFWLPYIKTLPNE-------YDTPLYFNEDEVQYLQSTQAILDVFS 207
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKV 140
+ + R+Y +F + Q +P +P ++FTF+ ++ A +V + +
Sbjct: 208 QYKNTARQY-----AYFY--KVIQTHPNANKLPLKDSFTFDDYRWAVSSVMTRQNQIPTE 260
Query: 141 SLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 199
+R AL+PL + DD + V + +K+GE I ++ G + N++
Sbjct: 261 DGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFKSGEQIYIFYGTRSNAEF 320
Query: 200 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD 259
+I+ GF E+N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 321 VIHNGFFFENNLHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHV-TEPPISAQ 379
Query: 260 MLPYLRL---------GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR--- 307
+L +LR+ G++ + + + PVS E + +L + +AR
Sbjct: 380 LLAFLRVFCMNEDELKGHLIGDHAIDKIFTLGNSEFPVS--WENEI--KLWTFLEARASL 435
Query: 308 -LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSP 366
L Y T+ +D +L ++ +A +L R+EK++L L+ +D L +
Sbjct: 436 LLKTYKTTVEDDNKVLEQPDMTFHSAMAIKLRRVEKEILEKALKSASDNRKLYSKNSEEG 495
Query: 367 CPAP 370
P P
Sbjct: 496 TPLP 499
>gi|50252331|dbj|BAD28364.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplast precursor
[Oryza sativa Japonica Group]
gi|215769445|dbj|BAH01674.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 495
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 95/349 (27%), Positives = 150/349 (42%), Gaps = 56/349 (16%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
+AL L+ E +G S W PY+ L RQ +S + WSE EL + G+ ++L
Sbjct: 135 AVALLLLREAARGAGSPWAPYLAILPRQ-------TDSTIFWSEEELLEIQGT----QLL 183
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE-------AFTFEIFKQAFVAVQSCVV 135
G+K EY + S F+ +I +E TF F AF ++S V
Sbjct: 184 STTMGVK-EYVQ---------SEFESVEAEIISENRELFPGTVTFNDFLWAFGILRSRVF 233
Query: 136 HLQKVSLARRFALVPLGPPLLAYSS-----------KCKAMLAAVDDAVQLVVDRPYKAG 184
+ + AL+P L+ +S K K + D L K+G
Sbjct: 234 AELR---GDKLALIPFAD-LVNHSDDITSKESSWEIKGKGLFGR-DVVFSLRTPVNVKSG 288
Query: 185 ESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 243
E I + + + N++L ++YGF + ++ D + ++ DP Y DK +A+ NG
Sbjct: 289 EQIYIQYDLDKSNAELALDYGFTESNSSRDAYTLTLEISESDPFYDDKLDIAELNGMGET 348
Query: 244 QVFHVHAGREKEAISDMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAV 296
F + G + MLPYLRL + T +V L PVS E A+
Sbjct: 349 AYFDIVLG--ESLPPQMLPYLRLLCLGGTDAFLLEALFRNAVWGHLE--LPVSQDNEEAI 404
Query: 297 LDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
+ + K+ L Y T+ EDE +L NL P+ ++A ++ EKK+L
Sbjct: 405 CQVIRNACKSALGAYHTTIEEDEELLGSENLQPRLQIAVEVRAGEKKVL 453
>gi|340780678|pdb|3SMT|A Chain A, Crystal Structure Of Human Set Domain-Containing Protein3
Length = 497
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 81/344 (23%), Positives = 157/344 (45%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ YL + ++
Sbjct: 155 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 206
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 207 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVXTRQNQIPT 259
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V + ++AGE I ++ G + N++
Sbjct: 260 EDGSRVTLALIPLWDXCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 319
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 320 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAXKAEVLARAGIPTSSVFALHFT-EPPISA 378
Query: 259 DMLPYLRL---------GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ ++ S + + + PVS E + L D L
Sbjct: 379 QLLAFLRVFCXTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 438
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED+++L +++L + + A +L EK++L ++ A
Sbjct: 439 TYKTTIEEDKSVLKNHDLSVRAKXAIKLRLGEKEILEKAVKSAA 482
>gi|194038089|ref|XP_001925323.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Sus scrofa]
gi|456754196|gb|JAA74239.1| SET domain containing 3 [Sus scrofa]
Length = 595
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 84/344 (24%), Positives = 157/344 (45%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ YL + ++
Sbjct: 156 TLAFHLLCERAD-PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 207
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYP--YDIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P + +P E+FT+E ++ A +V + +
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPQAHKLPLKESFTYEDYRWAVSSVMTRQNQIPT 260
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V R ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALRDFRAGEQIYIFYGTRSNAE 320
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPVSA 379
Query: 259 DMLPYLRLGYVSDTSEMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ +++ + ++ +LG PVS E + L D L
Sbjct: 380 QLLAFLRVFCMTEGELKEHLLGENAIDRIFTLGNSEYPVSWDNEVKLWTFLEDRASLLLK 439
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED+ L ++ L + +A +L EK++L ++ A
Sbjct: 440 TYKTTIEEDKTFLKNHGLSVRATMAVKLRLGEKEILEKAVESAA 483
>gi|195651313|gb|ACG45124.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Zea mays]
Length = 503
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 66/235 (28%), Positives = 110/235 (46%), Gaps = 21/235 (8%)
Query: 10 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
+++ N + + +A YL+ E S W+ YI L RQ S L W+ EL
Sbjct: 135 GDVMKRNSVPDWPLIATYLISEASLEGSSRWISYIAALPRQ-------PYSLLYWTRAEL 187
Query: 70 -AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 128
AYL SP + ++R + YN+L +F ++P P E + E F +F
Sbjct: 188 DAYLVASPIRKRAIQRITDVIGTYNDLRD------RIFSRHPDLFPEEVYNIETFLWSFG 241
Query: 129 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGES 186
+ S +V L S+ R LVP +L +S + + L + DR Y+ GE
Sbjct: 242 ILFSRLVRLP--SMDGRVVLVPWAD-MLNHSPEVETFLDFDKSSRGIVFTTDRSYQPGEQ 298
Query: 187 IVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNG 239
+ + G + + +LL++YGFV ++ NP D + + +L+ D Y++K +RNG
Sbjct: 299 VFISYGKKSSGELLLSYGFVPKEGTNPNDSVELLVSLDKSDNCYKEKLQALKRNG 353
>gi|357137766|ref|XP_003570470.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like
[Brachypodium distachyon]
Length = 389
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 69/236 (29%), Positives = 114/236 (48%), Gaps = 23/236 (9%)
Query: 10 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
+++ + + + LA YL+ E S W YI L RQ S L W+ TE+
Sbjct: 46 GDVMKSYSVPDWPLLATYLISEASLEGSSRWSSYIDALPRQ-------PYSLLYWTRTEI 98
Query: 70 -AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 128
AYL SP + + R + YN+L +F ++P P E + E F+ +F
Sbjct: 99 DAYLVASPIRERAISRIGDVIGTYNDL------RDRIFSKHPELFPEEVYNMENFRWSFG 152
Query: 129 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DRPYKAGE 185
+ S +V L S+ + ALVP +L ++ + A L D + Q +V DR Y+ GE
Sbjct: 153 ILFSRLVRLP--SMDGKVALVPWAD-MLNHNPEVDAFLD-FDKSSQGIVFTTDRSYQPGE 208
Query: 186 SIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNG 239
+ + G + + +LL++YGFV ++ NP D + +LN D Y++K +R+G
Sbjct: 209 QVFISYGKKSSGELLLSYGFVPKEGTNPNDSVEFSVSLNKSDDCYREKLQALKRHG 264
>gi|302754606|ref|XP_002960727.1| hypothetical protein SELMODRAFT_449995 [Selaginella moellendorffii]
gi|300171666|gb|EFJ38266.1| hypothetical protein SELMODRAFT_449995 [Selaginella moellendorffii]
Length = 430
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 93/345 (26%), Positives = 154/345 (44%), Gaps = 36/345 (10%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
+ALYL++EK + S W YIR L R ++SPL WSE ELA L G+ + I
Sbjct: 66 AVALYLLHEKAK-PHSDWSAYIRVLPR-------TLDSPLFWSEEELAELKGTQLLSSIN 117
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV------H 136
E +KREY+++ T + + P +T E F AF ++S +
Sbjct: 118 GFKEFLKREYDKVMT------EVIEPRPDVFDRSLYTLEAFTWAFGILRSRTFPPLIGDN 171
Query: 137 LQKVSLA----RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG 192
L V LA F L P + K + A + + E ++ +
Sbjct: 172 LALVPLADFVNHGFGLTNEDP---YWHVKSAGVFARQETLTLQAAANCAEKQEVLMQYGK 228
Query: 193 PQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 250
+ N++L +YGFVD D N D + ++ + DK +AQ G S F+++
Sbjct: 229 KKGNAQLATDYGFVDSDEKNNRDSFTLTLQVSLSERFADDKVDIAQMAGLDSTAYFNLY- 287
Query: 251 GREKEAISDMLPYLRLG--YVSDTSEMQSVISSL---GPICPVSPCMERAVLDQLADYFK 305
R + DM+ YLRL + SD+ ++++ + P+S E A+ + + + +
Sbjct: 288 -RNQGPPEDMIAYLRLIALFGSDSFLLEALFRNTVWDHLRLPISRENEEAICEAMIEGCR 346
Query: 306 ARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 350
A L Y +T+ ED +L L +K++A + EK++L LQ
Sbjct: 347 ATLREYSSTIDEDTMLLNSSELSTRKKMAVVVRLGEKRILQEQLQ 391
>gi|218202140|gb|EEC84567.1| hypothetical protein OsI_31339 [Oryza sativa Indica Group]
Length = 649
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 95/349 (27%), Positives = 150/349 (42%), Gaps = 56/349 (16%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
+AL L+ E +G S W PY+ L RQ +S + WSE EL + G+ ++L
Sbjct: 289 AVALLLLREAARGAGSPWAPYLAILPRQ-------TDSTIFWSEEELLEIQGT----QLL 337
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE-------AFTFEIFKQAFVAVQSCVV 135
G+K EY + S F+ +I +E TF F AF ++S V
Sbjct: 338 STTMGVK-EYVQ---------SEFESVEAEIISENRELFPGTVTFNDFLWAFGILRSRVF 387
Query: 136 HLQKVSLARRFALVPLGPPLLAYSS-----------KCKAMLAAVDDAVQLVVDRPYKAG 184
+ + AL+P L+ +S K K + D L K+G
Sbjct: 388 AELR---GDKLALIPFAD-LVNHSDDITSKESSWEIKGKGLFGR-DVVFSLRTPVNVKSG 442
Query: 185 ESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 243
E I + + + N++L ++YGF + ++ D + ++ DP Y DK +A+ NG
Sbjct: 443 EQIYIQYDLDKSNAELALDYGFTESNSSRDAYTLTLEISESDPFYDDKLDIAELNGMGET 502
Query: 244 QVFHVHAGREKEAISDMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAV 296
F + G + MLPYLRL + T +V L PVS E A+
Sbjct: 503 AYFDIVLG--ESLPPQMLPYLRLLCLGGTDAFLLEALFRNAVWGHL--ELPVSQDNEEAI 558
Query: 297 LDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
+ + K+ L Y T+ EDE +L NL P+ ++A ++ EKK+L
Sbjct: 559 CQVIRNACKSALGAYHTTIEEDEELLGSENLQPRLQIAVEVRAGEKKVL 607
>gi|242066146|ref|XP_002454362.1| hypothetical protein SORBIDRAFT_04g029430 [Sorghum bicolor]
gi|241934193|gb|EES07338.1| hypothetical protein SORBIDRAFT_04g029430 [Sorghum bicolor]
Length = 499
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 71/237 (29%), Positives = 114/237 (48%), Gaps = 25/237 (10%)
Query: 10 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
E++ N + + +A YL+ E S W YI L RQ S L W+ EL
Sbjct: 131 GEVMKRNSVPDWPLIATYLISEASLEGSSRWSSYIAALPRQ-------PYSLLYWTRAEL 183
Query: 70 -AYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
AYL SP + ++R + YN+L D ++ LF P E + E F +F
Sbjct: 184 DAYLVASPIRKRAIQRITDVIGTYNDLRDRIFSRHSDLF-------PEEVYNIETFLWSF 236
Query: 128 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DRPYKAG 184
+ S +V L S+ + ALVP +L +S + + L D + Q +V DR Y+ G
Sbjct: 237 GILFSRLVRLP--SMDEKVALVPWAD-MLNHSPEVETFLD-FDKSSQGIVFTTDRSYQPG 292
Query: 185 ESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNG 239
E + + G + + +LL++YGFV ++ NP D + + +L+ D Y++K +RNG
Sbjct: 293 EQVFISYGKKSSGELLLSYGFVPKEGTNPNDSVELLVSLDKSDKCYKEKLQALKRNG 349
>gi|148744485|gb|AAI42996.1| SET domain containing 3 [Homo sapiens]
Length = 594
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 81/344 (23%), Positives = 159/344 (46%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ YL + ++
Sbjct: 156 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 207
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V + ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 320
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379
Query: 259 DMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ +++ S + + + PVS E + L D L
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSKFPVSWDNEVKLWTFLEDRASLLLK 439
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED+++L +++L + ++A +L EK++L ++ A
Sbjct: 440 TYKTTIEEDKSVLKNHDLSVRAKMAIKLRLGEKEILEKAVKSAA 483
>gi|40068481|ref|NP_115609.2| histone-lysine N-methyltransferase setd3 isoform a [Homo sapiens]
gi|74750394|sp|Q86TU7.1|SETD3_HUMAN RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|28071092|emb|CAD61927.1| unnamed protein product [Homo sapiens]
gi|119602070|gb|EAW81664.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
gi|119602072|gb|EAW81666.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
gi|119602073|gb|EAW81667.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
gi|194380984|dbj|BAG64060.1| unnamed protein product [Homo sapiens]
gi|307686103|dbj|BAJ20982.1| SET domain containing 3 [synthetic construct]
Length = 594
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 81/344 (23%), Positives = 159/344 (46%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ YL + ++
Sbjct: 156 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 207
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V + ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 320
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379
Query: 259 DMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ +++ S + + + PVS E + L D L
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 439
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED+++L +++L + ++A +L EK++L ++ A
Sbjct: 440 TYKTTIEEDKSVLKNHDLSVRAKMAIKLRLGEKEILEKAVKSAA 483
>gi|443722302|gb|ELU11224.1| hypothetical protein CAPTEDRAFT_181634 [Capitella teleta]
Length = 541
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 79/316 (25%), Positives = 138/316 (43%), Gaps = 20/316 (6%)
Query: 38 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 97
SFWLPY++ L + SPL ++ +L L SPT +E++ + I R+Y
Sbjct: 168 SFWLPYLKILPH-------SYSSPLYFNPEDLQLLKASPTLSEMINQFRNITRQYAYFFN 220
Query: 98 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF-ALVPLGPPLL 156
+ F L + P I + ++ ++ A +V + + + R AL+PL
Sbjct: 221 L-FQGHELASKLP--IQVKNICYDDYRWAVSSVMTRQNQIPTLDGQRMISALIPLWDMCN 277
Query: 157 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 216
+ + + +D + AG + ++ G + N++LLI+ GFV N DRL
Sbjct: 278 HTNGQITTDFSLKNDRSECFSLEGTVAGAQVFIFYGSRSNAELLIHNGFVYPQNHSDRLT 337
Query: 217 VEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQ 276
+ ++ DP + K V R + ++F +H G SD L +LR+ +++ +++
Sbjct: 338 IRLGISKNDPLFSMKSEVLSRLSMQASRLFSLHCG-VNPVDSDTLAFLRVVVMTE-DDLR 395
Query: 277 SVISSLGPICP-------VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 329
+ ++ I VS ER LA L YP + ED +L +L
Sbjct: 396 TALACRQQISKLRDFDDFVSEDNERKAWAFLATRVLLLLKAYPTSAQEDATLLQGNDLST 455
Query: 330 KKRVATQLVRMEKKML 345
R+A QL EK +L
Sbjct: 456 HARLAVQLRHCEKNIL 471
>gi|355718753|gb|AES06373.1| SET domain containing 3 [Mustela putorius furo]
Length = 585
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 84/344 (24%), Positives = 159/344 (46%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ L + ++
Sbjct: 156 TLAFHLLCERAD-PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRDLQSTQAIHDVF 207
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P +AFT+E ++ A +V + +
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDAFTYEDYRWAVSSVMTRQNQIPT 260
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V R ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALRDFRAGEQIYIFYGTRSNAE 320
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H+ E +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPVSA 379
Query: 259 DMLPYLRLGYVSDTSEMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ +++ + ++ +LG PVS E + L D L
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDNALDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLK 439
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED++ L D++L + +A +L EK++L ++ A
Sbjct: 440 TYKTTIEEDKSFLKDHDLSVRAAMAIKLRLGEKEILEKAVKSAA 483
>gi|110331827|gb|ABG67019.1| hypothetical protein LOC84193 [Bos taurus]
Length = 488
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 79/339 (23%), Positives = 153/339 (45%), Gaps = 25/339 (7%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ YL + ++
Sbjct: 163 TLAFHLLCERAD-PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 214
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIP-TEAFTFEIFKQAFVAVQSCVVHLQKVS 141
+ + R+Y +F + + +P ++FT+E ++ A +V + +
Sbjct: 215 SQYKNTARQY-----AYFYKVIQTHPHAHKLPLKDSFTYEDYRWAVSSVMTRQNQIPTED 269
Query: 142 LAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLL 200
+R AL+PL S DD + V + ++AGE I ++ G + N++ +
Sbjct: 270 GSRVTLALIPLWDMCNHTSGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFV 329
Query: 201 INYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDM 260
I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E + +
Sbjct: 330 IHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQL 388
Query: 261 LPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGY 311
L +LR+ +++ S + + + PVS E + L D L Y
Sbjct: 389 LAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTY 448
Query: 312 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 350
T+ ED++ L +++L + +A +L EK++L ++
Sbjct: 449 KTTIEEDKSFLKNHDLSARATMAIKLRLGEKEILERAVK 487
>gi|440907688|gb|ELR57800.1| SET domain-containing protein 3 [Bos grunniens mutus]
Length = 594
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 82/344 (23%), Positives = 157/344 (45%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ YL + ++
Sbjct: 156 TLAFHLLCERAD-PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 207
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHAHKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL S DD + V + ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTSGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 320
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379
Query: 259 DMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ +++ S + + + PVS E + L D L
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLK 439
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED++ L +++L + +A +L EK++L ++ A
Sbjct: 440 TYKTTIEEDKSFLKNHDLSARATMAIKLRLGEKEILERAVKSAA 483
>gi|119914085|ref|XP_589822.3| PREDICTED: histone-lysine N-methyltransferase setd3 [Bos taurus]
gi|297488270|ref|XP_002696879.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Bos taurus]
gi|296475307|tpg|DAA17422.1| TPA: SET domain containing 3 [Bos taurus]
Length = 601
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 82/344 (23%), Positives = 157/344 (45%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ YL + ++
Sbjct: 163 TLAFHLLCERAD-PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 214
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 215 SQYKNTARQY-----AYFY--KVIQTHPHAHKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 267
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL S DD + V + ++AGE I ++ G + N++
Sbjct: 268 EDGSRVTLALIPLWDMCNHTSGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 327
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 328 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 386
Query: 259 DMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ +++ S + + + PVS E + L D L
Sbjct: 387 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLK 446
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED++ L +++L + +A +L EK++L ++ A
Sbjct: 447 TYKTTIEEDKSFLKNHDLSARATMAIKLRLGEKEILERAVKSAA 490
>gi|302759643|ref|XP_002963244.1| hypothetical protein SELMODRAFT_80789 [Selaginella moellendorffii]
gi|300168512|gb|EFJ35115.1| hypothetical protein SELMODRAFT_80789 [Selaginella moellendorffii]
Length = 467
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 65/235 (27%), Positives = 110/235 (46%), Gaps = 20/235 (8%)
Query: 9 AAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 68
A +++ L E LA YL+ E GK S W PYI L R+ G S LLW+ +
Sbjct: 99 AGKVIRAKDLPEWPFLACYLISEASLGKSSPWYPYIAALPRRPG-------SILLWTALD 151
Query: 69 L-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
+ +L+ + K L+ ++ +N+L+ FM + P E F + FK AF
Sbjct: 152 VETHLSATSIKDRALQCVREVEDTFNDLNKQVFMKNR------EEFPPEVFNLKSFKWAF 205
Query: 128 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGE 185
+ S +V L SL ++ AL+P G +L + ++ L + ++ +DR Y++
Sbjct: 206 GILFSRLVRLP--SLGQKLALIPFG-DMLNHDTEVTTFLDFDSGSKSITCTLDRGYESNR 262
Query: 186 SIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 239
+ + G + N +LL+ YGFV N D + + L+ D Y+ K + +G
Sbjct: 263 EVFISYGKRSNGELLVAYGFVPSGKNSEDSVSITLGLDPADEMYEAKLGTLKEHG 317
>gi|426248573|ref|XP_004018037.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
setd3 [Ovis aries]
Length = 596
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 82/344 (23%), Positives = 157/344 (45%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ YL + ++
Sbjct: 163 TLAFHLLCERAD-PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 214
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 215 SQYKNTARQY-----AYFY--RVIQTHPHAHKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 267
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL S DD + V + ++AGE I ++ G + N++
Sbjct: 268 EDGSRVTLALIPLWDMCNHTSGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 327
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 328 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 386
Query: 259 DMLPYLRLGYVSDT---------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ +++ S + + + PVS E + L D L
Sbjct: 387 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEYPVSWDNEVRLWAFLEDRASLLLK 446
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED++ L +++L + +A +L EK++L ++ A
Sbjct: 447 TYKTTIEEDKSFLKNHDLSARATMAVKLRLGEKEILERAVKSAA 490
>gi|444909511|ref|ZP_21229702.1| hypothetical protein D187_00317 [Cystobacter fuscus DSM 2262]
gi|444720460|gb|ELW61244.1| hypothetical protein D187_00317 [Cystobacter fuscus DSM 2262]
Length = 445
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 88/336 (26%), Positives = 145/336 (43%), Gaps = 30/336 (8%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
LA +L+ EK +G SFW P++ L + PL +SE E A + GS LE
Sbjct: 89 LASWLLEEKHRGADSFWKPFVDSLP------EAYPHVPLFYSEQERARMKGSQ-----LE 137
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPT-EAFTFEIFKQAFVAVQSCVVHLQKVSL 142
R ++R+ E + + Q +P E F FE + A +++ S + L+
Sbjct: 138 RLVEVQRQSFEQE---------YAQLREKLPEYERFGFEEYVWARISLYSRLFSLK--GG 186
Query: 143 ARRFALVPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGESIVVWCGPQPNSKLL 200
+ +LVPL + + + + +D +++ R AG I G + + L
Sbjct: 187 LQGPSLVPLSD-MFNHRQPPDVLWSTSEDGQTFRMIAQRAVPAGTEIHTHYGAKSSDVFL 245
Query: 201 INYGFV-DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD 259
++ GFV D + D + + L DP K+ + + F V + A
Sbjct: 246 LHSGFVPDGNEENDEVYLSVGLPPGDPLASVKQQMFGLASATAKHPFKVSRQGKYLASWS 305
Query: 260 MLPYLRLGYVSDTSEM---QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLS 316
+ +LR+ + S + ++S I PVS E VL LA + RL +P TL
Sbjct: 306 VFSFLRMAHASPDEFLALSNRLLSGTKTIAPVSVACEERVLGTLAAACEERLKAFPTTLE 365
Query: 317 EDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVT 352
EDE +L + L P +R L R EK++L L++T
Sbjct: 366 EDERLLREGPLSPNERSCVLLRRQEKRLLGDYLELT 401
>gi|281182452|ref|NP_001162549.1| histone-lysine N-methyltransferase setd3 [Papio anubis]
gi|332321745|sp|A9X1D0.1|SETD3_PAPAN RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|163781076|gb|ABY40825.1| SET domain containing 3, isoform 1 (predicted) [Papio anubis]
Length = 595
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 81/344 (23%), Positives = 158/344 (45%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ YL + ++
Sbjct: 156 ALAFHLLCERAN-PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 207
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V + ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 320
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379
Query: 259 DMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ +++ S + + + PVS E + L D L
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 439
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED+++L + +L + ++A +L EK++L ++ A
Sbjct: 440 TYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEKAVKSAA 483
>gi|355778846|gb|EHH63882.1| hypothetical protein EGM_16943 [Macaca fascicularis]
Length = 595
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 81/344 (23%), Positives = 158/344 (45%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ YL + ++
Sbjct: 156 ALAFHLLCERAN-PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 207
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V + ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 320
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379
Query: 259 DMLPYLRLGYVSDT---------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ +++ S + + + PVS E + L D L
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 439
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED+++L + +L + ++A +L EK++L ++ A
Sbjct: 440 TYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEKAVKSAA 483
>gi|386780935|ref|NP_001247800.1| SET domain containing 3 [Macaca mulatta]
gi|355693560|gb|EHH28163.1| hypothetical protein EGK_18532 [Macaca mulatta]
gi|380817110|gb|AFE80429.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
gi|383422129|gb|AFH34278.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
gi|384949778|gb|AFI38494.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
Length = 595
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 81/344 (23%), Positives = 158/344 (45%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ YL + ++
Sbjct: 156 ALAFHLLCERAN-PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 207
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V + ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 320
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379
Query: 259 DMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ +++ S + + + PVS E + L D L
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 439
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED+++L + +L + ++A +L EK++L ++ A
Sbjct: 440 TYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEKAVKSAA 483
>gi|114654683|ref|XP_522946.2| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2 [Pan
troglodytes]
gi|332843114|ref|XP_003314566.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Pan
troglodytes]
gi|397525919|ref|XP_003832895.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1 [Pan
paniscus]
gi|397525921|ref|XP_003832896.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2 [Pan
paniscus]
gi|410227562|gb|JAA11000.1| SET domain containing 3 [Pan troglodytes]
gi|410255618|gb|JAA15776.1| SET domain containing 3 [Pan troglodytes]
gi|410289938|gb|JAA23569.1| SET domain containing 3 [Pan troglodytes]
gi|410342147|gb|JAA40020.1| SET domain containing 3 [Pan troglodytes]
Length = 594
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 81/344 (23%), Positives = 158/344 (45%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ YL + ++
Sbjct: 156 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 207
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V + ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 320
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379
Query: 259 DMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ +++ S + + + PVS E + L D L
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 439
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED+++L + +L + ++A +L EK++L ++ A
Sbjct: 440 TYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEKAVKSAA 483
>gi|332252553|ref|XP_003275417.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
[Nomascus leucogenys]
gi|332252555|ref|XP_003275418.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
[Nomascus leucogenys]
Length = 595
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 81/344 (23%), Positives = 158/344 (45%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ YL + ++
Sbjct: 156 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 207
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V + ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 320
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379
Query: 259 DMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ +++ S + + + PVS E + L D L
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 439
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED+++L + +L + ++A +L EK++L ++ A
Sbjct: 440 TYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEKAVKSAA 483
>gi|426377975|ref|XP_004055723.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Gorilla
gorilla gorilla]
Length = 594
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 81/344 (23%), Positives = 158/344 (45%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ YL + ++
Sbjct: 156 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 207
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V + ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 320
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379
Query: 259 DMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ +++ S + + + PVS E + L D L
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 439
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED+++L + +L + ++A +L EK++L ++ A
Sbjct: 440 TYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEKTVKSAA 483
>gi|297695854|ref|XP_002825140.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
[Pongo abelii]
gi|395746278|ref|XP_003778419.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
[Pongo abelii]
Length = 595
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 81/344 (23%), Positives = 158/344 (45%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ YL + ++
Sbjct: 156 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 207
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V + ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 320
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379
Query: 259 DMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ +++ S + + + PVS E + L D L
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 439
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED+++L + +L + ++A +L EK++L ++ A
Sbjct: 440 TYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEKAVKSAA 483
>gi|431839268|gb|ELK01195.1| SET domain-containing protein 3 [Pteropus alecto]
Length = 805
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 83/343 (24%), Positives = 158/343 (46%), Gaps = 29/343 (8%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
LA +L+ E+ SFW PYI+ L + ++PL + E E+ YL + ++
Sbjct: 336 LAFHLLCERAD-PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFS 387
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKV 140
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 388 QYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTE 440
Query: 141 SLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 199
+R AL+PL + DD + V R ++AGE I ++ G + N++
Sbjct: 441 DGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALRDFRAGEQIYIFYGTRSNAEF 500
Query: 200 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD 259
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 501 VIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQ 559
Query: 260 MLPYLRLGYVSDTSEMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKARLAG 310
+L +LR+ +++ + ++ +LG PVS E + L D L
Sbjct: 560 LLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEYPVSWDNEVKLWTFLEDRASLLLKT 619
Query: 311 YPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED + L +++L + +A +L EK++L ++ A
Sbjct: 620 YKTTVEEDRSFLRNHDLSVRAAMAVKLRLGEKEILERAVKSAA 662
>gi|332321743|sp|C1FXW2.1|SETD3_DASNO RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|226526916|gb|ACO71275.1| SET domain containing 3 isoform a (predicted) [Dasypus
novemcinctus]
Length = 589
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 83/344 (24%), Positives = 159/344 (46%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ YL + ++
Sbjct: 156 TLAFHLLCERAN-PNSFWQPYIQSLPGE-------YDTPLYFEEDEVRYLHSTQAIHDVF 207
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V + ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 320
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379
Query: 259 DMLPYLRLGYVSDTSEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ +++ + + I +LG PVS E + L D L
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGENAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 439
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED++ L +++L + +A +L EK++L ++ A
Sbjct: 440 TYKTTIEEDKSFLKNHDLSVRATMAIKLRLGEKEILEKAVKSAA 483
>gi|332321478|sp|B1MTJ4.2|SETD3_CALMO RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
Length = 595
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 83/344 (24%), Positives = 160/344 (46%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ YL + ++
Sbjct: 156 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 207
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V + ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 320
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379
Query: 259 DMLPYLRLGYVSDTSEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ +++ + + I +LG PVS E + L D L
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 439
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED+++L + +L + ++A +L EK++L ++ A
Sbjct: 440 TYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEKAVKSAA 483
>gi|168986666|gb|ACA35060.1| SET domain containing 3 isoform a (predicted) [Callithrix jacchus]
Length = 597
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 83/344 (24%), Positives = 160/344 (46%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ YL + ++
Sbjct: 158 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRYLQSTQAVHDVF 209
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 210 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 262
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V + ++AGE I ++ G + N++
Sbjct: 263 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 322
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 323 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 381
Query: 259 DMLPYLRLGYVSDTSEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ +++ + + I +LG PVS E + L D L
Sbjct: 382 QLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 441
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED+++L + +L + ++A +L EK++L ++ A
Sbjct: 442 TYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEKAVKSAA 485
>gi|296215874|ref|XP_002754318.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Callithrix jacchus]
Length = 610
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 83/344 (24%), Positives = 160/344 (46%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ YL + ++
Sbjct: 171 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRYLQSTQAVHDVF 222
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 223 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 275
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V + ++AGE I ++ G + N++
Sbjct: 276 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 335
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 336 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 394
Query: 259 DMLPYLRLGYVSDTSEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ +++ + + I +LG PVS E + L D L
Sbjct: 395 QLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 454
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED+++L + +L + ++A +L EK++L ++ A
Sbjct: 455 TYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEKAVKSAA 498
>gi|169409575|gb|ACA57918.1| SET domain containing 3 isoform a (predicted) [Callicebus moloch]
Length = 597
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 83/344 (24%), Positives = 160/344 (46%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ YL + ++
Sbjct: 158 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 209
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 210 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 262
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V + ++AGE I ++ G + N++
Sbjct: 263 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 322
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 323 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 381
Query: 259 DMLPYLRLGYVSDTSEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ +++ + + I +LG PVS E + L D L
Sbjct: 382 QLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 441
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED+++L + +L + ++A +L EK++L ++ A
Sbjct: 442 TYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEKAVKSAA 485
>gi|332320543|sp|B0VX69.2|SETD3_CALJA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
Length = 595
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 83/344 (24%), Positives = 160/344 (46%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ YL + ++
Sbjct: 156 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRYLQSTQAVHDVF 207
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V + ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 320
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379
Query: 259 DMLPYLRLGYVSDTSEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ +++ + + I +LG PVS E + L D L
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 439
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED+++L + +L + ++A +L EK++L ++ A
Sbjct: 440 TYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEKAVKSAA 483
>gi|427784595|gb|JAA57749.1| Putative histone-lysine n-methyltransferase setd3 [Rhipicephalus
pulchellus]
Length = 485
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 80/337 (23%), Positives = 147/337 (43%), Gaps = 23/337 (6%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
LA++L+ E G+ SFW PYI L + + L +S EL L GS E L+
Sbjct: 152 LAMFLILEFCTGESSFWHPYISTL-------PASFNTVLYFSVEELELLHGSTVLDEALK 204
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS--CVVHLQKVS 141
I R+Y+ + F L + PY + FT+++++ A AV + V L +
Sbjct: 205 LHRSIARQYSYFHKI-FRTHPLAKSLPY---KDCFTYDLYRWAVSAVMTRQNAVPLTDTA 260
Query: 142 LARR-------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 194
A+VPL K + ++ R ++ G+ + ++ G +
Sbjct: 261 GGDDEDGTDAMTAMVPLWDMCNHSDGKVFTDYDISANMLRCYAMRDFEKGQEVTIFYGRR 320
Query: 195 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREK 254
N++ I+ GFV +N +D + ++ ++ +DP Y K + + +F + RE+
Sbjct: 321 TNAEFFIHNGFVFPENRHDSVDIKLGISKQDPLYAVKAKLCDDHELTPSGIFAL-VPRER 379
Query: 255 EAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPAT 314
D+ +LR+ + D S+ S I + R L+ L + L +P +
Sbjct: 380 PVCEDLSTFLRILVLKDASQAASFTDE--HIMVATDDNAREALNFLIVRIQLLLRAFPKS 437
Query: 315 LSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQV 351
E E ++ D + + ++A QL +E+K+L A L+
Sbjct: 438 DQEYENIIADEGSNGRLKMAAQLRLLERKILTAVLET 474
>gi|326495906|dbj|BAJ90575.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 507
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 67/222 (30%), Positives = 108/222 (48%), Gaps = 23/222 (10%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEIL 82
LA YL+ E S W YI L RQ S L W+ TE+ AYL SP + +
Sbjct: 152 LATYLISEASLEGSSRWSSYIDALPRQ-------PYSLLYWTRTEIDAYLVASPIRERAI 204
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
R + YN+L +F ++P P + + E F+ +F + S +V L+ S+
Sbjct: 205 SRISDVIGTYNDLRD------RIFSKHPDLFPEKVYNMENFRWSFGILFSRLVRLE--SM 256
Query: 143 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DRPYKAGESIVVWCGPQPNSKL 199
+ ALVP +L +S + A L D + Q +V DR Y+ GE + + G + + +L
Sbjct: 257 GGKVALVPWAD-MLNHSPEVDAFLD-YDKSSQGIVFTTDRSYQPGEQVFISYGKKSSGEL 314
Query: 200 LINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNG 239
L++YGFV ++ NP D + +L D Y++K +++G
Sbjct: 315 LLSYGFVPKEGTNPNDSVEFLVSLKKSDECYKEKLQALKKHG 356
>gi|147843303|emb|CAN82664.1| hypothetical protein VITISV_015206 [Vitis vinifera]
Length = 507
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 86/333 (25%), Positives = 153/333 (45%), Gaps = 25/333 (7%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
+AL+L+ EK + +S W Y+ L S + WSE EL + G+ L
Sbjct: 124 SVALFLIREKLR-DESPWRSYLDILPEY-------TNSTIYWSEEELVEIQGTQLSNTTL 175
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQSCVVHLQKVS 141
E ++ E+ +++ + S +P + + F I + +AF ++ Q +
Sbjct: 176 GVKEYVQSEFLKVEEEVILPHSQLFPFPVTLDDFLWAFGILRSRAFSRLRG-----QNLV 230
Query: 142 LARRFALVPLGPPLLA--YSSKCK-AMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNS 197
L L+ P + Y+ + K A L + D L KAGE +++ + + N+
Sbjct: 231 LIPLADLINHSPSITTEEYAWEIKGAGLFSRDQLFSLRTPVSVKAGEQVLIQYDLDKSNA 290
Query: 198 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI 257
+L ++YGF++ + + ++ DP + DK +A+ NG + F + G+ A
Sbjct: 291 ELALDYGFIESRPNRNSYTLTLEISESDPFFGDKLDIAESNGLSEIAYFDIVLGQSLPAA 350
Query: 258 SDMLPYLRLGYVS--DTSEMQSVISSL--GPI-CPVSPCMERAVLDQLADYFKARLAGYP 312
MLPYLRL + D ++S+ + G + PVS E + + D K+ L+GY
Sbjct: 351 --MLPYLRLVALGGPDAFLLESIFRNTIWGHLELPVSRANEELICQVIQDACKSALSGYL 408
Query: 313 ATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
T+ EDE + + NLHP+ +A + EKK+L
Sbjct: 409 TTIEEDEKLKEEGNLHPRLEIAVGVRTGEKKVL 441
>gi|291411315|ref|XP_002721936.1| PREDICTED: SET domain containing 3 [Oryctolagus cuniculus]
Length = 591
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 85/345 (24%), Positives = 154/345 (44%), Gaps = 31/345 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ YL + ++
Sbjct: 156 ALAFHLLCERAS-PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 207
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 208 SQYKNTARQY-----AYFY--RVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V R + AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALRDFHAGEQIYIFYGTRSNAE 320
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379
Query: 259 DMLPYLRLGYVSDTSEMQSVISSLGPI----------CPVSPCMERAVLDQLADYFKARL 308
+L +LR+ + E++ + G I PVS E + L D L
Sbjct: 380 QLLAFLRV-FCMTEEELREHLLGDGAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLL 438
Query: 309 AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED+A+L L + +A +L EK++L ++ A
Sbjct: 439 KTYKTTIEEDKAVLRSPALSARAAMAVKLRLGEKEILEKAVRSAA 483
>gi|332321744|sp|B5FW36.1|SETD3_OTOGA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|197215622|gb|ACH53017.1| SET domain containing 3 isoform a (predicted) [Otolemur garnettii]
Length = 595
Score = 80.9 bits (198), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 83/344 (24%), Positives = 159/344 (46%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ YL + ++
Sbjct: 156 ALAFHLLCER-ASPNSFWQPYIQSLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 207
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V + ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 320
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379
Query: 259 DMLPYLRLGYVSDTSEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ +++ + + I +LG PVS E + L D L
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 439
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED+ +L +++L + +A +L EK++L ++ A
Sbjct: 440 TYKTTIEEDKFVLKNHDLSVRATMAIKLRLGEKEILEKAVKSAA 483
>gi|224051705|ref|XP_002200601.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Taeniopygia
guttata]
Length = 593
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 82/344 (23%), Positives = 160/344 (46%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFWLPYI+ L + ++PL + E E+ +L + ++
Sbjct: 156 TLAFHLLCERAN-PHSFWLPYIQTLPSE-------YDTPLYFEEDEVQHLQSTQAIHDVF 207
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P +P ++FT++ ++ A +V + +
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPT 260
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V + +KAGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFKAGEQIYIFYGTRSNAE 320
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H+ E +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPAISA 379
Query: 259 DMLPYLRLGYVSDTSEMQSVIS--SLGPICPVSPCMERAVLD---QLADYFKAR----LA 309
+L +LR+ +S+ + +I ++G I + D +L + +AR L
Sbjct: 380 QLLAFLRVFCMSEEELKEHLIGEHAIGKIFTLGNSDFPVSWDNEVKLWTFLEARASLLLK 439
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ D++ L ++L P +A +L EK++L ++ A
Sbjct: 440 TYKTTVEVDKSFLETHDLTPHAIMAIKLRLGEKEILEKAVKSAA 483
>gi|344273731|ref|XP_003408672.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Loxodonta
africana]
Length = 597
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 83/344 (24%), Positives = 158/344 (45%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ +L + ++
Sbjct: 156 TLAFHLLCERAN-PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRHLQSTQAIHDVF 207
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V + ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 320
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379
Query: 259 DMLPYLRLGYVSDTSEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ +++ + + I +LG PVS E + L D L
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 439
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED+A L ++L + +A +L EK++L ++ A
Sbjct: 440 TYKTTIEEDKAFLKGHDLSIRATMAVKLRLGEKEILERAVKSAA 483
>gi|225462926|ref|XP_002267249.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic [Vitis
vinifera]
gi|296087793|emb|CBI35049.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 87/338 (25%), Positives = 154/338 (45%), Gaps = 25/338 (7%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
L +AL+L+ EK + +S W Y+ L S + WSE EL + G+
Sbjct: 119 LKPWVSVALFLIREKLR-DESPWRSYLDILPEY-------TNSTIYWSEEELVEIQGTQL 170
Query: 78 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQSCVVH 136
L E ++ E+ +++ + S +P + + F I + +AF ++
Sbjct: 171 SNTTLGVKEYVQSEFLKVEEEVILPHSQLFPFPVTLDDFLWAFGILRSRAFSRLRG---- 226
Query: 137 LQKVSLARRFALVPLGPPLLA--YSSKCK-AMLAAVDDAVQLVVDRPYKAGESIVV-WCG 192
Q + L L+ P + Y+ + K A L + D L KAGE +++ +
Sbjct: 227 -QNLVLIPLADLINHSPSITTEEYAWEIKGAGLFSRDQLFSLRTPVSVKAGEQVLIQYDL 285
Query: 193 PQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR 252
+ N++L ++YGF++ + + ++ DP + DK +A+ NG + F + G+
Sbjct: 286 DKSNAELALDYGFIESRPNRNSYTLTLEISESDPFFGDKLDIAESNGLSEIAYFDIVLGQ 345
Query: 253 EKEAISDMLPYLRLGYVS--DTSEMQSVISS--LGPI-CPVSPCMERAVLDQLADYFKAR 307
A MLPYLRL + D ++S+ + G + PVS E + + D K+
Sbjct: 346 SLPAA--MLPYLRLVALGGPDAFLLESIFRNTIWGHLELPVSRANEELICQVIQDACKSA 403
Query: 308 LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
L+GY T+ EDE + + NLHP+ +A + EKK+L
Sbjct: 404 LSGYLTTIEEDEKLKEEGNLHPRLEIAVGVRTGEKKVL 441
>gi|115448405|ref|NP_001047982.1| Os02g0725200 [Oryza sativa Japonica Group]
gi|45735887|dbj|BAD12920.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase [Oryza sativa Japonica
Group]
gi|45736017|dbj|BAD13045.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase [Oryza sativa Japonica
Group]
gi|113537513|dbj|BAF09896.1| Os02g0725200 [Oryza sativa Japonica Group]
gi|215737236|dbj|BAG96165.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623589|gb|EEE57721.1| hypothetical protein OsJ_08208 [Oryza sativa Japonica Group]
Length = 502
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 71/243 (29%), Positives = 112/243 (46%), Gaps = 23/243 (9%)
Query: 10 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
+L N + + +A YL+ E S W YI L RQ S L W+ EL
Sbjct: 134 GNVLKRNSVPDWPLIATYLISEASLESSSRWSSYIAALPRQ-------PYSLLYWTRPEL 186
Query: 70 -AYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
AYL SP + ++R + YN+L D ++ LF P E + E F+ +F
Sbjct: 187 DAYLVASPIRERAIQRITDVVGTYNDLRDRIFSKHSDLF-------PEEVYNLETFRWSF 239
Query: 128 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGE 185
+ S +V L S+ R ALVP +L +S + + L + DR Y+ GE
Sbjct: 240 GILFSRLVRLP--SMDGRVALVPWAD-MLNHSPEVETFLDYDKSSGGIVFTTDRSYQPGE 296
Query: 186 SIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 243
+ + G + + +LL++YGFV ++ NP D + + +LN D Y++K +RNG
Sbjct: 297 QVFISYGKKSSGELLLSYGFVPKEGTNPNDSVELLVSLNKSDKCYKEKLQALKRNGLSEF 356
Query: 244 QVF 246
+ F
Sbjct: 357 ESF 359
>gi|218191491|gb|EEC73918.1| hypothetical protein OsI_08761 [Oryza sativa Indica Group]
Length = 502
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 71/243 (29%), Positives = 112/243 (46%), Gaps = 23/243 (9%)
Query: 10 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
+L N + + +A YL+ E S W YI L RQ S L W+ EL
Sbjct: 134 GNVLKRNSVPDWPLIATYLISEASLESSSRWSSYIAALPRQ-------PYSLLYWTRPEL 186
Query: 70 -AYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
AYL SP + ++R + YN+L D ++ LF P E + E F+ +F
Sbjct: 187 DAYLVASPIRERAIQRITDVVGTYNDLRDRIFSKHSDLF-------PEEVYNLETFRWSF 239
Query: 128 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGE 185
+ S +V L S+ R ALVP +L +S + + L + DR Y+ GE
Sbjct: 240 GILFSRLVRLP--SMDGRVALVPWAD-MLNHSPEVETFLDYDKSSGGIVFTTDRSYQPGE 296
Query: 186 SIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 243
+ + G + + +LL++YGFV ++ NP D + + +LN D Y++K +RNG
Sbjct: 297 QVFISYGKKSSGELLLSYGFVPKEGTNPNDSVELLVSLNKSDKCYKEKLQALKRNGLSEF 356
Query: 244 QVF 246
+ F
Sbjct: 357 ESF 359
>gi|395827792|ref|XP_003787079.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Otolemur
garnettii]
Length = 595
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 83/344 (24%), Positives = 159/344 (46%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ YL + ++
Sbjct: 156 ALAFHLLCERA-SPNSFWQPYIQSLPSE-------YDTPLYFEEDEVRYLQSTQAIYDVF 207
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V + ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 320
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379
Query: 259 DMLPYLRLGYVSDTSEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ +++ + + I +LG PVS E + L D L
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 439
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED+ +L +++L + +A +L EK++L ++ A
Sbjct: 440 TYKTTIEEDKFVLKNHDLSVRATMAIKLRLGEKEILEKAVKSAA 483
>gi|403274243|ref|XP_003928891.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Saimiri
boliviensis boliviensis]
Length = 513
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 83/344 (24%), Positives = 159/344 (46%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ YL + ++
Sbjct: 74 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRYLQSTQAIHDVF 125
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 126 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 178
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V + ++AGE I ++ G + N++
Sbjct: 179 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQAGEQIYIFYGTRSNAE 238
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 239 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 297
Query: 259 DMLPYLRLGYVSDTSEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ +++ + + I +LG PVS E + L D L
Sbjct: 298 QLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 357
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED+ +L + +L + ++A +L EK++L ++ A
Sbjct: 358 TYKTTIEEDKFVLKNQDLSVRAKMAIKLRLGEKEILEKAVKSAA 401
>gi|432098266|gb|ELK28072.1| Histone-lysine N-methyltransferase setd3 [Myotis davidii]
Length = 585
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 81/344 (23%), Positives = 154/344 (44%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ L + ++
Sbjct: 145 TLAFHLLCERAD-PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRSLQSTQAVHDVF 196
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 197 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 249
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V R ++AGE I ++ G + N++
Sbjct: 250 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALRDFRAGEQIYIFYGTRSNAE 309
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 310 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-MEPPISA 368
Query: 259 DMLPYLRL---------GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ ++ + + + + PVS E + L D L
Sbjct: 369 QLLAFLRVFCMTEEELKDHLLGDNAIDKIFTLGNSEYPVSWDNEVKLWTFLEDRASLLLK 428
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T ED++ L +++L + R+A +L EK++L + A
Sbjct: 429 TYKTTSEEDKSFLKNHDLSVRARMAIKLRLGEKEILEKAVTSAA 472
>gi|348554489|ref|XP_003463058.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cavia
porcellus]
Length = 789
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 84/343 (24%), Positives = 158/343 (46%), Gaps = 29/343 (8%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
LA +L+ E+ SFWLPYI+ L + ++PL + E E+ L + ++
Sbjct: 157 LAFHLLCERAN-PNSFWLPYIQTLPSE-------YDTPLYFEEEEVQCLQSTQAIHDVFS 208
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKV 140
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 209 QYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTE 261
Query: 141 SLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 199
+R AL+PL + DD + V + ++AGE I ++ G + N++
Sbjct: 262 DGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEF 321
Query: 200 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD 259
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 322 VIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQ 380
Query: 260 MLPYLRLGYVSDTSEMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKARLAG 310
+L +LR+ +++ + ++ +LG PVS E + L D L
Sbjct: 381 LLAFLRVFCMTEEELKEHLLGENAIDRIFTLGNSEFPVSWENEVKLWSFLEDRASLLLKT 440
Query: 311 YPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED+A+L L + ++A +L EK++L +Q A
Sbjct: 441 YKTTIEEDKAVLKGPELPTRMKMAVKLRLGEKEILERTVQSAA 483
>gi|308802083|ref|XP_003078355.1| ribulose-1,5-bisphosphate carb (ISS) [Ostreococcus tauri]
gi|116056807|emb|CAL53096.1| ribulose-1,5-bisphosphate carb (ISS) [Ostreococcus tauri]
Length = 520
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 90/351 (25%), Positives = 152/351 (43%), Gaps = 39/351 (11%)
Query: 20 ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET-ELAYLTGSPTK 78
EL LAL+L E+ +G S W PY++ L + ++PL W++ + A L GSP
Sbjct: 121 ELIGLALWLCAERIKGGASEWAPYVKTL-------RANPDAPLFWTDAKDFALLKGSPVA 173
Query: 79 AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ 138
A+ +ER++ + EY + V + P P EA+ F + A+ +
Sbjct: 174 ADAIERSKSARTEYASITEV-------IKSDPSSYPPEAYEFLTEARFVDALATVCAKAT 226
Query: 139 KVSLARRFALVPL----------GPPLLAYSS-----KCKAMLAAVDDAVQLVVDRPYKA 183
+ A+ +ALVPL P +L S+ +C VD A ++ A
Sbjct: 227 WLPTAQCYALVPLLDVISIGGAPVPGVLPPSASDGVVRCGPADYDVDTASVVLRCATKAA 286
Query: 184 GESIVVWCGP--QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL 241
S V+ + N +L +N G+VD+ +P D + ++ + T D + K+ V + G
Sbjct: 287 ANSEVIQLDALQRNNGELFLNTGYVDQKHPGDYIYMKTDIQTSDRLFTAKKQVLEGMGFT 346
Query: 242 SV-QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQL 300
+ Q F V+ R + + YLR V D EM +V I VS E +L L
Sbjct: 347 AADQYFPVYKDRMP---TQLYSYLRFSRVQDPGEMMAVSFEEDKI--VSVMNEYEILQIL 401
Query: 301 ADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM-EKKMLNACLQ 350
+ +A Y ++ +L + P + + +RM EKK++ A +
Sbjct: 402 MGDCRELMAEYDTNEEDELNLLKLSDQMPVREIEAAKLRMSEKKLIGATMN 452
>gi|168063638|ref|XP_001783777.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664720|gb|EDQ51429.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 395
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 70/242 (28%), Positives = 114/242 (47%), Gaps = 26/242 (10%)
Query: 11 ELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA 70
ELL T ++E A LAL+++ E+ G+ S W PYI L A+ S + W + EL
Sbjct: 35 ELLPTG-VTEWARLALFILVEQHLGQASQWAPYINCLPTCG-----ALHSTVFWKKEELE 88
Query: 71 YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 130
+ + E ++R I E+ + V Q+ P+ I E FKQA+
Sbjct: 89 LVRFTSLHRETMQRRAVIGSEFASVLPV-------LQKCPH-IFGERVLHSKFKQAYATG 140
Query: 131 QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIV 188
+S + S R VP + S C+A+L+ ++ +++ D+ Y GE +V
Sbjct: 141 KSL-----RRSSNTRILTVPF-VDFFNHDSNCRALLSYDEERACAEVIADKNYARGEQVV 194
Query: 189 VWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV 248
+ G PN+ L +++GF NPYD++ V AL+ DP + K + +G +V V
Sbjct: 195 ISYGRLPNTTLALDFGFTISCNPYDQVEVWMALSHRDPLRKMKLALLHAHGMPTV----V 250
Query: 249 HA 250
HA
Sbjct: 251 HA 252
>gi|332321746|sp|B2KI88.1|SETD3_RHIFE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|183637154|gb|ACC64548.1| SET domain containing 3 isoform a (predicted) [Rhinolophus
ferrumequinum]
Length = 594
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 82/344 (23%), Positives = 158/344 (45%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ YL + ++
Sbjct: 156 TLAFHLLCERAD-PNSFWQPYIQTLPSE-------YDTPLYFGEDEVRYLQSTQAIHDVF 207
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V + ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQAGEQIYIFYGTRSNAE 320
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379
Query: 259 DMLPYLRLGYVSDTSEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ +++ + + I +LG PVS E + L D L
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEYPVSWDNEVKLWTFLEDRASLLLK 439
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y + ED++ L +++L + +A +L EK++L ++ A
Sbjct: 440 TYKTNIEEDKSFLKNHDLSVRATMAIKLRLGEKEILEKAVKSAA 483
>gi|444915331|ref|ZP_21235465.1| SET domain containing protein [Cystobacter fuscus DSM 2262]
gi|444713560|gb|ELW54457.1| SET domain containing protein [Cystobacter fuscus DSM 2262]
Length = 449
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 86/341 (25%), Positives = 150/341 (43%), Gaps = 27/341 (7%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
LA +L+ EK + + SFW PYI L + PL + E A L G +
Sbjct: 95 LASFLLQEKHR-EGSFWKPYIDSLPESYS------QMPLFYGSEEHALLKGCFALTLLTH 147
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 143
+A+ ++ +Y SL Q P E FT F A ++V S + L+K
Sbjct: 148 QAQSLREDYL----------SLCQNVP---GYERFTPGEFVWARLSVSSRLFSLKKGGFL 194
Query: 144 RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV--DRPYKAGESIVVWCGPQPNSKLLI 201
+ LVP+ +L + + +D V+ + AG+ + G + N +L+
Sbjct: 195 GQ-TLVPMAD-MLNHRRPPDVLWETTEDGESFVMKANNAVAAGDEVHDSYGAKSNDLMLL 252
Query: 202 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML 261
++GFV +DN +D + + DP K+M+ + + F +
Sbjct: 253 HFGFVTDDNEHDEAFLGLRILDGDPLAATKQMLLMLPSPTAARPFKISRPYVHTTTRMAF 312
Query: 262 PYLRLGYV--SDTSEMQS-VISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSED 318
+LR+ +D ++ S V+S + P+S E VL+ LA +ARL+ +P +L++D
Sbjct: 313 SFLRIAAAVPNDIEDISSRVMSGERALGPLSVENEENVLELLAATCQARLSIFPTSLAQD 372
Query: 319 EAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLL 359
E +L +L P R + R EK+++ L++T + LL
Sbjct: 373 EELLRGESLSPNARNCVLVRRAEKQLIEDYLEMTRVCLKLL 413
>gi|332321742|sp|E2RBS6.1|SETD3_CANFA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
Length = 588
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 82/344 (23%), Positives = 158/344 (45%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ L + ++
Sbjct: 156 TLAFHLLCERAD-PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRDLQSTQAIHDVF 207
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P +AFT+E ++ A +V + +
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDAFTYEDYRWAVSSVMTRQNQIPT 260
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V R ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALRDFRAGEQIYIFYGTRSNAE 320
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H + +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHY-TDPPVSA 379
Query: 259 DMLPYLRLGYVSDTSEMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ +++ + ++ +LG PVS E + L D L
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDNALDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLK 439
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED++ L +++L + +A +L EK++L ++ A
Sbjct: 440 TYKTTIEEDKSFLRNHDLSVRATMAIKLRLGEKEILEKAVKSAA 483
>gi|73964462|ref|XP_547974.2| PREDICTED: SET domain containing 3 [Canis lupus familiaris]
Length = 589
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 82/344 (23%), Positives = 158/344 (45%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ L + ++
Sbjct: 156 TLAFHLLCERAD-PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRDLQSTQAIHDVF 207
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P +AFT+E ++ A +V + +
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDAFTYEDYRWAVSSVMTRQNQIPT 260
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V R ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALRDFRAGEQIYIFYGTRSNAE 320
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H + +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHY-TDPPVSA 379
Query: 259 DMLPYLRLGYVSDTSEMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ +++ + ++ +LG PVS E + L D L
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDNALDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLK 439
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED++ L +++L + +A +L EK++L ++ A
Sbjct: 440 TYKTTIEEDKSFLRNHDLSVRATMAIKLRLGEKEILEKAVKSAA 483
>gi|410962953|ref|XP_003988033.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Felis catus]
Length = 591
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 82/342 (23%), Positives = 155/342 (45%), Gaps = 37/342 (10%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
LA +L+ E+ SFW PYI+ L + ++PL + E E+ L + ++
Sbjct: 157 LAFHLLCERAD-PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRDLQSTQAIHDVFS 208
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKV 140
+ + R+Y +F + Q +P+ +P +AFT+E ++ V S + +
Sbjct: 209 QYKNTARQY-----AYFY--KVIQTHPHANKLPLKDAFTYEDYRLGLV---SLALGRWAL 258
Query: 141 SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLL 200
L + G P + + DD + V + ++AGE I ++ G + N++ +
Sbjct: 259 GLECGVGIARCGKPQITTGYNLE------DDRCECVALQDFRAGEQIYIFYGTRSNAEFV 312
Query: 201 INYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDM 260
I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E + +
Sbjct: 313 IHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPVSAQL 371
Query: 261 LPYLRLGYVSDTSEMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKARLAGY 311
L +LR+ +++ + ++ +LG PVS E + L D L Y
Sbjct: 372 LAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEYPVSWDNEVKLWTFLEDRASLLLKTY 431
Query: 312 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
T+ ED+A L ++NL + +A +L EK++L ++ A
Sbjct: 432 KTTIEEDKAFLKNHNLSVRATMAIKLRLGEKEILEKAVKSAA 473
>gi|410928182|ref|XP_003977480.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Takifugu
rubripes]
Length = 598
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 87/345 (25%), Positives = 155/345 (44%), Gaps = 39/345 (11%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LAL+L+ E+ SFWLPYIR L ++ ++PL + + E+ L G+ ++L
Sbjct: 156 TLALHLLCERAN-PASFWLPYIRTLPQE-------YDTPLFYEQDEVQLLQGTQAVQDVL 207
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPYD--IP-TEAFTFEIFKQAFVAVQSCVVHL-- 137
+ R+Y +F L Q +P +P ++FTF+ ++ A +V + +
Sbjct: 208 SQYRNTARQY-----AYFY--KLIQTHPASSKLPLKDSFTFDDYRWAVSSVMTRQNQIPT 260
Query: 138 ---QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 194
++V+LA L+PL + DD + V + YK E I ++ G +
Sbjct: 261 EDGRQVTLA----LIPLWDMCNHRNGLITTGYNLEDDRCECVALQDYKKNEQIYIFYGTR 316
Query: 195 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREK 254
N++ +I+ GF ++N +D++ ++ ++ + Y K V R G +F ++ E+
Sbjct: 317 SNAEFVIHNGFFYQENAHDQVKIKLGISKSERLYAMKAEVLARAGIPVSSIFALYCN-EQ 375
Query: 255 EAISDMLPYLRLGYVSDTSEMQ---------SVISSLGPI-CPVSPCMERAVLDQLADYF 304
+ +L +LR+ + E++ + I +LG + PVS E + L
Sbjct: 376 PISAQLLAFLRV-FCMKEEELRDYLLGGHAINKIVTLGSMEFPVSWDNEIKLWTFLETRV 434
Query: 305 KARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 349
L Y T ED + L L P R+A QL EK +L L
Sbjct: 435 ALLLKAYKTTSEEDSSTLEKSELSPHSRMAIQLRLAEKWILEKAL 479
>gi|343961019|dbj|BAK62099.1| SET domain containing 3 isoform a [Pan troglodytes]
Length = 492
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 77/328 (23%), Positives = 150/328 (45%), Gaps = 29/328 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ YL + ++
Sbjct: 156 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 207
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V + ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 320
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379
Query: 259 DMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ +++ S + + + PVS E + L D L
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 439
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQL 337
Y T+ ED+++L + +L + ++A +L
Sbjct: 440 TYKTTIEEDKSVLKNQDLSVRAKMAIKL 467
>gi|281338628|gb|EFB14212.1| hypothetical protein PANDA_005835 [Ailuropoda melanoleuca]
Length = 585
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 83/344 (24%), Positives = 158/344 (45%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ L + ++
Sbjct: 156 TLAFHLLCERAD-PNSFWQPYIQTLPSE-------YDTPLYFEEEEVRDLQCTQAIHDVF 207
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P +AFT+E ++ A +V + +
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDAFTYEDYRWAVSSVMTRQNQIPT 260
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V R ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALRDFRAGEQIYIFYGTRSNAE 320
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H + +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TDPPVSA 379
Query: 259 DMLPYLRLGYVSDTSEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ +++ + + I +LG PVS E + L D L
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDNALDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLK 439
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED++ L +++L + +A +L EK++L ++ A
Sbjct: 440 TYKTTIEEDKSFLKNHDLSVRATMAIKLRLGEKEILEKAVKSAA 483
>gi|338719872|ref|XP_001488117.2| PREDICTED: histone-lysine N-methyltransferase setd3-like [Equus
caballus]
Length = 609
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 81/344 (23%), Positives = 154/344 (44%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ YL + ++
Sbjct: 170 TLAFHLLCERAD-PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAVHDVF 221
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 222 SQYKNTARQY-----AYFY--RVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 274
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V + ++AGE I ++ G + N++
Sbjct: 275 EDGSRVTLALIPLWDMCNHTTGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 334
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 335 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 393
Query: 259 DMLPYLRL---------GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ ++ + + + + PVS E + L D L
Sbjct: 394 QLLAFLRVFCMTEEELKDHLLGDNAIDRIFTLGNSEYPVSWDNEVKLWTFLEDRALLLLK 453
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED A L + +L + +A +L EK++L ++ A
Sbjct: 454 TYKTTVEEDRAFLKNSDLSVRATMAIKLRLGEKEILEKAVKSAA 497
>gi|301764186|ref|XP_002917505.1| PREDICTED: SET domain-containing protein 3-like [Ailuropoda
melanoleuca]
Length = 591
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 82/344 (23%), Positives = 158/344 (45%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ L + ++
Sbjct: 156 TLAFHLLCERAD-PNSFWQPYIQTLPSE-------YDTPLYFEEEEVRDLQCTQAIHDVF 207
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P +AFT+E ++ A +V + +
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDAFTYEDYRWAVSSVMTRQNQIPT 260
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V R ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALRDFRAGEQIYIFYGTRSNAE 320
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H + +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TDPPVSA 379
Query: 259 DMLPYLRLGYVSDTSEMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ +++ + ++ +LG PVS E + L D L
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDNALDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLK 439
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED++ L +++L + +A +L EK++L ++ A
Sbjct: 440 TYKTTIEEDKSFLKNHDLSVRATMAIKLRLGEKEILEKAVKSAA 483
>gi|302814473|ref|XP_002988920.1| hypothetical protein SELMODRAFT_129035 [Selaginella moellendorffii]
gi|300143257|gb|EFJ09949.1| hypothetical protein SELMODRAFT_129035 [Selaginella moellendorffii]
Length = 389
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 59/222 (26%), Positives = 107/222 (48%), Gaps = 20/222 (9%)
Query: 13 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 72
L + +S A LAL+L+ +K+ + S W PYI L ++ S + W++ EL YL
Sbjct: 36 LLSKDVSAWAKLALFLLAHQKKKETSAWAPYISCLPPFG-----SMHSTIFWTQDELVYL 90
Query: 73 TGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 132
SP E ++R + ++ E+ + + P+ + E FK A+ V S
Sbjct: 91 KVSPVYRETVQRKDVVRMEFAAAENALLLC-------PHIFGSRVSALE-FKHAYATVCS 142
Query: 133 CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVW 190
++ + + ALVP + + C+AML+ +D ++V DR Y G+ +V+
Sbjct: 143 RAWGIETI---KSLALVPF-VDFFNHDANCRAMLSYDEDRHCAEVVSDRDYATGDQVVIS 198
Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLV-VEAALNTEDPQYQDK 231
G N+ L +++GF NP+D++ + +L+ +DP K
Sbjct: 199 YGQLSNATLALDFGFALPFNPHDQVAGIWLSLSEKDPLRDSK 240
>gi|47215092|emb|CAF98166.1| unnamed protein product [Tetraodon nigroviridis]
Length = 444
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 84/343 (24%), Positives = 151/343 (44%), Gaps = 37/343 (10%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
LAL+L+ E+ SFWLPYIR L ++ ++PL + + ++ L G+ ++L
Sbjct: 75 LALHLLCERAD-PASFWLPYIRTLPQE-------YDTPLFYQQQDVQLLHGTQAIQDVLS 126
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPYD--IP-TEAFTFEIFKQAFVAVQSCVVHL--- 137
+ R+Y +F L Q +P +P ++FTF+ ++ A +V + +
Sbjct: 127 QYRNTARQY-----AYFY--KLVQTHPASSKLPLKDSFTFDDYRWAVSSVMTRQNQIPTE 179
Query: 138 --QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 195
++V+LA L+PL + DD + V + YK E I ++ G +
Sbjct: 180 DGRQVTLA----LIPLWDMCNHRNGLITTGYNLEDDRCECVALQDYKKNEQIYIFYGTRS 235
Query: 196 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 255
N++ +I+ GF ++N +D++ ++ ++ + Y K V R G VF ++ E
Sbjct: 236 NAEFVIHNGFFYQENAHDQVKIKLGISKSERLYAMKAEVLGRAGIPVSSVFALYCN-EPP 294
Query: 256 AISDMLPYLRL---------GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 306
+ +L +LR+ Y+ + +++ PVS E + L
Sbjct: 295 ISAQLLAFLRVFCMMEEELKDYLFGAQAINRLVTLGSMEFPVSWENEIKLWTFLETRAAL 354
Query: 307 RLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 349
L Y T ED + L +L P R+A QL EK +L L
Sbjct: 355 LLKAYKTTAEEDSSTLDKTDLSPHSRMAVQLRLAEKAILEKAL 397
>gi|449280698|gb|EMC87934.1| SET domain-containing protein 3 [Columba livia]
Length = 593
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 85/348 (24%), Positives = 160/348 (45%), Gaps = 37/348 (10%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFWLPYI+ L + +PL + E E+ YL + ++
Sbjct: 156 TLAFHLLCERAN-PNSFWLPYIQTLPSE-------YNTPLYFEEDEVRYLQSTQAIHDVF 207
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P +P ++FT++ ++ A +V + +
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPT 260
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V + +KAGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFKAGEQIYIFYGTRSNAE 320
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H+ E +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISA 379
Query: 259 DMLPYLRLGYVSDTSEMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKAR-- 307
+L +LR+ +S+ + +I +LG PVS E +L + +AR
Sbjct: 380 QLLAFLRVFCMSEEELKEHLIGEHAIDKIFTLGNSEFPVSWDNEV----KLWTFLEARAS 435
Query: 308 --LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
L Y T+ +D++ L ++L +A +L EK++L ++ A
Sbjct: 436 LLLKTYKTTVEDDKSFLETHDLTSHAIMAIKLRLGEKEILEKAVKSAA 483
>gi|41056027|ref|NP_956348.1| histone-lysine N-methyltransferase setd3 [Danio rerio]
gi|82187658|sp|Q7SXS7.1|SETD3_DANRE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|32766447|gb|AAH55261.1| SET domain containing 3 [Danio rerio]
Length = 596
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 81/333 (24%), Positives = 143/333 (42%), Gaps = 25/333 (7%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
LAL+L+ E+ S WLPYI+ L + ++PL + E E+ +L + ++L
Sbjct: 157 LALHLLCERAN-PSSPWLPYIKTLPSE-------YDTPLYFEEEEVRHLLATQAIQDVLS 208
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPYDIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
+ + R+Y +F +P +AFTF+ ++ A +V + +
Sbjct: 209 QYKNTARQY-----AYFYKVIHTHPNASKLPLKDAFTFDDYRWAVSSVMTRQNQIPTADG 263
Query: 143 AR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 201
+R AL+PL + DD + V + YK GE I ++ G + N++ +I
Sbjct: 264 SRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALKDYKEGEQIYIFYGTRSNAEFVI 323
Query: 202 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML 261
+ GF EDN +DR+ ++ ++ + Y K V R G + +F +H E + +L
Sbjct: 324 HNGFFFEDNAHDRVKIKLGVSKGERLYAMKAEVLARAGIPASSIFALHCS-EPPISAQLL 382
Query: 262 PYLRL---------GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP 312
+LR+ Y+ + + + PVS E + L L Y
Sbjct: 383 AFLRVFCMTEEELRDYLVGDHAINKIFTLGNTEFPVSWENEIKLWTFLETRAALLLKTYK 442
Query: 313 ATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
ED +ML +L R+A +L EK++L
Sbjct: 443 TASEEDRSMLEKPDLSLHSRIAIKLRLAEKEIL 475
>gi|326921018|ref|XP_003206761.1| PREDICTED: LOW QUALITY PROTEIN: SET domain-containing protein
3-like [Meleagris gallopavo]
Length = 593
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 84/348 (24%), Positives = 160/348 (45%), Gaps = 37/348 (10%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFWLPYI+ L + ++PL + E E+ YL + ++
Sbjct: 156 TLAFHLLCERAN-PNSFWLPYIQTLPNE-------YDTPLYFEEDEVQYLRSTQAIHDVF 207
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P +P ++FT++ ++ A +V + +
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPT 260
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V + +KAGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFKAGEQIYIFYGTRSNAE 320
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H+ E +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-IEPPISA 379
Query: 259 DMLPYLRLGYVSDTSEMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKAR-- 307
+L +LR+ +++ + +I +LG P+S E +L + +AR
Sbjct: 380 QLLAFLRVFCMNEEELKEHLIGEHAIDKIFTLGNSEFPISWDNEV----KLWTFLEARAS 435
Query: 308 --LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
L Y T+ +D+ L ++L +A +L EKK+L ++ A
Sbjct: 436 LLLKTYKTTVEDDKLFLETHDLTSHATMAIKLRLGEKKILEKTVKSAA 483
>gi|255080880|ref|XP_002504006.1| predicted protein [Micromonas sp. RCC299]
gi|226519273|gb|ACO65264.1| predicted protein [Micromonas sp. RCC299]
Length = 529
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 97/358 (27%), Positives = 147/358 (41%), Gaps = 48/358 (13%)
Query: 19 SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPT 77
EL LAL+L E+ +G S W PY+ L + PLLW+ EL L GSP
Sbjct: 118 GELVGLALWLCLERHKGPLSEWAPYVATLP------SAGSDHPLLWTAGELQTLLQGSPV 171
Query: 78 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV-AVQSCVVH 136
+ + + R E EY + + P D P +A+ F + + AFV A+ + +
Sbjct: 172 REQAVSRLESADDEYASI-------ADQIRSNPNDFPPDAYEF-LTRDAFVDALATVLAR 223
Query: 137 LQKVSLARRFALVPL-------------------------GPPLLAYSSKCKAMLAAVDD 171
++ A +A+VPL G P LA ++ AA +
Sbjct: 224 AVWLNAANCYAMVPLVDLLPLVGSPPPGVSPAAAAGGPAVGKPGLAAAAGVVDYDAATEC 283
Query: 172 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 231
+ + + + V + L + G VDE + D L A+ D Y+ K
Sbjct: 284 VAVVSANDAQQTARVVCVDPLARNAGDLFLATGAVDESHCGDYLAFAASCTQTDRLYEAK 343
Query: 232 RMVAQRNG-KLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSP 290
R + + G Q F V A R +L Y+R V D E+ SV I VSP
Sbjct: 344 RQILEGMGMSADGQTFPVFADRMP---MQLLAYMRFARVQDPGELMSVSFEEDRI--VSP 398
Query: 291 CMERAVLDQLADYFKARLAGYPATLSEDEAM-LTDYNLHPKKRVATQLVRMEKKMLNA 347
E VL L + LA Y ++ E E + L + L ++RVA +L EK+++NA
Sbjct: 399 MNEYEVLQLLMQDAREMLAEYESSSEEFELLQLKEKGLSARQRVAAKLRLAEKRLINA 456
>gi|148686777|gb|EDL18724.1| mCG18357, isoform CRA_b [Mus musculus]
Length = 466
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 80/344 (23%), Positives = 156/344 (45%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ L + ++
Sbjct: 28 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVF 79
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P E+FT+E ++ A +V + +
Sbjct: 80 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKESFTYEDYRWAVSSVMTRQNQIPT 132
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V + ++AG+ I ++ G + N++
Sbjct: 133 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAE 192
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H+ E +
Sbjct: 193 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISA 251
Query: 259 DMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ +++ S + + + PVS E + L D L
Sbjct: 252 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNAEFPVSWDNEVKLWTFLEDRASLLLK 311
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED+ +L + +L + +A +L EK++L ++ A
Sbjct: 312 TYKTTIEEDKIVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAA 355
>gi|357153645|ref|XP_003576520.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
[Brachypodium distachyon]
Length = 492
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 83/341 (24%), Positives = 152/341 (44%), Gaps = 38/341 (11%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
++L ++ E +G S W PY+ L RQ +S + WSE EL + G+ + +
Sbjct: 132 SVSLLILREAARGGDSLWAPYLAILPRQ-------TDSTIFWSEEELLEIQGTQLLSTTM 184
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
E ++ E++ ++ + G +P + TF+ F AF ++S V +
Sbjct: 185 GVKEYVQSEFDNVEAK--IIGPNKDLFP-----DTITFDDFLWAFGILRSRVFPELR--- 234
Query: 143 ARRFALVPLGPPLLAYSS-----------KCKAMLAAVDDAVQLVVDRPYKAGESIVV-W 190
+ AL+P L+ +S+ + K L D L K+GE + V +
Sbjct: 235 GDKLALIPFAD-LINHSADITSKQSCWEIQGKGFLGR-DVVFSLRTPMEVKSGEQVYVQY 292
Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 250
+ N++L ++YGF + ++ D + ++ DP Y DK +A+ NG F V
Sbjct: 293 DLDKSNAELALDYGFTETNSTRDSYTLTLEISESDPFYGDKLDIAELNGMGETAYFDVVL 352
Query: 251 GREKEAISDMLPYLRLGYVSDTSE--MQSVISS--LGPI-CPVSPCMERAVLDQLADYFK 305
G + M+ YLRL + T ++++ + G + PVS E ++ + K
Sbjct: 353 G--ESLPPQMITYLRLLCLGGTDAFLLEALFRNKVWGFLELPVSRDNEESICQVIQTACK 410
Query: 306 ARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 346
+ L Y T+ EDE +L +L + ++A ++ EKK+L
Sbjct: 411 SALTAYHTTIEEDEELLKREDLQSRHQIAVEVRAGEKKVLQ 451
>gi|308807993|ref|XP_003081307.1| putative methyltransferase (ISS) [Ostreococcus tauri]
gi|116059769|emb|CAL55476.1| putative methyltransferase (ISS) [Ostreococcus tauri]
Length = 505
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 91/337 (27%), Positives = 141/337 (41%), Gaps = 59/337 (17%)
Query: 17 KLSELACLALYLMYEK---KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYL 72
+L E + LA +L + + G + YIR L R+ G S L W E E+ L
Sbjct: 131 ELQEWSVLATFLAQQAMALESGNAGTFGEYIRALPRRTG-------SVLDWPEDEVETLL 183
Query: 73 TGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 132
GSP++ ER E + E+ + +P DI A + AF + S
Sbjct: 184 KGSPSRLAAAERQESVNAAIAEIRS----------SFP-DITEGALRW-----AFDILFS 227
Query: 133 CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG 192
++ L ++ ALVP +L + C A + AV L DR Y AGE + G
Sbjct: 228 RLIRLD--AMGGELALVPWAD-MLNHKPGCAAFIDLNGSAVNLTTDRAYAAGEQVWASYG 284
Query: 193 PQPNSKLLINYGFVDE--DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH- 249
+P+S+LLI+YGF E +NP D + ++ DP Q K V +R G V+ F +
Sbjct: 285 QRPSSELLISYGFAPEVGENPDDEYSLTLGVDVNDPYAQAKADVLRRMGLSPVETFPLRL 344
Query: 250 AGREKEAI-------------SDMLPYLRLGYVSDTSEMQSVISSL------------GP 284
G ++ + S++ R + + QS+ S+ G
Sbjct: 345 NGYPRQLLQYASFILCNPDKPSELEGLARTAFTGSANFGQSIFDSVRGLAQGQARGKQGV 404
Query: 285 ICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM 321
I P E AV + LAD L+ YP +L +D+ +
Sbjct: 405 ILGGVPG-EIAVREMLADMCAEALSAYPNSLEKDKGI 440
>gi|57529914|ref|NP_001006486.1| histone-lysine N-methyltransferase setd3 [Gallus gallus]
gi|363734802|ref|XP_003641459.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Gallus
gallus]
gi|75571462|sp|Q5ZML9.1|SETD3_CHICK RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|53127281|emb|CAG31024.1| hypothetical protein RCJMB04_1k10 [Gallus gallus]
Length = 593
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 83/348 (23%), Positives = 161/348 (46%), Gaps = 37/348 (10%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFWLPYI+ L + ++PL + E E+ YL + ++
Sbjct: 156 TLAFHLLCERAN-PNSFWLPYIQTLPSE-------YDTPLYFEEDEVQYLRSTQAIHDVF 207
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P +P ++FT++ ++ A +V + +
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPT 260
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V + +KAGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFKAGEQIYIFYGTRSNAE 320
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H+ E +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-IEPPISA 379
Query: 259 DMLPYLRLGYVSDTSEMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKAR-- 307
+L +LR+ +++ + +I +LG P+S E +L + +AR
Sbjct: 380 QLLAFLRVFCMNEEELKEHLIGEHAIDKIFTLGNSEFPISWDNEV----KLWTFLEARAS 435
Query: 308 --LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
L Y T+ +D++ L ++L +A +L EK++L ++ A
Sbjct: 436 LLLKTYKTTVEDDKSFLETHDLTSHATMAIKLRLGEKEILEKAVKSAA 483
>gi|160774366|gb|AAI55279.1| SET domain containing 3 [Danio rerio]
Length = 596
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 80/333 (24%), Positives = 142/333 (42%), Gaps = 25/333 (7%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
LAL+L+ E+ S WLPYI+ L + ++PL + E E+ +L + ++L
Sbjct: 157 LALHLLCERAN-PSSPWLPYIKTLPSE-------YDTPLYFEEEEVRHLLATQAIQDVLS 208
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPYDIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
+ + R+Y +F +P +AFTF+ ++ A +V + +
Sbjct: 209 QYKNTARQY-----AYFYKVIHTHPNASKLPLKDAFTFDDYRWAVSSVMTRQNQIPTADG 263
Query: 143 AR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 201
+R AL+PL + DD + V + YK GE I ++ G + N++ +I
Sbjct: 264 SRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALKDYKEGEQIYIFYGTRSNAEFVI 323
Query: 202 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML 261
+ GF EDN +DR+ ++ ++ + Y K V R G + +F +H E + +L
Sbjct: 324 HNGFFFEDNAHDRVKIKLGVSKSERLYAMKAEVLARAGIPASSIFALHCS-EPPISAQLL 382
Query: 262 PYLRL---------GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP 312
+LR+ Y+ + + + PVS E + L L Y
Sbjct: 383 AFLRVFCMTEEELRDYLVGDHAINKIFTLGNTEFPVSWENEIKLWTFLETRAALLLKTYK 442
Query: 313 ATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
ED +ML +L R+ +L EK++L
Sbjct: 443 TASEEDRSMLEKPDLSLHSRITIKLRLAEKEIL 475
>gi|268370088|ref|NP_082538.2| histone-lysine N-methyltransferase setd3 [Mus musculus]
gi|81879567|sp|Q91WC0.1|SETD3_MOUSE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=Endothelial differentiation inhibitory protein D10;
AltName: Full=SET domain-containing protein 3
gi|16359331|gb|AAH16123.1| SET domain containing 3 [Mus musculus]
gi|18044800|gb|AAH19973.1| Setd3 protein [Mus musculus]
gi|26327255|dbj|BAC27371.1| unnamed protein product [Mus musculus]
gi|74145116|dbj|BAE27425.1| unnamed protein product [Mus musculus]
gi|74151505|dbj|BAE38861.1| unnamed protein product [Mus musculus]
gi|148686776|gb|EDL18723.1| mCG18357, isoform CRA_a [Mus musculus]
Length = 594
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 80/344 (23%), Positives = 156/344 (45%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ L + ++
Sbjct: 156 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVF 207
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P E+FT+E ++ A +V + +
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKESFTYEDYRWAVSSVMTRQNQIPT 260
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V + ++AG+ I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAE 320
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H+ E +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISA 379
Query: 259 DMLPYLRLGYVSDT---------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ +++ S + + + PVS E + L D L
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNAEFPVSWDNEVKLWTFLEDRASLLLK 439
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED+ +L + +L + +A +L EK++L ++ A
Sbjct: 440 TYKTTIEEDKIVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAA 483
>gi|432952574|ref|XP_004085141.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Oryzias
latipes]
Length = 606
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 82/335 (24%), Positives = 148/335 (44%), Gaps = 29/335 (8%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
LAL+L+ E+ SFW PYIR L ++ ++PL + + ++ L G+ ++L
Sbjct: 157 LALHLLCERGD-PASFWSPYIRSLPQE-------YDTPLYYQQEDVQLLLGTQAVQDVLN 208
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYP--YDIP-TEAFTFEIFKQAFVAVQSCVVHLQKV 140
+ + R+Y +F L Q +P +P + F+F+ ++ A +V + + V
Sbjct: 209 QYKNTARQY-----AYFY--KLVQTHPAASKLPLKDGFSFDDYRWAVSSVMTRQNQIPTV 261
Query: 141 SLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 199
+R AL+PL + DD + V + YK E I ++ G + N++
Sbjct: 262 DGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDYKKNEQIYIFYGTRSNAEF 321
Query: 200 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD 259
+I+ GF +DN +DR+ ++ ++ + Y K V R G + VF +H + +
Sbjct: 322 VIHNGFFFQDNAHDRVKIKLGVSKSERLYAMKAEVLARAGIPASCVFALHC-NDPPISAQ 380
Query: 260 MLPYLRL---------GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAG 310
+L +LR+ Y+ + + + PVS E + L L
Sbjct: 381 LLAFLRVFCMTEEELKDYLLGERAINKIFTLGNSDFPVSWENEIKLWTFLETRAALLLKT 440
Query: 311 YPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
Y T ED ++L +L R+A QL EK++L
Sbjct: 441 YKTTSEEDRSILEKPDLSLHTRLAVQLRLAEKQIL 475
>gi|148686779|gb|EDL18726.1| mCG18357, isoform CRA_d [Mus musculus]
Length = 597
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 80/344 (23%), Positives = 156/344 (45%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ L + ++
Sbjct: 159 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVF 210
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P E+FT+E ++ A +V + +
Sbjct: 211 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKESFTYEDYRWAVSSVMTRQNQIPT 263
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V + ++AG+ I ++ G + N++
Sbjct: 264 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAE 323
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H+ E +
Sbjct: 324 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISA 382
Query: 259 DMLPYLRLGYVSDT---------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ +++ S + + + PVS E + L D L
Sbjct: 383 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNAEFPVSWDNEVKLWTFLEDRASLLLK 442
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED+ +L + +L + +A +L EK++L ++ A
Sbjct: 443 TYKTTIEEDKIVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAA 486
>gi|168043570|ref|XP_001774257.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674384|gb|EDQ60893.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 458
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 59/227 (25%), Positives = 105/227 (46%), Gaps = 19/227 (8%)
Query: 15 TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 74
+ ++E + LAL+ + K GK S W PYIR L RG +++ + W + EL L
Sbjct: 99 SEDVNEWSRLALFQLLHKHAGKASPWEPYIRCLPPLRG-----LQNTVFWRDEELELLRQ 153
Query: 75 SPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 134
S + R I +++ + V +YP ++ E T E FK A+ S
Sbjct: 154 SNVYDQTEHRKTLISNQFDLVQAV-------VNKYP-ELFGETVTLESFKHAYCVASSRS 205
Query: 135 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGESIVVWCG 192
++ + +VP + + S +A+LA ++ ++V D+ Y G +V+ G
Sbjct: 206 WGVEALG---SITMVPF-VDMFNHDSSARALLAYYEEEGYAEVVADKDYNQGSQVVITYG 261
Query: 193 PQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 239
PNS L +++GF DNP+D + + + DP +K + + +G
Sbjct: 262 TLPNSSLALDFGFTLPDNPHDEVQIWMEAPSGDPLRAEKLKLLRDHG 308
>gi|307108530|gb|EFN56770.1| hypothetical protein CHLNCDRAFT_8187, partial [Chlorella
variabilis]
Length = 398
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 96/346 (27%), Positives = 153/346 (44%), Gaps = 34/346 (9%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
+AL+L++E+ G S W PYI L G SP+ W E +LA L GS +
Sbjct: 44 AIALFLLHERAMGNASRWAPYIALLPADSG-------SPVQWEEADLAELQGSQVLGTVQ 96
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV------VH 136
++ +++L F S +D P F F+ F A V++ +
Sbjct: 97 GYRAYFQQRFDQLQAEVFGPNS----QAFD-PI-VFNFDAFLWAACTVRARAHPPLDGGN 150
Query: 137 LQKVSLARRFALVPLGPPLLA-YSSKCKAMLAAVDDAVQLVVD--RPYKAGESIVVWCGP 193
+ V LA P PP A + K L LV++ AG++I + GP
Sbjct: 151 IALVPLADMVRSQPSWPPDSAGWQLKQTGGLFGAGSTQALVMEASGSMAAGDAIAMDFGP 210
Query: 194 Q-PNSKLLINYGFVDE--DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV-H 249
Q + +LL+++G +D + P L +E L+ ED Y DK + + N +L+ H+
Sbjct: 211 QKSDGQLLVDHGVIDPLVNQPSYALTLE--LSKEDRNYDDKADILELN-ELAESTEHILR 267
Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSL---GPICPVSPCMERAVLDQLADYFKA 306
A R +A + L +D ++S+ + PVS ER QL D A
Sbjct: 268 ADRAPDAGLLPVLRLLNLSGTDAFLLESIFRNEVWEHMQLPVSEDNERGCYQQLIDGCTA 327
Query: 307 RLAGYPATLSEDEAMLTDYNLHPKKRVATQL-VRM-EKKMLNACLQ 350
LA YP ++ ED A++ +L P R + + VR+ EK+ L+A L+
Sbjct: 328 ALAAYPTSIDEDLALMASGSLQPGSRRQSAVRVRLGEKEALDATLR 373
>gi|145344456|ref|XP_001416748.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576974|gb|ABO95041.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 515
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 97/350 (27%), Positives = 144/350 (41%), Gaps = 43/350 (12%)
Query: 20 ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE-LAYLTGSPTK 78
EL LAL+L E+ +G S W PY++ L ++PL W+E E A L GSP
Sbjct: 120 ELVGLALWLCAERIKGGASDWAPYVKTL-------AANPDAPLFWTEAEDFALLKGSPIV 172
Query: 79 AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA---FTFEIFKQAFVAVQSCVV 135
+ +ER+ + EY + V + P P EA FT E F A V +
Sbjct: 173 NDAVERSRSAREEYAAIVEV-------IKGDPTAFPAEAYEFFTEERFVDALATVCAKAT 225
Query: 136 HLQKVSLARRFALVPL-------GPPLLAY---SSKCKAMLAAVD---DAVQLVVDRPYK 182
L S +ALVPL G P+ S+K A D D+ +V+ K
Sbjct: 226 WLPTASC---YALVPLLDVITIAGSPVPGVSPPSAKDGIARCAADYDVDSACVVLSAVVK 282
Query: 183 A-GESIVVWCGP--QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 239
A S VV P + N +L +N G VD+ +P D L + + D + K+ V + G
Sbjct: 283 APANSRVVQLDPLQRNNGELFLNTGRVDQKHPGDYLYMRTEIQPSDRLFSAKKQVLEGMG 342
Query: 240 -KLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLD 298
Q F V+ E + + YLR V D EM +V I VS E +L
Sbjct: 343 FTAENQYFPVY---EDRMPTQLYSYLRFARVQDPGEMMAVSFEEDKI--VSVMNEYEILQ 397
Query: 299 QLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNAC 348
L + ++ Y ++ +L + + + +RM +K L C
Sbjct: 398 LLMGDCRELMSEYDTNEEDELNLLKLSDTMRVREIEAAKLRMSEKKLIGC 447
>gi|354483159|ref|XP_003503762.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Cricetulus
griseus]
gi|344254671|gb|EGW10775.1| SET domain-containing protein 3 [Cricetulus griseus]
Length = 577
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 80/344 (23%), Positives = 155/344 (45%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ L + ++
Sbjct: 156 ALAFHLLCERAS-PNSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVF 207
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V + ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQAGEQIYIFYGTRSNAE 320
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379
Query: 259 DMLPYLRLGYVSDT---------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ +++ S + + + PVS E + L D L
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 439
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED+ +L + +L + +A +L EK++L ++ A
Sbjct: 440 TYKTTIEEDKIVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAA 483
>gi|12848462|dbj|BAB27964.1| unnamed protein product [Mus musculus]
gi|46241521|gb|AAS82953.1| endothelial differentiation inhibitory protein D10 [Mus musculus]
Length = 594
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 80/344 (23%), Positives = 156/344 (45%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ L + ++
Sbjct: 156 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVF 207
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P E+FT+E ++ A +V + +
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKESFTYEDYRWAVSSVMTRQNQIPT 260
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V + ++AG+ I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAE 320
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H+ E +
Sbjct: 321 SVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISA 379
Query: 259 DMLPYLRLGYVSDT---------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ +++ S + + + PVS E + L D L
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNAEFPVSWDNEVKLWTFLEDRASLLLK 439
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED+ +L + +L + +A +L EK++L ++ A
Sbjct: 440 TYKTTIEEDKIVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAA 483
>gi|302755392|ref|XP_002961120.1| hypothetical protein SELMODRAFT_402746 [Selaginella moellendorffii]
gi|300172059|gb|EFJ38659.1| hypothetical protein SELMODRAFT_402746 [Selaginella moellendorffii]
Length = 371
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 59/206 (28%), Positives = 94/206 (45%), Gaps = 35/206 (16%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA-VESPLLWSETELAYLTGSPTKAEIL 82
L+L+L+ EK + ++S W PY+R L QL +ES + W + ELA+L SPT E +
Sbjct: 51 LSLFLLSEKHKAQESQWAPYLRCL------PQLGDIESTMFWKDEELAWLKHSPTYRETM 104
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
E + IK E++ L+ F + D+ E + F A+ Q
Sbjct: 105 ECLKIIKSEFHVLEANVF-------PWCRDVLGEV-SLTDFMHAYSTDQ----------- 145
Query: 143 ARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLL 200
+P + C+ L+ D V D+ YKAG+ I + G PNS L
Sbjct: 146 ------IPFA-DFFNHDHNCQTRLSYDKEKDCAVAVADQDYKAGDEIFLSYGSTPNSILA 198
Query: 201 INYGFVDEDNPYDRLVVEAALNTEDP 226
++YGF NP++++ V ++ DP
Sbjct: 199 VDYGFAVASNPHEQVEVPMGVSLTDP 224
>gi|58177849|gb|AAH89108.1| Setd3 protein [Rattus norvegicus]
Length = 450
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 79/344 (22%), Positives = 155/344 (45%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ L + ++
Sbjct: 10 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVF 61
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 62 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 114
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V + ++AG+ I ++ G + N++
Sbjct: 115 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAE 174
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 175 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 233
Query: 259 DMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ +++ S + + + PVS E + L D L
Sbjct: 234 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 293
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED+ +L + +L + +A +L EK++L ++ A
Sbjct: 294 TYKTTIEEDKTVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAA 337
>gi|149044197|gb|EDL97579.1| rCG27725, isoform CRA_c [Rattus norvegicus]
Length = 468
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 79/344 (22%), Positives = 155/344 (45%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ L + ++
Sbjct: 28 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVF 79
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 80 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 132
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V + ++AG+ I ++ G + N++
Sbjct: 133 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAE 192
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 193 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 251
Query: 259 DMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ +++ S + + + PVS E + L D L
Sbjct: 252 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 311
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED+ +L + +L + +A +L EK++L ++ A
Sbjct: 312 TYKTTIEEDKTVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAA 355
>gi|392341246|ref|XP_002726820.2| PREDICTED: histone-lysine N-methyltransferase setd3 [Rattus
norvegicus]
gi|392349051|ref|XP_216781.6| PREDICTED: histone-lysine N-methyltransferase setd3 [Rattus
norvegicus]
gi|149044195|gb|EDL97577.1| rCG27725, isoform CRA_a [Rattus norvegicus]
Length = 596
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 79/344 (22%), Positives = 155/344 (45%), Gaps = 29/344 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ L + ++
Sbjct: 156 ALAFHLLCERAS-PNSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVF 207
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V + ++AG+ I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAE 320
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379
Query: 259 DMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
+L +LR+ +++ S + + + PVS E + L D L
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 439
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
Y T+ ED+ +L + +L + +A +L EK++L ++ A
Sbjct: 440 TYKTTIEEDKTVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAA 483
>gi|330822500|ref|XP_003291689.1| hypothetical protein DICPUDRAFT_57488 [Dictyostelium purpureum]
gi|325078125|gb|EGC31794.1| hypothetical protein DICPUDRAFT_57488 [Dictyostelium purpureum]
Length = 540
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 86/354 (24%), Positives = 145/354 (40%), Gaps = 40/354 (11%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
LALYL+ E S PY++ L + + W + L GSP +
Sbjct: 153 LALYLIKESTNPDSSI-APYLKVLPK-------TYSTIGYWGIEDFKQLEGSPVFQTAVN 204
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 143
G R+Y +F LF P + T FT+E F A VQS V
Sbjct: 205 YTRGSMRQY-----CYFY--QLFDNNPGILQTSNFTYEAFIWAVATVQS---RQNPVGGG 254
Query: 144 RRFALVPLGP--PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 201
+ AL+P ++ K + V + + YK GE + ++ GP+PNS+ +
Sbjct: 255 QEMALIPFWDFCNHSSHGGKITTFIDPVKHVLTCSAAKSYKKGEQVYMYYGPRPNSQFYL 314
Query: 202 NYGFVDEDNPYDRLVVEAALNTEDPQ--YQDK-RMVAQRNGKLSVQVFHVHAGREKEAI- 257
GF + N D + L+ ED + DK ++ +R G Q + E +
Sbjct: 315 FQGFSLKTNLNDDYSFDMDLDNEDDRDIAHDKIHILEERCGLRVGQTVSLSQNPSSEKLP 374
Query: 258 SDMLPYLRLGYVS--DTSEM-----------QSVISSLGP--ICPVSPCMERAVLDQLAD 302
++++P+ R+ +S +T ++ + P +S E+ L D
Sbjct: 375 AEIIPFYRIAALSPEETKKLAPPQEEGHHHHHQGPMDMKPEAFNIISEENEKKAFKLLLD 434
Query: 303 YFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 356
KARL+GYP TL++DE + + N ++R ++ EKK+L ++ +I
Sbjct: 435 SLKARLSGYPTTLAQDEQEMKN-NPTTQRRYVLYILINEKKILERNIKYVEQLI 487
>gi|302786274|ref|XP_002974908.1| hypothetical protein SELMODRAFT_102436 [Selaginella moellendorffii]
gi|300157067|gb|EFJ23693.1| hypothetical protein SELMODRAFT_102436 [Selaginella moellendorffii]
Length = 389
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 58/217 (26%), Positives = 106/217 (48%), Gaps = 22/217 (10%)
Query: 13 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 72
L + +S A LAL+L+ +K+ + S W PYI L ++ S + W++ EL YL
Sbjct: 36 LLSKDVSAWAKLALFLLAHQKKKETSAWAPYISCLPPFG-----SMHSTIFWTQDELVYL 90
Query: 73 TGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 132
SP E ++R + ++ E+ + V L QQ + + T ++ V S
Sbjct: 91 KVSPVYRETVQRKDVVRMEFAAAENVCM----LMQQVKLFVCSRILT------DYITVCS 140
Query: 133 CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVW 190
++ + + ALVP + + C+AML+ +D ++V DR Y G+ +V+
Sbjct: 141 RAWGIETI---KSLALVPF-VDFFNHDANCRAMLSYDEDRHCAEVVSDRDYATGDQVVIS 196
Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLV-VEAALNTEDP 226
G N+ L +++GF NP+D++ + +L+ +DP
Sbjct: 197 YGQLSNATLALDFGFALPFNPHDQVAGIWLSLSEKDP 233
>gi|42565948|ref|NP_191068.2| SET domain-containing protein [Arabidopsis thaliana]
gi|56236044|gb|AAV84478.1| At3g55080 [Arabidopsis thaliana]
gi|59958342|gb|AAX12881.1| At3g55080 [Arabidopsis thaliana]
gi|332645816|gb|AEE79337.1| SET domain-containing protein [Arabidopsis thaliana]
Length = 463
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 59/240 (24%), Positives = 113/240 (47%), Gaps = 22/240 (9%)
Query: 12 LLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY 71
+L +N++ + LA L+ EKK G+KS W+PYI L + + S + W E EL+
Sbjct: 105 VLLSNEVGNIGMLAAVLIREKKMGQKSRWVPYISRLPQ-----PAEMHSSIFWGEDELSM 159
Query: 72 LTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ 131
+ S E +++ I+++++ F+A + Q P I TE E F A+ V
Sbjct: 160 IRCSAVHQETVKQKAQIEKDFS------FVAQAFKQHCP--IVTERPDLEDFMYAYALVG 211
Query: 132 SCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVW 190
S + ++R +L+P + +L D+ + ++ DR Y G+ + +
Sbjct: 212 S-----RAWENSKRISLIPFADFMNHDGLSASIVLRDEDNQLSEVTADRNYSPGDEVFIK 266
Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ---VFH 247
G N+ L++++GF N +D + ++ + +DP K + Q + +V+ +FH
Sbjct: 267 YGEFSNATLMLDFGFTFPYNIHDEVQIQMDVPNDDPLRNMKLGLLQTHHTRTVKDINIFH 326
>gi|297849804|ref|XP_002892783.1| hypothetical protein ARALYDRAFT_471564 [Arabidopsis lyrata subsp.
lyrata]
gi|297338625|gb|EFH69042.1| hypothetical protein ARALYDRAFT_471564 [Arabidopsis lyrata subsp.
lyrata]
Length = 482
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 89/347 (25%), Positives = 159/347 (45%), Gaps = 35/347 (10%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
+AL+L+ EK + ++S W Y+ L + + +S + WSE ELA L G+ + L
Sbjct: 122 SVALFLIREKYE-EESSWRLYLDMLPQ-------STDSTVFWSEEELAELKGTQLLSTTL 173
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
E ++ E+ +L+ + D+ + T + F AF ++S +
Sbjct: 174 GVKEYVENEFLKLEQEILLPNK-------DLFSSRITLDDFIWAFGILKSRAFSRLR--- 223
Query: 143 ARRFALVPL------GPPLLA--YSSKCK-AMLAAVDDAVQLVVDRPYKAGESIVV-WCG 192
+ L+PL P + Y+ + K A L + D L KAGE + + +
Sbjct: 224 GQNLVLIPLADLINHNPAITTEDYAYEIKGAGLFSRDLLFSLKSPVYVKAGEQVYIQYDL 283
Query: 193 PQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR 252
+ N++L ++YGFV+ + + + + DP + DK +A+ N F V G+
Sbjct: 284 NKSNAELALDYGFVESNPNRNSYTLTIEIPESDPFFGDKLDIAETNKMGETGYFDVVDGQ 343
Query: 253 EKEAISDMLPYLRLGYV--SDTSEMQSVISSL--GPI-CPVSPCMERAVLDQLADYFKAR 307
A ML YLRL + SD ++S+ ++ G + PVS E + + D K+
Sbjct: 344 TLPA--GMLQYLRLVALGGSDAFLLESIFNNTIWGHLELPVSRSNEELICRVVRDACKSA 401
Query: 308 LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTAD 354
L+G+ T+ EDE +L + L P+ +A ++ EK++L Q+ D
Sbjct: 402 LSGFSTTIEEDEKLLEEGKLDPRLEMALKIRIGEKRVLQQIDQIFKD 448
>gi|126290266|ref|XP_001367810.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Monodelphis domestica]
Length = 595
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 83/348 (23%), Positives = 158/348 (45%), Gaps = 37/348 (10%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFWLPYI+ L + ++PL + E E+ +L + ++
Sbjct: 156 TLAFHLLCERA-NPSSFWLPYIQTLPSE-------YDTPLYFEEDEVQHLQSTQAIHDVF 207
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P +P ++FT+E ++ A +V + +
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPNANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V + + GE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFNVGEQIYIFYGTRSNAE 320
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379
Query: 259 DMLPYLRLGYVSDTSEMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKAR-- 307
+L +LR+ +++ + +I +LG PVS E +L + +AR
Sbjct: 380 QLLAFLRVFCMTEEELKEHLIGEHAIDRIFTLGNSEFPVSWDNEV----KLWTFLEARAS 435
Query: 308 --LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
L Y T+ ED++ L ++L +A +L EK++L ++ A
Sbjct: 436 LLLKTYKTTIEEDKSFLATHDLTFHATMAIKLRLGEKEILEKAVKSAA 483
>gi|395504553|ref|XP_003756612.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Sarcophilus
harrisii]
Length = 602
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 83/348 (23%), Positives = 158/348 (45%), Gaps = 37/348 (10%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFWLPYI+ L + ++PL + E E+ +L + ++
Sbjct: 163 TLAFHLLCERA-NPSSFWLPYIQTLPSE-------YDTPLYFEEDEVQHLQSTQAIHDVF 214
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P +P ++FT+E ++ A +V + +
Sbjct: 215 SQYKNTARQY-----AYFY--KVIQTHPNANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 267
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V + + GE I ++ G + N++
Sbjct: 268 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFNVGEQIYIFYGTRSNAE 327
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 328 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 386
Query: 259 DMLPYLRLGYVSDTSEMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKAR-- 307
+L +LR+ +++ + +I +LG PVS E +L + +AR
Sbjct: 387 QLLAFLRVFCMTEEELKEHLIGEHAIDRIFTLGNSEFPVSWDNEV----KLWTFLEARAS 442
Query: 308 --LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
L Y T+ ED++ L ++L +A +L EK++L ++ A
Sbjct: 443 LLLKTYKTTIEEDKSFLATHDLTFHATMAIKLRLGEKEILEKAVKSAA 490
>gi|168067849|ref|XP_001785817.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162662541|gb|EDQ49381.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 489
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 89/358 (24%), Positives = 156/358 (43%), Gaps = 50/358 (13%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
+AL+L++E S W PY+ L + +++SPL WS+ ELA L G+ +
Sbjct: 114 AVALFLIHESSN-PSSKWRPYLDSLPK-------SLDSPLFWSDEELAELVGTQLLGSVT 165
Query: 83 ERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 141
E ++ EYN L + V +F Y TF+ FK AF ++S
Sbjct: 166 GYLEFLENEYNNLVEEVLEPNNKIFNPAVY-------TFDGFKWAFGILRSRTFSPLT-- 216
Query: 142 LARRFALVPL------------GPP--LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESI 187
ALVP+ G P + +S+ + D + + + AGE +
Sbjct: 217 -GEDIALVPIADLVNHGKGLGDGSPSWVRKGTSQFWNIGKGSSDLLTVRASANFSAGEQV 275
Query: 188 VVWCGP-QPNSKLLINYGFVDEDN--------PYDRLVVEAALNTEDPQYQDKRMVAQRN 238
++ G + N+ L ++YGFV+ D D L + ++ +D DK + + N
Sbjct: 276 LMQYGATKSNADLALDYGFVERDRGSQFSPGIERDSLALSLEISPDDRFVDDKADILEIN 335
Query: 239 GKLSVQVFHVHAGREKEAISDMLPYLRLGYVS--DTSEMQSVI--SSLGPIC-PVSPCME 293
G F + G+ +M+ +LRL +S D+ ++++ + G + PVS E
Sbjct: 336 GFQCSMQFDLSRGQGPS--DEMITFLRLSALSGPDSFLLEALFRNEAWGHVSLPVSRDNE 393
Query: 294 RAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQV 351
A+ + + KA L GY T+ +D +L +L + +A +VR+ +K + LQ
Sbjct: 394 EALCTSMLEGLKAALDGYSTTVEQDMELLARGDLSTRMEIAV-VVRLGEKRVMQELQT 450
>gi|17368377|sp|P94026.1|RBCMT_TOBAC RecName: Full=Ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic; AltName:
Full=[Ribulose-bisphosphate carboxylase]-lysine
N-methyltransferase; Short=RuBisCO LSMT; Short=RuBisCO
methyltransferase; Short=rbcMT; Flags: Precursor
gi|1731475|gb|AAC49565.1| ribulose-1,5-bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase [Nicotiana tabacum]
gi|1731477|gb|AAC49566.1| ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase [Nicotiana tabacum]
Length = 491
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 80/335 (23%), Positives = 151/335 (45%), Gaps = 28/335 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
+AL+L+ EK + S W Y+ L + + +S + WSE EL+ + G+ + +
Sbjct: 130 SVALFLLREKWR-DDSKWKYYMDVLPK-------STDSTIYWSEEELSEIQGTQLLSTTM 181
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF--TFEIFK-QAFVAVQS-CVVHLQ 138
+ ++ E+ +++ + Q +P+ I + F F I + +AF +++ ++ +
Sbjct: 182 SVKDYVQNEFQKVEEEVILRNK--QLFPFPITLDDFFWAFGILRSRAFSRLRNQNLILVP 239
Query: 139 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNS 197
L A V A+ + A L + D L KAG+ + + + + N+
Sbjct: 240 FADLTNHNARVTTEDH--AHEVRGPAGLFSWDLLFSLRSPLKLKAGDQLFIQYDLNKSNA 297
Query: 198 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI 257
+ ++YGF++ + D + ++ D Y DK +A+ NG F + G+
Sbjct: 298 DMALDYGFIEPSSARDAFTLTLEISESDEFYGDKLDIAETNGIGETAYFDIKIGQSLPPT 357
Query: 258 SDMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYFKARLAG 310
M+PYLRL + T SV LG PVS E + + D K+ L+G
Sbjct: 358 --MIPYLRLVALGGTDAFLLESIFRNSVWGHLG--LPVSRANEELICKVVRDACKSALSG 413
Query: 311 YPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
Y T+ EDE ++ + NL + ++A + EK++L
Sbjct: 414 YHTTIEEDEKLMEEGNLSTRLQIAVGIRLGEKRVL 448
>gi|260803924|ref|XP_002596839.1| hypothetical protein BRAFLDRAFT_284593 [Branchiostoma floridae]
gi|229282099|gb|EEN52851.1| hypothetical protein BRAFLDRAFT_284593 [Branchiostoma floridae]
Length = 500
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 87/337 (25%), Positives = 148/337 (43%), Gaps = 31/337 (9%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
LAL+++ EK SFW PYI +PL + E E+ +L GS +++L
Sbjct: 170 LALHVLCEK-YSSNSFWAPYINIFPG-------TYTTPLYFEEGEMLHLQGSLNFSDVLN 221
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYP--YDIP-TEAFTFEIFKQAFVAVQSCVVHLQKV 140
+ + I R+Y +F LFQ P +P E FTF+ ++ A V + +
Sbjct: 222 QYKSIARQY-----AYFY--KLFQTQPEAAGLPLKECFTFDEYRWAVSTVMTRQNQV-PT 273
Query: 141 SLARRF--ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
S R AL+P+ + + D+ + + R + + ++ G + N++
Sbjct: 274 SDGRHLITALIPMWDMCNHSNGEVSTEFNLGSDSAECLAMREFPTDSQVYIFYGMRSNAE 333
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
LI+ GFV +N +DR+ V+ ++ D + K V R G + F VH G++
Sbjct: 334 FLIHNGFVYPENVHDRVNVKLGVSKNDSLFAMKAEVLSRAGIHASTSFQVHCGKDP-IPP 392
Query: 259 DMLPYLRL-----GYVSD--TSEMQSV-ISSLG-PICPVSPCMERAVLDQLADYFKARLA 309
++L +LR+ G + D TSE QS +S LG C V+ E L +
Sbjct: 393 ELLVFLRVFTMVEGDLRDLLTSEHQSAYLSCLGRSDCMVTQEQETKAWAFLETRLSLLIR 452
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 346
Y ++ + E L ++ R A QL E ++L+
Sbjct: 453 SYRTSIKDVETELQAPDMTYHSRAALQLKLAEMQILS 489
>gi|387016380|gb|AFJ50309.1| Histone-lysine N-methyltransferase setd3 [Crotalus adamanteus]
Length = 592
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 84/348 (24%), Positives = 158/348 (45%), Gaps = 37/348 (10%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFWLPYI+ L + + L + E E+ YL + +I
Sbjct: 156 TLAFHLLCER-YNPNSFWLPYIQTLPNE-------YNTALYFEEDEVQYLQSTQAIHDIF 207
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P +P ++FT++ ++ A +V + +
Sbjct: 208 SQYKNTARQY-----AYFY--KVVQTHPNASKLPLKDSFTYDDYRWAVSSVMARQNQIPA 260
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V + +KAGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLKDDRCECVALQDFKAGEQIYIFYGTRSNAE 320
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H+ E +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISA 379
Query: 259 DMLPYLRLGYVSDTSEMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKAR-- 307
+L +LR+ +++ + +I +LG PVS E +L + +AR
Sbjct: 380 QLLAFLRVFCMTEDELKEHLIGEHTIDRIFTLGNSEFPVSWDNEV----KLWTFLEARAS 435
Query: 308 --LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
L Y T+ +D+ +L +L +A +L EK++L ++ A
Sbjct: 436 LLLKTYKTTIHDDKFILETQDLTHNATMAIKLRLGEKEILEKAIKSAA 483
>gi|307109960|gb|EFN58197.1| hypothetical protein CHLNCDRAFT_142047 [Chlorella variabilis]
Length = 485
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 86/319 (26%), Positives = 142/319 (44%), Gaps = 27/319 (8%)
Query: 12 LLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY 71
LL L + LAL+L ++ G W PY+R L + G L WSE E+ +
Sbjct: 122 LLEEQPLPAWSVLALWLAEQRAAGSAGGWWPYVRLLPERTG-------CVLEWSEEEVEW 174
Query: 72 LTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ 131
L GS ++ LE + + E+ V A + + + AF + AF +
Sbjct: 175 LCGSQLHSDALEIRAAAEASWAEMQAVLAAAKAQGRAPAHG----AFGRAQLQWAFAVLL 230
Query: 132 SCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVV 189
S +V L L + AL+P LL + + L +A + AV L +R Y+AGE +++
Sbjct: 231 SRLVRL--AGLGDQEALLPWAD-LLNHDCAAASFLDWSATEAAVVLRAERRYRAGEQLLI 287
Query: 190 WCGPQPNSKLLINYGFVDE--DNPYD--RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQV 245
G + + +LL++YGF + NP+D RL++E L D K +++G + Q+
Sbjct: 288 SYGQKTSGELLLSYGFCPDLGSNPHDGCRLLLE--LAPGDAARNWKAAALRQHGLAASQL 345
Query: 246 FHVHAGREKEAISDMLPY--LRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADY 303
F + R A +++ Y V E + + L + P ++ A L+ +
Sbjct: 346 FPL---RMAAAPFELVHYTAFSAAVVGSRQEAEQLARRLFEEGDIPPALQTAALEAVVAA 402
Query: 304 FKARLAGYPATLSEDEAML 322
KA LA YP + D A L
Sbjct: 403 CKAALAAYPRSFDGDRAEL 421
>gi|302766942|ref|XP_002966891.1| hypothetical protein SELMODRAFT_408134 [Selaginella moellendorffii]
gi|300164882|gb|EFJ31490.1| hypothetical protein SELMODRAFT_408134 [Selaginella moellendorffii]
Length = 374
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 62/211 (29%), Positives = 95/211 (45%), Gaps = 42/211 (19%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA-VESPLLWSETELAYLTGSPTKAEIL 82
L+L+L+ EK + ++S W PY+R L QL +ES + W ELA+L SPT E +
Sbjct: 51 LSLFLLSEKHKAQESQWAPYLRCL------PQLGDIESTMFWKAEELAWLKHSPTYRETM 104
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
E + IK E++ L +A Q +P+ C L +VSL
Sbjct: 105 ECLKIIKSEFHLLT----LANK--QVFPW---------------------CRDALGEVSL 137
Query: 143 -----ARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQP 195
A +P + C+ L+ D V D+ YKAG+ I + G P
Sbjct: 138 TDFMHAYSTDQIPFA-DFFNHDHNCQTRLSYDKEKDCAVAVADQDYKAGDEIFLSYGSTP 196
Query: 196 NSKLLINYGFVDEDNPYDRLVVEAALNTEDP 226
NS L ++YGF NP++++ V ++ DP
Sbjct: 197 NSILAVDYGFAVASNPHEQVEVPMGVSLTDP 227
>gi|15223054|ref|NP_172856.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase
[Arabidopsis thaliana]
gi|17369870|sp|Q9XI84.1|RBCMT_ARATH RecName: Full=[Fructose-bisphosphate aldolase]-lysine
N-methyltransferase, chloroplastic; AltName:
Full=Aldolases N-methyltransferase; AltName:
Full=[Ribulose-bisphosphate carboxylase]-lysine
N-methyltransferase-like; Short=AtLSMT-L;
Short=LSMT-like enzyme; Flags: Precursor
gi|5080779|gb|AAD39289.1|AC007576_12 Putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase [Arabidopsis thaliana]
gi|28973755|gb|AAO64193.1| putative ribulose-1,5 bisphosphate carboxylase oxygenase large
subunit N-methyltransferase [Arabidopsis thaliana]
gi|332190979|gb|AEE29100.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase
[Arabidopsis thaliana]
Length = 482
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 86/348 (24%), Positives = 156/348 (44%), Gaps = 37/348 (10%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
+AL+L+ EK + ++S W Y+ L + + +S + WSE ELA L G+ + L
Sbjct: 122 SVALFLIREKYE-EESSWRVYLDMLPQ-------STDSTVFWSEEELAELKGTQLLSTTL 173
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
E ++ E+ +L+ + D+ + T + F AF ++S +
Sbjct: 174 GVKEYVENEFLKLEQEILLPNK-------DLFSSRITLDDFIWAFGILKSRAFSRLR--- 223
Query: 143 ARRFALVPLGPPLLAYSSKCK----------AMLAAVDDAVQLVVDRPYKAGESIVV-WC 191
+ L+PL L+ ++ K A L + D L KAGE + + +
Sbjct: 224 GQNLVLIPLAD-LINHNPAIKTEDYAYEIKGAGLFSRDLLFSLKSPVYVKAGEQVYIQYD 282
Query: 192 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 251
+ N++L ++YGFV+ + + + + DP + DK +A+ N F + G
Sbjct: 283 LNKSNAELALDYGFVESNPKRNSYTLTIEIPESDPFFGDKLDIAESNKMGETGYFDIVDG 342
Query: 252 REKEAISDMLPYLRLGYVS--DTSEMQSVISSL--GPI-CPVSPCMERAVLDQLADYFKA 306
+ A ML YLRL + D ++S+ ++ G + PVS E + + D K+
Sbjct: 343 QTLPA--GMLQYLRLVALGGPDAFLLESIFNNTIWGHLELPVSRTNEELICRVVRDACKS 400
Query: 307 RLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTAD 354
L+G+ T+ EDE +L L P+ +A ++ EK++L Q+ D
Sbjct: 401 ALSGFDTTIEEDEKLLDKGKLEPRLEMALKIRIGEKRVLQQIDQIFKD 448
>gi|356534483|ref|XP_003535783.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Glycine
max]
Length = 463
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 54/221 (24%), Positives = 104/221 (47%), Gaps = 16/221 (7%)
Query: 13 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 72
L ++ +A LA ++ EKK G+ S W PYI L +Q G+L + + W+E+EL +
Sbjct: 96 LIGEEVGNIAKLATVILIEKKLGQGSEWYPYISCLPQQ---GEL--HNTVFWTESELEMI 150
Query: 73 TGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 132
S E +++ I++++ + ++ + F Y A T +F V +
Sbjct: 151 RPSSVYQETIDQKSQIEKDFLAIKHIFECSHQSFGDSTYKDFMHACTLVLFDHFNVELP- 209
Query: 133 CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD-------AVQLVVDRPYKAGE 185
V + AL+P L + +A++ + DD ++Q++ DR Y GE
Sbjct: 210 --VGSRAWGSTNGLALIPFAD-FLNHDGVSEAIVMSDDDKQCSEVQSLQIIADRDYAPGE 266
Query: 186 SIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 226
+++ G N+ L++++GF N YD++ ++ + DP
Sbjct: 267 QVLIRYGKFSNATLMLDFGFTIPYNIYDQVQIQFDIPKHDP 307
>gi|8778402|gb|AAF79410.1|AC068197_20 F16A14.25 [Arabidopsis thaliana]
Length = 474
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 85/342 (24%), Positives = 155/342 (45%), Gaps = 33/342 (9%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
+AL+L+ EK + ++S W Y+ L + + +S + WSE ELA L G+ + L
Sbjct: 122 SVALFLIREKYE-EESSWRVYLDMLPQ-------STDSTVFWSEEELAELKGTQLLSTTL 173
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
E ++ E+ +L+ + D+ + T + F AF +++ + ++
Sbjct: 174 GVKEYVENEFLKLEQEILLPNK-------DLFSSRITLDDFIWAF-----GILNRESLTS 221
Query: 143 ARRFALVPLGPPL----LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNS 197
F + P + AY K A L + D L KAGE + + + + N+
Sbjct: 222 MFEFEQINHNPAIKTEDYAYEIKG-AGLFSRDLLFSLKSPVYVKAGEQVYIQYDLNKSNA 280
Query: 198 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI 257
+L ++YGFV+ + + + + DP + DK +A+ N F + G+ A
Sbjct: 281 ELALDYGFVESNPKRNSYTLTIEIPESDPFFGDKLDIAESNKMGETGYFDIVDGQTLPA- 339
Query: 258 SDMLPYLRLGYVS--DTSEMQSVISSL--GPI-CPVSPCMERAVLDQLADYFKARLAGYP 312
ML YLRL + D ++S+ ++ G + PVS E + + D K+ L+G+
Sbjct: 340 -GMLQYLRLVALGGPDAFLLESIFNNTIWGHLELPVSRTNEELICRVVRDACKSALSGFD 398
Query: 313 ATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTAD 354
T+ EDE +L L P+ +A ++ EK++L Q+ D
Sbjct: 399 TTIEEDEKLLDKGKLEPRLEMALKIRIGEKRVLQQIDQIFKD 440
>gi|217038301|gb|ACJ76599.1| SET domain-containing protein 3 (predicted) [Oryctolagus cuniculus]
Length = 394
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 62/248 (25%), Positives = 116/248 (46%), Gaps = 20/248 (8%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA +L+ E+ SFW PYI+ L + ++PL + E E+ YL + ++
Sbjct: 156 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 207
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 208 SQYKNTARQY-----AYFY--RVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260
Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
+R AL+PL + DD + V R + AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALRDFHAGEQIYIFYGTRSNAE 320
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379
Query: 259 DMLPYLRL 266
+L +LR+
Sbjct: 380 QLLAFLRV 387
>gi|146162512|ref|XP_001009518.2| SET domain containing protein [Tetrahymena thermophila]
gi|146146406|gb|EAR89273.2| SET domain containing protein [Tetrahymena thermophila SB210]
Length = 789
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 90/346 (26%), Positives = 149/346 (43%), Gaps = 61/346 (17%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES---PLLWSETELAYLTGSPTKAE 80
L++Y++ EKK KSFW P++ L VE P+L+++ EL +L GSP +
Sbjct: 426 LSIYIIIEKK-NHKSFWKPFL---------DILPVEYTTFPILYTDEELFWLKGSPFLNQ 475
Query: 81 ILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA--FTFEIFKQAFVAVQSCVVHLQ 138
+ ER E I ++Y Q IP A T + F A + S + L
Sbjct: 476 VKERRECITQDY--------------QAIVSKIPEFAKLCTLDEFAWARMMAASRIYGL- 520
Query: 139 KVSLARRFALVPLGPPLL----AYSS------KCKAMLAAVDDAVQLVVDRPYKAGESIV 188
++ R A VPL AY++ K ML A +D + G+ I
Sbjct: 521 FINKKRTDAFVPLADMFNHRRPAYTNWGFCEDKGGFMLKASEDI---------RRGDQIY 571
Query: 189 VWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK-RMVAQRNGKLSVQVFH 247
CG + NS+ L+NYGFV ++N + + + + +D K +M+ +R K +F
Sbjct: 572 YSCGRKCNSRFLLNYGFVVKNNEANEIQLRVDFDKKDETLPIKLQMIGKR--KPESLIFR 629
Query: 248 VHAGREKEAISDMLPYLRLGYVSDT------SEMQSVISSLGPIC--PVSPCMERAVLDQ 299
+H E++++ + +LR + D EM S P+ P S E+ + +
Sbjct: 630 IHINYEEKSVLEFFGFLRFVLIRDYIVLEKFHEM-SEGKEFDPLRTPPFSIENEKQMWTE 688
Query: 300 LADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
+ + YP TL ED+ +L L ++ L EK++L
Sbjct: 689 IHKICAEIMIQYPTTLDEDKKILETSKLTINQKNCVILRMGEKEIL 734
>gi|297820264|ref|XP_002878015.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297323853|gb|EFH54274.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 473
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 52/213 (24%), Positives = 101/213 (47%), Gaps = 19/213 (8%)
Query: 15 TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 74
T+++ + LA L+ EKK+G+KS W+PYI L + + S + W E E + +
Sbjct: 108 TDEVGNIGKLAAVLIREKKKGQKSRWVPYISRLPQ-----PAEMHSTIFWGEDEFSMIRC 162
Query: 75 SPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 134
S E +++ I++E++ F+A + Q YP I E E F A+ V S
Sbjct: 163 SAVHKETVKQKAQIEKEFS------FVAQAFKQHYPMVI--ERPYLEDFMYAYALVGS-- 212
Query: 135 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWCGP 193
+ ++ +L+P + +L+ D+ + ++ DR Y G+ + + G
Sbjct: 213 ---RAWETSKGISLIPFADFMNHDGLSASIVLSDEDNQLSEVTADRNYSPGDEVFIKYGE 269
Query: 194 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 226
N+ L++++GF N +D + ++ + +DP
Sbjct: 270 FSNATLMLDFGFTVPYNIHDEVQIQMDVPNDDP 302
>gi|115657973|ref|XP_798530.2| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Strongylocentrotus purpuratus]
Length = 682
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 86/339 (25%), Positives = 147/339 (43%), Gaps = 27/339 (7%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
LA++++ EK + SFW PY+ L + PL ++ E+ GS E L+
Sbjct: 158 LAIFILSEKLK-SDSFWKPYLDVLPS-------SYSLPLYFTPDEIQLFQGSTMYGECLK 209
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL-QKVSL 142
+ + I R+Y L F +L + I E FT++ ++ A V + + K
Sbjct: 210 QHKNIARQYAYL----FKLLNLPENSKLHI-REYFTYDFYRWAVSTVMTRQNQIPAKDGK 264
Query: 143 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 202
+L+PL + + K D+ + R + GE I + G + ++ LL+
Sbjct: 265 GMSLSLIPLWDMCNHANGEMKTDFIEERDSCVNMALRDFSVGEQIFICYGRRSSADLLLY 324
Query: 203 YGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV--QVFHVHAGREKEAISDM 260
GFV N YD + ++ L++ D Y K + KL V Q +H+ AG+E + ++
Sbjct: 325 SGFVYPGNVYDGMAIQLGLSSSDRLYAMKAQLCSVM-KLGVPSQNYHISAGKEPVTL-EL 382
Query: 261 LPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGY 311
L +LR+ + D + Q++ S + +S E LA Y
Sbjct: 383 LTFLRIFCMQDLELRDRLLGDNRAQALFSLVDRSQIISKLNELRTCVYLATRVTLLQRQY 442
Query: 312 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 350
++ EDE L D NL ++R A QL+ +EK L L+
Sbjct: 443 KTSIQEDEEKLKDGNLSAQERSALQLLLIEKCTLENVLE 481
>gi|145524453|ref|XP_001448054.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124415587|emb|CAK80657.1| unnamed protein product [Paramecium tetraurelia]
Length = 581
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 88/370 (23%), Positives = 155/370 (41%), Gaps = 65/370 (17%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
L+ +L+ EK + SFW PY+ L Q P+ ++ +L +L GSP +I +
Sbjct: 215 LSTFLLQEKSRPN-SFWKPYLDIL------PQSYPSFPIFFNNYDLEWLQGSPFLKQIND 267
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 143
+ +K++YN++ V F QY +F F A + S + + +
Sbjct: 268 KLSDLKKDYNDICNV----APEFSQY---------SFYEFCWARMTASSRIFGI-NIKGV 313
Query: 144 RRFALVPLG-------PPLLA--YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 194
+ A VPL P L + YS + + + D+ + DR G+ I G +
Sbjct: 314 KTDAFVPLADMLNHKRPKLTSWCYSEEKQGFIIETDEKI----DR----GQMIFDSYGRK 365
Query: 195 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK--------------RMVAQRNGK 240
NS+ L+NYGFV +DN + + V A DP Q K R++ +G
Sbjct: 366 CNSRFLLNYGFVVDDNDANEVNVTVAAEFNDPLIQLKEDATEEQLKQPKTFRLIMDTDGI 425
Query: 241 LSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISS-----LGP--ICPVSPCME 293
+ F + + + + Y+R + D +++Q +++ + P I P+ E
Sbjct: 426 NEITHFL------EATVMEFMSYIRFLVIRDQTQLQFLLNERESKYIKPTKIQPLGIHNE 479
Query: 294 RAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
+ D + L+ YP TL +D+ +L +L +R L EK++L Q +
Sbjct: 480 LDMWDLIRRICYVSLSRYPTTLEQDKEILQICDLTTNQRNCLILRMGEKEILKFYYQFSE 539
Query: 354 DMIMLLPDVT 363
M LL +
Sbjct: 540 KMKQLLSNFN 549
>gi|326492674|dbj|BAJ90193.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 86/341 (25%), Positives = 147/341 (43%), Gaps = 40/341 (11%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
++L ++ E +G S W PY+ L RQ +S + WSE EL + G+ + +
Sbjct: 89 SVSLLILREAARGGDSLWAPYLAILPRQ-------TDSTIFWSEEELLEIQGTQLLSTTM 141
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
E ++ E++ ++ AG + D+ TF+ F AF ++S V +
Sbjct: 142 GVKEYVQSEFDNVE-----AGII--NVNKDLFPGTITFDDFLWAFGVLRSRVFPELR--- 191
Query: 143 ARRFALVPLGPPL----------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WC 191
+ AL+P + + K K L D L K+GE I V +
Sbjct: 192 GDKLALIPFADLINHNGDITSKESCWEIKGKGFLGR-DTVFSLRTPVDVKSGEQIYVQYD 250
Query: 192 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 251
+ N++L ++YGF + ++ D + ++ DP Y+DK +A+ NG F V G
Sbjct: 251 LDKSNAELALDYGFTESNSSRDSYTLTLEISESDPFYEDKLDIAELNGMGETAYFDVVLG 310
Query: 252 REKEAISDMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYF 304
+ M+ YLRL + T V L PVS E ++ + +
Sbjct: 311 --ESLPPQMITYLRLLCLGGTDAFLLEALFRNKVWEHLE--LPVSRDNEESICQVIQNAC 366
Query: 305 KARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
K+ LA Y T+ EDE +L +L + ++A ++ EKK+L
Sbjct: 367 KSALAAYHTTIEEDEELLEREDLQSRHQIAVEVRVGEKKVL 407
>gi|326496433|dbj|BAJ94678.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 453
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 93/352 (26%), Positives = 151/352 (42%), Gaps = 44/352 (12%)
Query: 14 TTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLT 73
+ L ++L ++ E +G S W PY+ L RQ +S + WSE EL +
Sbjct: 84 SGGDLRPWVSVSLLILREAARGGDSLWAPYLAILPRQ-------TDSTIFWSEEELLEIQ 136
Query: 74 GSPTKAEILERAEGIKREY--NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ 131
G+ ++L G+K EY +E D V AG + D+ TF+ F AF ++
Sbjct: 137 GT----QLLSTTMGVK-EYVQSEFDNV--EAGII--NVNKDLFPGTITFDDFLWAFGVLR 187
Query: 132 SCVVHLQKVSLARRFALVPLGPPL----------LAYSSKCKAMLAAVDDAVQLVVDRPY 181
S V + + AL+P + + K K L D L
Sbjct: 188 SRVFPELR---GDKLALIPFADLINHDGDITSKESCWEIKGKGFLGR-DTVFSLRTPVDV 243
Query: 182 KAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 240
K+GE I V + + N++L ++YGF + ++ D + ++ DP Y+DK +A+ NG
Sbjct: 244 KSGEQIYVQYDLDKSNAELALDYGFTESNSSRDSYTLTLEISESDPFYEDKLDIAELNGM 303
Query: 241 LSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCME 293
F V G + M+ YLRL + T V L PVS E
Sbjct: 304 GETAYFDVVLG--ESLPPQMITYLRLLCLGGTDAFLLEALFRNKVWEHLE--LPVSRDNE 359
Query: 294 RAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
++ + + K+ LA Y T+ EDE +L +L + ++A ++ EKK+L
Sbjct: 360 ESICQVIQNACKSALAAYHTTIEEDEELLEREDLQSRHQIAVEVRVGEKKVL 411
>gi|145350419|ref|XP_001419603.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579835|gb|ABO97896.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 524
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 83/336 (24%), Positives = 133/336 (39%), Gaps = 57/336 (16%)
Query: 17 KLSELACLALYLMYEK---KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYL 72
+L E + LA +L + + G + YIR L R+ G S L W E E+ L
Sbjct: 148 ELQEWSVLATFLAQQALALESGTAGTFGEYIRALPRRTG-------SVLDWPEDEVDKLL 200
Query: 73 TGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 132
GSP++ ER + + +E+ + +P T + AF + S
Sbjct: 201 KGSPSRLAAAERQDSVNAAIDEIRSY----------FPE------ITVGALRWAFDILFS 244
Query: 133 CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG 192
++ L ++ ALVP +L + C A + DAV L DR Y GE + G
Sbjct: 245 RLIRLD--AMGGELALVPW-ADMLNHKPGCAAFIDLNGDAVNLTTDRSYVKGEQVWASYG 301
Query: 193 PQPNSKLLINYGFVDE--DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 250
+P+S+LLI+YGF E +NP D + ++ DP K V + G V+ F +
Sbjct: 302 QRPSSELLISYGFAPEVGENPDDEYALTLGVDVNDPLADAKAQVLRDMGLSPVETFPLRL 361
Query: 251 GR--------------EKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSP------ 290
E S++ + + + QS+ S+ +
Sbjct: 362 NGYPRQLLQYASFILCNPEKPSELKGLAQSAFTGSANIGQSIFDSVRGLTNGKARGKQGV 421
Query: 291 -----CMERAVLDQLADYFKARLAGYPATLSEDEAM 321
E AV + LAD L+ YP TL +D+ +
Sbjct: 422 ILGGVAGEIAVREMLADLCAEALSAYPNTLEKDKGL 457
>gi|79315114|ref|NP_001030864.1| SET domain-containing protein [Arabidopsis thaliana]
gi|51971180|dbj|BAD44282.1| unnamed protein product [Arabidopsis thaliana]
gi|332645817|gb|AEE79338.1| SET domain-containing protein [Arabidopsis thaliana]
Length = 353
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 58/233 (24%), Positives = 106/233 (45%), Gaps = 27/233 (11%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
LA L+ EKK G+KS W+PYI L + + S + W E EL+ + S E ++
Sbjct: 2 LAAVLIREKKMGQKSRWVPYISRLPQP-----AEMHSSIFWGEDELSMIRCSAVHQETVK 56
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 143
+ I+++++ F+A + Q P I TE E F A+ V S + +
Sbjct: 57 QKAQIEKDFS------FVAQAFKQHCP--IVTERPDLEDFMYAYALVGS-----RAWENS 103
Query: 144 RRFALVPLGPPLLAYSSKCKAMLAAVDD------AVQLVVDRPYKAGESIVVWCGPQPNS 197
+R +L+P + +L D+ +Q+ DR Y G+ + + G N+
Sbjct: 104 KRISLIPFADFMNHDGLSASIVLRDEDNQLSEFSTLQVTADRNYSPGDEVFIKYGEFSNA 163
Query: 198 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ---VFH 247
L++++GF N +D + ++ + +DP K + Q + +V+ +FH
Sbjct: 164 TLMLDFGFTFPYNIHDEVQIQMDVPNDDPLRNMKLGLLQTHHTRTVKDINIFH 216
>gi|340720054|ref|XP_003398458.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Bombus
terrestris]
Length = 484
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 88/345 (25%), Positives = 149/345 (43%), Gaps = 32/345 (9%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
LA+ L+ EK + + S W PY+ L + L + ++ L GSPT L+
Sbjct: 158 LAIALLIEKHK-EYSKWKPYLDILPT-------TYTTVLYMTAADMNELKGSPTLEAALK 209
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPT---EAFTFEIFKQAF--VAVQSCVVHLQ 138
+ I R+Y +F LFQ+ + + FT+E + A V + ++ +
Sbjct: 210 QCRNIARQY-----AYF--NKLFQKNNNAVSAILRDVFTYEKYCWAVSTVMTRQNIIPSK 262
Query: 139 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
SL AL+P+ SK A + + R +K E I + GP+ NS
Sbjct: 263 DGSLMIH-ALIPMWDMCNHEDSKITTDFNATLNCCECYALRDFKKAEQIFISYGPRTNSD 321
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
++ GFV DN D + ++ DP ++++ + + +V F + G E IS
Sbjct: 322 FFVHSGFVYMDNEQDGFKLRLGISKADPLHKERVELLNKLDLPAVGEFLLKPG--TEPIS 379
Query: 259 D-MLPYLRLGYVSDTSEMQSVISS--LGPICPVSPCMERAVLDQLADYFKARL----AGY 311
D +L +LR+ + E+ I S + + + +E V + + + RL A Y
Sbjct: 380 DTLLAFLRV-FSMRKEELAHWIQSDRVNDLKHMDCALETVVEENVKKFLLTRLQLLIANY 438
Query: 312 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 356
P TL ED +L + L K++A QL EK++L L+ I
Sbjct: 439 PTTLKEDLQLL-ETTLPRIKKLAIQLRVTEKRILQGALEYVQQWI 482
>gi|302768639|ref|XP_002967739.1| hypothetical protein SELMODRAFT_408995 [Selaginella moellendorffii]
gi|300164477|gb|EFJ31086.1| hypothetical protein SELMODRAFT_408995 [Selaginella moellendorffii]
Length = 421
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 55/219 (25%), Positives = 91/219 (41%), Gaps = 36/219 (16%)
Query: 13 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 72
L ++ +SE A LAL L++EK+ G+ S W PYI L R + S W + EL +
Sbjct: 98 LLSSGVSEWAQLALLLLFEKRAGEASIWAPYISCLPRWG-----TIHSTAFWRKEELTMI 152
Query: 73 TGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 132
S E + R I+ E+NE+ +V F +D + A V+
Sbjct: 153 QESSLSYETMSRRAAIREEFNEMQSVPFA-----DFMNHDWSSNAMLTYDTDNGSTEVEE 207
Query: 133 CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG 192
V+ + +A A QL D+ Y AGE + + G
Sbjct: 208 VKVYSDCLYIALFCA--------------------------QLFADKNYAAGEQVTISFG 241
Query: 193 PQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 231
P N+ L +++GF NP+D++ + ++ D ++K
Sbjct: 242 PLCNASLALDFGFTVPYNPWDKVQLWLGISRRDSLRKEK 280
>gi|302771638|ref|XP_002969237.1| hypothetical protein SELMODRAFT_410177 [Selaginella moellendorffii]
gi|300162713|gb|EFJ29325.1| hypothetical protein SELMODRAFT_410177 [Selaginella moellendorffii]
Length = 336
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 63/230 (27%), Positives = 103/230 (44%), Gaps = 24/230 (10%)
Query: 16 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 75
++ +E ++L +M EK++G+ S W PYI L G+ + S + W ELA L S
Sbjct: 104 DQANEFTRVSLLVMAEKRKGQNSAWAPYIECLP---SFGE--IHSTIFWDPKELACLECS 158
Query: 76 PTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 135
P ER ++ EY E+ V S Y D+ + E FK + V S
Sbjct: 159 PIHRGTGERNALLQSEYREVKKV---VESCPHLYDPDV-----SLEQFKHEYATVSSRAW 210
Query: 136 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV-DRPYKAGE-----SIVV 189
S ++PL + + + + + DD +VV R Y+ G+ + +
Sbjct: 211 GQGPHS---DMTMIPL-VDFANHDPRSRTLFSHADDNCTVVVASRDYQTGDENFHLKVHI 266
Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK-RMVAQRN 238
G N+ L ++YGFV DNP+D + + +EDP + K + +AQ N
Sbjct: 267 CYGDHSNAVLALDYGFVVPDNPFDEAEIFLEIPSEDPLREIKLQYMAQNN 316
>gi|302754340|ref|XP_002960594.1| hypothetical protein SELMODRAFT_402971 [Selaginella moellendorffii]
gi|300171533|gb|EFJ38133.1| hypothetical protein SELMODRAFT_402971 [Selaginella moellendorffii]
Length = 403
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 63/230 (27%), Positives = 103/230 (44%), Gaps = 24/230 (10%)
Query: 16 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 75
++ +E ++L +M EK++G+ S W PYI L G+ + S + W ELA L S
Sbjct: 104 DQANEFTRVSLLVMAEKRKGQNSAWAPYIECLP---SFGE--IHSTIFWDPKELACLECS 158
Query: 76 PTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 135
P ER ++ EY E+ V S Y D+ + E FK + V S
Sbjct: 159 PIHRGTGERNALLQSEYREVKKV---VESCPHLYDPDV-----SLEQFKHEYATVSSRAW 210
Query: 136 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV-DRPYKAGE-----SIVV 189
S ++PL + + + + + DD +VV R Y+ G+ + +
Sbjct: 211 GQGPHS---DMTMIPL-VDFANHDPRSRTLFSHADDNCTVVVASRDYQTGDENFHLKVHI 266
Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK-RMVAQRN 238
G N+ L ++YGFV DNP+D + + +EDP + K + +AQ N
Sbjct: 267 CYGDHSNAVLALDYGFVVPDNPFDEAEIFLEIPSEDPLREIKLQYMAQNN 316
>gi|255947868|ref|XP_002564701.1| Pc22g06730 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211591718|emb|CAP97961.1| Pc22g06730 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 679
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 62/209 (29%), Positives = 101/209 (48%), Gaps = 15/209 (7%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
A +LM + +G + FW PY+R L + GQL +PL + E ++ ++ G+ +
Sbjct: 105 TFAFFLMGQYLRGSEGFWYPYLRTLPQP---GQLT--TPLFFGEEDVDWIQGTGIPEAAV 159
Query: 83 ERAEGIKREYN----ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA-VQSCVVHL 137
ER + +++Y+ +LD + F +QY +++ A T I +AF A V S V
Sbjct: 160 ERIKVWEQKYDLGYLKLDEIGFPD---CEQYTWELYLWASTI-ITSRAFSAKVLSGAVQP 215
Query: 138 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 197
+ AL+PL L + K A D+ + L+V + AG+ I GP+ N
Sbjct: 216 DDLPEDGVSALLPL-IDLPNHRPMAKVEWRAGDEDIGLLVLEDHSAGQEISNNYGPRNNE 274
Query: 198 KLLINYGFVDEDNPYDRLVVEAALNTEDP 226
+LLINYGF NP D +V + + P
Sbjct: 275 QLLINYGFCIAGNPTDYRIVLLGVKPDSP 303
>gi|298711968|emb|CBJ32910.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 247
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 64/196 (32%), Positives = 92/196 (46%), Gaps = 23/196 (11%)
Query: 175 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT-EDPQYQDK-R 232
+ R + AGE +++ GP+ N LL YGFV++DNP D + ++ D +D R
Sbjct: 40 VTTQRGWTAGEQVLISYGPRSNDHLLRRYGFVEQDNPNDVYRITGLIDKLSDVLGKDSVR 99
Query: 233 MVAQRNGKLSVQ--------VFHVHAGR-----EKEAISDMLPYLRLGYVSDTS--EMQS 277
++ + GKL V V GR EKE ++P RL V D E ++
Sbjct: 100 VLRESGGKLGTTGDNAEGRPVESVTVGRSGLLGEKEE-GRVMPVFRLAVVKDDQLPEGKA 158
Query: 278 VISSLGPIC-PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD--YNLHPKKRVA 334
SL +SP E A D L G+ TL+EDEA L+ +L +KRVA
Sbjct: 159 AGISLKDFSNEISPANEAAARDALRKLCIKEREGFATTLAEDEAYLSSLGNSLGAQKRVA 218
Query: 335 TQLVRMEKK-MLNACL 349
RMEKK +L+A +
Sbjct: 219 FSF-RMEKKRVLDAAI 233
>gi|424513480|emb|CCO66102.1| predicted protein [Bathycoccus prasinos]
Length = 571
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 95/365 (26%), Positives = 154/365 (42%), Gaps = 54/365 (14%)
Query: 20 ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA-YLTGSPTK 78
EL LAL+L EK + K S W PY++ L+ S L ++E E L G+
Sbjct: 160 ELVQLALWLCCEKAKAKGSEWWPYLKTLNGNPN-------SVLRFTEEEFKELLKGTSID 212
Query: 79 AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV-AVQSCVVHL 137
E +R + K EY L + P P + + F + + AF+ A+
Sbjct: 213 KEARQRRDSAKEEYEALRAA-------IAEDPGKYPLDVYAF-LTESAFIDALDIVCARA 264
Query: 138 QKVSLARRFALVPL--------GPPLLA-------------------YSSKCKAMLAAVD 170
Q ++ A +A+VPL PP ++ + +C VD
Sbjct: 265 QWLNSANCYAMVPLMDAIPICGAPPPVSPEDPSFARFYEIRDIKTGLTAVRCGYADYDVD 324
Query: 171 DA-VQLVVDRPYKAGESIV-VWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 228
A V L + AG I+ + + NS+L +++G VD+ +P D L+ DP Y
Sbjct: 325 SASVVLCANTRASAGSKILQIDHSVRNNSELYLSFGDVDDQHPGDYEYWPTELSENDPLY 384
Query: 229 QDKRMVAQRNG-KLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICP 287
K+ V + G Q F V+ R + L YLR V+++ E+ +V + +
Sbjct: 385 AAKKSVLEAQGFADKGQTFPVYKDR---MPREFLSYLRFARVTNSEELFAVSFTEDKV-- 439
Query: 288 VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDY-NLHPKKRVATQLVRMEKKMLN 346
VSP E L L + R++ Y T EDE +L ++ K R A++L R EK+++
Sbjct: 440 VSPMNEYETLQLLMADCRDRMSAYD-TNEEDELLLQKRDDVSLKIRNASRLRRCEKELVG 498
Query: 347 ACLQV 351
+
Sbjct: 499 EMMNA 503
>gi|348537527|ref|XP_003456245.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Oreochromis niloticus]
Length = 607
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 85/354 (24%), Positives = 146/354 (41%), Gaps = 52/354 (14%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
LAL+L+ E+ SFWLPYIR L ++ + PL + + ++ L G+ ++L
Sbjct: 157 LALHLLCERAN-PASFWLPYIRSLPQE-------YDIPLYYQQEDVQLLLGTQAVQDVLS 208
Query: 84 RAEGIKREY----------NELDTVWF---------MAGSLFQQYPYDIPTEAFTFEIFK 124
+ + R+Y L +V M G LF Q+ A + + +
Sbjct: 209 QYKNTARQYAYFYKLVQDKGMLGSVELRLFASLTPVMGGKLFDQW-------AVSSVMTR 261
Query: 125 QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 184
Q + + +V+LA L+PL + DD + V + YK
Sbjct: 262 QNQIPTEDG----SRVTLA----LIPLWDMCNHTNGLITTGYNLEDDRCECVALQDYKEN 313
Query: 185 ESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ 244
E I ++ G + N++ +I+ GF +D+ +DR+ ++ ++ + Y K V R G +
Sbjct: 314 EQIYIFYGTRSNAEFVIHNGFFFQDDAHDRVKIKLGVSKSERLYAMKAEVLARAGIPASY 373
Query: 245 VFHVHAGREKEAISDMLPYLRLG---------YVSDTSEMQSVISSLGPICPVSPCMERA 295
VF +H E + +L +LR+ Y+ + + + PVS E
Sbjct: 374 VFALHCN-EPPISAQLLAFLRVFCMTEDELKYYLLGDRAINKIFTLGNSEFPVSWENEIK 432
Query: 296 VLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 349
+ L L Y T ED +ML +L R+A QL EK++L L
Sbjct: 433 LWTFLETRAALLLKTYKTTSEEDRSMLEKPDLSLHSRMAIQLRLAEKQILEKAL 486
>gi|449442309|ref|XP_004138924.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Cucumis
sativus]
Length = 503
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 83/348 (23%), Positives = 151/348 (43%), Gaps = 37/348 (10%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
+AL+L+ E +G S W Y+ L ++ +S + WSE ELA + G+ + L
Sbjct: 143 SVALFLIRENLKGD-SRWRRYLDILPQE-------TDSTVFWSEEELAEIQGTQLLSTTL 194
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
E +K E+ +++ + D+ T + F AF ++S +
Sbjct: 195 NVKEYVKSEFLKVEEEILLRHK-------DLFPSRITLDDFFWAFGILRSRAFSRLR--- 244
Query: 143 ARRFALVPLGPPL----------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WC 191
+ L+P + A+ K A L + D L KAG+ + + +
Sbjct: 245 GQNLVLIPFADLVNHSANVTTEEHAWEVKGPAGLFSWDVLFSLRSPLSVKAGDQVFIQYD 304
Query: 192 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 251
+ N+ L ++YGF+++ + + + + D + DK +A+ NG F +
Sbjct: 305 LKKSNADLALDYGFIEQKSDRNAYTLTLEIPESDLFFDDKLDIAETNGLNQTAYFDIIL- 363
Query: 252 REKEAISDMLPYLRLGYVSDTSE--MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKA 306
E+ MLP+LRL + T ++S+ S G + PVS E + + + +A
Sbjct: 364 -ERPFPPAMLPFLRLLALGGTDAFLLESLFRNSVWGHLEMPVSRANEELICQVVRNACEA 422
Query: 307 RLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTAD 354
L+GY T+ EDE L + NL + R+A + EK++L +Q+ D
Sbjct: 423 ALSGYHTTIEEDEK-LKEENLDSRLRIAVGIREGEKRVLQQIIQIFKD 469
>gi|219126444|ref|XP_002183467.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217405223|gb|EEC45167.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 519
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 94/352 (26%), Positives = 142/352 (40%), Gaps = 58/352 (16%)
Query: 24 LALYLMYEKK-QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
L +YL++++K G SF+ PY L P+ WS EL L GS ++I
Sbjct: 116 LMIYLLWDRKTHGSSSFFHPYYEILP------PTLRNMPIFWSAFELQELEGSHLLSQIA 169
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
+R + I+ +Y + V G+L T + FK A + V S LQ +
Sbjct: 170 DRGQAIQDDYEAILEVAPSLGTLC------------TLDEFKWARMCVCSRNFGLQ-IDG 216
Query: 143 ARRFALVPLGPPLLAYSSK-CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 201
R ALVP L Y + K V + + +AG + G + N + L+
Sbjct: 217 HRTSALVPHADMLNHYRPRETKWTFDEVTQCFTITSLQSIQAGAQVYDSYGQKCNHRFLL 276
Query: 202 NYGFVDEDNPY------DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 255
NYGF EDN + + +E ++ D +QDK R E
Sbjct: 277 NYGFAVEDNRELDGFCPNEVPLELYVDPADILFQDKLEFWTRG--------------ETN 322
Query: 256 AISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATL 315
IS G V+ Q+V S+G P S E + + RLA YP T+
Sbjct: 323 QIS--------GAVTAGLIAQAVGGSMGRGVP-SHAAESYTSGPVVK--RVRLASYPTTI 371
Query: 316 SEDEAMLTDYNLHPK---KRVATQLVRMEKKMLN---ACLQVTADMIMLLPD 361
S+D A L D +P+ +R A VR EK++L+ + DM+ + D
Sbjct: 372 SQDMADLQDEASYPQFSNRRHAKIQVRGEKEVLHHFRVWSETALDMLTFIED 423
>gi|361129824|gb|EHL01706.1| putative Ribosomal N-lysine methyltransferase 4 [Glarea lozoyensis
74030]
Length = 483
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 93/363 (25%), Positives = 149/363 (41%), Gaps = 51/363 (14%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
L +M E +Q S W PY+ L ++ ++S + WSE ELA L S +I
Sbjct: 89 ALTATMMSEGQQSD-SRWAPYLAVLPQK-------LDSLVFWSEEELAELQASSVAKKIG 140
Query: 83 ERA--EGIKREYNELDTVWF------MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 134
+ E + + L F S+ Y +DIP E E KQ +
Sbjct: 141 RSSAEEMFTKHISPLGLGEFNVELCHQVASVIMAYAFDIPEE----EPAKQENGGAEGET 196
Query: 135 VHL-------QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESI 187
L +K L+ ++PL L A + + A + ++ +++ +P AGE I
Sbjct: 197 DDLVSDDGEDEKTILS----MIPLADMLNADAERNNARIYYENEDLEMRTIKPIMAGEEI 252
Query: 188 VVWCGPQPNSKLLINYGFVDED-NPYDRLVVEAA----LNTEDPQY-------------- 228
G P S LL YG+V E+ YD + + +A L TE PQ
Sbjct: 253 FNDYGQLPRSDLLRRYGYVTENYAQYDVVEISSASIKSLMTEKPQEIQSGQFLDPLTSAE 312
Query: 229 -QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICP 287
+++ +A R G L A E+ AI D L L ++ D ++++++S +
Sbjct: 313 AEERVALADREGILEDSYDVNIANAEERAIPDELLALLYLFLLDNENLEAIVTSQSALPS 372
Query: 288 VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 347
S V L + R A Y TL EDE +L NL + +A Q+ EK++L
Sbjct: 373 RSKLATELVGKVLVKVLRHREAEYATTLEEDEKLLQAANLPRRTAMAIQVRHGEKRVLRL 432
Query: 348 CLQ 350
++
Sbjct: 433 AVE 435
>gi|449495943|ref|XP_004159992.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Cucumis
sativus]
Length = 503
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 83/348 (23%), Positives = 151/348 (43%), Gaps = 37/348 (10%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
+AL+L+ E +G S W Y+ L ++ +S + WSE ELA + G+ + L
Sbjct: 143 SVALFLIRENLKGD-SRWRRYLDILPQE-------TDSTVFWSEEELAEIQGTQLLSTTL 194
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
E +K E+ +++ + D+ T + F AF ++S +
Sbjct: 195 NVKEYVKSEFLKVEEEILLRHK-------DLFPSRITLDDFFWAFGILRSRAFSRLR--- 244
Query: 143 ARRFALVPLGPPL----------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WC 191
+ L+P + A+ K A L + D L KAG+ + + +
Sbjct: 245 GQNLVLIPFADLVNHSANVTTEEHAWEVKGPAGLFSWDVLCSLRSPLSVKAGDQVFIQYD 304
Query: 192 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 251
+ N+ L ++YGF+++ + + + + D + DK +A+ NG F +
Sbjct: 305 LKKSNADLALDYGFIEQKSDRNAYTLTLEIPESDLFFDDKLDIAETNGLNQTAYFDIIL- 363
Query: 252 REKEAISDMLPYLRLGYVSDTSE--MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKA 306
E+ MLP+LRL + T ++S+ S G + PVS E + + + +A
Sbjct: 364 -ERPFPPAMLPFLRLLALGGTDAFLLESLFRNSVWGHLEMPVSRANEELICQVVRNACEA 422
Query: 307 RLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTAD 354
L+GY T+ EDE L + NL + R+A + EK++L +Q+ D
Sbjct: 423 ALSGYHTTIEEDEK-LKEENLDSRLRIAVGIREGEKRVLQQIIQIFKD 469
>gi|350595011|ref|XP_003484025.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Sus
scrofa]
Length = 326
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/150 (30%), Positives = 77/150 (51%), Gaps = 7/150 (4%)
Query: 101 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS 160
M +F +YP P E F E FK +F + S +V L S+ + ALVP ++ +S
Sbjct: 73 MRVRIFSKYPDFFPEEVFNIESFKWSFGILFSRMVRLP--SMDGKNALVPWAD-MMNHSC 129
Query: 161 KCKAMLAAVDDAVQLV--VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLV 216
+ + L + +V DRPY+ GE + + G + N +LL++YGFV ++ NP D +
Sbjct: 130 EVETFLDYDKSSKGIVFPTDRPYQPGEQVFISYGKKSNGELLLSYGFVPKEGTNPSDSVE 189
Query: 217 VEAALNTEDPQYQDKRMVAQRNGKLSVQVF 246
+ +L D Y++K + ++ G Q F
Sbjct: 190 LSLSLKKSDESYKEKLELLKKYGLSGSQCF 219
>gi|198413420|ref|XP_002131202.1| PREDICTED: similar to SET domain containing 3 [Ciona intestinalis]
Length = 577
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 87/375 (23%), Positives = 169/375 (45%), Gaps = 50/375 (13%)
Query: 8 YAAELLTTNKLSEL---ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 64
Y A L+ N++ + CLALYL E+ S + PYI + ++ + L +
Sbjct: 140 YLAGLIEGNEVLSVMPNVCLALYLHCERFT-LNSKYQPYIDMIPQE-------FNTILYF 191
Query: 65 SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA---FTFE 121
E+ YL G+ + + + + I R++ L V+ GS ++ +P +A FTF+
Sbjct: 192 KPHEMKYLKGTAALSVAINQFKSIVRQFALLYQVF--NGSHQKEDVEKLPLQARNAFTFD 249
Query: 122 IFKQAFVAVQS----CVVHLQKV----SLARRFALVPL--------GPPLLAYSSKCKAM 165
++ AV + H+ V AL+P+ GP AY+
Sbjct: 250 TYRWCASAVTTRQNKIPTHVGDVLGDLDENSTLALIPMWDMFNHAIGPLSTAYN------ 303
Query: 166 LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 225
A+ ++ + + +K GE + + G + NS LLI+ GFV +++P+D++ + ++ +D
Sbjct: 304 --ALTRGIECLAMQDFKTGEQVKICYGARTNSDLLIHNGFVMKESPFDKVRIHLGVSQKD 361
Query: 226 PQYQDK-RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT------SEMQSV 278
P Y K +++ + N ++S Q +L +LR+ ++++ + ++
Sbjct: 362 PLYSLKAKLLEKLNVEVSGQFAVCSMDNSLPTSPQLLVFLRVFHMNEEELRSWLEKQKNE 421
Query: 279 ISSLGPI---CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVAT 335
+SSL I V + V + L + K L G+ E M+ D +L + ++A
Sbjct: 422 LSSLREIYISGEVKFKSDVKVWEFLENRVKLLLMGFKKIGDNIEEMMEDKSLTHRSKLAL 481
Query: 336 QLVRMEKKMLNACLQ 350
Q E ++L+AC+
Sbjct: 482 QFRIEEHRILSACVN 496
>gi|302896454|ref|XP_003047107.1| SET domain protein [Nectria haematococca mpVI 77-13-4]
gi|256728035|gb|EEU41394.1| SET domain protein [Nectria haematococca mpVI 77-13-4]
Length = 1037
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 100/397 (25%), Positives = 160/397 (40%), Gaps = 83/397 (20%)
Query: 17 KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP 76
+L + L L LMYE QG+KS W PY L + ++P+ WSE+EL L S
Sbjct: 660 RLDSWSSLILVLMYEYLQGEKSQWKPYFDVL-------PSSFDTPMFWSESELDQLQASH 712
Query: 77 TKAEILE-RAE-------------------GIKREYNELDTVWFMAGSLFQQYPYDIPTE 116
+ +I + AE G R ++L + GS Y +D+ +
Sbjct: 713 MRHKIGKADAESMFRKTLLPIIRKNSSVFGGENRSDDDLVEIAHRMGSTIMAYAFDLEND 772
Query: 117 AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 176
E +V + + +VP+ +L ++ A + ++++ +
Sbjct: 773 EDEEEEETDGWVEDREGKSMM---------GMVPMAD-ILNADAEFNAHVNHEEESLTVT 822
Query: 177 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP-YDRLVVEAALNTEDPQYQDKRMVA 235
RP KAGE I + GP PNS+LL YG+V E + YD VVE + + V
Sbjct: 823 SLRPIKAGEEIFNYYGPHPNSELLRRYGYVTERHSRYD--VVEIPWDVVES-------VM 873
Query: 236 QRNGKLSVQVFHV--HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGP--------- 284
+ N +S QV H E+E D R +T E+ S + GP
Sbjct: 874 RLNFGISGQVLEKLRHGLEEEEEFEDTFVLER-----ETGEVNSDGTFSGPARFESMPED 928
Query: 285 --------ICPVSPCMERAVLDQ----------LADYFKARLAGYPATLSEDEAMLTDYN 326
+ V A+ D+ LA +A + YP + SED +L +
Sbjct: 929 LQEQLKTFLKGVKKAQPEAIPDKRKRDEIHHAVLAKTLQALASKYPTSTSEDGILLQRQD 988
Query: 327 LHPKKRVATQLVRMEKKMLNACLQVTA--DMIMLLPD 361
L + R+A ++ EKK+L + T+ D+ M + D
Sbjct: 989 LSQRTRMAIEVRLGEKKLLQEAIASTSSVDVEMTVDD 1025
>gi|302753470|ref|XP_002960159.1| hypothetical protein SELMODRAFT_437298 [Selaginella moellendorffii]
gi|300171098|gb|EFJ37698.1| hypothetical protein SELMODRAFT_437298 [Selaginella moellendorffii]
Length = 377
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 52/195 (26%), Positives = 89/195 (45%), Gaps = 26/195 (13%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
LAL ++ E+ +G+ S W PYI L + +++ LW +TEL+YL SP + E
Sbjct: 114 LALIVLMERYKGQSSVWAPYISCLPQP-----AELDNTFLWEDTELSYLKASPLYGKTRE 168
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 143
R E I E+ ++ + LF + + E FK + V S + +++
Sbjct: 169 RLEMITTEFGQVQNALNVWPQLFGK---------VSLEDFKHVYATVFS-----RSLAIG 214
Query: 144 RRFALVPLGPPLLAY-----SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
LV + P+L + +S K + + + DR Y + I + G N++
Sbjct: 215 EDSTLVMI--PMLDFFNHNATSFAKLSFNGLLNYAVVTADRAYTENDQIWINYGDLSNAE 272
Query: 199 LLINYGFVDEDNPYD 213
L ++YGF +NPYD
Sbjct: 273 LALDYGFTVPENPYD 287
>gi|145528147|ref|XP_001449873.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124417462|emb|CAK82476.1| unnamed protein product [Paramecium tetraurelia]
Length = 605
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 80/356 (22%), Positives = 149/356 (41%), Gaps = 44/356 (12%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
L+ +L+ EKK + SFW PY+ L + P+ +++++L +L GSP ++ +
Sbjct: 240 LSTFLLQEKK-IQDSFWKPYLDVLPKSYSNF------PIFFNDSDLEWLKGSPFLKQVKD 292
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 143
+ +K++Y ++ V A Q +F+ F A + S + + +
Sbjct: 293 KITDLKKDYCDICQV---APEFLQN----------SFDEFCWARMTASSRIFGIN-IKGV 338
Query: 144 RRFALVPLG-------PPLLA--YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 194
+ A VPL P L + YS + + + D+ ++ G+ I G +
Sbjct: 339 KTDAFVPLADMLNHKRPKLTSWCYSDERQGFIIETDENIE--------KGQMIFDSYGSK 390
Query: 195 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREK 254
NS+ L+NYGFV +DN + + V + Q K +++ + + F +
Sbjct: 391 CNSRFLLNYGFVVDDNNANEVNVMVEPDGTISLIQLKEGLSRETLQFP-KSFRLVIDPND 449
Query: 255 EAISDMLPYLRLGYVSDTSEMQSVISSLGPICP-----VSPCMERAVLDQLADYFKARLA 309
+ SD + ++R + + E +++ I P +S E A + + + L
Sbjct: 450 VSFSDFMSFIRFILIQEEKEFANLLGKNSYIKPTKIHFISIQNELATWNLIENICIRALN 509
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVS 365
YP TL +D +L L +R L EKK+LN Q + M L + S
Sbjct: 510 QYPTTLEQDLEILKICELTTNQRNCLILRMGEKKILNFYKQFSEKMRQLFSNFDFS 565
>gi|270005261|gb|EFA01709.1| hypothetical protein TcasGA2_TC007289 [Tribolium castaneum]
Length = 230
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 54/213 (25%), Positives = 99/213 (46%), Gaps = 12/213 (5%)
Query: 146 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 205
+AL+PL + V D + + + +KAGE + ++ G + N+ L ++ GF
Sbjct: 14 YALIPLWDMCNHTNGTISTAYNPVLDRSECLAVKNFKAGEQLFIFYGSRSNADLFVHNGF 73
Query: 206 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD--MLPY 263
V E+N YD + ++ DP Q + + GKLS+ + R+ + D +L +
Sbjct: 74 VFENNDYDVYWIRLGISKSDPLQQKRGHLL---GKLSIASTCDFSIRKGASPIDGQLLAF 130
Query: 264 LRLGYVSDTSEMQSVISS-----LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSE 317
LR+ + + ++ I+S LG I C + +E L K L+ Y TL E
Sbjct: 131 LRV-FNMNEEQLDHWINSDKSADLGHIDCALDTALETKSWRFLHARLKLLLSTYKTTLDE 189
Query: 318 DEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 350
DE +L + P + +A ++ EK+++ L+
Sbjct: 190 DEKLLAEAQATPNRLLAIKMRATEKRIIRETLE 222
>gi|399949805|gb|AFP65462.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Chroomonas mesostigmatica
CCMP1168]
Length = 464
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 71/339 (20%), Positives = 141/339 (41%), Gaps = 31/339 (9%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
L+E LA+ + E+ G+KS W Y L +++ + W +++ +L GS
Sbjct: 134 LNEYDSLAIKAIQERAIGEKSKWKVYYEILPKEKDLNLV-----FRWKISDIVFLRGSKV 188
Query: 78 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
E IK ++ ++ F L P + F + ++ A + S + L
Sbjct: 189 LNASFYLKEKIKIQFLRIEKTIFSKNRLV------YPEKIFNLQSWEWAISLLLSRAIFL 242
Query: 138 QKVSLARRFALVPLGP-----PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG 192
Q + ++ ALVP P K + + ++ + + D+ Y + I G
Sbjct: 243 QNM---KKIALVPYADFINHNPFSTSYINSKKIAFSENNEIVMYADKDYNKFDQIFTTYG 299
Query: 193 PQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV---AQRNGKLSVQVFHVH 249
+ N +LL+ YGF+ E NP+D + + AL+T+D Y K ++ +++ VF+
Sbjct: 300 QKTNLELLVLYGFIIERNPFDSIELRVALSTKDELYNKKEKFINDCEKTEQITFPVFYYK 359
Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
+E + ++RL S+L + +E+ + + K L
Sbjct: 360 YPKE------LYEFMRLCLSGPRDFFGEEFSNLNFTDEENFNLEKIIRKTVIFACKKNLK 413
Query: 310 GYPATLSEDEAMLTDYN---LHPKKRVATQLVRMEKKML 345
Y T++E++ + N L ++ + + + EKK+L
Sbjct: 414 AYNKTINEEKILNNLSNIIVLTKNQKTSIKQRKCEKKIL 452
>gi|392349055|ref|XP_003750278.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Rattus
norvegicus]
Length = 416
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 59/247 (23%), Positives = 114/247 (46%), Gaps = 20/247 (8%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
LA +L+ E+ SFW PYI+ L + ++PL + E E+ L + ++
Sbjct: 96 LAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVFS 147
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKV 140
+ + R+Y V Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 148 QYKNTARQYAYFYKV-------IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTE 200
Query: 141 SLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 199
+R AL+PL + DD + V + ++AG+ I ++ G + N++
Sbjct: 201 DGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAEF 260
Query: 200 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD 259
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 261 VIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQ 319
Query: 260 MLPYLRL 266
+L +LR+
Sbjct: 320 LLAFLRV 326
>gi|242007310|ref|XP_002424484.1| SET domain-containing protein, putative [Pediculus humanus
corporis]
gi|212507902|gb|EEB11746.1| SET domain-containing protein, putative [Pediculus humanus
corporis]
Length = 492
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 77/340 (22%), Positives = 147/340 (43%), Gaps = 33/340 (9%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
LAL+L+ E + + SFW YI L + L + + + SPT +
Sbjct: 163 LALFLILEHVKNENSFWHHYISSLPSD-------YNTVLYFDLNDFLEMKNSPTFEMATK 215
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKV 140
+ I R+Y +F +LFQ + I FT+++++ A V + +
Sbjct: 216 HCKNIARQY-----AYF--NNLFQNSNDEASLILRNVFTYQLYRWAVSTVMTRQNFIPSS 268
Query: 141 SLARRF-----ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 195
S + L+PL + +++ + VD + L +P+K GE ++++ G +
Sbjct: 269 STSNDVENGINGLIPLWD-MCNHTNGYLSTQYKVDRSECLAC-KPFKKGEQVLIFYGERS 326
Query: 196 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 255
NS L++ GFV ++NP+D + ++ D + + + + G F++++G E
Sbjct: 327 NSDFLVHNGFVYDENPHDSFRLRLGISKSDKLHGLRCELLKDLGIPDSGDFYLYSGSEP- 385
Query: 256 AISDMLPYLRLGY--VSDTSEMQSVISSLGPI----CPVSPCMERAVLDQLADYFKARLA 309
++L +LR+ V + + +S S L + C + +E V + L D L
Sbjct: 386 VRENLLAFLRIFNMDVENLNHWKSHSSRLSDLMWKDCALDTKIESKVWNFLYDRINLLLK 445
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 349
Y E E + D N +R+ ++ EKK L++ L
Sbjct: 446 TYKG--DEVEVRVEDSNSTECRRLVRAQLKCEKKFLSSIL 483
>gi|52545671|emb|CAH56365.1| hypothetical protein [Homo sapiens]
Length = 380
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 61/262 (23%), Positives = 123/262 (46%), Gaps = 14/262 (5%)
Query: 105 LFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSS 160
+ Q +P+ +P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 9 VIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNG 68
Query: 161 KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 220
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++
Sbjct: 69 LITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLG 128
Query: 221 LNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD--------- 271
++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 129 VSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLG 187
Query: 272 TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKK 331
S + + + PVS E + L D L Y T+ ED+++L +++L +
Sbjct: 188 DSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNHDLSVRA 247
Query: 332 RVATQLVRMEKKMLNACLQVTA 353
++A +L EK++L ++ A
Sbjct: 248 KMAIKLRLGEKEILEKAVKSAA 269
>gi|189236574|ref|XP_975615.2| PREDICTED: similar to SET domain containing 3 [Tribolium castaneum]
Length = 667
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 57/220 (25%), Positives = 100/220 (45%), Gaps = 26/220 (11%)
Query: 146 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRP-------YKAGESIVVWCGPQPNSK 198
+AL+PL C + A V+DR +KAGE + ++ G + N+
Sbjct: 451 YALIPLW-------DMCNHTNGTISTAYNPVLDRSECLAVKNFKAGEQLFIFYGSRSNAD 503
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
L ++ GFV E+N YD + ++ DP Q + + GKLS+ + R+ +
Sbjct: 504 LFVHNGFVFENNDYDVYWIRLGISKSDPLQQKRGHLL---GKLSIASTCDFSIRKGASPI 560
Query: 259 D--MLPYLRLGYVSDTSEMQSVISS-----LGPI-CPVSPCMERAVLDQLADYFKARLAG 310
D +L +LR+ + + ++ I+S LG I C + +E L K L+
Sbjct: 561 DGQLLAFLRV-FNMNEEQLDHWINSDKSADLGHIDCALDTALETKSWRFLHARLKLLLST 619
Query: 311 YPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 350
Y TL EDE +L + P + +A ++ EK+++ L+
Sbjct: 620 YKTTLDEDEKLLAEAQATPNRLLAIKMRATEKRIIRETLE 659
>gi|190402231|gb|ACE77646.1| hypothetical protein [Sorex araneus]
Length = 350
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 61/248 (24%), Positives = 114/248 (45%), Gaps = 11/248 (4%)
Query: 116 EAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 174
E+FT+E ++ A +V + + +R AL+PL + DD +
Sbjct: 12 ESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCE 71
Query: 175 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 234
V + ++AGE I ++ G + N++ +++ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 72 CVALQDFRAGEQIYIFYGTRSNAEFVVHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEV 131
Query: 235 AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS--------SLGPI- 285
R G + VF +H E + +L +LR+ +++ + ++ +LG
Sbjct: 132 LARAGIPTSSVFALHV-TELPISAQLLAFLRVFCMTEEELREHLLGENAIDRIFTLGNSE 190
Query: 286 CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
PVS E + L D L Y T+ ED+A L L P+ +A +L EK++L
Sbjct: 191 YPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKAFLQSPGLSPRAAMAVKLRLGEKEIL 250
Query: 346 NACLQVTA 353
++ A
Sbjct: 251 EKAVRSAA 258
>gi|322694827|gb|EFY86647.1| SET domain protein [Metarhizium acridum CQMa 102]
Length = 467
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 91/392 (23%), Positives = 158/392 (40%), Gaps = 61/392 (15%)
Query: 5 LFYYAAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 64
LF + L + L L +MYE G +S W PYI L L ++P+ W
Sbjct: 78 LFESQGDEDEEQALDSWSALILIMMYEFFLGDESKWKPYIDVL-------PLTFDTPMFW 130
Query: 65 SETELAYLTGSPTKAEILE-RAEGIKREY--------------------NELDTVWFMAG 103
SE EL+YL S T +I + AE + R +L + G
Sbjct: 131 SEEELSYLQASATVNKIGKADAEEMFRTRLIPAIRGNPSVFVSSGDCSDEDLIGLAHRMG 190
Query: 104 SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCK 163
S Y +D+ E + +V + + V++A +L ++
Sbjct: 191 STIMAYAFDLENEEAENDEESDGWVEDREGKSMMGMVAMA----------DILNADAEFN 240
Query: 164 AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN--------PYDRL 215
A + D+ + + R KAGE I+ + GP PNS+LL YG++ E + P+D
Sbjct: 241 AHVNHGDEELTVTSIRDIKAGEEILNYYGPHPNSELLRRYGYITEKHSRYDVVEIPWD-- 298
Query: 216 VVEAALNTEDPQYQD------KRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV 269
V+ +L +E QD +RM ++ + V +G + P + G
Sbjct: 299 AVQHSLMSELGVPQDIMAETMERM--DQDDLEDIFVLERDSGEPNPDGTFAGPAVVDGMP 356
Query: 270 SDTSE-MQSVISSL----GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 324
D E +++ I L G + + + + + + + Y T++EDE +L
Sbjct: 357 PDLKEQLKATIKLLQKLDGNLISDKRKRDDILRSTMVETLRLIASRYSTTIAEDEVLLAQ 416
Query: 325 YNLHPKKRVATQLVRMEKKMLNACLQVTADMI 356
+L ++R+A Q+ EKK+L ++M+
Sbjct: 417 DSLTRRQRMAVQVRLGEKKLLQEACDHFSEMV 448
>gi|425766115|gb|EKV04742.1| hypothetical protein PDIG_87340 [Penicillium digitatum PHI26]
gi|425778867|gb|EKV16969.1| hypothetical protein PDIP_33360 [Penicillium digitatum Pd1]
Length = 679
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 58/213 (27%), Positives = 98/213 (46%), Gaps = 23/213 (10%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
A +LM + +G + FW PY+R L + GQL +PL + E ++ ++ G+ +
Sbjct: 105 TFAFFLMAQYLRGPEGFWYPYLRTLPQP---GQLT--TPLFFGEEDVDWIQGTGIPEAAV 159
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
ER + + +Y D+ + G+ +P E +T+E++ A + S + +S
Sbjct: 160 ERIKIWEEKY---DSGYLQLGA--TGFP---DCETYTWELYLWASTIITSRAFSAKVLSG 211
Query: 143 ARR---------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 193
A + AL+PL L + K A D + L+V + AG+ I GP
Sbjct: 212 AVQPGDLPEDGVSALLPL-IDLPNHRPMAKVEWRAGDKDIGLLVLEDHSAGQEISNNYGP 270
Query: 194 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 226
+ N +LLINYGF NP D +V + + P
Sbjct: 271 RNNEQLLINYGFCIAGNPTDYRIVHLGVKPDSP 303
>gi|297726941|ref|NP_001175834.1| Os09g0411650 [Oryza sativa Japonica Group]
gi|255678893|dbj|BAH94562.1| Os09g0411650, partial [Oryza sativa Japonica Group]
Length = 206
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 48/160 (30%), Positives = 78/160 (48%), Gaps = 13/160 (8%)
Query: 194 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGRE 253
+ N++L ++YGF + ++ D + ++ DP Y DK +A+ NG F + G
Sbjct: 10 KSNAELALDYGFTESNSSRDAYTLTLEISESDPFYDDKLDIAELNGMGETAYFDIVLG-- 67
Query: 254 KEAIS-DMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYFK 305
E++ MLPYLRL + T +V L PVS E A+ + + K
Sbjct: 68 -ESLPPQMLPYLRLLCLGGTDAFLLEALFRNAVWGHLE--LPVSQDNEEAICQVIRNACK 124
Query: 306 ARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
+ L Y T+ EDE +L NL P+ ++A ++ EKK+L
Sbjct: 125 SALGAYHTTIEEDEELLGSENLQPRLQIAVEVRAGEKKVL 164
>gi|348675930|gb|EGZ15748.1| hypothetical protein PHYSODRAFT_561468 [Phytophthora sojae]
Length = 430
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 88/361 (24%), Positives = 147/361 (40%), Gaps = 36/361 (9%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LAL+LM+E+ +G +SF+ P+I + + P+ W+E EL L G+
Sbjct: 83 VLALHLMHERSKGGESFFAPFIASM-------PTTFDLPVFWTEAELNELKGTNVLLLTQ 135
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
+ ++R++ + + F +PT T + + A + S VS
Sbjct: 136 LMKQHLERDFENIHQA---VAADFPDIFASLPT--LTIDDYMWAMSVIWSRAF---GVSK 187
Query: 143 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV---------VDRPYKAGESIVVWCGP 193
++ V L P + ++ + +DD V V AG ++ + G
Sbjct: 188 GGKYLHV-LCPAMDMFNHDV-TVRKPLDDFVSFNEEKQMMTHHVPEDVAAGSAVHISYGQ 245
Query: 194 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGRE 253
N+KLL +YGFV +N + + DP ++ K+ V N Q + H
Sbjct: 246 YSNAKLLYSYGFVSPENFRRGVDFWMKIPLSDPYFKLKQTVLDSNELTKEQTYDFHGTLL 305
Query: 254 KEAISD-MLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP 312
+ + +L LR+ +++ Q + I V E AV + L + +L+ Y
Sbjct: 306 SNDVDERLLATLRVILMNEQEIRQYKKAFESSILSVRN--ELAVYENLQSTCRRKLSNYA 363
Query: 313 ATLSEDEAMLTDYNLHPKKRVATQL-VRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPY 371
TL EDEA+L + K R+A + VRME K QVT +I L S P
Sbjct: 364 TTLEEDEAILAETETESKPRLAFAVRVRMEDK------QVTTSVIETLEQWKQSLASKPD 417
Query: 372 A 372
A
Sbjct: 418 A 418
>gi|242049248|ref|XP_002462368.1| hypothetical protein SORBIDRAFT_02g024510 [Sorghum bicolor]
gi|241925745|gb|EER98889.1| hypothetical protein SORBIDRAFT_02g024510 [Sorghum bicolor]
Length = 489
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 93/341 (27%), Positives = 146/341 (42%), Gaps = 39/341 (11%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
+AL L+ E +G S W PY+ L RQ S A L S + ++L
Sbjct: 130 AVALLLLSEVARGADSPWAPYLAILPRQTD------------STIFCAGLKKSSLRYKLL 177
Query: 83 ERAEGIKREY--NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV---VHL 137
G+K EY +E D+V S + D+ + TF+ F AF ++S V +
Sbjct: 178 STTVGVK-EYVQSEFDSVQAEIISRNK----DLFPGSITFDDFLWAFGILRSRVFPELRG 232
Query: 138 QKVSLARRFALVPLGPPLLAYSS----KCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCG 192
K++L LV P + + S K K + + L K+G+ I + +
Sbjct: 233 DKLALVPFADLVNHSPDITSEGSSWEIKGKGLFGR-EPMFSLRTPVDVKSGQQIYIQYDL 291
Query: 193 PQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR 252
+ N++L ++YGFV+ + D V ++ DP Y DK +A+ N F +
Sbjct: 292 DKSNAELALDYGFVESNPSRDSYTVTLEISESDPFYGDKLDIAELNELGETAYFDIIL-- 349
Query: 253 EKEAISDMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYFK 305
++ MLPYLRL + T SV L P+SP E ++ + D K
Sbjct: 350 DEPLPPQMLPYLRLLCIGGTDAFILEALFRNSVWGHLE--LPLSPDNEESICQVMRDACK 407
Query: 306 ARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 346
+ LA Y T+ EDE + NL P+ +A + EKK+L
Sbjct: 408 SALAAYHTTIEEDEELSERENLQPRLTIAIGVRAGEKKVLQ 448
>gi|350408192|ref|XP_003488333.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Bombus
impatiens]
Length = 484
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 87/345 (25%), Positives = 148/345 (42%), Gaps = 32/345 (9%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
LA+ L+ EK + + S W PY+ L + L + ++ L GSPT L+
Sbjct: 158 LAIALLIEKHK-EYSKWKPYLDILPT-------TYTTVLYMTAADMNELKGSPTLEAALK 209
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPT---EAFTFEIFKQAF--VAVQSCVVHLQ 138
+ I R+Y +F LFQ+ + + FT+E + A V + ++ +
Sbjct: 210 QCRNIARQY-----AYF--NKLFQKNNNAVSAILRDVFTYEKYCWAVSTVMTRQNIIPSK 262
Query: 139 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
SL AL+P+ +SK A + + R +K E I + G + NS
Sbjct: 263 DGSLMIH-ALIPMWDMCNHENSKITTDFNATLNCCECYALRDFKKAEQIFISYGARTNSD 321
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
++ GFV DN D + ++ DP +++ + + +V F + G E IS
Sbjct: 322 FFVHSGFVYMDNEQDGFKLRLGISKADPLQKERVELLNKLDLPAVGEFLLKPG--TEPIS 379
Query: 259 D-MLPYLRLGYVSDTSEMQSVISS--LGPICPVSPCMERAVLDQLADYFKARL----AGY 311
D +L +LR+ + E+ I S + + + +E V + + + RL A Y
Sbjct: 380 DTLLAFLRV-FSMRKEELAHWIQSDRVNDLKHMDCALETVVEENVKKFLLTRLQLLIANY 438
Query: 312 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 356
P TL ED +L + L K++A QL EK++L L+ I
Sbjct: 439 PTTLKEDLQLL-ETTLPRIKKLAIQLRVTEKRILQGALEYVQQWI 482
>gi|367029027|ref|XP_003663797.1| hypothetical protein MYCTH_2080826, partial [Myceliophthora
thermophila ATCC 42464]
gi|347011067|gb|AEO58552.1| hypothetical protein MYCTH_2080826, partial [Myceliophthora
thermophila ATCC 42464]
Length = 357
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 60/206 (29%), Positives = 93/206 (45%), Gaps = 32/206 (15%)
Query: 26 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES-PLLWSETELAYLTGSPTKAEILER 84
+L+ E +GK SFW PYI L Q+AV + P W + ++AYL G+ I E
Sbjct: 117 FFLVKEYLKGKDSFWWPYIATLPPPE---QVAVWALPPFWPDHDIAYLEGTNAHVAIQEI 173
Query: 85 AEGIKREYNELDTVWFMAGSLFQQYPY-DIPTEAFTFEIFKQAFVAVQSCVVH----LQK 139
E +KRE+ + A L ++ + D+P A+T ++K AF S L
Sbjct: 174 QENVKREFKQ-------ARKLLKEEDFPDLP--AYTQLLYKWAFCIFTSRSFRPSLVLSD 224
Query: 140 VSLARRFALVPLG---------PPLLAYSSKCKAM-----LAAVDDAVQLVVDRPYKAGE 185
+ R AL+P G PLL ++ +V D +L+ PY+ G
Sbjct: 225 ATKRRLSALLPQGVQLDDFSVLQPLLDIANHSPTARYTWDTTSVPDTCRLICHDPYQPGT 284
Query: 186 SIVVWCGPQPNSKLLINYGFVDEDNP 211
+ G + NS+LL+ YGF+ + P
Sbjct: 285 QVYNNYGLKTNSELLLAYGFILPETP 310
>gi|156064409|ref|XP_001598126.1| hypothetical protein SS1G_00212 [Sclerotinia sclerotiorum 1980]
gi|154691074|gb|EDN90812.1| hypothetical protein SS1G_00212 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 470
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 90/359 (25%), Positives = 148/359 (41%), Gaps = 47/359 (13%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
L LM E Q + S W PY+ L + ++S + WS++ELA L S +I
Sbjct: 86 VLTSILMTEA-QMENSKWAPYLAVLPER-------LDSLVFWSDSELAELQASAVVKKIG 137
Query: 83 ERA-----------EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ 131
++ +G+K E+ S+ Y +DIP + + A
Sbjct: 138 KKDAEDMFKSYIAPQGLKHSSTEM---CHKVASVIMAYAFDIPDPSDAPTSGGKGGEAGD 194
Query: 132 SCVV---HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 188
V +K L+ ++PL L A + + A L ++ +++ +P GE I
Sbjct: 195 DLVSDDGEDEKTILS----MIPLADMLNADADRNNARLICDNEELEMRAIKPISKGEEIF 250
Query: 189 VWCGPQPNSKLLINYGFV-DEDNPYD------RLVVEAALN----------TEDPQYQDK 231
G P S LL YG+V D + YD L+V N T+D + +
Sbjct: 251 NDYGQLPRSDLLRRYGYVTDGYSAYDVAEISAELIVSLFRNGKVHPSLHKLTQD-GLKTR 309
Query: 232 RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPC 291
+A+R G VH+ ++ +I D L + D S ++++++S I S
Sbjct: 310 LELAEREGVYEDSFDLVHSSPDEPSIPDELLAFLYLLLVDESHLKAILNSESSIPSRSKL 369
Query: 292 MERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 350
LA +AR Y TL EDE +L + +L + +A Q+ EKK+L A +Q
Sbjct: 370 TTELAGQVLATLLQAREKEYSTTLEEDEDLLKNADLPVRHAMAIQVRSGEKKVLRAAMQ 428
>gi|342881738|gb|EGU82570.1| hypothetical protein FOXB_06936 [Fusarium oxysporum Fo5176]
Length = 467
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 95/389 (24%), Positives = 161/389 (41%), Gaps = 61/389 (15%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
L + L L ++YE QG+ S W PY L + ++P+ WS+ EL L S
Sbjct: 96 LDSWSSLILIMIYEYLQGENSKWKPYFDVLPS-------SFDTPMFWSDNELDQLQASHM 148
Query: 78 KAEILE-RAEG--------IKREYNELDTVWFMAGS--------LFQQYPYDIPTEAFTF 120
+ +I + AE I R E+ F AG+ + + I AF
Sbjct: 149 RHKIGKADAENMFQKTLLPIIRSNAEI----FNAGNKTDAELIEIAHRMGSTIMAYAFDL 204
Query: 121 EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRP 180
E ++ V S+ +VP+ +L ++ A + ++++ + RP
Sbjct: 205 ENDEEEEEEADGWVEDRDGKSM---MGMVPMAD-ILNADAEFNAHVNHEEESLTVTSLRP 260
Query: 181 YKAGESIVVWCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAAL--NTEDPQYQD 230
KAGE I+ + GP PNS+LL YG+V E + P+D +VE+AL N P
Sbjct: 261 IKAGEEILNYYGPHPNSELLRRYGYVTEKHSRYDVVEIPWD--IVESALTSNFGIP---- 314
Query: 231 KRMVAQRNGKL-------SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE-MQSVISSL 282
+++ Q G L V G + P D E +++ + +
Sbjct: 315 GQVLEQIRGALEEDEEFEDTFVLERETGEVNSDGTFAEPARFESMPEDLQEQLKTFLKGI 374
Query: 283 GPICP--VSPCMERAVLDQ--LADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLV 338
P + +R + Q LA +A +A YP ++SEDE +L +L+ + R+A +
Sbjct: 375 KKAQPDAIPDKRKRDEIHQAVLAKTLEALVARYPTSISEDENLLK-QDLNQRTRMAIAVR 433
Query: 339 RMEKKMLNACLQVTADMIMLLPDVTVSPC 367
EKK+L + ++ + + D P
Sbjct: 434 LGEKKLLQEAITASSGDVEMTMDDESGPA 462
>gi|145516108|ref|XP_001443948.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124411348|emb|CAK76551.1| unnamed protein product [Paramecium tetraurelia]
Length = 572
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 77/356 (21%), Positives = 147/356 (41%), Gaps = 46/356 (12%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
L+ +L+ EK + SFW PYI L P+ ++ ++L +L GSP +I +
Sbjct: 215 LSTFLLQEKFRPN-SFWKPYIDILPSSYP------SFPIFYNNSDLEWLKGSPFLKQIKD 267
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 143
+ ++++YN++ V F QY F F A + S + + ++
Sbjct: 268 KLADLQKDYNDICNV----VPEFTQY---------QFHEFCWARMTASSRIFGI-NINGV 313
Query: 144 RRFALVPLG-------PPLLA--YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 194
+ A VPL P L + YS + + + D+ ++ G+ I G +
Sbjct: 314 KTDAFVPLADMLNHKRPKLTSWCYSDEKQGFIIETDEKIE--------RGQMIFDSYGRK 365
Query: 195 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREK 254
NS+ +NYGFV E N + + + + DP Q K + + + + F + ++
Sbjct: 366 CNSRFFLNYGFVVEGNDANEVNLAVEADQNDPLLQLKEQAIKESLQWP-KNFKLLMDTDE 424
Query: 255 EAISDMLPYLRLGYVSDTSEMQSVISSLGP-------ICPVSPCMERAVLDQLADYFKAR 307
A+ D + ++R + D ++++ +++ P+ E + + K
Sbjct: 425 TAVIDFMSHIRFLVIRDEAQLKLLLNQKNSQNFKSTKTQPLGIYNELEMWKMIGRICKKT 484
Query: 308 LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVT 363
L YP T +D+ +L L +R L EK++L Q + M LL +
Sbjct: 485 LKQYPTTFEQDQEILQICELTTNQRNCLILRMGEKEILKFYFQFSERMKELLSNFN 540
>gi|380015248|ref|XP_003691619.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Apis
florea]
Length = 483
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 80/342 (23%), Positives = 145/342 (42%), Gaps = 24/342 (7%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA+ L+ E+ + + S W PY+ L + L + ++ L GSPT L
Sbjct: 156 ALAIALLIERHK-ENSKWKPYLDILP-------TTYTTVLYMTAADMIELKGSPTLEAAL 207
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
++ I R+Y+ + V+ + D+ FT+E + A V + +
Sbjct: 208 KQCRNIARQYSYFNKVFQNNNNAVSAILRDV----FTYERYCWAVSTVMTRQNLIPSEDG 263
Query: 143 ARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 201
+R AL+P+ + + A + + R +K GE I + GP+ NS +
Sbjct: 264 SRMIHALIPMWDMCNHENGRITTDFNATSNYCECYALRDFKKGEQIFISYGPRTNSDFFV 323
Query: 202 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-M 260
+ GFV +N D + ++ D +++ + + +V F + G E ISD +
Sbjct: 324 HSGFVYMENKQDGFKLRLGISKADSLQKERIELLNKLDLPTVGEFLLKLG--TEPISDLL 381
Query: 261 LPYLRLGYVSDTSEMQSVISS--LGPICPVSPCMERAVLDQLADYFKARL----AGYPAT 314
L +LR+ + +E+ I S + + + +E V + + + RL A YP T
Sbjct: 382 LAFLRV-FSMRKAELAHWIRSDRVNDLKHMDCALETVVEENVRKFLLTRLQLLIANYPTT 440
Query: 315 LSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 356
L ED +L + L K++ QL EK++L L+ I
Sbjct: 441 LKEDLQLL-ETTLPQIKKLTIQLRVTEKRILQGALEYVEQWI 481
>gi|296419472|ref|XP_002839331.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295635461|emb|CAZ83522.1| unnamed protein product [Tuber melanosporum]
Length = 541
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 57/220 (25%), Positives = 106/220 (48%), Gaps = 24/220 (10%)
Query: 8 YAAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 67
++A+ +T ++ A L ++L E +GK+SFW PY+ L R+ ++PL +S+
Sbjct: 73 FSADFITNT--TQHAALCMFLCLEWLKGKESFWWPYLCVLPRE-------FDTPLYFSDE 123
Query: 68 ELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
+L +L G +A +E + I RE E A S+ Q+ YD TE +T+E++ A
Sbjct: 124 DLQFLQGCNLEATEVEARKLIWREEFE------AAVSILQREGYD--TEYYTWELYLWAS 175
Query: 128 VAVQSCVVHLQKVSLARRFA----LVPLGPPLLAYSSKCKAMLAA---VDDAVQLVVDRP 180
S + + R +P+ PL+ + A + D +++++
Sbjct: 176 TIFTSRSFPGKLMDWDRIIVHEDDTMPILFPLIDSLNHYPATIITWQPSDTSLRIISGVG 235
Query: 181 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 220
AG + GP+ N +LL+ YGF NP+D +++++
Sbjct: 236 VSAGAEVYNNYGPKANEELLMGYGFTLLQNPFDSFLLKSS 275
>gi|303277863|ref|XP_003058225.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226460882|gb|EEH58176.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 612
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 76/267 (28%), Positives = 114/267 (42%), Gaps = 40/267 (14%)
Query: 14 TTNKLSELACLALYLMYEK-KQGKKSFWLPYIREL---------DRQRGRGQLAVESPLL 63
T L E + L+L+YEK G +S W P + L + G L + +P
Sbjct: 234 TFAGLGEDTLMTLWLVYEKYALGDRSPWAPLLASLPMDDGGGDDGDRTAAGALGL-TPAS 292
Query: 64 W-SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEI 122
W +E A L G+P + ++ E R++ L F A L + +P PTE +T
Sbjct: 293 WPAEVTDALLRGAPLLDDAVKARETTARQHAAL----FPA--LGEHFPEVFPTELYTLRR 346
Query: 123 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA-------AV------ 169
F+ A A + + +Q ++ PP A+L AV
Sbjct: 347 FRIASEAWNAYGMTVQAETVGGASGGGEHHPPAPTTCLPPIALLCNHATWPHAVRYSRLR 406
Query: 170 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 229
DDA+ L + R +AGE I V G + N++LL+ YGF DNPYD + L+ E PQ +
Sbjct: 407 DDALHLPIARGVRAGEEIFVSYGAKSNAELLLFYGFGVRDNPYD----DVPLSLELPQGE 462
Query: 230 DKRMVAQR-----NGKLSVQVFHVHAG 251
+ + A R KLS+ V G
Sbjct: 463 VRDVSALRERVLHRAKLSLSPHSVRCG 489
>gi|322703179|gb|EFY94792.1| UV-endonuclease UVE-1 [Metarhizium anisopliae ARSEF 23]
Length = 1118
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 89/392 (22%), Positives = 158/392 (40%), Gaps = 61/392 (15%)
Query: 5 LFYYAAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 64
LF + L + L L +MYE G +S W PYI L L ++P+ W
Sbjct: 729 LFESQGDEDEQQALDSWSALILIMMYEFFLGHQSKWKPYIDVL-------PLTFDTPMFW 781
Query: 65 SETELAYLTGSPTKAEILE-RAEGIKREY--------------------NELDTVWFMAG 103
SE EL+YL S T +I + AE + R +L + G
Sbjct: 782 SEEELSYLQASATVNKIGKADAEEMFRTRLIPAIRGNPSVFASSGDCSDEDLIGLAHRMG 841
Query: 104 SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCK 163
S Y +D+ E + +V + + V++A +L ++
Sbjct: 842 STIMAYAFDLENEEAENDDESDGWVEDREGKSMMGMVAMA----------DILNADAEFN 891
Query: 164 AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN--------PYDRL 215
A + D+ + + R KAGE I+ + GP PNS+LL YG++ E + P+D
Sbjct: 892 AHVNHGDEELTVTSIRDIKAGEEILNYYGPHPNSELLRRYGYITEKHSRYDVVEIPWD-- 949
Query: 216 VVEAALNTEDPQYQDKRMVAQRNGKL------SVQVFHVHAGREKEAISDMLPYLRLGYV 269
V+ +L +E QD ++ + K+ + V +G + P + G
Sbjct: 950 AVQHSLMSELGVPQD--IMTETMDKMDQDDLEDIFVLERDSGEPNPDGTFAGPAVVDGMP 1007
Query: 270 SDTSE-MQSVISSL----GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 324
D E +++ I L G + + + + + + + Y T++EDE +L
Sbjct: 1008 PDLKEQLKATIKLLQKVDGNLISDKRKRDDILRSTMVETLRLIASRYSTTIAEDEILLAQ 1067
Query: 325 YNLHPKKRVATQLVRMEKKMLNACLQVTADMI 356
+L ++R+A ++ EKK+L ++M+
Sbjct: 1068 DSLTRRQRMAVRVRLGEKKLLQEAFDHFSEMV 1099
>gi|229596469|ref|XP_001008992.3| SET domain containing protein [Tetrahymena thermophila]
gi|225565279|gb|EAR88747.3| SET domain containing protein [Tetrahymena thermophila SB210]
Length = 629
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 58/191 (30%), Positives = 80/191 (41%), Gaps = 26/191 (13%)
Query: 22 ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 81
A AL+L+ E+K+ S + YI L P L+SE EL YL G+ +
Sbjct: 227 AYFALFLLQEQKKKDASHYKAYIDSLPTDLSSF------PALFSEEELQYLEGTAALKLV 280
Query: 82 LERAEGIKREYNELDTVWFMAGSLFQQYPYDIP--TEAFTFEIFKQAFVAVQSCVVHLQK 139
E+ E IK +Y + V IP F+FE F+ AF+ S V + K
Sbjct: 281 QEQKEDIKTDYESISQV--------------IPEFKSEFSFEQFRWAFLCSHSRVFGI-K 325
Query: 140 VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA---GESIVVWCGPQPN 196
V + +VPL L S + DDA + K + I G + N
Sbjct: 326 VKGVKTSVMVPLADMLNHKHSGQEDSEWVFDDATNCFTVKALKKIQRNQQIHFSYGSKCN 385
Query: 197 SKLLINYGFVD 207
SKL +NYGFVD
Sbjct: 386 SKLFLNYGFVD 396
>gi|160331079|ref|XP_001712247.1| met [Hemiselmis andersenii]
gi|159765694|gb|ABW97922.1| met [Hemiselmis andersenii]
Length = 464
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 70/340 (20%), Positives = 149/340 (43%), Gaps = 33/340 (9%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
L+E L ++L+ + G KS W Y L R+ + W+ ++ +L GS T
Sbjct: 139 LNEYDSLGIFLIQQMAMGDKSKWKIYFDILPREED-----LNLGFRWNLNDIVFLRGSKT 193
Query: 78 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
L E IK ++ L+ F L +YP I F ++ A + S + L
Sbjct: 194 LNASLYLKEKIKIQFLRLEKTIFSKNRL--KYPVSI----FNLAQWEWALSILLSRAIFL 247
Query: 138 QKVSLARRFALVPLGP-----PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG 192
Q + ++ +LVP P K + + + + + D+ Y + I G
Sbjct: 248 QNL---KKVSLVPYADFMNHNPFSTSYINSKKISFSKNHEIVMYADKDYNKFDQIFTTYG 304
Query: 193 PQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV---AQRNGKLSVQVFHVH 249
+ N +LL+ YGF+ E NP+D + + +L+ +D ++ K+ ++ +++ +F+
Sbjct: 305 QKTNLELLLLYGFILERNPFDSIELRISLSDKDSFFEKKKQFMIECEKTSEITFPIFYYK 364
Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSV-ISSLGPICPVSPCMERAVLDQLADYFKARL 308
+E + +LR +S+ E+ S +S + +E+ + + + L
Sbjct: 365 YPKE------LYEFLRFC-ISNQEELGSTDLSDFNFNDENNYEIEKIIRKLVLFSCEKLL 417
Query: 309 AGYPATLSEDE---AMLTDYNLHPKKRVATQLVRMEKKML 345
Y +SE++ ++ +++ + +++A + + EKK++
Sbjct: 418 KNYSKKVSEEKILNSLNSNFLISKNQKMALKQSKCEKKII 457
>gi|408393455|gb|EKJ72719.1| hypothetical protein FPSE_07119 [Fusarium pseudograminearum CS3096]
Length = 465
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 86/394 (21%), Positives = 158/394 (40%), Gaps = 71/394 (18%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
L + L L ++YE +G S W Y L + ++P+ WSE EL L S
Sbjct: 92 LDSWSSLILIMIYEYLRGDSSQWKSYFDVLPS-------SFDTPMFWSENELDQLQASHM 144
Query: 78 KAEILE-RAEGI-------------------KREYNELDTVWFMAGSLFQQYPYDIPTEA 117
+ +I + AE + R +EL + GS Y +D+ +
Sbjct: 145 RHKIGKADAENMFKKTLVPIIRSNPSIFNAENRSDSELVEIAHRMGSTIMAYAFDLENDE 204
Query: 118 FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 177
E ++ + +VP+ +L ++ A + ++++ +
Sbjct: 205 EEEEETEEWVEDRDGKSM----------MGMVPMAD-ILNADAEFNAHVNHEEESLTVTS 253
Query: 178 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAALNT------ 223
RP KAGE I+ + GP PNS+LL YG+V E + P+D +VE+ L
Sbjct: 254 LRPIKAGEEILNYYGPHPNSELLRRYGYVTEKHSRYDVVEIPWD--IVESVLTNFGISSK 311
Query: 224 ----------EDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 273
+ +++D ++ + G+++ + + D+ L+ S
Sbjct: 312 ILKQIRGEFEGEEEFEDTFVLERDTGEINSDGTFAEPAKFEGMPEDLQEQLK----SFLK 367
Query: 274 EMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRV 333
++ V S P + +AVL + + AR YP ++SED+ +L NL + R+
Sbjct: 368 GIKKVQSDTIPDKRKRDEIHQAVLVKTLEALAAR---YPTSISEDQTLLNGQNLDQRARM 424
Query: 334 ATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPC 367
AT + EKK+L + ++ + + D P
Sbjct: 425 ATVVRLGEKKLLQEAIATFSEDVEMTMDDESGPA 458
>gi|159479580|ref|XP_001697868.1| rubisco large subunit N-methyltransferase [Chlamydomonas
reinhardtii]
gi|158273966|gb|EDO99751.1| rubisco large subunit N-methyltransferase [Chlamydomonas
reinhardtii]
Length = 475
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 89/342 (26%), Positives = 141/342 (41%), Gaps = 34/342 (9%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
LAL L+ E+ KS Y L G +PLLWSE E L G+ +
Sbjct: 117 LALVLVAERFGSAKSELAGYASSLPEDLG-------TPLLWSEEETRALAGTQVAGTLNS 169
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV---VHLQKV 140
+ + +L LF P P FT F A AV+S + K+
Sbjct: 170 YLTFFRSTFAQLQA------GLFTANPAAFPPAVFTLPNFVWAVAAVRSRSHPPLEGDKI 223
Query: 141 SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DRPYKAGESIVVWCGPQP-N 196
+LA LV L A ++K + + Q+ V R + GE++ + P +
Sbjct: 224 ALA---PLVDLVSHRRAANTKLSVRSSGLFGRGQVAVVEATRAIRKGEALGMDYAPGKLD 280
Query: 197 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEA 256
+L++YG +D +P + L+ D DK + + G + + +++
Sbjct: 281 GPVLLDYGVMDTASPKPGYSLTLTLDESDKFVDDKADIVEGAGLRPSMTYSITP--DQQP 338
Query: 257 ISDMLPYLRLGYVS--DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPAT 314
+M+ +LRL + D ++S+ + VS E AV LA+ +A LAGYP T
Sbjct: 339 GEEMMAFLRLMNIKAMDAFLLESIFRN-----EVSEGNEEAVCAMLAEGARAALAGYPTT 393
Query: 315 LSEDEAMLTDYNLHPKKRV-ATQLVRM-EKKMLNACLQVTAD 354
L +D A L + R A LVR+ EK+ L+A + D
Sbjct: 394 LDQDLAALRSNSTPLGSRAEAALLVRLGEKESLDAVARFFED 435
>gi|327259114|ref|XP_003214383.1| PREDICTED: SET domain-containing protein 3-like, partial [Anolis
carolinensis]
Length = 311
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 55/197 (27%), Positives = 97/197 (49%), Gaps = 18/197 (9%)
Query: 170 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 229
DD + V + +KAGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 9 DDRCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 68
Query: 230 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS--------S 281
K V R G + VF +HA E + +L +LR+ +++ + +I +
Sbjct: 69 MKAEVLARAGIPTSSVFALHA-TEPPISAQLLAFLRVFCMTEDELKEHLIGEHAIDRIFT 127
Query: 282 LGPI-CPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVATQ 336
LG PVS E +L + +AR L Y T+SED+A L +L +A +
Sbjct: 128 LGNSEFPVSWDNEV----KLWTFLEARASLLLKTYKTTVSEDKAFLGTQDLTCNATMAIK 183
Query: 337 LVRMEKKMLNACLQVTA 353
L EK++L ++ A
Sbjct: 184 LRLGEKEILEKAIKSAA 200
>gi|238485948|ref|XP_002374212.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
gi|83768069|dbj|BAE58208.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|220699091|gb|EED55430.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
Length = 713
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 55/193 (28%), Positives = 89/193 (46%), Gaps = 7/193 (3%)
Query: 26 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 85
+LM + QGK+ FW PYIR L Q G A+ +PL + +L +L G+ ++A
Sbjct: 131 FFLMGQYLQGKEGFWYPYIRTLP-QPG----ALTTPLYYEGDDLEWLEGTSLSPARQQKA 185
Query: 86 EGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLAR 144
+K +Y + T AG ++Y +D+ A T + + V S V+ ++
Sbjct: 186 NLLKEKYGTVYTELCKAGFDGAEKYTWDLYLWASTIFVSRAFSAKVLSGVIPDTQLPEEN 245
Query: 145 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 204
L+P +L + K A V +V AG+ I GP+ N +L++NYG
Sbjct: 246 VSVLLPF-IDILNHRPLAKVEWRAGKGNVAFLVLEDVAAGQEISNNYGPRNNEQLMMNYG 304
Query: 205 FVDEDNPYDRLVV 217
F +NP D +V
Sbjct: 305 FCLPNNPCDYRIV 317
>gi|310799999|gb|EFQ34892.1| SET domain-containing protein [Glomerella graminicola M1.001]
Length = 478
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 89/389 (22%), Positives = 155/389 (39%), Gaps = 72/389 (18%)
Query: 11 ELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA 70
E + L L L ++YE QG S W Y L + ++ + W +L
Sbjct: 88 EDMENEPLDSWGSLILVMIYEYLQGNASPWKTYFEVLPEK-------FDTLMFWESPDLE 140
Query: 71 YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 130
YL GS ++I + L + AG F Q P+E+ ++ + +
Sbjct: 141 YLKGSAVLSKIGKDEADEMFRSRILPVISANAGIFFPQ-GVSPPSESELLQLAHRMGSII 199
Query: 131 QSCVVHLQKVSLARR--------------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 176
+ L+ + +VP+ +L ++ A + +D + +
Sbjct: 200 MAYAFDLENEEEPEQEDEEWVEDREGKTMLGMVPMAD-ILNADAEFNAHVNHGEDDLSVT 258
Query: 177 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAALNTEDPQY 228
RP KAGE I+ + GP PNS+LL YG+V + P+D +V++ L TE +
Sbjct: 259 ALRPIKAGEEILNYYGPHPNSELLRRYGYVTPKHSRYDVVEIPWD--LVQSTL-TEQLRL 315
Query: 229 QDK--RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL---- 282
D+ + VA+ ++ V E S+ G++ +++Q V + L
Sbjct: 316 TDEVWKQVAEHVDPEDLEDVFVLERESGEPDSE-------GHLQTPAKVQEVSAELEEQL 368
Query: 283 -----------GPICP-------VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 324
G + P V C+ + L +L LA YP T EDEAML
Sbjct: 369 KDVLKAIKKVRGDLIPDKRKRDEVYQCVVVSTLQKL-------LAQYPTTAEEDEAMLAS 421
Query: 325 YNLHPKKRVATQLVRMEKKMLNACLQVTA 353
N+ ++++A ++ EK+++ LQV
Sbjct: 422 GNVTSRQKLAVEVRLGEKRLIKEALQVAG 450
>gi|317144568|ref|XP_001820210.2| SET domain protein [Aspergillus oryzae RIB40]
gi|391871646|gb|EIT80803.1| N-methyltransferase [Aspergillus oryzae 3.042]
Length = 703
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 55/193 (28%), Positives = 89/193 (46%), Gaps = 7/193 (3%)
Query: 26 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 85
+LM + QGK+ FW PYIR L Q G A+ +PL + +L +L G+ ++A
Sbjct: 121 FFLMGQYLQGKEGFWYPYIRTLP-QPG----ALTTPLYYEGDDLEWLEGTSLSPARQQKA 175
Query: 86 EGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLAR 144
+K +Y + T AG ++Y +D+ A T + + V S V+ ++
Sbjct: 176 NLLKEKYGTVYTELCKAGFDGAEKYTWDLYLWASTIFVSRAFSAKVLSGVIPDTQLPEEN 235
Query: 145 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 204
L+P +L + K A V +V AG+ I GP+ N +L++NYG
Sbjct: 236 VSVLLPF-IDILNHRPLAKVEWRAGKGNVAFLVLEDVAAGQEISNNYGPRNNEQLMMNYG 294
Query: 205 FVDEDNPYDRLVV 217
F +NP D +V
Sbjct: 295 FCLPNNPCDYRIV 307
>gi|440797255|gb|ELR18348.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
Length = 431
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 49/162 (30%), Positives = 76/162 (46%), Gaps = 15/162 (9%)
Query: 60 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 119
+ L +S+ EL+YL GS +R I+ +Y+ + T LF YP E F+
Sbjct: 138 TTLFFSDDELSYLEGSSLHHFARQRRRAIESQYDTIFT------PLFVDYPEHFAPEQFS 191
Query: 120 FEIFKQAFVAV--QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 177
+ +K A + +S VV K L + + P + + K + AVD +
Sbjct: 192 LDAWKWALSVIWSRSFVVDEGKRGLVPWADMFNMAPE----TEQVKVAVDAVDHHLIYSA 247
Query: 178 DRPYKAGESIVVWCGPQ---PNSKLLINYGFVDEDNPYDRLV 216
P K GE I V G N++LL++YGFV E+NP+D +V
Sbjct: 248 RSPIKKGEQIFVAYGQSRQMSNAQLLMDYGFVLENNPHDAVV 289
>gi|449017905|dbj|BAM81307.1| similar to ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplast precursor
[Cyanidioschyzon merolae strain 10D]
Length = 567
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 109/401 (27%), Positives = 159/401 (39%), Gaps = 85/401 (21%)
Query: 20 ELACLALYLMYE-------KKQGKK--SFWLPYIRELDR--QRGRGQLAVES----PL-- 62
E LA L+YE KK G+ SFW PY+ L +G ES PL
Sbjct: 161 EFFALATLLLYERGLDESWKKSGRGPGSFWGPYLDILPPVPWEFKGAEPAESLSMDPLDA 220
Query: 63 --LWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE-AFT 119
LW+E E+ +L GSPT ++REY E L++++P+ E AF
Sbjct: 221 LWLWAEDEMQWLQGSPTLLSARALRSKVEREYAE------ACERLYRRHPHIFDLEGAFR 274
Query: 120 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG--------------------PPLLAYS 159
E F AF + S V L + ALVP P S
Sbjct: 275 LERFLWAFGVLFSRAVSLPAENGM--LALVPYADLANHSAFCVSFIDARTAAFPYAFRAS 332
Query: 160 SKCK----------------AMLAAVD------DAVQLVV---DRPYKAGESIVVWCGPQ 194
SK K +A D DA + VV DR Y E + V G +
Sbjct: 333 SKQKRGQWWQRFLAPNSDDAGAVANTDSSHYREDAQREVVAYADRFYDKFEQVYVSYGQK 392
Query: 195 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQ----YQDKR--MVAQRNGKLSVQVFHV 248
N++LL+ YGFV + NPY+ + V +L+ + KR ++A + F +
Sbjct: 393 SNAELLLLYGFVSDRNPYNSVEVCVSLSGSEAAGAGLLDRKRSFLLACGRDPDKPECFPL 452
Query: 249 HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARL 308
+A R + +L + L + S + + PV+ E A L K L
Sbjct: 453 YADRYPLELMQLLRFASL--TEQDAAGYSDLEQIDVAQPVNRENEIAAKSALLQACKIAL 510
Query: 309 AGYPATLSEDEAMLTDYN----LHPKKRVATQLVRMEKKML 345
YP + ED+A L D + L K+R++ +L R EK++L
Sbjct: 511 QAYPTSADEDDAALKDKSMAQLLSRKQRLSVRLRRSEKRIL 551
>gi|442753255|gb|JAA68787.1| Putative set domain-containing protein [Ixodes ricinus]
Length = 428
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 50/206 (24%), Positives = 92/206 (44%), Gaps = 16/206 (7%)
Query: 17 KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP 76
KL+ + L L+++ +K G+ S W P++ L R +P+ T L
Sbjct: 99 KLTPIDVLTLFVLDQKLLGEASRWWPFVDSLPR-------TFTTPVFLRRTVFESLP-KD 150
Query: 77 TKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH 136
+ E+ R I+R + +L + + G + ++ + FT+ F A+ AV + +
Sbjct: 151 LREEVHTRITSIQRTFLKLKVL--LGGHVEEEPEVQSLSTGFTWNNFVWAWTAVNTRCIF 208
Query: 137 LQKVSLARRF-----ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
Q + + + AL P L + K A V + +++ + + A E + +
Sbjct: 209 AQGSNSSSLWENDHCALAPF-LDCLNHHWKASIETAMVGENFEILSHKSHDANEQVFISY 267
Query: 192 GPQPNSKLLINYGFVDEDNPYDRLVV 217
GP N +L ++YGFV DNP D +VV
Sbjct: 268 GPHSNRRLFLDYGFVLPDNPNDVVVV 293
>gi|358392567|gb|EHK41971.1| hypothetical protein TRIATDRAFT_251278, partial [Trichoderma
atroviride IMI 206040]
Length = 956
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 85/394 (21%), Positives = 154/394 (39%), Gaps = 74/394 (18%)
Query: 22 ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 81
+ L + L+YE +G +S W PY+ L + E+P+ WS+ E+ L S T+++I
Sbjct: 586 STLIIVLIYEYFKGDQSSWKPYLDVL-------PASFETPMFWSDAEVDELQASATRSKI 638
Query: 82 LE-RAEGI--------------------KREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 120
+ AE + + EL + GS Y +D
Sbjct: 639 GKTNAEEMFHAKILPVIRGNPDIFQTSQAKSDEELIQLAHRMGSTIMSYAFD-------- 690
Query: 121 EIFKQAFVAVQSCVVHLQKVSLARR-FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 179
F+ + + A+ +VP+ +L ++ A + DDA+ + R
Sbjct: 691 --FQNEDEEEEDDSEEWVEDREAKSTMGMVPMAD-ILNADAEYNAHVNYGDDALTVATLR 747
Query: 180 PYKAGESIVVWCGPQPNSKLLINYGFVDEDN--------PY----DRLVVEAALNTED-- 225
KAGE I+ + GP PNS+LL YG+V + P+ D L L++E
Sbjct: 748 TIKAGEEILNYYGPHPNSELLRRYGYVTPKHSRYDVVELPWKMIEDALAANLGLSSEQLD 807
Query: 226 --------PQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 277
++++ ++ + + + + + + E D+ L+ M
Sbjct: 808 SAREHLDLDEFEETFVLERESDEPNPDGTFANPAKFSEIPEDLREQLK--------SMLK 859
Query: 278 VISSLGPICPVSPCMERAVLDQ-LADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQ 336
I + P C V V L A + YP T+ EDE +L+ NL +++ A
Sbjct: 860 AIRKVDPSCIVDKRKRDEVQHTVLITALDALTSQYPTTIIEDELILSGSNLSERRKAAVT 919
Query: 337 LVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAP 370
+ EK++L + ++ + D + PAP
Sbjct: 920 VRLGEKRLLQEARVLLSE---IASDAILDDAPAP 950
>gi|46130858|ref|XP_389160.1| hypothetical protein FG08984.1 [Gibberella zeae PH-1]
Length = 1000
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 87/394 (22%), Positives = 160/394 (40%), Gaps = 71/394 (18%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
L + L L ++YE QG S W Y L + ++P+ WSE EL L S
Sbjct: 627 LDSWSSLILIMIYEYLQGDSSQWKSYFDVL-------PSSFDTPMFWSENELDQLQASHM 679
Query: 78 KAEILE-RAEGI-------------------KREYNELDTVWFMAGSLFQQYPYDIPTEA 117
+ +I + AE + R EL + GS Y +D+ +
Sbjct: 680 RHKIGKADAEDMFKKTLVPIIRSNPSIFNAENRSDYELVEIAHRMGSTIMAYAFDLENDE 739
Query: 118 FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 177
E ++ + + +VP+ +L ++ A + ++++ +
Sbjct: 740 EEEEETEEWVEDREGKSM----------MGMVPMAD-ILNADAEFNAHVNHEEESLTVTS 788
Query: 178 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAALNT------ 223
RP KAGE I+ + GP PNS+LL YG+V E + P+D +VE+ L
Sbjct: 789 LRPIKAGEEILNYYGPHPNSELLRRYGYVTEKHSRYDVVEIPWD--IVESVLTNFGISSK 846
Query: 224 ----------EDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 273
E+ +++D ++ + G+++ + + D+ L+ ++
Sbjct: 847 ILEQIRGEFEEEEEFEDTFVLERDTGEVNSDGTFAEPAKFEGMPEDLQEQLK-SFLKGIK 905
Query: 274 EMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRV 333
++Q S P + +AVL + + AR YP ++SED+ +L NL + R+
Sbjct: 906 KLQ---SDTIPDKRKRDEIHQAVLVKTLEALAAR---YPTSISEDQILLNGQNLDQRARM 959
Query: 334 ATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPC 367
AT + EKK+L + ++ + + D P
Sbjct: 960 ATVVRLGEKKLLQEAIATFSEDVEMTMDDESGPA 993
>gi|340519125|gb|EGR49364.1| predicted protein [Trichoderma reesei QM6a]
Length = 963
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 82/365 (22%), Positives = 147/365 (40%), Gaps = 63/365 (17%)
Query: 22 ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 81
+ L + +MYE +G +S W PYI L + E+P+ WS+ EL L S T++++
Sbjct: 590 STLIIVMMYEYFKGSESKWKPYIDVL-------PSSFETPMFWSDAELDELQASATRSKV 642
Query: 82 LE---------------RAEG----IKREYNELDTVWFMA--GSLFQQYPYDIPTEAFTF 120
+ RA + Y++ D + GS Y +D
Sbjct: 643 GKASAEEMFQDKVLPVIRANQHLFPTSQTYSDDDLIQLAHRMGSTIMSYSFD-------- 694
Query: 121 EIFKQAFVAVQSCVVHLQKVSLARR-FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 179
F+ + + A+ +VP+ +L ++ A + DDA+ + R
Sbjct: 695 --FQNEDEEDEDETEEWVEEREAKSTMGMVPMAD-ILNADAEYNAHVNYGDDALTVTALR 751
Query: 180 PYKAGESIVVWCGPQPNSKLLINYGFVD-EDNPYDRL---------VVEAALNTEDPQYQ 229
KAGE I + GP PNS+LL YG+V + + YD + V A+L Q
Sbjct: 752 TIKAGEEIFNYYGPHPNSELLRRYGYVTPKHSRYDVVELPWTLVEESVAASLGLSSEQL- 810
Query: 230 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT--SEMQSVISSLGPICP 287
DK + +L E + R + + +++S++ ++ P
Sbjct: 811 DKARECLDSDELEDTFVLERETEEPNPDGTLTGSARFSEIPEDLRDQLKSLLKAIRKAVP 870
Query: 288 VSPCMERA-------VLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 340
S +R +L + D +R YP ++SEDE +L ++ ++R A +
Sbjct: 871 SSVVDKRKRDEIQHNILIRALDALASR---YPTSISEDERILAGNDISERRRAAVTVRLG 927
Query: 341 EKKML 345
EK+++
Sbjct: 928 EKRLI 932
>gi|357615786|gb|EHJ69829.1| putative SET domain containing 3 [Danaus plexippus]
Length = 489
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 66/264 (25%), Positives = 109/264 (41%), Gaps = 50/264 (18%)
Query: 22 ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 81
LAL+L+ EK SFW PYI L + + L ++ ELA L SP
Sbjct: 159 VTLALFLLLEK-NNPNSFWKPYIDVLPEK-------YSTVLYFNSEELAELRPSPVFESS 210
Query: 82 LERAEGIKREY----NELDTVWFMAGSLFQQYPYDIPT-----EAFTFEIFKQAFVAV-- 130
L+ I R+Y N++ T+ D+P + FTF+ ++ A V
Sbjct: 211 LKLYRSIVRQYAYFYNKIHTI-------------DLPVLKNLQDIFTFDNYRWAVSTVMT 257
Query: 131 -QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR-------PYK 182
Q+ +V +L A +PL C + L ++R Y+
Sbjct: 258 RQNNIVQGTAFTLTN--AFIPLW-------DMCNHKHGKITTDFNLELNRGECYALQDYR 308
Query: 183 AGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLS 242
E I ++ G +PNS L ++ GFV DN YD L + ++ D K + + G
Sbjct: 309 RDEQIFIFYGARPNSDLFLHNGFVYPDNDYDSLSIALGISPNDALRNGKVNLLNKLGLSG 368
Query: 243 VQVFHVHAGREKEAISDMLPYLRL 266
V F ++ G ++ ++L ++R+
Sbjct: 369 VTNFSLYKGASPISV-ELLAFIRI 391
>gi|357444999|ref|XP_003592777.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase [Medicago truncatula]
gi|355481825|gb|AES63028.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase [Medicago truncatula]
Length = 451
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 54/227 (23%), Positives = 101/227 (44%), Gaps = 22/227 (9%)
Query: 13 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 72
+ + +A LA L+ K G+ S W PYI L Q + + + W+E+EL +
Sbjct: 97 FISEDVGNIAKLATVLLIHKNLGQDSEWHPYISCLP-----PQAEMHNTIFWNESELEMI 151
Query: 73 TGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 132
S E + + I++++ E+ V FQ P+ FT++ F A V S
Sbjct: 152 RQSSVYQETIYQKSQIEKDFLEIKPV-------FQ--PFCQSFGDFTWKDFMHACTLVGS 202
Query: 133 CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVW 190
+ + +L+P L + +A++ + DD ++ DR Y GE +++
Sbjct: 203 -----RAWGSTKGLSLIPFAD-FLNHDGISEAIVMSDDDNKCSEVFSDRDYVPGEQVLIR 256
Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQR 237
G N+ L++++GF N YD++ ++ + DP K + Q+
Sbjct: 257 YGKFSNATLMLDFGFTIPYNIYDQVQIQYDIPKYDPLRHTKLELLQQ 303
>gi|255562948|ref|XP_002522479.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
gi|223538364|gb|EEF39971.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
Length = 502
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 83/351 (23%), Positives = 149/351 (42%), Gaps = 44/351 (12%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW-----------------S 65
+AL+L+ EK + + S W PY+ L S + W S
Sbjct: 125 SVALFLIREKLKKEGSTWWPYLDILPD-------TTNSTIYWWVLLVAFYVLVLSFQRRS 177
Query: 66 ETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK- 124
E ELA L G+ L E ++RE+ +++ + P + + F I +
Sbjct: 178 EEELAELQGTQLLRTTLGVKEYMQREFAKVEEEILLPHKELFPSPITLDDFLWAFGILRS 237
Query: 125 QAFVAVQSCVVHLQKVSLARRFALVPLGPPLL----AYSSKCKAMLAAVDDAVQLVVDRP 180
+AF ++ Q + L L+ P + AY K + + + L
Sbjct: 238 RAFSRLRG-----QNLVLIPLADLINHSPDITTEDYAYEIKGGGLFSR-ELLFSLRSPIS 291
Query: 181 YKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 239
K+GE +++ + + N++L ++YGF+++ + + ++ DP + DK +A+ NG
Sbjct: 292 VKSGEQVLIQYDLNKSNAELALDYGFIEKTPDRNTYTLTLQISESDPFFGDKLDIAETNG 351
Query: 240 KLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE--MQSVISS--LGPI-CPVSPCMER 294
F + G MLPYLRL + T ++S+ + G + P+S E
Sbjct: 352 SGETADFDIVLGNPLPPA--MLPYLRLVALGGTDAFLLESIFRNTIWGHLELPISRANEE 409
Query: 295 AVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
+ + D K+ L+GY T+ EDE L +L+P+ +A + EKK+L
Sbjct: 410 LICRVVRDACKSALSGYHTTIEEDEK-LEAADLNPRLEIAVGIRAGEKKVL 459
>gi|260831632|ref|XP_002610762.1| hypothetical protein BRAFLDRAFT_91548 [Branchiostoma floridae]
gi|229296131|gb|EEN66772.1| hypothetical protein BRAFLDRAFT_91548 [Branchiostoma floridae]
Length = 604
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 58/231 (25%), Positives = 112/231 (48%), Gaps = 35/231 (15%)
Query: 15 TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 74
T++ + L+L+L+ EK +GK SFW PYIR L + +P+ ++E+EL L+
Sbjct: 230 TSRFTCAQVLSLFLLLEKNKGKDSFWYPYIRSLPN-------SFTTPVYFTESELNALSP 282
Query: 75 SPTKAEILERAEGIKRE----YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 130
S + E+A +K+E +N+L+ F+ L + FTF+ F+ A+ +
Sbjct: 283 S-----LQEKARDLKKELLHAFNDLEP--FVTSCLPEL------DSTFTFDAFRWAWSVL 329
Query: 131 QSCVVHLQKVS---LARR----FALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPY 181
++ ++ + L+ + LVP+ L+ +S KA ++ V PY
Sbjct: 330 KTRTLYQEDCRSPYLSNKEPQTSTLVPM-LDLINHSPSAKARFGYNVNTSCYEVRVLEPY 388
Query: 182 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED-PQYQDK 231
+ + + + G + N++L++ +GF +NP D + + + E PQ D+
Sbjct: 389 RKYDQVFISYGFEENTELMLKFGFFVPENPKDFMKINLSEMLESLPQINDE 439
>gi|380480025|emb|CCF42668.1| SET domain-containing protein RMS1 [Colletotrichum higginsianum]
Length = 318
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 55/232 (23%), Positives = 107/232 (46%), Gaps = 34/232 (14%)
Query: 146 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 205
+VP+ +L ++ A + +D + +V RP KAGE I+ + GP PNS+LL YG+
Sbjct: 71 LGMVPMAD-ILNADAEFNAHVNHGEDDLSVVALRPIKAGEEILNYYGPHPNSELLRRYGY 129
Query: 206 VDEDN--------PYDRLVVEAALNTE---------------DPQ-YQDKRMVAQRNGKL 241
V + P+D +V++ L + DP+ ++D ++ + +G+
Sbjct: 130 VTPKHSRYDVVEIPWD--LVQSILTEQLRLTDDVWKQLAEHVDPEDFEDVFVLERDSGEP 187
Query: 242 SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLA 301
+ + +E +++ L+ + ++++ G + P + +A
Sbjct: 188 DSEGRLTTPAKVQEVSAELEEQLK-------AVLKAIKKVRGDLIPDKRKRDEVYQHVVA 240
Query: 302 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
+ LA YP T EDEA+L NL ++R+A ++ EK++L LQ+
Sbjct: 241 AALQKLLAQYPTTAEEDEALLASGNLTSRQRMAVEVRLGEKRLLKEALQMDG 292
>gi|156384284|ref|XP_001633261.1| predicted protein [Nematostella vectensis]
gi|156220328|gb|EDO41198.1| predicted protein [Nematostella vectensis]
Length = 403
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 82/347 (23%), Positives = 148/347 (42%), Gaps = 41/347 (11%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
LAL+++ E+ + S W PY+ L R + + L +S ++ L GSP+ E L+
Sbjct: 73 LALHVLCERLH-EDSTWAPYLNILPR-------SYSTCLYFSPDDMMALQGSPSMGEALK 124
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPYDIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
+ GI ++Y V+F +P +FTF+ F+ A V + ++ S
Sbjct: 125 QFRGIVKQY-----VYFFRLVQINPEASRLPLKNSFTFDDFRWAVSTVMTRQNDVKVSSN 179
Query: 143 ARRFALVPLGPPLLAYSSKCKA-MLAAVDDAVQLVVD---RPYKAGESIVVWCGPQPNSK 198
AL+P+ + C DD+ + V +P +AG+ + ++ G + N+
Sbjct: 180 ETVKALIPM----WDMCNHCNGPFTTGFDDSTKEVKSLAFKPTRAGDQVFIFYGRRNNAD 235
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
L + GFV + D + ++ ++ D Y K + G L R E IS
Sbjct: 236 RLFHNGFVYTEAEEDWVNIQLGVSKNDRLYAMKAQILAMVG-LDASGRSYRVLRGPEPIS 294
Query: 259 -DMLPYLRLGYVSDTSEMQSV--------ISSLGPICPVSPCMERAVLDQLADYFKAR-- 307
++ +LR+ + +T E++ ++ L +C + +L +F R
Sbjct: 295 PELRIFLRV-FSMNTGELKPYLFNPEGLPVTPLAELCKAEFTLSEENELKLWSFFHTRLQ 353
Query: 308 --LAGYPATLSEDEAMLT--DYNLHPKKRVATQLVRMEKKMLNACLQ 350
L Y T EDEA+L+ D LH R +L E+ +L + L+
Sbjct: 354 LILGQYKTTKQEDEALLSRDDNTLH--TRNCIRLRMSERDILVSALE 398
>gi|440804743|gb|ELR25614.1| SET domain containing protein, partial [Acanthamoeba castellanii
str. Neff]
Length = 273
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 48/159 (30%), Positives = 74/159 (46%), Gaps = 15/159 (9%)
Query: 60 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 119
+ L +S+ EL+YL GS +R I+ +Y+ + T LF YP E F+
Sbjct: 124 TTLFFSDDELSYLEGSSLHYFARQRRRAIESQYDTIFT------PLFVDYPEHFAPEQFS 177
Query: 120 FEIFKQAFVAV--QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 177
+ +K A + +S VV K L + + P + + K + AVD +
Sbjct: 178 LDAWKWALSVIWSRSFVVDEGKSGLVPWADMFNMAPE----TEQVKVAVDAVDHHLIYSA 233
Query: 178 DRPYKAGESIVVWCGPQ---PNSKLLINYGFVDEDNPYD 213
P K GE I V G N++LL++YGFV E+NP+D
Sbjct: 234 RSPIKKGEQIFVAYGQSRQMSNAQLLMDYGFVLENNPHD 272
>gi|336261436|ref|XP_003345507.1| hypothetical protein SMAC_07495 [Sordaria macrospora k-hell]
gi|380088183|emb|CCC13858.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 499
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 50/205 (24%), Positives = 88/205 (42%), Gaps = 28/205 (13%)
Query: 26 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 85
YL+ + +GK S W PYI L + A+ P W+E ++ L G+ I E
Sbjct: 107 FYLIQQYLKGKSSLWAPYISTLTDPSQLDKWAL--PPFWTEHDIELLRGTNAYVAIQEIQ 164
Query: 86 EGIKREYNELDTVWFMAGS----LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 141
+ +K EY + + GS + Q Y+ FT F+ + + +S ++++
Sbjct: 165 DNVKSEYKQARKILKQEGSPDYRAYTQVLYNWAYCMFTSRSFRPSLILSESAREYVER-- 222
Query: 142 LARRFALVPLGPPLLAYS--------------SKCKAMLAAVDDAVQLVVDRPYKAGESI 187
L+P G + +S ++ L + A +L+ Y+ G+ +
Sbjct: 223 ------LLPEGAKIDDFSILQPLYDIGNHSPEAEYSWNLTSEPSACELICRNSYEPGQQV 276
Query: 188 VVWCGPQPNSKLLINYGFVDEDNPY 212
G + NS+LL+ YGFV E+N Y
Sbjct: 277 FNNYGKKTNSELLLGYGFVTENNDY 301
>gi|162606198|ref|XP_001713614.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Guillardia theta]
gi|13794534|gb|AAK39909.1|AF165818_117 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Guillardia theta]
Length = 460
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 74/351 (21%), Positives = 155/351 (44%), Gaps = 39/351 (11%)
Query: 13 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 72
+ +N + LA+ L+ E + KKSFW PYI L + L W EL ++
Sbjct: 132 INSNGSDNYSDLAIKLLVELFKNKKSFWFPYIGILPEEYDLKLL-----FRWPLKELFFI 186
Query: 73 TGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 132
GS + +K +Y ++ +FQ+ P++ F ++ ++ + + S
Sbjct: 187 KGSRLSKASDYLKKKLKAQYEMVNK------EVFQRNRLLYPSKIFNYQNWEWSMSILLS 240
Query: 133 CVVHLQ---KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV--DRPYKAGESI 187
+ LQ KV L L+ P ++ S K L+ D+ ++VV D+ + +
Sbjct: 241 RTISLQETKKVVLIPYIDLLNHNPFSSSFISYRKIPLS---DSKEIVVYSDKNCNKFDQL 297
Query: 188 VVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFH 247
+ G + N +LL YGF+ E NPYD +++ +++ +D +++K+ N K +
Sbjct: 298 YISYGQKSNLELLNLYGFIAERNPYDSVIIRISMSPKDIFFKEKKSFLFSNKKFFYNSYP 357
Query: 248 VHAGREKEAISDMLPYLRL----GYVSDTSEMQSVISSLGPICPVSPC----MERAVLDQ 299
+ + + +M+ ++++ ++D + + I + + C +E+++
Sbjct: 358 IFLYKYPD---EMIEFIKICLFNTNINDKNFNLNKIENYDYTKIIKSCIVTVIEKSLNSN 414
Query: 300 LADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 350
DY R L E+ ++D ++++ + +EKK+LN L+
Sbjct: 415 YNDYENLR----NIMLKENLLHISD-----NQKISIKYNALEKKILNRFLE 456
>gi|302836231|ref|XP_002949676.1| Rubisco large subunit N-methyltransferase [Volvox carteri f.
nagariensis]
gi|300265035|gb|EFJ49228.1| Rubisco large subunit N-methyltransferase [Volvox carteri f.
nagariensis]
Length = 484
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 84/312 (26%), Positives = 141/312 (45%), Gaps = 26/312 (8%)
Query: 58 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 117
+++PLLWSE EL L G+ ++L+ G + T + LF P P
Sbjct: 144 LDTPLLWSEDELQELQGT----QVLQTLGGYLTFFRS--TFQQLQSGLFTSNPAAFPPSI 197
Query: 118 FTFEIFKQAFVAVQSCV---VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 174
FT F A AV+S + K++LA L L A +SK A + Q
Sbjct: 198 FTLPRFLWAVAAVRSRSHPPLDGPKIALA---PLTELVSHRRAANSKLSVRSAGLFGRGQ 254
Query: 175 LVV---DRPYKAGESIVVWCGP-QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQD 230
++V R + GE + + GP + + +L++YG +D +P + + D D
Sbjct: 255 VLVLEATRAIRKGEPLSMDYGPGKLDGPVLVDYGVMDVTSPKPGYSLTLKMPDSDRFIDD 314
Query: 231 KRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV--SDTSEMQSVISS--LGPIC 286
K + + N V+++ +++ +ML +LRL + SD ++S+ + G +
Sbjct: 315 KLDILESNDLPQSVVYNLTP--DEQPTIEMLAFLRLMQLKGSDAFLLESIFRNDVWGFMQ 372
Query: 287 -PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPK--KRVATQLVRM-EK 342
PVS E AV + L++ +A L GY T+ +D A L K +R A L+R+ EK
Sbjct: 373 EPVSEGNEEAVCNTLSEGARAALGGYGTTIDQDLAELRAQGSRAKGSRREAALLIRLGEK 432
Query: 343 KMLNACLQVTAD 354
+ L+A + D
Sbjct: 433 EALDAVARFFED 444
>gi|19112238|ref|NP_595446.1| ribosomal lysine methyltransferase Set10 [Schizosaccharomyces pombe
972h-]
gi|74626910|sp|O74738.1|SET10_SCHPO RecName: Full=Ribosomal N-lysine methyltransferase set10
gi|3738151|emb|CAA21252.1| ribosomal lysine methyltransferase Set10 [Schizosaccharomyces
pombe]
Length = 547
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 64/247 (25%), Positives = 106/247 (42%), Gaps = 31/247 (12%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
L +L E +G +S W YI L + +PL ++E + A+L + + E
Sbjct: 82 LCTFLALESLKGIQSKWYGYIEYLPK-------TFNTPLYFNENDNAFLISTNAYSAAQE 134
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQSCVVHLQKVSL 142
R K EY E A SL + PTE FTF+++ A V C +
Sbjct: 135 RLHIWKHEYQE-------ALSL-----HPSPTERFTFDLYIWSATVFSSRC---FSSNLI 179
Query: 143 ARRFALVPLGPPLL-AYSSKCKAMLAAVDD-----AVQLVVDRPYKAGESIVVWCGPQPN 196
+ P+ PL+ + + K K + D +VQL+ G + GP+ N
Sbjct: 180 YKDSESTPILLPLIDSLNHKPKQPILWNSDFQDEKSVQLISQELVAKGNQLFNNYGPKGN 239
Query: 197 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG--KLSVQVFHVHAGREK 254
+LL+ YGF DNP+D + ++ A++ + P K + + + +LS VF + +K
Sbjct: 240 EELLMGYGFCLPDNPFDTVTLKVAIHPDLPHKDQKAAILENDCQFQLSNLVFFLPKSPDK 299
Query: 255 EAISDML 261
E +L
Sbjct: 300 EIFQKIL 306
>gi|291235388|ref|XP_002737626.1| PREDICTED: SET domain containing 4-like [Saccoglossus kowalevskii]
Length = 353
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 57/217 (26%), Positives = 101/217 (46%), Gaps = 26/217 (11%)
Query: 17 KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP 76
KLS + + YL+ E+ + K SFW YI+ L + + +P+ ++ E+ +L
Sbjct: 102 KLSPMQVICTYLICERSRQKDSFWYNYIKVLPK-------SYSNPVYFTNEEINWLP-RR 153
Query: 77 TKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH 136
K ++ + E I Y EL ++ + S F + F + F+ A+ V + V+
Sbjct: 154 IKRKVFDECEKINTAYRELKNLFSILESTFVSFK-----GIFEYSAFRWAWCTVNTRSVY 208
Query: 137 LQK-----VSLAR-RFALVPLGPPLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIV 188
+ + +S+ R +AL P LL +++ + KA V ++ K + +
Sbjct: 209 MLQEQNPHLSIERDHYALAPF-LDLLNHTNTVEVKASYNPVSKCYEIFTCTACKKYDQMF 267
Query: 189 VWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 225
++ GP N KL I YGFV N ++ VVE L+ ED
Sbjct: 268 IYYGPHDNVKLFIEYGFVLPQNQHN--VVE--LDFED 300
>gi|347836900|emb|CCD51472.1| similar to SET domain-containing protein [Botryotinia fuckeliana]
Length = 470
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 90/361 (24%), Positives = 146/361 (40%), Gaps = 45/361 (12%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
L LM E Q S W PY+ L Q + S + WS++ELA L S +I
Sbjct: 86 ALTSILMAEG-QVDDSKWAPYLAILPEQ-------LNSLVFWSDSELAELQASAVVKKIG 137
Query: 83 ERA-----------EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ 131
++ +G++ E+ S+ Y +DIP + + A
Sbjct: 138 KQGAEDMFKTYITPQGLQHSSTEM---CHKVASVIMAYAFDIPDPSEGPTSGGKGEEAAD 194
Query: 132 SCVV---HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 188
V +K L+ ++PL L A + + A L ++ +++ +P GE I
Sbjct: 195 DLVSDDGEDEKTILS----MIPLADMLNADADRNNARLICDNEDLEMRAIKPIAKGEEIF 250
Query: 189 VWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAAL----------NTEDPQ-YQDKRMV-- 234
G P S LL YG+V D + YD + A L + P+ QDK V
Sbjct: 251 NDYGQLPRSDLLRRYGYVTDGYSAYDVAEISAELIVSLFRNGKVHPSLPKLTQDKLKVRL 310
Query: 235 --AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCM 292
A+R G VH+ ++ +I D L + D S +++++ S + S
Sbjct: 311 DLAEREGVYDESFDLVHSSPDEPSIPDELLAFLYLLLVDESHLKAILDSESSLPSRSKLT 370
Query: 293 ERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVT 352
LA +AR Y T+ EDE +L + +L + +A Q+ EKK+L A ++
Sbjct: 371 TELAGQVLAILLQARENEYSTTVEEDEDLLKNADLPIRTAMAIQVRSGEKKVLRAAIREA 430
Query: 353 A 353
A
Sbjct: 431 A 431
>gi|225448769|ref|XP_002275729.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Vitis
vinifera]
Length = 480
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 52/217 (23%), Positives = 100/217 (46%), Gaps = 25/217 (11%)
Query: 13 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 72
L +++ +A LA+ + E K G+ S W PYI L Q G + S + WSE EL +
Sbjct: 105 LLGDEVGNIAKLAIVISVEWKMGQDSEWAPYINRLP-QPGE----MHSTIFWSEGELKMI 159
Query: 73 TGSPTKAEILERAEGIKREYNELDTV-WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ 131
S E + + I++++ + V + +LF+ DI + F +
Sbjct: 160 QQSSVYQETINQKAQIQKDFLAIKPVLHHFSENLFK----DISLKEF-----------MH 204
Query: 132 SC-VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGESIV 188
+C +V + + +L+P + + ++L +D +++ DR Y GE ++
Sbjct: 205 ACALVGSRAWGSTKGLSLIPFA-DFVNHDGFSDSVLLGDEDKQLSEVIADRNYAPGEQVL 263
Query: 189 VWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 225
+ G PN+ LL+++GF N YD++ ++ + D
Sbjct: 264 IRYGKFPNATLLLDFGFTLPYNIYDQVQIQVNIPHHD 300
>gi|332020870|gb|EGI61268.1| SET domain-containing protein 3 [Acromyrmex echinatior]
Length = 232
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 55/211 (26%), Positives = 96/211 (45%), Gaps = 11/211 (5%)
Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
AL+P+ + + A D + R +K GE + + GP+ NS ++ GFV
Sbjct: 18 ALIPMWDMCNHENGRITTDFNATSDRCECYALRDFKKGEQVFISYGPRTNSDFFVHSGFV 77
Query: 207 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-MLPYLR 265
DN D + ++ D +++ + + SV F + G E ISD +L +LR
Sbjct: 78 CMDNEQDGFKLRLGISKADSLQKERIELLSKLDLPSVGEFLLKPG--TEPISDTLLAFLR 135
Query: 266 LGYVSDTSEMQSVISS--LGPICPVSPCMERAVLDQLADYFKARL----AGYPATLSEDE 319
+ + +E+ + S + + V +E V + + + RL A YP TL ED
Sbjct: 136 V-FSMRKAELTHWLRSDKVFDLKHVDCALETVVEENVRKFLLTRLQLLIANYPTTLKEDL 194
Query: 320 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 350
+L + L K++A QL EK++L+ L+
Sbjct: 195 ELL-ETTLPQMKKMAVQLRVTEKRILSGALE 224
>gi|302792358|ref|XP_002977945.1| hypothetical protein SELMODRAFT_107696 [Selaginella moellendorffii]
gi|300154648|gb|EFJ21283.1| hypothetical protein SELMODRAFT_107696 [Selaginella moellendorffii]
Length = 467
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 67/265 (25%), Positives = 110/265 (41%), Gaps = 55/265 (20%)
Query: 4 LLFYYAAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 63
LL A + + +L L + LMYE+ +GK S W Y++ L RQ P L
Sbjct: 57 LLTTAARDAIERARLGGGLGLTVALMYERSKGKGSKWYRYLKTLPRQE-------SVPFL 109
Query: 64 WSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEI 122
WSE E+ L G+ + E +K ++ E L ++ P + P + FTFE
Sbjct: 110 WSEEEIDGLLLGTELHKALKEDKLLMKEDWEE------NIAPLTKEDPLEFPAQDFTFE- 162
Query: 123 FKQAFVAVQSCVVHLQ-KVSLARRFALVPLGPPLLAYSSKCKA-----MLAA-------- 168
+++A +S V ++ + +VPL ++ K A ML A
Sbjct: 163 ---SYLAAKSLVSSRSFEIDAEHGYGMVPLAD---LFNHKTDAEDVHFMLNASDSDDDDN 216
Query: 169 ---VDDA---------------VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 210
+DD +++V+ + AG I G N+ LL YGF + +N
Sbjct: 217 GLIIDDGLANGDCREISSDKSVLEMVMVKDVAAGSEIFNTYGQLGNAALLHRYGFTEPNN 276
Query: 211 PYD--RLVVEAALNTEDPQYQDKRM 233
P+D L ++ L ++Q KR+
Sbjct: 277 PHDIVNLDMDCVLEVLLSRFQKKRV 301
>gi|159477607|ref|XP_001696900.1| rubisco small subunit N-methyltransferase [Chlamydomonas
reinhardtii]
gi|158274812|gb|EDP00592.1| rubisco small subunit N-methyltransferase [Chlamydomonas
reinhardtii]
Length = 411
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 52/182 (28%), Positives = 83/182 (45%), Gaps = 14/182 (7%)
Query: 179 RPYKAGESIVVWCGP---------QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 229
R A + +VVW G +PN +LL+ G + ++N D L A L D Y
Sbjct: 166 RAAGARKGVVVWDGAGSEMLLNDGRPNGELLLATGTLQDNNSSDFLSWPAGLVPADRYYM 225
Query: 230 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVS 289
K V + G + + F V+A R +L YLRL V+D + + + +S
Sbjct: 226 MKSQVLESMGYSAAEEFPVYADRMP---IQLLAYLRLSRVADPALLAKC--TFEADVELS 280
Query: 290 PCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 349
E +L L + RLA Y + ED + +L PK+R+A +L EK+++NA +
Sbjct: 281 QMNEYEILQILMGDCRERLASYTKSYEEDVKIAQQSDLSPKERLAVKLRLGEKRIINATM 340
Query: 350 QV 351
+
Sbjct: 341 EA 342
>gi|212542185|ref|XP_002151247.1| SET domain protein [Talaromyces marneffei ATCC 18224]
gi|210066154|gb|EEA20247.1| SET domain protein [Talaromyces marneffei ATCC 18224]
Length = 709
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 68/224 (30%), Positives = 99/224 (44%), Gaps = 24/224 (10%)
Query: 25 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE--LAYLTGSPTKAEIL 82
A +LM + + FW PYIR L G+ + +PL + E E L +L G + A
Sbjct: 108 AFFLMGQYLLQEHGFWYPYIRSLP-----GKEELTTPLFFREEEGDLEWL-GMTSLAASR 161
Query: 83 ERAEGI-----KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA--VQSCVV 135
ER I +R Y L + F + Y +D+ A T I +AF A + S +
Sbjct: 162 ERRLAIWRGNYERGYTMLKELGFEG---VEGYTWDLYLWASTI-ISSRAFTAKVLASVIP 217
Query: 136 HLQKVSLARRFALVPLGPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 193
L+ + R L+PL + A + K K A D++ LVV AGE + GP
Sbjct: 218 ELKNAEVDRVSVLLPL---IDATNHKPLSKVEWRAGTDSIGLVVMSDVAAGEEVGNNYGP 274
Query: 194 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQR 237
+ N +L++NYGF DNP + VV + P Q K Q
Sbjct: 275 RNNEQLMMNYGFCIPDNPCEYRVVSLRAPLDSPLAQIKAQYEQH 318
>gi|148686780|gb|EDL18727.1| mCG18357, isoform CRA_e [Mus musculus]
Length = 458
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 47/193 (24%), Positives = 93/193 (48%), Gaps = 10/193 (5%)
Query: 170 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 229
DD + V + ++AG+ I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 156 DDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 215
Query: 230 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVIS 280
K V R G + VF +H+ E + +L +LR+ +++ S + + +
Sbjct: 216 MKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFT 274
Query: 281 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 340
PVS E + L D L Y T+ ED+ +L + +L + +A +L
Sbjct: 275 LGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRLG 334
Query: 341 EKKMLNACLQVTA 353
EK++L ++ A
Sbjct: 335 EKEILEKAVKSAA 347
>gi|302804448|ref|XP_002983976.1| hypothetical protein SELMODRAFT_423083 [Selaginella moellendorffii]
gi|300148328|gb|EFJ14988.1| hypothetical protein SELMODRAFT_423083 [Selaginella moellendorffii]
Length = 266
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 52/199 (26%), Positives = 88/199 (44%), Gaps = 30/199 (15%)
Query: 24 LALYLMYEKKQGK----KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 79
LAL ++ E+ +G+ S W PYI L +++ LW +TEL+YL SP
Sbjct: 67 LALIVLMERYKGQAKLQSSAWAPYISCLPEPA-----ELDNTFLWEDTELSYLRASPLYG 121
Query: 80 EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 139
+ ER E I E+ ++ + LF + + E FK + V S +
Sbjct: 122 KTRERLEMITTEFGQVQNALDVWPQLFGK---------VSLEDFKHVYATVFS-----RS 167
Query: 140 VSLARRFALVPLGPPLLAY-----SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 194
+++ LV + P+L + +S K + + + DR Y + I + G
Sbjct: 168 LAIGEDSTLVMI--PMLDFFNHNATSFAKLSFNGLLNYAVVTADRDYAENDQIWINYGDL 225
Query: 195 PNSKLLINYGFVDEDNPYD 213
N++L ++YGF +NPYD
Sbjct: 226 SNAELALDYGFAVPENPYD 244
>gi|255083899|ref|XP_002508524.1| set domain protein [Micromonas sp. RCC299]
gi|226523801|gb|ACO69782.1| set domain protein [Micromonas sp. RCC299]
Length = 425
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 64/237 (27%), Positives = 96/237 (40%), Gaps = 33/237 (13%)
Query: 17 KLSELACLALYLMYE----KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA-Y 71
L E + LA +L + S + Y+R L R+ G L W E ++
Sbjct: 62 NLQEWSVLAAFLAEQALAIDAGADGSRFATYVRALPRRTG-------GVLDWPEEDVKEL 114
Query: 72 LTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ 131
L GSP++ +ER + +E+ +P P + AF +
Sbjct: 115 LAGSPSQRAAMERQASVDAAIDEIRA----------SFPQLTPG------ALRWAFDVLF 158
Query: 132 SCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
S ++ L A ALVP +L + C A + AV L DR YK GE +
Sbjct: 159 SRLIRLPNRGGA--LALVPWAD-MLNHRPGCDAYIDDTGGAVCLSPDRRYKPGEQVYASY 215
Query: 192 GPQPNSKLLINYGFVDE--DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 246
GP+P+S+LLI+YGF +NP D V ++ D K +R G V+ F
Sbjct: 216 GPRPSSELLISYGFAPAVGENPDDEFEVVLGIDPNDRHADAKADALRRIGLSPVEAF 272
>gi|148686778|gb|EDL18725.1| mCG18357, isoform CRA_c [Mus musculus]
Length = 536
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 47/193 (24%), Positives = 93/193 (48%), Gaps = 10/193 (5%)
Query: 170 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 229
DD + V + ++AG+ I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 234 DDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 293
Query: 230 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVIS 280
K V R G + VF +H+ E + +L +LR+ +++ S + + +
Sbjct: 294 MKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFT 352
Query: 281 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 340
PVS E + L D L Y T+ ED+ +L + +L + +A +L
Sbjct: 353 LGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRLG 412
Query: 341 EKKMLNACLQVTA 353
EK++L ++ A
Sbjct: 413 EKEILEKAVKSAA 425
>gi|168046556|ref|XP_001775739.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672891|gb|EDQ59422.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 524
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 64/214 (29%), Positives = 95/214 (44%), Gaps = 23/214 (10%)
Query: 12 LLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET-ELA 70
L+ + L+ + L+L+YE+ +G+ SFW PYI L ++ +L P+LWS+
Sbjct: 144 LVKKSDLTPWQAMCLHLLYERARGETSFWYPYIAVLPKEL---ELIGIHPMLWSQKMRRE 200
Query: 71 YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLF----QQYPYDIPTEAFTFEIFKQA 126
+L GSP ++ ER I RE E + AG L + P I A +
Sbjct: 201 WLEGSPM-LDVTERRLAICREDYEA-MLLAGAGRLTPRGNEGEPISITETAVQ---WAAT 255
Query: 127 FVAVQSCVVHLQKVSL------ARRFALVPLGPPLLAYSSKCKAMLAAVDD---AVQLVV 177
+ +S ++LQ L ALVP L SS + D L
Sbjct: 256 MLLSRSFSLNLQTQKLRPGSFAEDTIALVPWADMLNHSSSAGRESCLVYDQKSGVATLQA 315
Query: 178 DRPYKAGESIVVWCGPQPN-SKLLINYGFVDEDN 210
R Y GE + GP + S+LL++YGFVDE+N
Sbjct: 316 HRTYSEGEQVFDSYGPSCSPSRLLLDYGFVDEEN 349
>gi|440804394|gb|ELR25271.1| rubisco lsmt substrate-binding protein [Acanthamoeba castellanii
str. Neff]
Length = 408
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 83/351 (23%), Positives = 138/351 (39%), Gaps = 40/351 (11%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
LAL++++E ++ SFW PY L + V P+ W++ ++ L GSP A +L
Sbjct: 48 LALFMLHELRK-PDSFWRPYFDALPSR-------VNLPMFWADEDMQLLAGSPLHAAVLA 99
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYP--YDIPTE------AFTFEIFKQAFVAVQSCVV 135
+ + + + E + ++YP + + + +++ F+ + S
Sbjct: 100 QKKQARDWHTE------HIVPIVRRYPRPFGVSDDDSSLEPSYSLARFEWVLSMIASRAF 153
Query: 136 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV---VDRPYKAGESIVVWCG 192
+ +VP+ + + DD Q V +PY GE + +
Sbjct: 154 WHFDLKDTWEPHMVPMADLINHSLTNDNVSKYTFDDKTQTFIVHVQQPYAEGEQVFITYC 213
Query: 193 PQPNSKLLINYGFVDEDN-------PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQV 245
N +LL Y + EDN D + E + R + QR L+ Q
Sbjct: 214 TDSNFELLKTYAMMVEDNYNKYTEIRLDETTIARICPDEVERLTKTRALTQRG--LAKQT 271
Query: 246 FHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFK 305
+ V + +E D++ LRL ++ T S+ PVS E V D +A K
Sbjct: 272 YPV---KSEEFPLDLVQALRLYHLPLTDSHTE--STCFETDPVSVQNELMVYDTIAGCVK 326
Query: 306 ARLAGYPATLSEDEAMLT-DYNLHPKKRVATQLVRMEKKMLNACLQVTADM 355
L+ YP T ED AML D L R+A R +K L V A+M
Sbjct: 327 ELLSQYPITAQEDAAMLAHDPRLSATARLAVAYRREDKLFLTEVGSVFAEM 377
>gi|34784341|gb|AAH57968.1| Setd3 protein [Mus musculus]
Length = 408
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 47/193 (24%), Positives = 93/193 (48%), Gaps = 10/193 (5%)
Query: 170 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 229
DD + V + ++AG+ I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 106 DDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 165
Query: 230 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVIS 280
K V R G + VF +H+ E + +L +LR+ +++ S + + +
Sbjct: 166 MKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFT 224
Query: 281 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 340
PVS E + L D L Y T+ ED+ +L + +L + +A +L
Sbjct: 225 LGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRLG 284
Query: 341 EKKMLNACLQVTA 353
EK++L ++ A
Sbjct: 285 EKEILEKAVKSAA 297
>gi|351701197|gb|EHB04116.1| SET domain-containing protein 3 [Heterocephalus glaber]
Length = 705
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 61/251 (24%), Positives = 116/251 (46%), Gaps = 14/251 (5%)
Query: 116 EAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 174
++FT+E ++ A +V + + +R AL+PL + DD +
Sbjct: 346 DSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCE 405
Query: 175 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 234
V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 406 CVALQDFQAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEV 465
Query: 235 AQRNG---KLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS--------SLG 283
R G + VF +H E + +L +LR+ +++ + ++ +LG
Sbjct: 466 LARAGIPTYVWSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGENAIDRIFTLG 524
Query: 284 PI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEK 342
PVS E + L D L Y T ED+A+L + +L + ++A +L EK
Sbjct: 525 NSEFPVSWENEVKLWSFLEDRASLLLKTYKTTTEEDKAVLKNPDLPARTKMAIKLRLGEK 584
Query: 343 KMLNACLQVTA 353
++L +Q A
Sbjct: 585 EILEKAVQSAA 595
>gi|307190530|gb|EFN74527.1| SET domain-containing protein 3 [Camponotus floridanus]
Length = 232
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 54/211 (25%), Positives = 93/211 (44%), Gaps = 11/211 (5%)
Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
AL+P+ + + A D + R +K GE + + GP+ NS ++ GFV
Sbjct: 18 ALIPMWDMCNHENGRITTDFNATSDHCECYALRNFKKGEQVFISYGPRTNSDFFVHSGFV 77
Query: 207 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-MLPYLR 265
+N D + ++ D +++ + + G SV F + G E ISD +L +LR
Sbjct: 78 YMNNKQDGFKLRLGISKADSLQKERIELLSKLGLPSVGEFLLKPG--TEPISDTLLAFLR 135
Query: 266 LGYVSDTSEMQSVISSLGPI------CPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 319
+ + +E+ + S C + +E V L + +A YP TL ED
Sbjct: 136 V-FSMRKAELAHWLRSDKVFDLKHMDCALETVVEENVRKFLLTRLQLLIANYPTTLKEDL 194
Query: 320 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 350
+L + L K++A QL EK++L L+
Sbjct: 195 ELL-ETTLPQIKKMAVQLRVTEKRILLGALE 224
>gi|297736447|emb|CBI25318.3| unnamed protein product [Vitis vinifera]
Length = 487
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 52/223 (23%), Positives = 101/223 (45%), Gaps = 31/223 (13%)
Query: 13 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 72
L +++ +A LA+ + E K G+ S W PYI L Q G + S + WSE EL +
Sbjct: 106 LLGDEVGNIAKLAIVISVEWKMGQDSEWAPYINRLP-QPGE----MHSTIFWSEGELKMI 160
Query: 73 TGSPTKAEILERAEGIKREYNELDTV-WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ 131
S E + + I++++ + V + +LF+ DI + F +
Sbjct: 161 QQSSVYQETINQKAQIQKDFLAIKPVLHHFSENLFK----DISLKEF-----------MH 205
Query: 132 SC-VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD--------AVQLVVDRPYK 182
+C +V + + +L+P + + ++L +D ++++ DR Y
Sbjct: 206 ACALVGSRAWGSTKGLSLIPFAD-FVNHDGFSDSVLLGDEDKQLSESSSTLEVIADRNYA 264
Query: 183 AGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 225
GE +++ G PN+ LL+++GF N YD++ ++ + D
Sbjct: 265 PGEQVLIRYGKFPNATLLLDFGFTLPYNIYDQVQIQVNIPHHD 307
>gi|302815683|ref|XP_002989522.1| hypothetical protein SELMODRAFT_129980 [Selaginella moellendorffii]
gi|300142700|gb|EFJ09398.1| hypothetical protein SELMODRAFT_129980 [Selaginella moellendorffii]
Length = 464
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 89/375 (23%), Positives = 150/375 (40%), Gaps = 62/375 (16%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
L L L+YE+ Q K S+W PYI L + P+ +S ++ + +P ++ +
Sbjct: 105 LGLKLLYERAQ-KGSYWWPYISMLPH-------SFTLPIFFSGVDIESIDYAPVTHQVKK 156
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF---EIFKQAFVAVQSCVVHLQKV 140
R + + +EL + + P +I A F A AV S + V
Sbjct: 157 RCRFLLQFSSEL--------AKLESLPEEIHPFAGQFVDSGALGWAMAAVSSRAFRIHGV 208
Query: 141 SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV----------QLVVDRPYKAGESIVVW 190
+ A++ PL+ + A +++ + ++V R + G +I +
Sbjct: 209 TNKLCSAMML---PLIDMCNHSFQPNAHIEEDLSRDAQDVSFLKVVTKRNLEKGSAITLN 265
Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA--QRNGK------LS 242
GP N LL++YGFV DNP+DR+ L + ++ RM+A R G S
Sbjct: 266 YGPLSNDLLLLDYGFVIPDNPHDRI----ELRYDGSLMENARMIAGLSRTGSPPFSSPAS 321
Query: 243 VQVFH--------------VHAGREKEAISDMLPYLRLGYVSDTS--EMQSVIS--SLGP 284
QV V G +E +L LR+ + E + ++S + G
Sbjct: 322 WQVDRLKQLGLADSGESQKVTLGGPEEVDGRLLAALRILHAESQEPLERRELVSLQAWGV 381
Query: 285 ICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 344
VS E VL L + T+ EDEA L+D +L R+A Q +K++
Sbjct: 382 ESMVSSDNEERVLRTLCGLGAIVFNQFKTTIEEDEAKLSDKSLAETSRIAVQFRLTKKRL 441
Query: 345 LNACLQVTADMIMLL 359
+ L+ +M L
Sbjct: 442 VVRVLESLKKRLMDL 456
>gi|449542715|gb|EMD33693.1| hypothetical protein CERSUDRAFT_56467 [Ceriporiopsis subvermispora
B]
Length = 510
Score = 57.8 bits (138), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 65/226 (28%), Positives = 98/226 (43%), Gaps = 31/226 (13%)
Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
A+VP+ L A A L + +++V +P AGE I G PNS LL YG V
Sbjct: 260 AMVPMADMLNARFESENAKLFYEEHYLKMVATKPINAGEQIWNTYGDPPNSDLLRRYGHV 319
Query: 207 D----------EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQV-FHVHAGREK- 254
D E NP D + + A L R + G L V+V F + +
Sbjct: 320 DVVPLGEPLSGEGNPADVVEIRADL-----VVSAVRKARKAAGDLQVRVDFWLEEADDDT 374
Query: 255 -------EAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 307
E ++L ++RL +S T + + + + G + +E +L + D K R
Sbjct: 375 FVLMTDCEVPEELLSFIRL--LSLTKDEWNKVKAKGKLP--KGKLELELLPAIVDVLKER 430
Query: 308 LAGYPATLSEDEAML---TDYNLHPKKRVATQLVRMEKKMLNACLQ 350
L YP T+ EDE++L + NL KR A + EK++L LQ
Sbjct: 431 LKEYPTTIEEDESLLGPDSAVNLSFNKRNAVVVRLGEKRILRGALQ 476
>gi|149044196|gb|EDL97578.1| rCG27725, isoform CRA_b [Rattus norvegicus]
Length = 538
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 47/193 (24%), Positives = 92/193 (47%), Gaps = 10/193 (5%)
Query: 170 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 229
DD + V + ++AG+ I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 234 DDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 293
Query: 230 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVIS 280
K V R G + VF +H E + +L +LR+ +++ S + + +
Sbjct: 294 MKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFT 352
Query: 281 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 340
PVS E + L D L Y T+ ED+ +L + +L + +A +L
Sbjct: 353 LGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKTVLKNPDLSVRATMAIKLRLG 412
Query: 341 EKKMLNACLQVTA 353
EK++L ++ A
Sbjct: 413 EKEILEKAVKSAA 425
>gi|340966944|gb|EGS22451.1| hypothetical protein CTHT_0019870 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 499
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 48/191 (25%), Positives = 78/191 (40%), Gaps = 12/191 (6%)
Query: 26 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 85
+L+ E +G+ SFW PYI L + + P W E ++ +L G+ I E
Sbjct: 111 FFLIKEYLKGENSFWWPYIATLPQPEQVNSWTL--PAFWPEDDIQFLEGTNAHVAIGEIQ 168
Query: 86 EGIKREYNELDTVW----FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK-V 140
IKREY + V F + Q Y FT F+ + + QS ++ +
Sbjct: 169 ANIKREYKQARKVLKEENFPNWKEYSQMLYKWAFSIFTSRSFRPSLILSQSVKDYVSTLL 228
Query: 141 SLARRFALVPLGPPLLAYSSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWCGPQP 195
AR + PL ++ D + QL+ Y+ G+ + G +
Sbjct: 229 PSAREIDDFSILQPLFDIANHSMTATYTWDTTSDPNCCQLICQDSYRPGDQVFNNYGFKT 288
Query: 196 NSKLLINYGFV 206
NS+LL+ YGF+
Sbjct: 289 NSELLLAYGFI 299
>gi|384251065|gb|EIE24543.1| hypothetical protein COCSUDRAFT_40909 [Coccomyxa subellipsoidea
C-169]
Length = 685
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 52/171 (30%), Positives = 76/171 (44%), Gaps = 18/171 (10%)
Query: 65 SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK 124
+E E++ L G+P +E + I+ +Y V +L YP DI + T + F
Sbjct: 65 TEEEVSMLEGTPAHTTFVEARQHIREQYRAAQPVL---QALTAAYPDDITPDLVTEDKFI 121
Query: 125 QAFVAVQSCVVHLQKVSLARRFALVPLG--------PPLLAYSSKCKAMLAAVDDAVQLV 176
A S + ++ V A R LVP+ P ++ Y L A D+++L
Sbjct: 122 WACELWYSYAIEVEYVDGAVRQTLVPIAHLLNHSPWPHIVRY-----GRLDAATDSLRLR 176
Query: 177 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR--LVVEAALNTED 225
R AGE + GP PN KLL+ YGF DNP+D + EA N D
Sbjct: 177 AFRHCAAGEQCFLSYGPLPNLKLLLFYGFALPDNPHDTVPITFEAEKNEGD 227
>gi|168005531|ref|XP_001755464.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693592|gb|EDQ79944.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1033
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 49/209 (23%), Positives = 90/209 (43%), Gaps = 24/209 (11%)
Query: 8 YAAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 67
Y E + ++ A L+ EK +G S W YI L + + P+L +
Sbjct: 585 YDNETIVAGEVDSWDRAAARLLREKAKGSSSAWASYINILPQN-------MTVPILLEDH 637
Query: 68 ELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYD---IPTEAFTFEIFK 124
EL + P E+++ + I+ ++ L +V +AG+ F++Y + + + AFT +F
Sbjct: 638 ELHEVQWWPVLRELVQVRKSIRESFSLL-SVDDLAGADFEEYRWAAMMVHSRAFTLPVFA 696
Query: 125 QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 184
A + ++ ++ + + P+ V++V R K G
Sbjct: 697 DDHYAPYVMMPYMDMINHHYHYQADWMSQPIWG-------------GKVEIVARRDIKKG 743
Query: 185 ESIVVWCGPQPNSKLLINYGFVDEDNPYD 213
E + GP+ N L + YGFV +DNP+D
Sbjct: 744 EELFASFGPRANDNLFLYYGFVLKDNPFD 772
>gi|429861365|gb|ELA36056.1| set domain protein [Colletotrichum gloeosporioides Nara gc5]
Length = 471
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 81/378 (21%), Positives = 143/378 (37%), Gaps = 55/378 (14%)
Query: 11 ELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA 70
E + L L L ++YE QG S W PY L + + + W ++L
Sbjct: 86 EDMENEPLDSWGSLILVMIYEFLQGAASPWKPYFEVLPEK-------FHTLMFWESSDLE 138
Query: 71 YLTGSPTKAEI-LERAEGIKREY----------------------NELDTVWFMAGSLFQ 107
L GS ++I E A+ + R EL + GS+
Sbjct: 139 NLKGSAVLSKIGKEEADEMFRSRILTVIAANPAIFYPEGSSPLGEAELLQLAHRMGSIIM 198
Query: 108 QYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA 167
Y +D+ E + ++ + L V +A +L ++ A +
Sbjct: 199 AYAFDLDNEEEPEQEEDDEWIEDRDGKTMLGMVPMAD----------ILNADAEFNAHVN 248
Query: 168 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN--------PYD--RLVV 217
DD + + RP AGE I+ + GP PNS+LL YG+V + P+D + V
Sbjct: 249 HGDDELTVTALRPIPAGEEILNYYGPHPNSELLRRYGYVTPKHSRYDVVEIPWDLVQASV 308
Query: 218 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR-EKEAISDMLPYLRLGYVSDTSEMQ 276
L D ++ + V V +G + E + +R +++
Sbjct: 309 SEHLKIGDDVWKQVQEYVDPEELEDVFVLERESGEPDSEGQFRTVAEVREISAELEEQLK 368
Query: 277 SVISSL----GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKR 332
+V+ ++ G + P + + + L+ YP + EDEA+L +L ++R
Sbjct: 369 AVLKAIKKINGDLIPDKRKRDEVFHAVIVSTLQKILSQYPTSTQEDEALLATSDLTNRQR 428
Query: 333 VATQLVRMEKKMLNACLQ 350
+A + EKK+L L+
Sbjct: 429 MAIHVRLGEKKLLKEALE 446
>gi|307195794|gb|EFN77608.1| SET domain-containing protein 3 [Harpegnathos saltator]
Length = 245
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 56/219 (25%), Positives = 87/219 (39%), Gaps = 27/219 (12%)
Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
AL+P+ + + A D + R ++ GE I + GP+ NS ++ GFV
Sbjct: 31 ALIPMWDMCNHENGRITTDFNATSDRCECYALRNFQKGEQIFISYGPRTNSDFFVHSGFV 90
Query: 207 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML----- 261
DN D + ++ D +++ + + SV F + G E ISDML
Sbjct: 91 YMDNEQDGFKLRLGISKADSLQKERTELLGKLDLPSVGEFLLKPG--TEPISDMLLAFLR 148
Query: 262 ----------PYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGY 311
+LR V D M C + +E V L + +A Y
Sbjct: 149 VFSMRKAELAHWLRSDKVFDLKHMD---------CALETVVEENVRKFLLTRLQLLIANY 199
Query: 312 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 350
P TL ED +L + L K++ QL EKK+L L+
Sbjct: 200 PTTLKEDLELL-ETTLPQVKKMTVQLRVTEKKILLGALE 237
>gi|301122791|ref|XP_002909122.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262099884|gb|EEY57936.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 426
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 83/365 (22%), Positives = 154/365 (42%), Gaps = 42/365 (11%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LAL+LM+E+ + SF+ P+I L + P+ WSE+EL L G+
Sbjct: 83 VLALHLMHERSKRSDSFFAPFIASL-------PTTFDLPVFWSESELNELKGTNVLLLTQ 135
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQK 139
+ ++R++ + ++ + +P +PT T E + A + S
Sbjct: 136 LMKQQLQRDFENIHQ------AVVEDFPEVFALLPT--LTLEDYTWAMSVIWSRAF---G 184
Query: 140 VSLARRFALVPLGPPLLAYSSKCKAML-----AAVDDAVQLV---VDRPYKAGESIVVWC 191
V+ +++ V L P + ++ + + D+ Q++ V + AG ++ +
Sbjct: 185 VTREKKYLRV-LCPAMDMFNHDVSLRILLDDFVSFDEETQMLTHHVPKEVAAGSALQISY 243
Query: 192 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 251
G N+KLL +YGFV ++N + + DP + K+ V N Q +
Sbjct: 244 GQYSNAKLLFSYGFVAKENSRRAVDFWMKIPPNDPYLKLKQTVLDSNELTRDQTYDFCGT 303
Query: 252 R-EKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAG 310
E + +L LR+ +++ E++ + +S E AV + L + + +LA
Sbjct: 304 LFENDVDERLLATLRVILMNE-QEIR-LYKKAFETSIISIRNELAVYENLQNTCRRKLAN 361
Query: 311 YPATLSEDEAMLTDYNLHPKKRVATQL-VRMEKK--------MLNACLQVTADMIMLLPD 361
+ TL EDEA+L + R++ + VR+E K L QV A + + P
Sbjct: 362 FATTLEEDEAILAEMATESSPRLSFAVRVRVEDKQVLTGVIDTLEKWKQVLASNLEMYPP 421
Query: 362 VTVSP 366
T P
Sbjct: 422 STTRP 426
>gi|358366345|dbj|GAA82966.1| SET domain protein [Aspergillus kawachii IFO 4308]
Length = 673
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 61/229 (26%), Positives = 96/229 (41%), Gaps = 7/229 (3%)
Query: 16 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 75
+ + E +L+ + +G + FW PYIR L Q G ++ +P + +L +L G+
Sbjct: 77 DAVGEKESTIFFLIGQYLRGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGT 131
Query: 76 PTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 134
A +R E +K +Y + T AG Y +D+ A + I + V S V
Sbjct: 132 SLLAAREKRLEVLKEKYEKGSTALRNAGFEGADAYTWDLYLWAASMFISRAFSARVLSGV 191
Query: 135 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 194
+S + L+P+ + + K A D V VV AG+ I GP+
Sbjct: 192 FPETDLSEEKLSVLLPI-IDMGNHRPLAKVEWRAGKDDVAFVVLEDVSAGQEISNNYGPR 250
Query: 195 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 243
N +L++NYGF NP D +V P Y K Q L+V
Sbjct: 251 NNEQLMMNYGFCIPGNPCDHRIVSLRAPPGSPLYMAKSHQLQMYPDLAV 299
>gi|428175234|gb|EKX44125.1| hypothetical protein GUITHDRAFT_109909 [Guillardia theta CCMP2712]
Length = 442
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 46/187 (24%), Positives = 76/187 (40%), Gaps = 28/187 (14%)
Query: 60 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM---------AGSLFQQYP 110
+PL WS+ E L GS YN LD W M A L Q
Sbjct: 163 TPLFWSDKEREELQGSNL--------------YNMLDG-WTMNVEKLHRSTARVLGQHNV 207
Query: 111 Y-DIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA---RRFALVPLGPPLLAYSSKCKAML 166
+ D+P ++ + FK A+ + + + S R+ + P+ K
Sbjct: 208 FPDLPKAIYSLKEFKWAYATIFARAFDVDGKSFGFSGRQRIMAPMADLFNHGDVKTSYTF 267
Query: 167 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 226
A +L + + GE I + + N++ L+ YGFV E NP+D + + A++ + P
Sbjct: 268 NAASGHFELFTQQFFSRGEQIFMNYDSKNNAEFLLQYGFVIESNPHDYVGIAASIGNDQP 327
Query: 227 QYQDKRM 233
Y+DK +
Sbjct: 328 FYRDKSL 334
>gi|348671353|gb|EGZ11174.1| hypothetical protein PHYSODRAFT_361758 [Phytophthora sojae]
Length = 486
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 82/356 (23%), Positives = 140/356 (39%), Gaps = 35/356 (9%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
L ++L+ + ++ + SF+ Y L P+ WS+ EL +L GS +I E
Sbjct: 121 LMMFLLTDMERAETSFFRNYYSTLP------STLSNMPIFWSDEELGWLKGSYIIQQIQE 174
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 143
R I+++Y+ + V D F+ + F A + V S L +
Sbjct: 175 RKAAIRKDYDVICRV-------------DPAFARFSLDRFSWARMIVCSRNFGL-TIDGV 220
Query: 144 RRFALVPLGPPLLAYSSKCKA-MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 202
+ ALVP L Y + + DA + G + G + N + L+N
Sbjct: 221 KTAALVPFADMLNHYRPRETSWTFDQSIDAFTITSLGTIGTGAQVYDSYGKKCNHRFLLN 280
Query: 203 YGFVDEDNPY------DRLVVEAALNTEDPQ-YQDKRMVAQRNGKLSVQVFHVHAGREKE 255
YGF EDN + ++++ L+ D Q + DKR +G ++ +
Sbjct: 281 YGFAVEDNTEEDGRNPNEVLIDFQLSQADGQLFYDKRAYLHESGIYTMDA-RLSCSHSDA 339
Query: 256 AISDMLPYLRLGYVSDTSEMQS--VISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 313
+ + RL V+ E S + S P+S E L L D +L+ Y
Sbjct: 340 NTREGFSFARL-IVATEDEFSSMKMKSPAHSSPPISFDNEIRALQYLRDLMTHQLSLYDT 398
Query: 314 TLSEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPC 367
T+ ED +L Y L + A +R EK++ Q AD ++ L + ++ C
Sbjct: 399 TIEEDNELLASKQYPLFSNRIQALFFIRGEKQVCR-YFQELADKVIQLFSLPLAEC 453
>gi|170067683|ref|XP_001868579.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167863782|gb|EDS27165.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 269
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 50/189 (26%), Positives = 89/189 (47%), Gaps = 24/189 (12%)
Query: 181 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 240
Y+ GE I ++ G + N+ L++ GFV DN + + +LN + Q++ ++ + ++ G
Sbjct: 74 YRKGEQIFIYYGNRTNADFLVHNGFVYPDNANSAVAIPLSLNPTEEQFEQRKQLLEKLGL 133
Query: 241 LSVQVFHVHAGREKEAIS-DMLPYLR--------LGYVSDTSEMQSVISSLGPICPVSPC 291
S F+V G IS ++L + R LG+ +QS + L P C P
Sbjct: 134 ASSGDFNVQRGGGDSFISPELLGFARVFNMTKEQLGHWQGEDAVQSQL--LEPDC---PG 188
Query: 292 MERAVLDQLADYFKARLA----GYPATLSEDEAMLTDYN------LHPKKRVATQLVRME 341
+E ++ +++ Y RL TL +DEA+L + L K + Q +E
Sbjct: 189 LEASLREKVWKYLSIRLQLALRMTGTTLDQDEALLANQGQKGAQKLGHIKSMLVQFRVVE 248
Query: 342 KKMLNACLQ 350
KK+L+ L+
Sbjct: 249 KKILSEALE 257
>gi|255568191|ref|XP_002525071.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
gi|223535652|gb|EEF37318.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
Length = 456
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 56/231 (24%), Positives = 108/231 (46%), Gaps = 35/231 (15%)
Query: 8 YAAELLTTNKLSEL-----------ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 56
Y+A++ + N L EL A LA+ L+ ++K G++S W PYI L Q G
Sbjct: 66 YSAQIASDNLLPELSDLLGDEVGSVAKLAIVLLVDQKVGQESKWAPYISRLP-QLGE--- 121
Query: 57 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 116
+ S + WS++EL + S E +++ I++++ + V + +P +
Sbjct: 122 -MHSTIFWSKSELDMIFQSSVYKETIKQKAQIEKDFLTIKPV-------LEHFPQ--ISR 171
Query: 117 AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD--AVQ 174
+ TF+ F A+ V+S + + +L+P L + +A++ +D +
Sbjct: 172 SITFQDFMHAYALVKS-----RAWGSTKGVSLIPFA-DFLNHDGFSEAVVLNDEDKQVSE 225
Query: 175 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 225
+ DR Y E +++ G N+ LL+++GF N +++ VE +N D
Sbjct: 226 VAADRNYAPHEEVLIRYGKFSNATLLLDFGFSLPYNIHEQ--VEIQINIPD 274
>gi|449464220|ref|XP_004149827.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Cucumis
sativus]
Length = 499
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 51/205 (24%), Positives = 91/205 (44%), Gaps = 22/205 (10%)
Query: 13 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 72
L N++ +A LA+ ++ E K G S W PYI L + + + + W E+EL +
Sbjct: 111 LLGNEIGNVAKLAVVVLLEHKLGLGSEWAPYIIRLPQP-----WEMHNTIFWKESELEMI 165
Query: 73 TGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 132
S E L + IKRE+ + + +P I + + + F A+ V S
Sbjct: 166 RKSSLYEESLNQRSQIKREFLAIRKA-------LEAFPEII--DRISCDDFMHAYALVTS 216
Query: 133 CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGESIVVW 190
+ +L+P L + +AML DD ++V DR + GE +++
Sbjct: 217 -----RAWRSTEGVSLIPFAD-FLNHDGASEAMLLNDDDKQLSEVVADRDFAPGEHVLIR 270
Query: 191 CGPQPNSKLLINYGFVDEDNPYDRL 215
G N+ L++++GF N +D++
Sbjct: 271 YGKYSNATLMLDFGFALPYNIHDQV 295
>gi|301094750|ref|XP_002896479.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262109454|gb|EEY67506.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 478
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 76/320 (23%), Positives = 128/320 (40%), Gaps = 31/320 (9%)
Query: 61 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 120
P+ WSE EL++L GS +I ER I+++Y+ + V D F+
Sbjct: 144 PIFWSEEELSWLKGSYIIQQIQERKAAIRKDYDVICRV-------------DPSFARFSL 190
Query: 121 EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA-MLAAVDDAVQLVVDR 179
+ F A + V S L + + ALVP L Y + + DA +
Sbjct: 191 DRFSWARMIVCSRNFGL-TIDGVKTAALVPFADMLNHYRPRETSWTFDQSIDAFTITSLG 249
Query: 180 PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY------DRLVVEAALNTEDPQ-YQDKR 232
G + G + N + L+NYGF EDN + ++++ L+ D Q + DKR
Sbjct: 250 TIGTGAQVYDSYGKKCNHRFLLNYGFAVEDNTEEDGRNPNEVLIDFQLSPADGQLFYDKR 309
Query: 233 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPIC---PVS 289
+G ++ + + + RL + T E S + P P+S
Sbjct: 310 AYLHESGIYTMDA-RLSCSHSDANTREGFSFARL--IVATEEEFSTMKMKSPAHSSPPIS 366
Query: 290 PCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNA 347
E L+ L + +L+ Y T+ ED +L Y L + A +R EK++
Sbjct: 367 FDNEIRALEYLRNLMTHQLSLYDTTIEEDNELLASKQYPLFSNRIQALFFIRGEKQVCRY 426
Query: 348 CLQVTADMIMLLPDVTVSPC 367
Q AD ++ L + ++ C
Sbjct: 427 -FQELADKVIPLFSLPLAEC 445
>gi|412990750|emb|CCO18122.1| predicted protein [Bathycoccus prasinos]
Length = 543
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 61/211 (28%), Positives = 89/211 (42%), Gaps = 31/211 (14%)
Query: 21 LACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKA 79
LA A L E K + Y++ L R G S L W E+++ L GSP+
Sbjct: 169 LAETAQNLSTEDNSSNKYRFATYVKALPRSTG-------SVLEWPESDVRTLLAGSPSLF 221
Query: 80 EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 139
LER + E+ + Q+ +DI +F S ++ L+
Sbjct: 222 SALERRASVAAAIAEIRVNFPELNEKTLQWAFDI--------LF--------SRLIRLE- 264
Query: 140 VSLARRFALVPLGPPLLAYSSKCKAM--LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 197
SL ALVP +L + C+A L V L DR Y+ GE + G +P+S
Sbjct: 265 -SLGGNLALVPWAD-MLNHQPGCEAFIDLDRGSRKVCLTTDRSYEPGEQVWASYGQRPSS 322
Query: 198 KLLINYGFVDE--DNPYDRLVVEAALNTEDP 226
+LLI+YGF DNP D + ++ EDP
Sbjct: 323 ELLISYGFAPAVGDNPDDEYALNLQIDEEDP 353
>gi|413923745|gb|AFW63677.1| hypothetical protein ZEAMMB73_839660 [Zea mays]
gi|413923746|gb|AFW63678.1| hypothetical protein ZEAMMB73_839660 [Zea mays]
Length = 306
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 49/178 (27%), Positives = 77/178 (43%), Gaps = 19/178 (10%)
Query: 10 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
+++ N + + +A YL+ E S W+ YI L RQ S L W+ EL
Sbjct: 135 GDVMKRNSVPDWPLIATYLISEASLEGSSRWISYIAALPRQ-------PYSLLYWTRAEL 187
Query: 70 -AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 128
AYL SP + ++R + YN+L +F ++P P E + E F +F
Sbjct: 188 DAYLVASPIRKRAIQRITDVIGTYNDL------RDRIFSRHPDLFPEEVYNIETFLWSFG 241
Query: 129 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAG 184
+ S +V L S+ R ALVP +L +S + + L + DR Y+ G
Sbjct: 242 ILFSRLVRLP--SMDGRVALVPWA-DMLNHSPEVETFLDFDKSSRGIVFTTDRSYQPG 296
>gi|281207217|gb|EFA81400.1| mRNA-decapping enzyme 2 [Polysphondylium pallidum PN500]
Length = 1078
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 78/321 (24%), Positives = 134/321 (41%), Gaps = 62/321 (19%)
Query: 58 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 117
+ S L +S+ E L GSP +++ + K + L SLF++ + A
Sbjct: 130 LNSGLFFSDAEAELLNGSPAYIDLMNQRLDAKELFGRL-------KSLFKEQQFS--KCA 180
Query: 118 FTFEIFKQAFVAVQSCVVHLQKVSL---ARRFALVPLGPPLLAYSSKCKAMLAAVD---- 170
T++ K A+ V S ++ + +L F V L P L Y + + AA D
Sbjct: 181 MTYDRLKWAYSVVDSRKIYTEAPNLDANGNPFITVVLAP-FLDYFNHAEDAQAAYDFDYD 239
Query: 171 -DAVQLVVDRPYKAGESIVVWCGPQP-NSKLLINYGFVDEDNPYDRLV---VEAALNT-- 223
A+++V +P K GE I + G Q NS LLI+YGF+D+ + V VE LNT
Sbjct: 240 ESAIKVVALQPIKKGEQIFLNYGNQDCNSDLLIHYGFIDQSSTAKHCVNVLVEELLNTIP 299
Query: 224 -EDPQYQDKRMVAQRNGKLS--VQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS 280
DPQ +K + + + + +++F E IS L Y
Sbjct: 300 ASDPQLIEKTELLTKAFEQNERMKLFKDSLTEELLKISKYLSYKNF-------------- 345
Query: 281 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML---TDYNLHPKKRVATQL 337
++L L ++ YP T+ ED A++ T++ ++ + +
Sbjct: 346 --------------SLLPYLKSLIDMKMKAYPTTMEEDRAIIEATTEFEKLSQRSKMSII 391
Query: 338 VRMEK----KMLNACLQVTAD 354
+R+++ K + A +QV D
Sbjct: 392 MRLQEKETLKEIGALIQVKID 412
>gi|303271159|ref|XP_003054941.1| methyltransferase [Micromonas pusilla CCMP1545]
gi|226462915|gb|EEH60193.1| methyltransferase [Micromonas pusilla CCMP1545]
Length = 544
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 93/355 (26%), Positives = 143/355 (40%), Gaps = 43/355 (12%)
Query: 19 SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPT 77
EL LAL+L E+ +G S W PY+ L V+ PL W E+E+ + L GSPT
Sbjct: 133 GELVGLALWLCCERAKGSLSDWAPYVNTLPT-----GCTVDHPLRWEESEIRSLLKGSPT 187
Query: 78 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
+ + RA + EY + + Y+ TE F A V + V L
Sbjct: 188 CEQAVGRAVDAREEYASIRAAIAADADAYPADAYEFLTEL----AFTDALATVLARAVWL 243
Query: 138 QKVSLARRFALVPL--------------GPPLLAYSSKCKAMLAAV-----DDAVQLV-V 177
++ +ALVPL P A + + + AAV D A + V V
Sbjct: 244 NAANV---YALVPLVDLLPVVGAPPPGVNPAAAAADAGARGLDAAVGVVDYDAATECVAV 300
Query: 178 DRPYKAGESIVVWCGP---QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 234
A ++ V C + L ++ G V+ + D L + D Y K+ +
Sbjct: 301 VSANDARQTAPVVCADALGRNAGDLFLSTGRVNGAHVGDYLTFVTSTVMSDKLYAAKKQI 360
Query: 235 AQRNG-KLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCME 293
+ G Q F V A R + Y+R V + SE+ +V I VSP E
Sbjct: 361 LEGMGYSADAQAFPVFADRMP---LQLFAYMRFARVQEPSELMTVSFEEDRI--VSPMNE 415
Query: 294 RAVLDQLADYFKARLAGYPATLSEDEAM-LTDYNLHPKKRVATQLVRMEKKMLNA 347
+L L + LA Y + E E + L + N+ ++ A +L EK+++NA
Sbjct: 416 YEILQLLMGDAREMLAEYENSSEEFELLQLKETNISERQMTAAKLRLGEKRLINA 470
>gi|66828265|ref|XP_647487.1| hypothetical protein DDB_G0268558 [Dictyostelium discoideum AX4]
gi|60475797|gb|EAL73732.1| hypothetical protein DDB_G0268558 [Dictyostelium discoideum AX4]
Length = 459
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 61/219 (27%), Positives = 93/219 (42%), Gaps = 22/219 (10%)
Query: 25 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILER 84
A++LMY K KSFW PY+ L ++ + + +SE EL L S K + R
Sbjct: 116 AIFLMY-CKLNDKSFWYPYVSVLPKE-------FTTSIYFSEEELDELQSSKLKEFTIIR 167
Query: 85 AEGIKREYNELDTVWFMAGSLFQQYPYDIPT---EAFTFEIFKQAFVAVQSCVVHLQKVS 141
+GI+R YN T G + + P T + +T E+F A V S L
Sbjct: 168 KDGIERHYNSTFTRLSNRG-IAEFSPTSTQTLQQKGYTLELFTWALSCVWSRAFSLSDSD 226
Query: 142 LARRFALVPLGPPLLAYS-SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG---PQPNS 197
+VPL A SK K D + + GE I G P +S
Sbjct: 227 ----GGMVPLADMFNAEEISKSKVQPKVTDSTLDYYASDDIEIGEQIFTPYGVYKPLSSS 282
Query: 198 KLLINYGFV-DEDNPYDRLVVEAAL-NTEDPQYQDKRMV 234
++L++YGFV D P D + + + + ++P Q K+ +
Sbjct: 283 QMLMDYGFVFDHGTPSDNVAISVPIFHPDEPNIQVKQSI 321
>gi|444705829|gb|ELW47217.1| Histone-lysine N-methyltransferase setd3 [Tupaia chinensis]
Length = 539
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/193 (23%), Positives = 91/193 (47%), Gaps = 10/193 (5%)
Query: 170 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 229
DD + V + ++ GE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 234 DDRCECVALQDFRPGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 293
Query: 230 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVIS 280
K V R G + VF +H + + +L +LR+ +++ S + + +
Sbjct: 294 MKAEVLARAGIPTSSVFALHF-TDPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFT 352
Query: 281 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 340
PVS E + L D L Y T+ ED+++L +L + +A +L
Sbjct: 353 LGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKSRDLSVRATMAIKLRLG 412
Query: 341 EKKMLNACLQVTA 353
EK++L ++ A
Sbjct: 413 EKEILERAVRSAA 425
>gi|321470773|gb|EFX81748.1| hypothetical protein DAPPUDRAFT_317395 [Daphnia pulex]
Length = 495
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 76/336 (22%), Positives = 136/336 (40%), Gaps = 25/336 (7%)
Query: 37 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL- 95
KSFW PY+ L + ++ + ++ E+ L GSP + L I R+Y+
Sbjct: 167 KSFWKPYLDALPS-------SYDTVMYFTPDEITELKGSPAFDDALRMCRNIARQYSYFY 219
Query: 96 ----DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 151
V +L + Y+ A + + +Q + Q + K L AL+PL
Sbjct: 220 SLLQKNVDPALSNLRANFTYNDYRWAVSTVMTRQNLIPSQEEISGNDKDQLPPVNALIPL 279
Query: 152 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 211
+ R + GE + ++ G + ++ I+ GFVD +N
Sbjct: 280 WDFCNHQDGQFSTEFQLESRRTVCQAGRDFGPGEQVFIFYGTRTCAEQFIHNGFVDINNA 339
Query: 212 YDRLVVEAALNTEDPQYQDK-----RMVAQRNGKLSVQV-FHVHAGREKEAISDMLPYLR 265
+D L ++ L+ DP + ++ + K+S + F + AG + +L +LR
Sbjct: 340 HDALTLKVGLSKSDPLAGQRATLLCKLRILSDEKISGPIAFQLKAG-PQPVDGKLLAFLR 398
Query: 266 LGYVSDTSEMQSVISSLGPICPVSPC-MERAVLDQLADYFKAR----LAGYPATLSEDEA 320
L ++ S + + S C +E V D+ + KAR L YP T D
Sbjct: 399 LFCMTKDSLDRWLQSDNASNLMHEECGIETEVDDKSWSFLKARCQLLLQLYPTTKEADLK 458
Query: 321 MLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 356
ML + +L +R+ L EK++L + ++ A I
Sbjct: 459 MLEE-DLSSHRRMCVLLRLAEKRILLSAIECAAQRI 493
>gi|57899520|dbj|BAD87034.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
gi|57899939|dbj|BAD87851.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
Length = 509
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 49/226 (21%), Positives = 86/226 (38%), Gaps = 36/226 (15%)
Query: 13 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 72
L + + + + LA L+ E+ G +S W PYI+ L + + + +LW EL +
Sbjct: 94 LLDHAVGDTSKLAALLIMEQHLGNESGWAPYIKSLPTKD-----QMHNMVLWDLNELHAV 148
Query: 73 TGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 132
S E +E E K+E+ L LF + A + V
Sbjct: 149 QNSSIYDEAIEHKEQAKKEFLALKPALDHFPHLFGEVKLGDFMHASALDFLNHDGVFGSV 208
Query: 133 CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG 192
+ QK D +++ DR Y GE +++ G
Sbjct: 209 LIYDEQK-------------------------------DVCEIIADRNYAVGEQVMIRYG 237
Query: 193 PQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 238
N+ L +N+GF N YD+ ++ + +DP Y+ K + Q++
Sbjct: 238 KYSNATLALNFGFTLARNIYDQALIRIDMPVQDPLYKKKLDIWQKH 283
>gi|302762396|ref|XP_002964620.1| hypothetical protein SELMODRAFT_81798 [Selaginella moellendorffii]
gi|300168349|gb|EFJ34953.1| hypothetical protein SELMODRAFT_81798 [Selaginella moellendorffii]
Length = 464
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 86/367 (23%), Positives = 153/367 (41%), Gaps = 46/367 (12%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
L L L+YE+ Q K S+W PYI L + P+ +S ++ + +P ++ +
Sbjct: 105 LGLKLLYERAQ-KGSYWWPYISMLPH-------SFTLPIFFSGVDIESIDYAPVTHQVKK 156
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH--LQKVS 141
R + + EL + + + + + A + + A V+ ++ +H K+
Sbjct: 157 RCRFLLQFSAELAKLESLPEEVHPFAGQSVDSGALGWAM---AAVSSRAFRIHGVTNKLC 213
Query: 142 LARRFALVPLGPPLLAYSSKCKAMLA--AVDDA-VQLVVDRPYKAGESIVVWCGPQPNSK 198
A L+ + ++ + L+ A D + +++V R + G +I + GP N
Sbjct: 214 SAMMLPLIDMCNHSFQPNAHIEEDLSRDAQDVSFLKVVTKRNLEKGSAITLNYGPLSNDL 273
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA--QRNGK------LSVQVFH--- 247
LL++YGFV DNP+DR+ L + ++ RM+A R G S QV
Sbjct: 274 LLLDYGFVIPDNPHDRI----ELRYDGSLMENARMIAGLSRTGSPPFSSPASWQVDRLKQ 329
Query: 248 -----------VHAGREKEAISDMLPYLRLGYVSDTS--EMQSVIS--SLGPICPVSPCM 292
V G +E +L LR+ + E + ++S + G VS
Sbjct: 330 LGLADSGESQKVTLGGPEEVDGRLLAALRILHAESQEPLERRELVSLQAWGVESMVSSDN 389
Query: 293 ERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVT 352
E VL L + T+ EDEA L+D +L R+A Q +K+++ L+
Sbjct: 390 EERVLRTLCGLAAIVFNQFKTTIEEDEAKLSDKSLAETSRIAVQFRLTKKRLVVRVLESL 449
Query: 353 ADMIMLL 359
+M L
Sbjct: 450 KKRLMDL 456
>gi|428182808|gb|EKX51668.1| hypothetical protein GUITHDRAFT_102933 [Guillardia theta CCMP2712]
Length = 436
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 63/286 (22%), Positives = 122/286 (42%), Gaps = 35/286 (12%)
Query: 20 ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 79
E+ LA +L EK +G +S + P+I L PLLW++ E+ L G+
Sbjct: 104 EVVALATFLAREKMKGSESSYKPFIDVL-------PWDSLHPLLWTDEEVDLLEGTYAHR 156
Query: 80 EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF----KQAFVAVQS--- 132
EIL E ++ + V G ++Q+ I TE T E F + AF +V S
Sbjct: 157 EILAFREQVEVATELFEPVLNPKG--WKQFFQTIETEKMTPEEFGFMMRGAFASVLSRAF 214
Query: 133 -CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY---------- 181
+ L R + P L ++ D A++ ++ +
Sbjct: 215 DSKIGRGDKGLEERVVI----PLLDIFNHGSYGPSITFDTALERDNEKGFPVRVADKGKS 270
Query: 182 -KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 240
+ GE + + G +PN +L YGFV + + +++ +DP + K + + G
Sbjct: 271 IEEGEELFGFYGDKPNWNMLTTYGFVSPNPKCQETTLSVSIDEKDPYFAQKEEILKARGM 330
Query: 241 LSV-QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPI 285
++V Q+F + + + + ++ Y R+ +S+ +++ V ++ G +
Sbjct: 331 VAVEQLFDIR--HDTDPMGPLINYFRIREISNEADLTKVQTNYGEM 374
>gi|388516285|gb|AFK46204.1| unknown [Lotus japonicus]
Length = 271
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 39/133 (29%), Positives = 65/133 (48%), Gaps = 7/133 (5%)
Query: 121 EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVD 178
E FK +F + S +V L S+ + ALVP +L +S + L + D
Sbjct: 2 ESFKWSFGILFSRMVRLP--SMDGKVALVPWAD-MLNHSCDVETFLDYDKQSKGIVFTTD 58
Query: 179 RPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQ 236
RPY+ GE + + G + N +LL++YGFV + NP D + + +L D Y++K + +
Sbjct: 59 RPYQPGEQVFISYGKKSNGELLLSYGFVTREGANPSDSVELSLSLKKSDGSYKEKLELLK 118
Query: 237 RNGKLSVQVFHVH 249
+ G Q F +
Sbjct: 119 KYGLSGSQCFPIR 131
>gi|365989356|ref|XP_003671508.1| hypothetical protein NDAI_0H00910 [Naumovozyma dairenensis CBS 421]
gi|343770281|emb|CCD26265.1| hypothetical protein NDAI_0H00910 [Naumovozyma dairenensis CBS 421]
Length = 540
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 56/209 (26%), Positives = 86/209 (41%), Gaps = 22/209 (10%)
Query: 15 TNKLSELA---CLALYLMYEKKQ-GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA 70
T KL EL+ L + +MYE K +S W Y L L + W++ EL+
Sbjct: 126 TGKLMELSQWDSLIICMMYEMKVLQHESRWSSYFNVLPSSESLNTL-----MYWNDKELS 180
Query: 71 YLT--------GSPTKAEILERAEGIKREYNELDTVWFMAGSL----FQQYPYDIPTEAF 118
+LT G + R E+NE D + GS+ F P I +F
Sbjct: 181 FLTPSLVVNRVGKGDAETMYRRILDTINEFNE-DILTEKLGSISWEEFLYIPSIIMAYSF 239
Query: 119 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD 178
EI ++ +++PL L A + KC A L D+++++
Sbjct: 240 DVEIKNDDDENEGDEEFDEKEEEPELLKSMIPLADTLNADTHKCNANLTYDKDSLKMLAI 299
Query: 179 RPYKAGESIVVWCGPQPNSKLLINYGFVD 207
+P K GE + G PNS+LL YG+V+
Sbjct: 300 KPIKKGEQVYNTYGELPNSELLRKYGYVE 328
>gi|367048695|ref|XP_003654727.1| hypothetical protein THITE_2117893 [Thielavia terrestris NRRL 8126]
gi|347001990|gb|AEO68391.1| hypothetical protein THITE_2117893 [Thielavia terrestris NRRL 8126]
Length = 481
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 54/199 (27%), Positives = 81/199 (40%), Gaps = 28/199 (14%)
Query: 26 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 85
+L+ E +G+ SFW PYI L + A+ P W E ++AYL G+ I E
Sbjct: 107 FFLIKEYLKGRDSFWAPYIATLPQPEHVSAWAL--PAFWPEEDIAYLAGTNAHVAIAEIQ 164
Query: 86 EGIKREYNE----LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 141
+K E+ + L F A + Q Y FT F+ + V L + +
Sbjct: 165 ANVKSEFKQARKALKAAGFPAWQDYTQMLYKWAFCIFTSRSFRPSLV--------LSEPA 216
Query: 142 LARRFALVPLG---------PPLLAYSSKCKAMLAAVDDA-----VQLVVDRPYKAGESI 187
+ L+P G PL ++ A D A QLV Y+ GE +
Sbjct: 217 KQQMAELLPPGCQLDDFSILQPLFDIANHSMTARYAWDVASDPASCQLVCHDAYQPGEQV 276
Query: 188 VVWCGPQPNSKLLINYGFV 206
G + NS+LL+ YGF+
Sbjct: 277 YNNYGLKTNSELLLAYGFI 295
>gi|315039895|ref|XP_003169325.1| hypothetical protein MGYG_08872 [Arthroderma gypseum CBS 118893]
gi|311337746|gb|EFQ96948.1| hypothetical protein MGYG_08872 [Arthroderma gypseum CBS 118893]
Length = 455
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 82/346 (23%), Positives = 142/346 (41%), Gaps = 36/346 (10%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTK 78
LALYL++ K R L + R +A + + +++ EL GS
Sbjct: 77 LALYLLFVKS-----------RTLGYEGQRHHIAAMPQSYSASIFFTDDELQVCKGSSLY 125
Query: 79 AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ 138
A + + + +Y +L +L Q+ P + FT E +K A ++ S +
Sbjct: 126 ALTPQLEQRVHDDYRQLLV------ALLSQHRDLFPLDQFTIEDYKWALCSIWSRAMDFA 179
Query: 139 KVSLARRFALVPLGPPLLAYS---SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 195
A + PL +L +S +C A D + ++ + Y+ G+ I ++ G P
Sbjct: 180 VSETASVRLVAPLAD-MLNHSPDVKQCHAYDPTSGD-LSILAAKDYQVGDQIFIYYGSVP 237
Query: 196 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 255
N++LL YGFV DNP D + + P Y+ K + G S + K+
Sbjct: 238 NNRLLRLYGFVLPDNPNDSYDLVLQTSPLAPLYEQKERLWALAGLDSTCTIPLTV---KD 294
Query: 256 AI-SDMLPYLRLGYVSDTSEMQSVISSL--GPICPVSPCMERAVLDQLADYFKARLAGYP 312
+ +++L YLR+ + D S + + L G V+ E VL L D + L G+
Sbjct: 295 PLPNNVLRYLRIQRL-DESNITDITLRLVNGTDGKVNDGNEIQVLQFLVDSIGSLLEGFG 353
Query: 313 ATLSEDEAMLT--DYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 356
L + EA L DY A + E+++L + D++
Sbjct: 354 IPLEKLEAQLVAGDYPAGGNAWAAAHVSAGEQRVLTRAKKTAEDLL 399
>gi|224098926|ref|XP_002311320.1| SET domain-containing protein [Populus trichocarpa]
gi|222851140|gb|EEE88687.1| SET domain-containing protein [Populus trichocarpa]
Length = 490
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 82/354 (23%), Positives = 148/354 (41%), Gaps = 45/354 (12%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
+AL+L+ EK + + S W PY+ L + S + WSE ELA L G+ + L
Sbjct: 123 SVALFLIREKLK-EDSTWRPYLDVLPE-------STNSTIFWSEEELAELQGTQLLSTTL 174
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
++RE+ +++ + Q +P + T + F AF ++S +
Sbjct: 175 GVKSYLRREFLKVEEEILVPHK--QLFPSPV-----TLDDFSWAFGILRSRSFSRLR--- 224
Query: 143 ARRFALVPLG----------------PPLLAYSSKC----KAMLAAVDDAVQLVVDRPYK 182
+ L+PL P + A L + D L K
Sbjct: 225 GQNLVLIPLADLCNFLHTWLLDQVNHSPDITIEDGVYEIKGAGLFSRDLIFSLRSPISLK 284
Query: 183 AGESIVVWCGPQ-PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL 241
AGE +++ N++L ++YGF++ + + + ++ DP + DK +A+ NG
Sbjct: 285 AGEQVLIQYNLNLSNAELAVDYGFIEAKSDRNMYTLTLQISESDPFFGDKLDIAETNGLG 344
Query: 242 SVQVFH-VHAGREKEAISDMLPYLRLGYVSDTSEMQSVISS--LGPI-CPVSPCMERAVL 297
+ F V + L + LG +D+ ++S+ + G + PVS E +
Sbjct: 345 EIADFDIVLGNPLPPTLLPYLRLVALGG-TDSFLLESIFRNTIWGHLELPVSRANEELIC 403
Query: 298 DQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQV 351
+ D K+ L+GY T+ EDE L L+P+ +A + EKK+L ++
Sbjct: 404 RVVRDACKSALSGYHTTIEEDEK-LKGEELNPRLEIAVGIRAGEKKVLQQIEEI 456
>gi|328700922|ref|XP_003241429.1| PREDICTED: SET domain-containing protein 3-like [Acyrthosiphon
pisum]
Length = 463
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 50/211 (23%), Positives = 90/211 (42%), Gaps = 18/211 (8%)
Query: 23 CLALYLMYEK-KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 81
LA++++ E ++ KKSFW Y+ L + +P+ + +L L GSP
Sbjct: 163 ALAIFILVESLRKDKKSFWHSYLTTL-------PVTYSTPVYFDVADLEALKGSPAFEAA 215
Query: 82 LERAEGIKREYNELDTVWFMAGSLFQQYPYD-IPTEAFTFEIFKQA---FVAVQSCVVHL 137
L+ I R+Y ++ ++ P I + FT+E ++ A ++ Q+ V
Sbjct: 216 LKLNRNIARQYAYFKKLFQLSND-----PASVILKDTFTYEYYRWAVSTLMSRQNTVPSS 270
Query: 138 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 197
S AL+PL S + + D Y A E + ++ G + N+
Sbjct: 271 DNPS-ENVSALIPLWDMFNHRSGRLSTDFVKSSNVCVCYADGDYAADEQVYIFYGVRTNA 329
Query: 198 KLLINYGFVDEDNPYDRLVVEAALNTEDPQY 228
L++ GFV DN +D + + ++ DP Y
Sbjct: 330 DFLVHNGFVYPDNEHDAVKIRLGVSRSDPLY 360
>gi|296804474|ref|XP_002843089.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
gi|238845691|gb|EEQ35353.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
Length = 455
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 74/305 (24%), Positives = 128/305 (41%), Gaps = 20/305 (6%)
Query: 60 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 119
+ + ++E EL GS A + + ++ +Y +L L Q+ P + FT
Sbjct: 107 ASIFFTEDELQVCAGSSLYALTRQLEQRVRDDYRQLLV------PLLSQHRDLFPLDQFT 160
Query: 120 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYS---SKCKAMLAAVDDAVQLV 176
E +K A ++ S + VS LV +L +S +C A D + ++
Sbjct: 161 IEDYKWALCSIWSRAMDF-AVSGTTSVRLVAPLADMLNHSPDVKQCHAYDPTSGD-LSIL 218
Query: 177 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 236
+ Y+ G+ + ++ G PN++LL YGFV DNP D + + P Y+ K +
Sbjct: 219 AAKDYQVGDQVFIYYGSVPNNRLLRLYGFVLPDNPNDSYDLVLQTSPLAPLYEQKERLWA 278
Query: 237 RNGKLSVQVFHVHAGREKEAI-SDMLPYLRLGYVSDTSEMQSVISSL--GPICPVSPCME 293
G S + K+ + +++L YLR+ + D S + + L G VS E
Sbjct: 279 LAGLDSTCTIPLTV---KDPLPNNVLRYLRIQRL-DESNITDITLQLVNGTDGKVSDGNE 334
Query: 294 RAVLDQLADYFKARLAGYPATLSEDEAMLT--DYNLHPKKRVATQLVRMEKKMLNACLQV 351
VL L D + L G+ L + EA L DY A + E+++L +
Sbjct: 335 MQVLQFLVDSIGSLLEGFGIPLEKLEAQLAAGDYPAGGNAWAAAHVSAGEQRVLTRAKRT 394
Query: 352 TADMI 356
D++
Sbjct: 395 AEDLL 399
>gi|241603784|ref|XP_002405757.1| SET domain-containing protein, putative [Ixodes scapularis]
gi|215502568|gb|EEC12062.1| SET domain-containing protein, putative [Ixodes scapularis]
Length = 429
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 48/206 (23%), Positives = 90/206 (43%), Gaps = 16/206 (7%)
Query: 17 KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP 76
KL+ + L L+++ +K G+ S W P++ L R +P+ L
Sbjct: 100 KLTPIDVLTLFVLDQKLLGEASRWWPFVDSLPR-------TFTTPVFLRRKVFESLP-KD 151
Query: 77 TKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH 136
+ E+ I+R + +L + + G + ++ + FT+ F A+ AV + +
Sbjct: 152 LREEVQTGITFIQRTFLKLKVL--LGGHVEEEPEVQCLSTGFTWNNFVWAWTAVNTRCIF 209
Query: 137 LQKVSLARRF-----ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
Q + + + AL P L + K A V + +++ + + A E + +
Sbjct: 210 AQGSNSSSLWEDDHCALAPF-LDCLNHHWKASIETAMVGENFEILSHKSHDANEQVFISY 268
Query: 192 GPQPNSKLLINYGFVDEDNPYDRLVV 217
GP N +L ++YGFV DNP D +VV
Sbjct: 269 GPHSNRRLFLDYGFVLPDNPNDVVVV 294
>gi|367036287|ref|XP_003648524.1| hypothetical protein THITE_2106073 [Thielavia terrestris NRRL 8126]
gi|346995785|gb|AEO62188.1| hypothetical protein THITE_2106073 [Thielavia terrestris NRRL 8126]
Length = 496
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 55/203 (27%), Positives = 83/203 (40%), Gaps = 32/203 (15%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
L L L+YE QG+ S W PY+ L ++P+ WS TEL+ L S A++
Sbjct: 109 LILVLIYEHLQGEASRWRPYLDVL-------PPTFDTPMFWSPTELSELQASALVAKV-G 160
Query: 84 RAEG----------IKREYNELDTVWFMAG----------SLFQQYPYDIPTEAFTFEIF 123
RAE + R + E V+F G L + I AF E
Sbjct: 161 RAEADRMIEAKVLPVIRAHEE---VFFPPGRAKLDDAQLFELAHRMGSTIMAYAFDLEND 217
Query: 124 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 183
++ +VP+ +L ++ A + DDA+ RP +A
Sbjct: 218 DSDNDEADEDDEWVEDREGRTMLGMVPMAD-MLNADAEFNAHINHGDDALTATALRPIRA 276
Query: 184 GESIVVWCGPQPNSKLLINYGFV 206
G+ I+ + GP PN +LL YG+V
Sbjct: 277 GDEILNYYGPLPNGELLRRYGYV 299
>gi|156361027|ref|XP_001625323.1| predicted protein [Nematostella vectensis]
gi|156212150|gb|EDO33223.1| predicted protein [Nematostella vectensis]
Length = 447
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 85/207 (41%), Gaps = 20/207 (9%)
Query: 7 YYAAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSE 66
Y A + K S L L+LMYEK K SFW PYIR L +P ++
Sbjct: 113 YVAERMKGGAKSSNDYLLVLFLMYEKYLEKGSFWAPYIRTLPD-------TFNTPCYFTR 165
Query: 67 TELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 126
EL +L + + E+ IK+ Y + + Q + + FE FK A
Sbjct: 166 KEL-FLLPEQCREQAFEQVTQIKQSYKSFAKAY---NDVLQDFDCNF-WRTVDFESFKWA 220
Query: 127 FVAVQS-CVVHLQKVSLAR----RFALVPLGPPLLAYSSKCK--AMLAAVDDAVQLVVDR 179
+ V + V H + A+ AL PL LL + K + + ++ V
Sbjct: 221 WCVVNTRSVYHDEPNRRAQPIDGNCALAPL-LDLLNHCDKAEMCGRFNSSSKNYEINVIT 279
Query: 180 PYKAGESIVVWCGPQPNSKLLINYGFV 206
Y+ G + + GP N++L + YGFV
Sbjct: 280 EYQKGTQVFINYGPHDNTRLFLEYGFV 306
>gi|134074534|emb|CAK38827.1| unnamed protein product [Aspergillus niger]
Length = 625
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 57/217 (26%), Positives = 92/217 (42%), Gaps = 7/217 (3%)
Query: 16 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 75
+ + E +L+ + +G + FW PYIR L Q G ++ +P + +L +L G+
Sbjct: 29 DAVGEKESTIFFLIGQYLRGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGT 83
Query: 76 PTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 134
A +R E +K +Y + T AG Y +D+ A + I + V S V
Sbjct: 84 SLLAAREKRLEVLKEKYEKGSTELRNAGFEGADAYTWDLYLWAASMFISRAFSAKVLSGV 143
Query: 135 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 194
+S + L+P+ + + K A D + VV AG+ I GP+
Sbjct: 144 FPETDLSEEKLSVLLPI-IDMGNHRPLAKVEWRAGKDDIAFVVLEDVWAGQEISNNYGPR 202
Query: 195 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 231
N +L++NYGF NP D +V P Y K
Sbjct: 203 NNEQLMMNYGFCIPGNPCDHRIVSLRAPPGSPLYMAK 239
>gi|168014081|ref|XP_001759585.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689124|gb|EDQ75497.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 340
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 51/212 (24%), Positives = 97/212 (45%), Gaps = 28/212 (13%)
Query: 155 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 214
LL +SS+ +++ +++V ++ + GE++V+ GP N LL++YGFV NP DR
Sbjct: 123 LLQHSSESQSL-----PVLEVVAEKDLEKGENVVLNYGPLSNDILLLDYGFVMPKNPNDR 177
Query: 215 --------------LVVEAALNT-EDPQYQDKRMVAQRN--GKLSVQVFHVHAGREKEAI 257
LV + +++ +DP ++ + N G S Q+ V G +
Sbjct: 178 VELRYDDQLLHMACLVAKVNIDSFKDPTTSQLALLTRLNLHGPSSSQM--VTLGGTELVE 235
Query: 258 SDMLPYLRLGYVSDTSEMQSV----ISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 313
+L +R+ + D E+ V + + P+ ER + L LA +P
Sbjct: 236 GRLLAAVRVMHAQDPMELLDVDLEALQTWNQSPPLGVLNERKTIRTLIGLGMLALASFPT 295
Query: 314 TLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
+ ED++ L ++ R+A Q ++K++L
Sbjct: 296 EIEEDQSELVKGDISENHRLAIQFRMLKKRLL 327
>gi|317038661|ref|XP_001401929.2| SET domain protein [Aspergillus niger CBS 513.88]
Length = 699
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 57/217 (26%), Positives = 92/217 (42%), Gaps = 7/217 (3%)
Query: 16 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 75
+ + E +L+ + +G + FW PYIR L Q G ++ +P + +L +L G+
Sbjct: 103 DAVGEKESTIFFLIGQYLRGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGT 157
Query: 76 PTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 134
A +R E +K +Y + T AG Y +D+ A + I + V S V
Sbjct: 158 SLLAAREKRLEVLKEKYEKGSTELRNAGFEGADAYTWDLYLWAASMFISRAFSAKVLSGV 217
Query: 135 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 194
+S + L+P+ + + K A D + VV AG+ I GP+
Sbjct: 218 FPETDLSEEKLSVLLPI-IDMGNHRPLAKVEWRAGKDDIAFVVLEDVWAGQEISNNYGPR 276
Query: 195 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 231
N +L++NYGF NP D +V P Y K
Sbjct: 277 NNEQLMMNYGFCIPGNPCDHRIVSLRAPPGSPLYMAK 313
>gi|358397725|gb|EHK47093.1| hypothetical protein TRIATDRAFT_298882 [Trichoderma atroviride IMI
206040]
Length = 481
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 50/193 (25%), Positives = 87/193 (45%), Gaps = 14/193 (7%)
Query: 26 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 85
L L+ E +G++SFW PYI+ L + A+ P W E E L G+ + + +
Sbjct: 90 LLLIKELLRGEESFWWPYIQALPQPEDVDDWAL--PPFWPEEEAELLEGTNVEVGLDKIR 147
Query: 86 EGIKREYNELDTVWFMA--------GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
+ +KRE+ E + + L + Y+ F+ F+ + V ++ L
Sbjct: 148 DDLKREFREAKAMLLASQKDAEDDFSELLTRELYNWAYCIFSSRSFRASLVMTEAQQQAL 207
Query: 138 -QKVSLARRFALVPL---GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 193
+ VS+ L+PL G +A + + A A QL V R ++ G+ I P
Sbjct: 208 PEDVSVDDFSVLLPLFDIGNHDMAVDVRWELDAANSGAACQLRVGREHQPGQQIFNNYSP 267
Query: 194 QPNSKLLINYGFV 206
+ N++LL+ YGF+
Sbjct: 268 KTNAELLLGYGFM 280
>gi|350632383|gb|EHA20751.1| hypothetical protein ASPNIDRAFT_120572 [Aspergillus niger ATCC
1015]
Length = 668
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 57/217 (26%), Positives = 92/217 (42%), Gaps = 7/217 (3%)
Query: 16 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 75
+ + E +L+ + +G + FW PYIR L Q G ++ +P + +L +L G+
Sbjct: 77 DAVGEKESTIFFLVGQYLRGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGT 131
Query: 76 PTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 134
A +R E +K +Y + T AG Y +D+ A + I + V S V
Sbjct: 132 SLLAAREKRLEVLKEKYEKGSTELRNAGFEGADAYTWDLYLWAASMFISRAFSAKVLSGV 191
Query: 135 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 194
+S + L+P+ + + K A D + VV AG+ I GP+
Sbjct: 192 FPETDLSEEKLSVLLPI-IDMGNHRPLAKVEWRAGKDDIAFVVLEDVWAGQEISNNYGPR 250
Query: 195 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 231
N +L++NYGF NP D +V P Y K
Sbjct: 251 NNEQLMMNYGFCIPGNPCDHRIVSLRAPPGSPLYMAK 287
>gi|327295326|ref|XP_003232358.1| hypothetical protein TERG_07206 [Trichophyton rubrum CBS 118892]
gi|326465530|gb|EGD90983.1| hypothetical protein TERG_07206 [Trichophyton rubrum CBS 118892]
Length = 692
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 50/200 (25%), Positives = 91/200 (45%), Gaps = 23/200 (11%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
LA ++++E+ +G+ S W PY+ L R + S L + +++L +L G+
Sbjct: 108 LAFFMVHEQLKGRDSHWWPYLATLPRAS-----ELTSALFFQDSDLEWLQGTSLYETHRA 162
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF--VAVQSCVVHLQKVS 141
+K EY+ +A S+ + Y + E++T++IF A+ +A ++ +
Sbjct: 163 YRNTVKEEYD-------LAISILRDEGY-LAIESYTWDIFCWAYTLIASRAFTSRVLDAY 214
Query: 142 LARRFAL-----VPLGPPLLAYSSK---CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 193
L+ +L + PL+ +S+ K A ++L V P GE + GP
Sbjct: 215 LSNHPSLKQEEEFQIMLPLVDFSNHKPLAKIEWQAEATEIRLKVVEPTFTGEEVHNNYGP 274
Query: 194 QPNSKLLINYGFVDEDNPYD 213
N +L+ YGF DNP D
Sbjct: 275 LNNQQLMTTYGFCIVDNPCD 294
>gi|344277088|ref|XP_003410336.1| PREDICTED: SET domain-containing protein 4 [Loxodonta africana]
Length = 440
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 61/237 (25%), Positives = 99/237 (41%), Gaps = 25/237 (10%)
Query: 19 SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
S L L +L+ EK G +S W PY+ L + P+ W E E+ L P +
Sbjct: 105 SPLLALCTFLVLEKHAGDQSSWKPYLETLPK-------TYTCPVCW-EPEVVNLLPRPLR 156
Query: 79 AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ 138
A+ E+ ++ + + LF + +I FT+ A+ V + V+L+
Sbjct: 157 AKAQEQRTRVQEFFTSFRDFFSSLQPLFSEAVENI----FTYSALLWAWCTVNTRAVYLR 212
Query: 139 KVSLARRFALVP----LGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVW 190
L R F+ P L P LL +S + KA ++V + E + +
Sbjct: 213 HRQL-RCFSAEPDTCALAPYLDLLNHSPDVQVKAAFNEKTRCYEIVAVSSCRKHEEVFIC 271
Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFH 247
GP N +LL+ YGFV NP+ + V + + DK+M N K+S+ H
Sbjct: 272 YGPHDNHRLLLEYGFVSTRNPHACVYVSRDILVKYLPSTDKQM----NKKISILKDH 324
>gi|189189204|ref|XP_001930941.1| SET domain-containing protein RMS1 [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187972547|gb|EDU40046.1| SET domain-containing protein RMS1 [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 476
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 56/216 (25%), Positives = 85/216 (39%), Gaps = 44/216 (20%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
L L ++YE G S W PY L + + + W+E ELA L S ++
Sbjct: 85 SLILVMLYEYHNGSASNWAPYFAVLPTE-------FNTLMFWTEDELAELQASAVVGKVG 137
Query: 83 ---------------------------ERAEGIKREYNELDTVWFM--AGSLFQQYPYDI 113
ERA+ +E L+ + M GSL Y +D+
Sbjct: 138 KESADEAFLEQLLPVIEEFADIVFSGDERAKDKAKEMRSLENLELMHKMGSLIMAYAFDV 197
Query: 114 PTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV 173
T E+ ++ A + L K +VPL L A + +C A L D +
Sbjct: 198 EPATPTKEVDEEG-FAEEEEDAALPK-------GMVPLADMLNADADRCNARLFYEKDCL 249
Query: 174 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 209
++ +P +AGE I GP P S LL YG+V ++
Sbjct: 250 EMKALKPIQAGEEIFNDYGPLPRSDLLRRYGYVTDN 285
>gi|449283795|gb|EMC90389.1| SET domain-containing protein 4 [Columba livia]
Length = 440
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 71/298 (23%), Positives = 119/298 (39%), Gaps = 39/298 (13%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
+S L L +L+ EK G+KS W PY+ L + P+ E ++ L P
Sbjct: 104 VSPLTALCTFLIAEKHAGEKSLWKPYLDVLPK-------TYSCPVC-LEHDVVSLLPEPL 155
Query: 78 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
+ + E+ + Y + LF + I F + + A+ + + +++
Sbjct: 156 RKKAQEQRTKVHELYISSKAFFSSLQPLFAENTETI----FNYSALEWAWCTINTRTIYM 211
Query: 138 QKVSLARRFALVP----LGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
K S + F+L P L P LL +S + KA + ++ + K E + +
Sbjct: 212 -KHSQRKCFSLEPDVYALAPYLDLLNHSPNVQVKAAFNEQTRSYEIRTNSLCKKYEEVFI 270
Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
GP N +LL+ YGFV DNP+ + V +A + DK QRN K+S+ H
Sbjct: 271 CYGPHDNQRLLLEYGFVAMDNPHSSVYVSSATLLKYFPPLDK----QRNAKVSILKDH-- 324
Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 307
D+L L G+ + + + + L C R + L D AR
Sbjct: 325 ---------DLLENLTFGWDGPSWRLLTALKVLSLGADEFTCWRRTL---LGDVISAR 370
>gi|242769547|ref|XP_002341787.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
gi|218724983|gb|EED24400.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
Length = 739
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 65/230 (28%), Positives = 100/230 (43%), Gaps = 29/230 (12%)
Query: 25 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSE--TELAYLTGSPTKAEIL 82
+LM + + ++ FW PYI+ L G + +PLL+ E +LA+L + A
Sbjct: 138 TFFLMGQYLRREEGFWYPYIQSLP-----GPEELTTPLLFKEEDGDLAWLNMTSLAASRE 192
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQ--------YPYDIPTEAFTFEIFKQAFVA--VQS 132
R + K Y + A S+ Q Y +D+ A T I +AF A + S
Sbjct: 193 RRLQIWKVNYEK-------AYSMMQDLGVENARLYTWDLYLWASTI-ISSRAFTAKVLAS 244
Query: 133 CVVHLQKVSLARRFALVPLGPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVW 190
+ LQ R ++ L P + A + K K A D++ LVV +AG+ +
Sbjct: 245 VIPKLQTAEEGDRISV--LLPLIDATNHKPLSKVEWRAGTDSIGLVVMSDLRAGDEVGNN 302
Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 240
GP+ N +L++NYGF DNP + VV + P Q K Q K
Sbjct: 303 YGPRNNEQLMMNYGFCIPDNPCEYRVVSLRAPPDSPLAQIKAQYEQHCSK 352
>gi|281201870|gb|EFA76078.1| hypothetical protein PPL_10657 [Polysphondylium pallidum PN500]
Length = 1234
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 54/210 (25%), Positives = 89/210 (42%), Gaps = 17/210 (8%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
L++ L L+++YEK + SFW P+ L + + ++ TEL L G+
Sbjct: 809 LNDDTILFLFVIYEK-ENPNSFWRPFFDTLPS-------YFPTSIHYTSTELLELEGTNL 860
Query: 78 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
AE L+ +K + + F L +QYP P F++E F A S + L
Sbjct: 861 FAETLQ----VKEHLQSIRDMLF--PELSEQYPTIFPESLFSWENFLWARSLFDSRAIQL 914
Query: 138 QKVSLARRFALVPLGPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 195
K+ LVP+ + + + + D ++V I + G
Sbjct: 915 -KIDDKITNCLVPMADMINHHHNAQISQRFFDQTDQCFKMVSCCSVPPNAQIFLHYGALQ 973
Query: 196 NSKLLINYGFVDEDNPYDRLVVEAALNTED 225
N +L + YGFV +DNPYD +++ L ED
Sbjct: 974 NRELALYYGFVIQDNPYDSMLIGFDLPDED 1003
>gi|71895277|ref|NP_001025965.1| SET domain-containing protein 4 [Gallus gallus]
gi|53134599|emb|CAG32346.1| hypothetical protein RCJMB04_23h14 [Gallus gallus]
Length = 439
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 79/355 (22%), Positives = 140/355 (39%), Gaps = 42/355 (11%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
+S L L +L+ EK G++S W PY+ L + P+ E ++ L P
Sbjct: 103 VSPLIALCPFLIAEKHAGERSLWKPYLDVLPK-------TYSCPVC-LEQDVVQLLPEPL 154
Query: 78 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
+ + E+ + Y + SLF + I F + + A+ + + +++
Sbjct: 155 RKQAQEQRTAVHELYMSSKAFFSSLQSLFAENTATI----FNYSALEWAWCTINTRTIYM 210
Query: 138 QKVSLARRFALVP----LGP--PLLAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
K S F+L P L P LL +S + KA ++ + K E + +
Sbjct: 211 -KHSQRECFSLEPDVYALAPYLDLLNHSPNVQVKAAFNEQSRNYEIQTNSQCKKYEEVFI 269
Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
GP N +LL+ YGFV DNP+ + V + + DK Q+N KLS+ H
Sbjct: 270 CYGPHDNQRLLLEYGFVAVDNPHSSVYVSSDTLLKYFPSLDK----QKNAKLSILKEH-- 323
Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
D+L L G+ + + + + L C RA+ L D AR
Sbjct: 324 ---------DLLENLTFGWDGPSWRLLTALKVLSLGGDEFTCWRRAL---LGDVISARNE 371
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTV 364
++ + + + + V Q+ +M++ N Q+T + L D+ +
Sbjct: 372 QQALNIT---TKICHFLIEETQHVLLQISQMKRDKENLKEQLTLVEALRLEDLKI 423
>gi|121707885|ref|XP_001271968.1| SET domain protein [Aspergillus clavatus NRRL 1]
gi|119400116|gb|EAW10542.1| SET domain protein [Aspergillus clavatus NRRL 1]
Length = 677
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 55/203 (27%), Positives = 92/203 (45%), Gaps = 27/203 (13%)
Query: 26 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 85
+L+ + QG+ FW PYIR L + L++ +PL + +L +L G+ +R
Sbjct: 97 FFLIGQYLQGEDGFWFPYIRTLPQP-----LSLTTPLYYEGDDLGWLKGTSLWPAREQRM 151
Query: 86 EGIKREYNELDTVWFMAGSLFQ---QYPYD--------IPTEAFTFEIFKQAFVAVQSCV 134
E +K Y + V + + FQ +Y +D I + AF+ ++ +AF +
Sbjct: 152 ELLKEAYE--NGVRELRKAGFQDVDKYTWDLYLWASSMIVSRAFSPKVLAEAFADID--- 206
Query: 135 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 194
+ VS+ L+P L+ + K A V +V AG+ I GP+
Sbjct: 207 LPEDGVSV-----LLPC-IDLMNHRPLAKVEWRAGKQDVAYLVLEDVAAGQEIANNYGPR 260
Query: 195 PNSKLLINYGFVDEDNPYDRLVV 217
N +L++NYGF DNP D +V
Sbjct: 261 NNEQLMMNYGFCLPDNPCDYRIV 283
>gi|451852693|gb|EMD65988.1| hypothetical protein COCSADRAFT_86793 [Cochliobolus sativus ND90Pr]
Length = 478
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 85/369 (23%), Positives = 143/369 (38%), Gaps = 51/369 (13%)
Query: 15 TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 74
+ L L L ++YE QG+ S W Y+ L + A E+P+ W+ EL L G
Sbjct: 100 SEALDSWGSLILVMLYEYLQGEASRWKTYLDILPQ-------AFETPIFWTPDELKELEG 152
Query: 75 SPTKAEILERAEGIK--REY--------------------NELD--TVWFMAGSLFQQYP 110
+ E + + E + RE NE D ++ GS Y
Sbjct: 153 TSLTTEKIGKKESDRMLRERILPIVTSHPDVFSPPGAPRLNEDDLLSLAHRMGSTIMAYA 212
Query: 111 YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD 170
+D+ E E + ++ + SL +VP+ +L +++ A + D
Sbjct: 213 FDLENEEEQSEDEEDGWIEDRDGK------SL---IGMVPMAD-MLNANAEFNAHVHHGD 262
Query: 171 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRL-----VVEAALNTE 224
+ AG I+ + GP P+S+LL YG+V E + YD +V AL E
Sbjct: 263 QLQVTSLRESIPAGSEILNYYGPLPSSELLRRYGYVTSEHHRYDVAEISWSLVRTALAEE 322
Query: 225 DPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS----DMLPYLRLGYVSDTSEMQSVIS 280
+D +R + ++ F V E S P LR + ++ +
Sbjct: 323 LKLSEDTIADIERKLESELEEFFVIERDAGEPSSYGTLTQPPVLREISTELEEQTKAFLK 382
Query: 281 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 340
+L P E L + RL YP + +DE++L+ L + R+A ++
Sbjct: 383 ALKKRDPKRKRSETICNTVLEKALRTRLGQYPTSAKQDESLLSKEGLSKRHRMAVEVRLG 442
Query: 341 EKKMLNACL 349
EK++L L
Sbjct: 443 EKRLLQEAL 451
>gi|255071849|ref|XP_002499599.1| predicted protein [Micromonas sp. RCC299]
gi|226514861|gb|ACO60857.1| predicted protein [Micromonas sp. RCC299]
Length = 588
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 67/251 (26%), Positives = 105/251 (41%), Gaps = 32/251 (12%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE-----SPLLWS-ETELAY 71
L E AL+L+ E+ G+ S W I L G G+ + +P+ W E A
Sbjct: 231 LGEDTIAALWLIAERALGEASPWHAVIASLPWPEG-GEGSASPCGGCTPVSWPREACDAL 289
Query: 72 LTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDI-PTEAFTFEIFKQAFVAV 130
L G+P A+ + +E + R++ L F A ++ D+ P A+T + F++A A
Sbjct: 290 LGGTPLLADAIAASEKLARQHAAL----FPA---LSEHMADVFPASAYTLDNFRRAHEAW 342
Query: 131 QSCVVHLQKVSLARRFALVP---------LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 181
S + +Q +P L P ++ YS D ++L V R
Sbjct: 343 NSYGMTVQASPGEPAATCLPPVAMLCNHALWPHVVRYSRL-------RDGTLRLPVARSV 395
Query: 182 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL-NTEDPQYQDKRMVAQRNGK 240
AGE + V G + N++LL+ YGF NPYD + + L E R A
Sbjct: 396 HAGEEVFVSYGAKSNAELLLFYGFALPGNPYDDVPLSLELPGGEVADVTKAREAALARAG 455
Query: 241 LSVQVFHVHAG 251
L++ V AG
Sbjct: 456 LTLSPHAVRAG 466
>gi|449662705|ref|XP_002165483.2| PREDICTED: uncharacterized protein LOC100209819 [Hydra
magnipapillata]
Length = 819
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 80/333 (24%), Positives = 133/333 (39%), Gaps = 49/333 (14%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY-LTGSPTKAEIL 82
L + LM+E Q K SFW Y+ + G PL W E E G P +I+
Sbjct: 186 LLITLMWEFNQ-KDSFWASYLLLVPEISEFGH-----PLFWKEEEYNLEFQGMPLLNDII 239
Query: 83 ERAEGIKREYNELDTVWF-----MAGSLFQQYPYDIPTEAFTFEIFKQ--AFVAVQSCVV 135
E I+ EY E ++ + GSL E ++ E FK+ AFV S
Sbjct: 240 VDRENIETEYAEFVLLFLRRNKDLFGSL----------ENYSLEFFKRMVAFVMAYSFTE 289
Query: 136 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 195
+ S+ VP+ +L + S A L +Q++ R K GE + G
Sbjct: 290 DEESPSM------VPMAD-ILNHHSNNNAHLVFHKSNLQMISIRRIKKGEEVFNTFGKLG 342
Query: 196 NSKLLINYGFVD-EDNPYDRLVV-----------EAALNTEDPQYQDKRMVAQRNGKLSV 243
N++LL YG+V+ N YD L++ + +DP K + R G V
Sbjct: 343 NTELLQMYGYVEIPSNQYDSLLLPVKDFYKIMTSKNGTANDDPYLLAKINLLNRTGIAEV 402
Query: 244 QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVS---PCMERAVLDQL 300
F + D++ +L++ + SD E++ ++ + P S + + L +
Sbjct: 403 DAFFMFDKNGLRCGPDLIQFLKIFHASD-RELEKILKTRASKRPESFYHKLLRKLRLSKK 461
Query: 301 ADYFKARLAGYPATLSEDEAMLTDYNLHPKKRV 333
+ K L ++ED+ + N + +K V
Sbjct: 462 TE--KNSLGMTVIDITEDDTEMDIENFNKRKNV 492
>gi|358388339|gb|EHK25932.1| hypothetical protein TRIVIDRAFT_82204 [Trichoderma virens Gv29-8]
Length = 915
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 48/198 (24%), Positives = 88/198 (44%), Gaps = 22/198 (11%)
Query: 22 ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 81
+ L + +M+E +G +S W PY+ L + E+P+ WS EL L S T+ ++
Sbjct: 543 SILIIIMMFEYFKGDESKWKPYMDVL-------PASFETPMFWSGAELDELQASATRTKV 595
Query: 82 LERAEGIKREYNELDTVWFMAGSLF---QQYPYDIPTE----------AFTFEIFKQAFV 128
+A+ + + ++ V +F Q Y D + ++ F+ +
Sbjct: 596 -GKADAEEMFHAKVLPVIRANHEIFPSSQSYSDDELVQLAHRMGSTIMSYAFDFQNEDEE 654
Query: 129 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 188
+ ++ +VP+ +L ++ A + DDA+ + R KAGE I+
Sbjct: 655 DEEDEEEWVEDRESKSTMGMVPMAD-ILNADAEYNAHVNYGDDALTVTALRTIKAGEEIL 713
Query: 189 VWCGPQPNSKLLINYGFV 206
+ GP PNS+LL YG+V
Sbjct: 714 NYYGPHPNSELLRRYGYV 731
>gi|17367341|sp|Q43088.1|RBCMT_PEA RecName: Full=Ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic; AltName:
Full=[Fructose-bisphosphate aldolase]-lysine
N-methyltransferase; AltName:
Full=[Ribulose-bisphosphate carboxylase]-lysine
N-methyltransferase; Short=PsLSMT; Short=RuBisCO LSMT;
Short=RuBisCO methyltransferase; Short=rbcMT; Flags:
Precursor
gi|508551|gb|AAA69903.1| ribulose-1,5 bisphosphate carboxylase large subunit
N-methyltransferase [Pisum sativum]
Length = 489
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 83/325 (25%), Positives = 136/325 (41%), Gaps = 24/325 (7%)
Query: 40 WLPYIRELDRQRGR---------GQL--AVESPLLWSETELAYLTGSPTKAEILERAEGI 88
WL I L R+R R G L +S + WSE EL L GS + E +
Sbjct: 125 WLSVILFLIRERSREDSVWKHYFGILPQETDSTIYWSEEELQELQGSQLLKTTVSVKEYV 184
Query: 89 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQS-CVVHLQKVSLARRF 146
K E +L+ + P + + F I + +AF +++ +V + L
Sbjct: 185 KNECLKLEQEIILPNKRLFPDPVTLDDFFWAFGILRSRAFSRLRNENLVVVPMADLINHS 244
Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGF 205
A V AY K A L + D L KAGE + + + + N++L ++YGF
Sbjct: 245 AGVTTEDH--AYEVKGAAGLFSWDYLFSLKSPLSVKAGEQVYIQYDLNKSNAELALDYGF 302
Query: 206 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 265
++ + + ++ DP + DK VA+ NG F + R +LPYLR
Sbjct: 303 IEPNENRHAYTLTLEISESDPFFDDKLDVAESNGFAQTAYFDIFYNRTLPP--GLLPYLR 360
Query: 266 LGYVSDTSE--MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEA 320
L + T ++S+ + G + VS E + + + K+ LAGY T+ +D
Sbjct: 361 LVALGGTDAFLLESLFRDTIWGHLELSVSRDNEELLCKAVREACKSALAGYHTTIEQDRE 420
Query: 321 MLTDYNLHPKKRVATQLVRMEKKML 345
L + NL + +A + EK +L
Sbjct: 421 -LKEGNLDSRLAIAVGIREGEKMVL 444
>gi|109158151|pdb|2H21|A Chain A, Structure Of Rubisco Lsmt Bound To Adomet
gi|109158152|pdb|2H21|B Chain B, Structure Of Rubisco Lsmt Bound To Adomet
gi|109158153|pdb|2H21|C Chain C, Structure Of Rubisco Lsmt Bound To Adomet
gi|109158154|pdb|2H23|A Chain A, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
Adohcy
gi|109158155|pdb|2H23|B Chain B, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
Adohcy
gi|109158156|pdb|2H23|C Chain C, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
Adohcy
gi|109158157|pdb|2H2E|A Chain A, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
gi|109158158|pdb|2H2E|B Chain B, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
gi|109158159|pdb|2H2E|C Chain C, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
gi|109158160|pdb|2H2J|A Chain A, Structure Of Rubisco Lsmt Bound To Sinefungin And
Monomethyllysine
gi|109158161|pdb|2H2J|B Chain B, Structure Of Rubisco Lsmt Bound To Sinefungin And
Monomethyllysine
gi|109158162|pdb|2H2J|C Chain C, Structure Of Rubisco Lsmt Bound To Sinefungin And
Monomethyllysine
Length = 440
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 83/325 (25%), Positives = 135/325 (41%), Gaps = 24/325 (7%)
Query: 40 WLPYIRELDRQRGR---------GQL--AVESPLLWSETELAYLTGSPTKAEILERAEGI 88
WL I L R+R R G L +S + WSE EL L GS + E +
Sbjct: 77 WLSVILFLIRERSREDSVWKHYFGILPQETDSTIYWSEEELQELQGSQLLKTTVSVKEYV 136
Query: 89 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQS-CVVHLQKVSLARRF 146
K E +L+ + P + + F I + +AF +++ +V + L
Sbjct: 137 KNECLKLEQEIILPNKRLFPDPVTLDDFFWAFGILRSRAFSRLRNENLVVVPMADLINHS 196
Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGF 205
A V AY K A L + D L KAGE + + + + N++L ++YGF
Sbjct: 197 AGVTTEDH--AYEVKGAAGLFSWDYLFSLKSPLSVKAGEQVYIQYDLNKSNAELALDYGF 254
Query: 206 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 265
++ + + ++ DP + DK VA+ NG F + R +LPYLR
Sbjct: 255 IEPNENRHAYTLTLEISESDPFFDDKLDVAESNGFAQTAYFDIFYNR--TLPPGLLPYLR 312
Query: 266 LGYVSDTSE--MQSVISS--LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEA 320
L + T ++S+ G + VS E + + + K+ LAGY T+ +D
Sbjct: 313 LVALGGTDAFLLESLFRDTIWGHLELSVSRDNEELLCKAVREACKSALAGYHTTIEQDRE 372
Query: 321 MLTDYNLHPKKRVATQLVRMEKKML 345
L + NL + +A + EK +L
Sbjct: 373 -LKEGNLDSRLAIAVGIREGEKMVL 396
>gi|330806388|ref|XP_003291152.1| hypothetical protein DICPUDRAFT_155733 [Dictyostelium purpureum]
gi|325078672|gb|EGC32310.1| hypothetical protein DICPUDRAFT_155733 [Dictyostelium purpureum]
Length = 465
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 80/316 (25%), Positives = 128/316 (40%), Gaps = 34/316 (10%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
LS A+++MY K +KSFW PY+ L + + L +S+ EL L S
Sbjct: 114 LSPTNLQAVFIMYSK-LNEKSFWHPYVTVLPE-------SFSTSLYFSDNELDELQASQL 165
Query: 78 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA-FVAVQSCVVH 136
K + R +GI+R Y S F + +P E ++ Q F SCV
Sbjct: 166 KEFTIIRKDGIERHYE----------STFSRLSKLVP-EFSNLALYNQELFTWALSCVWS 214
Query: 137 LQKVSLARR-FALVPLGPPLLAYS-SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG-- 192
+ SLA +VPL A SK K + D + GE I G
Sbjct: 215 -RAFSLAENDGGMVPLADMFNAEDRSKSKVLPKVTDTTLDYYASDDIAEGEQIFTPYGVY 273
Query: 193 -PQPNSKLLINYGFV-DEDNPYDRLVVEA-ALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
P +S++L++YGF+ DE D + + + ++P K+ + + N ++ +VF +
Sbjct: 274 KPLSSSQMLMDYGFIFDEGTVSDNVAITVPVFHNDEPNLSTKQEILEENDIIN-EVFLLQ 332
Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGP---ICPVSPCMERAVLDQLADYFKA 306
A D+L Y R+ + Q+ L P P++ E+ L L +
Sbjct: 333 KTDPLPA--DLLLYARVKNLIAKECDQAKKHFLSPNTRNTPLNTRNEKVSLRFLENLIHR 390
Query: 307 RLAGYPATLSEDEAML 322
L Y L D+ +L
Sbjct: 391 YLDSYGTNLESDKNLL 406
>gi|24987776|pdb|1MLV|A Chain A, Structure And Catalytic Mechanism Of A Set Domain Protein
Methyltransferase
gi|24987777|pdb|1MLV|B Chain B, Structure And Catalytic Mechanism Of A Set Domain Protein
Methyltransferase
gi|24987778|pdb|1MLV|C Chain C, Structure And Catalytic Mechanism Of A Set Domain Protein
Methyltransferase
gi|33357815|pdb|1OZV|A Chain A, Crystal Structure Of The Set Domain Of Lsmt Bound To
Lysine And Adohcy
gi|33357816|pdb|1OZV|B Chain B, Crystal Structure Of The Set Domain Of Lsmt Bound To
Lysine And Adohcy
gi|33357817|pdb|1OZV|C Chain C, Crystal Structure Of The Set Domain Of Lsmt Bound To
Lysine And Adohcy
gi|33357822|pdb|1P0Y|A Chain A, Crystal Structure Of The Set Domain Of Lsmt Bound To
Melysine And Adohcy
gi|33357823|pdb|1P0Y|B Chain B, Crystal Structure Of The Set Domain Of Lsmt Bound To
Melysine And Adohcy
gi|33357824|pdb|1P0Y|C Chain C, Crystal Structure Of The Set Domain Of Lsmt Bound To
Melysine And Adohcy
Length = 444
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 83/325 (25%), Positives = 136/325 (41%), Gaps = 24/325 (7%)
Query: 40 WLPYIRELDRQRGR---------GQL--AVESPLLWSETELAYLTGSPTKAEILERAEGI 88
WL I L R+R R G L +S + WSE EL L GS + E +
Sbjct: 81 WLSVILFLIRERSREDSVWKHYFGILPQETDSTIYWSEEELQELQGSQLLKTTVSVKEYV 140
Query: 89 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQS-CVVHLQKVSLARRF 146
K E +L+ + P + + F I + +AF +++ +V + L
Sbjct: 141 KNECLKLEQEIILPNKRLFPDPVTLDDFFWAFGILRSRAFSRLRNENLVVVPMADLINHS 200
Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGF 205
A V AY K A L + D L KAGE + + + + N++L ++YGF
Sbjct: 201 AGVTTEDH--AYEVKGAAGLFSWDYLFSLKSPLSVKAGEQVYIQYDLNKSNAELALDYGF 258
Query: 206 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 265
++ + + ++ DP + DK VA+ NG F + R +LPYLR
Sbjct: 259 IEPNENRHAYTLTLEISESDPFFDDKLDVAESNGFAQTAYFDIFYNR--TLPPGLLPYLR 316
Query: 266 LGYVSDTSE--MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEA 320
L + T ++S+ + G + VS E + + + K+ LAGY T+ +D
Sbjct: 317 LVALGGTDAFLLESLFRDTIWGHLELSVSRDNEELLCKAVREACKSALAGYHTTIEQDRE 376
Query: 321 MLTDYNLHPKKRVATQLVRMEKKML 345
L + NL + +A + EK +L
Sbjct: 377 -LKEGNLDSRLAIAVGIREGEKMVL 400
>gi|451999637|gb|EMD92099.1| hypothetical protein COCHEDRAFT_1134267 [Cochliobolus
heterostrophus C5]
Length = 476
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 80/333 (24%), Positives = 133/333 (39%), Gaps = 54/333 (16%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
L L ++YE G S W PY L ++ + W+E EL L S +I
Sbjct: 85 SLILVMLYEYHNGSASNWAPYFAVLPTD-------FDTLMFWTEDELTELQASAVVNKIG 137
Query: 83 ---------------------------ERAEGIKREYNELDTVWFM--AGSLFQQYPYDI 113
ERA+ + +E + + M GSL Y +D+
Sbjct: 138 KEGANEVFIEQLLPVIEEFADVIFSGDERAKDLAKEMRAPENLELMHKMGSLIMAYAFDV 197
Query: 114 PTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV 173
A + + + A + L K +VPL L A + +C A L D +
Sbjct: 198 EP-AISDKEVDEEGFAEEEEDAALPK-------GMVPLADMLNADADRCNARLFYEKDGL 249
Query: 174 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED-NPYDRLVVEAALNTE----DPQY 228
++ +P +AGE I GP P S LL YG++ E+ YD + + A L ++ D +
Sbjct: 250 EMKALKPIQAGEEIFNDYGPLPRSDLLRRYGYITENYAQYDVVEIPADLVSQALAHDGLW 309
Query: 229 QDKRMVAQRNGKLSVQVFHVHAG---REKEAISDMLPYLRLGYVSDTSEMQSVISSLGPI 285
+KR+ ++ + + A +E++S L L + + E + + S G +
Sbjct: 310 HEKRIEYLDEQEIVDTGYDIAASVPFSLEESLSPELVILVETMLLPSEEFER-LQSKGRL 368
Query: 286 CPVSPCMERAVLDQLADYFKARLAGYPATLSED 318
P + M + L +AR+A YP TL +D
Sbjct: 369 -PKAEKMTGKAAEILYKIVQARIAQYPTTLEQD 400
>gi|148908465|gb|ABR17345.1| unknown [Picea sitchensis]
Length = 350
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/154 (27%), Positives = 73/154 (47%), Gaps = 16/154 (10%)
Query: 181 YKAGESIVVWCG-PQPNSKLLINYGFVDED----NPYDRLVVEAALNTEDPQYQDKRMVA 235
++ GE +++ G + N +L ++YGFV+ + + D + ++ DP + DK +A
Sbjct: 136 FRTGEQVLMQYGMNKSNGQLALDYGFVERNRKNGSNRDIFTLTLEISESDPFFADKLDIA 195
Query: 236 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM-------QSVISSLGPICPV 288
+ NG + F + G + ML +LRL + T SV L PV
Sbjct: 196 ELNGMETTAYFDITQG--QGVPESMLTFLRLIALGGTDAFLLEPLFRDSVWEHLS--LPV 251
Query: 289 SPCMERAVLDQLADYFKARLAGYPATLSEDEAML 322
S E A+ + D ++ L+GY T+ EDEA+L
Sbjct: 252 SQENEAAICKVVLDGCQSTLSGYGTTIEEDEALL 285
>gi|428179206|gb|EKX48078.1| hypothetical protein GUITHDRAFT_106158 [Guillardia theta CCMP2712]
Length = 410
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 54/211 (25%), Positives = 95/211 (45%), Gaps = 21/211 (9%)
Query: 17 KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP 76
+L E L+L+L+ EK + ++S W +IR + + ++ WSE +A L P
Sbjct: 19 QLCERQLLSLHLLVEKWKAERSRWWRFIRSI-------PPSYDTLENWSEQSVARLQYKP 71
Query: 77 TKAEILERAEGIKREYNELDTV--------WFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 128
A R + E+++L + W + + + +F+ E + A
Sbjct: 72 FLAIAARRKRVVNDEFSQLQRLLSRCKKRSWNEPEAAEEAERIQLGFSSFSREDYLWAAG 131
Query: 129 AVQSCVVHLQKVS-LARRFALVPLGPPLLAYSSKCKAMLAAV---DDAV--QLVVDRPYK 182
V + H ++ S + R V P+L + + A +AA DA+ ++ R Y+
Sbjct: 132 TVSTRSCHYERKSGYSLRGETVGCLVPVLDFLNHSTAPVAACGFCKDAMVYRVTCLRSYE 191
Query: 183 AGESIVVWCGPQPNSKLLINYGFVDEDNPYD 213
GE +++ G N+ LL +YGFV EDNP D
Sbjct: 192 EGEQVMIHYGNWSNAGLLEHYGFVLEDNPLD 222
>gi|400597281|gb|EJP65016.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
Length = 484
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 57/236 (24%), Positives = 90/236 (38%), Gaps = 75/236 (31%)
Query: 13 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 72
L+++ L L L L+YE +G S W PY+ L E+P+ W+ EL L
Sbjct: 96 LSSSPLDAWGALILVLLYEHLRGAASAWRPYLDVL-------PATFETPMFWTGAELGAL 148
Query: 73 TGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 132
T ++ RE E DT + + + +P ++F+ +
Sbjct: 149 QAGATAGKV-------GRESAE-DTFRGILLPVVRAHP----------DVFQGSAALSDE 190
Query: 133 CVVHLQKVSLARRFALVPLGPPLLAYS---------------------SKCKAMLAAV-- 169
+V +LA R +G ++AY+ KAM+ V
Sbjct: 191 ALV-----ALAHR-----MGSTIMAYAFDLENDEEREDEEDEDGWVEDRDGKAMMGMVPM 240
Query: 170 -----------------DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 208
D+ + + RP KAGE I+ + GP PNS+LL YG+V E
Sbjct: 241 ADILNADAEFNAHVNHGDNELTVTALRPIKAGEEILNYYGPHPNSELLRRYGYVTE 296
>gi|384484604|gb|EIE76784.1| hypothetical protein RO3G_01488 [Rhizopus delemar RA 99-880]
Length = 400
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 62/253 (24%), Positives = 105/253 (41%), Gaps = 51/253 (20%)
Query: 16 NKLSELAC---LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 72
++LSEL+ L L +MYE ++ SFW PY L R+ +P+ W++ +L L
Sbjct: 74 SELSELSGWSPLILCMMYEIEK-PDSFWKPYFDVLPRE-------FTTPMFWNQEDLKEL 125
Query: 73 TGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK--QAFVAV 130
G+ ++I + E + +NEL+ + ++YP + T E+F + +
Sbjct: 126 EGTDIISKI-GKKESEELFHNELEPI-------IKKYPNLFDEQKHTIELFHICGSLIMA 177
Query: 131 QSCVVHLQKVSLARR------------------------FALVPLGPPLLAYSSKCKAML 166
S LQK ++VP+ L + A L
Sbjct: 178 YSFNDELQKAPKENNKEEEKEEEEEEEEEEEEEEEEEGLISMVPMADMLNHKTGFNNARL 237
Query: 167 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR------LVVEAA 220
D++Q+ + K GE I G N+ LL YGFVDE N +D L+VE
Sbjct: 238 FHEPDSLQMRAIKDIKEGEQIYNTYGDLCNADLLRKYGFVDEKNDFDLVELDGPLLVEVC 297
Query: 221 LNTEDPQYQDKRM 233
+D +++++
Sbjct: 298 CEDQDEALKERKI 310
>gi|325183831|emb|CCA18289.1| conserved hypothetical protein [Albugo laibachii Nc14]
gi|325183979|emb|CCA18437.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 561
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 73/301 (24%), Positives = 121/301 (40%), Gaps = 38/301 (12%)
Query: 61 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 120
P+ WS+ EL++L GS +I ER I+++Y+ + D F+
Sbjct: 227 PIFWSDQELSWLKGSYILHQIQERKAAIRKDYDAICRA-------------DPSFSRFSL 273
Query: 121 EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA-MLAAVDDAVQLVVDR 179
E F A + V S L + + ALVP L Y + + D +
Sbjct: 274 ERFSWARMIVCSRNFGL-TIDGVKTAALVPFADMLNHYRPRETSWTFDQKLDGFTITSLE 332
Query: 180 PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY------DRLVVEAALNTEDPQYQDKRM 233
+G + G + N + L+NYGF EDN + ++V+ L+ D Q +
Sbjct: 333 SICSGAQVYDSYGKKCNHRFLLNYGFAVEDNTEEDGSNPNEIMVDFQLDPGDGQ-----L 387
Query: 234 VAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY-----VSDTSEMQSVISSLGPIC-- 286
+ + L + R + SD P R G+ ++ T + S + P
Sbjct: 388 LYDKTAYLYESGIYTMNARLSCSHSD--PSTREGFSFARLIAATEDEFSSMKMRSPAHAS 445
Query: 287 -PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML--TDYNLHPKKRVATQLVRMEKK 343
P+S E A L+ L +L YP +L E EA+L +Y L+ + A +R EK+
Sbjct: 446 PPISFRNEIAALNLLKQLMDTQLDQYPTSLDEGEAILKSKEYPLYSNRIQALFFIRGEKQ 505
Query: 344 M 344
+
Sbjct: 506 V 506
>gi|145553305|ref|XP_001462327.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124430166|emb|CAK94954.1| unnamed protein product [Paramecium tetraurelia]
Length = 481
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 76/359 (21%), Positives = 141/359 (39%), Gaps = 56/359 (15%)
Query: 27 YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAE 86
+L+ E ++ + SFW PYI L + P + + A L GSPT ++ + +
Sbjct: 122 HLLQENRR-QNSFWKPYIDVLPKD------VSGFPTNFDAEQDALLKGSPTLFTVMNQRK 174
Query: 87 GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF 146
+ EY+ L A FQ+Y Y T+ F V + ++++R F
Sbjct: 175 TFQEEYDNLKE----AVKEFQRYGY-------TYNDF-----------VKFRTLTISRSF 212
Query: 147 -----------ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD--RPYKAGESIVVWCGP 193
LVPL + + + DA + R + GE + G
Sbjct: 213 PVYIGENEQQQLLVPLAD-FINHDNNGFLQYGYSPDADGFFMQAVRNIQKGEELFYNYGQ 271
Query: 194 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN---GKLSVQVFHVHA 250
N +NYGF NP ++ + L+ D ++ K + N G V
Sbjct: 272 WSNKYFFMNYGFASLTNPMNQFDFDICLDRNDRMFKMKVELTGGNICWGNRLVNETDHDT 331
Query: 251 GREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICP---VSPC---MERAVLDQLADYF 304
R+ A + + ++ + D +++ + + P +P +E+A L D
Sbjct: 332 FRQSLAT---VRFAQISKLDDFLQLEEDVQNYNQFWPGWHTTPKTIELEKATFKALRDLL 388
Query: 305 KARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVT 363
+ LA + +T+ +D+ L D + +R L EK+++ ++V DM++ + D T
Sbjct: 389 VSELANFASTIEDDQRRLNDPSTPEFRRHIIMLTMREKQIIKKNIEV-CDMMLSVIDKT 446
>gi|406860468|gb|EKD13526.1| putative SET domain-containing protein [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 474
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 89/367 (24%), Positives = 149/367 (40%), Gaps = 53/367 (14%)
Query: 20 ELAC-LALY-LMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
E+ C LAL ++ + Q + S W PY+ L + ++S + WSE+EL L S
Sbjct: 79 EMPCWLALTAIILTEGQQEDSKWAPYLALLPSR-------LDSLVFWSESELLELQASTV 131
Query: 78 KAEILE-RAEGIKREY-------NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
+I AE + E+ N + S+ Y +DIP K+
Sbjct: 132 VNKIGRASAEQLFLEHISPLGLSNTNTEMCHKVASVVMAYAFDIPE--------KKGHDD 183
Query: 130 VQS-----CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 184
+S +V + +++PL L A + A L ++ +++ +P G
Sbjct: 184 PESPEDGDDLVSDNEEEENTILSMIPLADMLNADADGNNARLCCDNEELEMRSIKPISKG 243
Query: 185 ESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVE-------AALNTEDP---------- 226
E I+ G P S LL YG++ D+ YD V E A+L+TE P
Sbjct: 244 EEILNDYGQLPRSDLLRRYGYISDKYAAYD--VAELSTQSLLASLSTEQPLLAGGTLQPL 301
Query: 227 ---QYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLG 283
+ + + +AQR G H G + +I D L L + D + ++ +S
Sbjct: 302 SREKLEQRVELAQREGVYEDSYDLTHPGPDDPSIPDELLALLYILLLDNENLAAIETSHA 361
Query: 284 PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKK 343
+ S V L ++R Y T+ D+A+L NL +KR+A ++ EK
Sbjct: 362 SLPSRSKLATSLVGQILTKILESRKQEYATTIEADQAILQADNLPSRKRMAVEVRLGEKL 421
Query: 344 MLNACLQ 350
+L +Q
Sbjct: 422 VLEKAIQ 428
>gi|159476096|ref|XP_001696150.1| protein N-methyltransferase [Chlamydomonas reinhardtii]
gi|158275321|gb|EDP01099.1| protein N-methyltransferase [Chlamydomonas reinhardtii]
Length = 474
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 58/221 (26%), Positives = 98/221 (44%), Gaps = 35/221 (15%)
Query: 21 LACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAE 80
A + L++ K+QG +S P+I +L G PL WS+ +LA L A+
Sbjct: 138 FAKMGAMLLWHKRQGSQSPLAPWIAQLPADTG-------VPLNWSDKQLAALQYPYLVAQ 190
Query: 81 ILERAEGIKREYNEL-DTV-----------------WFMAGSLFQQYPYDIPTEAFTFEI 122
+ E+ +RE+ L DT+ W+ G + + + P T
Sbjct: 191 VKEQ----QREWTALYDTLRGSGMAAGAAPPSREEFWWAMG-VVRSRTFSGPYIGSTLSD 245
Query: 123 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAA--VDDAVQLVVDRP 180
+ V + VV L + SL +++A+ PL L ++S ++ ++ D+ +V R
Sbjct: 246 RLRLAGLVAALVVILSR-SL-KQYAICPL-IDLFNHTSAAQSEVSYNYFGDSYSVVASRD 302
Query: 181 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 221
+K GE + + G Q N L+ YGF + DNP D V+ L
Sbjct: 303 FKKGEQVFITYGAQSNDSLMQYYGFAEADNPQDTYVISDVL 343
>gi|299473350|emb|CBN77749.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 563
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 79/298 (26%), Positives = 125/298 (41%), Gaps = 30/298 (10%)
Query: 61 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIP-TEAFT 119
P+ W+E E+ L GS ++ ER + I+ +Y G + YP P + T
Sbjct: 212 PIFWTEEEMRLLQGSYLVTQVEERNQAIEGDY----------GVICDLYP---PFRDVAT 258
Query: 120 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSK-CKAMLAAVDDAVQLVVD 178
E FK A + V S L ++ R ALVP L Y + K +
Sbjct: 259 LEEFKWARMCVCSRNFGLD-INGLRTSALVPYADMLNHYRPRETKWTYDNNRGGFTITTL 317
Query: 179 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY------DRLVVEAALNTEDPQYQDKR 232
G + G + N + L+NYGF E+N + + + L+ DP Q K
Sbjct: 318 HRILGGAQVYDSYGQKCNHRFLLNYGFAIENNQEANGFCPNEVPLLFRLDARDPLRQKKA 377
Query: 233 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS--VISSLGPI-CPVS 289
+ +G +V + G + +A+ L LR+ V+D +EM + + ++ + P+S
Sbjct: 378 RFWRMDGPEQRRV-RLCVG-DTDAVRGALSMLRV-IVADAAEMGARYMYRTVKDVRFPLS 434
Query: 290 PCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP--KKRVATQLVRMEKKML 345
E A +++L L YP TL ED A L + L P +R A V EK +L
Sbjct: 435 VRNEVAAMERLLLLTTGALDAYPTTLEEDRAALKNGGLEPFSNRRHALIQVYGEKVVL 492
>gi|302826668|ref|XP_002994755.1| hypothetical protein SELMODRAFT_432653 [Selaginella moellendorffii]
gi|300136963|gb|EFJ04180.1| hypothetical protein SELMODRAFT_432653 [Selaginella moellendorffii]
Length = 688
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 47/191 (24%), Positives = 82/191 (42%), Gaps = 26/191 (13%)
Query: 31 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 90
+K Q + S W PYI L +++ LW +TEL+YL SP + ER E I
Sbjct: 503 QKFQLQSSAWAPYISCLPEPA-----ELDNTFLWEDTELSYLRASPLYGKTRERLEIITT 557
Query: 91 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 150
E+ ++ + LF + + E F + V S + +++ LV
Sbjct: 558 EFGQVQNALDVWPQLFGK---------VSVEDFMHVYATVFS-----RPLAIGEDSTLVM 603
Query: 151 LGPPLLAYSSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 205
+ P+L + + A A + + + DR + I + CG N++L ++YGF
Sbjct: 604 I--PMLDFFNHNAASFAKLSFNGLLNYAVVTADRDCAENDQIWINCGDLSNAELALDYGF 661
Query: 206 VDEDNPYDRLV 216
+N YD ++
Sbjct: 662 TVPENRYDEVM 672
>gi|167521575|ref|XP_001745126.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776740|gb|EDQ90359.1| predicted protein [Monosiga brevicollis MX1]
Length = 390
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 49/204 (24%), Positives = 85/204 (41%), Gaps = 14/204 (6%)
Query: 38 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-D 96
SFW PYI L ++ PL W +L L GS E + + + R+Y L +
Sbjct: 118 SFWRPYIACL-------PVSYSVPLYWDLPDLMSLRGSSLFVEAIRLYKHVCRQYGYLHN 170
Query: 97 TVWFMAGSLFQQYPY--DIPTEAFTFEIFKQAFVAVQSCVVHLQKVS----LARRFALVP 150
+ A +P + EAFTFE ++ A V + + + + AL+P
Sbjct: 171 KLSVRANPSCSCFPLTLGLSPEAFTFEDWRWAVATVMTRQNSIPQAGPDGQMKPTLALIP 230
Query: 151 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 210
L + + + + ++ V P K G I +W G + N + L++ GF +
Sbjct: 231 LWDMINHANHPMSTQFDSERECLEFVCPAPAKPGSQITMWYGDRNNGQFLLHQGFFFAGH 290
Query: 211 PYDRLVVEAALNTEDPQYQDKRMV 234
D + V +L+ D Y+ K ++
Sbjct: 291 ANDYVNVPFSLDETDSLYKIKALL 314
>gi|145353540|ref|XP_001421068.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581304|gb|ABO99361.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 813
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 56/202 (27%), Positives = 85/202 (42%), Gaps = 23/202 (11%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
+AL++ E+ +G+K+ ++ L R + ++ WSE EL LTG+ E +
Sbjct: 70 VALHVCAERFRGEKATRAAHVATLPR-------SFDTAFFWSEEELRELTGTTCLRETMN 122
Query: 84 RAEGIKREYNELDTVW--FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 141
E K +Y L G +++ D +E + A + S L
Sbjct: 123 LREETKNDYETLTKKMEAIGEGGWMREHEVD-------YERYAWARSNLWSRQCDLLMPD 175
Query: 142 LARRFALVPLGPPLLAYSSKC----KAMLAAVDDAVQLVVDRPYKAGES--IVVWCGPQP 195
R A+VP + +S+K L A + V + YKAGE I G
Sbjct: 176 GKRTRAMVPT-FDIFNHSAKAPLGKTHKLNAEKNCVTVYAADDYKAGEQAFISYGSGEAA 234
Query: 196 NSKLLINYGFVDEDNPYDRLVV 217
NSKLL YGF +DNPY+ L V
Sbjct: 235 NSKLLTWYGFCIDDNPYEELDV 256
>gi|326480913|gb|EGE04923.1| SET domain-containing protein [Trichophyton equinum CBS 127.97]
Length = 692
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 52/211 (24%), Positives = 94/211 (44%), Gaps = 12/211 (5%)
Query: 7 YYAAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSE 66
++ +++L+ ++ + LA ++ +E+ +G+ S W PY+ L R + S L + +
Sbjct: 92 FHCSDILS-ERVRKSIILAFFVAHEQLKGRDSHWWPYLATLPRAS-----ELTSALFYQD 145
Query: 67 TELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSL-FQQYPYDIPTEAFTFEIFKQ 125
++L +L G+ +K EY+ ++ G L + Y +DI A+T I +
Sbjct: 146 SDLDWLQGTNLYQTHQAYRNTVKEEYDSAISILRDEGCLAVESYSWDIFCWAYTL-IASR 204
Query: 126 AFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSK---CKAMLAAVDDAVQLVVDRPYK 182
AF + + +L ++ + PL+ S+ K A + L V P
Sbjct: 205 AFTS-RVLDAYLSNHPTLKQDEEFQIMLPLVDSSNHKPLAKIEWRAEATEIGLKVIEPTF 263
Query: 183 AGESIVVWCGPQPNSKLLINYGFVDEDNPYD 213
GE I GP N +L+ YGF DNP D
Sbjct: 264 TGEEIHNNYGPLNNQQLMTTYGFCIVDNPCD 294
>gi|159490820|ref|XP_001703371.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280295|gb|EDP06053.1| predicted protein [Chlamydomonas reinhardtii]
Length = 339
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 50/202 (24%), Positives = 87/202 (43%), Gaps = 13/202 (6%)
Query: 20 ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 79
E+ L + +M+E+ +G +S W PY+ L PL W E L G+
Sbjct: 116 EVQGLVIAVMWERWRGPESRWAPYLALLPDD------MTHMPLYWKRREFRELRGTAAYD 169
Query: 80 EILERAEGIKREYNELDTVWF-MAGSLFQQYP-YDIPTEAFTFEIFKQAFVAVQSCVVHL 137
++L RA+ ++ +W + G ++P +P +E+++ A AV S L
Sbjct: 170 KMLGRAQHPSDAPTQVPLLWSEVVGPFIAEHPELGLPGGERGYELYRWATAAVASYSFIL 229
Query: 138 QKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQP 195
+ A+VP+ L + L + +Q++ R AG +V G
Sbjct: 230 GD---DKYQAMVPVWDLLNHITGDVNVRLHHCSKRHVLQMIAMRDIVAGSELVNNYGELS 286
Query: 196 NSKLLINYGFVDEDNPYDRLVV 217
N++LL YGFV+ N Y+ + V
Sbjct: 287 NAELLRGYGFVERANRYNHIPV 308
>gi|302834219|ref|XP_002948672.1| hypothetical protein VOLCADRAFT_104004 [Volvox carteri f.
nagariensis]
gi|300265863|gb|EFJ50052.1| hypothetical protein VOLCADRAFT_104004 [Volvox carteri f.
nagariensis]
Length = 510
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 94/386 (24%), Positives = 154/386 (39%), Gaps = 70/386 (18%)
Query: 21 LACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAE 80
A +A L++ K+QG +S P+I +L G P+LW E ++A L A+
Sbjct: 135 FAKMAAMLLWHKRQGSQSPLAPWIAQLPSDTG-------VPVLWDERQIAALQYPYLIAQ 187
Query: 81 ILERAEGIKREYNEL---------------DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ 125
+ E+ ++ Y +L D W M S + + P T + +
Sbjct: 188 VKEQQREWQQLYGDLVRSGTPAGVQAPSREDFFWAM--SCVRSRTFSGPYIGSTLQDRLR 245
Query: 126 -----AFVAVQSCVVHL---QKVSLA-------------------RRFALVPLGPPLLAY 158
A +A + V+ L QK A +++A+ PL L +
Sbjct: 246 TAGLVAVLAAGNTVLGLADPQKTLSAAIAVLLFNVLYELILSRSLKQYAICPL-IDLFNH 304
Query: 159 SSKCKAMLAA--VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 216
SS ++ +A D+ +V R +K GE + + G Q N L+ YGF + +NP D V
Sbjct: 305 SSAVQSEVAYNYFGDSYSVVASREFKKGEQVFISYGAQSNDSLMQYYGFAEANNPQDVYV 364
Query: 217 VEAALN--TEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE 274
+ L T R+ A + L+ + V R S+ L +R +D SE
Sbjct: 365 MTDMLRWLTAVRSVGQSRLDALKGSPLANSLQQVAIQRAGFP-SETLQAVRFLLAAD-SE 422
Query: 275 MQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKR-- 332
+ +SS SP E + + +A+ + L ++L ED A+L+ R
Sbjct: 423 AGADVSSFSKSG--SPDQEAQLAEVVAEVVRRELGHLGSSLQEDLALLSSTGASAGGRKG 480
Query: 333 -------VATQLVRMEKK-MLNACLQ 350
VA R+EKK +L A LQ
Sbjct: 481 GTAAAAAVAAVAFRVEKKRLLTAVLQ 506
>gi|281207968|gb|EFA82146.1| hypothetical protein PPL_04566 [Polysphondylium pallidum PN500]
Length = 510
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 30/116 (25%), Positives = 54/116 (46%), Gaps = 12/116 (10%)
Query: 17 KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP 76
K+ + ++ L+YE G +S W YI+ L L+V+ P+LW++ +L L G+
Sbjct: 95 KIKDATATSIALLYEASIGSQSKWYGYIKSL-------PLSVDLPILWNDADLKNLKGTS 147
Query: 77 TKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 132
+ + E E + YN+ ++ L +P F+ + FK+A V S
Sbjct: 148 IETVVYENKETVDATYNK-----YIKSKLIANHPDVFNEHVFSLDNFKRASCLVSS 198
>gi|367023575|ref|XP_003661072.1| hypothetical protein MYCTH_2300057 [Myceliophthora thermophila ATCC
42464]
gi|347008340|gb|AEO55827.1| hypothetical protein MYCTH_2300057 [Myceliophthora thermophila ATCC
42464]
Length = 496
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 59/221 (26%), Positives = 89/221 (40%), Gaps = 38/221 (17%)
Query: 9 AAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 68
E T++ L L L+YE QG S W PY+ L A ++P+ WS TE
Sbjct: 98 GGEDGTSSSQDSWTLLILILIYEYLQGDASQWKPYLDVL-------PSAFDTPMFWSPTE 150
Query: 69 LAYLTGSPTKAEI-LERAEGIKRE-----YNELDTVWFMAGSLFQQYPYDIPTEAFTFEI 122
LA L S ++ E A+ + R D V+F G Q+ D FE+
Sbjct: 151 LAELQASALVTKVGREEADRMIRSKILPVIRGHDHVFFPHGR--QRLDDDQ-----LFEL 203
Query: 123 FKQAFVAVQSCVVHLQKVSLARR-----------------FALVPLGPPLLAYSSKCKAM 165
+ A+ + L+K A +VP+ +L ++ A
Sbjct: 204 AHRMGSAIMAYAFDLEKDDDANEEASEQDEWVDDREGRTMLGMVPMA-DMLNADAEFNAY 262
Query: 166 LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
+ D++ R KAGE I+ + GP PN +LL YG+V
Sbjct: 263 INHGADSLTATALRTIKAGEEILNYYGPLPNGELLRRYGYV 303
>gi|336467028|gb|EGO55192.1| hypothetical protein NEUTE1DRAFT_147775 [Neurospora tetrasperma
FGSC 2508]
gi|350288355|gb|EGZ69591.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
Length = 504
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 50/201 (24%), Positives = 88/201 (43%), Gaps = 20/201 (9%)
Query: 26 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 85
YL+ + +GK SFW PYI L + A+ P W+E ++ L G+ I E
Sbjct: 107 FYLIQQYLKGKSSFWAPYISTLADPSQLDKWAL--PPFWAEDDIELLKGTNAYVAIQEIQ 164
Query: 86 EGIKREYNELDTVWFMAG----SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKV- 140
+K EY + + G + Q Y+ FT F+ + V +S +++++
Sbjct: 165 SNVKSEYKQARKILKKEGFPDYRDYTQVLYNWAYCMFTSRSFRPSLVLSESAREYVERLL 224
Query: 141 ---SLARRFALVPLGPPLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG 192
S F+++ PL ++ + L + A +L+ + Y G+ + G
Sbjct: 225 PEGSKIDDFSIL---QPLYDIGNHSWDASYTWNLTSEPSACELICNDSYGPGQQVFNNYG 281
Query: 193 PQPNSKLLINYGFVDEDNPYD 213
+ NS+LL+ YGF+ NP D
Sbjct: 282 FKTNSELLLGYGFI--INPKD 300
>gi|302810436|ref|XP_002986909.1| hypothetical protein SELMODRAFT_235145 [Selaginella moellendorffii]
gi|300145314|gb|EFJ11991.1| hypothetical protein SELMODRAFT_235145 [Selaginella moellendorffii]
Length = 447
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 61/247 (24%), Positives = 102/247 (41%), Gaps = 57/247 (23%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEIL 82
L + +MYE+ +GK S W Y++ L Q P LWSE E+ L G+ +
Sbjct: 53 LTVAVMYERSKGKGSKWYRYLKTLPCQE-------SVPFLWSEEEIDGLLLGTELHKALK 105
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ-KVS 141
E +K ++ E L ++ P + P + FTFE +++A +S V ++
Sbjct: 106 EDKLLMKEDWEE------NIAPLTKEDPLEFPAQDFTFE----SYLAAKSLVSSRSFEID 155
Query: 142 LARRFALVPLGPPLLAYSSKCKA-----MLAA-------------VDDA----------- 172
+ +VPL ++ K A ML A +DD
Sbjct: 156 AEHGYGMVPLAD---LFNHKTDAEDVHFMLNASDSDDDDDNNGLIIDDGLANGDCREISS 212
Query: 173 ----VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD--RLVVEAALNTEDP 226
+++V+ + AG I G N+ LL YGF + +NP+D L ++ L
Sbjct: 213 DKSVLEMVMVKDVAAGSEIFNTYGQLGNAALLHRYGFTEPNNPHDIVNLDMDCLLEVLLS 272
Query: 227 QYQDKRM 233
++Q KR+
Sbjct: 273 RFQKKRV 279
>gi|326473914|gb|EGD97923.1| hypothetical protein TESG_05224 [Trichophyton tonsurans CBS 112818]
Length = 692
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 50/223 (22%), Positives = 92/223 (41%), Gaps = 36/223 (16%)
Query: 7 YYAAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSE 66
++ +++L+ ++ + LA ++ +E+ +G+ S W PY+ L R + S L + +
Sbjct: 92 FHCSDILS-ERVRKSIILAFFVAHEQLKGRDSHWWPYLATLPRAS-----ELTSALFYQD 145
Query: 67 TELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 126
++L +L G+ +K EY+ ++ G L E+++++IF A
Sbjct: 146 SDLDWLQGTNLYQTHQAYRNTVKEEYDSAISILRDEGCL--------AVESYSWDIFCWA 197
Query: 127 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA-------------- 172
+ + S + +R P L + + ML VD +
Sbjct: 198 YTLIAS------RAFTSRVLDAYFSNHPTLKQDEEFQIMLPLVDSSNHKPLAKIEWRAEA 251
Query: 173 --VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 213
+ L V P GE I GP N +L+ YGF DNP D
Sbjct: 252 TEIGLKVIEPTFTGEEIHNNYGPLNNQQLMTTYGFCIVDNPCD 294
>gi|358056332|dbj|GAA97699.1| hypothetical protein E5Q_04377 [Mixia osmundae IAM 14324]
Length = 347
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 53/107 (49%), Gaps = 4/107 (3%)
Query: 116 EAFTFEIFKQAFVAVQS-CV-VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA- 172
E F+ F+ A++ V S CV + L + F LVPL + +SS C D A
Sbjct: 122 EIIDFDAFRWAWLCVNSRCVWLDLDYEAHEENFTLVPL-LDMANHSSTCANATVKYDHAH 180
Query: 173 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEA 219
+L + RP K GE IV G + L YGF++ NP++R+ + A
Sbjct: 181 FELKLTRPVKRGEEIVFEYGGHDQATLWAEYGFIESSNPHERIDLTA 227
>gi|345325919|ref|XP_001512656.2| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Ornithorhynchus anatinus]
Length = 345
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 51/197 (25%), Positives = 94/197 (47%), Gaps = 18/197 (9%)
Query: 170 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 229
DD + V + + AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 42 DDRCECVALQDFTAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 101
Query: 230 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS--------S 281
K V R G + VF +H E + +L +LR+ +++ + +I +
Sbjct: 102 MKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLIGDHAIDKIFT 160
Query: 282 LGPI-CPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVATQ 336
LG PVS E +L + +AR L Y T+ ED++ L +L +A +
Sbjct: 161 LGNSEFPVSWDNEV----KLWTFLEARASLLLKTYKTTIEEDKSFLETPDLTFHATMAIK 216
Query: 337 LVRMEKKMLNACLQVTA 353
L EK++L ++ A
Sbjct: 217 LRLGEKEILEKAVKSAA 233
>gi|328854233|gb|EGG03367.1| hypothetical protein MELLADRAFT_90239 [Melampsora larici-populina
98AG31]
Length = 509
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 57/216 (26%), Positives = 92/216 (42%), Gaps = 24/216 (11%)
Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
A+VPL L A S A L + + + + GE I PN+ LL YG V
Sbjct: 260 AMVPLADILNAKSGCENAKLFYEPTTLNMTTTKSIRKGEQIYNTYADPPNADLLRRYGHV 319
Query: 207 DEDNPYD----------RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV-------- 248
D++NP+D RL E +L+ DPQ Q+ + K +++V +
Sbjct: 320 DDENPFDLAEVSLELCIRLAAE-SLHPSDPQNQNTLDELKSRAKWALEVSDIDEIFMLPT 378
Query: 249 HAGRE-KEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 307
+ RE KE + D L + +S E Q+ S G + P M + R
Sbjct: 379 KSQREPKEILPDELVIMLRILLSTEEEFQT-WKSKGKVP--KPAMSEPIAQLAIQILSNR 435
Query: 308 LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKK 343
L Y T+ D+ +L D +L ++++ + VR+ +K
Sbjct: 436 LNQYSTTIQNDQDLLKDQSL-SRRKLKSIKVRLGEK 470
>gi|358056251|dbj|GAA97802.1| hypothetical protein E5Q_04481 [Mixia osmundae IAM 14324]
Length = 433
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 83/343 (24%), Positives = 138/343 (40%), Gaps = 48/343 (13%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-L 82
L L LMYE+ S W Y+ + +S + WS+ EL L GS +I
Sbjct: 83 LILCLMYEQID-PASHWKRYLNSM-------PTCFDSLMFWSDDELRELQGSSVLDKIGR 134
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
E AEG Y+ L +F+ P EA++ ++ + + S H+
Sbjct: 135 EEAEG--SYYSILVPYLSKHADIFK------PLEAYSLALYHRCGSLILSRSFHVSNQDD 186
Query: 143 ---------------ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESI 187
+VP+ L A S A L DA+ + + AGE I
Sbjct: 187 SASDASDDDDAAYHEVETVGMVPMADVLNAKSGSANACLVYHPDALVMTTTKEIAAGEQI 246
Query: 188 VVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL-NTEDPQYQDKRMVAQRNGKLSVQVF 246
PN+ LL YG VDE N D + + A L +D +R L ++
Sbjct: 247 FNTYNDPPNADLLRRYGHVDEVNLNDNVEISADLIGCKD---------LERVDWLLDRLD 297
Query: 247 HVHAGREKEAI-SDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFK 305
V+ + E + D + +++ + SE + + + P ++ A ++ + +
Sbjct: 298 DVYTLTQAEDLPEDFITAVKI-LTASKSEFRKIQKA--DDLP-DDVLDEATAMRVREILQ 353
Query: 306 ARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM-EKKMLNA 347
RLA Y +T+ EDE++L + + A LVR+ EK++L A
Sbjct: 354 MRLAQYSSTIEEDESLLASSTMLTSRSRAALLVRLGEKRILAA 396
>gi|390354259|ref|XP_001201449.2| PREDICTED: SET domain-containing protein 4-like [Strongylocentrotus
purpuratus]
Length = 455
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 50/200 (25%), Positives = 84/200 (42%), Gaps = 19/200 (9%)
Query: 27 YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT--KAEILER 84
+L+ E+ +GK SFW PYI L + +P S T+ A PT ++ + +
Sbjct: 125 FLLTERSKGKSSFWYPYINVLPKD-------FTTPAFGS-TKQADFDVLPTIARSRAINQ 176
Query: 85 AEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF----KQAFVAVQSCVVHLQKV 140
+ I+ + ++ F QY ++F + F + ++ C K
Sbjct: 177 LQDIRAAFESASCLFEDIERTFPQYRIFFSLDSFVWAWFVINSRSVYIEPSGCEAFDPKA 236
Query: 141 SLARRFALVPLGPPLLAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
S FAL P LL +S ++ A V + ++ Y A + + + GP N
Sbjct: 237 S--DDFALAPFLD-LLNHSPGAEVTAGFDPVSNCYRIKTLDSYHAYDQVFIHYGPHDNVN 293
Query: 199 LLINYGFVDEDNPYDRLVVE 218
LL+ YGFV NP+D + E
Sbjct: 294 LLLEYGFVIPSNPHDAVSFE 313
>gi|346980096|gb|EGY23548.1| SET domain-containing protein RMS1 [Verticillium dahliae VdLs.17]
Length = 469
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 83/371 (22%), Positives = 143/371 (38%), Gaps = 65/371 (17%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
L L L ++YE QG S W PY L +Q ++P+ WS+ EL L G+
Sbjct: 91 LDSWGQLILVMLYEVLQGDASRWKPYFDILPQQ-------FDTPIFWSDGELLELQGTSL 143
Query: 78 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
AE + + E +++ + ++F PTE + + + + L
Sbjct: 144 TAEKIGKVESDAMFRSKILPIVQANPAIFYPEGAAQPTEDELLHLAHRMGSTIMAYAFDL 203
Query: 138 QKVSLARR--------------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 183
+ +VP+ L A +++ A + + + KA
Sbjct: 204 ENDDENENEEDGWVEDREGRTMLGMVPMADTLNA-NAEFNAHINHGESLEATAIRADIKA 262
Query: 184 GESIVVWCGPQPNSKLLINYGFVD-EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLS 242
G+ I+ + GP P S+LL YG+V E + YD + V L E V + LS
Sbjct: 263 GDQILNYYGPLPTSELLRRYGYVTPEHSRYDVVEVPWTLVKE---------VIVSSLSLS 313
Query: 243 VQVF-HVHAGREKEAISDMLPYLRLGYVSDTSE--------MQSVISSLGP--------- 284
+ + V + + E I D R D+ E +V+ + P
Sbjct: 314 AEAWKQVESQIDDEEIEDYFVIER-----DSGEPGPDGRFTAPAVLREVSPELVEQLKEF 368
Query: 285 ICPVSPCMERAVLDQ----------LADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVA 334
+ V + D+ +A+ K RLA YP ++ DE +L + +L ++R+A
Sbjct: 369 LKAVKKLDSERIPDKRKRDEICDAVIAEVLKVRLAQYPTSIETDEKLLAEADLPARRRMA 428
Query: 335 TQLVRMEKKML 345
+ EKK+L
Sbjct: 429 VVVRLGEKKLL 439
>gi|145549620|ref|XP_001460489.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124428319|emb|CAK93092.1| unnamed protein product [Paramecium tetraurelia]
Length = 482
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 71/344 (20%), Positives = 132/344 (38%), Gaps = 33/344 (9%)
Query: 27 YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAE 86
+L+ E ++ K SFW PYI L + P + + A L GSPT ++ + +
Sbjct: 123 HLLQENRR-KYSFWKPYIDVLPKD------VSGFPTYFDAEQDALLKGSPTLFTVINQRK 175
Query: 87 GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF 146
K EY L A FQ+Y Y T++ F + + S +Q ++
Sbjct: 176 VFKEEYENLKE----AVKEFQKYGY-------TYDDFIKFRILTISRSFTVQIGEKEQQQ 224
Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD--RPYKAGESIVVWCGPQPNSKLLINYG 204
LVPL + + + DA + R + GE + G N +NYG
Sbjct: 225 LLVPLAD-FINHDNNGFLKYGYSKDADGFFMQAVRNIQKGEELFYNYGQWSNKYFFMNYG 283
Query: 205 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN---GKLSVQVFHVHAGREKEAISDML 261
F NP ++ ++ LN D + K + + N G V R+ A +
Sbjct: 284 FASLTNPMNQFDLDICLNKNDRLFNLKISLTKGNMCWGNRLVNETDHDTFRQSLA---TV 340
Query: 262 PYLRLGYVSDTSEMQSVISSLGPICP------VSPCMERAVLDQLADYFKARLAGYPATL 315
+ ++ + D +++ + + P + +E+A L L + +T+
Sbjct: 341 RFTQISKLDDFLQLEEDVQNFKQFWPGWHTTIKTIELEKATFKALKGILVTELGNFASTI 400
Query: 316 SEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLL 359
+DE L D ++ L EK+++ + + M+ ++
Sbjct: 401 EDDERRLNDPQTPEFRKHIIMLTLREKQIIKKNIDICDLMLQVI 444
>gi|119500300|ref|XP_001266907.1| SET domain protein [Neosartorya fischeri NRRL 181]
gi|119415072|gb|EAW25010.1| SET domain protein [Neosartorya fischeri NRRL 181]
Length = 704
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 55/203 (27%), Positives = 90/203 (44%), Gaps = 27/203 (13%)
Query: 26 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 85
+L+ + +G + FW PYIR L + L++ +PL + +L +L G+ +R
Sbjct: 124 FFLIGQYLKGSEGFWFPYIRTLPQP-----LSLTTPLYYEGGDLRWLDGTSLAPAREQRM 178
Query: 86 EGIKREYNELDTVWFMAGSLFQ---QYPYDI--------PTEAFTFEIFKQAFVAVQSCV 134
K +Y T AG FQ QY +D+ + AF+ ++ +A V+
Sbjct: 179 GVWKEKYKNGITELRKAG--FQDVDQYTWDLYLWSSSILVSRAFSAKVLAEAVTDVE--- 233
Query: 135 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 194
+ VS+ L+P L+ + K A V VV +G+ I GP+
Sbjct: 234 LPEDGVSV-----LLPC-IDLMNHRPLAKVEWRAGKQDVAFVVLEDVGSGQEISNNYGPR 287
Query: 195 PNSKLLINYGFVDEDNPYDRLVV 217
N +L++NYGF DNP D +V
Sbjct: 288 NNEQLMMNYGFCLPDNPCDYRIV 310
>gi|357131865|ref|XP_003567554.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Brachypodium distachyon]
Length = 316
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 46/181 (25%), Positives = 74/181 (40%), Gaps = 22/181 (12%)
Query: 36 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 95
KKS W PY+R L R + + + W EL + S E +ER E +E++ +
Sbjct: 38 KKSGWAPYVRSLPRND-----QMHNMMFWDLNELHMVRISSICDEAIERRERAMKEFSAV 92
Query: 96 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 155
LF + E F A S +V + +R +L+P
Sbjct: 93 KPSLECFPHLFGE---------IKLEDFMHA-----SALVSSRAWQTSRGVSLIPFAD-F 137
Query: 156 LAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 213
L + ++L D +++ DR Y GE ++V G N+ L +N+GF N YD
Sbjct: 138 LNHDGVSDSILLYDGQKDIAEVISDRNYAVGEQVMVRYGKYSNAMLALNFGFTLPRNIYD 197
Query: 214 R 214
+
Sbjct: 198 Q 198
>gi|340520781|gb|EGR51016.1| N-methyltransferase [Trichoderma reesei QM6a]
Length = 470
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 79/186 (42%), Gaps = 21/186 (11%)
Query: 60 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 119
S + +SE EL G+ + + IK +Y +L A LF Q+P P + FT
Sbjct: 107 SSIFFSEGELEVCAGTSLYTVTKQLEQRIKDDYRQL------AVRLFAQHPDLFPLQKFT 160
Query: 120 FEI----------FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS---KCKAML 166
E +K A V S + + L P +L +SS +C A
Sbjct: 161 IEDVRLLRRATDPYKWALCTVWSRSMDFTLPDGSSIRLLAPFAD-MLNHSSEVKQCHAYD 219
Query: 167 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 226
D + + + Y+ G+ + ++ GP PN++LL YGFV DNP D + + P
Sbjct: 220 VKSGD-LSVFAGKDYEIGDQVYIYYGPIPNNRLLRLYGFVIPDNPNDSYDLVLTTHPMAP 278
Query: 227 QYQDKR 232
Y+ K+
Sbjct: 279 FYEQKQ 284
>gi|400602586|gb|EJP70188.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
Length = 797
Score = 51.2 bits (121), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 40/77 (51%), Gaps = 2/77 (2%)
Query: 145 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 204
R AL+P+ L ++ C + +A + DR Y+AGE + G N LL YG
Sbjct: 593 RLALLPVADVLNHANAGCSVAFST--EAYDITADRAYQAGEEVYTSYGAHSNDFLLAEYG 650
Query: 205 FVDEDNPYDRLVVEAAL 221
FV DNP+D+L ++ L
Sbjct: 651 FVLPDNPWDQLCLDKVL 667
>gi|308806489|ref|XP_003080556.1| SET-domain transcriptional regulator-like protein (ISS)
[Ostreococcus tauri]
gi|116059016|emb|CAL54723.1| SET-domain transcriptional regulator-like protein (ISS)
[Ostreococcus tauri]
Length = 394
Score = 51.2 bits (121), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 52/186 (27%), Positives = 83/186 (44%), Gaps = 20/186 (10%)
Query: 40 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 99
W P +R L + S L W+ +L + G + E +K +Y+ L
Sbjct: 77 WAPALRALPTR-------ASSSLAWNADDLGAVAGEDLANRLREYRRSVKVQYDAL---- 125
Query: 100 FMAGSLFQQYPYDIPTEAF-TFEIFKQAFVAVQSCVVHLQK-VSLARRFALVP----LGP 153
F A L +Q P P AF + F++A+ S + +Q SL R +VP
Sbjct: 126 FPA--LCEQVPEAFPARAFGDYAKFERAYDIWTSYAMKVQDPDSLQIREVIVPGVFLCNH 183
Query: 154 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 213
L A+S + ++ A +L + R GE+I + G N+ LL+ YGF E+NPYD
Sbjct: 184 SLSAHSVRYTSLERGTK-AFRLELSRGCVEGEAITISYGRLDNADLLMFYGFSLENNPYD 242
Query: 214 RLVVEA 219
R+ + +
Sbjct: 243 RVSLHS 248
>gi|356577306|ref|XP_003556768.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Glycine
max]
Length = 487
Score = 51.2 bits (121), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 75/330 (22%), Positives = 141/330 (42%), Gaps = 19/330 (5%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
+AL+L+ E+ + S W Y L ++ +S + WSE EL+ L G+
Sbjct: 125 AVALFLIRERSR-SNSLWKHYFSVLPKE-------TDSTIYWSEEELSELQGTQLLNTTR 176
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQS-CVVHLQKV 140
+ ++ EY L+ + P + + F I + +AF +++ +V +
Sbjct: 177 SVKQYVENEYRRLEEEIILPNKKLFPSPLTLDDFFWAFGILRSRAFSRLRNENLVVIPFA 236
Query: 141 SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKL 199
A V AY K A L + D L KAG+ + + + + N++L
Sbjct: 237 DFINHSARVTTEDH--AYEIKGAAGLFSWDYLFSLRSPLSLKAGDQVYIQYDLNKSNAEL 294
Query: 200 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV-HAGREKEAIS 258
++YGF++ + + + ++ DP + DK +A+ NG F + ++ +
Sbjct: 295 ALDYGFIEPNADRNAYTLTLQISESDPFFGDKLDIAESNGFGETAYFDIFYSRPLPPGLL 354
Query: 259 DMLPYLRLGYVSDTSEMQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATL 315
L + LG +D ++S+ S G + PVS E + + + K LAGY T+
Sbjct: 355 PYLRLVALG-GTDAFLLESIFRNSIWGHLELPVSRDNEELICRVVRETCKTALAGYHTTI 413
Query: 316 SEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
ED+ L + L + +A + EK++L
Sbjct: 414 EEDQK-LKEAKLDSRHAIAVGIREGEKQLL 442
>gi|391340216|ref|XP_003744440.1| PREDICTED: SET domain-containing protein 4-like [Metaseiulus
occidentalis]
Length = 381
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 48/206 (23%), Positives = 84/206 (40%), Gaps = 28/206 (13%)
Query: 13 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 72
++ LS L ++++ E+ G+KS W PYI + P ++ +
Sbjct: 83 MSKENLSAEEILTIFVLSERSLGEKSKWKPYIESI-------------PDVFDGLQCRKS 129
Query: 73 TGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 132
P + A+ I R E V+ F+ D+ FE F A+ AV +
Sbjct: 130 VRLPRRL-----AQAIDRWNAERRNVFSRLRMFFRGRGIDL-----NFETFSWAWSAVNT 179
Query: 133 CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG 192
++++ L P LL + K + V++ + + Y+AG + + G
Sbjct: 180 RCIYVE----GHGSTLAPF-LDLLNHHWKASIETSFVNNHFIIRSNVGYEAGSEVFIGYG 234
Query: 193 PQPNSKLLINYGFVDEDNPYDRLVVE 218
N L +NYGFV ++NP D + VE
Sbjct: 235 SHDNRTLFLNYGFVLDENPNDCITVE 260
>gi|80479475|gb|AAI08868.1| Unknown (protein for MGC:132347) [Xenopus laevis]
Length = 456
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 49/207 (23%), Positives = 91/207 (43%), Gaps = 19/207 (9%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
+S L L +L+ E+ G++S W PY+ + P+ W E E+ +L +P
Sbjct: 104 VSPLLALCTFLIAERFAGERSQWKPYLDVIPS-------TYSCPVYW-ELEIVHLLPAPL 155
Query: 78 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
+ + LE+ K E EL T + Q D + +T++ + A+ V + V++
Sbjct: 156 RQKALEQ----KTEVQELHTESLAFFNSLQPLFCDNVADIYTYDALRWAWCTVNTRTVYM 211
Query: 138 QKVSLARRFA---LVPLGP--PLLAYSSKCK--AMLAAVDDAVQLVVDRPYKAGESIVVW 190
+ R A + L P LL +S + + A + ++ + + + +
Sbjct: 212 KHTQQDRLLAQQDVCALAPYLDLLNHSPEVQVEAEFSKDRRCYEIRTNSGCRKHDQAFIC 271
Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVV 217
GP N +LL+ YGFV +NP+ + V
Sbjct: 272 YGPHDNQRLLLEYGFVAANNPHRSVYV 298
>gi|50557134|ref|XP_505975.1| YALI0F28061p [Yarrowia lipolytica]
gi|49651845|emb|CAG78787.1| YALI0F28061p [Yarrowia lipolytica CLIB122]
Length = 454
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 61/226 (26%), Positives = 93/226 (41%), Gaps = 41/226 (18%)
Query: 17 KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP 76
KL+ L LY+M K G + W PY L Q ++S ++W++ EL L GS
Sbjct: 80 KLNSWLQLILYMM---KAGSMTKWKPYFDVLPTQ-------LDSLMMWTDDELEGLKGSM 129
Query: 77 TKAEILERAEGIKREYNE---------------LDTV---WFMAGSLFQQYPYDIPTEAF 118
+I G + +Y E DT + G L Y +D P ++F
Sbjct: 130 IVKKI--GKAGAEEDYQEKLKPIIDAHPEYFKDCDTSLESFHRMGGLIMAYSFDAP-DSF 186
Query: 119 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA-VQLVV 177
+ + L L + A+VPL L A++ C A L A DD +
Sbjct: 187 S----EDEEDDEDIEHDDLYNEGLVK--AMVPLADTLNAHTRFCNANLIAEDDGGFSMTA 240
Query: 178 DRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALN 222
+P K GE + G PN L YG+V +E +D +VE +++
Sbjct: 241 IQPIKKGEQVYNTYGELPNCDFLRRYGYVENEGTEFD--IVEFSMD 284
>gi|66825817|ref|XP_646263.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|60474297|gb|EAL72234.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 567
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 74/345 (21%), Positives = 129/345 (37%), Gaps = 45/345 (13%)
Query: 38 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 97
S W PYI+ L +Q + W E GSP + G R+Y
Sbjct: 164 SEWGPYIKLLPKQ-------YNTVYYWGLKEFTQFRGSPNLEYAMRYVRGAMRQY----- 211
Query: 98 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR-----FALVPLG 152
++ + + +P +FT++ F A VQS Q A AL+P
Sbjct: 212 -CYLYSMIDRTQSNIMPISSFTWDAFVWAISTVQS----RQNPVYAGNGNGSIMALIPFW 266
Query: 153 P--PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 210
+ SK + + + + +K GE + ++ GP+ N++LL++ GF + N
Sbjct: 267 DFCNHSSTGSKITSFYHMDSNCMTSGAIKDFKKGEQVYMFYGPRDNTQLLMHAGFATKTN 326
Query: 211 PYDRLVVEAAL--NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 268
+D E L + ++ ++ +R + V V E +++P+ R+ Y
Sbjct: 327 LHDSYPFELHLLEGNHEIRHDKVHLLEERGIRDGVVVNLNQNPTSNELPLELIPFYRI-Y 385
Query: 269 VSDTSEMQSVIS---------------SLGPIC--PVSPCMERAVLDQLADYFKARLAGY 311
E +++ L P+ ++ E L K +LA Y
Sbjct: 386 ALSEQETRAIAPPQVPGEHNHHHGHQLELKPLAFKIITQENEEKAYSNLVQALKGKLASY 445
Query: 312 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 356
P TL EDE L N +R EKK+L+ ++ +I
Sbjct: 446 PTTLEEDEQELKK-NPPANQRFILYTKINEKKILDRNIKYLESLI 489
>gi|387191841|gb|AFJ68625.1| set domain-containing protein [Nannochloropsis gaditana CCMP526]
Length = 736
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 41/200 (20%), Positives = 79/200 (39%), Gaps = 22/200 (11%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
LA+ L+ E+ +G +SFW PY+R L + P+ ++ +E + +
Sbjct: 260 LAVLLVAERMKGPQSFWWPYLRNLPEK------YAHMPIFYNNSEFGSIQIPSLMRTVQS 313
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 143
R + ++ +Q + P E + + C + +
Sbjct: 314 RCRMLVN----------ISDGYLRQLSHGGPAENPFLDDVHANDMGWGLCAASSRALRNI 363
Query: 144 RRFALVPLGPPLLAYSSKCKAMLAAVDD------AVQLVVDRPYKAGESIVVWCGPQPNS 197
PL P++ + + + D ++QLV R + G+++ + G N
Sbjct: 364 PGLGSTPLMVPVIDFCEHAVSPTCYIKDYRKSGGSIQLVAGRDLQPGDALTISYGNLTNP 423
Query: 198 KLLINYGFVDEDNPYDRLVV 217
+LL++YGF DNP+DR V
Sbjct: 424 QLLLDYGFTLSDNPHDRFEV 443
>gi|85093434|ref|XP_959692.1| hypothetical protein NCU09581 [Neurospora crassa OR74A]
gi|28921141|gb|EAA30456.1| predicted protein [Neurospora crassa OR74A]
Length = 504
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 48/199 (24%), Positives = 83/199 (41%), Gaps = 28/199 (14%)
Query: 26 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 85
YL+ + +GK SFW PYI L + A+ P W+E ++ L G+ I E
Sbjct: 107 FYLIQQYLKGKSSFWAPYISTLADPSQLDKWAL--PPFWAEDDIELLQGTNAYIAIQEIQ 164
Query: 86 EGIKREYNELDTVWFMAG----SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 141
+K EY + + G + Q Y+ FT F+ + V +S ++++
Sbjct: 165 NNVKSEYKQARKILKKEGFPDYREYTQVLYNWAYCMFTSRSFRPSLVLSESAREYVER-- 222
Query: 142 LARRFALVPLGP---------PLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESI 187
L+P G PL ++ + L + A +L+ + Y G+ +
Sbjct: 223 ------LLPEGTKIDDFSVLQPLYDIGNHSWDASYTWNLTSEPSACELICNDSYGPGQQV 276
Query: 188 VVWCGPQPNSKLLINYGFV 206
G + NS+LL+ YGF+
Sbjct: 277 FNNYGFKTNSELLLGYGFI 295
>gi|396495152|ref|XP_003844476.1| similar to SET domain-containing protein [Leptosphaeria maculans
JN3]
gi|312221056|emb|CBY00997.1| similar to SET domain-containing protein [Leptosphaeria maculans
JN3]
Length = 475
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 56/218 (25%), Positives = 83/218 (38%), Gaps = 48/218 (22%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI- 81
L L ++YE G S W PY L + + + WSE ELA L S A+I
Sbjct: 85 SLILVMLYEHLNGDASNWAPYFAVLPNE-------FNTLMFWSEHELAELQASAVLAKIG 137
Query: 82 ------------------------------LERAEGIKREYNELDTVWFMAGSLFQQYPY 111
++AE ++ E N T+ GSL Y +
Sbjct: 138 REGANEAFLGQLVPVIKEFAGIFFSGDSRAAQKAEEMRDEKN--ITLMHKMGSLIMAYAF 195
Query: 112 DIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD 171
DI + V + + +L + ++PL L A + +C A L
Sbjct: 196 DIEPAT------PRKDVDEEGFAEEEEDEALPK--GMIPLADMLNADADRCNARLFYEQK 247
Query: 172 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 209
+++ +P KAGE I GP P S LL YG+V E+
Sbjct: 248 YLEMKALKPIKAGEEIFNDYGPLPRSDLLRRYGYVTEN 285
>gi|402224283|gb|EJU04346.1| hypothetical protein DACRYDRAFT_114691 [Dacryopinax sp. DJM-731 SS1]
Length = 1313
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 57/215 (26%), Positives = 90/215 (41%), Gaps = 19/215 (8%)
Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
A+VP+ L A A L D +Q++ +P GE I G PNS LL YG+V
Sbjct: 1060 AMVPMADMLNARCGCNNAKLFYTRDDLQMMATKPIAKGEQIWNTYGDPPNSDLLRRYGYV 1119
Query: 207 DE-------DNPYDRLVVEAALNTEDPQ---YQDKRMVAQRNGKLSVQVFHVHAGREKEA 256
D +P D + + A E + YQD+ G V V +
Sbjct: 1120 DALTLPDGVGSPSDVVEINADTVVEAAKVQSYQDRIDWWLEEGGDDAFVLDV-----TYS 1174
Query: 257 ISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLS 316
+ D + L + + + + S P P ++ + L + RLA YP +L+
Sbjct: 1175 VPDEMLSLVRLLLLNQEDWEKAQSK---GKPPKPKLDEKSYEVLLVVLQKRLAMYPISLT 1231
Query: 317 EDEAMLTDYN-LHPKKRVATQLVRMEKKMLNACLQ 350
E E ML N L+ K+R A + E+++L+ L+
Sbjct: 1232 EQEGMLRSSNELNEKRRNALIVTTGEQRILHKTLE 1266
>gi|303275964|ref|XP_003057276.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226461628|gb|EEH58921.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 308
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 61/238 (25%), Positives = 102/238 (42%), Gaps = 35/238 (14%)
Query: 17 KLSELACLALYLMYEK---KQGKKS-FWLPYIRELDRQRGRGQLAVESPLLWSETEL-AY 71
+L E + LA +L + + G +S + Y++ L R+ G L W E ++
Sbjct: 63 ELQEWSLLAAFLAEQALDIENGDESGVFAAYVKALPRRTG-------GVLDWPEEDVKTL 115
Query: 72 LTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ 131
L GSP++ ER + E+ ++P P + AF +
Sbjct: 116 LAGSPSQRAAYERQASVDGAIEEIRA----------EFPQLTPG------ALRWAFDVLF 159
Query: 132 SCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
S ++ L + ALVP +L + C A + V L DR YK GE +
Sbjct: 160 SRLIRLP--NRGGELALVPWAD-MLNHKPGCNAYIDDSGGKVCLQPDRAYKPGEQVFASY 216
Query: 192 GPQPNSKLLINYGFVDE--DNPYDRLVVEAALNTEDPQYQDKRMVA-QRNGKLSVQVF 246
G +P+++LLI+YGF E +NP D + ++ D +Y D + A ++ G V+ F
Sbjct: 217 GQRPSAELLISYGFAPEVGENPDDEYEITLGIDPND-RYADAKAAALEKIGLRPVESF 273
>gi|70993754|ref|XP_751724.1| SET domain protein [Aspergillus fumigatus Af293]
gi|66849358|gb|EAL89686.1| SET domain protein [Aspergillus fumigatus Af293]
gi|159125354|gb|EDP50471.1| SET domain protein [Aspergillus fumigatus A1163]
Length = 674
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 53/201 (26%), Positives = 88/201 (43%), Gaps = 23/201 (11%)
Query: 26 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 85
+L+ + +G + FW PYIR L + L++ +PL + +L +L G+ +R
Sbjct: 94 FFLIGQYLRGSEGFWFPYIRTLPQP-----LSLTTPLYYEGDDLRWLDGTSLAPAREQRM 148
Query: 86 EGIKREYNELDTVWFMAG-SLFQQYPYDI--------PTEAFTFEIFKQAFVAVQSCVVH 136
K +Y T AG QY +D+ + AF+ ++ +A V+ +
Sbjct: 149 GVWKEKYENGITELRKAGFEDVDQYTWDLYLWSSSILVSRAFSAKVLAEAVTDVE---LP 205
Query: 137 LQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPN 196
VS+ L+P L+ + K A V VV +G+ I GP+ N
Sbjct: 206 EDGVSV-----LLPC-IDLMNHRPLAKVEWRAGKQDVAFVVLEDVASGQEISNNYGPRNN 259
Query: 197 SKLLINYGFVDEDNPYDRLVV 217
+L++NYGF DNP D +V
Sbjct: 260 EQLMMNYGFCLPDNPCDYRIV 280
>gi|302840199|ref|XP_002951655.1| hypothetical protein VOLCADRAFT_105180 [Volvox carteri f.
nagariensis]
gi|300262903|gb|EFJ47106.1| hypothetical protein VOLCADRAFT_105180 [Volvox carteri f.
nagariensis]
Length = 517
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 46/206 (22%), Positives = 89/206 (43%), Gaps = 13/206 (6%)
Query: 16 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 75
+++ E+ L + +MYEK +G++S W PY+ + PL W E L G+
Sbjct: 179 DEILEVQGLIIAVMYEKSRGRQSRWAPYLNLIPDD------MTHMPLYWKHREFKELRGT 232
Query: 76 PTKAEILERAEGIKREYNELDTVWF-MAGSLFQQYP-YDIPTEAFTFEIFKQAFVAVQSC 133
+++ + + ++ +W + Q++P ++P +++++ A AV S
Sbjct: 233 AAYDKMMGKVQCPADAPTQVPVLWSEVVEPFIQEHPELELPEGKAGYDLYRWATCAVASY 292
Query: 134 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWC 191
L A+VP+ L + + L A + ++ R GE +V
Sbjct: 293 SFILGDDKYQ---AMVPVWDLLNHITGRVNVRLHHCAKRHVLHMIATRDILRGEELVNNY 349
Query: 192 GPQPNSKLLINYGFVDEDNPYDRLVV 217
G N++LL YGFV+ N + + V
Sbjct: 350 GELSNAELLRGYGFVEARNRNNHVQV 375
>gi|389741836|gb|EIM83024.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 502
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 91/414 (21%), Positives = 153/414 (36%), Gaps = 100/414 (24%)
Query: 22 ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 81
A L L +M+E+ QG S W Y+ L + ++P+ WS +L L G+ +I
Sbjct: 79 AGLILCMMWEEAQGASSRWSTYLASLPS-------SFDTPMFWSPDDLEELKGTSVVDKI 131
Query: 82 ---------------------------------------------------LERAEGIKR 90
+ER EG
Sbjct: 132 GRDGAEEDYRSKVVPTLQSRPDLFAPEALSRHYSLENYHLMGSRILSRSFSVERWEGHAA 191
Query: 91 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFE-------IFKQAFVAVQSCVVHLQKVSLA 143
+ E D+ + D+ TEA T + + +FV +
Sbjct: 192 DKQE-DSASSPVADTGRDEAMDVDTEAVTATAPEAEDGVDEPSFVVDDENDSDDEDEEDP 250
Query: 144 RRFALVPLGPPLLA-YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 202
A+VP+ L A Y S+ + +D ++++ +P GE I G PNS LL
Sbjct: 251 ANVAMVPMADMLNARYRSENAKLFYETED-LRMITTKPILKGEQIFNTYGDPPNSDLLRR 309
Query: 203 YGFV--------DEDNPYDRLVVE-----AALNTEDPQYQDKRMVAQR------NGKLSV 243
YG V D NP D +VE A + + Q A+R G V
Sbjct: 310 YGHVDLVPLPNGDIGNPAD--IVELRGDLAFFSISERHKQPVESSAERVDWWLEEGGEDV 367
Query: 244 QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADY 303
+ + E +++P+ RL S + ++ S P V + ++L +A+
Sbjct: 368 FILETN----HELPDELVPFCRLLLQSQSEWEKTKSKSKLPKAKV----DESILSTIANA 419
Query: 304 FKARLAGYPATLSEDEAMLTD-YNLHPKKRVATQLVRMEKKMLNACLQVTADMI 356
+ RLA YP ++ ED+ +LT+ +L+ K V +L EK++L+ L + +
Sbjct: 420 LERRLAEYPTSVEEDQKLLTEPLSLNRKHAVIVRL--GEKRILHGTLSTVKEKL 471
>gi|302658278|ref|XP_003020845.1| SET domain protein [Trichophyton verrucosum HKI 0517]
gi|291184711|gb|EFE40227.1| SET domain protein [Trichophyton verrucosum HKI 0517]
Length = 692
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 47/206 (22%), Positives = 81/206 (39%), Gaps = 35/206 (16%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
LA ++++E+ +G+ S W PY+ L R S L + + +L +L G+
Sbjct: 108 LAFFMVHEQLKGRDSHWWPYLATLPRAS-----EFTSALFYQDNDLEWLQGTNLYQTHQA 162
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 143
++ EY+ ++ G L E++ ++IF A+ + S + +
Sbjct: 163 YRNAVQEEYDSAISILRDEGFL--------AVESYRWDIFCWAYTLIAS------RAFTS 208
Query: 144 RRFALVPLGPPLLAYSSKCKAMLAAVDDA----------------VQLVVDRPYKAGESI 187
R P L + + ML VD + + L V P +GE +
Sbjct: 209 RVLDAYFSNHPTLKQDEEFQIMLPLVDSSNHKPLAKIEWRAEATEIGLKVIEPTSSGEEV 268
Query: 188 VVWCGPQPNSKLLINYGFVDEDNPYD 213
GP N +L+ YGF DNP D
Sbjct: 269 HNNYGPLNNQQLMTTYGFCIVDNPCD 294
>gi|393230612|gb|EJD38215.1| SET domain-containing protein [Auricularia delicata TFB-10046 SS5]
Length = 381
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/242 (24%), Positives = 99/242 (40%), Gaps = 21/242 (8%)
Query: 16 NKLSE--LACLALYLMYEKKQ---GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA 70
N LSE L C L + + K+ G + L + +D R QL +PL ++ ELA
Sbjct: 44 NSLSERELVCTYLAMHWIAKEVDLGPSAASLDHGPYVDSLPSRAQL--RTPLHFTPQELA 101
Query: 71 YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 130
L G+ A +R + E V G Y + + F ++
Sbjct: 102 LLKGTNMAAATTDREADWRSECERCRAVLGHWGEHLTWEHYLTASTHLSSRAFPSTLLSP 161
Query: 131 QSCVVHLQKVSLARRFALVPLGPPL-LAYSSKCKAMLAAVDDAVQ---LVVDRPYKAGES 186
+ ++ + + LVPL L A + ++ D+ +V P AG
Sbjct: 162 EPALI----PTPSSHPVLVPLIDSLNHARAHPVSWSVSPADNGAHTLSIVQHAPVAAGAE 217
Query: 187 IVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 246
++ GP+PN++L++ YGF DNP D LV++ + D+R + + G L +F
Sbjct: 218 VLNNYGPKPNAELVLGYGFALPDNPDDTLVLKVS------GAADRREIWRAGGGLQRILF 271
Query: 247 HV 248
V
Sbjct: 272 DV 273
>gi|356521657|ref|XP_003529470.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Glycine
max]
Length = 487
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 77/332 (23%), Positives = 143/332 (43%), Gaps = 23/332 (6%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
+AL+L+ E+ + S W Y L ++ +S + WSE EL+ L G+
Sbjct: 125 AVALFLIRERSR-SDSLWKHYFSILPKE-------TDSTIYWSEEELSELQGTQLLNTTR 176
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF--TFEIFK-QAFVAVQS-CVVHLQ 138
+ ++ E+ L+ + + +P I + F F I + +AF +++ +V +
Sbjct: 177 SVKQYVQNEFRRLEEEIIIPNK--KLFPSSITLDDFFWAFGILRSRAFSRLRNENLVVIP 234
Query: 139 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNS 197
L A V AY K A L + D L KAG+ + + + + N+
Sbjct: 235 LADLINHSARVTTDDH--AYEIKGAAGLFSWDYLFSLRSPLSLKAGDQVYIQYDLNKSNA 292
Query: 198 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV-HAGREKEA 256
+L ++YGF++ + + + ++ DP + DK +A+ NG F + +
Sbjct: 293 ELALDYGFIEPNTDRNAYTLTLQISESDPFFGDKLDIAESNGFGETAYFDIFYNRPLPPG 352
Query: 257 ISDMLPYLRLGYVSDTSEMQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPA 313
+ L + LG +D ++S+ S G + PVS E + + + K LAGY
Sbjct: 353 LLPYLRLVALGG-TDAFLLESIFRNSIWGHLELPVSRDNEELICRVVRETCKTALAGYHT 411
Query: 314 TLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
T+ ED+ L + L + +A + EK +L
Sbjct: 412 TIEEDQK-LKEAKLDSRHAIAVGIREGEKNLL 442
>gi|322712432|gb|EFZ04005.1| histone-lysine N-methyltransferase [Metarhizium anisopliae ARSEF
23]
Length = 462
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 64/271 (23%), Positives = 112/271 (41%), Gaps = 24/271 (8%)
Query: 60 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 119
S + +++ EL G+ + + I+ +Y +L + QYP P + FT
Sbjct: 107 SSIFFTDDELEVCAGASLYTITKQLQQRIEDDYRDLVV------RVLVQYPDLFPLDKFT 160
Query: 120 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS---KCKAMLAAVDDAVQLV 176
+K A AV S + Q + L P +L +SS +C A+ D + ++
Sbjct: 161 LHHYKWALCAVWSRAMDFQLSDGSSIRLLAPFAD-MLNHSSESKQCHVYDASSGD-LSVL 218
Query: 177 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 236
+ Y+AG+ + + G PN +LL YGF+ NP D + A + P ++ K+ +
Sbjct: 219 AGKDYEAGDQVYIHYGSIPNHRLLRLYGFIIPGNPNDSYDLVLATHPLAPFFELKQKLWA 278
Query: 237 RNGKLSVQVFHVHAGREKEAISDMLP-----YLRLGYVSDTSEMQSVISSLGPICPVSPC 291
G S + ++D LP YLR+ + D S++ S+ +S
Sbjct: 279 LAGLDSTCTISL-------TLTDPLPKNVIRYLRIQRL-DESDLASIALGQAADEKISNS 330
Query: 292 MERAVLDQLADYFKARLAGYPATLSEDEAML 322
E VL L + + L + L + E L
Sbjct: 331 NEVQVLQSLVESIASLLGSFGTRLEKLEEQL 361
>gi|328772383|gb|EGF82421.1| hypothetical protein BATDEDRAFT_86633 [Batrachochytrium
dendrobatidis JAM81]
Length = 648
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 56/212 (26%), Positives = 82/212 (38%), Gaps = 32/212 (15%)
Query: 38 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA-EILERAEGIKREYNELD 96
S WLPY+ L + PL+W+ + L G + ++ER E I+ N
Sbjct: 146 SHWLPYLASLPKNYAL-------PLMWTRDRIQNLLGGTSLLYMMIERLEWIQ---NSTK 195
Query: 97 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR----------- 145
V G F PT A T + + A ++ S K SL +
Sbjct: 196 VVENACGHYF-------PTGALTVQSMQWATCSIWSRAFPKAKPSLDLQDGSHQDVQDWI 248
Query: 146 -FALVPLGPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 202
+ + L P L ++ K + + V + G ++ GP+ N LL N
Sbjct: 249 GLSEICLFPILDMFNHKRGYRVEWRMTEKGVSFITPDGICKGSELLNNYGPKGNENLLSN 308
Query: 203 YGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 234
YGFV E+NP D V L EDP Y K+ V
Sbjct: 309 YGFVIENNPEDYFKVFLGLQQEDPLYTAKKAV 340
>gi|323452617|gb|EGB08490.1| hypothetical protein AURANDRAFT_71532 [Aureococcus anophagefferens]
Length = 1114
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 85/340 (25%), Positives = 134/340 (39%), Gaps = 60/340 (17%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP---LLWSETELAYLTGSPTK- 78
LAL L+YE+++G KS W P+I L P L WSE ELA L GS
Sbjct: 100 ALALRLLYERRKGAKSRWGPHI---------ALLPATPPHALLRWSEAELAELAGSDALE 150
Query: 79 -------------AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ 125
+EI++++ E + + A ++P+ + E F++ +
Sbjct: 151 LANRWRSQVSSDFSEIVDKSRAAVEESDPGKQL-SAAVKASLRFPW-LDLEGFSWAV--- 205
Query: 126 AFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY--------SSKCKAMLAAVDDAVQLVV 177
++ + VS++R+ A PP+ A+ DDA V
Sbjct: 206 -------SMIWSRCVSVSRKGA-----PPIKAFLPVVDMHNHDPGAPENHGFDDARDGFV 253
Query: 178 DR---PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 234
R K G+ + + PN+ LL+ YGF + + + A L+ E P Y+ KR
Sbjct: 254 LRRTGNAKKGDELKLCYDGLPNAWLLLLYGFALDHAAHAGRDLYAPLSPEAPHYEAKRAA 313
Query: 235 AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMER 294
+ KL + A + A D LP RL ++ ++ + L + S R
Sbjct: 314 LE---KLGLGATADGAAPFRLAADDALPE-RL--LTALMAQRATLDELPGLPATSEATAR 367
Query: 295 AVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVA 334
A L A LA Y + ED A L D P+ R+A
Sbjct: 368 AAAGDLVAACDALLAAYRGSEDEDAAALADPATPPRLRLA 407
>gi|328869852|gb|EGG18227.1| hypothetical protein DFA_03714 [Dictyostelium fasciculatum]
Length = 504
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/143 (25%), Positives = 63/143 (44%), Gaps = 16/143 (11%)
Query: 10 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
A +L +L E ++ LMYE +G +S W YI+ + ++ P+LW + +
Sbjct: 73 APILEKYELEEAVATSIALMYETSKGVQSKWYSYIQSM-------PTVIDLPILWDKESI 125
Query: 70 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
YL G+ + ++E E ++ +Y E + + +P FT E FK A
Sbjct: 126 EYLVGTDLEEIVIENIETLEEQYRE------DVEPIIKNHPETFKENIFTLESFKIASTI 179
Query: 130 VQSCVVHLQKVSLARRFALVPLG 152
V S ++ + +LVPL
Sbjct: 180 VSSRAFNIDQYHGE---SLVPLA 199
>gi|302784522|ref|XP_002974033.1| hypothetical protein SELMODRAFT_414219 [Selaginella moellendorffii]
gi|300158365|gb|EFJ24988.1| hypothetical protein SELMODRAFT_414219 [Selaginella moellendorffii]
Length = 527
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 56/214 (26%), Positives = 85/214 (39%), Gaps = 34/214 (15%)
Query: 12 LLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW-SETELA 70
+L N + +L+ E+ +GK+SFW PYI L +L++ PLLW +ET
Sbjct: 100 MLRKNDFRPWLTMCAHLLVERSRGKESFWHPYISALPSVE---ELSISHPLLWPAETIQE 156
Query: 71 YLTGSPTKAEILERAEGIKREYNELDTVW---FMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
L GSP I R + + ++ L T F+ G E +
Sbjct: 157 LLQGSPMLDTIATRLKLCQEDHEALLTAGIEKFLPGG----------------ETLSEGD 200
Query: 128 VAVQSCVVHLQKVSLA-------RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ---LVV 177
V S V+ + SL LVP L SS + D + L
Sbjct: 201 VRWASAVLLSRAFSLELDVDDDFDTLCLVPWADMLNHCSSAGEESCLIFDQDTKTASLEA 260
Query: 178 DRPYKAGESIVVWCGPQ-PNSKLLINYGFVDEDN 210
+ Y G+ + GP S+L ++YGFVD++N
Sbjct: 261 HKSYSKGDEVFDSYGPALTGSQLFLDYGFVDDEN 294
>gi|346324642|gb|EGX94239.1| SET domain-containing protein RMS1 [Cordyceps militaris CM01]
Length = 482
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 53/214 (24%), Positives = 83/214 (38%), Gaps = 44/214 (20%)
Query: 22 ACLALYLMYEKKQ------GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 75
+ L L L+YE Q G W PY+ L A +P+ WS EL L S
Sbjct: 102 SALILVLLYEHLQRDADATGAACRWRPYLDVL-------PAAFATPMFWSPAELGALQAS 154
Query: 76 PTKAEI-LERAEGIKREY--------------------NELDTVWFMAGSLFQQYPYDIP 114
P A++ E A+ + R ++ + GS Y +D+
Sbjct: 155 PAVAKVGRESADNMFRGILLPAVRAHAHVFAGSERLSDEQIVALAHRMGSTIMAYAFDLD 214
Query: 115 TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 174
E E + +V + + V +A +L ++ + DD +
Sbjct: 215 KEEDEDEDGEDGWVEDRDGKALMGMVPMA----------DILNADAEFNVHVNHGDDDLT 264
Query: 175 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 208
+ RP +AGE I+ + GP PNS+LL YG+V E
Sbjct: 265 VTALRPIRAGEEILNYYGPHPNSELLRRYGYVTE 298
>gi|392569623|gb|EIW62796.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
Length = 509
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 61/221 (27%), Positives = 92/221 (41%), Gaps = 23/221 (10%)
Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
A+VP+ L A A L + +++V +P KAGE I G PNS LL YG V
Sbjct: 264 AMVPMADMLNARFESENAKLFYDERELKMVSTKPIKAGEQIWNTYGDPPNSDLLRRYGHV 323
Query: 207 D----------EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG----KLSVQVFHVHAGR 252
D NP D + V A L + K + +R + VF +
Sbjct: 324 DLVPLSAPLSGLGNPGDVVEVRADLIVSVAAKKVKHDLKERVDWWLEEADDDVFVLRT-- 381
Query: 253 EKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP 312
+ E +++ ++RL + ++ S P P +++ VL D + RL YP
Sbjct: 382 DCELAEELVSFVRLLLLPKDEWEKAAQKSKLP----KPKLDKDVLTIAVDVLEKRLKDYP 437
Query: 313 ATLSEDEAMLTDY---NLHPKKRVATQLVRMEKKMLNACLQ 350
TL EDEA+ L KR A + EK++L L+
Sbjct: 438 TTLEEDEALFAPERFGELSLNKRHAVVVRLGEKRILRGTLK 478
>gi|50556556|ref|XP_505686.1| YALI0F20944p [Yarrowia lipolytica]
gi|49651556|emb|CAG78495.1| YALI0F20944p [Yarrowia lipolytica CLIB122]
Length = 402
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 59/219 (26%), Positives = 106/219 (48%), Gaps = 35/219 (15%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIREL-DRQRGRGQLAVESPLLWSETELAYLTGSP 76
+S LAL+L+ ++ G KS W ++ L DR+ G ++ PL WS+ + LT P
Sbjct: 79 MSAHQVLALFLVIQQSLGSKSDWKAFMGLLPDRKEG----FLDVPLQWSKEDQDSLT--P 132
Query: 77 TKAEILERA-EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CV 134
+L++ + + +Y++ T F+A +Y D P +A+ + A++ V S C+
Sbjct: 133 EGIVVLKKTLDTFEADYDKTKT--FVA-----KYDSD-PRDAYLW-----AWLCVNSRCL 179
Query: 135 VHLQKVSLARRFAL-----VPLGP--PLLAYS-----SKCKAMLAAVDDAVQLVVDRPYK 182
++ ++ A + L P L+ +S + C+ +++ + L R Y
Sbjct: 180 YFDLTLTTGKKDAQEVPDNITLAPYVDLINHSVESGPTHCQLKTSSIGFEI-LCGQRGYT 238
Query: 183 AGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 221
A E I + GP+ NS LL YGF +NP+D + + AL
Sbjct: 239 ADEEIFLCYGPRSNSVLLCEYGFTVPENPWDDVDISDAL 277
>gi|260835045|ref|XP_002612520.1| hypothetical protein BRAFLDRAFT_214305 [Branchiostoma floridae]
gi|229297897|gb|EEN68529.1| hypothetical protein BRAFLDRAFT_214305 [Branchiostoma floridae]
Length = 287
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 50/208 (24%), Positives = 85/208 (40%), Gaps = 24/208 (11%)
Query: 17 KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP 76
+L+ LA++LM EK + +KSFW PYI L + P ++E + L S
Sbjct: 91 RLTPTQALAVFLMCEKYRREKSFWRPYIDILPEE-------YSCPTFFTEDDFRLLPNS- 142
Query: 77 TKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV- 135
+ + + +EY EL + M LF +AF F+ FK A+ A+++ +
Sbjct: 143 LRGKAKAKKYECHKEYKELAPFFKMLADLFPDQ-----EDAFNFKDFKWAWSAIKTRALD 197
Query: 136 ---------HLQKVSLARRFALVPLGPPL-LAYSSKCKAMLAAVDDAVQLVVDRPYKAGE 185
HL+ + PL + A +K + ++ + Y+
Sbjct: 198 VPIGRESCRHLRDAEDTPTPTMFPLVDSINHAAQAKIRHRYNEKSRCLESRTETVYRRHA 257
Query: 186 SIVVWCGPQPNSKLLINYGFVDEDNPYD 213
++ G N LL+ +GFV NP D
Sbjct: 258 EVMNSYGRADNDNLLLEFGFVVPGNPED 285
>gi|384246985|gb|EIE20473.1| rubisco small subunit N-methyltransferase, partial [Coccomyxa
subellipsoidea C-169]
Length = 363
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 80/337 (23%), Positives = 130/337 (38%), Gaps = 74/337 (21%)
Query: 19 SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW-SETELAYLTGSPT 77
EL LAL+LM E+++G++S W P++ L A SP+LW E + L SPT
Sbjct: 31 GELTGLALWLMAERQKGEESRWAPFLECLPE-------ATLSPVLWPEEVQDELLKNSPT 83
Query: 78 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
E R +++E++ +A +
Sbjct: 84 LKECRARRAALQQEWD---------------------------------VIAQRIATGDA 110
Query: 138 QKVSLARRFAL-VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPN 196
++ S L + LG P + +L A+ D +PN
Sbjct: 111 RRFSGGDELKLWITLGSP--GWGGTSDKLLMAIYDG---------------------RPN 147
Query: 197 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEA 256
+L + G V++DN D L V L D + K+ + + G VQ F + R
Sbjct: 148 GELAMATGRVEDDNASDCLTVRVGLVQADRLFSVKKQILESLGFDIVQEFPIFRDR---M 204
Query: 257 ISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP---- 312
+ +L YLRL ++D + + V S ++P E VL L + RL Y
Sbjct: 205 PTQLLAYLRLARLTDPALLAKV--SFEEDIILNPVNEYEVLQLLLGECRDRLTSYAGMHM 262
Query: 313 ATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 349
+ ED +L L ++R+A +L + EK +L L
Sbjct: 263 GSAEEDVKLLQRPGLTAQERLAARLRKAEKAILQGTL 299
>gi|409080258|gb|EKM80618.1| hypothetical protein AGABI1DRAFT_71041 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 492
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 60/221 (27%), Positives = 94/221 (42%), Gaps = 25/221 (11%)
Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
A+VP+ L A A L D +++V +P K GE I G PN++LL YG V
Sbjct: 245 AMVPMADILNARYQTENAKLFHEKDELKMVTTKPIKTGEQIWNTYGDLPNAELLRRYGHV 304
Query: 207 D--------EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
D NP D + ++A L V K + + + G E I
Sbjct: 305 DFLSLPSGGHGNPGDVVEIKADLII-SAVSSTPEAVKDDEAKERID-WWLEEGGEDVFIL 362
Query: 259 D--------MLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAG 310
D M+ +++L ++ ++ S P +E + D L + RLA
Sbjct: 363 DYEYDLPPVMISFVKLLLLTQADWEKAREKS----KPPKSRLEGILYDILISTLEKRLAE 418
Query: 311 YPATLSEDEAMLT-DYNLHPKKRVATQLVRMEKKMLNACLQ 350
YP T+ D+A+LT D L+ K + +L EK++L+ LQ
Sbjct: 419 YPTTIETDKALLTNDTPLNNKNAIIVRL--GEKEILHGILQ 457
>gi|330933580|ref|XP_003304225.1| hypothetical protein PTT_16721 [Pyrenophora teres f. teres 0-1]
gi|311319308|gb|EFQ87682.1| hypothetical protein PTT_16721 [Pyrenophora teres f. teres 0-1]
Length = 476
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 55/216 (25%), Positives = 83/216 (38%), Gaps = 44/216 (20%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
L L ++YE G S W PY L + + + W+E ELA L S +I
Sbjct: 85 SLILVMLYEYHNGSASNWAPYFAVLPTE-------FNTLMFWTEDELAELQASAVVGKIG 137
Query: 83 ---------------------------ERAEGIKREYNELDTVWFM--AGSLFQQYPYDI 113
E+A+ +E + M GSL Y +D+
Sbjct: 138 KESADEAFLEQLLPVIEEFADIVFSGDEKAKDKAKEMRSPKNLELMHKMGSLIMAYAFDV 197
Query: 114 PTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV 173
T E+ ++ A + L K +VPL L A + +C A L D +
Sbjct: 198 EPATPTKEVDEEG-FAEEEEDAALPK-------GMVPLADMLNADADRCNARLFYEKDCL 249
Query: 174 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 209
++ +P +AGE I GP P S LL YG+V ++
Sbjct: 250 EMKALKPIQAGEEIFNDYGPLPRSDLLRRYGYVTDN 285
>gi|171679805|ref|XP_001904849.1| hypothetical protein [Podospora anserina S mat+]
gi|170939528|emb|CAP64756.1| unnamed protein product [Podospora anserina S mat+]
Length = 468
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 51/199 (25%), Positives = 80/199 (40%), Gaps = 29/199 (14%)
Query: 26 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 85
+L+ E + K S+W PYI L + A+ P +W E ++ L + + E
Sbjct: 104 FFLVKEYLKEKDSYWWPYISTLPQPDRVDTWAL--PAVWPEDDIECLEETNAHVAVREIQ 161
Query: 86 EGIKREYNE----LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 141
IK+EY L V F + Q Y FT F+ + + Q H+
Sbjct: 162 ANIKKEYKHARKLLKEVDFPGWQEYTQLLYKWAFCIFTSRSFRPSLILSQETQDHV---- 217
Query: 142 LARRFALVPLGP---------PLLAY-----SSKCKAMLAAVDDAVQLVVDRPYKAGESI 187
L P G PLL +S+ + L VD QL+ + Y+ G+ +
Sbjct: 218 ----LGLTPHGTKVDDFSILQPLLDIGNHDPTSQYQWNLE-VDGTCQLICNNAYQPGQQV 272
Query: 188 VVWCGPQPNSKLLINYGFV 206
G + NS+LL+ YGF+
Sbjct: 273 FNNYGLKSNSELLLGYGFI 291
>gi|432901733|ref|XP_004076920.1| PREDICTED: SET domain-containing protein 4-like [Oryzias latipes]
Length = 441
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/235 (25%), Positives = 101/235 (42%), Gaps = 26/235 (11%)
Query: 19 SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL-TGSPT 77
S L L ++L+ E+ +G+ S W PYI L + P +++T +A L +G
Sbjct: 105 SSLVALCVFLVCERHRGEASDWFPYIDVLP-------CSYCCPPYFTDTVMAVLPSGVRR 157
Query: 78 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
+AE E+ EG++ Y + +FM+ +P P E T+E + A+ ++ + V +
Sbjct: 158 RAE--EQREGLQHLY-AVHQDFFMSLQPVLSHP---PEEVLTYEALRWAWCSINTRSVFM 211
Query: 138 QK-----VSLARRFALVPLGPPLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVW 190
+ +S +AL P LL + + KA ++ + +
Sbjct: 212 DRPSSSFLSGPDNYALAPF-LDLLNHRPDVQVKAGFNRTSGCYEIRSISGVQRYHQAFIN 270
Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE----DPQYQDKRMVAQRNGKL 241
G N +LL+ YGFV NP+ + VE L E D +K + NG L
Sbjct: 271 YGSHDNQRLLLEYGFVSSCNPHSVIYVEEDLLCEVLRGDESLDEKMKFLRENGFL 325
>gi|148226164|ref|NP_001079674.1| SET domain containing 4 [Xenopus laevis]
gi|28422727|gb|AAH46855.1| MGC53706 protein [Xenopus laevis]
Length = 456
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 50/207 (24%), Positives = 89/207 (42%), Gaps = 19/207 (9%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
+S L L +L+ E+ G S W PY+ + P+ W E E+ +L +P
Sbjct: 104 VSPLLALCTFLIAERFAGDCSQWKPYLDVIPS-------TYSCPVYW-ELEIIHLLPAPL 155
Query: 78 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
+ + LE+ K E EL T S Q D + +T++ + A+ V + V++
Sbjct: 156 RKKALEQ----KTEVQELHTESLAFFSSLQPLFCDNVADIYTYDALRWAWCTVNTRTVYM 211
Query: 138 QKVSLARRFA---LVPLGP--PLLAYSSKCK--AMLAAVDDAVQLVVDRPYKAGESIVVW 190
+ R A + L P LL +S + + A + ++ + + + +
Sbjct: 212 KHTQQDRLLAQQDVCALAPYLDLLNHSPEVQVEAEFSKDRRCYEIRTNSGCRKHDQAFIC 271
Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVV 217
GP N +LL+ YGFV +NP+ + V
Sbjct: 272 YGPHDNQRLLLEYGFVAANNPHRSVYV 298
>gi|66827459|ref|XP_647084.1| hypothetical protein DDB_G0267502 [Dictyostelium discoideum AX4]
gi|60475269|gb|EAL73204.1| hypothetical protein DDB_G0267502 [Dictyostelium discoideum AX4]
Length = 472
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 69/329 (20%), Positives = 133/329 (40%), Gaps = 67/329 (20%)
Query: 59 ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 118
++ L + E E+ YL GSP +I+ E + Y++L F + + +
Sbjct: 163 DTSLYFDEKEIEYLAGSPAFVDIMVEKEVATKLYDQLSQTLFKDNVILEMCQG--QSTII 220
Query: 119 TFEIFKQAFVAVQSCVVHLQK----VSLARRFALVPLGPPLLAYSSKCKAMLAAVD---- 170
++ F+ A + + +++ S ++ L P+ PP++ Y + A +D
Sbjct: 221 GWDQFRWAHSTITARKIYVTDPDSVGSDGKQMKLSPVVPPIVDYFNHGNQPSAEIDYNEE 280
Query: 171 -DAVQLVVDRPYKAGESIVV-----WCGPQPNSKLLINYGF----VDEDNPYDRLVVE-- 218
+V + + K GE I V +CG S LL++YG+ +D+ + + L+ E
Sbjct: 281 LGSVDVKAIKDIKKGEEIFVSYDHHYCG----SDLLVDYGYLPNQIDDKSCVNVLMEELL 336
Query: 219 AALNTEDPQYQDK-----RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 273
+N +DP DK +++ ++ KL +
Sbjct: 337 ETINLDDPIKDDKYYLVNKLLETKDIKLKIS----------------------------- 367
Query: 274 EMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML--TDYN-LHPK 330
M S+ L I + ++L+ L ++ YP T+ +D+ L +YN L +
Sbjct: 368 -MDSLTEDLLKISKYMSYKQESLLEYLKRLVSLKIGHYPTTIIQDKEFLLSKEYNQLSAR 426
Query: 331 KRVATQLVRMEKKMLNAC---LQVTADMI 356
++A L EK++L+ LQ D I
Sbjct: 427 SKLAFNLAFQEKQILSNVYTKLQENIDTI 455
>gi|7329638|emb|CAB82703.1| putative protein [Arabidopsis thaliana]
Length = 486
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 36/126 (28%), Positives = 62/126 (49%), Gaps = 15/126 (11%)
Query: 12 LLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY 71
+L +N++ + LA L+ EKK G+KS W+PYI L + + S + W E EL+
Sbjct: 105 VLLSNEVGNIGMLAAVLIREKKMGQKSRWVPYISRLPQ-----PAEMHSSIFWGEDELSM 159
Query: 72 LTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ 131
+ S E +++ I+++++ F+A + Q P I TE E F A+ +
Sbjct: 160 IRCSAVHQETVKQKAQIEKDFS------FVAQAFKQHCP--IVTERPDLEDFMYAYALGE 211
Query: 132 S--CVV 135
C+V
Sbjct: 212 KVLCIV 217
>gi|395332633|gb|EJF65011.1| SET domain-containing protein [Dichomitus squalens LYAD-421 SS1]
Length = 502
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 61/223 (27%), Positives = 97/223 (43%), Gaps = 27/223 (12%)
Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
A+VP+ L A A L + +++V +P +AGE I G PNS LL YG V
Sbjct: 257 AMVPMADMLNARFESENAKLFYEERELKMVTTKPVEAGEQIWNTYGDPPNSDLLRRYGHV 316
Query: 207 D----------EDNPYDRLVVEAAL----NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR 252
D NP D + V A L ++ +Y + V + VF +
Sbjct: 317 DVVPLRPPLSGMGNPRDIVEVRADLIVSAVSKKVEYSLQERVDWWLEEAEDDVFILRT-- 374
Query: 253 EKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP 312
+ E +++ + RL ++S+ +++ S P V P VL D ARL YP
Sbjct: 375 DCELPEELVSFERLLFLSEDEWIKTAKKSKLPKPKVDP----DVLTVAIDVLSARLKEYP 430
Query: 313 ATLSEDEAMLT-----DYNLHPKKRVATQLVRMEKKMLNACLQ 350
++ EDE +L+ +L+ K V +L EK++L L+
Sbjct: 431 TSIEEDEKLLSADKVESLSLNKKHAVIVRL--GEKRILQGTLK 471
>gi|303288796|ref|XP_003063686.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226454754|gb|EEH52059.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 538
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 95/365 (26%), Positives = 146/365 (40%), Gaps = 64/365 (17%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LAL L++EK G S W YIR L R A+++PL WS ELA L G+ ++L
Sbjct: 157 ALALQLLHEKSLGDDSRWAAYIRCLPRVE-----ALDAPLFWSSEELAELAGT----QLL 207
Query: 83 ERAEG----IKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
A G ++ + L F +LF D AF+ F AF ++S L
Sbjct: 208 ANAAGYDSYVRGTHAALKETTFKEHPALFGDAGDDDGGGAFSEREFLWAFGVLRSRA--L 265
Query: 138 QKVSLARRFALVPLGPPLLAYSSKC-------KAMLAAV---------DDAVQLVVDRP- 180
V AL+P G + + C +AAV +V L V++
Sbjct: 266 PPVDQGESIALIP-GIDMANHDGLCSQTWQLNNGGIAAVFGGRGGADGGGSVLLRVEKTK 324
Query: 181 ---YKAGESIVVWCGP-QPNSKLLINYGFVDEDNPYDRLVVEA-ALNTEDPQYQDKRMVA 235
K GE I GP +S+ ++YGFVD V+ ++ +D DK V
Sbjct: 325 AGGAKRGEEIRCNYGPANIDSQFALDYGFVDAFCSRPGYVLGPLSIPEDDVNAFDKMDVL 384
Query: 236 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS----------EMQSVISSLGPI 285
G F + A ++ +M+ ++RL + + E ++IS
Sbjct: 385 SVAGLKESPAFTIRA--FEDPPPEMVVFMRLLNLKNDDAFLLEAIFRQECWALISD---- 438
Query: 286 CPVSPCMERAVLDQLADYF-KARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 344
PVSP D AD + L Y + +D + D + P+ R+A ++ EK+
Sbjct: 439 -PVSP-------DNEADAGCEEALGAYATKIEDDRGVADDADASPRLRLAARVRMGEKQA 490
Query: 345 LNACL 349
L L
Sbjct: 491 LEEVL 495
>gi|302803412|ref|XP_002983459.1| hypothetical protein SELMODRAFT_445547 [Selaginella moellendorffii]
gi|300148702|gb|EFJ15360.1| hypothetical protein SELMODRAFT_445547 [Selaginella moellendorffii]
Length = 536
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 56/214 (26%), Positives = 85/214 (39%), Gaps = 34/214 (15%)
Query: 12 LLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW-SETELA 70
+L N + +L+ E+ +GK+SFW PYI L +L++ PLLW +ET
Sbjct: 100 MLRKNDFRPWLTMCAHLLVERSRGKESFWHPYIAALPSV---DELSISHPLLWPAETIQE 156
Query: 71 YLTGSPTKAEILERAEGIKREYNELDTVW---FMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
L GSP I R + + ++ L T F+ G E +
Sbjct: 157 LLQGSPMLDTIATRLKLCQEDHEALLTAGIEKFLPGG----------------ETLSEGD 200
Query: 128 VAVQSCVVHLQKVSLA-------RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ---LVV 177
V S V+ + SL LVP L SS + D + L
Sbjct: 201 VRWASAVLLSRAFSLELDVDDDFDTLCLVPWADMLNHCSSAGEESCLIFDQDTKTASLEA 260
Query: 178 DRPYKAGESIVVWCGPQ-PNSKLLINYGFVDEDN 210
+ Y G+ + GP S+L ++YGFVD++N
Sbjct: 261 HKSYSKGDEVFDSYGPALTGSQLFLDYGFVDDEN 294
>gi|299115166|emb|CBN75532.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 524
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/240 (24%), Positives = 100/240 (41%), Gaps = 46/240 (19%)
Query: 22 ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW----SETELAYLTGSPT 77
+ LAL+LM E+++G SFW Y+R L V++PL W +E E L G T
Sbjct: 83 SVLALHLMAERRKGDGSFWKQYLRTLPDD-------VDTPLRWLVEQAEEEFRLLDG--T 133
Query: 78 KAEILER--AEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 135
+L R +++++ E L + +P + TFE + A ++ S
Sbjct: 134 MVGLLSRMMHSQVRKDWEEFHL------PLVEAHPEILG--GVTFEDYLWAMSSIWSRSF 185
Query: 136 HLQK----VSLARRFALVPL------GPP-------LLAYSSKCKAMLAAVDD------A 172
Q+ S R A+VP+ P ++ + ++ + + +
Sbjct: 186 DYQEPGPDDSPCSRRAMVPVINAANHDPSAADSLSEMIEFQAQEGGLSMGIGEPGRARGT 245
Query: 173 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 232
+++ R Y A E + G N+KLL +YGFV NPY L + DP + K+
Sbjct: 246 LRVSAGRDYAAREQFFILYGRYSNAKLLYSYGFVLASNPYGGLDYWVRVPQTDPGFAWKQ 305
>gi|322698908|gb|EFY90674.1| putative histone-lysine N-methyltransferase [Metarhizium acridum
CQMa 102]
Length = 437
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 38/135 (28%), Positives = 63/135 (46%), Gaps = 13/135 (9%)
Query: 105 LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRFALVPLGPPLLAYS 159
+ QYP P + FT E +K A V S + + + L FA +L +S
Sbjct: 146 MLGQYPDLFPLDKFTVEDYKWALCTVWSRAMDFVLPDGKSIRLLAPFA------DMLNHS 199
Query: 160 SKCKA--MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 217
S+ K + A + ++ + Y+AG+ + + GP PN++LL YGFV NP D +
Sbjct: 200 SEAKQCHVYDASSGNLSVLAGKDYEAGDQVFINYGPMPNNRLLRLYGFVVPGNPNDSYDL 259
Query: 218 EAALNTEDPQYQDKR 232
A + P ++ K+
Sbjct: 260 VLATHPMAPFFKQKQ 274
>gi|336468018|gb|EGO56181.1| hypothetical protein NEUTE1DRAFT_83233 [Neurospora tetrasperma FGSC
2508]
gi|350289741|gb|EGZ70966.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
Length = 459
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 45/177 (25%), Positives = 82/177 (46%), Gaps = 13/177 (7%)
Query: 60 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 119
S +L++E +L G+ + + I+ ++ L LF Q+P P + FT
Sbjct: 107 SSILFAEDDLEACAGTSLYTITKQLEQSIEDDHRALVV------RLFVQHPDLFPLDKFT 160
Query: 120 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP--PLLAYSSKCKA--MLAAVDDAVQL 175
E +K A V S + LA ++ L P +L ++S+ K + + +
Sbjct: 161 VEDYKWALCTVWSRAMDF---VLADGNSIRLLAPFADMLNHTSEVKQCHVYDPSSGNLSV 217
Query: 176 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 232
+ + Y+AG+ + + GP PNS+LL YGFV NP D + + + + P ++ K+
Sbjct: 218 LAGKDYEAGDQVFINYGPVPNSRLLRLYGFVIPGNPNDSYDLVLSTHPQAPFFEQKQ 274
>gi|164423408|ref|XP_963594.2| hypothetical protein NCU08733 [Neurospora crassa OR74A]
gi|157070080|gb|EAA34358.2| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 459
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 45/177 (25%), Positives = 81/177 (45%), Gaps = 13/177 (7%)
Query: 60 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 119
S +L++E +L G+ + + I+ ++ L LF Q+P P + FT
Sbjct: 107 SSILFAEDDLEACAGTSLYTITKQLEQSIEDDHRALVV------RLFVQHPDLFPLDKFT 160
Query: 120 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP--PLLAYSSKCKA--MLAAVDDAVQL 175
E +K A V S + LA ++ L P +L ++S+ K + + +
Sbjct: 161 VEDYKWALCTVWSRAMDF---VLADGNSIRLLAPFADMLNHTSEVKQCHVYDPSSGTLSV 217
Query: 176 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 232
+ Y+AG+ + + GP PNS+LL YGFV NP D + + + + P ++ K+
Sbjct: 218 FAGKDYEAGDQVFINYGPVPNSRLLRLYGFVIPGNPNDSYDLVLSTHPQAPFFEQKQ 274
>gi|302410103|ref|XP_003002885.1| SET domain-containing protein RMS1 [Verticillium albo-atrum
VaMs.102]
gi|261357909|gb|EEY20337.1| SET domain-containing protein RMS1 [Verticillium albo-atrum
VaMs.102]
Length = 469
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 75/365 (20%), Positives = 139/365 (38%), Gaps = 53/365 (14%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
L L L ++YE QG S W PY L +Q ++P+ WS+ EL L G+
Sbjct: 91 LDSWGQLILVMLYEVLQGDSSRWKPYFDILPQQ-------FDTPIFWSDGELLELQGTSL 143
Query: 78 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
AE + + E +++ + ++F PTE + + + + L
Sbjct: 144 TAEKIGKVESDAMFRSKILPIVQANPAIFYPEGAAQPTEDELLHLAHRMGSTIMAYAFDL 203
Query: 138 QKVSLARR--------------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 183
+ +VP+ L A +++ A + + + +A
Sbjct: 204 ENDDENENEEDGWVEDREGRTMLGMVPMADTLNA-NAEFNAHINHGESLEATAIRADIRA 262
Query: 184 GESIVVWCGPQPNSKLLINYGFVD-EDNPYDRLVVEAALNTE------------------ 224
G+ ++ + GP P S+LL YG+V E + YD + V L E
Sbjct: 263 GDQVLNYYGPLPTSELLRRYGYVTPEHSRYDVVEVPWTLVKEVIVSCLSLSAEAWKQVES 322
Query: 225 ---DPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISS 281
D + +D ++ + +G+ +E +++ L+ E +
Sbjct: 323 QIDDEEIEDYFVIERDSGEPGPDGRFTAPAVLREVSPELVEQLK--------EFLKAVKK 374
Query: 282 LGPICPVSPCMERAVLDQL-ADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 340
L + D + A+ K RLA YP ++ DE +L + +L ++R+A +
Sbjct: 375 LDSERIPDKRKRDEICDAVIAEVLKVRLAQYPTSIETDEKLLAEADLPARRRMAVVVRLG 434
Query: 341 EKKML 345
EKK+L
Sbjct: 435 EKKLL 439
>gi|325186532|emb|CCA21071.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 441
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 90/392 (22%), Positives = 155/392 (39%), Gaps = 56/392 (14%)
Query: 12 LLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY 71
L N+L + +AL+LM E+ + SF+ P+I+ L Q + P+ W++++ A
Sbjct: 74 LQDKNELDQDEIVALFLMIERFKSSDSFFEPFIQSLPSQ-------FDLPIFWNDSDFAE 126
Query: 72 LTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ 131
L G T +L + I R+ E D F A + Y+ T EI +
Sbjct: 127 LEG--TNVALLAK---IMRKQIEAD---FQAIHIPLLRAYEERLNLRTSEISISDYEWAL 178
Query: 132 SCVVHLQKVSLARRFALVPLGP------PLLAYSSKCKAMLAAVD---------DAVQLV 176
S + R F + G P L + + +D D +
Sbjct: 179 SII-------WTRAFGITRYGEYLRVLCPALDMFNHSVLVQEPLDEFIKYDHMKDVLAHC 231
Query: 177 VDRPYKAGESIVVWCGPQPNSKLLINYGFV--DEDNPYDRLVVEAALNTEDPQYQDKRMV 234
V A + + G ++KLL +YGFV +E N ++ + + + DP ++ K+ +
Sbjct: 232 VVMETSANDPFYISYGSYSDAKLLYSYGFVSLNEKNRFNGIDLWMRVPVTDPNFKLKQAI 291
Query: 235 AQRNGKLSVQVFHVHAGREKEAISD-MLPYLRLGYVS--DTSEMQSVISSLGPICPVSPC 291
+ N Q + + + + L R+ +S + E + S VS
Sbjct: 292 LEGNAATRDQTYDFRGTIHLDDVDERFLASFRIILLSQEEFREYEKAFDS----TIVSVR 347
Query: 292 MERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQL-VRME-KKMLNACL 349
E AV + D + RLA +P +L +D L + ++ R + VRME KK+L +
Sbjct: 348 NELAVYAAIHDVCEKRLARFPTSLEDDLKKLAELEMNSDLRKTYAISVRMEDKKILQSVC 407
Query: 350 QVTADMIMLL--------PDVTVSPCPAPYAP 373
++ + LL PDVT P P
Sbjct: 408 RLMKEWRNLLENDSNIYPPDVTRQQQPQLSMP 439
>gi|260807503|ref|XP_002598548.1| hypothetical protein BRAFLDRAFT_118329 [Branchiostoma floridae]
gi|229283821|gb|EEN54560.1| hypothetical protein BRAFLDRAFT_118329 [Branchiostoma floridae]
Length = 448
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 56/245 (22%), Positives = 103/245 (42%), Gaps = 41/245 (16%)
Query: 17 KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP 76
+L+ + + +L+Y+K G+ SFW PY+ L + P+ + E + YL S
Sbjct: 100 QLTTIQAITTFLIYQKHIGETSFWKPYLDILPNE-------YTHPVYFGEEDFLYLPHS- 151
Query: 77 TKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF--VAVQSCV 134
+A I + + + Y EL + L + FTF+ ++ A+ V +S
Sbjct: 152 LRANIKAKKQECIKSYEELKPFFPSLEPLLPNW-----EGIFTFDAYRWAWSTVKTRSLY 206
Query: 135 VHLQKVSLARRF--------ALVPLGPPLLAYSSKCKAML------AAVDDAVQLVVDRP 180
V + ++ R +LVP+ LL +S + L D + +
Sbjct: 207 VDDKGSTVLRNLDKSGLGVTSLVPM-VDLLNHSHSARTGLLIKKSCKNGDYFYTVTAEDD 265
Query: 181 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL----------VVE-AALNTEDPQYQ 229
YK G+ ++ N LL+NYGFV DN D + ++E EDP+++
Sbjct: 266 YKRGDQVLFCYRRADNQTLLLNYGFVLPDNHLDTIKFFLVKDIIGILELMNFEEEDPKFR 325
Query: 230 DKRMV 234
++++
Sbjct: 326 RRKVL 330
>gi|428163078|gb|EKX32170.1| hypothetical protein GUITHDRAFT_121664 [Guillardia theta CCMP2712]
Length = 449
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 77/356 (21%), Positives = 139/356 (39%), Gaps = 26/356 (7%)
Query: 11 ELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA 70
++ + + + LYL+ E+ S W P+++ L + L +SE ++
Sbjct: 88 HMIENTTIGRITAICLYLISERADSS-SHWKPWLQSLPPRFFHA-------LSYSEDDML 139
Query: 71 YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQ------QYPYDIPTEAFTFEIFK 124
+ S K + + +++EY + F P ++ E FT+E F+
Sbjct: 140 HFQASSFKELRDRKKKNVRQEYEQTVAPLLHKLPAFDPLLAAVDKPQNVTREDFTYEAFE 199
Query: 125 QAFVAVQSCVVHLQKVSLARRFALVPL---GP---PLLAYSSKCKAMLAAVDDAVQLVVD 178
A+ V + + + R VPL GP + +S K A +
Sbjct: 200 WAYSVVTTRGIFPGLLGEEEREGEVPLLVLGPLADSFIHGASGVKISYDAQEHRCVFSAL 259
Query: 179 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 238
I + G N +LL N GF+ ++N + ++++ L+ + R +
Sbjct: 260 HKVAKNSPISIGVGMSSNMELLANRGFMMQNNGNNFVLMKFQLDRNSDMHASARESMMKQ 319
Query: 239 GKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLD 298
LS + +V R E +L LR+ +S E S +L PV+ E
Sbjct: 320 LNLSNPMTYV--VRYGEMPQGLLASLRIQSLSPV-EFGSYGKALA--TPVTLENEWRAYR 374
Query: 299 QLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTAD 354
L + LA YP T+ EDE +LT R A L+R E+K++ ++ A+
Sbjct: 375 LLISSCNSILAMYPTTIEEDEIVLTQTKTSRHLRAAV-LLRREEKLIYESIKTWAN 429
>gi|348690659|gb|EGZ30473.1| hypothetical protein PHYSODRAFT_553476 [Phytophthora sojae]
Length = 437
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 90/354 (25%), Positives = 151/354 (42%), Gaps = 51/354 (14%)
Query: 24 LALYLMYEK-KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LA+ L+YEK +G KS W +I L R + L + EL L GS
Sbjct: 97 LAIALLYEKFVRGSKSKWAKHIELLPR-------TYHNALYFGPEELRALEGSNVYFIAQ 149
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYP----YDIPTEAFTFEIFKQAFVAVQSCVVHLQ 138
+ E + +Y L + LF+ P D+ E F+ E +K A + S
Sbjct: 150 QMEEKVAHDYARLKESVLL--ELFENVPEGINVDLFDEFFSLENYKWALSTIWS---RFG 204
Query: 139 KVSLARRF--ALVPLGPPLLAYSSKCKAMLAAVDDAVQ---LVVDRPYKAGESIVVWCGP 193
V +A++ A+VP+ +L + + + M D + Q LV + + AG + + GP
Sbjct: 205 DVPVAKQSFKAMVPVFD-MLNHDPEAE-MSHFFDMSTQRFKLVSHQHWNAGAQMFINYGP 262
Query: 194 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQ---YQDKRMVAQRNGKLSVQVFHVHA 250
N KLL YGFV NP+D VE L ++ +Q+K + NG HA
Sbjct: 263 LSNHKLLALYGFVIIGNPFD--AVEMWLPMDEASTKFFQEKEQLLLTNGL-------DHA 313
Query: 251 GREKEAISD-----MLPYLRLGYVS--DTSEMQSVISSLGPICPVSPCMERAVLDQLADY 303
E ++D +L R+ + E + + + +S E+ L +L
Sbjct: 314 TNPFELVADESNDLLLMAARIQEIDCETVEEFEELANKALEGEMISLENEQEALTRLIYT 373
Query: 304 FKARLAGYPATLSEDEAML------TDYNLHPKKRVATQLVRMEKKMLNACLQV 351
+ L +P ++ ED+ +L TD NL+ +R+A + R +K +L+ + +
Sbjct: 374 LEKMLESFPTSIEEDDILLEQDDKKTD-NLN-HERMAVAVRRSDKYILSENINM 425
>gi|347967018|ref|XP_321037.5| AGAP002018-PA [Anopheles gambiae str. PEST]
gi|333469795|gb|EAA01259.5| AGAP002018-PA [Anopheles gambiae str. PEST]
Length = 493
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 82/358 (22%), Positives = 144/358 (40%), Gaps = 56/358 (15%)
Query: 24 LALYLMYEKKQGKKSFWLPYIREL-DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LAL L+ E+ + K S W PY+ L DR +PL ++ ++ L + L
Sbjct: 151 LALALIMERFRAK-SDWKPYLDLLPDR--------YTTPLYYTTEDMGELAETDAFLPAL 201
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS----CVVHLQ 138
+ + I R+Y + F Q D + FT+++F+ A V + V+L
Sbjct: 202 KLCKHIARQYGFIRR--------FVQEKVDELRDCFTYDVFRWAVSTVMTRQNKVPVNLA 253
Query: 139 KV-SLARRFALVPL------GPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYK--AGESI 187
+ + AL+PL P A ++C A A ++ ++ + R A I
Sbjct: 254 EFDGMDHTLALIPLWDMANHAFPDTANETRCVAETCYNATNEQLECSLTREVSDIASVPI 313
Query: 188 VVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFH 247
+ G + +++ L++ GFV NP+ + L P Y+++ + + G + F
Sbjct: 314 FIVYGTRTDAEFLVHNGFVCPRNPHANVQKRFTLVPAIPLYKERAHLLELLGMPTTGTFS 373
Query: 248 VHAGREKEAISDMLPY----LRLGYVSDTS------------EMQSVISSLGPICPVSPC 291
RE A + P + L VS + + + + + P C
Sbjct: 374 FGPAREPAAATTTTPISQELISLARVSSMTAKELDEYTAMKETQRQTLRTYQALLPAELC 433
Query: 292 --MERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 347
ER LA K L YP T+ +DEA+L N H +R+ + EK++L +
Sbjct: 434 ARTER----WLATVMKIMLLRYPTTIEQDEALLKT-NRHHIRRLLIEYRLGEKQILRS 486
>gi|212546319|ref|XP_002153313.1| SET domain protein [Talaromyces marneffei ATCC 18224]
gi|210064833|gb|EEA18928.1| SET domain protein [Talaromyces marneffei ATCC 18224]
Length = 481
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 90/416 (21%), Positives = 165/416 (39%), Gaps = 83/416 (19%)
Query: 1 MFHLLFYYAAELLTTNKLSELAC-----------LALYLMYEKKQGKKSFWLPYIRELDR 49
+FHL ++ ++ T++L+++ L + ++YE G++S W Y + L
Sbjct: 62 LFHLP-HHIVLMVKTSRLNQILADDLKNLGPWLSLVVVMIYEYSLGEQSNWKQYFQVLPS 120
Query: 50 QRGRGQLAVESPLLWSETELAYLTGSPT---------KAEILERAEGIKREYNEL----- 95
+ ++ + WSE E + L S + +I E+ + R + +L
Sbjct: 121 K-------FDTLMFWSEEEFSQLQASAVVDKVGKRDAEEDIFEKVLPLVRAHPDLFPPID 173
Query: 96 -------DT-------VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 141
DT + GSL Y +DI + ++ L K
Sbjct: 174 GVMSYDDDTGAQALLELAHRMGSLIMAYAFDIEKAEEEESEGEDGYLTDDEE--QLPK-- 229
Query: 142 LARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 201
+VPL L A + + A L + A+ + +P KAG+ I G P S LL
Sbjct: 230 -----GMVPLADLLNADADRNNARLFQEEGALVMRAIKPIKAGDEIFNDYGELPRSDLLR 284
Query: 202 NYGFVDEDNPYDRLVVEAAL----------NTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 251
YG+V DN VVE L N ED +Y +++ Q ++ + +
Sbjct: 285 RYGYV-TDNYAQYDVVELPLTGICHAAGFDNIEDKEYPQLKLLDQL--EILEDGYCILRP 341
Query: 252 REKEAISDMLP----YLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 307
++ + D+LP L D+ E+Q ++S P+ E + L D +++
Sbjct: 342 SPEDTLLDILPDELLALLKTLTLDSEELQRLLSKNKHPKPILGAREARI---LLDAAQSK 398
Query: 308 LAGYPATLSEDEAMLTDY-------NLHPKKRVATQLVRMEKKMLNACLQVTADMI 356
+ Y T+ ED+ +L + ++ +A Q+ EK++L A L + D +
Sbjct: 399 MGQYGTTIQEDKILLQQFASSSVLRTRERRRHMAVQVRVGEKEILQALLMMLQDFL 454
>gi|308802149|ref|XP_003078388.1| related to histone-lysine N-methyltransferase (ISS) [Ostreococcus
tauri]
gi|116056840|emb|CAL53129.1| related to histone-lysine N-methyltransferase (ISS), partial
[Ostreococcus tauri]
Length = 446
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 86/347 (24%), Positives = 138/347 (39%), Gaps = 53/347 (15%)
Query: 35 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 94
G++S + P++R L + ++ + W E EL L GS A + + EY+
Sbjct: 85 GERSAFWPWLRLLPSE-------TDAAVGWDEDELRELQGSNVVAFARAIKKSWREEYDA 137
Query: 95 LDTVWFMAGSLFQQYPYDIPTEAF--------TFEIFKQAFVAVQSCVVHLQKVSLARRF 146
LD F D P EAF TFE F A V S + L+ S +
Sbjct: 138 LD---------FAGLGVDFP-EAFGGEHAAHYTFEKFTWARFVVWSRAIDLKTDSTS--A 185
Query: 147 ALVPLGPPLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 201
++ + P+L A S K A +AV++ +K + +P+ L+
Sbjct: 186 PVIRMLVPILDMANHAPSGKLLPRWDAKANAVKIYAGSAFKRNTELRFNYDTKPSQYFLL 245
Query: 202 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG-KLSVQVFHVHAGREKEAISDM 260
YGF+ E NP + + V L+ D + K + +R+G + + F + D+
Sbjct: 246 QYGFIPEANPAECVEVTMQLSQRDNLRERKEALLRRHGLDPTKRNFEWKV---RGLDYDL 302
Query: 261 LPYLRLGYVSDTSEMQSVIS-----SLGPICPVSPCMERAVLDQLADYFKARLAGYPATL 315
L R+ D SE+ S S + + +AVL + L GY TL
Sbjct: 303 LAAARI-IAMDESELDDDTSVALSVSGASVSAKNDARTKAVLLK---SLITSLDGYGTTL 358
Query: 316 SEDEAMLTDYNLH----PKKRVA-TQLVRMEKKMLNACLQVTADMIM 357
ED + + +N PKKR L+RM +K L +AD +
Sbjct: 359 GEDNSYIARFNTSSDELPKKRKRFAVLLRMREK---GILLASADALF 402
>gi|403215215|emb|CCK69715.1| hypothetical protein KNAG_0C06190 [Kazachstania naganishii CBS
8797]
Length = 496
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 50/205 (24%), Positives = 88/205 (42%), Gaps = 30/205 (14%)
Query: 24 LALYLMYE-KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TG 74
L + L++E K G+KS W PY++ L ++ QL + W++ EL L G
Sbjct: 90 LIIALLFEWKVVGEKSKWWPYLQVLPKKTDMNQL-----IYWADDELELLKPSLILERVG 144
Query: 75 SPTKAEILERAEGI--KREYNELDT-----VW---FMAGSLFQQYPYDIPTEAFTFEIFK 124
+ E+ E I K E D+ W + S+ Y +D+ + + E K
Sbjct: 145 ADKAKEMFENVVDIINKSTLKEKDSYILKVTWENFLLVASIIMSYSFDV--QDYVEE--K 200
Query: 125 QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 184
+ + + V + ++PL L + + KC A L + +++ + K G
Sbjct: 201 EGGTDEEEDDNESENVRSLK--CMIPLADTLNSNTHKCNAHLIHGSNLLEMRSIKAIKKG 258
Query: 185 ESIVVWCGPQPNSKLLINYGFVDED 209
E I G PNS++L YG+++ D
Sbjct: 259 EQIYNIYGDHPNSEILRRYGYIEPD 283
>gi|320163048|gb|EFW39947.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 476
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 95/402 (23%), Positives = 163/402 (40%), Gaps = 58/402 (14%)
Query: 10 AELLTTNKLSELAC-----LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 64
AELL NK A L + +MYE K SFW PY+ L +++P+ W
Sbjct: 82 AELLLENKKEACALVGWMPLVVAMMYEI-TNKDSFWRPYLDLLPE-------TLDTPMFW 133
Query: 65 SETELAYLTGSPTKAEI-LERAEGIKRE----------------------YNELDTVWFM 101
++ +L L G+ T + + E AE I E Y+ + +V M
Sbjct: 134 NDDDLELLEGTSTLSHLGKEDAETIFTEQIVPFMKLHPTHFDLKVHNMALYHRVASV-IM 192
Query: 102 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSK 161
A S + D E E A C ++ + + A+VPL +L + +
Sbjct: 193 AYSFSEDDDEDDDDEDDDEEEDCCDGDANNECCSQKRQKRM-EKIAMVPLAD-MLDHKTG 250
Query: 162 CK-AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 220
C A L + + P AG + G NS+LL YGF+D+ N ++ + +
Sbjct: 251 CNNARLFYGKTTLAMSCIEPCAAGHELYNTYGDLSNSELLRKYGFIDDVNEHNSVDIPVE 310
Query: 221 L---NTEDPQYQDKRMVAQRNGKLSVQVFHVHAG----REKEAISDML-----PYLRLGY 268
+ E + ++ M A + FH+ A +E EA +L L
Sbjct: 311 MLEERFESCSFMEEAMEALEEIGCWLPEFHIPADALPPQELEASIALLFQSPKQVRALRA 370
Query: 269 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 328
+ D E++S +++L V+ C R V + L + + R Y T EDE L + +L
Sbjct: 371 LDDEDEIRSFLATL-----VNKC-RRKVSETLLAFGQKRAEEYTTTREEDEERLKESDLT 424
Query: 329 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAP 370
++++A ++ E+ +L+ + + + P + PAP
Sbjct: 425 HRQKMALRVRIGERTILHNYISHLKERLETTPPDQETKEPAP 466
>gi|195439104|ref|XP_002067471.1| GK16171 [Drosophila willistoni]
gi|194163556|gb|EDW78457.1| GK16171 [Drosophila willistoni]
Length = 511
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 67/317 (21%), Positives = 125/317 (39%), Gaps = 34/317 (10%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
LA L+ EK +G S W PYI L + + L ++ ++ L G+ + L
Sbjct: 159 LANALVVEKSRGADSIWKPYIDVLPSR-------YNTVLYFTVEQMRRLRGTSVCSSALR 211
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 143
+ I R+Y +L + S + +E+++ A V + +L +A
Sbjct: 212 QCRMIARKYAKLYAFAYCDSSYLRPDTGLFTQHGLCYELYRWAVSTVMT-RQNLVPREIA 270
Query: 144 RR-------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPN 196
+ AL+P K + + ++ KAG ++ G +PN
Sbjct: 271 TKDDGNSPISALIPCWDMANHRPGKITSFYDSNAHQMECTAQEFCKAGNQFFIYYGDRPN 330
Query: 197 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREK-- 254
+ LL++ GFVD +N D + + L+ D +A++ +L ++ H G +
Sbjct: 331 ADLLVHNGFVDPNNNKDFVNIRLGLSPTDG-------LAEKRSRLLDRLNIEHKGEFRVL 383
Query: 255 ---EAIS-DMLPYLRLGYVSD------TSEMQSVISSLGPICPVSPCMERAVLDQLADYF 304
E IS +L ++R+ +S S+++ + L C + +E F
Sbjct: 384 PAPEYISGQLLAFVRVFNMSSDQLDHWCSDLERAVDLLHIDCALETDLETRTWQYFHQRF 443
Query: 305 KARLAGYPATLSEDEAM 321
K L ATL E + +
Sbjct: 444 KLLLGVLEATLREADEL 460
>gi|384248321|gb|EIE21805.1| SET domain-containing protein, partial [Coccomyxa subellipsoidea
C-169]
Length = 275
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 53/207 (25%), Positives = 88/207 (42%), Gaps = 30/207 (14%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
L L LM EKK GK S W Y+ L + PL W +L L G+ ++E
Sbjct: 83 LILALMTEKKLGKSSKWKGYLDFLPKS------IPGMPLFWDSEQLQSLEGT----SLIE 132
Query: 84 RAEGIKR----------EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSC 133
+ G K ++N + + F+ + + P++ + + ++ A V+ S
Sbjct: 133 KMNGCKAMPDRPLEPPCKFNSV-VLPFLQSNAHLKLPHNAASTRRLY-VWATAMVSAYSF 190
Query: 134 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWC 191
+ + A+VP+ L + L A A++++ GE ++
Sbjct: 191 TIGEDRFQ-----AMVPMWDALNHITGHANVRLHHCARKGALRMIATCLITKGEQVINSY 245
Query: 192 GPQPNSKLLINYGFVDED-NPYDRLVV 217
G PNS+LL YGFV+ D NP+D L V
Sbjct: 246 GDLPNSELLRRYGFVETDPNPHDCLEV 272
>gi|346319394|gb|EGX88996.1| Protein kinase-like domain [Cordyceps militaris CM01]
Length = 1753
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 70/302 (23%), Positives = 121/302 (40%), Gaps = 18/302 (5%)
Query: 62 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 121
+ ++E EL GS + + ++ ++ +L L Q+ P + FT E
Sbjct: 993 IFFAEDELQVCEGSSLHTLTTQLEQRVQDDFRQLLV------QLLSQHRDLFPLDQFTIE 1046
Query: 122 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYS---SKCKAMLAAVDDAVQLVVD 178
+K A + S + + PL +L +S +C A D + ++
Sbjct: 1047 DYKWALCTIWSRAMDFAVSDTTSVRLVAPLAD-MLNHSLDVKQCHAYDPTSGD-LSILAA 1104
Query: 179 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 238
+ Y+ G+ I ++ G PN++LL YGFV DNP D + + P Y+ K +
Sbjct: 1105 KDYQVGDQIFIYYGSVPNNRLLRLYGFVLLDNPNDSYDLVLQTSPMAPLYEQKERLWALA 1164
Query: 239 GKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL--GPICPVSPCMERAV 296
G S + A + ++L YLR + D +++ + L G V+ E V
Sbjct: 1165 GLDSTCTIPLTA--KHPLPKNVLRYLRTQRL-DAADVADMTLQLLNGTDGKVNDGNEIQV 1221
Query: 297 LDQLADYFKARLAGYPATLSEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNACLQVTAD 354
L L D + L G+ L + EA L Y A Q+ E+ +L + D
Sbjct: 1222 LQFLIDSLGSVLEGFGIPLEKLEAQLAGGFYPAGGNAWAAAQVSAGEQGILTRAKKTAED 1281
Query: 355 MI 356
M+
Sbjct: 1282 ML 1283
>gi|302804174|ref|XP_002983839.1| hypothetical protein SELMODRAFT_445692 [Selaginella moellendorffii]
gi|300148191|gb|EFJ14851.1| hypothetical protein SELMODRAFT_445692 [Selaginella moellendorffii]
Length = 236
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 46/186 (24%), Positives = 79/186 (42%), Gaps = 29/186 (15%)
Query: 36 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 95
+ S W PYI L G +++ LW +TEL+YL SP + ER E I E+ ++
Sbjct: 74 QSSAWAPYISCLPEPAG-----LDNTFLWEDTELSYLRASPLYGKTRERLEIITTEFGQV 128
Query: 96 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 155
+ LF + + E F + V S + +++ LV + P+
Sbjct: 129 QNALDVWPQLFGK---------VSVEDFMHVYATVFS-----RPLAIGEDSTLVMI--PM 172
Query: 156 LAYSSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 210
L + + A A + + + DR + I + CG N++L ++YGF
Sbjct: 173 LDFFNHNAASFAKLSFNGLLNYAVVTADRDCAENDQIWINCGDLSNAELALDYGFT---V 229
Query: 211 PYDRLV 216
P +RL+
Sbjct: 230 PENRLI 235
>gi|42567909|ref|NP_197226.2| protein SET DOMAIN GROUP 40 [Arabidopsis thaliana]
gi|75271674|sp|Q6NQJ8.1|SDG40_ARATH RecName: Full=Protein SET DOMAIN GROUP 40
gi|34222078|gb|AAQ62875.1| At5g17240 [Arabidopsis thaliana]
gi|51969984|dbj|BAD43684.1| unknown protein [Arabidopsis thaliana]
gi|332005020|gb|AED92403.1| protein SET DOMAIN GROUP 40 [Arabidopsis thaliana]
Length = 491
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 72/355 (20%), Positives = 136/355 (38%), Gaps = 54/355 (15%)
Query: 16 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRG---------RGQLAVESPLLWSE 66
N LS L++ L+YE + KKSFW PY+ + R + L VE + +E
Sbjct: 92 NSLSSTQILSVCLLYEMSKEKKSFWYPYLFHIPRDYDLLATFGNFEKQALQVEDAVWATE 151
Query: 67 TELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 126
A +A L + +K ++ W A + +P ++
Sbjct: 152 KATAKCQSEWKEAGSLMKELELKPKFRSFQ-AWLWASATISSRTLHVPWDS--------- 201
Query: 127 FVAVQSCVVHLQKVSLARRFALVPLGPP---------LLAYSSKCKAMLAAVDDAVQ--- 174
V L ++ P GP L+ + + ++ V
Sbjct: 202 -AGCLCPVGDLFNYDAPGDYSNTPQGPESANNVEEAGLVVETHSERLTDGGFEEDVNAYC 260
Query: 175 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV--EAALNTEDPQYQDKR 232
L R Y+ GE +++ G N +LL +YGF+ E+N D++ + E +L + +
Sbjct: 261 LYARRNYQLGEQVLLCYGTYTNLELLEHYGFMLEENSNDKVFIPLETSLFSLASSWPKDS 320
Query: 233 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCM 292
+ ++GKLS ++ LRL + + +SV+ + +S
Sbjct: 321 LYIHQDGKLSFA---------------LISTLRLWLIPQSQRDKSVMRLVYAGSQISVKN 365
Query: 293 ERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 347
E V+ +++ + L P +++ED + LH ++ +R+E+K A
Sbjct: 366 EILVMKWMSEKCGSVLRDLPTSVTEDTVL-----LHNIDKLQDPELRLEQKETEA 415
>gi|241712095|ref|XP_002413441.1| conserved hypothetical protein [Ixodes scapularis]
gi|215507255|gb|EEC16749.1| conserved hypothetical protein [Ixodes scapularis]
Length = 227
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 36/116 (31%), Positives = 54/116 (46%), Gaps = 11/116 (9%)
Query: 17 KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP 76
K E L+++L+ E G SFW PYI L R + + L +S EL LTGS
Sbjct: 76 KSMENVALSMFLILELCAGSASFWHPYISILPR-------SFNTVLYFSVDELQLLTGSS 128
Query: 77 TKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 132
E L+ I R+Y + F L + PY + FT+++++ A AV +
Sbjct: 129 VLDEALKLHRSIARQYAYFHKI-FRTHPLAKSLPY---KDCFTYDLYRWAVSAVMT 180
>gi|358386801|gb|EHK24396.1| hypothetical protein TRIVIDRAFT_168260 [Trichoderma virens Gv29-8]
Length = 370
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 33/140 (23%), Positives = 59/140 (42%), Gaps = 6/140 (4%)
Query: 82 LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 141
LE E +++ E W + F+ D+P E +T+ + + K
Sbjct: 113 LESREHLRKREKEFQGNW----NAFKDAFPDVPYEEYTYAWMIVNTRSFYNETPETLKYP 168
Query: 142 LARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 201
R AL+P+ CK +A D +V DR YK GE + + N +L+
Sbjct: 169 WEDRLALIPVADLFNHSDDGCKVYYSA--DGYHIVADREYKKGEELFISYSSHSNDYILL 226
Query: 202 NYGFVDEDNPYDRLVVEAAL 221
YGF+ +++ D + ++ A+
Sbjct: 227 EYGFIPDESLDDDVYIDDAV 246
>gi|322697804|gb|EFY89580.1| putative histone-lysine N-methyltransferase [Metarhizium acridum
CQMa 102]
Length = 466
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 43/175 (24%), Positives = 75/175 (42%), Gaps = 9/175 (5%)
Query: 60 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 119
S + ++E EL G+ + + I+ +Y +L + P P FT
Sbjct: 107 SSIFFTEDELEVCAGTSLYTITKQLKQRIEDDYKDL------IARVLGPRPDLFPLNKFT 160
Query: 120 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA--MLAAVDDAVQLVV 177
+K A V S + + + L P +L +SS+ K + A + ++
Sbjct: 161 IHHYKWALCTVWSRAMDFELYDGSSMRLLAPFAD-MLNHSSESKQCHVYDASTGNLSILA 219
Query: 178 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 232
+ Y+AG+ + + G PNS+LL YGFV DNP D + A + P ++ K+
Sbjct: 220 GKDYEAGDQVYIHYGSIPNSRLLRLYGFVIPDNPNDSYDLVLATHPMAPFFEQKQ 274
>gi|440302460|gb|ELP94773.1| hypothetical protein EIN_341910 [Entamoeba invadens IP1]
Length = 823
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 77/333 (23%), Positives = 130/333 (39%), Gaps = 80/333 (24%)
Query: 16 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 75
N+ E + + LYL K +K F PYI L PL +SE EL L G+
Sbjct: 78 NRDDENSVVYLYLAM-NKTNEKCFHFPYINTLPT-------TFSCPLSYSENELKMLKGT 129
Query: 76 PTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 135
++L E K +L + +L QYP T F+ F Q V
Sbjct: 130 ----KLLVTVEKTKTFLKKLSDYY---ETLTHQYP----TRFQQFDDFYQRLVWAH---- 174
Query: 136 HLQKVSLARRFALV---PLGP-----PLLAYSSKCKAMLAAVDDAVQLVVDRPYKA---- 183
+V +R F ++ P+G P +S+ + V V +R +
Sbjct: 175 ---QVFWSRAFLVIYPDPIGDVASLIPFADFSNH------NTETKVTYVSNRQTQTFSLQ 225
Query: 184 GESIVVWCGPQ--------PNSKLLINYGFVDEDNPYDRLVV-----EAALNTEDPQYQD 230
V+ CG Q PN K+L+ YGFV +NPYD +++ E + + ++
Sbjct: 226 TNEKVLHCGEQIFNNYRIRPNEKMLLGYGFVISENPYDEVLLRINFKERHFEKQVEESEE 285
Query: 231 KRM--------------------VAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS 270
+M + Q + V F + REKE +D+L LR+ +S
Sbjct: 286 SKMEVENKENERMEVEEEDNEDEITQILKREGVDRFDYYLTREKELPTDLLRVLRIVNLS 345
Query: 271 --DTSEMQSVISSLGPICPVSPC-MERAVLDQL 300
+ ++ + L + P++ R++++Q+
Sbjct: 346 LVEANQYSQALLDLSYVSPINEIKATRSLMEQI 378
>gi|327290197|ref|XP_003229810.1| PREDICTED: SET domain-containing protein 4-like [Anolis
carolinensis]
Length = 440
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 56/233 (24%), Positives = 97/233 (41%), Gaps = 23/233 (9%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
+S L L +L+ EK +KS W PY+ L + S + E ++ L P
Sbjct: 104 ISPLIALCTFLIAEKWAQEKSPWKPYLDLLPE--------IYSCPVCLEQKIVNLFPEPL 155
Query: 78 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV-- 135
+ + E+ + ++ + +F LF P D+ + F ++ FK A+ + + V
Sbjct: 156 RRKAHEQRKLVQELFISSQQFFFSLQPLF---PKDVAS-VFNYQAFKWAWCTINTRTVYM 211
Query: 136 -HLQKVSLARRFALVPLGP--PLLAY--SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 190
H Q+ +R L P LL + + + KA ++ + +
Sbjct: 212 KHSQRDCFSRDTDTYALAPYLDLLNHNPTVQVKAGFNEKTKCYEITTVTQCHHYNEVFIC 271
Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 243
GP N +LL+ YGFV DNP+ + V ++ +DK QR KLS+
Sbjct: 272 YGPHDNQRLLLEYGFVSRDNPHSSVYVGTDTLLKNVFPEDK----QRPKKLSI 320
>gi|308809221|ref|XP_003081920.1| N-methyltransferase (ISS) [Ostreococcus tauri]
gi|116060387|emb|CAL55723.1| N-methyltransferase (ISS) [Ostreococcus tauri]
Length = 403
Score = 47.8 bits (112), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 51/171 (29%), Positives = 75/171 (43%), Gaps = 40/171 (23%)
Query: 22 ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 81
A LA+ LM + G + W Y L AV+S ++WS+ EL L GS +
Sbjct: 47 ATLAVALMQQTNGGASARWRAYCDAL-------PAAVDSLMMWSDEELEVLQGSALRQRA 99
Query: 82 LERAEGIKREYNELDTVWFMAGSLFQQYPYDI-PTEAFTFEIFKQAFVAVQSCVVHLQKV 140
+ R + KREY+ L F A L + P EA++F++F+ A+ V
Sbjct: 100 VFRRDLCKREYDAL----FPA--LARADPETFGDVEAYSFDVFRWAYATV---------- 143
Query: 141 SLARRFALVPLGPPLLAYSSKCKAMLAAVD------DAVQLVVDRPYKAGE 185
+AR F L L +C A+L +D DA + VV+R A E
Sbjct: 144 -MARAFVLPDL---------QCMALLPGLDIYNSARDAEKCVVERDEGACE 184
>gi|308812602|ref|XP_003083608.1| unnamed protein product [Ostreococcus tauri]
gi|116055489|emb|CAL58157.1| unnamed protein product [Ostreococcus tauri]
Length = 427
Score = 47.8 bits (112), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 80/343 (23%), Positives = 139/343 (40%), Gaps = 46/343 (13%)
Query: 15 TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 74
+N+ +E A LA+ L E+++G S + P++ L+ T
Sbjct: 79 SNESAEWA-LAIELAMEREKGVASRYRPFV----------------DSLYERTPANSTVV 121
Query: 75 SPTKAEIL--ERAEGIKREYNELDTV--WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 130
S E L AE + R Y+E D V W A F+ +P + FT F++A V
Sbjct: 122 SKKARERLAEHHAEKVMRRYDE-DIVRGWNAAVRTFRTFPTIFRAQDFTRSKFEEALAIV 180
Query: 131 QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 190
++ + + R LVPL L+ +S + VDD + VD ++AG+ +
Sbjct: 181 RANSFEVTRADGVRERVLVPLAHLLVHDTSSSVPCVKMVDDTFVINVD-EHRAGDELSCS 239
Query: 191 CGPQPNSKLLINYG----FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL--SVQ 244
G +++ +G + +E+N D + + D+ + + G +
Sbjct: 240 HGEYSDAETFARFGTSAVYSEENNARDVITF---------TFPDEVHLKEEIGSCGPAED 290
Query: 245 VFHVHAGREKEAISDMLPYLRLGYVSDT--SEMQSVISSLGPIC--PVSPCMERAVLDQL 300
+ G A ++++ LRL + T SEM+ L + P+S E AV D L
Sbjct: 291 IGFTRDG----ASAELMCALRLVSANATEWSEMRKPNFDLQSLKNRPLSEESEVAVYDAL 346
Query: 301 ADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKK 343
L YP + +DE +L L +R A ++ EK+
Sbjct: 347 FATLTDLLNSYPYSDVDDEHLLRGDRLADDERRAVKIRLREKR 389
>gi|378731232|gb|EHY57691.1| hypothetical protein HMPREF1120_05719 [Exophiala dermatitidis
NIH/UT8656]
Length = 714
Score = 47.8 bits (112), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 61/224 (27%), Positives = 86/224 (38%), Gaps = 46/224 (20%)
Query: 25 ALYLMYEKKQGKKSFWLPYIRELD--RQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
A +L+ + G KS+W PYI L Q E+ LLW L G+ KA
Sbjct: 128 AFFLLEQLVLGDKSWWAPYISSLPTVEDVSHSQFEDEADLLW-------LEGTNLKAGFA 180
Query: 83 ERAEGIKREYNELDTVWFMAG--SLFQQYPYDIPTEAFTFEIFKQAFV--AVQSCVVHLQ 138
A K Y + G L Q + A+T+E F+ A +S +
Sbjct: 181 AEAARWKEMY--------LKGMHQLKQSQWENAVNGAYTWERFRWAMTIFGSRSFTSQVL 232
Query: 139 KVSLARRFALVP------------LGP----------PLLAYSSK---CKAMLAAVDDAV 173
+L AL+ LG PL+ S+ K A V
Sbjct: 233 DATLPADKALLQQYRHDDGRDLCVLGELFAQHFGVLLPLVDISNHKPGAKVEWQARYSFV 292
Query: 174 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 217
L V PY++G+ I GP+ N LL+ YGF DNP+D +V+
Sbjct: 293 GLQVLEPYESGQEIFNNYGPRDNETLLVAYGFTIPDNPFDHVVI 336
>gi|426197159|gb|EKV47086.1| hypothetical protein AGABI2DRAFT_203917 [Agaricus bisporus var.
bisporus H97]
Length = 491
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 60/225 (26%), Positives = 97/225 (43%), Gaps = 33/225 (14%)
Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
A+VP+ L A A L D +++V +P ++GE I G PN++LL YG V
Sbjct: 244 AMVPMADILNARYQTENAKLFHEKDELKMVTTKPIRSGEQIWNTYGDLPNAELLRRYGHV 303
Query: 207 D--------EDNPYDRLVVEAAL----NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREK 254
D NP D + ++A L + P+ V K + + + G E
Sbjct: 304 DFLSLPSEGHGNPGDVVEIKADLIISAVSSIPE-----AVKDDEAKERID-WWLEEGGED 357
Query: 255 EAISD--------MLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 306
I D M+ +++L + ++ S P +E + D L +
Sbjct: 358 IFILDYEYDLPPVMISFVKLLLLPQADWEKAREKS----KPPKSKLEGILYDILISTLEK 413
Query: 307 RLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM-EKKMLNACLQ 350
RLA YP T+ D+A+LT N P + +VR+ EK++L+ LQ
Sbjct: 414 RLAEYPTTIETDKALLT--NDTPLNKKNAIIVRLGEKEILHGILQ 456
>gi|409045252|gb|EKM54733.1| hypothetical protein PHACADRAFT_97093 [Phanerochaete carnosa
HHB-10118-sp]
Length = 513
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 63/222 (28%), Positives = 97/222 (43%), Gaps = 26/222 (11%)
Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 205
A+VP+ L + A L D+ V +++ KAGE I G PNS LL YGF
Sbjct: 271 AMVPMADMLNGRFNTETARLFYDDEHVLRMMTVHEIKAGEQIWNTYGDPPNSDLLRRYGF 330
Query: 206 VD----------EDNPYD------RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
+D NP D LVVEAA + QD+ V + VF V
Sbjct: 331 IDVTKLESPLSGAGNPADIVEIPANLVVEAATKHTTSKTQDR--VDWWLEEAEDDVFVV- 387
Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
G + E +M+ RL + +E + + G + P M+ + D ++RL
Sbjct: 388 -GTDCELPPEMVSLARL-LLQPKAEWEKT-KAKGKVP--KPTMDTTIAAIAMDVLQSRLK 442
Query: 310 GYPATLSEDEAMLTDYN-LHPKKRVATQLVRMEKKMLNACLQ 350
YP ++ EDE +L D + L +++A + EK++L L+
Sbjct: 443 EYPTSVEEDERLLADESQLGFNRKMAVTVRLGEKRILAGTLR 484
>gi|302793745|ref|XP_002978637.1| hypothetical protein SELMODRAFT_52721 [Selaginella moellendorffii]
gi|300153446|gb|EFJ20084.1| hypothetical protein SELMODRAFT_52721 [Selaginella moellendorffii]
Length = 523
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 54/247 (21%), Positives = 99/247 (40%), Gaps = 61/247 (24%)
Query: 16 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 75
N++ + + ++L+ E+ +G+ SFW PY+ L G +PL + + EL L G+
Sbjct: 91 NEIDDRLLVMIFLIIERARGRASFWAPYLEMLPSGFG-------TPLWFEDEELMELDGT 143
Query: 76 P----TKAEI--------------LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 117
TKA++ L R + + E+ E +W A +F +IP A
Sbjct: 144 TLFEATKAQVFFPSTFVSTCMSLYLFRPDDRELEFQEF--LW--ANCIFWTRALNIPCPA 199
Query: 118 -----------------FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS 160
+ F++ + + VS LVP G ++
Sbjct: 200 SFVTSSSPEVAKDDGNRLVIYVLPHPFISCSA-----KDVSTIWIEGLVP-GIDFCNHTR 253
Query: 161 KCKAML---------AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 211
+ + + V ++ L+ D + G +++ G + N +LL YGFV+EDN
Sbjct: 254 RASGLWEIDGSDGSTSGVPHSMYLIADVVFPPGSEVLINYGDKGNEELLFLYGFVEEDNS 313
Query: 212 YDRLVVE 218
D ++V
Sbjct: 314 NDYVMVH 320
>gi|358380690|gb|EHK18367.1| hypothetical protein TRIVIDRAFT_47382 [Trichoderma virens Gv29-8]
Length = 479
Score = 47.8 bits (112), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 49/207 (23%), Positives = 85/207 (41%), Gaps = 32/207 (15%)
Query: 26 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 85
L+L+ E +G++SFW PYI+ L + A+ P W E E L G+ + + +
Sbjct: 91 LFLIKELLRGQESFWYPYIQALPQPEDFDDWAL--PPFWPEEEAELLEGTNVEIGLEKIR 148
Query: 86 EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-------CVVHLQ 138
E + RE+ + + + Q+ D ++ T E+++ A+ S + Q
Sbjct: 149 EDLGREFRDARNLLIAS----QKDAEDDFSDHLTRELYQWAYCIFSSRSFRPSLVLSEEQ 204
Query: 139 KVSLARRFALVPLGPPLLAYSSKCKAMLAAV-------------------DDAVQLVVDR 179
+ SL ++ L + M V AVQL V R
Sbjct: 205 QQSLPDGVSVNDFSVLLPLFDIGNHDMTVHVRWDLAAGDEAAAGAGVRGSGAAVQLKVGR 264
Query: 180 PYKAGESIVVWCGPQPNSKLLINYGFV 206
+K G+ I P+ N++LL+ YGF+
Sbjct: 265 EHKPGQQIFNNYSPKTNAELLLGYGFM 291
>gi|294948379|ref|XP_002785721.1| hypothetical protein Pmar_PMAR008080 [Perkinsus marinus ATCC 50983]
gi|239899769|gb|EER17517.1| hypothetical protein Pmar_PMAR008080 [Perkinsus marinus ATCC 50983]
Length = 353
Score = 47.4 bits (111), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 21/72 (29%), Positives = 37/72 (51%)
Query: 146 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 205
++PL S+K + V++ Q++ ++P K GE I G N LL+ +GF
Sbjct: 171 LCVIPLADQFNHSSTKWHTRVREVEEGFQMLAEKPVKKGEEIFNNYGLYTNEMLLLTHGF 230
Query: 206 VDEDNPYDRLVV 217
++ DNP+D +
Sbjct: 231 IEFDNPHDHFIT 242
>gi|451854554|gb|EMD67847.1| hypothetical protein COCSADRAFT_34629 [Cochliobolus sativus ND90Pr]
Length = 476
Score = 47.4 bits (111), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 53/216 (24%), Positives = 82/216 (37%), Gaps = 44/216 (20%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
L L ++YE G S W PY L ++ + W+E ELA L S +I
Sbjct: 85 SLILIMLYEYHNGSASNWAPYFAVLPTD-------FDTLMFWTEDELAELQASAVVNKIG 137
Query: 83 ---------------------------ERAEGIKREYNELDTVWFM--AGSLFQQYPYDI 113
ERA+ +E + + M GSL Y +D+
Sbjct: 138 KEGANEVFIEQLLPVIEEFADVIFSGDERAKHKAKEMRAPENLELMHKMGSLIMAYAFDV 197
Query: 114 PTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV 173
A + + + A + L K +VPL L A +C A L D +
Sbjct: 198 EP-AISDKEVDEEGFAEEEEDAALPK-------GMVPLADMLNADGDRCNARLFYEKDGL 249
Query: 174 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 209
++ +P +AG+ I GP P S LL YG++ ++
Sbjct: 250 EMKALKPIQAGDEIFNDYGPLPRSDLLRRYGYITDN 285
>gi|116206234|ref|XP_001228926.1| hypothetical protein CHGG_02410 [Chaetomium globosum CBS 148.51]
gi|88183007|gb|EAQ90475.1| hypothetical protein CHGG_02410 [Chaetomium globosum CBS 148.51]
Length = 442
Score = 47.4 bits (111), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 49/195 (25%), Positives = 79/195 (40%), Gaps = 20/195 (10%)
Query: 26 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 85
+L+ E +GK SFW PY+ L + P W E ++AYL + I E
Sbjct: 113 FFLIKEYLKGKDSFWWPYLATLPSPDQVNAWVL--PAFWPEDDIAYLECTNAHVAIQEIQ 170
Query: 86 EGIKREYNE----LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---- 137
+K E+ + L F + + Y FT F+ + + + H+
Sbjct: 171 ANVKGEFKQARKILKNENFPDVAAYTSLMYKWAFTIFTSRSFRPSLILSDTTKRHISTLL 230
Query: 138 -QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWC 191
Q V L F+++ PLL ++ + + D DA LV Y G +
Sbjct: 231 PQSVEL-DDFSILQ---PLLDIANHSPTAVYSWDTTSPADACTLVCGDRYPPGAQVFNNY 286
Query: 192 GPQPNSKLLINYGFV 206
G + NS+LL+ YGF+
Sbjct: 287 GLKTNSELLLGYGFI 301
>gi|452841392|gb|EME43329.1| hypothetical protein DOTSEDRAFT_131367 [Dothistroma septosporum
NZE10]
Length = 445
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 82/360 (22%), Positives = 140/360 (38%), Gaps = 50/360 (13%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI- 81
L + + E G +S W PY L ++ ++ + WS+ EL +L GS +I
Sbjct: 85 SLIVVMALEYLDGSQSRWKPYFDVL-------PVSFDNLMFWSDRELRHLEGSTVVGKIG 137
Query: 82 ---------------LERAEGIKREYNE-LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ 125
+ER K NE L + GS Y +D+ T + + +
Sbjct: 138 KEAADATFREQLIPVIERISKAKAADNEELLRMCHRMGSTIMAYGFDLETSSDQAKNDGE 197
Query: 126 AFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGE 185
+ L K +VPL L A + + A L DD V + +P KAGE
Sbjct: 198 EWEEDSDAGETLPK-------GMVPLADMLNADADRNNAKLFYEDDKVVMKTIKPVKAGE 250
Query: 186 SIVVWCGPQPNSKLLINYGFVDEDN---------PYDRLVVEAALNTEDPQYQDKRMVAQ 236
+ G P + LL YG++ DN P D + A L T+D ++ A+
Sbjct: 251 ELYNDFGSLPRADLLRRYGYL-TDNYAQYDVVEIPADLIKERAGLRTQD--VDERWQYAE 307
Query: 237 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 296
G L A E+ + L L +E + V + P + +
Sbjct: 308 EQGVLDDGYDVSRASSEEGQFPEELCVLLNLLALPRAEFEKVKNKDKIPKPDLTTNAKKL 367
Query: 297 LDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKR-VATQLVRMEKKMLNACLQVTADM 355
L + Y R A YP + + M +D +L+ ++R +A +++ EK++L + +++
Sbjct: 368 LRTILVY---RYAAYPGNVDQ---MHSDVSLNDRRRKMAIVVIQGEKQVLQEAVDAISEI 421
>gi|240278777|gb|EER42283.1| conserved hypothetical protein [Ajellomyces capsulatus H143]
gi|325090312|gb|EGC43622.1| conserved hypothetical protein [Ajellomyces capsulatus H88]
Length = 471
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 44/164 (26%), Positives = 70/164 (42%), Gaps = 11/164 (6%)
Query: 60 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 119
S + ++E EL TG+ A + I+ +Y L L Q+ P FT
Sbjct: 107 SSIFFTEDELEVCTGTSLYAITKQLGRCIQDDYKALVV------RLLIQHRDLFPLSKFT 160
Query: 120 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVV 177
E +K A V S + L P +L +SS + A + + ++
Sbjct: 161 IEDYKWALCTVWSRAMDFVLPDGKSIRLLAPFAD-MLNHSSDVRQCHAYDPLSGNLSILA 219
Query: 178 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR--LVVEA 219
+ YKAG+ + ++ G PN++LL YGF+ NP D LV+E
Sbjct: 220 GKDYKAGDQVFIYYGSIPNNRLLRLYGFIIPSNPNDNYELVLET 263
>gi|346465219|gb|AEO32454.1| hypothetical protein [Amblyomma maculatum]
Length = 353
Score = 47.0 bits (110), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 64/271 (23%), Positives = 113/271 (41%), Gaps = 28/271 (10%)
Query: 17 KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP 76
+L+ + L L+L+ EK +G S W +I L ++ +P+ LA L +
Sbjct: 97 QLTAIEVLTLFLINEKLRGLDSEWRFFIDSL-------PVSYTTPVFLGSKLLARLPETM 149
Query: 77 T-KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CV 134
KAE + I+R + L + + +L +E FT+ +F A+ AV + C+
Sbjct: 150 CRKAE--AQVSRIRRTFVRLQIL--LKRALLDDSALLNLSENFTWHLFVWAWTAVNTRCI 205
Query: 135 VHLQKVSLARRF---ALVPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGESIVV 189
K F L P L + KA + + ++V + Y+ + + +
Sbjct: 206 --FSKHRTDHSFWDDDYCALAPFLDCLNHHWKADVETTVEGSYFEIVTNNNYEPNDQVFI 263
Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
G N KLL+ YGFV DNP D + + T+ Y ++ +Q+N + +
Sbjct: 264 SYGSHDNKKLLLEYGFVLADNPNDVVAI-----TKGHLY---KLNSQQNDTVLYFATKLS 315
Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVIS 280
EK+ ISD + G + + V+S
Sbjct: 316 FLEEKDIISDTCGFTTDGLTWNGKIVMQVLS 346
>gi|320163219|gb|EFW40118.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 1188
Score = 47.0 bits (110), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 45/202 (22%), Positives = 81/202 (40%), Gaps = 22/202 (10%)
Query: 61 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 120
PL W++ EL +L G+ I ER ++ ++ + V L ++ P P + FT+
Sbjct: 245 PLWWNDAELDHLDGTNIGGYIQERRNQVRNQFLNVFPV------LSREQPALFPKDVFTY 298
Query: 121 EIFKQAFVAVQSCVVHLQ-KVSLARRFALVPLGPPLLAYSSKC------------KAMLA 167
E + AF S L+ V+ +G P+ +C A +
Sbjct: 299 EAYLWAFSTCSSRAFPLRVTVNPTTGVESHAIGNPMKEPCVECLLPLLDMMNHQFGASIT 358
Query: 168 AVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 225
D +V+ + GE + GP+ N +LL+ YGF +N D + ++ + D
Sbjct: 359 WFTDETSVRFFTGAKVRKGEQVYNNYGPKSNEELLMGYGFCLPNNEADHVKIQLTVGN-D 417
Query: 226 PQYQDKRMVAQRNGKLSVQVFH 247
P + K + + +G H
Sbjct: 418 PDGEAKLAILRWHGLSLTHFLH 439
>gi|225561342|gb|EEH09622.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
Length = 487
Score = 47.0 bits (110), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 78/339 (23%), Positives = 134/339 (39%), Gaps = 50/339 (14%)
Query: 60 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 119
S + +++ EL GS A I+ +Y L L Q+ P + FT
Sbjct: 125 SSIFFTDDELEVCAGSSLYALTKRLGRCIEDDYRALVV------RLLVQHQDLFPLDKFT 178
Query: 120 FEIFKQAFVAVQSCVVHL-----QKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDA 172
E +K A V S + + + L FA +L +SS+ + A +
Sbjct: 179 IEDYKWALCTVWSRAMDFVLPGGKSIRLMAPFA------DMLNHSSEVRQCHAYDPLSGN 232
Query: 173 VQLVVDRPYKAGES-----IVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQ 227
+ ++ + Y+AG+ + ++ G PN++LL YGFV NP D + + P
Sbjct: 233 LTILAGKDYEAGDQGVFFQVFIYYGSIPNNRLLRLYGFVMPGNPNDSYDLVLETHPMAPF 292
Query: 228 YQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV----SDTSEMQSVISSLG 283
++ KR + G S + ++D LP LGY+ SD S++ S+
Sbjct: 293 FEQKRKLWDLAGFDSTSTISI-------TLTDPLPKNVLGYLRIQRSDESDLASIARQR- 344
Query: 284 PICP----VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKR---VATQ 336
I P +S E VL L + F L + L E L + ++P + A
Sbjct: 345 -IDPKYEKISDSNEVEVLQSLIESFCGLLDSFGTQLESLEKQLAE-GVYPSRGNAWAAAH 402
Query: 337 LVRMEKKMLNACLQVTADMIMLLPDVTVS-----PCPAP 370
+ E+++L + DM+ + + + P PAP
Sbjct: 403 VSLGEQQVLRLARKRAEDMLAAVESGSGNEKGSLPAPAP 441
>gi|400596811|gb|EJP64567.1| histone-lysine N-methyltransferase [Beauveria bassiana ARSEF 2860]
Length = 406
Score = 47.0 bits (110), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 76/316 (24%), Positives = 133/316 (42%), Gaps = 28/316 (8%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPL--LWSETELAYLTGS 75
LS LALY+++ + +G+ + +RQ L E L +++ EL GS
Sbjct: 71 LSVEDTLALYILFVRSRGEDPAYA------ERQTHVAMLPSEYTLSMYFTDEELRVCAGS 124
Query: 76 PTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 134
+ +Y +L T FM LF P + F+F+ +K A ++ S
Sbjct: 125 SLYTLTTHLRGRVGDDYKKLLTGVFMRHRDLF-------PLDKFSFQHYKWALSSIWSRG 177
Query: 135 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV--DRPYKAGESIVVWCG 192
+ +S L+ +L ++S K A L V R Y+ G+ + ++ G
Sbjct: 178 MDF-TISEGNSVRLMAPFADMLNHASDAKQCHAYDPSTGSLTVLACRDYEVGDQVFIYYG 236
Query: 193 PQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV--QVFHVHA 250
NS+LL YGFV DNP D + ++ P Y+ K QR KL+ ++ +
Sbjct: 237 NVSNSRLLRLYGFVLPDNPNDNYELVLQTSSMAPLYEQK----QRLWKLAGLDEISTIPL 292
Query: 251 GREKEAISDMLPYLRLGYVSDTSEMQSVISSLG--PICPVSPCMERAVLDQLADYFKARL 308
+ +L YLR+ + D S++ ++ + +S E +L L+ +A L
Sbjct: 293 SLQNPLPDSVLRYLRIQRL-DASDLGTMTMQIATESYTKISDENESQILLFLSQSIEALL 351
Query: 309 AGYPATLSEDEAMLTD 324
G+ +L + E L +
Sbjct: 352 EGFEISLEKLETQLAE 367
>gi|422293951|gb|EKU21251.1| hypothetical protein NGA_2061300, partial [Nannochloropsis gaditana
CCMP526]
Length = 452
Score = 47.0 bits (110), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 47/198 (23%), Positives = 81/198 (40%), Gaps = 27/198 (13%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
L + + +AL L+YE G KS + YI+ L R GQ + PL WS E L S T
Sbjct: 137 LGDASLIALQLLYEAHMGPKSKYAVYIKSLPRP---GQDGFDHPLFWSTAEQGVLAKSST 193
Query: 78 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
+ + + +Y W + +L + + ++F F+ A V S
Sbjct: 194 RNLGETLIDAVAEDYG-----WIQS-ALARGGISGLQADSFDLSDFEWAVAVVLSRSFFA 247
Query: 138 QKVSLARRFALVPLGPPLLAYSSKCKA---------MLAAVDDAVQLVVDRPYKAGESIV 188
+ R A PLL +++ + + ++++ DR G+ IV
Sbjct: 248 EN---GLRLA------PLLDMANRGEGCTNEPQIGGLGIFGGKGLKVIADRDTDKGQEIV 298
Query: 189 VWCGPQPNSKLLINYGFV 206
+ GP+ + L ++GFV
Sbjct: 299 ISYGPKSGIEFLEDHGFV 316
>gi|387193935|gb|AFJ68731.1| hypothetical protein NGATSA_2061300, partial [Nannochloropsis
gaditana CCMP526]
Length = 446
Score = 46.6 bits (109), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 47/198 (23%), Positives = 81/198 (40%), Gaps = 27/198 (13%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
L + + +AL L+YE G KS + YI+ L R GQ + PL WS E L S T
Sbjct: 131 LGDASLIALQLLYEAHMGPKSKYAVYIKSLPRP---GQDGFDHPLFWSTAEQGVLAKSST 187
Query: 78 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
+ + + +Y W + +L + + ++F F+ A V S
Sbjct: 188 RNLGETLIDAVAEDYG-----WIQS-ALARGGISGLQADSFDLSDFEWAVAVVLSRSFFA 241
Query: 138 QKVSLARRFALVPLGPPLLAYSSKCKA---------MLAAVDDAVQLVVDRPYKAGESIV 188
+ R A PLL +++ + + ++++ DR G+ IV
Sbjct: 242 EN---GLRLA------PLLDMANRGEGCTNEPQIGGLGIFGGKGLKVIADRDTDKGQEIV 292
Query: 189 VWCGPQPNSKLLINYGFV 206
+ GP+ + L ++GFV
Sbjct: 293 ISYGPKSGIEFLEDHGFV 310
>gi|145356486|ref|XP_001422460.1| chloroplast lysine N-methyltransferase [Ostreococcus lucimarinus
CCE9901]
gi|144582703|gb|ABP00777.1| chloroplast lysine N-methyltransferase [Ostreococcus lucimarinus
CCE9901]
Length = 529
Score = 46.6 bits (109), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 68/274 (24%), Positives = 118/274 (43%), Gaps = 49/274 (17%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
L+ LAL L+ E+++G++S + Y++ L + SPL W+ EL+ + G+
Sbjct: 136 LARWTTLALTLLKERERGEESKYAAYVKTLPE-------VLHSPLFWNAEELSEIQGT-- 186
Query: 78 KAEILERAEG----IKREYNELDTVWFMAGSLFQQYP--YDIPTEAFTFEIFKQAFVAVQ 131
++L+ A G ++ Y L T +F ++ +D+ AF+ + F+ AF ++
Sbjct: 187 --QLLDNAAGYDGYVRGVYETLRT------GMFAKHADVFDVEG-AFSEDNFRWAFGILR 237
Query: 132 SCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA-------MLAAV---------DDAVQL 175
S ALVP G L+ +SS +A + AV DD V
Sbjct: 238 S---RTMAPCDGANIALVP-GVDLVNHSSLSQARWRVSGGVAGAVAGLFGGGKGDDGVSA 293
Query: 176 VV--DRPYKAGESIVVWCGPQ-PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 232
V DR E + V P+ ++ +++GFVD P + ++ +DP DK
Sbjct: 294 RVECDRALNVNEPLYVNYNPEGTDTSFALDFGFVDTITPSPGYALSLSVPEDDPNVFDKL 353
Query: 233 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 266
V G F + A + + D+ +LRL
Sbjct: 354 DVLDVCGLGETPTFTLRAYSDPD--PDLRTFLRL 385
>gi|281205954|gb|EFA80143.1| hypothetical protein PPL_06965 [Polysphondylium pallidum PN500]
Length = 417
Score = 46.6 bits (109), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 74/309 (23%), Positives = 133/309 (43%), Gaps = 37/309 (11%)
Query: 25 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILER 84
A+YLMY K S+W Y L +Q + + + + E+ L S + R
Sbjct: 93 AVYLMY-SLLNKDSYWHQYTSILPKQ-------FTTSIYFDQDEMKELQLSKLRYFTESR 144
Query: 85 AEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLAR 144
GI+R YN ++ SL ++ + +TFE+FK A + S L +
Sbjct: 145 LSGIERHYN---VIFKKLSSLNDEFK----KKEYTFELFKWALSCIWSRAFSLS----SD 193
Query: 145 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV--VDRPYKAGESIVVWCGPQP---NSKL 199
+VPL A K K+ + A QL+ + + GE + G N+++
Sbjct: 194 DGGMVPLADMFNAIE-KAKSKVRPDSRADQLIYYASKDIERGEQVFTPYGVYKTIGNAQM 252
Query: 200 LINYGFVDEDNPYDRLVVEAALN--TEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI 257
L++YGF D+P + ++ L+ ++D Y D ++ + V+ F++ + +
Sbjct: 253 LMDYGFA-FDDPSEGDTIQLTLDNFSDDELYIDTKIDLLEQLDI-VREFNL---KRNQLP 307
Query: 258 SDMLPYLRLGYVSDTSEMQ----SVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 313
++L Y R+ + + +E+Q + PVS E+ L L++Y L Y
Sbjct: 308 QELLIYARVKNLKE-NELQLAKEHYRNDDNRNKPVSRRNEKTALRYLSNYLSRYLDSYET 366
Query: 314 TLSEDEAML 322
TLS+D +L
Sbjct: 367 TLSDDLELL 375
>gi|384254260|gb|EIE27734.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 724
Score = 46.6 bits (109), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 63/239 (26%), Positives = 92/239 (38%), Gaps = 44/239 (18%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
+AL L+YE++ G S PY+ L + +PL WSE +L L E+
Sbjct: 152 MALNLLYERQLGPASKLAPYVAALP-------VDFSTPLSWSEAQLQALCYPQLIREVAT 204
Query: 84 RAEGIKREYNEL------------DTVWFMAGSLFQQY--PYDIPTEAFTFEIFKQAFVA 129
+ EG+KR + EL D +W + + + PY PT + F
Sbjct: 205 QREGLKRLHAELAVSTPGTPITEQDLIWALQAVRSRAFSGPYAGPTWRSRLKTFGALGAL 264
Query: 130 VQSCVVHLQKVSLARRFALVPL-GPPLLAYSSKCKAMLAAVD-----DAVQLVVDRPYKA 183
+ + ++ A AL L +L+ K AM VD VQ V+ Y A
Sbjct: 265 AAASITVAHVLNGAIAAALFNLLYDVVLSQKVKWYAMCPVVDFLNHKSTVQSEVEYEYFA 324
Query: 184 -------------GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV----EAALNTED 225
GE + + G Q N LL YGFV+ P+D + AAL D
Sbjct: 325 DRFSVRCQSYFSKGEQVFISYGKQSNDSLLQYYGFVEPGIPHDTYTIPDLRAAALALSD 383
>gi|322706860|gb|EFY98439.1| SET domain protein [Metarhizium anisopliae ARSEF 23]
Length = 595
Score = 46.6 bits (109), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 56/231 (24%), Positives = 105/231 (45%), Gaps = 27/231 (11%)
Query: 24 LALYLMYEKKQGKK------SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
+ LYL+ + Q + + W YIR L R + P +W+E E L G+
Sbjct: 201 IMLYLVSQFAQSSRPKGLSPTPWTEYIRLLPR-------PIPVPTMWTEPERLLLNGTSL 253
Query: 78 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
+A + + + +E++ L V + +P+ E+ + +V V +
Sbjct: 254 EAALEAKLLSLGKEFDTLREV-------SEDFPFWNEFLWSGEEVSLEDWVLVDAWY-RS 305
Query: 138 QKVSLARR-FALVPLGPPLLAYSSKCKAMLAAVD-DAVQLVVDR--PYKAGESIVVWCG- 192
+ + L R A+VP G ++ +SSK A D D V L++ P ++GE + + G
Sbjct: 306 RCLELPRSGTAMVP-GLDMVNHSSKATAYYEEDDHDNVVLLIRPGCPVRSGEEVTISYGD 364
Query: 193 PQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 243
+P S++L +YGF+D +N D+L + +DP + K ++ L++
Sbjct: 365 AKPASEMLFSYGFIDPNNIVDKLTLRLDPFPDDPLARAKLRISNSGPTLTI 415
>gi|345566622|gb|EGX49564.1| hypothetical protein AOL_s00078g53 [Arthrobotrys oligospora ATCC
24927]
Length = 611
Score = 46.6 bits (109), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 74/313 (23%), Positives = 130/313 (41%), Gaps = 43/313 (13%)
Query: 22 ACLAL---YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
AC L L+ E+ Q FW PYIR L + ++PL +++ E+ L G+
Sbjct: 112 ACFHLSQHLLLKEQSQ----FW-PYIRLLPK-------TFDTPLYFNDDEMERLAGTNLG 159
Query: 79 A-EILERAEGIKREYNELDTVWFMAG---SLFQQYPYDIPTEA---FTFEIFKQAFVAV- 130
A ++L R + E+ F+ G ++Y +D+ A +T F V +
Sbjct: 160 AGDVLLRKQLWMEEWEAGKQ--FLEGVGAERAREYTWDLFLRAATIYTSRSFPSKLVGIT 217
Query: 131 -QSCVVHLQKVSLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 188
S + +S F L+PL +L + K + + L+ G +
Sbjct: 218 MDSSIEENTMLSDDNGFPVLIPL-VDILNHKPNTKIIWEPTQTSFSLITPETISEGSQVF 276
Query: 189 VWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV-QVFH 247
GP+ N +LL+ YGFV +NP D L ++ ++ P+ Q ++ QR K + +VFH
Sbjct: 277 NNYGPKGNEELLMGYGFVIPENPGDSLAMKFTIS---PRGQAAQIWEQRALKQTWREVFH 333
Query: 248 VHAGREKEAISDMLPYLRLGY-----------VSDTSEMQSVISSLGPICPVSPCMERAV 296
+ + + +P L + V++ +E+ + + P+S E AV
Sbjct: 334 LTKSADSGQKTSTVPALESDWPEAFVDLFRILVANENEIDDLENGDINATPISIRNELAV 393
Query: 297 LDQLADYFKARLA 309
L K +LA
Sbjct: 394 ALGLKAAIKQKLA 406
>gi|400594002|gb|EJP61885.1| histone-lysine N-methyltransferase [Beauveria bassiana ARSEF 2860]
Length = 481
Score = 46.6 bits (109), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 64/277 (23%), Positives = 111/277 (40%), Gaps = 24/277 (8%)
Query: 62 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 121
+ +S+ E+ GS + I +Y +L T M ++ P F E
Sbjct: 115 IFFSDEEMQVCKGSSLYTLTTQLRGRIGDDYKKLLTRVLM------RHRNLFPLSKFGIE 168
Query: 122 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS---KCKAMLAAVDDAVQLVVD 178
+K A V S + VS L+ +L +SS +C A D + ++
Sbjct: 169 HYKWALCTVWSRGMDF-TVSEGNSLRLLAPFADMLNHSSDVKQCHAYDPTTGD-LSILAS 226
Query: 179 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 238
+ Y G+ + ++ GP PN++LL YGFV +NP+D + + P Y+ K + +
Sbjct: 227 KDYNVGDQVFIYYGPVPNNRLLRLYGFVLPENPHDSYDLVLQTSPMAPLYEQKERLWKLA 286
Query: 239 GKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL------GPICPVSPCM 292
G + + A +D LP L Y+ +S++ ++ G +S
Sbjct: 287 GLDTACTIPLTA-------NDPLPRSVLRYLRIQRLDESLLGAMTMQIATGADEKISDDS 339
Query: 293 ERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 329
E +L L D A L G+ L A L +++P
Sbjct: 340 ETLILQFLIDSISAILEGFSIPLDILTAQLAAGDVYP 376
>gi|1150596|emb|CAA86307.1| putative transcription regulator [Saccharomyces cerevisiae]
Length = 496
Score = 46.6 bits (109), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 51/205 (24%), Positives = 88/205 (42%), Gaps = 20/205 (9%)
Query: 24 LALYLMYEKKQ-GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLT--------G 74
L + ++YE + ++S W PY + ++ L + W + EL L G
Sbjct: 90 LIICILYEMEVLQERSRWAPYFKVWNKPSDMNAL-----IFWDDNELQLLKPSLVLERIG 144
Query: 75 SPTKAEILERA-EGIKREYNELDTVWFMAGSLFQQYPYD---IPTEAFTFEIFKQAFVAV 130
E+ ER + IK+ E T S F + Y I + +F E+ +
Sbjct: 145 KKEAKEMHERIIKSIKQIGGEFSTCVANCPSKFDNFAYIASIILSYSFDLEMQDSSVNEN 204
Query: 131 QSCVVHLQKVSLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
+ +++ R +++PL L A +SKC A L + +++V R + E +
Sbjct: 205 EEEETSEEELENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYN 264
Query: 190 WCGPQPNSKLLINYGFVDED-NPYD 213
G PNS+LL YG+V+ D + YD
Sbjct: 265 IYGEHPNSELLRRYGYVEWDGSKYD 289
>gi|323447496|gb|EGB03414.1| hypothetical protein AURANDRAFT_72732 [Aureococcus anophagefferens]
Length = 403
Score = 46.2 bits (108), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 44/151 (29%), Positives = 62/151 (41%), Gaps = 14/151 (9%)
Query: 61 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 120
P+ W EL +L GS +I ER IK +Y + +W P I + T
Sbjct: 141 PIFWQPDELEWLKGSYLLTQIEERKRAIKADYEAICGIW----------PSFI--DVCTL 188
Query: 121 EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSK-CKAMLAAVDDAVQLVVDR 179
E FK A + V S + V+ AR A+VP L + + K A + +
Sbjct: 189 EEFKWARMCVCSRNFGVV-VNGARTSAMVPYADMLNHFRPRETKWTFDNSRGAFTITSLQ 247
Query: 180 PYKAGESIVVWCGPQPNSKLLINYGFVDEDN 210
G I G + N + L+NYGF EDN
Sbjct: 248 KISVGSQIYDSYGQKCNHRFLLNYGFAIEDN 278
>gi|294868786|ref|XP_002765694.1| hypothetical protein Pmar_PMAR013760 [Perkinsus marinus ATCC 50983]
gi|239865773|gb|EEQ98411.1| hypothetical protein Pmar_PMAR013760 [Perkinsus marinus ATCC 50983]
Length = 330
Score = 46.2 bits (108), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 21/72 (29%), Positives = 36/72 (50%)
Query: 146 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 205
++PL S+K + V+ Q++ ++P K GE I G N LL+ +GF
Sbjct: 171 LCVIPLADQFNHSSTKWHTRVREVEGGFQMLAEKPVKKGEEIFNNYGLYTNEMLLLTHGF 230
Query: 206 VDEDNPYDRLVV 217
++ DNP+D +
Sbjct: 231 IEFDNPHDHFIT 242
>gi|384246211|gb|EIE19702.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 503
Score = 46.2 bits (108), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 91/359 (25%), Positives = 151/359 (42%), Gaps = 48/359 (13%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
+AL+L++E+ + + W ++ + A + PL WSE EL+ L G+ + +
Sbjct: 126 IALFLLHERSK-PDAAWQGFLDSI-------PAAPDVPLFWSEEELSQLEGTQLLSSVQG 177
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 143
+ + +Y EL+ LF + P ++ + F A V+S V
Sbjct: 178 YRQFFEAKYAELEE------QLFAPHREAFPPKSHQLDDFLWAVATVRSRV---HSPLDG 228
Query: 144 RRFALVPLGPPLLAY----SSKCKAMLAA--VDDAVQLVVD--RPYKAGESIVVWCGP-- 193
ALVPL L+ + ++ + LA A LVV+ R Y GE + + G
Sbjct: 229 EDVALVPLAD-LVQHRKLQGARWQLQLAGGLFSKAQALVVEAQRDYAEGEVVTMDFGAPL 287
Query: 194 ------QPNSKLLINYGFVDEDNPY-DRLVVEA------ALNTEDPQYQDKRMVAQRNGK 240
+ +S++L++YG +D D P D VV+ AL +D Y DK + + NG
Sbjct: 288 TEEDQEKLDSQVLLDYGALDADRPQADPGVVQGGFILSLALPEDDKYYDDKADILELNGL 347
Query: 241 LSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLG---PICPVSPCMERAVL 297
F + A E L D ++ + + + PVS ERAV
Sbjct: 348 SEAASFVLRANEEPSEQLLGFLRLLNLSGQDAFLLEPLFRNEAWGHMLAPVSEANERAVY 407
Query: 298 DQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQ-LVRM-EKKMLNACLQVTAD 354
+ + + +A L GY ++ +D L D P R+ LVR+ EK+ L+A L D
Sbjct: 408 ESMMEGCRAALQGYATSIDDDLRALRDT--QPGTRLEKAILVRLGEKETLDATLAFFED 464
>gi|62860180|ref|NP_001017105.1| SET domain containing 4 [Xenopus (Silurana) tropicalis]
gi|89267009|emb|CAJ81787.1| novel protein containing a SET domain [Xenopus (Silurana)
tropicalis]
Length = 442
Score = 46.2 bits (108), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 47/207 (22%), Positives = 94/207 (45%), Gaps = 19/207 (9%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
+S L L +L+ E+ ++S W PY+ L + P+ W E+E+ L +P
Sbjct: 104 VSPLLALCTFLIAERVARERSPWKPYLDVLPS-------SYSCPVYW-ESEIISLLPAPL 155
Query: 78 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV-- 135
+ + LE+ +K + E W SL + +I T+ +T+ + A+ V + V
Sbjct: 156 RQKALEQQTEVKELHTE---SWSFFVSLQPLFGGNI-TDIYTYGALRWAWCTVNTRTVYM 211
Query: 136 -HLQKVSLARRFALVPLGP--PLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVW 190
H ++ L+ + + + P LL +S + A ++ ++ + + + +
Sbjct: 212 KHPRRHGLSAQQDVYAMAPYLDLLNHSPAVQVEAAFNEERRCYEIRTNSGCRKHDQAFIC 271
Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVV 217
GP N +LL+ YGF+ +NP+ + V
Sbjct: 272 YGPHDNQRLLLEYGFIAANNPHRSVYV 298
>gi|367009050|ref|XP_003679026.1| hypothetical protein TDEL_0A04830 [Torulaspora delbrueckii]
gi|359746683|emb|CCE89815.1| hypothetical protein TDEL_0A04830 [Torulaspora delbrueckii]
Length = 484
Score = 46.2 bits (108), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 39/68 (57%), Gaps = 1/68 (1%)
Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
+++PL L A +SKC A L +++++ +P GE + G PNS+LL YG+V
Sbjct: 213 SMIPLADTLNANTSKCNANLVYDIESLKMCATKPIGMGEQVYNIYGDHPNSELLRRYGYV 272
Query: 207 D-EDNPYD 213
+ E + YD
Sbjct: 273 EWEGSKYD 280
>gi|302754814|ref|XP_002960831.1| hypothetical protein SELMODRAFT_402223 [Selaginella moellendorffii]
gi|300171770|gb|EFJ38370.1| hypothetical protein SELMODRAFT_402223 [Selaginella moellendorffii]
Length = 486
Score = 46.2 bits (108), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 43/162 (26%), Positives = 71/162 (43%), Gaps = 18/162 (11%)
Query: 60 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF- 118
S W +TEL+YL SP + ER E I E+ ++ F L Q D+ + F
Sbjct: 313 STFRWEDTELSYLRASPLYGKARERLEMITTEFGQVQND-FCTCVLEQ--ALDVWPQLFG 369
Query: 119 --TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY-----SSKCKAMLAAVDD 171
+ E K + V S + +++ LV + P+L + +S K + +
Sbjct: 370 KVSLEDLKHVYATVFS-----RSLAIGEDSTLVMI--PMLDFFNHNATSFAKLSFNGLLN 422
Query: 172 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 213
+ DR Y + I + G N++L ++YGF +NPYD
Sbjct: 423 YAVVTADRDYAENDQIWINYGDLSNAELALDYGFTVPENPYD 464
>gi|402076002|gb|EJT71425.1| hypothetical protein GGTG_10683 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 497
Score = 46.2 bits (108), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 53/202 (26%), Positives = 83/202 (41%), Gaps = 27/202 (13%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
L L +++E +G S W PY+ L + E+P+ WS ELA L SP A +
Sbjct: 104 SLILVMIHEHLRGSASPWRPYLDVLPAR-------FETPMFWSAAELAELQASPVVASV- 155
Query: 83 ERAEG---IKRE----YNELDTVWFMAGS----------LFQQYPYDIPTEAFTFEIFKQ 125
RAEG I+ E + ++F AG L + I AF E
Sbjct: 156 GRAEGDAMIRSRILPVIRENEALFFGAGGAAMGDEELVELAHRMGSTIMAYAFDLERDDD 215
Query: 126 AFVAVQSCVVHLQKVSLARR-FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 184
A + + R +VP+ +L ++ A + ++A+ + R AG
Sbjct: 216 AMDEDDAEGDGWVEDRDGRTVMGMVPMA-DILNADAEFNAHINHSEEALVAISLRKIPAG 274
Query: 185 ESIVVWCGPQPNSKLLINYGFV 206
E I+ + GP PN +L YG+
Sbjct: 275 EEILNYYGPLPNGQLCRRYGYT 296
>gi|219125216|ref|XP_002182882.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217405676|gb|EEC45618.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 488
Score = 46.2 bits (108), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 40/161 (24%), Positives = 63/161 (39%), Gaps = 17/161 (10%)
Query: 61 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 120
P WSE+EL L GSP + + EGI+++YN + W + + P+ F
Sbjct: 138 PRQWSESELKALRGSPLLIRVRKAREGIQKDYNLIREAW------KEIHSSHSPSTTANF 191
Query: 121 ---EIFKQAFVAVQSCVVH-----LQKVSLARRFALVPLGPPLLAYSSKCKAMLAAV--- 169
E F A V S K ++PL + +C +
Sbjct: 192 PKLEEFSSAMAVVSSRAFSGMAGFETKGGAVNDTTMIPLLDLCNHHRGRCVTKNVSYRFK 251
Query: 170 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 210
D V + G+++ + G Q N++L +NYGF DN
Sbjct: 252 DGTVMVKAVTDIAIGDTLKITYGAQGNAQLFLNYGFCVADN 292
>gi|294659704|ref|XP_462118.2| DEHA2G13354p [Debaryomyces hansenii CBS767]
gi|199434171|emb|CAG90604.2| DEHA2G13354p [Debaryomyces hansenii CBS767]
Length = 480
Score = 46.2 bits (108), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 51/214 (23%), Positives = 89/214 (41%), Gaps = 19/214 (8%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS------ETELAY 71
LS L+ Y+ +EK++G SFW P+I L + LA PL+W EL
Sbjct: 143 LSSFQLLSFYICFEKQRGSSSFWKPFIDMLP-ETSDFDLA---PLVWKVLKVDHYEELLK 198
Query: 72 LTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFT----FEIFKQA 126
L + TK + + + + +YN + D + + + T+A E++ +
Sbjct: 199 LLPNSTKRHMDKIYDRFQTDYNVVKDLISIKLKEISDNERSNDLTDAIRHLVPIELYLWS 258
Query: 127 FVAVQSCVVHL---QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 183
++ + S +++ Q + A F + P L +S + L Q+ Y
Sbjct: 259 WMCINSRCLYMEIPQSKNAADNFTMAPY-VDFLNHSCDDQCGLKIDGTGFQVYTTCSYNP 317
Query: 184 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 217
E + + GP N LL YGF +N ++ L V
Sbjct: 318 DEQLFLSYGPHSNEFLLCEYGFTLPENKWNDLDV 351
>gi|320170159|gb|EFW47058.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 640
Score = 46.2 bits (108), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 52/199 (26%), Positives = 81/199 (40%), Gaps = 24/199 (12%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP--TKAE 80
LA+ L+YE + KS W +I L + ++S + WS E L P K +
Sbjct: 252 ALAMALLYESYE-PKSMWREWISSLPQ-------TLDSTVFWSAEEQDALQSLPLKRKTQ 303
Query: 81 ILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CVVHLQK 139
ILER +++ YN L +P+ +++E+FK A++ V S +
Sbjct: 304 ILER--HLQQLYNA------TTPRLLAAFPHIFAGGNYSYEMFKWAYMIVDSRSLTFSTG 355
Query: 140 VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV-----QLVVDRPYKAGESIVVWCGPQ 194
+ L PL L + L + V L R K GE +V G
Sbjct: 356 PDTLPQIMLAPLVDLLHHDPVQTNIQLGVHPEEVLGFEISLKTTRAIKKGEPLVRHIGEL 415
Query: 195 PNSKLLINYGFVDEDNPYD 213
PN +LL+ +G NPY+
Sbjct: 416 PNHQLLLRFGLAMPRNPYE 434
>gi|224012755|ref|XP_002295030.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220969469|gb|EED87810.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 753
Score = 46.2 bits (108), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 49/189 (25%), Positives = 81/189 (42%), Gaps = 21/189 (11%)
Query: 24 LALYLMYEKK-QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
L +Y+++++K G+ SF+ PY + L + P+ W+ EL L GS +I
Sbjct: 177 LMIYILWDRKVNGETSFFAPYYKILP------ETLRNMPIFWTREELDALEGSYLLLQIA 230
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
+RAE IK +Y + ++ G + T E F+ A + V S L ++
Sbjct: 231 DRAEAIKEDYISICSIAPEFGDI------------ATLEEFQWARMIVCSRNFGLL-ING 277
Query: 143 ARRFALVPLGPPLLAYSSK-CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 201
R ALVP L + K + + + + GE + G + N + L+
Sbjct: 278 HRTSALVPHADMLNHLRPRETKWTFSEESQSFTITTLQEIGMGEQVFDSYGQKCNHRFLL 337
Query: 202 NYGFVDEDN 210
NYGF E N
Sbjct: 338 NYGFCVERN 346
>gi|299470104|emb|CBN78133.1| protein N-methyltransferase [Ectocarpus siliculosus]
Length = 482
Score = 45.8 bits (107), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 36/74 (48%)
Query: 145 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 204
R AL+PL + YS M D A+ L V + G+ GP N LL YG
Sbjct: 215 RMALLPLIDSINHYSRMPTHMYWEADGALSLSVGAAFDPGDHAFASYGPVSNDDLLQYYG 274
Query: 205 FVDEDNPYDRLVVE 218
FV++DNP D V+E
Sbjct: 275 FVEQDNPSDTYVLE 288
>gi|443730800|gb|ELU16158.1| hypothetical protein CAPTEDRAFT_140019 [Capitella teleta]
Length = 255
Score = 45.8 bits (107), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 55/212 (25%), Positives = 92/212 (43%), Gaps = 30/212 (14%)
Query: 17 KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP 76
+LS L ++L+ E+ +G SFW PY+ L + L W+ E+ L
Sbjct: 50 RLSPTETLVIFLLCERNKGCSSFWKPYVDILPS-------SYTDILHWTSKEMDLLPKF- 101
Query: 77 TKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH 136
TK + + +N L + L +Q P AFT+++FK A+ +V + V+
Sbjct: 102 TKRRACDLRLKAEESFNRLCNGFLPL--LVRQMPQF--NGAFTWDLFKWAWSSVNTRCVY 157
Query: 137 L---QKVSLA----RRFALVPLGPPLLAYSSKCKAMLAAVDDA------VQLVVDRPYKA 183
+ Q L+ + AL P LL ++ + A DD+ L +PY
Sbjct: 158 MSQPQNSVLSPDEEDKSALAPFL-DLLNHTVDVEVN-ARFDDSSKSYKITTLTACKPY-- 213
Query: 184 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 215
+ + + GP N KLL+ YGF NP++ +
Sbjct: 214 -DQVFINYGPHSNEKLLLEYGFTLPCNPHNNI 244
>gi|380089029|emb|CCC12973.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 465
Score = 45.8 bits (107), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 61/270 (22%), Positives = 105/270 (38%), Gaps = 21/270 (7%)
Query: 60 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 119
S + ++E EL G+ + + I+ ++ L LF Q+ P + F+
Sbjct: 116 SSIFFTEEELEVCAGTSLYTITKQLEQSIEDDHRAL------VMQLFIQHRDLFPLDKFS 169
Query: 120 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCK--AMLAAVDDAVQLVV 177
E +K A V S + Q L P +L +SS+ K + + ++
Sbjct: 170 IEDYKWALCTVWSRRMDFQLRDGKSMRLLAPFAD-MLNHSSEAKPCHVYDVSSGNLSVLA 228
Query: 178 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQR 237
+ Y+ G+ + + G PNS+LL YGFV NP D + + + + P Y+ K +
Sbjct: 229 GKDYEPGDQVFINYGSVPNSRLLRLYGFVIPGNPNDTYDLVLSTHPQAPFYEQKHKLWVS 288
Query: 238 NGKLSVQVFHVHAGREKEAISDMLP-----YLRLGYVSDTSEMQSVISSLGPICPVSPCM 292
G S + ++D LP YLR+ + + + VS
Sbjct: 289 AGLDSTSTIPL-------TLTDPLPKNVLRYLRIQRADASDLAAMALQNAKADEKVSDSN 341
Query: 293 ERAVLDQLADYFKARLAGYPATLSEDEAML 322
E +L L + F L G+ L + E L
Sbjct: 342 EVEILQFLVESFGHLLGGFGTPLEKLEEQL 371
>gi|255075907|ref|XP_002501628.1| predicted protein [Micromonas sp. RCC299]
gi|226516892|gb|ACO62886.1| predicted protein [Micromonas sp. RCC299]
Length = 607
Score = 45.8 bits (107), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 59/224 (26%), Positives = 93/224 (41%), Gaps = 40/224 (17%)
Query: 17 KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP 76
+L+ A LAL++++E +S Y+ L G+ +V PLLW+ T++A L G+P
Sbjct: 145 ELATHAALALHVLFELGD-PRSEGFAYLATLPGLAGKASPSV--PLLWTPTQVATLRGTP 201
Query: 77 TKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH 136
T +L RA+ + + L +P E F + A +V S
Sbjct: 202 THGRVLRRAKFVSDAHAAL---------FGSGGGGGVPLEKFAW-----ALSSVLSRAAS 247
Query: 137 LQKVSLARRFALVPLGPPLLAYSS-KCKAMLAAV--------------DDAVQLVVDRPY 181
++ +A +P G LL + L+AV D V V D P
Sbjct: 248 GDRMP----YAFLP-GVDLLNHGGVDANCELSAVKLAPGGNEENVTWGDVEVTCVKDTP- 301
Query: 182 KAGESIVVWCGPQP-NSKLLINYGFVDEDNPYDRLVVEAALNTE 224
AGE + + G + N +LL YGF N +DR +E L +
Sbjct: 302 -AGEQLTISYGDESDNCRLLRLYGFATRGNVHDRRTIELRLTGD 344
>gi|347967016|ref|XP_003436005.1| AGAP002018-PB [Anopheles gambiae str. PEST]
gi|333469796|gb|EGK97407.1| AGAP002018-PB [Anopheles gambiae str. PEST]
Length = 504
Score = 45.8 bits (107), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 82/369 (22%), Positives = 143/369 (38%), Gaps = 67/369 (18%)
Query: 24 LALYLMYEKKQGKKSFWLPYIREL-DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
LAL L+ E+ + K S W PY+ L DR +PL ++ ++ L + L
Sbjct: 151 LALALIMERFRAK-SDWKPYLDLLPDR--------YTTPLYYTTEDMGELAETDAFLPAL 201
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV--------AVQSCV 134
+ + I R+Y + F Q D + FT+++F+ AV + +
Sbjct: 202 KLCKHIARQYGFIRR--------FVQEKVDELRDCFTYDVFRLLLFSLLIPHSWAVSTVM 253
Query: 135 VHLQKV--------SLARRFALVPL------GPPLLAYSSKCKAMLA--AVDDAVQLVVD 178
KV + AL+PL P A ++C A A ++ ++ +
Sbjct: 254 TRQNKVPVNLAEFDGMDHTLALIPLWDMANHAFPDTANETRCVAETCYNATNEQLECSLT 313
Query: 179 RPYK--AGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 236
R A I + G + +++ L++ GFV NP+ + L P Y+++ + +
Sbjct: 314 REVSDIASVPIFIVYGTRTDAEFLVHNGFVCPRNPHANVQKRFTLVPAIPLYKERAHLLE 373
Query: 237 RNGKLSVQVFHVHAGREKEAISDMLPY----LRLGYVSDTS------------EMQSVIS 280
G + F RE A + P + L VS + + +
Sbjct: 374 LLGMPTTGTFSFGPAREPAAATTTTPISQELISLARVSSMTAKELDEYTAMKETQRQTLR 433
Query: 281 SLGPICPVSPC--MERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLV 338
+ + P C ER LA K L YP T+ +DEA+L N H +R+ +
Sbjct: 434 TYQALLPAELCARTER----WLATVMKIMLLRYPTTIEQDEALLKT-NRHHIRRLLIEYR 488
Query: 339 RMEKKMLNA 347
EK++L +
Sbjct: 489 LGEKQILRS 497
>gi|115391295|ref|XP_001213152.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114194076|gb|EAU35776.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 691
Score = 45.8 bits (107), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 50/193 (25%), Positives = 87/193 (45%), Gaps = 15/193 (7%)
Query: 26 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 85
+L+ + +G + FW PYI L + G L +PL + +L +L G+ + A E+
Sbjct: 115 FFLIGQYLRGSEGFWYPYICTLPQP---GDLT--TPLYYEGADLRWLEGT-SLAPAREQK 168
Query: 86 EGIKRE-----YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKV 140
E + +E + EL F ++Y +++ A T + + V + VV ++
Sbjct: 169 ESLLKEKYQSTFEELRKSGFGDA---EKYTWELYLWASTIFVSRAFSAKVLAGVVPHAEL 225
Query: 141 SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLL 200
L+P +L + K A + V VV AGE + GP+ N +L+
Sbjct: 226 PEENVSVLLPF-IDVLNHRPLAKVEWRAGERDVLFVVLEHVAAGEEVANNYGPRNNEQLM 284
Query: 201 INYGFVDEDNPYD 213
+NYGF ++NP D
Sbjct: 285 MNYGFCLQNNPCD 297
>gi|308811012|ref|XP_003082814.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplast precursor (ISS)
[Ostreococcus tauri]
gi|116054692|emb|CAL56769.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplast precursor (ISS)
[Ostreococcus tauri]
Length = 588
Score = 45.8 bits (107), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 66/271 (24%), Positives = 112/271 (41%), Gaps = 46/271 (16%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
+AL L+ E+++G + Y+ L + ++SPL WS EL + G+ ++L+
Sbjct: 128 VALLLLKERERGADGEFAAYVATLPK-------TLDSPLFWSADELRDIAGT----QLLD 176
Query: 84 RAEG----IKREYNELDTVWFMAGSLFQQYPYDIPTE-AFTFEIFKQAFVAVQS-CVVHL 137
A G ++ Y EL +F +Y + AF F+ AF ++S + L
Sbjct: 177 NAAGYDAYVRAVYEEL------KNGVFVEYASTFDVDGAFDEASFRWAFGILRSRTMAPL 230
Query: 138 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD---------------AVQLVVDRPYK 182
++A LVP G L+ +SS A A + DR Y
Sbjct: 231 DGANVA----LVP-GLDLINHSSLSGARWRVGGGGGMGGLFGGGSGSGVAAYVECDRDYD 285
Query: 183 AGESIVVWCGPQP-NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL 241
G I V P+ +SK ++YGF+D NP + ++ +D DK V + G
Sbjct: 286 EGAEIFVNYDPEGIDSKFALDYGFIDVVNPSPGYALTLSIPEDDANLFDKLDVLETQGLP 345
Query: 242 SVQVFHVHAGREKEAISDMLPYLRLGYVSDT 272
F + + + ++ +LRL + DT
Sbjct: 346 EAPTFTLRPYSDPD--RELRTFLRLLHCKDT 374
>gi|403414266|emb|CCM00966.1| predicted protein [Fibroporia radiculosa]
Length = 420
Score = 45.8 bits (107), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 54/212 (25%), Positives = 92/212 (43%), Gaps = 27/212 (12%)
Query: 42 PYIREL---DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE---- 94
PYIR L D+ R +PL +S+TEL L GS L+R + E+ +
Sbjct: 117 PYIRMLPAPDKLR--------TPLHFSDTELEALKGSNLYGATLDRRRDWQSEWEQCQKT 168
Query: 95 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 154
+ TV G F Y + + F ++ +V S + ++ G
Sbjct: 169 IATVDLTWGEQFSWERYLSASTYLSSRAFPSMVLSPNPSLV-----STEESYPVLLPGID 223
Query: 155 LLAYS-----SKCKAMLAAVD-DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 208
L +S S ++ + D + + LV+ + AG ++ GP+PN++L++ YGF
Sbjct: 224 SLNHSRGQPVSWVVSIGTSSDVNRISLVLHKSTPAGSELLNNYGPKPNAELILGYGFSLP 283
Query: 209 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 240
+NP D +V++ N+ Q K V RN +
Sbjct: 284 ENPDDTIVLKIGGNSASGLQQQKWEVG-RNAQ 314
>gi|336260071|ref|XP_003344832.1| hypothetical protein SMAC_06115 [Sordaria macrospora k-hell]
Length = 456
Score = 45.4 bits (106), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 63/271 (23%), Positives = 111/271 (40%), Gaps = 23/271 (8%)
Query: 60 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 119
S + ++E EL G+ + + I+ ++ L LF Q+ P + F+
Sbjct: 107 SSIFFTEEELEVCAGTSLYTITKQLEQSIEDDHRAL------VMQLFIQHRDLFPLDKFS 160
Query: 120 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCK--AMLAAVDDAVQLVV 177
E +K A V S + Q L P +L +SS+ K + + ++
Sbjct: 161 IEDYKWALCTVWSRRMDFQLRDGKSMRLLAPFAD-MLNHSSEAKPCHVYDVSSGNLSVLA 219
Query: 178 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQR 237
+ Y+ G+ + + G PNS+LL YGFV NP D + + + + P Y+ K +
Sbjct: 220 GKDYEPGDQVFINYGSVPNSRLLRLYGFVIPGNPNDTYDLVLSTHPQAPFYEQKHKLWVS 279
Query: 238 NGKLSVQVFHVHAGREKEAISDMLP-----YLRLGYVSDTSEMQSV-ISSLGPICPVSPC 291
G S + ++D LP YLR+ +D S++ ++ + + VS
Sbjct: 280 AGLDSTSTIPL-------TLTDPLPKNVLRYLRI-QRADASDLAAMALQNAKADEKVSDS 331
Query: 292 MERAVLDQLADYFKARLAGYPATLSEDEAML 322
E +L L + F L G+ L + E L
Sbjct: 332 NEVEILQFLVESFGHLLGGFGTPLEKLEEQL 362
>gi|195132508|ref|XP_002010685.1| GI21676 [Drosophila mojavensis]
gi|193907473|gb|EDW06340.1| GI21676 [Drosophila mojavensis]
Length = 593
Score = 45.4 bits (106), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 73/351 (20%), Positives = 134/351 (38%), Gaps = 44/351 (12%)
Query: 28 LMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG 87
L+ EK +G S W P+I L + + L ++ ++ L G+ + +
Sbjct: 251 LIMEKLRGADSPWQPFIDTLPSR-------YNTVLYFTVEQMQRLRGTSACSAAVRHCRV 303
Query: 88 IKREYNEL---------DTVWFMAGSLFQQYP--YDIPTEAFTFEIFKQAFVAVQSCVVH 136
I R Y + D+V +LF Y Y++ A + +Q V Q
Sbjct: 304 IARLYASMYKCAFMQLDDSVMGGMANLFTDYGLCYELYRWAVSTVTTRQNLVPRQEIPSD 363
Query: 137 LQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPN 196
+ ++ AL+P S K + ++ YK+GE ++ G + N
Sbjct: 364 AANLPIS---ALIPYWDMANHRSGKITSFYDQAAGQMECTAQEAYKSGEQYFIYYGDRSN 420
Query: 197 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV-----AQRNGKLSVQVFHVHAG 251
+ L++ GFVD NP D + + L+ D + + ++ +R +L V H
Sbjct: 421 ADRLVHNGFVDMQNPKDYVQIRLGLSPTDALAEQRAILLAELNIERKAELRVLPAPEHIS 480
Query: 252 REKEAISDMLPYLRLGYVSDT------SEMQSVISSLGPICPVSPCMERAVLDQLADYFK 305
E +L ++R+ +S S+++ + L C + +E L K
Sbjct: 481 GE------LLAFVRVFNMSKEQLEHWCSDLERAVDLLHIDCALETDLETRTWQYLYQRLK 534
Query: 306 ARLAGYPATLSED------EAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 350
L ATL E EA+ + + Q R+E+++L+ LQ
Sbjct: 535 LLLGVLEATLKETDELKQLEALQQQADASEIDIMVLQYRRLERRILSDALQ 585
>gi|403158396|ref|XP_003307692.2| hypothetical protein PGTG_00642 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375163798|gb|EFP74686.2| hypothetical protein PGTG_00642 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 622
Score = 45.4 bits (106), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 30/78 (38%), Positives = 39/78 (50%), Gaps = 2/78 (2%)
Query: 146 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 205
A+VPL L A + A L D +++ R K GE I G PNS LL YG
Sbjct: 345 IAMVPLADLLNAKTGSENARLFYETDCLKMKATRNIKKGEQIYNTYGDPPNSDLLRRYGH 404
Query: 206 VDEDNPYDRLVVEAALNT 223
VD+ N +D VVE ++ T
Sbjct: 405 VDDPNRFD--VVEISIKT 420
>gi|302754816|ref|XP_002960832.1| hypothetical protein SELMODRAFT_437299 [Selaginella moellendorffii]
gi|300171771|gb|EFJ38371.1| hypothetical protein SELMODRAFT_437299 [Selaginella moellendorffii]
Length = 418
Score = 45.4 bits (106), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 46/172 (26%), Positives = 76/172 (44%), Gaps = 22/172 (12%)
Query: 64 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF---TF 120
W +TEL+YL SP + ER E I E+ ++ F L Q D+ + F +
Sbjct: 202 WEDTELSYLRASPLYGKARERLEMITTEFGQVQND-FCTCVLEQ--ALDVWPQLFGKVSL 258
Query: 121 EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY-----SSKCKAMLAAVDDAVQL 175
E K + V S + +++ LV + P+L + +S K + + +
Sbjct: 259 EDLKHVYATVFS-----RSLAIGEDSTLVMI--PMLDFFNHNATSFAKLSFNGLLNYAVV 311
Query: 176 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQ 227
DR Y + I + G N++L ++YGF +NPYD E L T+ P+
Sbjct: 312 TADRDYAENDQIWINYGDLSNAELALDYGFTVPENPYD----ETELLTQFPE 359
>gi|428171155|gb|EKX40074.1| hypothetical protein GUITHDRAFT_113813 [Guillardia theta CCMP2712]
Length = 353
Score = 45.4 bits (106), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 24/79 (30%), Positives = 41/79 (51%), Gaps = 1/79 (1%)
Query: 139 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
++ L+R FA G L +S Q+V ++ +K G+S+ + G + N +
Sbjct: 183 EIVLSRAFAFSRTGGDDLVFSG-TSVKYDNSKQEFQIVAEKDFKVGQSVEISYGLKSNHE 241
Query: 199 LLINYGFVDEDNPYDRLVV 217
LL++YGF+ DNP D V+
Sbjct: 242 LLLSYGFILPDNPEDFFVI 260
>gi|307109196|gb|EFN57434.1| hypothetical protein CHLNCDRAFT_142903 [Chlorella variabilis]
Length = 1233
Score = 45.4 bits (106), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 77/334 (23%), Positives = 127/334 (38%), Gaps = 86/334 (25%)
Query: 19 SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA-YLTGSPT 77
SEL LAL+LM E+ + A +P+LW + E L GSP
Sbjct: 913 SELVGLALWLMQERAK----------------------ATLTPILWPDEERQQLLRGSPV 950
Query: 78 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV-H 136
E R + +++E+ ++ + + YP + E QAF+ S V+ H
Sbjct: 951 LEEARTREQALRQEWQDIAAIAAQTSGGPEAYPAVVYNE--------QAFLEAMSVVLAH 1002
Query: 137 LQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQ 194
+ A+ FAL+PL L S A+L +AV +V R G+ + ++C
Sbjct: 1003 AAYLPKAQCFALLPLVGGLCRTGSSSGALLDYDLEREAVTVVAQR--TPGQEVALYC--- 1057
Query: 195 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG---KLSVQVFHVHAG 251
L + A+L D Y KR + + G K +F
Sbjct: 1058 --------------------LFMAASLVAADRLYTTKREILEELGLGVKAEFPIFE---- 1093
Query: 252 REKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGY 311
++ A ++ + E ++I SP E +L L + R+ Y
Sbjct: 1094 -DRLATQQLINF----------EQDTII---------SPENEYEILQLLMGDLRDRIQAY 1133
Query: 312 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
+D L +L P++R+A QL EK++L
Sbjct: 1134 ATEFDDDIKDLQRTDLTPRQRLAAQLRLGEKRIL 1167
>gi|428173103|gb|EKX42007.1| hypothetical protein GUITHDRAFT_141487 [Guillardia theta CCMP2712]
Length = 355
Score = 45.4 bits (106), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 53/202 (26%), Positives = 80/202 (39%), Gaps = 21/202 (10%)
Query: 18 LSELACLALYL---MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 74
LSE + +Y MY K K+SFW PY + + SP+ WSE EL L G
Sbjct: 129 LSEKNKVLMYFLLQMYNPK--KESFWKPYFDIM-------PTNLTSPIFWSEDELQELAG 179
Query: 75 SPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 134
S + ++ Y+EL +F+ +AFT + + A S V
Sbjct: 180 SEVSNMARIEKKRLRAMYDELRE------RIFKHDRKTFLKQAFTLKNWFWANGLYDSRV 233
Query: 135 VHLQKVSLARRF-ALVPLGPPLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWC 191
+ L + + +PL + S+ K + A + DR G +
Sbjct: 234 IQLNRQTGHGNVPTFIPLIDMVNCIESQDKTFIQYDKKLRAAVMYADRAVSRGVQVFESY 293
Query: 192 GPQPNSKLLINYGFVDEDNPYD 213
G + N + L+ GFV EDNP D
Sbjct: 294 GNKSNYEYLLYNGFVMEDNPND 315
>gi|348684109|gb|EGZ23924.1| hypothetical protein PHYSODRAFT_296170 [Phytophthora sojae]
Length = 452
Score = 45.4 bits (106), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 51/192 (26%), Positives = 77/192 (40%), Gaps = 36/192 (18%)
Query: 58 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 117
V+ PL W + + L G ++ R Y+++ F A + F + EA
Sbjct: 143 VDLPLYWDDKQFEELQGCEEARRAMQHG---ARFYSQVYKHLFGANNQF------VNAEA 193
Query: 118 FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY---SSKCKAMLAAVDDAVQ 174
F + I S ++ + FAL+P S C+ L + D+ VQ
Sbjct: 194 FFWAI---------SILMSRATSGQNQPFALIPFFDWFNHAGNGSDNCRHALDS-DECVQ 243
Query: 175 ---------LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD--RLVVEAAL-- 221
+ R Y+ GE + + G N +LL NYGF +NPYD L + AAL
Sbjct: 244 DFDMQKGFTIHTTRSYEPGEQLFINYGSHGNLRLLRNYGFTMPNNPYDVVNLPMPAALQQ 303
Query: 222 -NTEDPQYQDKR 232
N DP + KR
Sbjct: 304 PNEADPAFAQKR 315
>gi|298715435|emb|CBJ28046.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 719
Score = 45.1 bits (105), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 60/248 (24%), Positives = 107/248 (43%), Gaps = 24/248 (9%)
Query: 7 YYAAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSE 66
Y L T L LALY+M E + ++SF+ PY+R L ES LL +
Sbjct: 83 YGHVHRLNTRLLGSDNGLALYIMQEILKEERSFYWPYLRMLPTPCNLRNWNRESLLLLQD 142
Query: 67 TELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 126
+L T + ++ ++L + RE T+ F++ S YP + +TFE+F A
Sbjct: 143 HKLVRRTAARSR-QLL----ALYRE-----TIEFLSSS----YPELYTADRYTFELFDFA 188
Query: 127 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DRPYKA 183
+ +Q+ + S ALVP L + + K + + + Y
Sbjct: 189 WRTIQARAFGKRLKS----SALVPFADCLNHGNVQTKYDFDVGGNGTFRLFPSGNNRYPR 244
Query: 184 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL---NTEDPQYQDKRMVAQRNGK 240
++ G + N LL++YGF DN +D V +L + + P + ++ + +G+
Sbjct: 245 NSEVLNSYGRRANDNLLLDYGFAMLDNEWDAAEVICSLPPSHDQSPLDRRRKACLRASGQ 304
Query: 241 LSVQVFHV 248
+V++ V
Sbjct: 305 HTVRILRV 312
>gi|330800139|ref|XP_003288096.1| hypothetical protein DICPUDRAFT_152307 [Dictyostelium purpureum]
gi|325081857|gb|EGC35358.1| hypothetical protein DICPUDRAFT_152307 [Dictyostelium purpureum]
Length = 525
Score = 45.1 bits (105), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 90/444 (20%), Positives = 162/444 (36%), Gaps = 128/444 (28%)
Query: 10 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
+ +LT + A+ L+YE G+KS W YI L L V+ P+LW +
Sbjct: 111 SNILTKYTMERNIATAIALIYEASIGEKSKWYGYISSL-------PLKVDIPILWDKESQ 163
Query: 70 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
L G+ + I + I Y ++ + L + +P E F+ EIF
Sbjct: 164 QLLNGTVMEDVIQDDNILINHAYADI-----VESLLIKNHP-----EYFSKEIFSFENFK 213
Query: 130 VQSCVVHLQKVSLARRF--ALVPLG----------------------------------- 152
+ + +V + + +LVPL
Sbjct: 214 IANSIVSSRAFCIDSYHGDSLVPLADIFNHKTGRENVHIESNGDVCNKCGSIKTCKHRKV 273
Query: 153 PPLLAYSSK---------------------------CKAMLAAVDDAVQLVVDRPYKAGE 185
PL+ S+K C + D+ + + V + KA +
Sbjct: 274 TPLITKSAKSYKKLTNKKKMELIEKQKQQQINDEENCGDIAEEDDEHLYIKVVKAVKANQ 333
Query: 186 SIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL-----------NTEDPQYQDKRM- 233
+ G N+ LL YGF++ DNP D L VE +L N D KR+
Sbjct: 334 EVYNTYGDHSNATLLSKYGFIEMDNPCDNLPVEKSLVDTNLISLCKENGFDSNELSKRIS 393
Query: 234 -------VAQRNGKLSVQVFHVHAGREKEA-----------ISDMLPYLRLGYVSDTSEM 275
+ RN S+++ +GR +A +S+ +L++ ++
Sbjct: 394 FYASLFDIDSRNTH-SIEI----SGRLDDALVCSVGIALAPLSEFEGWLKMS----EHKL 444
Query: 276 QSVISSLGP--ICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDY--NLHPKK 331
+ L I + +++A++ L + +L+ YP TL +D+ L + N +K
Sbjct: 445 EKYFEKLEAEDIVKQNAQVKKAIVQILNN----KLSNYPTTLEQDQNKLKELKENEENRK 500
Query: 332 RVATQLVRMEKKMLNACLQVTADM 355
++T L EKK++ ++ D+
Sbjct: 501 IISTSLNICEKKLIYKSIKYYEDL 524
>gi|320169513|gb|EFW46412.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 495
Score = 45.1 bits (105), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 88/383 (22%), Positives = 153/383 (39%), Gaps = 53/383 (13%)
Query: 12 LLTTNKLSELACLALYLMYE-KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA 70
+L +LS+ +A +L+YE + + S WLPY+ L QL + + E+
Sbjct: 113 ILDDFRLSDAEAMAFWLIYELTRPERASPWLPYLESL--PASIKQLT----MFYDPFEMK 166
Query: 71 YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP-----YDIPTEAFTFEIFKQ 125
L SP R ++ ++ + + + P + P E T + F
Sbjct: 167 RLQASPVAEFTSRRTVKMRNKFGKYRE------QISKHRPAHLAEIEFPVELITVDDFLW 220
Query: 126 AFVAVQSC------VVHLQKVSLARRFALVPLG-----PPLLAYSSKCKAMLAAVDDAVQ 174
A +AVQ V H R LVPL P + +C L + +
Sbjct: 221 A-MAVQFTRLITVQVKHPADGEWERTKCLVPLADLLNTAPADQINVECATNLDSTH--FE 277
Query: 175 LVVDRPYKAGESIVV-WCGPQ--PNSKLLINYGFVDEDNPYDRLVV------EAALNTED 225
RP G+ ++ + G + N +L+++YG +NP D + + E A+ +
Sbjct: 278 CATIRPVAEGQELLTPYGGAEQLSNGQLIMDYGVTFRNNPSDLVALPIPKLRETAVAYDS 337
Query: 226 PQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS---DTSEMQSVISSL 282
M R +L + V KE +L + R+ YVS D S+++ V+ +
Sbjct: 338 KMRLLMAMSLDRFDRLQLPVLDHFESIPKE----LLAFARV-YVSTPSDLSDLEHVLELM 392
Query: 283 GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH--PKKRVATQLV-R 339
++P ER L+ L + Y T+ EDE ML + + P +V R
Sbjct: 393 KEHRAINPSNERRALELLLQLTNEMILKYITTIEEDETMLRELDAESVPNANAVNAVVLR 452
Query: 340 M-EKKMLNACLQVTADMIMLLPD 361
+ EK++L++ Q+ I LP+
Sbjct: 453 LGEKRILSSLWQLLDSAIEALPE 475
>gi|324503528|gb|ADY41532.1| SET domain-containing protein 3 [Ascaris suum]
Length = 502
Score = 45.1 bits (105), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 63/275 (22%), Positives = 109/275 (39%), Gaps = 51/275 (18%)
Query: 32 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 91
+K S WLPY+ L + +PL +S EL L+ SP E L + R+
Sbjct: 167 QKLSPDSSWLPYLDALPQ-------TFSTPLYFSALELRKLSPSPAYEESLIMYRNVARQ 219
Query: 92 YNELDTVWFMAG----------------------SLFQQYPYDIPTEAFTFEIFKQAFVA 129
+ V+F+A LF P+ + FTF++++ A
Sbjct: 220 F-----VYFLAAVQRSERSRSAKKDKNHAAVGMEPLFLNAPFTVSN--FTFDLYRWAVAC 272
Query: 130 VQSCVVHL----QKVSLARRFALVPLGPPLL-----AYSSKCKAMLAAVDDAVQLVVDRP 180
V + + + K S + A VP PLL + + D + +
Sbjct: 273 VTTRINFIPSQYAKDSNGQPVA-VPCLIPLLDMANHEFDHPLTVHFSTEGDYASIKATKD 331
Query: 181 YKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALNTEDPQYQDKRMV---AQ 236
YKAG+ + ++ G + N + ++ GFV D +N D ++ D Q + + + A
Sbjct: 332 YKAGDEVTIFYGIRTNRQFFLHNGFVPDGENKNDTYKLKIGFPRGDKQVRARLKLMHDAG 391
Query: 237 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD 271
N + V VF V+A +S +L + R+ V +
Sbjct: 392 FNAESRVFVFEVNASERPVPLS-LLDFARVFLVEN 425
>gi|303271033|ref|XP_003054878.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226462852|gb|EEH60130.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 664
Score = 45.1 bits (105), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 42/145 (28%), Positives = 68/145 (46%), Gaps = 17/145 (11%)
Query: 9 AAELLTTNKLSELACLALYLMYE-KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 67
A + +T+ +++ A +AL+L++E Q +KS W P++ L R VE+PLLW+
Sbjct: 185 ALKTITSREVTIDAVIALHLLHELYVQREKSEWWPWVSILPRD-------VETPLLWTPR 237
Query: 68 ELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
ELA L GS ++ + + + + F L Q++P P E F E + A
Sbjct: 238 ELAQLEGS----NLIGFRDAVLKGWTTQRDALF--PKLTQKFPSLFPEEHFRTERWAWAM 291
Query: 128 VAVQSCVVHLQKVSLARRFALVPLG 152
V S V + R A+ P G
Sbjct: 292 AIVWSRAA---DVPVPRPEAIFPSG 313
>gi|328772335|gb|EGF82373.1| hypothetical protein BATDEDRAFT_86177 [Batrachochytrium
dendrobatidis JAM81]
Length = 966
Score = 45.1 bits (105), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 51/206 (24%), Positives = 78/206 (37%), Gaps = 41/206 (19%)
Query: 38 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 97
S W PY L R P+L S +LA TG+ E++ + R+Y L
Sbjct: 652 SLWQPYFDLLPR-------VFTIPVLGSAQDLAAYTGTSIIDEVVHDMIALMRQYLYLQH 704
Query: 98 VWFMAGSLFQQYPYD-IPTEAFTFEIFKQAFVAVQS-----CVVHLQKVSLARRFALVPL 151
+ F+ P IP FTF F A V + C + + + L+PL
Sbjct: 705 I-------FKSIPEPPIPLADFTFAAFSWARAIVSTRQNEICYANPSTSEMQQFLCLIPL 757
Query: 152 --------GPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 201
G + +K C +A+ D GE I + G + N ++L+
Sbjct: 758 FDMFNHKPGNSTTQFDTKEYCSETIASCD----------VSPGEQIFIHYGKRSNQEMLL 807
Query: 202 NYGFVDEDN-PYDRLVVEAALNTEDP 226
GFVD N YD + + ++ DP
Sbjct: 808 YSGFVDPTNIEYDHIKLSVSIPQSDP 833
>gi|408392258|gb|EKJ71616.1| hypothetical protein FPSE_08255 [Fusarium pseudograminearum CS3096]
Length = 527
Score = 45.1 bits (105), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 35/63 (55%), Gaps = 2/63 (3%)
Query: 159 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 218
S CK + +A+ +VQ DR YK GE + V GP N LL YGF+ + N +D + ++
Sbjct: 194 SQGCKLVYSALGYSVQ--TDRAYKQGEEVFVSYGPHSNDFLLTEYGFILDTNRWDEVYLD 251
Query: 219 AAL 221
+
Sbjct: 252 EVI 254
>gi|348676999|gb|EGZ16816.1| hypothetical protein PHYSODRAFT_251772 [Phytophthora sojae]
Length = 424
Score = 45.1 bits (105), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 71/302 (23%), Positives = 118/302 (39%), Gaps = 56/302 (18%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV----ESPLLWSETELAYLTGSPTK 78
LAL+L+ E +SF+ PY LAV ES W++ EL L
Sbjct: 90 VLALFLVRELLLADRSFFHPY------------LAVLPYPESVQDWTQAELGELHD---- 133
Query: 79 AEILER-AEGIKREYNELDTVWFMAGSLFQ-QYPYDIPTEAFTFEIFKQAFVAVQSCVVH 136
ER + R +E+D + Q +YP + P +TF+ FK A+ +Q+
Sbjct: 134 ----ERLVDAAARRTSEIDVYYRRVMVRLQTKYPGEFPEALYTFDRFKFAWKTIQA---- 185
Query: 137 LQKVSLARRF---ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR---PYKAGESIVVW 190
+ RR ALVP L + K D+ + + + G +
Sbjct: 186 ---RTFGRRLPWTALVPFADCLNHTNVATKYDFDVNDNGLFRLYPSGATSFAQGAEVFNS 242
Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 250
G + N +LL++YGF DN +D + VE + P R KL V
Sbjct: 243 YGRRSNFQLLLDYGFALPDNEWDYVDVEIGKDRAGP----------RGRKLRFMKRVVRI 292
Query: 251 GREKEAISDMLPYLRLGYVSD------TSEMQSVISSLGPICPVSPCMERAVLDQLADYF 304
R+ ++ ++ P L ++D SE + +S +C + +++ +AD+
Sbjct: 293 DRQS-SLDELFPPSFLAGLADPVPDEEQSEAAAELSERTALCDALEWLRSILIETIADWG 351
Query: 305 KA 306
A
Sbjct: 352 TA 353
>gi|390602144|gb|EIN11537.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
SS5]
Length = 503
Score = 45.1 bits (105), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 59/220 (26%), Positives = 87/220 (39%), Gaps = 24/220 (10%)
Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
A+VP+ L A A L +++V +P +GE I G PNS LL YG V
Sbjct: 255 AMVPMADMLNARYGSENAKLFYESRDLRMVTTKPIASGEQIWNTYGDPPNSDLLRRYGHV 314
Query: 207 D---------EDNPYDRLVVEA--ALNTEDPQYQDKRMVAQ-----RNGKLSVQVFHVHA 250
D NP D + V A LN + + Q + + G V VF
Sbjct: 315 DLLALSDGDGMGNPSDIVEVRADLVLNHVNSKKQSHELEERIDWWLEEGGDDVFVFT--- 371
Query: 251 GREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAG 310
R+ E S+++ +RL + T ++ P V +L + RL
Sbjct: 372 -RDAELPSELVSLIRLLILPPTEWTKTRDKGKLPKGKVDDVR---ILHVVTGALHERLQQ 427
Query: 311 YPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 350
YP ++ +DEA+L L KR A + EK +L L
Sbjct: 428 YPTSIEDDEALLA-TALSENKRQAVIVRLAEKHILRKALH 466
>gi|255087300|ref|XP_002505573.1| set domain protein [Micromonas sp. RCC299]
gi|226520843|gb|ACO66831.1| set domain protein [Micromonas sp. RCC299]
Length = 509
Score = 45.1 bits (105), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 82/353 (23%), Positives = 146/353 (41%), Gaps = 46/353 (13%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
+AL L++E+ G+KS W Y+ L Q +++PL WS E+A LTG+ ++L
Sbjct: 134 AVALQLLHERSIGEKSRWAAYVNALPAQ-------LDAPLFWSAEEVATLTGT----QLL 182
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CVVHLQKVS 141
+ A G + T + S F P P++AF F AF ++S C V
Sbjct: 183 DAAAGY--DSYARGTWARLKESAFDANPDVFPSDAFDEPSFLWAFGILRSRCQA---PVD 237
Query: 142 LARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV------------ 189
ALVP +A S + +++ V K+G S+++
Sbjct: 238 QGADIALVP--GLDMANHSGLSSQTWTLNNGGVAAVFGGGKSGGSMLLRTEKGAKGLLAK 295
Query: 190 -------WCGPQPNSKLLINYGFVDEDNPYDRLVVE-AALNTEDPQYQDKRMVAQRNGKL 241
+ + +++L ++YGF D V+ A+ DP DK V + G
Sbjct: 296 GAEVFMNYGQRKIDNQLALDYGFTDAFASRPGYVLGPIAIPESDPNAFDKMDVLEVAGLR 355
Query: 242 SVQVFHVHAGREKEAISDMLPYLRLGYVS--DTSEMQSVI--SSLGPIC-PVSPCMERAV 296
F + A + E ++ ++RL + D ++++ + G I PVS E+
Sbjct: 356 EAPSFVLRAFEDPE--PELRVFMRLLNLKGEDAFLLEAIFRQEAWGLISEPVSRLNEQEA 413
Query: 297 LDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 349
+ + + L GY + +D + D + + R+A ++ EK+ L L
Sbjct: 414 CGTMINGCEEALRGYATRVEDDRRVAEDPGVGHRLRLAARVRMGEKQALADAL 466
>gi|126325439|ref|XP_001376285.1| PREDICTED: SET domain-containing protein 4-like [Monodelphis
domestica]
Length = 437
Score = 44.7 bits (104), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 50/207 (24%), Positives = 88/207 (42%), Gaps = 19/207 (9%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
+S L L +L+ EK G KS W PY+ L + + L+ E E+ L P
Sbjct: 105 ISPLLALCAFLISEKHAGNKSPWKPYLDVLPK--------AYTCLVCLEPEVVRLLPRPL 156
Query: 78 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV-- 135
+ + E+ +++ + + SLF + D+ F + F A+ + + V
Sbjct: 157 QMKAEEQRMQVQKLFISSRGFFSSLQSLFTE---DV-KHVFHYHAFLWAWCTINTRTVYM 212
Query: 136 -HLQKVSLARRFALVPLGP--PLLAYSSKCKAMLAAVDDAV--QLVVDRPYKAGESIVVW 190
H QK L+ + L P LL +S + A ++ ++ K E + +
Sbjct: 213 KHAQKQCLSAEPDVYALAPYLDLLNHSPRVWVEAAFNEETCCYEIRTTSHCKKFEELFIC 272
Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVV 217
GP N +LL+ YGFV +NP+ + +
Sbjct: 273 YGPHDNHRLLLEYGFVASNNPHSAVYI 299
>gi|346327621|gb|EGX97217.1| SET domain-containing protein, putative [Cordyceps militaris CM01]
Length = 371
Score = 44.7 bits (104), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 38/77 (49%), Gaps = 2/77 (2%)
Query: 145 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 204
R AL+P+ S C + + + DR Y+A E + G N LL YG
Sbjct: 178 RLALLPVADMFNHASVGCAVAFST--EVYDVTADRDYEADEELYTSYGAHSNDFLLAEYG 235
Query: 205 FVDEDNPYDRLVVEAAL 221
F+ +DNP+D+L ++A L
Sbjct: 236 FMLQDNPHDQLCLDAVL 252
>gi|366987955|ref|XP_003673744.1| hypothetical protein NCAS_0A08050 [Naumovozyma castellii CBS 4309]
gi|342299607|emb|CCC67363.1| hypothetical protein NCAS_0A08050 [Naumovozyma castellii CBS 4309]
Length = 499
Score = 44.7 bits (104), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 47/85 (55%), Gaps = 3/85 (3%)
Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
+++PL L + ++KC A L +D++++ KAGE + G PN+++L YG+V
Sbjct: 229 SMIPLADTLNSDTNKCNAHLIYDEDSLKMRAISNIKAGEQVYNIYGNHPNAEILRRYGYV 288
Query: 207 D-EDNPYD--RLVVEAALNTEDPQY 228
+ E + YD L +E + T QY
Sbjct: 289 EWEGSKYDFGELPLEVIIETLHEQY 313
>gi|145349891|ref|XP_001419360.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579591|gb|ABO97653.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 465
Score = 44.7 bits (104), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 50/213 (23%), Positives = 81/213 (38%), Gaps = 31/213 (14%)
Query: 14 TTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLT 73
T + S L L L E+ G KS + Y R L R A W++ E +YL
Sbjct: 89 TKTEASWLCGLTAALCVERSLGLKSRYFAYDRVLPRCEANVVCA------WNDGERSYLA 142
Query: 74 GSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSC 133
G+ + + + A K E+ + +F+++ + +FE F +A V S
Sbjct: 143 GTEVETSLRDEAAAAKNEWER------VVAPVFKEHGVEC-----SFEQFIEARTVVSSR 191
Query: 134 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD-----------AVQLVVDRPYK 182
L S LVP+ + + D V++ ++ +
Sbjct: 192 AFTL---SPNAGVGLVPIADAFNHLTGNHHVNVGDGDAVVRSETGGEALCVKVTNEQGVR 248
Query: 183 AGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 215
G+ I G N+KLL +YGF DNP D +
Sbjct: 249 RGDEIFNTYGFHGNAKLLNSYGFTQNDNPADEV 281
>gi|348552908|ref|XP_003462269.1| PREDICTED: SET domain-containing protein 4-like [Cavia porcellus]
Length = 440
Score = 44.7 bits (104), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 56/244 (22%), Positives = 98/244 (40%), Gaps = 32/244 (13%)
Query: 19 SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
S L L +L+ EK G +S W PY+ L + + P+ E E+ L P K
Sbjct: 105 SPLLALCTFLVSEKHAGDQSVWKPYLDILPK-------SYTCPVCL-EPEVVNLLPEPLK 156
Query: 79 AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ 138
A+ E+ +++ + + LF++ + F++ A+ V + V+L+
Sbjct: 157 AKAEEQRMSVQQFFASSRDFFSSLQPLFEEATDSV----FSYSALLWAWCTVNTRAVYLR 212
Query: 139 KVSLARRFALVPLGPP---------LLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESI 187
RR + L P LL +S + KA ++ Y+ + +
Sbjct: 213 ----TRRRDCLSLEPDTCALAPYLDLLNHSPNVQVKAAFNEETGCYEIRTASDYRKHKEV 268
Query: 188 VVWCGPQPNSKLLINYGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLS 242
+ GP N +LL+ YGFV NP+ R ++ L + D Q K + + +G L
Sbjct: 269 FICYGPHDNHRLLLEYGFVSLCNPHACVYVSREILVKYLPSTDKQMNKKISILKDHGFLE 328
Query: 243 VQVF 246
F
Sbjct: 329 NLTF 332
>gi|315042966|ref|XP_003170859.1| SET domain-containing protein [Arthroderma gypseum CBS 118893]
gi|311344648|gb|EFR03851.1| SET domain-containing protein [Arthroderma gypseum CBS 118893]
Length = 693
Score = 44.7 bits (104), Expect = 0.077, Method: Compositional matrix adjust.
Identities = 50/194 (25%), Positives = 84/194 (43%), Gaps = 11/194 (5%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
LAL++ +++ + K S W PY+ L R + S L + +L +L G+
Sbjct: 108 LALFVAHQQLKEKGSHWWPYLATLPRAS-----ELTSALFYHGDDLEWLQGTNLYQTHQA 162
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQ-YPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
+K EY+ ++ G L + Y +D+ A+T I +AF + + V+L +
Sbjct: 163 YMNAVKEEYDSAISILRDEGCLAAELYSWDLFCWAYTV-IASRAFTS-RVLSVYLSRNPA 220
Query: 143 ARRFALVPLGPPLLAYSSK---CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 199
++ + PL+ S+ K A + L V P + E I GP N +L
Sbjct: 221 LKQDEEFQILLPLVDSSNHKPLAKIEWRAEAAEIGLKVVEPIVSEEEIHNNYGPLNNQQL 280
Query: 200 LINYGFVDEDNPYD 213
+ YGF DNP D
Sbjct: 281 MTTYGFCIVDNPCD 294
>gi|328772032|gb|EGF82071.1| hypothetical protein BATDEDRAFT_23340 [Batrachochytrium
dendrobatidis JAM81]
Length = 419
Score = 44.7 bits (104), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 52/213 (24%), Positives = 93/213 (43%), Gaps = 32/213 (15%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
L + +AL++ ++K W PYI L R L ++ LL L Y
Sbjct: 86 LKQHPSIALFIAWQKIHPTPE-WSPYIDILPRSFDTMPLCIDLKLL---AMLPY------ 135
Query: 78 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYD-IPTEAFTFEIFKQAFVAVQSCVVH 136
+I E A K + ++LDT + + Y+ IP + IFK A++ V + +
Sbjct: 136 --DIQEIA---KNQQSKLDTDYAFVCTALAVSGYEMIPKD-----IFKWAWIVVNTRCIT 185
Query: 137 LQKVSLAR-------RFALVPLGPPLLAYSSKCKAMLAAVDDAVQ----LVVDRPYKAGE 185
+ ++++ + ++ L P L + A ++A D V+ + PYK G
Sbjct: 186 MNTNAISKPQLSHIHQQPIITLAPFLDCLNHTSTARISAGYDTVEKAYIIRTLVPYKKGS 245
Query: 186 SIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 218
+ + GP N+ LL YGF NP++ +V++
Sbjct: 246 QVFINYGPHDNNFLLAEYGFAILKNPFNHVVLD 278
>gi|291000152|ref|XP_002682643.1| predicted protein [Naegleria gruberi]
gi|284096271|gb|EFC49899.1| predicted protein [Naegleria gruberi]
Length = 619
Score = 44.7 bits (104), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 51/210 (24%), Positives = 88/210 (41%), Gaps = 19/210 (9%)
Query: 21 LACLALYLMYE-KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 79
L ++L+YE + +KS PY+ L R+ + L + E E+A L +
Sbjct: 106 LIVFYMFLIYELHVEKEKSTHFPYLNLLPRE-------FTTALYFDEDEMAALRSTNLYK 158
Query: 80 EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 139
+ + +K+ Y E + M +YP + F++E F AF AV S V ++
Sbjct: 159 SVQSIRQNLKQIY-ETKVEYLM-----NKYPQKFDRQVFSYENFMWAFSAVWSRVFPIEY 212
Query: 140 -VSLARRFALVP-LGPPLLAYSSKCKAMLA---AVDDAVQLVVDRPYKAGESIVVWCGPQ 194
+VP L P + + K A + D L K+G+ + G +
Sbjct: 213 PAENGEGVEIVPTLLPTVDILNHKFNAKITYFTGSDRRFYLKTRESLKSGDYVCNNYGAK 272
Query: 195 PNSKLLINYGFVDEDNPYDRLVVEAALNTE 224
N L++YGFV +N D L V+ ++ +
Sbjct: 273 SNDSFLLSYGFVIPNNSEDTLYVQFGISDD 302
>gi|46129354|ref|XP_389038.1| hypothetical protein FG08862.1 [Gibberella zeae PH-1]
Length = 478
Score = 44.7 bits (104), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 23/67 (34%), Positives = 36/67 (53%), Gaps = 2/67 (2%)
Query: 155 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 214
L + CK + +A+ +VQ DR YK GE + V GP N LL YGF+ + N +D
Sbjct: 187 LFNHDQGCKLVYSALGYSVQ--TDRVYKQGEEVYVSYGPHSNDFLLTEYGFILDTNRWDE 244
Query: 215 LVVEAAL 221
+ ++ +
Sbjct: 245 VYLDEVI 251
>gi|115386294|ref|XP_001209688.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114190686|gb|EAU32386.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 486
Score = 44.7 bits (104), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 88/388 (22%), Positives = 152/388 (39%), Gaps = 70/388 (18%)
Query: 13 LTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
L + L EL L L +MYE G +S W Y + L R+ ++ + W+ +EL
Sbjct: 81 LLSQDLEELGPWLSLMLVMMYEYLLGDQSTWAAYFKVLPRK-------FDTLMFWTPSEL 133
Query: 70 AYLTGSP---------TKAEILERAEGIKREYNEL----------------DTVWFMA-- 102
L GS ILE I R + L + +A
Sbjct: 134 LELQGSAVIDKIGRQGADESILEMIAPIVRAHPSLFPPVDGLPSYDGDAGTQALLHLAHT 193
Query: 103 -GSLFQQYPYDI--PTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYS 159
GSL Y +DI P + + ++ + ++ L++ +VPL L A +
Sbjct: 194 MGSLIMAYAFDIEKPEDEDEEGDGEGGYMTDE------EEEQLSK--GMVPLADLLNADA 245
Query: 160 SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED-NPYDRLVVE 218
+ A L ++A+ + +P GE I G P + LL YG+V ++ PYD + V
Sbjct: 246 DRNNARLFQDENALVMKAIKPIAKGEEIFNDYGEIPRADLLRRYGYVTDNYAPYDVVEVS 305
Query: 219 -------AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP-YLRLGYVS 270
A L+ DP+ Q L +E + ++D+LP L + +
Sbjct: 306 LDVICKAAGLSDSDPEKQPPLEFLDELELLDDGYVIPRPSQEDDQLTDILPDELIILLRT 365
Query: 271 DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT------- 323
T + + P P A LA + + A Y T+++D+ +L+
Sbjct: 366 LTLSPEQLAQQRSKNKPPKPAFAEAEATILAKAIQLKQAQYATTIAQDQEILSQLNSSEV 425
Query: 324 -----DYNLHPKKRVATQLVRMEKKMLN 346
D + H ++++A Q+ EK++L+
Sbjct: 426 SSGFVDESTH-RRKMAVQVRIGEKEILH 452
>gi|258567286|ref|XP_002584387.1| predicted protein [Uncinocarpus reesii 1704]
gi|237905833|gb|EEP80234.1| predicted protein [Uncinocarpus reesii 1704]
Length = 706
Score = 44.7 bits (104), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 55/206 (26%), Positives = 87/206 (42%), Gaps = 22/206 (10%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIREL--DRQRGRGQLAVESPLLWSE-TELAYLTG 74
+ E LA +LM + G +SFW PYI+ L D Q R + L W E T L L
Sbjct: 120 VEEPGALAFFLMDQYLLGDESFWAPYIQSLPDDSQFTRLEYYTGDDLKWLEGTNLLKLRE 179
Query: 75 SPTKAEILERAEGIK--REYNELDT---VW--FMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
+ + G++ +E+ +T W F+ S I + AF+ E+ K
Sbjct: 180 KLLERLKAKYETGLRLLKEFPNKNTPKYTWERFLWASSI------ILSRAFSSEVLKDYI 233
Query: 128 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESI 187
+ V L+ S+ LVPL + + + A + + L+V + GE +
Sbjct: 234 KGTPTRVKPLEDFSV-----LVPLVD-ISNHQPLAQVEWATSLEKIGLIVHKTLLPGEEV 287
Query: 188 VVWCGPQPNSKLLINYGFVDEDNPYD 213
GP+ N +L++NYGF N D
Sbjct: 288 PNNYGPRSNERLMMNYGFCIRGNVCD 313
>gi|296808191|ref|XP_002844434.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
gi|238843917|gb|EEQ33579.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
Length = 684
Score = 44.3 bits (103), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 51/194 (26%), Positives = 84/194 (43%), Gaps = 11/194 (5%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
LA ++ +++ + K S W PY+ L R G+L S L + +L +L +
Sbjct: 111 LAFFVAHQQLKAKDSHWWPYLATLPRA---GELT--SALFYQGEDLEWLQDTNFYHARQM 165
Query: 84 RAEGIKREYNELDTVWFMAGS-LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
+ +K EY+ ++ G L + Y ++I A+T I +AF + + ++ K
Sbjct: 166 YHDAVKTEYDAAISILRKEGCPLVESYSWNIFCWAYTV-IASRAFTS-RVLEAYISKNPA 223
Query: 143 ARRFALVPLGPPLLAYSSK---CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 199
R+ + PL+ S+ K A + L V P A E I GP N +L
Sbjct: 224 LRQDDEFQIMLPLVDSSNHRPLAKIEWRAEATRIGLKVIDPVSAKEEIHNNYGPLNNQQL 283
Query: 200 LINYGFVDEDNPYD 213
+ YGF DNP D
Sbjct: 284 MATYGFCIVDNPCD 297
>gi|224042477|ref|XP_002188626.1| PREDICTED: SET domain-containing protein 4 [Taeniopygia guttata]
Length = 457
Score = 44.3 bits (103), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 70/297 (23%), Positives = 113/297 (38%), Gaps = 39/297 (13%)
Query: 12 LLTTNKLSELACLALYL---MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 68
L L L C L + EK G+KS W PY+ L + A P E +
Sbjct: 112 FLIGQNLELLECFQFLLVNGIAEKHAGQKSPWKPYLDVLPK-------AYTCPAC-LEPD 163
Query: 69 LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 128
+ L P + + E+ I+ + + LF + +I F F + A+
Sbjct: 164 IINLLPKPLQKKAQEQKMLIQELFQSSRAFFSSLQPLFAEDTGNI----FNFSALQWAWC 219
Query: 129 AVQSCVVHLQKVSLARRFALVP----LGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRP 180
V + +++ K F+L P L P LL +S + KA + ++ D
Sbjct: 220 TVNTRTIYM-KHPHRECFSLEPDVYALAPYLDLLNHSPNVQVKAGFNEQTRSYEIWTDSQ 278
Query: 181 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 240
K + +++ GP N +LL+ YGFV DNP+ + V A + DK QR K
Sbjct: 279 CKKYQEVLICYGPHDNQRLLLEYGFVATDNPHSSVYVSADTLLKYFSSLDK----QREAK 334
Query: 241 LSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVL 297
+S+ H D L L G+ + + + + L C R +L
Sbjct: 335 VSILKDH-----------DFLENLTFGWEGPSWRLLTALKVLSLAADEFACWRRILL 380
>gi|449702130|gb|EMD42824.1| Hypothetical protein EHI5A_004190 [Entamoeba histolytica KU27]
Length = 749
Score = 44.3 bits (103), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 57/227 (25%), Positives = 87/227 (38%), Gaps = 23/227 (10%)
Query: 26 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 85
LYL K K W PYI L + PL ++ EL + G+ A + E+
Sbjct: 42 LYLAVNKTNPKCFHW-PYINVLPE-------TYDCPLSYTIDELNLMKGTKLYAAV-EKI 92
Query: 86 EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR 145
+ V + L QQ+P ++F ++FK+ A QS V +
Sbjct: 93 NAFL-----MKVVDYYNNKLIQQFPQYF--QSFD-DLFKRLQWAHQSFWSRAFLVIYPQP 144
Query: 146 FALVPLGPPLLAYSSKCKAMLAAVDDAVQL------VVDRPYKAGESIVVWCGPQPNSKL 199
F V P +S+ C Q + K GE I + N KL
Sbjct: 145 FGEVGSLIPFCDFSNHCTQAKVTYISNTQTETFSFQTNEELVKPGEQIFNNYRIRSNEKL 204
Query: 200 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 246
L+ YGFV+E+NP D L++ +D QY + + ++ S F
Sbjct: 205 LLGYGFVEENNPCDNLLLRIYFEVDDNQYNEIEEILKQEEIKSFDFF 251
>gi|67484540|ref|XP_657490.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56474743|gb|EAL52100.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
Length = 791
Score = 44.3 bits (103), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 57/227 (25%), Positives = 87/227 (38%), Gaps = 23/227 (10%)
Query: 26 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 85
LYL K K W PYI L + PL ++ EL + G+ A + E+
Sbjct: 84 LYLAVNKTNPKCFHW-PYINVLPE-------TYDCPLSYTIDELNLMKGTKLYAAV-EKI 134
Query: 86 EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR 145
+ V + L QQ+P ++F ++FK+ A QS V +
Sbjct: 135 NAFL-----MKVVDYYNNKLIQQFPQYF--QSFD-DLFKRLQWAHQSFWSRAFLVIYPQP 186
Query: 146 FALVPLGPPLLAYSSKCKAMLAAVDDAVQL------VVDRPYKAGESIVVWCGPQPNSKL 199
F V P +S+ C Q + K GE I + N KL
Sbjct: 187 FGEVGSLIPFCDFSNHCTQAKVTYISNTQTETFSFQTNEELVKPGEQIFNNYRIRSNEKL 246
Query: 200 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 246
L+ YGFV+E+NP D L++ +D QY + + ++ S F
Sbjct: 247 LLGYGFVEENNPCDNLLLRIYFEVDDNQYNEIEEILKQEEIKSFDFF 293
>gi|403412960|emb|CCL99660.1| predicted protein [Fibroporia radiculosa]
Length = 508
Score = 44.3 bits (103), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 63/227 (27%), Positives = 92/227 (40%), Gaps = 37/227 (16%)
Query: 148 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 207
+VP+ L A A L + +++V +P KAGE I G PNS LL YG VD
Sbjct: 258 MVPMADMLNARFGSENAKLFYEEHHLKMVTTKPIKAGEQIWNTYGDPPNSDLLRRYGHVD 317
Query: 208 ----------EDNPYD------RLVVEAALNTEDPQYQDK----RMVAQRNGKLSVQVFH 247
NP D L V AA + QDK VA + F
Sbjct: 318 LVPLEPPLAGLGNPADIVEIGADLAVFAAKKDSPEKLQDKIDWWLEVANDD------TFV 371
Query: 248 VHAGREKEAISDMLPYLRLGYV-SDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 306
+ G + + +++ + RL ++ D E S L P ++ VL D
Sbjct: 372 I--GTDCQLPEELVSFARLLFLPRDEWEKVRQKSKL-----PKPKIDAQVLSVAEDVLSR 424
Query: 307 RLAGYPATLSEDEAMLTDYNLHP---KKRVATQLVRMEKKMLNACLQ 350
R+ Y T+ +DEA+L N P K+ A + EK++L+ LQ
Sbjct: 425 RINEYSTTIEDDEALLALENAQPLSLNKKHALIVRHGEKRILHGTLQ 471
>gi|167389227|ref|XP_001738871.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165897700|gb|EDR24782.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 791
Score = 44.3 bits (103), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 55/212 (25%), Positives = 86/212 (40%), Gaps = 25/212 (11%)
Query: 26 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 85
LYL K K W PYI L + + PL ++ EL + G+ + E+
Sbjct: 84 LYLAINKTNPKCFHW-PYINVLPK-------TYDCPLSYTIDELNIMKGTKLYVAV-EKI 134
Query: 86 EGIKREYNELDTVWFMAGSLFQQYP-YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLAR 144
+ V + L QQ+P Y P + ++FK+ A QS V +
Sbjct: 135 NAFL-----MKVVDYYNNKLIQQFPQYFQPFD----DLFKRLQWAHQSFWSRAFLVIYPQ 185
Query: 145 RFALVPLGPPLLAYSSKC-KAMLAAVDDAVQLVV-----DRPYKAGESIVVWCGPQPNSK 198
F V P +S+ C +A + + + + K GE I + N K
Sbjct: 186 PFGEVGSLIPFCDFSNHCTQAKVTYISNTRTETFSFQTNEEVVKPGEQIFNNYRIRSNEK 245
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQD 230
LL+ YGFV+E+NP D L++ +D QY +
Sbjct: 246 LLLGYGFVEENNPCDNLLLRIYFEVDDNQYNE 277
>gi|302832548|ref|XP_002947838.1| hypothetical protein VOLCADRAFT_88145 [Volvox carteri f.
nagariensis]
gi|300266640|gb|EFJ50826.1| hypothetical protein VOLCADRAFT_88145 [Volvox carteri f.
nagariensis]
Length = 508
Score = 44.3 bits (103), Expect = 0.100, Method: Compositional matrix adjust.
Identities = 45/160 (28%), Positives = 72/160 (45%), Gaps = 8/160 (5%)
Query: 167 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV--DEDNPYDRLVVEAALNTE 224
A A+ L DRPY AG+ + V GP+ + +LL++YGF NP+ + A++ +
Sbjct: 242 GATKGALVLRADRPYAAGQQVYVSYGPKSSGELLLSYGFCPPPASNPHQDCRLRVAVDRQ 301
Query: 225 -DPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL--RLGYVSDTSEMQSVISS 281
DP K R+G S F + E + L +L R +T E+ SV+
Sbjct: 302 GDPLADLKEQALARHGLPSELEFPLKLEGIPEGLLQYLAFLDARPKVAQETFELASVLFE 361
Query: 282 LGPICPVSPCMERAV--LDQLADYFKARLAGYPATLSEDE 319
G P+ + V L L++ A L YP ++ D+
Sbjct: 362 SGGF-PLLDGQDTLVLALRGLSNRCTAALKAYPTSMEADQ 400
>gi|428163884|gb|EKX32933.1| hypothetical protein GUITHDRAFT_120884 [Guillardia theta CCMP2712]
Length = 320
Score = 44.3 bits (103), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 45/183 (24%), Positives = 75/183 (40%), Gaps = 30/183 (16%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
C+ L LM E SF+ PY+ L V++P+ WS+ E L GSP +
Sbjct: 73 CMLLLLMCESVN-SSSFFRPYLDLLPD-------TVDTPITWSKEEAKELVGSPVLHRAV 124
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
+ + R + E M +F +YP P F++E ++ A+ ++S
Sbjct: 125 KLRHELARSFQE------MKDKVFDKYPDRFPPLLFSYERYQWAYSILRSRAFG------ 172
Query: 143 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC--GPQPNSKLL 200
+ L+PL + + A D + L+ R Y VW G + ++ LL
Sbjct: 173 --NYTLMPLIDLMNHHPDSRLAPTLLSDGSDALIARREYN------VWGFYGRKSDADLL 224
Query: 201 INY 203
+NY
Sbjct: 225 LNY 227
>gi|358399747|gb|EHK49084.1| hypothetical protein TRIATDRAFT_213818 [Trichoderma atroviride IMI
206040]
Length = 378
Score = 44.3 bits (103), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 36/171 (21%), Positives = 72/171 (42%), Gaps = 24/171 (14%)
Query: 56 LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPT 115
V P++W EL +L + + +R + + ++++ F + DI
Sbjct: 102 FEVGMPMMWPR-ELKHLLPLEPRNLVFKREKAFQGDWSD-----------FHKAFSDISY 149
Query: 116 EAFTFEIFKQAFVAVQSCVVHLQ-----KVSLARRFALVPLGPPLLAYSSKCKAMLAAVD 170
E +T+ A++ V + + + K R AL+P+ + C+ +
Sbjct: 150 EEYTY-----AWLTVNTRTFYNESPETLKYPWEDRLALIPVADLFNHADAGCRVYYSP-- 202
Query: 171 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 221
+ +V DR YK GE + + N L+ YGFV ++NP D + ++ +
Sbjct: 203 EGYHIVADRDYKRGEELYISYSSHSNDYNLVEYGFVPDENPSDDVYIDDVI 253
>gi|344300819|gb|EGW31140.1| hypothetical protein SPAPADRAFT_142076 [Spathaspora passalidarum
NRRL Y-27907]
Length = 436
Score = 44.3 bits (103), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 48/210 (22%), Positives = 78/210 (37%), Gaps = 19/210 (9%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW---SETELAYLTG 74
LS L +Y+ E ++GK SFW P+ LD + PL+W ++ +L L
Sbjct: 117 LSSFQLLGMYITIETQRGKSSFWKPF---LDMLPSIADFEL-MPLVWQINNQHDLLDLLP 172
Query: 75 SPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 134
P + + +YN + + Q D + F A++ + S
Sbjct: 173 QPIRKTSEKVYTRFTSDYNTV--------TALLQTKIDNTEAVLPLDQFLLAWICINSRC 224
Query: 135 VHLQ---KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
+++ S + F + P L +S L Q+ Y E + +
Sbjct: 225 LYMNLPTSKSASDNFTMAPY-VDFLNHSPNDHCTLKIDGRGFQVFSTCAYSENEQVYLSY 283
Query: 192 GPQPNSKLLINYGFVDEDNPYDRLVVEAAL 221
GP N LL YGF DN ++ L V L
Sbjct: 284 GPHSNDFLLCEYGFTISDNKWNDLDVTEYL 313
>gi|145516585|ref|XP_001444181.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124411592|emb|CAK76784.1| unnamed protein product [Paramecium tetraurelia]
Length = 658
Score = 43.9 bits (102), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 26/92 (28%), Positives = 39/92 (42%), Gaps = 16/92 (17%)
Query: 19 SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
+E CLALY+ Y K QG KSFW PY+ +++ + W +L L
Sbjct: 98 AEFNCLALYIFYHKLQGDKSFWYPYLNVVEQH---------TMFEWRNRDLFNLQDQSLI 148
Query: 79 AEILERAEGIKREYNELDTVWFMAGSLFQQYP 110
E + +E+D W+ L +YP
Sbjct: 149 DEFMYIQ-------SEMDKSWYKFKGLMNKYP 173
>gi|407035166|gb|EKE37568.1| [Ribulose-bisphosphate-carboxylase]-lysine N-methyltransferase
[Entamoeba nuttalli P19]
Length = 791
Score = 43.9 bits (102), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 58/228 (25%), Positives = 87/228 (38%), Gaps = 25/228 (10%)
Query: 26 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 85
LYL K K W PYI L + PL ++ EL + G+ A + E+
Sbjct: 84 LYLAVNKTNPKCFHW-PYINVLPE-------TYDCPLSYTIDELNLMKGTKLYAAV-EKI 134
Query: 86 EGIKREYNELDTVWFMAGSLFQQYP-YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLAR 144
+ V + L QQ+P Y P + ++FK+ A QS V +
Sbjct: 135 NAFL-----MKVVDYYNNKLIQQFPQYFQPFD----DLFKRLQWAHQSFWSRAFLVIYPQ 185
Query: 145 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQL------VVDRPYKAGESIVVWCGPQPNSK 198
F V P +S+ C Q + K GE I + N K
Sbjct: 186 PFGEVGSLIPFCDFSNHCTQAKVTYISNTQTETFSFQTNEALVKPGEQIFNNYRIRSNEK 245
Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 246
LL+ YGFV+E+NP D L++ +D QY + + ++ S F
Sbjct: 246 LLLGYGFVEENNPCDNLLLRIYFEVDDNQYNEIEEILKQEEIKSFDFF 293
>gi|299115489|emb|CBN75653.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 451
Score = 43.9 bits (102), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 21/70 (30%), Positives = 38/70 (54%)
Query: 170 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 229
+D + LV + + +G + GP PNS+LL+ +GF DNP++ + + A + P +
Sbjct: 229 NDCLHLVTLQDWASGSEVKFSYGPLPNSRLLLLHGFCLPDNPFESVELWAMMEPGAPGFA 288
Query: 230 DKRMVAQRNG 239
+K + NG
Sbjct: 289 EKNKIMLDNG 298
>gi|328864871|gb|EGG13257.1| hypothetical protein DFA_11018 [Dictyostelium fasciculatum]
Length = 1658
Score = 43.9 bits (102), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 71/308 (23%), Positives = 122/308 (39%), Gaps = 25/308 (8%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
L++ L L+++YEK + SFW P+ L + + ++ TEL L G+
Sbjct: 1255 LNDDTILFLFVIYEK-ENPNSFWRPFFDTLPS-------YFPTSIHYTTTELLELEGT-- 1304
Query: 78 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
+ E IK + + F L QYP P FT E F A S + L
Sbjct: 1305 --NLFEETIQIKEHLESIRELLF--PELSNQYPDVFPESLFTMENFLWARSLFDSRAIQL 1360
Query: 138 QKVSLARRFALVPLGPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 195
K+ LVP+ + + + +D +++ A I + G
Sbjct: 1361 -KIDGRIVNCLVPMADMINHHDQAQISQRYFDQENDCFRMISCCNIPATSQIFLQYGALQ 1419
Query: 196 NSKLLINYGFVDEDNPYDRLVVEAALNTED-PQYQDKRMVAQRNGKLSVQVFHVHAGREK 254
+ +L + YGFV +N YD + + + ED P+ ++++ L+V ++H
Sbjct: 1420 SWELALYYGFVISNNHYDSVHIGFDMPEEDTPELREEKQKLLDRHLLTVDHHYLHRSN-- 1477
Query: 255 EAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPAT 314
S +L LR+ +++ E + PI S E VL L L + +T
Sbjct: 1478 -IPSKLLASLRVALLAE-DEFNPHVDVWNPI---SRSNEEVVLYTLYSTVLMLLKQFSST 1532
Query: 315 LSEDEAML 322
ED+ +L
Sbjct: 1533 CDEDQQLL 1540
>gi|242059429|ref|XP_002458860.1| hypothetical protein SORBIDRAFT_03g041640 [Sorghum bicolor]
gi|241930835|gb|EES03980.1| hypothetical protein SORBIDRAFT_03g041640 [Sorghum bicolor]
Length = 491
Score = 43.9 bits (102), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 54/245 (22%), Positives = 93/245 (37%), Gaps = 60/245 (24%)
Query: 22 ACLALYL--MYEKKQGKKSFWLPYIREL-DRQRGRGQLAVESPLLWSETEL-AYLTGSPT 77
CLAL + MYE+ +G S W Y++ L DR+ PL+W E L G+
Sbjct: 79 GCLALAVAVMYERARGTDSPWDAYLQLLPDRE--------SVPLVWPADEAECLLAGTEL 130
Query: 78 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
+ + E + ++ E ++G L D+ + F+ E + A V S +
Sbjct: 131 DKIVKQDREFLCEDWKECIEPLLLSGEL------DVDPDDFSLEKYFSAKTLVSSRSFQI 184
Query: 138 QKVSLARRFALVPLGPPLLAYSSKCKAML------------------------------- 166
F +VPL L + + C+ +
Sbjct: 185 DSY---HGFGMVPLAD-LFNHKTDCEHVHFTSASDASDSDGEDADDDQSDASADDESTIE 240
Query: 167 -------AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEA 219
+ D+ +++++ R GE + G N+ LL YGF + DN YD + ++
Sbjct: 241 NPTSSSPGSKDEDLEMIIVRDVNEGEEVYNTYGTMGNAALLHRYGFTELDNQYDIVNIDL 300
Query: 220 ALNTE 224
AL T+
Sbjct: 301 ALVTK 305
>gi|426392958|ref|XP_004062802.1| PREDICTED: SET domain-containing protein 4 [Gorilla gorilla
gorilla]
Length = 440
Score = 43.9 bits (102), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 55/240 (22%), Positives = 96/240 (40%), Gaps = 24/240 (10%)
Query: 19 SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
S L L +L+ EK G++S W PY+ L + A P+ E E+ L K
Sbjct: 105 SPLLALCTFLVSEKHAGRRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLK 156
Query: 79 AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL- 137
A+ E+ ++ + + LF + I F++ A+ V + V+L
Sbjct: 157 AKAEEQRAHVQEFFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLR 212
Query: 138 --QKVSLARRFALVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
Q+ L+ L P LL +S + KA + ++ ++ E + +
Sbjct: 213 PRQRECLSAEPDTCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICY 272
Query: 192 GPQPNSKLLINYGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 246
GP N +L + YGFV NP+ R ++ L + D Q K + + +G + F
Sbjct: 273 GPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332
>gi|33468718|emb|CAE30375.1| SI:dZ63M10.4 (novel protein similar to human chromosome 21 open
reading frame 18 (C21orf18)) [Danio rerio]
Length = 440
Score = 43.9 bits (102), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 58/242 (23%), Positives = 101/242 (41%), Gaps = 26/242 (10%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
+S L L +L+ E+ G+ S W PYI L + PL + + + L S
Sbjct: 105 ISPLLALCCFLISERHHGEASEWNPYIDILPK-------TYTCPLYFPDNVIELLPRSLQ 157
Query: 78 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA-FTFEIFKQAFVAVQSCVV- 135
K + ++ E + ++ T + LF Q PTE F+ + + A+ +V + V
Sbjct: 158 K-KATQQKEQFQELFSSSQTFFHSLQPLFNQ-----PTEELFSQDALRWAWCSVNTRTVY 211
Query: 136 --HLQKVSLARRFALVPLGP--PLLAY--SSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
H Q L+R + L P LL + + + +A ++ K + +
Sbjct: 212 MEHDQSKYLSREKDVYALAPYLDLLNHCPNVQVEAGFNKETRCYEIRSVNGCKKFQQAFI 271
Query: 190 WCGPQPNSKLLINYGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ 244
GP N +LL+ YGFV NP+ D ++ L+ +D Q ++K + + N L
Sbjct: 272 NYGPHDNHRLLLEYGFVAPCNPHSVVYVDLETLKVGLDEKDKQLKEKLLYLKDNDFLRNL 331
Query: 245 VF 246
F
Sbjct: 332 TF 333
>gi|330797452|ref|XP_003286774.1| hypothetical protein DICPUDRAFT_54488 [Dictyostelium purpureum]
gi|325083217|gb|EGC36675.1| hypothetical protein DICPUDRAFT_54488 [Dictyostelium purpureum]
Length = 1335
Score = 43.9 bits (102), Expect = 0.12, Method: Composition-based stats.
Identities = 52/203 (25%), Positives = 85/203 (41%), Gaps = 19/203 (9%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
L++ L L+++YEK + +FW P+ L + + +S TEL L G+
Sbjct: 874 LNDETILFLFVIYEK-ENPNTFWRPFYDTLPSY-------FTTSIHYSSTELLELEGTNL 925
Query: 78 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
AE L +K++ F L QYP P F++E F A + S + L
Sbjct: 926 FAETL----AVKQQLQAFRDYLF--PELSNQYPDIFPESVFSWENFLWARSLLDSRAIQL 979
Query: 138 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD---DAVQLVVDRPYKAGESIVVWCGPQ 194
K+ + LVP+ ++ + + + D + ++V A I + G
Sbjct: 980 -KIDGKIKSCLVPMAD-MINHHTNAQISERHFDQDSNCFRMVSSCNIPANNQIFLHYGAL 1037
Query: 195 PNSKLLINYGFVDEDNPYDRLVV 217
NS L + YGFV +N YD V
Sbjct: 1038 QNSDLALYYGFVIPNNIYDSFHV 1060
>gi|320168265|gb|EFW45164.1| hypothetical protein CAOG_03170 [Capsaspora owczarzaki ATCC 30864]
Length = 464
Score = 43.9 bits (102), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 57/260 (21%), Positives = 95/260 (36%), Gaps = 69/260 (26%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
L ++L+ E+++G S+W PYI +L QR R PL ++E EL L SP E
Sbjct: 227 LRMFLIVERRRGGNSYWSPYI-DLLPQRFR------LPLYFTEAELELLKPSPALQEAFV 279
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 143
+ + R+Y + ++QY E+ + A + S H + +
Sbjct: 280 QLRNVVRQY-----------AAWKQY-------LMMLELARAAELPSGSGDAHQKILDQR 321
Query: 144 RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR------------------------ 179
RR +P+ L Y C A A Q+VV
Sbjct: 322 RRAQAMPVRYNELTYDLFCWASSAVATRQNQIVVGEVRANQAPELSLALIPGWDMCNHAF 381
Query: 180 --------------------PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEA 219
P GE +++ G + + N FV D+P D+ ++
Sbjct: 382 GGASSFYDTQTRSLECVAVAPIAKGEPVLLHYGDRSSMAYFGNSEFVPADHPTDQYLILL 441
Query: 220 ALNTEDPQYQDKRMVAQRNG 239
A+ +DP ++ K + Q G
Sbjct: 442 AVGKQDPLFKSKSTILQALG 461
>gi|440464611|gb|ELQ34010.1| hypothetical protein OOU_Y34scaffold00824g3 [Magnaporthe oryzae
Y34]
Length = 373
Score = 43.9 bits (102), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 46/163 (28%), Positives = 71/163 (43%), Gaps = 24/163 (14%)
Query: 61 PLLWSETELAYLTGSPTKAEILERAEGIKREYN-ELDTVWFMAGSL----FQQYPYDIPT 115
P +W + EL L PT A + E + +YN E +TV S+ FQ Y + + T
Sbjct: 107 PFMWPK-ELQKLL--PTSARVF--LENQQTKYNHEWNTVSQAMPSISEERFQYYWHIVNT 161
Query: 116 EAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQL 175
F +E V+ C S R ALVPL C+ ++ + + +
Sbjct: 162 RTFLYE------VSETECY------SWEDRLALVPLADIFNHADEGCR--VSYMPEHYVI 207
Query: 176 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 218
DR Y+AGE + + G N LL YGF+ N +D + ++
Sbjct: 208 TTDRAYEAGEELFISYGDHSNDCLLTEYGFLLPKNRWDIICID 250
>gi|345326326|ref|XP_001512617.2| PREDICTED: SET domain-containing protein 4-like [Ornithorhynchus
anatinus]
Length = 499
Score = 43.9 bits (102), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 54/226 (23%), Positives = 92/226 (40%), Gaps = 19/226 (8%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
+S L L +L+ EK+ G +S W PY+ L + A P+ L+ L P
Sbjct: 104 VSPLLALCTFLIAEKQAGARSLWQPYLGVLPQ-------AYTCPVGLDAAVLSLLP-QPL 155
Query: 78 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV-- 135
E+ + RE +F SL + D+ FT + A+ V + V
Sbjct: 156 GRRAREQRTAV-RELFAASRAFF--SSLQPLFSEDV-ERVFTLDALGWAWCTVNTRTVYM 211
Query: 136 -HLQKVSLARRFALVPLGP--PLLAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 190
H Q+ + + L P LL +S ++ +A ++ + E +++
Sbjct: 212 EHAQRDCFSAEADIYALAPYLDLLNHSPGAQVEAAFNKETRCYEIRTASRCRKYEEVLIC 271
Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 236
GP N +LL+ YGFV +NP+ +VV + DK+M +
Sbjct: 272 YGPHDNRRLLLEYGFVCSNNPHSNVVVSPDVLVRHLPSGDKQMTKK 317
>gi|113930683|ref|NP_001039027.1| SET domain-containing protein 4 [Danio rerio]
gi|66911144|gb|AAH96876.1| SET domain containing 4 [Danio rerio]
Length = 440
Score = 43.9 bits (102), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 58/242 (23%), Positives = 101/242 (41%), Gaps = 26/242 (10%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
+S L L +L+ E+ G+ S W PYI L + PL + + + L S
Sbjct: 105 ISPLLALCCFLISERHHGEASEWNPYIDILPK-------TYTCPLYFPDNVIELLPRSLQ 157
Query: 78 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA-FTFEIFKQAFVAVQSCVV- 135
K + ++ E + ++ T + LF Q PTE F+ + + A+ +V + V
Sbjct: 158 K-KATQQKEQFQELFSSSQTFFHSLQPLFNQ-----PTEELFSQDALRWAWCSVNTRTVY 211
Query: 136 --HLQKVSLARRFALVPLGP--PLLAY--SSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
H Q L+R + L P LL + + + +A ++ K + +
Sbjct: 212 MEHDQSKYLSREKDVYALAPYLDLLNHCPNVQVEAGFNKETRCYEIRSVNGCKKFQQAFI 271
Query: 190 WCGPQPNSKLLINYGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ 244
GP N +LL+ YGFV NP+ D ++ L+ +D Q ++K + + N L
Sbjct: 272 NYGPHDNHRLLLEYGFVAPCNPHSVVYVDLETLKVGLDEKDKQLKEKLLYLKDNDFLRNL 331
Query: 245 VF 246
F
Sbjct: 332 TF 333
>gi|406607002|emb|CCH41620.1| SET domain-containing protein 4 [Wickerhamomyces ciferrii]
Length = 424
Score = 43.9 bits (102), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 42/208 (20%), Positives = 89/208 (42%), Gaps = 24/208 (11%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
LS ++L+L E +GK+S+W P+I+ L + SP LW + G
Sbjct: 114 LSSFQIMSLFLELESSRGKESWWDPFIQMLPTIND----FLTSPFLWQ------IQG--- 160
Query: 78 KAEILERAEGIKRE-----YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 132
K E++E+ ++ +N ++ + +L + ++ + + F ++ + S
Sbjct: 161 KYELIEKLPKSTQKHSLKMFNRFESDFKAVKTLLE--THNASKDIINHDKFVLYWMCINS 218
Query: 133 CVVHL---QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
+++ QK + + F + P + +S+ + L ++ YK + + +
Sbjct: 219 RCLYMEIPQKKTTSDNFTMAPY-VDFINHSTNDQCKLKIDRTGFHVITTSNYKENDELYL 277
Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVV 217
GP N LL YGF +N ++ L +
Sbjct: 278 SYGPHSNEFLLCEYGFHLSNNEWNDLDI 305
>gi|403349615|gb|EJY74245.1| hypothetical protein OXYTRI_04500 [Oxytricha trifallax]
Length = 689
Score = 43.9 bits (102), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 45/90 (50%), Gaps = 12/90 (13%)
Query: 11 ELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL--WSETE 68
EL + ++ CLA++LM E QG++SFW PYI ++ ES LL W + E
Sbjct: 115 ELFMKHADADQLCLAVFLMNEYLQGQQSFWWPYINVMN----------ESDLLYKWKDEE 164
Query: 69 LAYLTGSPTKAEILERAEGIKREYNELDTV 98
+ L + E + I+ E+N+L +
Sbjct: 165 IKLLNDFEIYQQAKEYRDDIEDEWNKLSKI 194
>gi|222640175|gb|EEE68307.1| hypothetical protein OsJ_26571 [Oryza sativa Japonica Group]
Length = 422
Score = 43.9 bits (102), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 35/130 (26%), Positives = 59/130 (45%), Gaps = 22/130 (16%)
Query: 151 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 210
+ P +L Y K V +++ + RP KAGE + G P S L+ YGF+ DN
Sbjct: 302 VSPHILNYGRVDK-----VTKSLKFPLSRPCKAGEQCFLSYGKHPGSHLITFYGFLPRDN 356
Query: 211 PYDR--LVVEAALNTEDP----------QYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
PYD L ++ +++ ED + +RM+ R + +Q + ++ +
Sbjct: 357 PYDVIPLDLDTSVDEEDSSSPSVTTSQTSHMGERMLG-RQSRTGLQ----RSTKKDSFVH 411
Query: 259 DMLPYLRLGY 268
YLRLG+
Sbjct: 412 CYFVYLRLGH 421
>gi|260835124|ref|XP_002612559.1| hypothetical protein BRAFLDRAFT_219602 [Branchiostoma floridae]
gi|229297937|gb|EEN68568.1| hypothetical protein BRAFLDRAFT_219602 [Branchiostoma floridae]
Length = 327
Score = 43.9 bits (102), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 54/223 (24%), Positives = 90/223 (40%), Gaps = 41/223 (18%)
Query: 17 KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP 76
+L+ L+LYL+ EK +GK SFW PYI+ L + +P +S E+ L
Sbjct: 76 RLTPKQVLSLYLIAEKSRGKDSFWYPYIQTL-------PTSYTTPSYFSTAEVDALPALV 128
Query: 77 TKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH 136
+A + R + ++ Y L T LF + FT + ++ A+ V + V+
Sbjct: 129 REATLRHR-KVLQNSYKSLQTSLHNLEPLFPDW-----KTVFTLKSYRWAWATVYTRSVY 182
Query: 137 LQ-------KVSLARRFALVPL-----GPPLLA------YSSKCKAMLAAVDDAVQLVVD 178
+ S +AL P PL+ SSKC ++ +
Sbjct: 183 KRGPGWEFLDPSDPDVYALAPFLDMLNHSPLVQTDTDFNVSSKC----------YEVKTE 232
Query: 179 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 221
+ + + P N +LL+ YGFV NP+ + AA+
Sbjct: 233 GACRKYRQVFINYDPYDNGRLLMEYGFVMPRNPHSVVTFTAAV 275
>gi|440792461|gb|ELR13682.1| [Ribulose-bisphosphate-carboxylase]-lysine N-methyltransferase
[Acanthamoeba castellanii str. Neff]
Length = 400
Score = 43.5 bits (101), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 49/204 (24%), Positives = 88/204 (43%), Gaps = 23/204 (11%)
Query: 25 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE- 83
A+ + E +SFW PY+ EL AV + W++ ELA + + E++E
Sbjct: 49 AVLWLLESVNCAQSFWQPYLSELPD-------AVATVDRWNQEELAEVGHTLMLYEMVEY 101
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 143
+ + I +Y + + + Q + IP+E E +++A V S + L
Sbjct: 102 KKKKIAADYAAILLPFLQENT--QLFGGSIPSE----EEYRRALSLVYSRTFDFSE--LI 153
Query: 144 RRFALVPLGPPLLAYS------SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 197
+P L +S + C D +L+ Y GE + + G + +S
Sbjct: 154 GEHVFIPF-VDFLNHSINDTGKAACTYSYNHDKDCFELLAGADYDEGEEVFISYGEKTSS 212
Query: 198 KLLINYGFVDEDNPYDRLVVEAAL 221
+LL +YGF+ E+N D + + A+L
Sbjct: 213 QLLASYGFMYENNAEDTVDITASL 236
>gi|207346544|gb|EDZ73016.1| YDR257Cp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 354
Score = 43.5 bits (101), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
+++PL L A +SKC A L + +++V R + E + G PNS+LL YG+V
Sbjct: 80 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 139
Query: 207 DED-NPYD 213
+ D + YD
Sbjct: 140 EWDGSKYD 147
>gi|302498903|ref|XP_003011448.1| SET domain protein [Arthroderma benhamiae CBS 112371]
gi|291174999|gb|EFE30808.1| SET domain protein [Arthroderma benhamiae CBS 112371]
Length = 689
Score = 43.5 bits (101), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 49/225 (21%), Positives = 90/225 (40%), Gaps = 38/225 (16%)
Query: 6 FYYAAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 65
F+Y+ L + ++ + LA ++++E+ +G+ S W PY+ L R + S L +
Sbjct: 88 FHYSDIL--SQRVRKSIILAFFMVHEQLKGRDSHWWPYLATLPRAS-----ELTSALFYQ 140
Query: 66 ETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ 125
+ +L +L G+ +K EY+ ++ G L E++ ++IF
Sbjct: 141 DNDLEWLQGTNLYQTHQAYRNAVKEEYDSAISILRDEGFL--------AVESYRWDIFCW 192
Query: 126 AFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA------------- 172
A+ + S + +R P L + + ML VD +
Sbjct: 193 AYTLIAS------RAFTSRVLDAYFSNHPTLKQDEEFQIMLPLVDSSNHKPLAKIEWRAE 246
Query: 173 ---VQLVVDRPYKAGESIVVWCGPQPNSK-LLINYGFVDEDNPYD 213
+ L V P +GE + G N + ++ YGF DNP D
Sbjct: 247 ATEIGLKVIEPTFSGEEVHNNYGSLNNQQSVMTTYGFCIVDNPCD 291
>gi|367016539|ref|XP_003682768.1| hypothetical protein TDEL_0G01900 [Torulaspora delbrueckii]
gi|359750431|emb|CCE93557.1| hypothetical protein TDEL_0G01900 [Torulaspora delbrueckii]
Length = 573
Score = 43.5 bits (101), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 25/79 (31%), Positives = 37/79 (46%), Gaps = 9/79 (11%)
Query: 155 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD- 213
LL + + K +D V V K GE + G + N LL++YGFV + NPYD
Sbjct: 229 LLNHKNDTKVKWTFTNDNVCFVSQEIMKEGEEVFNNYGEKSNEDLLLSYGFVQDQNPYDL 288
Query: 214 --------RLVVEAALNTE 224
+ +++ ALN E
Sbjct: 289 TRLTLRLTKEMIDEALNAE 307
>gi|22328112|gb|AAH36556.1| SETD4 protein [Homo sapiens]
gi|119630166|gb|EAX09761.1| SET domain containing 4, isoform CRA_d [Homo sapiens]
gi|167773807|gb|ABZ92338.1| SET domain containing 4 [synthetic construct]
Length = 416
Score = 43.5 bits (101), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 55/240 (22%), Positives = 95/240 (39%), Gaps = 24/240 (10%)
Query: 19 SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
S L L +L+ EK G +S W PY+ L + A P+ E E+ L K
Sbjct: 81 SPLLALCTFLVSEKHAGHRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLK 132
Query: 79 AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL- 137
A+ E+ ++ + + LF + I F++ A+ V + V+L
Sbjct: 133 AKAEEQRAHVQEFFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLR 188
Query: 138 --QKVSLARRFALVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
Q+ L+ L P LL +S + KA + ++ ++ E + +
Sbjct: 189 PRQRECLSAEPDTCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICY 248
Query: 192 GPQPNSKLLINYGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 246
GP N +L + YGFV NP+ R ++ L + D Q K + + +G + F
Sbjct: 249 GPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 308
>gi|114684050|ref|XP_001168792.1| PREDICTED: SET domain-containing protein 4 isoform 4 [Pan
troglodytes]
gi|410222534|gb|JAA08486.1| SET domain containing 4 [Pan troglodytes]
gi|410259178|gb|JAA17555.1| SET domain containing 4 [Pan troglodytes]
gi|410287502|gb|JAA22351.1| SET domain containing 4 [Pan troglodytes]
gi|410336607|gb|JAA37250.1| SET domain containing 4 [Pan troglodytes]
Length = 307
Score = 43.5 bits (101), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 50/211 (23%), Positives = 84/211 (39%), Gaps = 19/211 (9%)
Query: 19 SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
S L L +L+ EK G +S W PY+ L + A P+ E E+ L K
Sbjct: 105 SPLLALCTFLVSEKHAGHRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLK 156
Query: 79 AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL- 137
A+ E+ ++ + + LF + I F++ A+ V + V+L
Sbjct: 157 AKAEEQRAHVQEFFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLR 212
Query: 138 --QKVSLARRFALVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
Q+ L+ L P LL +S + KA + ++ ++ E + +
Sbjct: 213 PRQRECLSAEPDTCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICY 272
Query: 192 GPQPNSKLLINYGFVDEDNPYDRLVVEAALN 222
GP N +L + YGFV NP+ + V N
Sbjct: 273 GPHDNQRLFLEYGFVSVHNPHACVYVSRGWN 303
>gi|328872715|gb|EGG21082.1| hypothetical protein DFA_00957 [Dictyostelium fasciculatum]
Length = 643
Score = 43.5 bits (101), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 78/369 (21%), Positives = 145/369 (39%), Gaps = 53/369 (14%)
Query: 17 KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP 76
K+S A+ ++Y + S+W Y+ L +Q + + ++ E+ L SP
Sbjct: 282 KISRTDQQAIAIIYSILH-EDSYWYEYVSILPKQ-------FTTTVYFTREEMTQLQASP 333
Query: 77 TKAEILERAEGIKREYN-------------ELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 123
R G+ R Y+ E D+ S + + +T + F
Sbjct: 334 VHRFTEMRLNGVHRHYDTTISRLRFGYEGGEDDSTKTKTKSQLDAMK-EFKDDRYTLDQF 392
Query: 124 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYS----SKCKAMLAAVDDAVQLVVDR 179
K A V S L + +VPL A + SK ++A ++ +
Sbjct: 393 KWALGCVWSRAFSLSE----EDGGMVPLADMFNADTVISRSKVHPKISASSPSLVYTASQ 448
Query: 180 PYKAGESIVVWCG---PQPNSKLLINYGFVDEDNPY--DRLVVEAALNTEDPQYQDKRMV 234
+AGE I G + ++L++YGF+ ED +V A + +P Y KR +
Sbjct: 449 DIEAGEQIFTPYGVYKTLGSGQMLMDYGFIHEDGSSADSTIVTVAPIPPSEPLYDLKRHL 508
Query: 235 AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS--DTSEMQSVISSLGPICPVSPCM 292
Q NG + + F + + + ++ + R+ ++ ++ + + S ++P
Sbjct: 509 MQSNG-IESEEFTI---TKNKLAKELFLFARIKSINKKESDQASAHFMSTQRHSMLNPRN 564
Query: 293 ERAVLDQLADYFKARLAGYPATLSEDEAML-------TDYNLHPKKRVAT----QLVRME 341
E+A L L++ L Y T+ +D +L T+ N H T +L ME
Sbjct: 565 EKAALRLLSNLISRHLDAYQTTIDQDNQILKEIEKDKTNTN-HSSVTFNTINAIKLRLME 623
Query: 342 KKMLNACLQ 350
K +LN+ L+
Sbjct: 624 KNILNSFLK 632
>gi|320170797|gb|EFW47696.1| hypothetical protein CAOG_05634 [Capsaspora owczarzaki ATCC 30864]
Length = 903
Score = 43.5 bits (101), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 33/99 (33%), Positives = 46/99 (46%), Gaps = 15/99 (15%)
Query: 24 LALYLMYEK-KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
L LY+++EK G S W P+ L SPLL+ TEL L G+ E
Sbjct: 469 LLLYVLHEKLVAGSASRWAPFFETL-------PATYNSPLLFHVTELLELEGTRLIDETF 521
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 121
E +G++ + L G L + YP PT+AFT+E
Sbjct: 522 EIKDGLRVLHESL-------GPLAEAYPALFPTDAFTYE 553
>gi|55953063|ref|NP_001007260.1| SET domain-containing protein 4 isoform 2 [Homo sapiens]
gi|12804091|gb|AAH02898.1| SET domain containing 4 [Homo sapiens]
gi|119630161|gb|EAX09756.1| SET domain containing 4, isoform CRA_a [Homo sapiens]
Length = 307
Score = 43.5 bits (101), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 50/211 (23%), Positives = 84/211 (39%), Gaps = 19/211 (9%)
Query: 19 SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
S L L +L+ EK G +S W PY+ L + A P+ E E+ L K
Sbjct: 105 SPLLALCTFLVSEKHAGHRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLK 156
Query: 79 AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL- 137
A+ E+ ++ + + LF + I F++ A+ V + V+L
Sbjct: 157 AKAEEQRAHVQEFFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLR 212
Query: 138 --QKVSLARRFALVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
Q+ L+ L P LL +S + KA + ++ ++ E + +
Sbjct: 213 PRQRECLSAEPDTCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICY 272
Query: 192 GPQPNSKLLINYGFVDEDNPYDRLVVEAALN 222
GP N +L + YGFV NP+ + V N
Sbjct: 273 GPHDNQRLFLEYGFVSVHNPHACVYVSRGWN 303
>gi|440802665|gb|ELR23594.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
Length = 984
Score = 43.1 bits (100), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 46/212 (21%), Positives = 86/212 (40%), Gaps = 34/212 (16%)
Query: 22 ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 81
A L YL++++ + +FW PY+R L + R + ++ +L G+ +
Sbjct: 84 AILCAYLIHQRA-AQDAFWGPYLRSLPKHDDR-----------PDEDIQHLAGTNLFYAM 131
Query: 82 LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV------QSCVV 135
E+ + I+ ++ + +L +P P + FT++ F F A Q+ V
Sbjct: 132 QEKQQQIRESFD------LLFPALCHAHPTVFPPDLFTWDHFLWTFTACSSRSFPQTLVQ 185
Query: 136 HLQKVSLARR--FALVPLGPPLL--------AYSSKCKAMLAAVDDAVQLVVDRPYKAGE 185
+ A + L+ + LL Y K L ++ V + + G
Sbjct: 186 QPTATTSAHADPYDLLEIDECLLPGLDMLNHQYRKKITWALDPSTGRLKFVTEDTVEKGT 245
Query: 186 SIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 217
GP+ N +LL+ YGF EDN D +++
Sbjct: 246 EAFNNYGPKGNEELLMGYGFCIEDNEQDYVMI 277
>gi|340503949|gb|EGR30449.1| SET domain protein [Ichthyophthirius multifiliis]
Length = 518
Score = 43.1 bits (100), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 50/211 (23%), Positives = 91/211 (43%), Gaps = 25/211 (11%)
Query: 170 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 229
D+ + +P K G+ I G + N LL+ YGF N YD + +N Q
Sbjct: 250 DNYFVVTTQKPEKKGQQIYNCYGQRTNKFLLMWYGFCFNKNRYDSYSLRLWINMRQEQLN 309
Query: 230 D---KRMVAQ---------------RNGKLSVQVFHVHAGREKEAIS-DMLPYLRLGYVS 270
+ +++V Q + K+++ + +K I+ D++ YLRL +
Sbjct: 310 NDLFEKIVFQEFLEKEDCKGGFVWKKQEKVNLDDITQNFRIKKNKINIDLIIYLRLYLMM 369
Query: 271 DTS--EMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 328
+++ V+ SL PVSP E VL L+ + T+ +D+ +L + NL+
Sbjct: 370 HYKGPDLKRVMVSL----PVSPVYECFVLSFAIRLLSYLLSRFTTTIKDDKELLQNQNLN 425
Query: 329 PKKRVATQLVRMEKKMLNACLQVTADMIMLL 359
K R A +K++L + + ++LL
Sbjct: 426 YKYRFAIIYRLNQKEILQEQISLMNQALILL 456
>gi|12718364|emb|CAC28558.1| related to histone-lysine N-methyltransferase [Neurospora crassa]
Length = 471
Score = 43.1 bits (100), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 45/181 (24%), Positives = 81/181 (44%), Gaps = 17/181 (9%)
Query: 60 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 119
S +L++E +L G+ + + I+ ++ L LF Q+P P + FT
Sbjct: 115 SSILFAEDDLEACAGTSLYTITKQLEQSIEDDHRALVV------RLFVQHPDLFPLDKFT 168
Query: 120 FE----IFKQAFVAVQSCVVHLQKVSLARRFALVPLGP--PLLAYSSKCKA--MLAAVDD 171
E +K A V S + LA ++ L P +L ++S+ K +
Sbjct: 169 VEDVGLHYKWALCTVWSRAMDF---VLADGNSIRLLAPFADMLNHTSEVKQCHVYDPSSG 225
Query: 172 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 231
+ + + Y+AG+ + + GP PNS+LL YGFV NP D + + + + P ++ K
Sbjct: 226 TLSVFAGKDYEAGDQVFINYGPVPNSRLLRLYGFVIPGNPNDSYDLVLSTHPQAPFFEQK 285
Query: 232 R 232
+
Sbjct: 286 Q 286
>gi|323355591|gb|EGA87411.1| Set7p [Saccharomyces cerevisiae VL3]
Length = 515
Score = 43.1 bits (100), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
+++PL L A +SKC A L + +++V R + E + G PNS+LL YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279
Query: 207 DED-NPYD 213
+ D + YD
Sbjct: 280 EWDGSKYD 287
>gi|388452885|ref|NP_001253203.1| SET domain-containing protein 4 [Macaca mulatta]
gi|355560299|gb|EHH16985.1| SET domain-containing protein 4 [Macaca mulatta]
gi|387541878|gb|AFJ71566.1| SET domain-containing protein 4 isoform 1 [Macaca mulatta]
Length = 440
Score = 43.1 bits (100), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 55/240 (22%), Positives = 95/240 (39%), Gaps = 24/240 (10%)
Query: 19 SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
S L L +L+ EK G +S W PY+ L + A P+ E E+ L K
Sbjct: 105 SPLLALCTFLVSEKHAGDRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLK 156
Query: 79 AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL- 137
A+ E+ ++ + + LF + I F++ A+ V + V+L
Sbjct: 157 AKAEEQRAHVQEFFASSRDFFSSLQPLFVEAVDSI----FSYSALLWAWCTVNTRAVYLR 212
Query: 138 --QKVSLARRFALVPLGP--PLLAYSSKC--KAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
Q+ L+ L P LL +S + KA + ++ ++ E + +
Sbjct: 213 PRQRECLSAEPDTCALAPYLDLLNHSPRVQVKAAFNEETHSYEIRTTSRWRKHEEVFICY 272
Query: 192 GPQPNSKLLINYGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 246
GP N +L + YGFV NP+ R ++ L + D Q K + + +G + F
Sbjct: 273 GPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSRDKQMDKKISILKDHGYIENLTF 332
>gi|443733230|gb|ELU17670.1| hypothetical protein CAPTEDRAFT_97123, partial [Capitella teleta]
Length = 199
Score = 43.1 bits (100), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 53/205 (25%), Positives = 90/205 (43%), Gaps = 30/205 (14%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
L ++L+ E+ +G SFW PY+ L + L W+ E+ L TK +
Sbjct: 1 LVIFLLCERNKGCSSFWKPYVDILPS-------SYTDILHWTSKEMDLLPKF-TKRRACD 52
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKV 140
+ +N L + L +Q P AFT+++FK A+ +V + V++ Q
Sbjct: 53 LRLKAEESFNRLCNGFLPL--LVRQMPQF--NGAFTWDLFKWAWSSVNTRCVYMSQPQNS 108
Query: 141 SLA----RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ------LVVDRPYKAGESIVVW 190
L+ + AL P LL ++ + A DD+ + L +PY + + +
Sbjct: 109 VLSPDEEDKSALAPFLD-LLNHTVDVEVN-ARFDDSSKSYKITTLTACKPY---DQVFIN 163
Query: 191 CGPQPNSKLLINYGFVDEDNPYDRL 215
GP N KLL+ YGF NP++ +
Sbjct: 164 YGPHSNEKLLLEYGFTLPCNPHNNI 188
>gi|428181778|gb|EKX50641.1| hypothetical protein GUITHDRAFT_135258 [Guillardia theta CCMP2712]
Length = 254
Score = 43.1 bits (100), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 45/162 (27%), Positives = 68/162 (41%), Gaps = 25/162 (15%)
Query: 163 KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 222
K+ L V+ VQL+ P KAGE I ++ G + L +GF D DNP D + E L
Sbjct: 74 KSELGRVE--VQLLA--PVKAGEQIFIYYGALSTASELTRFGFCDRDNPNDTVPFELDL- 128
Query: 223 TEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL 282
+E + Q K M +V+ ++ D LP RL + +
Sbjct: 129 SEMTELQRKAM----------EVWEFRPDVQQLLKRDGLPSWRL----------LAMLRI 168
Query: 283 GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 324
+ +S E+ V + + A AGYP L ED + L +
Sbjct: 169 LHLNQLSVANEKLVWGTMEELLNAVTAGYPTRLEEDISRLEE 210
>gi|323334121|gb|EGA75505.1| Set7p [Saccharomyces cerevisiae AWRI796]
Length = 515
Score = 43.1 bits (100), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
+++PL L A +SKC A L + +++V R + E + G PNS+LL YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279
Query: 207 DED-NPYD 213
+ D + YD
Sbjct: 280 EWDGSKYD 287
>gi|151942233|gb|EDN60589.1| SET domain-containing protein [Saccharomyces cerevisiae YJM789]
Length = 494
Score = 43.1 bits (100), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
+++PL L A +SKC A L + +++V R + E + G PNS+LL YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279
Query: 207 DED-NPYD 213
+ D + YD
Sbjct: 280 EWDGSKYD 287
>gi|349577313|dbj|GAA22482.1| K7_Set7p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 494
Score = 43.1 bits (100), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
+++PL L A +SKC A L + +++V R + E + G PNS+LL YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279
Query: 207 DED-NPYD 213
+ D + YD
Sbjct: 280 EWDGSKYD 287
>gi|6320463|ref|NP_010543.1| Rkm4p [Saccharomyces cerevisiae S288c]
gi|46577338|sp|Q12504.1|RKM4_YEAST RecName: Full=Ribosomal N-lysine methyltransferase 4; AltName:
Full=SET domain-containing protein 7
gi|1136212|emb|CAA92714.1| unknown [Saccharomyces cerevisiae]
gi|1226033|emb|CAA94096.1| unknown [Saccharomyces cerevisiae]
gi|51830266|gb|AAU09704.1| YDR257C [Saccharomyces cerevisiae]
gi|190404795|gb|EDV08062.1| hypothetical protein SCRG_00269 [Saccharomyces cerevisiae RM11-1a]
gi|259145494|emb|CAY78758.1| Set7p [Saccharomyces cerevisiae EC1118]
gi|285811273|tpg|DAA12097.1| TPA: Rkm4p [Saccharomyces cerevisiae S288c]
gi|323349272|gb|EGA83501.1| Set7p [Saccharomyces cerevisiae Lalvin QA23]
gi|365766338|gb|EHN07836.1| Set7p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
gi|392300372|gb|EIW11463.1| Rkm4p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 494
Score = 43.1 bits (100), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
+++PL L A +SKC A L + +++V R + E + G PNS+LL YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279
Query: 207 DED-NPYD 213
+ D + YD
Sbjct: 280 EWDGSKYD 287
>gi|308812294|ref|XP_003083454.1| N-methyltransferase (ISS) [Ostreococcus tauri]
gi|116055335|emb|CAL58003.1| N-methyltransferase (ISS) [Ostreococcus tauri]
Length = 492
Score = 43.1 bits (100), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 20/56 (35%), Positives = 32/56 (57%)
Query: 158 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 213
+S A + ++ V+LV R KAG+ I + G N +L ++YGF+ EDN +D
Sbjct: 248 HSFDASARVRECENGVELVTTRDLKAGQPIELCYGELSNDELFLDYGFIVEDNAFD 303
>gi|332872029|ref|XP_001168891.2| PREDICTED: SET domain-containing protein 4 isoform 8 [Pan
troglodytes]
gi|410222532|gb|JAA08485.1| SET domain containing 4 [Pan troglodytes]
gi|410259176|gb|JAA17554.1| SET domain containing 4 [Pan troglodytes]
gi|410287500|gb|JAA22350.1| SET domain containing 4 [Pan troglodytes]
gi|410336605|gb|JAA37249.1| SET domain containing 4 [Pan troglodytes]
Length = 440
Score = 43.1 bits (100), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 55/240 (22%), Positives = 95/240 (39%), Gaps = 24/240 (10%)
Query: 19 SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
S L L +L+ EK G +S W PY+ L + A P+ E E+ L K
Sbjct: 105 SPLLALCTFLVSEKHAGHRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLK 156
Query: 79 AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL- 137
A+ E+ ++ + + LF + I F++ A+ V + V+L
Sbjct: 157 AKAEEQRAHVQEFFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLR 212
Query: 138 --QKVSLARRFALVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
Q+ L+ L P LL +S + KA + ++ ++ E + +
Sbjct: 213 PRQRECLSAEPDTCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICY 272
Query: 192 GPQPNSKLLINYGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 246
GP N +L + YGFV NP+ R ++ L + D Q K + + +G + F
Sbjct: 273 GPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332
>gi|255080174|ref|XP_002503667.1| set domain protein [Micromonas sp. RCC299]
gi|226518934|gb|ACO64925.1| set domain protein [Micromonas sp. RCC299]
Length = 401
Score = 43.1 bits (100), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 30/117 (25%), Positives = 52/117 (44%), Gaps = 10/117 (8%)
Query: 10 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
AE L +L L + +M+E+ G+ S W Y L RG+ + P+ W+ +L
Sbjct: 40 AETLREARLGGGLALNIAIMHERSLGEGSRWAGYFAVLP---ARGERTL--PMFWTSAQL 94
Query: 70 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 126
+L G+ + E AE ++ ++NE + L +P P T E + +A
Sbjct: 95 EHLRGTDLLRHVTEDAESMRLDFNE-----NVVDGLCVTHPVAFPPGKHTLEAYMEA 146
>gi|8393013|ref|NP_059134.1| SET domain-containing protein 4 isoform 1 [Homo sapiens]
gi|12229715|sp|Q9NVD3.1|SETD4_HUMAN RecName: Full=SET domain-containing protein 4
gi|7023055|dbj|BAA91819.1| unnamed protein product [Homo sapiens]
gi|119630162|gb|EAX09757.1| SET domain containing 4, isoform CRA_b [Homo sapiens]
gi|119630163|gb|EAX09758.1| SET domain containing 4, isoform CRA_b [Homo sapiens]
gi|119630165|gb|EAX09760.1| SET domain containing 4, isoform CRA_b [Homo sapiens]
Length = 440
Score = 43.1 bits (100), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 55/240 (22%), Positives = 95/240 (39%), Gaps = 24/240 (10%)
Query: 19 SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
S L L +L+ EK G +S W PY+ L + A P+ E E+ L K
Sbjct: 105 SPLLALCTFLVSEKHAGHRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLK 156
Query: 79 AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL- 137
A+ E+ ++ + + LF + I F++ A+ V + V+L
Sbjct: 157 AKAEEQRAHVQEFFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLR 212
Query: 138 --QKVSLARRFALVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
Q+ L+ L P LL +S + KA + ++ ++ E + +
Sbjct: 213 PRQRECLSAEPDTCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICY 272
Query: 192 GPQPNSKLLINYGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 246
GP N +L + YGFV NP+ R ++ L + D Q K + + +G + F
Sbjct: 273 GPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332
>gi|397507017|ref|XP_003824008.1| PREDICTED: SET domain-containing protein 4 [Pan paniscus]
Length = 440
Score = 43.1 bits (100), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 55/240 (22%), Positives = 95/240 (39%), Gaps = 24/240 (10%)
Query: 19 SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
S L L +L+ EK G +S W PY+ L + A P+ E E+ L K
Sbjct: 105 SPLLALCTFLVSEKHAGHRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLK 156
Query: 79 AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL- 137
A+ E+ ++ + + LF + I F++ A+ V + V+L
Sbjct: 157 AKAEEQRAHVQEFFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLR 212
Query: 138 --QKVSLARRFALVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
Q+ L+ L P LL +S + KA + ++ ++ E + +
Sbjct: 213 PRQRECLSAEPDTCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICY 272
Query: 192 GPQPNSKLLINYGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 246
GP N +L + YGFV NP+ R ++ L + D Q K + + +G + F
Sbjct: 273 GPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332
>gi|299472213|emb|CBN77183.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloropl [Ectocarpus
siliculosus]
Length = 460
Score = 43.1 bits (100), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 48/195 (24%), Positives = 79/195 (40%), Gaps = 25/195 (12%)
Query: 19 SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
E +AL +++E+ QG+ S P+I L ++ G+L E PL W E +L S T+
Sbjct: 87 GETGLIALQVLWERAQGEGSKMAPWIAVLPKE---GEL--EMPLFWGEADLTLADASSTR 141
Query: 79 AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ 138
G + +E D W ++ + F ++P P + F F+ A S +
Sbjct: 142 G-----ISGFVADVDE-DFAW-LSENAFAKHPKVFPADKFGPGDFRWAVGVALSRSFFVD 194
Query: 139 K-------VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
V A +L + P + + + AV L + Y+ GE V
Sbjct: 195 GELRLTPLVDFANHSSLRGVSEP----TGGTTGLFGS--KAVVLRAGKNYEEGEEFFVSY 248
Query: 192 GPQPNSKLLINYGFV 206
GP+ + L GFV
Sbjct: 249 GPKGAAGYLEENGFV 263
>gi|323309789|gb|EGA62995.1| Set7p [Saccharomyces cerevisiae FostersO]
Length = 417
Score = 43.1 bits (100), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
+++PL L A +SKC A L + +++V R + E + G PNS+LL YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279
Query: 207 DED-NPYD 213
+ D + YD
Sbjct: 280 EWDGSKYD 287
>gi|256270722|gb|EEU05884.1| Set7p [Saccharomyces cerevisiae JAY291]
Length = 494
Score = 43.1 bits (100), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
+++PL L A +SKC A L + +++V R + E + G PNS+LL YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279
Query: 207 DED-NPYD 213
+ D + YD
Sbjct: 280 EWDGSKYD 287
>gi|407852222|gb|EKG05847.1| hypothetical protein TCSYLVIO_003073 [Trypanosoma cruzi]
Length = 565
Score = 43.1 bits (100), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 43/148 (29%), Positives = 70/148 (47%), Gaps = 15/148 (10%)
Query: 184 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRL-----VVEAALNTEDPQYQDKR--MVAQ 236
G I + GP N +LL YGFV E N +DRL EAA+ E + +R +VA+
Sbjct: 362 GREIWMSYGPLQNWELLQFYGFVLEGNEHDRLPFPLDFPEAAVGDE---WDGRRAALVAK 418
Query: 237 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 296
L+ + H GR A ++ LR+ ++++ E ++ + GP + E V
Sbjct: 419 YGLHLAGCCWICHDGRPPPA---LVALLRV-HLAEAEEFDTMERN-GPFASLGAGTEARV 473
Query: 297 LDQLADYFKARLAGYPATLSEDEAMLTD 324
+AD + L + +L EDE +L +
Sbjct: 474 FATIADTIRCILDLFSTSLEEDERLLEN 501
>gi|452986759|gb|EME86515.1| hypothetical protein MYCFIDRAFT_131111 [Pseudocercospora fijiensis
CIRAD86]
Length = 391
Score = 43.1 bits (100), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 31/120 (25%), Positives = 49/120 (40%), Gaps = 10/120 (8%)
Query: 121 EIFKQAFVAVQSCVVHLQKVSLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 179
++FK + V S H + F L P + S + ++ +R
Sbjct: 164 DVFKYYWAIVNSRSFHFKPPGAKPGFMVLCPFIDYMNHGPSGTGVNVRQTAKGYEVTANR 223
Query: 180 PYKAGESIVVWCGPQPNSKLLINYGFV---------DEDNPYDRLVVEAALNTEDPQYQD 230
Y AGE ++ G PN KLL++YGF+ D+D D +++ NT Q QD
Sbjct: 224 DYVAGEEVLATYGAHPNDKLLVHYGFINSSKPGAPSDDDIRLDHYILDNLSNTTRDQLQD 283
>gi|298708218|emb|CBJ30557.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 493
Score = 43.1 bits (100), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 49/203 (24%), Positives = 85/203 (41%), Gaps = 25/203 (12%)
Query: 22 ACLA-LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAE 80
ACL L L++E+ G+ S + Y+ L + PL W+E E+ L G T AE
Sbjct: 133 ACLTVLRLLHERGLGESSPFHSYLSVLPQDH-------RLPLEWTEAEVGLLQG--TSAE 183
Query: 81 ILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKV 140
L A + ++ +V Q+P T F + V+S +
Sbjct: 184 PLVGAGSLDSQFEAFQSV-------VAQHPTVWEPSVCTKAAFAKGVNWVRS-----RGF 231
Query: 141 SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD--RPYKAGESIVVWCGPQPNSK 198
++ ++P G + + +++ D V+ +P KAGE + G N++
Sbjct: 232 TVMGDPHMIP-GADMFNHDPNKQSVQIGTDGEEHFVMKTVQPVKAGEEVFSSFGHISNAQ 290
Query: 199 LLINYGFVDEDNPYDRLVVEAAL 221
LL +YGFV N +D +++ L
Sbjct: 291 LLNSYGFVLPGNSFDTVLIPTQL 313
>gi|301094169|ref|XP_002997928.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262109714|gb|EEY67766.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 440
Score = 42.7 bits (99), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 41/166 (24%), Positives = 66/166 (39%), Gaps = 27/166 (16%)
Query: 179 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL-----NTEDPQYQDKRM 233
+ Y+ GE + + G N +LL NYGF +NPYD + + + N DP + KR
Sbjct: 247 KAYEPGEQLYINYGSHSNLRLLRNYGFTTPNNPYDVVTLPMPIALQQPNPADPAFLQKRG 306
Query: 234 VAQR-NGKLSVQVFHVHAGREKE----------------AISDMLPYLRLGYVSDTSEMQ 276
+ Q G S + + + R A + L + S +
Sbjct: 307 LLQSATGSHSTDIPALRSLRFNHDGQLAPNAEHWLEILLATPEELSEIITQAASQSGAAD 366
Query: 277 SVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 322
S IS P+ ++ V ++ ARL + +TL ED+A L
Sbjct: 367 STISLALPMS-----LKHKVHSEVGSLVTARLKQHSSTLEEDDAFL 407
>gi|149742140|ref|XP_001496337.1| PREDICTED: SET domain-containing protein 4 [Equus caballus]
Length = 440
Score = 42.7 bits (99), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 56/224 (25%), Positives = 92/224 (41%), Gaps = 21/224 (9%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
LS L L +L+ EK G +S W PY+ L + A P+ E E+ L P
Sbjct: 104 LSPLLALCTFLVAEKHAGDRSVWKPYLEVLPK-------AYTCPVC-LEPEVVDLLPKPL 155
Query: 78 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
KA+ E+ ++ + + LF + I F++ F A+ V + V++
Sbjct: 156 KAKAREQRTRLQAFFTSSRDFFSSLRPLFSEAVESI----FSYSAFLWAWCTVNTRAVYM 211
Query: 138 QKVSLARRFALVP----LGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
K R F+ P L P LL +S + +A ++ + E + +
Sbjct: 212 -KPRRRRCFSAEPDTYALAPYLDLLNHSPDVQVRAGFNEETRCYEIRTVSSCRKHEEVFI 270
Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRM 233
GP N +LL+ YGFV NP+ + V + + DK+M
Sbjct: 271 CYGPHDNQRLLLEYGFVSIHNPHACVYVSKDILVKYLPSTDKQM 314
>gi|391342782|ref|XP_003745694.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Metaseiulus occidentalis]
Length = 278
Score = 42.7 bits (99), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 37/182 (20%), Positives = 83/182 (45%), Gaps = 12/182 (6%)
Query: 179 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 238
R YK E + ++ G + N++ +++ GFV ++N +D L ++ L+ D ++ KR + ++
Sbjct: 96 REYKKNEQVNIFYGNRANAQFMLHNGFVPDENQWDSLAIKIGLSKADKLFEMKRRLCEQM 155
Query: 239 GKLSVQVFHVHAGREKEAISDMLPYLRLGYV--------SDTSEMQSVISSLGPICPVSP 290
+ VF + + + + ++P + L V SD + + + + P P
Sbjct: 156 KIPTSDVFELKKAPDGDGV--LVPKVLLHLVHILQWKAPSDGTTSGTDVGA-DPSDATDP 212
Query: 291 CMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML-NACL 349
+ L + + P ++ E +L D + ++A + E++ML NAC
Sbjct: 213 VRTKKAKTFLHVRCQLLMKALPRSVEELTEILNDPTTSLESKLAIRYRLSEQRMLTNACN 272
Query: 350 QV 351
++
Sbjct: 273 KI 274
>gi|145344497|ref|XP_001416768.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576994|gb|ABO95061.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 514
Score = 42.7 bits (99), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 94/373 (25%), Positives = 154/373 (41%), Gaps = 67/373 (17%)
Query: 22 ACLALYLMYEK-KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAE 80
A LA +++ E G KS + P++R L R V+S + W+E EL+ L+GS
Sbjct: 130 AILAAHVLREAFDAGSKSAYWPWLRLLPRD-------VDSTVGWNEDELSELSGS--NVV 180
Query: 81 ILERA--EGIKREYNELDTVWFMAGSLFQQYPYDIPTEA---FTFEIFKQAFVAVQSCVV 135
+ RA + EY+ LD +L +++P E +TF+ F A + S +
Sbjct: 181 VFTRAIKAQWRMEYDALDVP-----TLGEKFPDVFGGERAAHYTFDKFTWARFIIWSRAI 235
Query: 136 HLQKVSLARRFALVPLGPPLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 190
L S + + PLL A K + A +AV++ ++ +
Sbjct: 236 DLSTESA--EAPTIRVLVPLLDMANHAPGGKLRPEWDARSNAVKVYAASAFREHTELRFN 293
Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 250
+P+ L+ YGF+ E NP + VEA + D + R+ K + H
Sbjct: 294 YDTKPSQYFLLQYGFIPETNPAE--CVEATVRVSDHD-------SLRDAKEELLRLHGLD 344
Query: 251 GREKE------AIS-DMLPYLRLGYVSDTSEMQSVIS-----SLGPICPVSPCMERAV-L 297
+++ +I D+L R+ D +EM S S + + +AV L
Sbjct: 345 PKKRNFEWKPRSIDYDLLAATRV-ITMDEAEMSDATSLTLAVSGASVSAKNDARTKAVLL 403
Query: 298 DQLADYFKARLAGYPATLSEDEAMLT----DYNLHP----KKRVATQLVRMEKKMLNACL 349
LA + L Y TL+ED + + N P +KR A L+RM +K + L
Sbjct: 404 KSLASF----LESYTTTLAEDNEYVARVDDESNDEPLPGKRKRFAV-LLRMREKQI---L 455
Query: 350 QVTADMIML-LPD 361
+AD + LPD
Sbjct: 456 LASADALFKELPD 468
>gi|345561352|gb|EGX44442.1| hypothetical protein AOL_s00188g347 [Arthrobotrys oligospora ATCC
24927]
Length = 468
Score = 42.7 bits (99), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 53/215 (24%), Positives = 88/215 (40%), Gaps = 23/215 (10%)
Query: 60 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 119
S + +++ EL GS + + I+ +Y L LF QY FT
Sbjct: 107 SSIFFTDDELEVCAGSSLYTITKQLKQQIQDDYRTL------VERLFGQYLDIFSLGKFT 160
Query: 120 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS---KCKAMLAAVDDAVQLV 176
E +K A V S + + L P +L +SS KC + D + ++
Sbjct: 161 IEDYKWALCTVWSRAMDFVQPDGKSIRLLAPFAD-MLNHSSDVKKCHVYDTSSGD-LSIL 218
Query: 177 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 236
+ Y+ G+ + + G PN++LL YGFV +NP D + E P ++ K+ +
Sbjct: 219 AGKDYEPGDQVFINYGSIPNNRLLRLYGFVVPNNPNDSYDLVLMTQPEAPFFELKQKLWV 278
Query: 237 RNGKLSVQVFHVHAGREKEAISDMLP-----YLRL 266
G SV + +++D LP YLR+
Sbjct: 279 SAGLDSVSTISL-------SLNDPLPKSVLQYLRI 306
>gi|66813084|ref|XP_640721.1| hypothetical protein DDB_G0281543 [Dictyostelium discoideum AX4]
gi|60468751|gb|EAL66753.1| hypothetical protein DDB_G0281543 [Dictyostelium discoideum AX4]
Length = 1339
Score = 42.7 bits (99), Expect = 0.28, Method: Composition-based stats.
Identities = 48/200 (24%), Positives = 84/200 (42%), Gaps = 17/200 (8%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
L++ L L+++YEK SFW P+ L + + +S TEL L G+
Sbjct: 828 LNDDTILFLFVIYEKGNAN-SFWRPFYDTLPSY-------FTTSIHYSATELLELEGT-- 877
Query: 78 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
+ E K++ N F L +QYP P F++E F A + S + L
Sbjct: 878 --NLFEETLHTKQQLNSFRDYLF--PELSKQYPDIFPESQFSWENFLWARSLLDSRAIQL 933
Query: 138 QKVSLARRFALVPLGPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 195
K+ + + LVP+ + +++ + + +++ A I + G
Sbjct: 934 -KIDGSIKSCLVPMADMINHHTNAQISERFFDHDSQSFKMISSCNIPANNQIFLHYGALQ 992
Query: 196 NSKLLINYGFVDEDNPYDRL 215
N +L + YGF+ +N YD L
Sbjct: 993 NWELALYYGFIIPNNIYDSL 1012
>gi|384249602|gb|EIE23083.1| SET domain-containing protein, partial [Coccomyxa subellipsoidea
C-169]
Length = 306
Score = 42.7 bits (99), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 30/117 (25%), Positives = 54/117 (46%), Gaps = 13/117 (11%)
Query: 10 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
A++L +++ L + +MYE GK+SFW Y+ EL ++ PL W+E E
Sbjct: 33 ADILEQHRIRGGLGLIIAIMYELSIGKESFWHGYLEELHKRE-------YLPLFWAEQER 85
Query: 70 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 126
+ L G+ + E E + ++ L +Q+ + ++FT E F+ A
Sbjct: 86 SLLQGTEAEHRPQEDEELTQEDFET------HVPPLVEQHADRLRADSFTLESFRVA 136
>gi|50303805|ref|XP_451849.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49640981|emb|CAH02242.1| KLLA0B07161p [Kluyveromyces lactis]
Length = 553
Score = 42.7 bits (99), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 25/65 (38%), Positives = 33/65 (50%), Gaps = 1/65 (1%)
Query: 169 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 228
D+ V +++ KAGE I G NS LL YGF EDNP+D + + L E Q
Sbjct: 319 TDECVDIILSNDVKAGEEIFNSYGDHSNSYLLARYGFCIEDNPHDVVDLSEEL-VEYGQK 377
Query: 229 QDKRM 233
KR+
Sbjct: 378 NSKRI 382
>gi|322707769|gb|EFY99347.1| SET domain protein [Metarhizium anisopliae ARSEF 23]
Length = 467
Score = 42.7 bits (99), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 78/192 (40%), Gaps = 13/192 (6%)
Query: 26 LYLMYEKKQGKKSFWLPYIRELDR--QRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
L+L+ E + KSFW PYIR L + Q + Q A+ W + E L G+ + I +
Sbjct: 95 LFLIKEYLKRDKSFWWPYIRALPQPGQGNKSQWALAP--FWDDDEAELLEGTNVEVGIDK 152
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE--IFKQAFVAVQSCVVHLQKVS 141
++R+ E + + G + + TE + + IF + Q+ S
Sbjct: 153 IRNDVRRDLQEAQELLRLHGDADGAFGKALTTELYQWAYCIFSSRSFRPSLVLSDEQRRS 212
Query: 142 LARRFALVPLGPPLLAYSSKCKAMLAAV----DDAVQ---LVVDRPYKAGESIVVWCGPQ 194
L R + L + M + DD Q L V R + G+ + +
Sbjct: 213 LPRGVTMDDFSVLLPLFDIGNHDMTTEIRWDLDDDRQTCELRVGRTHMPGQQVFNNYSMK 272
Query: 195 PNSKLLINYGFV 206
N++LL+ YGF+
Sbjct: 273 TNAELLLGYGFM 284
>gi|330924929|ref|XP_003300837.1| hypothetical protein PTT_12198 [Pyrenophora teres f. teres 0-1]
gi|311324820|gb|EFQ91062.1| hypothetical protein PTT_12198 [Pyrenophora teres f. teres 0-1]
Length = 372
Score = 42.7 bits (99), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 22/77 (28%), Positives = 38/77 (49%), Gaps = 2/77 (2%)
Query: 145 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 204
R A++P+ C+A A+ + + DR Y+AGE + + G LL YG
Sbjct: 179 RLAILPVADLFNHADVGCEAQFAS--ENYSFIADRTYRAGEELYISYGTHSTDFLLAEYG 236
Query: 205 FVDEDNPYDRLVVEAAL 221
FV +N +D + ++ A+
Sbjct: 237 FVPAENRWDVVCLDEAI 253
>gi|281201674|gb|EFA75882.1| tryptophan 2,3-dioxygenase [Polysphondylium pallidum PN500]
Length = 732
Score = 42.7 bits (99), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 59/120 (49%), Gaps = 15/120 (12%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
L ++L+ E ++ ++SFW PYIR L + + ++ L ++ E L GSP +LE
Sbjct: 146 LVIHLIQETQKQQQSFWAPYIRMLPK-------SYKTALYFTLAEFQLLIGSP----VLE 194
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 143
E I N L F+ F + P + T FT+E F+Q +A +V L K LA
Sbjct: 195 --ESINTYRNTLRQYCFLY-DFFGKNPGILSTSNFTWE-FEQNELAAYKSIVSLLKKRLA 250
>gi|302754812|ref|XP_002960830.1| hypothetical protein SELMODRAFT_402221 [Selaginella moellendorffii]
gi|300171769|gb|EFJ38369.1| hypothetical protein SELMODRAFT_402221 [Selaginella moellendorffii]
Length = 393
Score = 42.7 bits (99), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 27/192 (14%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
LAL ++ E+ +G+ + W PYI L + +++ W +TEL+YL SP + E
Sbjct: 167 LALIVLMERYKGQ-AIWAPYISCLPQPA-----ELDNTFRWEDTELSYLRASPLYGKARE 220
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ-SCVVHLQKVSL 142
R E I E+ ++ F L Q +++ Q F V + H+
Sbjct: 221 RLEMITTEFGQVQND-FCTCVLEQ-----------ALDVWPQLFGKVSLEDLKHVYATVF 268
Query: 143 ARRFALVPLGP---PLLAY-----SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 194
+R A+ P+L + +S K + + + DR Y + I + G
Sbjct: 269 SRSLAIGEDSTTLIPMLDFFNHNATSFAKLSFNGLLNYAVVTADRDYAENDQIWINYGDL 328
Query: 195 PNSKLLINYGFV 206
N++L ++YGF
Sbjct: 329 SNAELALDYGFT 340
>gi|297707870|ref|XP_002830708.1| PREDICTED: SET domain-containing protein 4 [Pongo abelii]
Length = 440
Score = 42.7 bits (99), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 54/240 (22%), Positives = 95/240 (39%), Gaps = 24/240 (10%)
Query: 19 SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
S L L +L+ EK G +S W PY+ L + A P+ E E+ L K
Sbjct: 105 SPLLALCTFLVSEKHAGDRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPQSLK 156
Query: 79 AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ 138
A+ E+ ++ + + LF + I F++ A+ V + V+L+
Sbjct: 157 AKAEEQRAHVQEFFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLR 212
Query: 139 ---KVSLARRFALVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
+ L+ L P LL +S + KA + ++ ++ E + +
Sbjct: 213 PRHRECLSAELDTCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRRHEEVFICY 272
Query: 192 GPQPNSKLLINYGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 246
GP N +L + YGFV NP+ R ++ L + D Q K + + +G + F
Sbjct: 273 GPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332
>gi|355747383|gb|EHH51880.1| SET domain-containing protein 4 [Macaca fascicularis]
Length = 440
Score = 42.7 bits (99), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 54/240 (22%), Positives = 95/240 (39%), Gaps = 24/240 (10%)
Query: 19 SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
S L L +L+ EK G +S W PY+ L + A P+ E E+ L K
Sbjct: 105 SPLLALCTFLVSEKHAGDRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLK 156
Query: 79 AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL- 137
A+ E+ ++ + + LF + I F++ A+ + + V+L
Sbjct: 157 AKAEEQRAHVQEFFASSRDFFSSLQPLFVEAVDSI----FSYSALLWAWCTINTRAVYLR 212
Query: 138 --QKVSLARRFALVPLGP--PLLAYSSKC--KAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
Q+ L+ L P LL +S + KA + ++ ++ E + +
Sbjct: 213 PRQRECLSAEPDTCALAPYLDLLNHSPRVQVKAAFNEETHSYEIRTTSRWRKHEEVFICY 272
Query: 192 GPQPNSKLLINYGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 246
GP N +L + YGFV NP+ R ++ L + D Q K + + +G + F
Sbjct: 273 GPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSRDKQMDKKISILKDHGYIENLTF 332
>gi|297608243|ref|NP_001061350.2| Os08g0244400 [Oryza sativa Japonica Group]
gi|255678277|dbj|BAF23264.2| Os08g0244400, partial [Oryza sativa Japonica Group]
Length = 195
Score = 42.4 bits (98), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 34/59 (57%), Gaps = 2/59 (3%)
Query: 169 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD--RLVVEAALNTED 225
V +++ + RP KAGE + G P S L+ YGF+ DNPYD L ++ +++ ED
Sbjct: 14 VTKSLKFPLSRPCKAGEQCFLSYGKHPGSHLITFYGFLPRDNPYDVIPLDLDTSVDEED 72
>gi|413917183|gb|AFW57115.1| hypothetical protein ZEAMMB73_742803 [Zea mays]
Length = 514
Score = 42.4 bits (98), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 61/240 (25%), Positives = 96/240 (40%), Gaps = 24/240 (10%)
Query: 16 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 75
N ++ L L+ M E+ F PY L G L + LA L G+
Sbjct: 190 NNITSETMLLLWSMRERYNLGSKF-KPYFDTLPANFNTG-------LSFGIDALAALEGT 241
Query: 76 PTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 135
EI++ + ++++Y+EL + L +P + T++ F A S +
Sbjct: 242 LLFDEIIQARQHLRQQYDELFPL------LCTNFPEIFRKDVCTWDDFLWACELWYSNSM 295
Query: 136 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV---VDRPYKAGESIVVWCG 192
+ S LVP+ L S VD+A + + + RP AGE + G
Sbjct: 296 MIVLSSGKLSTCLVPVAGLLNHSVSPHILNYGRVDEATKSLKFPLSRPCDAGEQCFLSYG 355
Query: 193 PQPNSKLLINYGFVDE-DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 251
P S LL YGF+ DNPYD + ++ D D+ + AQ + S Q H+ G
Sbjct: 356 KHPGSHLLTFYGFLPRGDNPYDVIPLDL-----DTSADDEDITAQSSATTS-QTTHMVRG 409
>gi|403271547|ref|XP_003927684.1| PREDICTED: SET domain-containing protein 4 [Saimiri boliviensis
boliviensis]
Length = 440
Score = 42.4 bits (98), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 55/240 (22%), Positives = 94/240 (39%), Gaps = 24/240 (10%)
Query: 19 SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
S L L +L+ EK G +S W PY+ L + A P+ E E+ L K
Sbjct: 105 SPLLALCTFLVSEKHAGDRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLK 156
Query: 79 AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL- 137
A+ E+ ++ + + LF + I F++ A+ V + V+L
Sbjct: 157 AKAEEQRAHVQEFFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLR 212
Query: 138 --QKVSLARRFALVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
Q+ L+ L P LL +S + KA ++ ++ E + +
Sbjct: 213 PRQQECLSAEPDTCALAPYLDLLNHSPHVQVKAAFNEETHCYEIRTTSRWRKHEEVFICY 272
Query: 192 GPQPNSKLLINYGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 246
GP N +L + YGFV NP+ R ++ L + D Q K + + +G + F
Sbjct: 273 GPHDNQRLFLEYGFVSAHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332
>gi|355718756|gb|AES06374.1| SET domain containing 4 [Mustela putorius furo]
Length = 256
Score = 42.4 bits (98), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 56/230 (24%), Positives = 94/230 (40%), Gaps = 23/230 (10%)
Query: 21 LACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAE 80
L L +L+ EK G +S W PY+ L + A P+ E ++ L P KA+
Sbjct: 10 LCTLCTFLVSEKHAGDQSLWKPYLDILPK-------AYTCPVC-LEPKVVNLFPEPLKAK 61
Query: 81 ILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HL 137
E+ ++ ++ + LF + +I F++ A+ V + V H
Sbjct: 62 AEEQRARVQGFFSSSRDFFSSLQPLFSEAVENI----FSYSALLWAWCTVNTRAVYMKHG 117
Query: 138 QKVSLARRFALVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 193
Q+ + L P LL +S + KA ++ + E + + GP
Sbjct: 118 QRKCFSPEPDTYALAPYLDLLNHSPDVQVKAAFNEETRCYEVRTASGCRKHEQVFICYGP 177
Query: 194 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 243
N +LL+ YGFV NP+ + V A L + DK+M N K+S+
Sbjct: 178 HDNQRLLLEYGFVSIQNPHACVYVSADLLVKYLPSTDKQM----NKKISI 223
>gi|400598098|gb|EJP65818.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
Length = 356
Score = 42.4 bits (98), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 41/168 (24%), Positives = 69/168 (41%), Gaps = 22/168 (13%)
Query: 61 PLLWSETELAYLTGSPTKAEILERAE-GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 119
P W L G T +LE+ + R++ L + YPY +P+E +
Sbjct: 104 PFFWPPEAQRLLPG--TARRLLEKQQSNFGRDWKHLQSA----------YPY-VPSEDYM 150
Query: 120 FEIFKQAFVAVQSCVVHLQKVSL---ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 176
F V+ ++ Q+ L R A++P+ S CK A ++ +V
Sbjct: 151 HAWF---VVSSRAFYQETQQTLLYPWHDRLAMLPVADLFNHASVGCKVSYCA--ESYDIV 205
Query: 177 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 224
DR Y G+ + G N LL YGF+ ++N DR + +++E
Sbjct: 206 ADREYGTGDEVCTCYGEHSNDFLLAEYGFLLQNNTNDRFDPDDLISSE 253
>gi|336258546|ref|XP_003344085.1| hypothetical protein SMAC_09068 [Sordaria macrospora k-hell]
gi|380093059|emb|CCC09296.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 421
Score = 42.4 bits (98), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 29/50 (58%)
Query: 172 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 221
A + RPY AGE + + G N LLI YGF+ ++N +D + ++ A+
Sbjct: 263 AFTITTTRPYSAGEEVYICYGNHSNDFLLIEYGFLFDENVWDEVCIDDAI 312
>gi|242823770|ref|XP_002488126.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
gi|218713047|gb|EED12472.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
Length = 480
Score = 42.4 bits (98), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 49/193 (25%), Positives = 85/193 (44%), Gaps = 20/193 (10%)
Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
+VPL L A + + A L D A+ + +P K G+ I G P S LL YG+V
Sbjct: 230 GMVPLADLLNADADRNNARLFQEDGALVMRAIKPIKTGDEIFNDYGELPRSDLLRRYGYV 289
Query: 207 DEDNPYDRLVVEAAL----------NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEA 256
DN VVE L N E +Y +++ + ++ + + +++
Sbjct: 290 -TDNYAQYDVVELPLTGICHAAGLDNIESQEYPHLKLLHEL--EILEDGYCILRPSAEDS 346
Query: 257 ISDMLP----YLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP 312
++D+LP L + E+Q + S P P+ E + L D K++L+ Y
Sbjct: 347 LTDILPDELLALLKSLTLEREELQRLQSKQKPPKPILAAREARI---LLDSVKSKLSQYG 403
Query: 313 ATLSEDEAMLTDY 325
T+ +D+A+L +
Sbjct: 404 TTVEQDKAILQQF 416
>gi|367042232|ref|XP_003651496.1| hypothetical protein THITE_2111880 [Thielavia terrestris NRRL 8126]
gi|346998758|gb|AEO65160.1| hypothetical protein THITE_2111880 [Thielavia terrestris NRRL 8126]
Length = 377
Score = 42.0 bits (97), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 42/171 (24%), Positives = 69/171 (40%), Gaps = 22/171 (12%)
Query: 56 LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPT 115
LA PL WS L P +A L RA+ K + W + F
Sbjct: 104 LATALPLAWSSPVLHNYLPPPARA--LLRAQQAKFARD-----WAAVSAAFP-------- 148
Query: 116 EAFTFEIFKQAFVAVQSCVVHLQKVSLAR-----RFALVPLGPPLLAYSSKCKAMLAAVD 170
A + F+ A++ + + + AR R L P+ L +++ +A
Sbjct: 149 -ALAPDAFRHAWLLTNTRTFYHETARTARLPHDDRMVLQPVAD-LFNHAADGGCEVAFTP 206
Query: 171 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 221
+ + DR Y GE +++ G N LL+ YGFV E N +D + ++ A+
Sbjct: 207 ASFAITADRAYAEGEEVLICYGRHSNDFLLVEYGFVLEQNRWDEVGLDEAV 257
>gi|302896942|ref|XP_003047350.1| hypothetical protein NECHADRAFT_106552 [Nectria haematococca mpVI
77-13-4]
gi|256728280|gb|EEU41637.1| hypothetical protein NECHADRAFT_106552 [Nectria haematococca mpVI
77-13-4]
Length = 471
Score = 42.0 bits (97), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 52/230 (22%), Positives = 97/230 (42%), Gaps = 36/230 (15%)
Query: 25 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-LE 83
A++L+ + G++SFW PYI+ L + A+ PLLW E++L +L G+ + + +
Sbjct: 111 AIFLVQQYLLGEQSFWYPYIQILPQPDDDKDSAI--PLLWPESDLLWLRGTHLEEAVSKQ 168
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF---------------- 127
+ + +KR W A Q+Y +D P++ FT E+ A+
Sbjct: 169 KVDHVKR--------WTEAMETLQKYGWD-PSQ-FTLELGLWAYYCFYSRYFWSIILEPD 218
Query: 128 -VAVQSCVVHLQKVSLARRFALVPLGPPL--LAYSSKCKAMLAAVDDAVQLVVDRPYKAG 184
++ HL K + L P L L ++ + D + + + K G
Sbjct: 219 VANIKPEFQHLVKAGMNLDDTAKILLPILETLNHAQETNTEYNLDDKGLSVSKNIELKPG 278
Query: 185 ESIVVWCGPQP----NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQD 230
+ + + N+ LL ++GF+ DN LV+ + + P + D
Sbjct: 279 DPFYIAYDKETQRFNNTVLLKDFGFILPDNEAAELVLSSPFDLTRPMHLD 328
>gi|218200748|gb|EEC83175.1| hypothetical protein OsI_28406 [Oryza sativa Indica Group]
Length = 319
Score = 42.0 bits (97), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 40/77 (51%), Gaps = 7/77 (9%)
Query: 151 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 210
+ P +L Y K V +++ + RP KAGE + G P S L+ YGF+ DN
Sbjct: 166 VSPHILNYGRVDK-----VTKSLKFPLSRPCKAGEQCFLSYGKHPGSHLITFYGFLPRDN 220
Query: 211 PYDR--LVVEAALNTED 225
PYD L ++ +++ ED
Sbjct: 221 PYDVIPLDLDTSVDEED 237
>gi|170588849|ref|XP_001899186.1| SET domain containing protein [Brugia malayi]
gi|158593399|gb|EDP31994.1| SET domain containing protein [Brugia malayi]
Length = 278
Score = 42.0 bits (97), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 28/79 (35%), Positives = 44/79 (55%), Gaps = 5/79 (6%)
Query: 147 ALVPLGPPLLAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 204
A+VPL +L +S S+C A+ + + +++V RP + GE I + G N L I YG
Sbjct: 186 AIVPL-IDMLNHSNDSQCCAIWDSKFNLYKVIVTRPIRKGEQIFICYGSHTNGSLWIEYG 244
Query: 205 FVDEDNPYDRLVVEAALNT 223
F +DN D+ VE +L +
Sbjct: 245 FYLKDNICDK--VEISLGS 261
>gi|358055500|dbj|GAA98620.1| hypothetical protein E5Q_05307 [Mixia osmundae IAM 14324]
Length = 462
Score = 42.0 bits (97), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 18/46 (39%), Positives = 28/46 (60%)
Query: 175 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 220
LV D P AG+ + GP+ N + L+ YGF+ +NP D +V++ A
Sbjct: 225 LVQDEPTAAGQQVFNNYGPKSNEEFLLGYGFIIPNNPDDHMVLKLA 270
>gi|383863095|ref|XP_003707018.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Megachile
rotundata]
Length = 277
Score = 42.0 bits (97), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 49/91 (53%), Gaps = 4/91 (4%)
Query: 179 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD--RLVVEAALNTE-DPQYQDKRMVA 235
R +K G+ I + GP+PNS ++ GFV D+ +D + V + L + DP ++R +
Sbjct: 80 RDFKKGDQIFISYGPRPNSDFFLHSGFVYMDHKHDTLKFWVGSFLESNLDPHLAERRQLL 139
Query: 236 QRNGKLSVQVFHVHAGREKEAISDMLPYLRL 266
++ F V++GRE S +L Y+R+
Sbjct: 140 KKLHLQPWSEFVVNSGREPIPGS-VLAYMRV 169
>gi|378728064|gb|EHY54523.1| SET domain-containing protein 6 [Exophiala dermatitidis NIH/UT8656]
Length = 495
Score = 42.0 bits (97), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 73/281 (25%), Positives = 105/281 (37%), Gaps = 46/281 (16%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
L + ++YE +G+ S W PY + L + + W++ ELA L S +I
Sbjct: 98 LIVTIIYEYLRGESSPWHPYFKIL-------PTTFNTLMFWNDAELAELQASAVVDKIGR 150
Query: 84 R-AEG------------------IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK 124
R AE + +L + MAGSL Y +DI + +
Sbjct: 151 RQAEEEWQNTIIPTMADHPDLFPVGGSSAKLIELAHMAGSLIMAYAFDIDRDDMEDDNDN 210
Query: 125 QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 184
A + + +VP L A + K A L D + + +P AG
Sbjct: 211 DKDGADSADDEFEEDDEDEPFKGMVPFADMLNADADKNNARLFQEPDYLIMKATKPISAG 270
Query: 185 ESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 243
E I GP P S LL YG+V D YD + L E VA ++ K
Sbjct: 271 EQIFNDYGPLPRSDLLRMYGYVTDNYAQYDVVEFSHDLLLE---------VAGKHSKSKD 321
Query: 244 QVFHVHAGREKEAISDMLPYLRLGYV-----SDTSEMQSVI 279
QV+ RE+E D L L GY DT +Q V+
Sbjct: 322 QVW-----REREQQLDELGVLDDGYAITRPEYDTQGLQDVL 357
>gi|440464432|gb|ELQ33864.1| hypothetical protein OOU_Y34scaffold00857g1 [Magnaporthe oryzae
Y34]
Length = 464
Score = 42.0 bits (97), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 36/129 (27%), Positives = 55/129 (42%), Gaps = 3/129 (2%)
Query: 105 LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA 164
L Q+ P E FT E +K A V S + L P +L +S K
Sbjct: 146 LLVQHRDLFPLEQFTIEDYKWALCTVWSRAMDFVLPGGNSIRLLAPFAD-MLNHSDNVKQ 204
Query: 165 MLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 222
A + + ++ + Y+AG+ + ++ GP NS+LL YGFV N D + A +
Sbjct: 205 CHAYDSSSKTLSVLAGKDYEAGDQVFIYYGPVSNSRLLRLYGFVLPGNSNDNYDLVLATH 264
Query: 223 TEDPQYQDK 231
E P + K
Sbjct: 265 PEAPFFARK 273
>gi|395518633|ref|XP_003763464.1| PREDICTED: SET domain-containing protein 4 [Sarcophilus harrisii]
Length = 440
Score = 42.0 bits (97), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 50/207 (24%), Positives = 83/207 (40%), Gaps = 19/207 (9%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
+S L L +L+ E G KS W PY+ L + + L+ E ++ L P
Sbjct: 104 ISPLLALCTFLISENNAGNKSPWKPYLDILPKDY--------TCLVCLEPQVVRLLPKPL 155
Query: 78 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV-- 135
K + E+ ++ + + SLF + I F + F A+ + + V
Sbjct: 156 KIKAQEQKTQVQELFVSSRGFFSSLQSLFTEDVKHI----FHYHAFLWAWCTINTRTVYM 211
Query: 136 -HLQKVSLARRFALVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVW 190
H QK L+ + L P LL +S + A ++ K E + +
Sbjct: 212 KHAQKKCLSAEPDVYALAPYLDLLNHSPGVQVNAAFNEKTRCYEIRTTSSCKKYEELFIC 271
Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVV 217
GP N +LL+ YGFV +NP+ + V
Sbjct: 272 YGPHDNHRLLLEYGFVAINNPHSAVYV 298
>gi|218200744|gb|EEC83171.1| hypothetical protein OsI_28399 [Oryza sativa Indica Group]
Length = 437
Score = 42.0 bits (97), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 21/56 (37%), Positives = 33/56 (58%), Gaps = 2/56 (3%)
Query: 172 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR--LVVEAALNTED 225
+++ + RP KAGE + G P S L+ YGF+ DNPYD L ++ +++ ED
Sbjct: 300 SLKFPLSRPCKAGEQCFLSYGKHPGSHLITFYGFLPRDNPYDVIPLDLDTSVDEED 355
>gi|358335378|dbj|GAA53907.1| histone-lysine N-methyltransferase setd3 [Clonorchis sinensis]
Length = 254
Score = 42.0 bits (97), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 36/168 (21%), Positives = 79/168 (47%), Gaps = 15/168 (8%)
Query: 192 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 251
G + +++ L+ GFV NP++ + + ++ D + + + S + H+
Sbjct: 63 GKRTSAEFLMFSGFVPATNPHNNVRIVLGVSKSDQLSSKREQLLELIALQSPLILHITG- 121
Query: 252 REKEAISDMLPYLRLGYVSDTSEMQSVIS---------SLGPICPVSPCMERAVLDQLAD 302
+ ++SD + + R+ +V D+ ++ + +S P+CP P ++A+ L
Sbjct: 122 -DLSSLSDAIAFARV-FVMDSDQLDAHLSMTTSALHALRTSPLCPGDPIDDQAIA-FLIM 178
Query: 303 YFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 350
F+ ++ Y +SEDE NL P +R +L E ++L +C++
Sbjct: 179 RFELLVSAYGPMVSEDEVGYE--NLTPIQRYCERLRVQEVQILRSCIE 224
>gi|260819628|ref|XP_002605138.1| hypothetical protein BRAFLDRAFT_122719 [Branchiostoma floridae]
gi|229290469|gb|EEN61148.1| hypothetical protein BRAFLDRAFT_122719 [Branchiostoma floridae]
Length = 453
Score = 42.0 bits (97), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 48/209 (22%), Positives = 84/209 (40%), Gaps = 24/209 (11%)
Query: 16 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 75
++L+ LA++LM EK + +KSFW PYI L + P ++E + L S
Sbjct: 99 SQLTPSQALAVFLMCEKCRREKSFWRPYIDILPEE-------YTCPAFFTEEDFRLLPNS 151
Query: 76 PTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS--- 132
+ + + +E+ EL + M LF +AF F+ FK A+ A+++
Sbjct: 152 -LRGKAKAKKYECHKEFMELAPFFKMLADLFPDQ-----EDAFNFKDFKWAWSAIKTRAF 205
Query: 133 -------CVVHLQKVSLARRFALVPLGPPL-LAYSSKCKAMLAAVDDAVQLVVDRPYKAG 184
L+ + PL + A +K + ++ + Y+
Sbjct: 206 DVPLGGETCYRLRDSEDTSNPTMFPLVDSINHAAQAKIRHRYNEKRRCLESRTETVYRRH 265
Query: 185 ESIVVWCGPQPNSKLLINYGFVDEDNPYD 213
++ G N LL+ +GFV NP D
Sbjct: 266 AEVMNSYGRADNDNLLLEFGFVVPGNPAD 294
>gi|50294638|ref|XP_449730.1| hypothetical protein [Candida glabrata CBS 138]
gi|49529044|emb|CAG62706.1| unnamed protein product [Candida glabrata]
Length = 510
Score = 42.0 bits (97), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 34/66 (51%)
Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
++VPL L A + KC A L ++++ R +GE + G NS+LL YG+V
Sbjct: 238 SMVPLADTLNADTKKCNANLLHSKQTLRMIAIRDIPSGEQVYNTYGELSNSELLRRYGYV 297
Query: 207 DEDNPY 212
+ D Y
Sbjct: 298 EWDGSY 303
>gi|85113406|ref|XP_964517.1| hypothetical protein NCU02158 [Neurospora crassa OR74A]
gi|28926302|gb|EAA35281.1| hypothetical protein NCU02158 [Neurospora crassa OR74A]
Length = 504
Score = 42.0 bits (97), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 29/50 (58%)
Query: 172 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 221
A + RPY AGE + + G N LLI YGF+ ++N +D + ++ A+
Sbjct: 270 AFTITTTRPYAAGEEVYICYGNHSNDFLLIEYGFLFDENVWDEVCIDDAI 319
>gi|422293007|gb|EKU20308.1| ribulose- -bisphosphate carboxylase oxygenase small subunit
n-methyltransferase i [Nannochloropsis gaditana CCMP526]
Length = 385
Score = 42.0 bits (97), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 53/121 (43%), Gaps = 11/121 (9%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
++E +AL L+ E+ G SFW PYI L V L+W E +LA L SP
Sbjct: 192 INEYFAIALMLIKERALGPSSFWAPYIDVLPTTE-----EVNPTLVWPEGDLALLEASPL 246
Query: 78 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
A + E+ L+ + A S +D FTFE + AF+ + S + +
Sbjct: 247 VAATRSLKRKLAAEFALLEEQYMRARSDV----FD--PSVFTFEAYLWAFINIFSRAIRV 300
Query: 138 Q 138
+
Sbjct: 301 K 301
>gi|118395738|ref|XP_001030215.1| hypothetical protein TTHERM_01108540 [Tetrahymena thermophila]
gi|89284510|gb|EAR82552.1| hypothetical protein TTHERM_01108540 [Tetrahymena thermophila SB210]
Length = 1709
Score = 42.0 bits (97), Expect = 0.54, Method: Composition-based stats.
Identities = 29/88 (32%), Positives = 45/88 (51%), Gaps = 19/88 (21%)
Query: 19 SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
++ LALYLMYEK +G++SFW PY EL+++ + L WS ELA S
Sbjct: 1014 ADFNILALYLMYEKLKGEQSFWHPYF-ELNQKS-------YTLLDWSTEELAQFEDSY-- 1063
Query: 79 AEILERAEGIKREYNELDTVWFMAGSLF 106
I +E N+ + ++F+ S+
Sbjct: 1064 ---------ILQEVNQSNQIFFLQQSVL 1082
>gi|388250581|gb|AFK23406.1| histone-lysine N-methyltransferase [Cordyceps militaris]
Length = 479
Score = 42.0 bits (97), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 49/199 (24%), Positives = 83/199 (41%), Gaps = 7/199 (3%)
Query: 173 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 232
+ ++ + Y+ G+ I ++ G PN++LL YGFV DNP D + + P Y+ K
Sbjct: 239 LSILAAKDYQVGDQIFIYYGSVPNNRLLRLYGFVLLDNPNDSYDLVLQTSPMAPLYEQKE 298
Query: 233 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL--GPICPVSP 290
+ G S + A + ++L YLR + D +++ + L G V+
Sbjct: 299 RLWALAGLDSTCTIPLTA--KHPLPKNVLRYLRTQRL-DAADVADMTLQLLNGTDGKVND 355
Query: 291 CMERAVLDQLADYFKARLAGYPATLSEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNAC 348
E VL L D + L G+ L + EA L Y A Q+ E+ +L
Sbjct: 356 GNEIQVLQFLIDSLGSVLEGFGIPLEKLEAQLAGGFYPAGGNAWAAAQVSAGEQGILTRA 415
Query: 349 LQVTADMIMLLPDVTVSPC 367
+ DM+ + + C
Sbjct: 416 KKTAEDMLAAVTRPLATQC 434
>gi|453087416|gb|EMF15457.1| SET domain-containing protein [Mycosphaerella populorum SO2202]
Length = 454
Score = 41.6 bits (96), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 57/263 (21%), Positives = 108/263 (41%), Gaps = 26/263 (9%)
Query: 59 ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 118
S + +++ EL GS A I+ +Y +L LF ++ P++ F
Sbjct: 106 SSSIFFADAELEICAGSSLYTTTKHLARQIEVDYKDL------VARLFGRHRDVFPSDKF 159
Query: 119 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYS---SKCKAMLAAVDDAVQL 175
T + +K A V S + K+ L+ +L +S +C + + +
Sbjct: 160 TIDDYKWALCTVWSRAMDF-KLRDGESIRLMAPFADMLNHSPDVGQCHVYDPQSGN-LSI 217
Query: 176 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 235
+ + Y+ G+ + + GP PN++L YGFV NP D + + + P ++ K +
Sbjct: 218 LAGKSYEPGDQVFINYGPIPNNRLSRLYGFVVPGNPNDSYDLVLSTHPMAPFFEQKHKLW 277
Query: 236 QRNGKLSVQVFHVHAGREKEAISDMLP-----YLRLGYVSDTSEMQSVISSLGPIC--PV 288
G S + ++D LP YLR+ +++T ++ +V + + +
Sbjct: 278 IAAGLDSTSTVSL-------TLTDPLPRSVLRYLRIQRLNET-DLAAVGTRQSDVAFEKI 329
Query: 289 SPCMERAVLDQLADYFKARLAGY 311
S E VL L + A L G+
Sbjct: 330 SDSNETEVLTFLVESISALLDGF 352
>gi|169626351|ref|XP_001806576.1| hypothetical protein SNOG_16462 [Phaeosphaeria nodorum SN15]
gi|160705819|gb|EAT76160.2| hypothetical protein SNOG_16462 [Phaeosphaeria nodorum SN15]
Length = 474
Score = 41.6 bits (96), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 59/234 (25%), Positives = 88/234 (37%), Gaps = 49/234 (20%)
Query: 23 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI- 81
L L ++YE G S W PY L + + + WSE ELA L S +I
Sbjct: 85 SLILVMLYEYINGDASNWAPYFSVLPTE-------FNTLMFWSEDELAELQASAVLNKIG 137
Query: 82 ------------------------------LERAEGIKREYNELDTVWFMAGSLFQQYPY 111
+RAE ++ E N L + GSL Y +
Sbjct: 138 KEGANEAFMEQLLPIIKEFADIFFAGDERAKQRAEEMRDERNVL--LMHKMGSLIMAYAF 195
Query: 112 DIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD 171
D+ A + + + A + L K ++PL L A + A L +
Sbjct: 196 DVEP-ATSRKDVDEEGFAEEEEDEALPK-------GMIPLADMLNADADCNNARLFYEEK 247
Query: 172 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALNTE 224
+++ +P +AGE + GP P S LL YG+V D YD + + L TE
Sbjct: 248 YLEMKALKPIRAGEEVFNDYGPLPRSDLLRRYGYVTDNYAQYDVVEINMDLVTE 301
>gi|336463341|gb|EGO51581.1| hypothetical protein NEUTE1DRAFT_125257 [Neurospora tetrasperma
FGSC 2508]
gi|350297448|gb|EGZ78425.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
Length = 503
Score = 41.6 bits (96), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 29/50 (58%)
Query: 172 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 221
A + RPY AGE + + G N LLI YGF+ ++N +D + ++ A+
Sbjct: 270 AFTITTTRPYAAGEEVYICYGNHSNDFLLIEYGFLFDENVWDEVCIDDAI 319
>gi|401624185|gb|EJS42251.1| set7p [Saccharomyces arboricola H-6]
Length = 494
Score = 41.6 bits (96), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 22/68 (32%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
+++PL L A +SKC A L +++++ R + E + G PNS++L YG+V
Sbjct: 220 SMIPLADTLNADTSKCNANLTYDSGSLKMIAVRDIEIDEQVYNIYGEHPNSEILRRYGYV 279
Query: 207 DED-NPYD 213
+ D + YD
Sbjct: 280 EWDGSKYD 287
>gi|365982325|ref|XP_003667996.1| hypothetical protein NDAI_0A05980 [Naumovozyma dairenensis CBS 421]
gi|343766762|emb|CCD22753.1| hypothetical protein NDAI_0A05980 [Naumovozyma dairenensis CBS 421]
Length = 573
Score = 41.6 bits (96), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 23/54 (42%), Positives = 28/54 (51%)
Query: 160 SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 213
SK + L DD V +V R GE I + GP PN+ LL GF DNP+D
Sbjct: 335 SKPEEELNNPDDYVDIVTTRGILKGEEIFISYGPLPNAFLLAKCGFTMADNPFD 388
>gi|320167148|gb|EFW44047.1| hypothetical protein CAOG_02072 [Capsaspora owczarzaki ATCC 30864]
Length = 533
Score = 41.6 bits (96), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 53/229 (23%), Positives = 97/229 (42%), Gaps = 36/229 (15%)
Query: 20 ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 79
E A L+L +YE SFW P+ + R+ ++ W + L L + +
Sbjct: 146 EEAKLSLLFLYEYFD-PDSFWQPWFQLFPRE-------LDCAGFWDDLLLMELDNTSIRD 197
Query: 80 EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 139
I + I+ EY++LD +L ++P + F+++ FK AF+ + S + +
Sbjct: 198 AIRQLEALIEYEYDQLDL-----PALRLRFPDSFVADRFSYDDFKWAFMVLASRGLTM-S 251
Query: 140 VSLARRFALVPL-------GPPLLAYSSKCKAMLAA------VDDAVQ------LVVDRP 180
V+ A ++P G +A+S +A A+ DD+V+ + +
Sbjct: 252 VNNAPCTVMIPFVDFFNHNGAKSIAFSYTRRAGDASDVSSGNYDDSVENLNCAVISGNET 311
Query: 181 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN---TEDP 226
+ GE + + N LL++YGF N +D +V + T DP
Sbjct: 312 FLPGEQMFLNYKAHSNEVLLLHYGFALPHNEHDTFLVRLHFDREKTNDP 360
>gi|356511297|ref|XP_003524363.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
setd3-like [Glycine max]
Length = 449
Score = 41.6 bits (96), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 70/172 (40%), Gaps = 19/172 (11%)
Query: 170 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD--RLVVEAAL------ 221
D +++V + K + +++ G N L++YGFV NPYD L + AL
Sbjct: 232 DSKMKVVAETAIKEDDPLLLCYGCLNNDLFLLDYGFVMHSNPYDCIELKYDGALLDAAST 291
Query: 222 -------NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE 274
N P + +++Q N V G ++ +L LR+ ++
Sbjct: 292 AAGVSSPNFSTPAPWQELILSQLNLAGETPDLKVSLGGQETVEGRLLAALRVILSTNVET 351
Query: 275 MQ----SVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 322
MQ S++ SL P+ E AV L L +P + +DE++L
Sbjct: 352 MQKYDLSILQSLDAEAPLGVANEIAVFRTLIALCVIALGHFPTKIMDDESLL 403
>gi|392594054|gb|EIW83379.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
Length = 508
Score = 41.6 bits (96), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 64/236 (27%), Positives = 94/236 (39%), Gaps = 30/236 (12%)
Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
A+VP+ L A A L + A+++ R + G+ I G PNS LL YG V
Sbjct: 261 AMVPMADLLNARYGSSNAKLFYEEHALEMRTTRFIRRGDQIWNTYGDPPNSDLLRRYGHV 320
Query: 207 DE--------DNPYDRLVVEAALNTEDPQYQDKRM-VAQR------NGKLSVQVFHVHAG 251
D NP D + V A L T+ + VA+R G V V
Sbjct: 321 DLVPLAQGGLGNPADVVEVRADLVTDVVSASGSSIPVAERIDWYLEMGGDDVFVLETDLD 380
Query: 252 REKEAIS--DMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
+ I +L L + T E + P P M+ AV+ L + RL
Sbjct: 381 IPEPLIVLVRLLQLLEPDW-EKTREKEK---------PPKPKMDGAVIGVLVEVLHRRLK 430
Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVS 365
YP T+ EDEA+L + + +VRM +K++ LQ T + + L +V
Sbjct: 431 EYPTTIDEDEALLYKEDTLSINKKNAIIVRMGEKII---LQRTLESLQRLASQSVG 483
>gi|384483765|gb|EIE75945.1| hypothetical protein RO3G_00649 [Rhizopus delemar RA 99-880]
Length = 376
Score = 41.6 bits (96), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 47/194 (24%), Positives = 83/194 (42%), Gaps = 31/194 (15%)
Query: 34 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 93
+ K+S+W PY L + S LL ++L S K E +++ + I +Y
Sbjct: 51 RDKQSWWKPYTDLLPMHFNTMPVNYPSELL------SHLPNS-LKQETMQQKDNIHTDY- 102
Query: 94 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP 153
V + +Q P DI T E FK A++ V + +H+ + + L P
Sbjct: 103 ----VTCLKFCKSKQLPQDI-----TAEEFKWAWLCVNTRCIHMTVPDYLAKGENIALAP 153
Query: 154 PL--LAYSSKCKAMLAAVDDAVQLVVDR-------PYKAGESIVVWCGPQPNSKLLINYG 204
L L ++++ K ++ + R YK GE + + GP N +L YG
Sbjct: 154 MLDFLNHTTEAK-----IESGFNIRTQRFEIKTLTAYKKGEQVYINYGPHDNLAMLKEYG 208
Query: 205 FVDEDNPYDRLVVE 218
FV +N Y+ ++++
Sbjct: 209 FVLNENIYNFVLLD 222
>gi|281338852|gb|EFB14436.1| hypothetical protein PANDA_005285 [Ailuropoda melanoleuca]
Length = 415
Score = 41.6 bits (96), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 57/236 (24%), Positives = 94/236 (39%), Gaps = 23/236 (9%)
Query: 19 SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
S L L +L+ EK G +S W PY+ L + A P+ E E+ L P K
Sbjct: 81 SPLLALCTFLVSEKHAGDQSLWKPYLEILPK-------AYTCPVC-LEPEVVNLFPKPLK 132
Query: 79 AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV--- 135
A+ E+ ++ ++ + LF + I F++ A+ V + V
Sbjct: 133 AKAEEQRARVQGFFSSSRDFFSSLQPLFSEAVESI----FSYSALLWAWCTVNTRAVYVK 188
Query: 136 HLQKVSLARRFALVPLGP--PLLAYSSKC--KAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
H Q+ + L P LL +S + KA ++ + E + +
Sbjct: 189 HRQEQCFSTEPNTCALAPYLDLLNHSPRVQVKAAFNEETRCYEIRTASGCRKHEEVFICY 248
Query: 192 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFH 247
GP N +LL+ YGFV NP+ + V + + DK+M N K+S+ H
Sbjct: 249 GPHDNQQLLLEYGFVSIQNPHACVYVSEDVLVKYLPLTDKQM----NKKISILKDH 300
>gi|449506720|ref|XP_004162829.1| PREDICTED: uncharacterized LOC101212907 [Cucumis sativus]
Length = 559
Score = 41.6 bits (96), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 45/178 (25%), Positives = 77/178 (43%), Gaps = 20/178 (11%)
Query: 57 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 116
A + L + + L G+ E+++ E ++++YNEL F A L +P P E
Sbjct: 259 AFNTGLSFGVGAMTTLVGTLLFDELMQAKEHLRKQYNEL----FPA--LCNNHPDIFPEE 312
Query: 117 AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP--------LGPPLLAYSSKCKAMLAA 168
+++E F A S + + R LVP L P +L Y + +
Sbjct: 313 FYSWEEFLWACELWYSNSLKIMFPDGNVRTCLVPIAGFLNHSLHPHILHYGK-----VDS 367
Query: 169 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYDRLVVEAALNTED 225
D+++ + RP +AGE + G S L+ YGF+ E DN D + ++ +D
Sbjct: 368 DTDSLKFRLSRPCRAGEECYLSYGNYSGSHLVTFYGFLPEGDNVNDVIPLDIDFGDDD 425
>gi|71425330|ref|XP_813082.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70877934|gb|EAN91231.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 565
Score = 41.6 bits (96), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 51/184 (27%), Positives = 80/184 (43%), Gaps = 31/184 (16%)
Query: 184 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRL-----VVEAALNTEDPQYQDKR--MVAQ 236
G I + GP N +LL YGFV E N +DRL E + E + +R +VA
Sbjct: 362 GREIWMSYGPLQNWELLQFYGFVLEGNEHDRLPFPFDFPEGVVGDE---WDGRRAALVAT 418
Query: 237 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 296
L+ + + H GR A ++ LR+ ++++ E + + GP + E V
Sbjct: 419 YGLHLAGRCWICHDGRPPPA---LVALLRV-HLAEAEEFDT-MERKGPFASLGAGTEARV 473
Query: 297 LDQLADYFKARLAGYPATLSEDEAML------------TDYNLHP---KKRVATQLVRME 341
+ +AD + L + +L EDE +L D N P KR+A L+RM
Sbjct: 474 VATIADTIRCILDLFSTSLEEDERLLENGSGPVATHSGDDGNTQPLSCNKRLAI-LLRMG 532
Query: 342 KKML 345
K +
Sbjct: 533 MKRI 536
>gi|452824261|gb|EME31265.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase
[Galdieria sulphuraria]
Length = 546
Score = 41.6 bits (96), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 70/307 (22%), Positives = 118/307 (38%), Gaps = 64/307 (20%)
Query: 16 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 75
+ LS+ +A L+ E + + SFW P+I L L+WS E YL S
Sbjct: 137 SDLSDEEAIAALLLVETARKETSFWKPWIETLPSSDELHHF-----LVWSTAETQYLESS 191
Query: 76 PTKAEILERAEGIKREYNELDTVWFMAGSLFQQ--YP-YDIPTEAFTFEIFKQAFVAVQS 132
T +IL E + EL+T LF + YP YD+ + FT F A VQS
Sbjct: 192 STFEDILSLRETASLVFEELNT------ELFPKFLYPQYDV--KYFTLPYFTWALSIVQS 243
Query: 133 CVVH-----------------------LQKVSLARRF------ALVPLGPPLLAYSSKCK 163
++ + + S R++ +L+ +GP ++ + +
Sbjct: 244 FGLYDIMDSCPLVIVPGLEWLTYKYSLITEESFFRQYFHISNVSLIRVGP---FFTQERR 300
Query: 164 AMLAAVDDA-----VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN--PYDRLV 216
+ A +D V LV + ++ W ++ G +DE+
Sbjct: 301 LKITASEDLKVGEPVSLVYEGNVSLIDTFCRWGWK-------LDLGALDEEQLLKMGSYE 353
Query: 217 VEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE-- 274
+ A+ T D + DK + +Q F + KE + +LP+LRL + D
Sbjct: 354 ISFAVTTTDQFFDDKEDILDAQRLELLQTFELRYDMSKELLQRILPFLRLICLKDKDSFI 413
Query: 275 MQSVISS 281
++SV S
Sbjct: 414 LESVFRS 420
>gi|358384831|gb|EHK22428.1| hypothetical protein TRIVIDRAFT_84056 [Trichoderma virens Gv29-8]
Length = 458
Score = 41.6 bits (96), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 20/60 (33%), Positives = 35/60 (58%)
Query: 173 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 232
+ ++ + Y+A + + ++ GP PNS+LL YGFV DNP D + + + P Y+ K+
Sbjct: 216 LSILAGKDYEAEDQVFIYYGPMPNSRLLRLYGFVIPDNPNDSYDLVLSTHPLAPFYEQKQ 275
>gi|449466129|ref|XP_004150779.1| PREDICTED: uncharacterized protein LOC101212907 [Cucumis sativus]
Length = 559
Score = 41.6 bits (96), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 45/178 (25%), Positives = 77/178 (43%), Gaps = 20/178 (11%)
Query: 57 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 116
A + L + + L G+ E+++ E ++++YNEL F A L +P P E
Sbjct: 259 AFNTGLSFGVGAMTTLVGTLLFDELMQAKEHLRKQYNEL----FPA--LCNNHPDIFPEE 312
Query: 117 AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP--------LGPPLLAYSSKCKAMLAA 168
+++E F A S + + R LVP L P +L Y + +
Sbjct: 313 FYSWEEFLWACELWYSNSLKIMFPDGNVRTCLVPIAGFLNHSLHPHILHYGK-----VDS 367
Query: 169 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYDRLVVEAALNTED 225
D+++ + RP +AGE + G S L+ YGF+ E DN D + ++ +D
Sbjct: 368 DTDSLKFRLSRPCRAGEECYLSYGNYSGSHLVTFYGFLPEGDNVNDVIPLDIDFGDDD 425
>gi|307103393|gb|EFN51653.1| hypothetical protein CHLNCDRAFT_139846 [Chlorella variabilis]
Length = 712
Score = 41.6 bits (96), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 53/101 (52%), Gaps = 7/101 (6%)
Query: 179 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 238
RP +AG+ + + GP PN KLL YGFV NP+D +V L + + +++ A
Sbjct: 444 RPCQAGQQVFISYGPVPNLKLLCYYGFVVPHNPHD--LVPLQLEPPEGPLKQQQLAAMEA 501
Query: 239 GKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVI 279
L ++ H+ ++ +L LRL V+ ++E+Q V+
Sbjct: 502 LGLGLE----HSLQDGPLSKQLLACLRL-IVATSAELQLVV 537
>gi|452823683|gb|EME30691.1| hypothetical protein Gasu_19370 [Galdieria sulphuraria]
Length = 370
Score = 41.2 bits (95), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 56/226 (24%), Positives = 95/226 (42%), Gaps = 35/226 (15%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
++++L +E +Q SFW PY++ L L + LL +Y+T +A+I++
Sbjct: 86 VSIFLFFESRQSD-SFWKPYLQVLPTSYDLLFLYRDGLLL------SYVT----EADIMQ 134
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQY--PY-----DIPTEAFTFEIFKQAFVAVQSCVVH 136
E ++R + FQ Y P+ D F+ F + + AV S + +
Sbjct: 135 MVESVRR----------ILRDTFQTYVIPHFSSVDDRDKWNVLFKEFVRWYCAVVSRICY 184
Query: 137 LQKVSLARRFALVPLGPPLL--AYSSKCKAMLAAVDDAVQLV-VDRPYKAGESIVVWCGP 193
L ALVPLG A + + A + + R + G + V G
Sbjct: 185 LPDDIAG---ALVPLGDIFNHEAVDTPVDILYAKWERGYYVFRAHRNFSIGTQVFVSYGA 241
Query: 194 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 239
N++L++ YGF DNP+D L E ++ + R+V R G
Sbjct: 242 LSNTELMMYYGFTLNDNPWDTLSFYPHELDESIKFYE-RVVLDREG 286
>gi|301763371|ref|XP_002917104.1| PREDICTED: SET domain-containing protein 4-like [Ailuropoda
melanoleuca]
Length = 440
Score = 41.2 bits (95), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 57/236 (24%), Positives = 94/236 (39%), Gaps = 23/236 (9%)
Query: 19 SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
S L L +L+ EK G +S W PY+ L + A P+ E E+ L P K
Sbjct: 105 SPLLALCTFLVSEKHAGDQSLWKPYLEILPK-------AYTCPVC-LEPEVVNLFPKPLK 156
Query: 79 AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV--- 135
A+ E+ ++ ++ + LF + I F++ A+ V + V
Sbjct: 157 AKAEEQRARVQGFFSSSRDFFSSLQPLFSEAVESI----FSYSALLWAWCTVNTRAVYVK 212
Query: 136 HLQKVSLARRFALVPLGP--PLLAYSSKC--KAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
H Q+ + L P LL +S + KA ++ + E + +
Sbjct: 213 HRQEQCFSTEPNTCALAPYLDLLNHSPRVQVKAAFNEETRCYEIRTASGCRKHEEVFICY 272
Query: 192 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFH 247
GP N +LL+ YGFV NP+ + V + + DK+M N K+S+ H
Sbjct: 273 GPHDNQQLLLEYGFVSIQNPHACVYVSEDVLVKYLPLTDKQM----NKKISILKDH 324
>gi|150864441|ref|XP_001383253.2| hypothetical protein PICST_42613 [Scheffersomyces stipitis CBS
6054]
gi|149385697|gb|ABN65224.2| predicted protein [Scheffersomyces stipitis CBS 6054]
Length = 453
Score = 41.2 bits (95), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 54/238 (22%), Positives = 92/238 (38%), Gaps = 42/238 (17%)
Query: 7 YYAAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS- 65
Y + EL L+ L+LYL +E+++ SFW P++ L +PL+W
Sbjct: 111 YKSLELQDLLALTSFQLLSLYLCFERQRIHSSFWKPFLEMLPDISDFSL----NPLIWQV 166
Query: 66 ------ETELAYLTGSPTKAEILERAEGIKREYNE------------LDTVWFMAGSLFQ 107
E + +L S + RAE + + E LD + S +
Sbjct: 167 LQVDQWEELIQFLPESAKR-----RAEDVYERFLEDYVVVRALVSRILDDLKLSESSADE 221
Query: 108 QYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGPPLLAYSSKCKA 164
P D +F A++ + S +++ Q + A F + P L +S +
Sbjct: 222 YIPVD---------LFLWAWMCINSRCLYMTIPQGKTNADNFTMAPY-VDFLNHSCNDEC 271
Query: 165 MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAAL 221
+ + PY G+ + + GP N LL YGFV DN ++ L + A +
Sbjct: 272 SILIDTTGFHVRTTTPYMPGDQLFLSYGPHCNEFLLCEYGFVIPHDNKWNDLDISAYI 329
>gi|428175768|gb|EKX44656.1| hypothetical protein GUITHDRAFT_109433 [Guillardia theta CCMP2712]
Length = 591
Score = 41.2 bits (95), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 56/214 (26%), Positives = 90/214 (42%), Gaps = 33/214 (15%)
Query: 20 ELACLALYLMYEK-KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
ELA L L ++YEK QG SFW P I L G + WSE EL L K
Sbjct: 91 ELA-LTLLILYEKLDQGNASFWKPMIDILPADPG-------AASKWSEEELQELQDESLK 142
Query: 79 AEILERAEGIKREYNE-LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
AE + +++ Y L + G +F + +T+E F+ A + V+S
Sbjct: 143 AEAMIVVASMQQTYQRVLRPILVQHGDVF-------SVDRYTWEEFRWALLCVESRTFG- 194
Query: 138 QKVSLARRF----ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD----RPYKAGESIVV 189
RF ++VP L + + + + D ++ GE +
Sbjct: 195 -------RFLPHPSIVPFADLLNHVNVQTSYRWLPEERRAAYMCDASGEHVHRRGEEAFM 247
Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 223
GP+ N++LL++YGF + N Y+ + + +NT
Sbjct: 248 SYGPRSNAELLLHYGFALQSNRYEAVELNFRINT 281
>gi|225678514|gb|EEH16798.1| SET domain-containing protein RMS1 [Paracoccidioides brasiliensis
Pb03]
Length = 488
Score = 41.2 bits (95), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 82/390 (21%), Positives = 143/390 (36%), Gaps = 61/390 (15%)
Query: 13 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 72
+ L + CL L ++YE QG S W PY + L ++ + W++ EL L
Sbjct: 84 INERDLGQWLCLILVMIYEYLQGAASPWAPYFKVLPTD-------FDTLMFWTDAELLEL 136
Query: 73 TGSPTKAEILERA----------------------EGIKREYNELD------TVWFMAGS 104
GS I + A G YN D ++ GS
Sbjct: 137 KGSAVLGRIGKSAAEEVFLRDLLPLVSKNSELFPLTGGLLSYNSPDGKAALLSLAHRMGS 196
Query: 105 LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA 164
L Y +D+ + + V ++ L + ++PL L A + + A
Sbjct: 197 LIMSYAFDVENDE------AEEVEGEDGYVTDDEERQLPK--GMIPLADLLNADADRNNA 248
Query: 165 MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVE----- 218
L D + + + + GE I G P ++LL YG+V D YD V
Sbjct: 249 RLFQEDGYLAMKSIKSIRKGEEIFNDYGELPRAELLRRYGYVTDSYAQYDEAEVPIQTIC 308
Query: 219 --AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQ 276
A L + P + R+ + ++ + + +++ LP L ++
Sbjct: 309 RVAGLKSSTPGPDEPRLEFLDDLEVLDDGYGIPRHDRSTPLAETLPTELLVVLNILVMPL 368
Query: 277 SVISSLGPICPV-SPCMERAVLDQLADYFKARLAGYPATLSEDEAML---TDYNLHP--- 329
+ L V P + A L + + L YP T+++D+ +L +Y
Sbjct: 369 EQFNQLKQKSKVPKPALGIAEATLLDEVVRLILGEYPTTVAQDKELLASCANYQGSTSPI 428
Query: 330 ---KKRVATQLVRMEKKMLNACLQVTADMI 356
+ ++A Q+ + EK++LNA L D I
Sbjct: 429 SAGRLKMALQVRKGEKEILNAVLSELEDFI 458
>gi|407417214|gb|EKF38012.1| hypothetical protein MOQ_001785 [Trypanosoma cruzi marinkellei]
Length = 578
Score = 41.2 bits (95), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 37/137 (27%), Positives = 66/137 (48%), Gaps = 9/137 (6%)
Query: 192 GPQPNSKLLINYGFVDEDNPYDRL--VVEAALNTEDPQYQDKR--MVAQRNGKLSVQVFH 247
GP N +LL YGFV E+N +DRL + ++ +R +VA L+ + +
Sbjct: 370 GPLQNWELLQFYGFVVEENEHDRLPFPFDFPEGVAGDEWDRRRATLVATYGLHLAGRCWI 429
Query: 248 VHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 307
H GR A ++ LR+ ++++ E ++ + GP + E V+ +AD +
Sbjct: 430 CHDGRPPPA---LVALLRV-HLAEAEEFDTMERN-GPFASLGAGTEARVVATIADTIRCI 484
Query: 308 LAGYPATLSEDEAMLTD 324
L + +L EDE +L +
Sbjct: 485 LDLFSTSLEEDEWLLEN 501
>gi|213407234|ref|XP_002174388.1| lysine methyltransferase [Schizosaccharomyces japonicus yFS275]
gi|212002435|gb|EEB08095.1| lysine methyltransferase [Schizosaccharomyces japonicus yFS275]
Length = 537
Score = 41.2 bits (95), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 51/229 (22%), Positives = 91/229 (39%), Gaps = 28/229 (12%)
Query: 17 KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP 76
K+ + C + E+ +G+KS W YI L + ++PL +++ EL L +
Sbjct: 74 KMHTIVCT--FFALERLKGEKSQWAKYIEYLPK-------TFDTPLYFTDDELKSLEHTN 124
Query: 77 TKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH 136
+R K E+ A L D P + F++ ++ A S
Sbjct: 125 IFYGCNDRKRIWKEEHA-------TAAKLL-----DNP-DNFSWNMYLWAATVFSSRCFS 171
Query: 137 LQKVSLARRFALVPLGPPL---LAYSSKCKAMLAAV---DDAVQLVVDRPYKAGESIVVW 190
+ P+ PL L + +C + V AVQLV +P +G +
Sbjct: 172 SALLGEEDTDDAAPILIPLVDSLNHKPRCPIIWNKVTKESHAVQLVSVKPISSGGQVYNN 231
Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 239
GP+ N +LL+ YGF +N ++ + +L+ + KR + +G
Sbjct: 232 YGPKGNEELLMGYGFCLPNNEFETFALRLSLDKAVYNSEKKRSILASHG 280
>gi|426218421|ref|XP_004003445.1| PREDICTED: SET domain-containing protein 4 [Ovis aries]
Length = 439
Score = 41.2 bits (95), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 46/201 (22%), Positives = 85/201 (42%), Gaps = 19/201 (9%)
Query: 19 SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
S L L +L+ EK G +S W PY+ L + A P+ E E+ L +P K
Sbjct: 105 SPLLALCTFLVSEKHAGDRSPWKPYLEVLPK-------AYTCPVC-LEPEVVNLLPNPLK 156
Query: 79 AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ 138
+ E+ ++ ++ + LF + I F++ + A+ V + V+++
Sbjct: 157 TKAWEQRSHVQEFFSSSRGFFSSLQPLFSEAIETI----FSYRALRWAWCTVNTRAVYMK 212
Query: 139 K---VSLARRFALVPLGP--PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGE--SIVVWC 191
+ + L+ L P LL +S + A ++ + + G+ + +
Sbjct: 213 RPPQLCLSPEPDTCALAPYLDLLNHSPDVQVKAAFNEETRCYEIRTATRCGKHKEVFICY 272
Query: 192 GPQPNSKLLINYGFVDEDNPY 212
GP N +LL+ YGFV NP+
Sbjct: 273 GPHDNHRLLLEYGFVSVSNPH 293
>gi|156849027|ref|XP_001647394.1| hypothetical protein Kpol_1018p68 [Vanderwaltozyma polyspora DSM
70294]
gi|156118080|gb|EDO19536.1| hypothetical protein Kpol_1018p68 [Vanderwaltozyma polyspora DSM
70294]
Length = 494
Score = 41.2 bits (95), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 18/61 (29%), Positives = 34/61 (55%)
Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
+++PL L + + C A L + +++ +P +AGE + G PNS++L YG+V
Sbjct: 221 SMIPLADTLNSDTHLCNANLMYDKETLKMTAIKPIRAGEEVFNIYGEHPNSEILRRYGYV 280
Query: 207 D 207
+
Sbjct: 281 E 281
>gi|121701277|ref|XP_001268903.1| SET domain protein [Aspergillus clavatus NRRL 1]
gi|119397046|gb|EAW07477.1| SET domain protein [Aspergillus clavatus NRRL 1]
Length = 498
Score = 41.2 bits (95), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 48/205 (23%), Positives = 91/205 (44%), Gaps = 18/205 (8%)
Query: 35 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 94
G + W YIR + ++ P ++E EL L G+ + + + +++E+
Sbjct: 129 GISNPWTEYIRFM-------PPSIRLPTFYTEAELELLRGTSLRTAVFAKLASLEKEFER 181
Query: 95 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP---L 151
L Q+Y +D T TF+ +K +S VV L + A+VP +
Sbjct: 182 LRQS-TEGIPWCQKYWWDEDTGRLTFDDWKYVDAVYRSRVVELPESG----HAIVPCVDM 236
Query: 152 GPPLLAYSSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWCGPQ-PNSKLLINYGFV-DE 208
S K + ++ +DA+ QL R +GE + + G + P S+++ +YGFV +E
Sbjct: 237 ANHASEDSVKARYDESSTEDALLQLRQGRRICSGEEVTISYGSEKPASEMVFSYGFVENE 296
Query: 209 DNPYDRLVVEAALNTEDPQYQDKRM 233
++ ++ + +DP K+M
Sbjct: 297 RTDAKQIFLDLEIPDDDPLRMAKQM 321
>gi|226294776|gb|EEH50196.1| SET domain-containing protein [Paracoccidioides brasiliensis Pb18]
Length = 488
Score = 41.2 bits (95), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 83/390 (21%), Positives = 144/390 (36%), Gaps = 61/390 (15%)
Query: 13 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 72
+ L + CL L ++YE QG S W PY + L ++ + W++ EL L
Sbjct: 84 INERDLGQWLCLILVMIYEYLQGAASPWAPYFKVLPTD-------FDTLMFWTDAELLEL 136
Query: 73 TGS--------PTKAEILERA--------------EGIKREYNELD------TVWFMAGS 104
GS T E+ R G YN D ++ GS
Sbjct: 137 KGSAVLGRIGKSTAEEVFLRDLLPLVSKNSELFPLTGGLLSYNSPDGKAALLSLAHRMGS 196
Query: 105 LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA 164
L Y +D+ + + V ++ L + ++PL L A + + A
Sbjct: 197 LIMSYAFDVENDE------AEEVEGEDGYVTDDEERQLPK--GMIPLADLLNADADRNNA 248
Query: 165 MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVE----- 218
L D + + + + GE I G P ++LL YG+V D YD V
Sbjct: 249 RLFQEDGYLSMKSIKSIRKGEEIFNDYGELPRAELLRRYGYVTDSYAQYDEAEVPIQTIC 308
Query: 219 --AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQ 276
A L + P + R+ + ++ + + +++ LP L ++
Sbjct: 309 RVAGLKSSTPGPDEPRLEFLDDLEVLDDGYGIPRHDRSTPLAETLPTELLVVLNILVMPL 368
Query: 277 SVISSLGPICPV-SPCMERAVLDQLADYFKARLAGYPATLSEDEAML---TDYNLHP--- 329
+ L V P + A L + + L YP T+++D+ +L +Y
Sbjct: 369 EQFNQLKQKSKVPKPALGIAEATLLDEVVRLILGEYPTTVAQDKELLASCANYQGSTSPI 428
Query: 330 ---KKRVATQLVRMEKKMLNACLQVTADMI 356
+ ++A Q+ + EK++LNA L D I
Sbjct: 429 SAGRLKMALQVRKGEKEILNAVLSELEDFI 458
>gi|449301991|gb|EMC98000.1| hypothetical protein BAUCODRAFT_146595 [Baudoinia compniacensis
UAMH 10762]
Length = 633
Score = 41.2 bits (95), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 55/230 (23%), Positives = 92/230 (40%), Gaps = 27/230 (11%)
Query: 26 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW-SETELAYLTGSPTKAEILER 84
YLM + ++SFW PY+ L +PL + + +LA+L G+ +L R
Sbjct: 87 FYLMTQYLNKEQSFWKPYLDVLPSPS-----EFSTPLWFDAPADLAWLDGTDVLHTMLAR 141
Query: 85 AEGIKREYNELDTVWFMAGSLFQQYPYDI---PTEAFTFEIFKQAFVAVQS---CVVHLQ 138
E + Y V +G Y +D+ FT F + Q+ VH
Sbjct: 142 REVYAQYYQSGLKVLSESGIDVTLYTWDLFRWAITTFTSRSFTSRVLLPQNRKYWPVHRT 201
Query: 139 KVSLARRFALVPLG-------------PPLLAYSSKCKAMLAAVDDAVQLVVD--RPYKA 183
+ R+ L+ + P L + + A + DA Q + +P +A
Sbjct: 202 STNGRRQTVLLDMSHSPAEDLDFSVLFPGLDSGNHDPNAQVDWSFDANQFSIALVQPIEA 261
Query: 184 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRM 233
G + GP+ N +LL+ YGF +NP D +++ E Q + KR+
Sbjct: 262 GAEVCNNYGPKANDELLMGYGFCIPNNPRDEVLLTLKAPPEALQVELKRI 311
>gi|145485580|ref|XP_001428798.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124395886|emb|CAK61400.1| unnamed protein product [Paramecium tetraurelia]
Length = 331
Score = 41.2 bits (95), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 21/69 (30%), Positives = 33/69 (47%), Gaps = 1/69 (1%)
Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
LVP G +L + KC+ + + + + + E I + G NS LL YGF
Sbjct: 186 CLVPFGD-MLNHHDKCQTKQKIIGTDLVFITTKQIQENEEIYNFFGEHGNSFLLCWYGFT 244
Query: 207 DEDNPYDRL 215
++N YD+L
Sbjct: 245 YDNNIYDKL 253
>gi|171684553|ref|XP_001907218.1| hypothetical protein [Podospora anserina S mat+]
gi|170942237|emb|CAP67889.1| unnamed protein product [Podospora anserina S mat+]
Length = 396
Score = 40.8 bits (94), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 25/111 (22%), Positives = 50/111 (45%), Gaps = 12/111 (10%)
Query: 138 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 197
++++ + AL P+ L C+ + + DR YK GE + + G N
Sbjct: 194 ERLTKDDKMALQPVADLLNHSDEGCEVVFDT--GCYTISADREYKQGEEVYICYGTHSND 251
Query: 198 KLLINYGFVDEDNPYDRLVVEAA----LNTEDPQYQDKR------MVAQRN 238
L++ YGF E+N +D + ++ ++T ++ D R ++ +RN
Sbjct: 252 FLMVEYGFCPEENKWDEVCIDEVVLEEMSTARKKWLDGRDFLGKYLIDERN 302
>gi|320584053|gb|EFW98265.1| Nuclear protein that contains a SET-domain [Ogataea parapolymorpha
DL-1]
Length = 499
Score = 40.8 bits (94), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 55/214 (25%), Positives = 87/214 (40%), Gaps = 28/214 (13%)
Query: 11 ELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA 70
E+L T L++ L L L YE G++S W Y+ L + S + WS EL
Sbjct: 77 EVLET--LNQWEALILCLAYEMMLGEESRWSSYLAVLPEK-------FNSLMFWSSEELE 127
Query: 71 YLTGSPTKAEI-LERAE--------------GIKR--EYNELDTVWFMAGSLFQQYPYDI 113
L S I E+AE G K+ EY +D + + S+ Y +D+
Sbjct: 128 KLKPSNVLQRIGREQAEQMYSKLVPEYCLRLGSKKLVEYLTIDR-FHVVASIIMSYSFDV 186
Query: 114 PTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV 173
E + K + ++VPL L + ++ A L+ +DA+
Sbjct: 187 DDPEDDPEDDEDEEEDFDEIEQECIKYDGYLK-SMVPLADTLNSNTNLVNANLSYENDAL 245
Query: 174 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 207
+ + K GE I G PNS++L YG+V+
Sbjct: 246 VMTATKDIKKGEQIYNIYGELPNSEILRKYGYVE 279
>gi|345795412|ref|XP_544872.3| PREDICTED: SET domain-containing protein 4 [Canis lupus familiaris]
Length = 440
Score = 40.8 bits (94), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 56/236 (23%), Positives = 96/236 (40%), Gaps = 23/236 (9%)
Query: 19 SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
S L L +L+ EK G +S W PY+ L + A P+ E E+ L P K
Sbjct: 105 SPLLALCTFLVSEKHAGDQSLWKPYLEILPQ-------AYTCPVC-LEPEVVNLFPKPLK 156
Query: 79 AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV--- 135
A+ E+ ++ ++ + LF + I F++ A+ V + V
Sbjct: 157 AKAEEQRARVQEFFSSSRDFFSSLQPLFSEAVESI----FSYRALLWAWCTVNTRAVYVK 212
Query: 136 HLQKVSLARRFALVPLGP--PLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGESIVVWC 191
H Q+ + L P LL +S + + A ++ ++ + E + +
Sbjct: 213 HRQRQCFSTEPNTYALAPYLDLLNHSPEVQVKGAFNEETRCYEIRTASNCRKHEEVFICY 272
Query: 192 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFH 247
GP N +LL+ YGFV NP+ + V + + DK+M N K+S+ H
Sbjct: 273 GPHDNQRLLLEYGFVSIHNPHACVYVSEDILVKYLPTTDKQM----NKKISILKDH 324
>gi|402077770|gb|EJT73119.1| hypothetical protein GGTG_09969 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 377
Score = 40.8 bits (94), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 33/76 (43%), Gaps = 2/76 (2%)
Query: 143 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 202
A R ALVP+ + C A D + DR Y G+ I + G N LL
Sbjct: 177 ADRLALVPIADLFNHADTGCGASFTP--DGFVVSTDRKYHVGQEIYISYGTHTNDLLLAE 234
Query: 203 YGFVDEDNPYDRLVVE 218
YGFV N +D+ ++
Sbjct: 235 YGFVPMANRWDKTCLD 250
>gi|226508108|ref|NP_001151788.1| SET domain containing protein [Zea mays]
gi|195649689|gb|ACG44312.1| SET domain containing protein [Zea mays]
Length = 536
Score = 40.8 bits (94), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 60/240 (25%), Positives = 96/240 (40%), Gaps = 24/240 (10%)
Query: 16 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 75
N ++ L L+ M E+ F PY L G L + LA L G+
Sbjct: 221 NNITSETMLLLWSMRERYNLGSKF-KPYFDTLPANFNTG-------LSFGIDALAALEGT 272
Query: 76 PTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 135
EI++ + ++++Y+EL + L +P + T++ F A S +
Sbjct: 273 LLFDEIIQARQHLRQQYDELFPL------LCTNFPEMFRKDVCTWDDFLWACELWYSNSM 326
Query: 136 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV---VDRPYKAGESIVVWCG 192
+ S LVP+ L S VD+A + + + RP AGE + G
Sbjct: 327 MIVLSSGKLSTCLVPVAGLLNHSVSPHILNYGRVDEATKSLKFPLSRPCDAGEQCFLSYG 386
Query: 193 PQPNSKLLINYGFVDE-DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 251
P S L+ YGF+ DNPYD + ++ D D+ + AQ + S Q H+ G
Sbjct: 387 KHPGSHLVTFYGFLPRGDNPYDVIPLDL-----DTSVDDEDIAAQSSATTS-QTTHMVRG 440
>gi|145354661|ref|XP_001421597.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581835|gb|ABO99890.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 341
Score = 40.8 bits (94), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 73/320 (22%), Positives = 126/320 (39%), Gaps = 30/320 (9%)
Query: 49 RQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG-----IKREYN-ELDTVWFMA 102
R+R +G ++ +P + S E T R EG REY + + W A
Sbjct: 2 RERAKGGVSAYAPFVESLYEHTPARAVETSRAARARLEGHAAAETMREYERDAEDGWRAA 61
Query: 103 GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC 162
F+ +P FT F++A V++ + R ALVP+ LL +
Sbjct: 62 RRTFETFPSIFSVHEFTRAAFEEALAIVRANSFEARSEDGTRARALVPMAHLLLHDTGSE 121
Query: 163 KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG----FVDEDNPYDRLVVE 218
+ VD + VD ++ G+ + G +++ +G + E N ++ ++
Sbjct: 122 VPCVKIVDGVFVINVD-EHEEGDELSCSHGDYSDAETFARFGVSAFYSAEKNARNK--IK 178
Query: 219 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV 278
A +E Y K + R G + F AG A + + LRL ++T E ++
Sbjct: 179 FAFPSE--IYSMKSL--DRCGSVENIAF-TDAG----ATEEFMCALRLASANET-EWAAI 228
Query: 279 ISSLGPI-----CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH--PKK 331
S + P+S E AV + L L YP++ +EDE +L L P +
Sbjct: 229 SKSKASVRALRKKPLSEESEIAVYEALFATLTELLNSYPSSDNEDERLLQSRTLQSAPDE 288
Query: 332 RVATQLVRMEKKMLNACLQV 351
A + EK++ + L
Sbjct: 289 ERAVTIRLREKRLALSSLNA 308
>gi|159464317|ref|XP_001690388.1| hypothetical protein CHLREDRAFT_144255 [Chlamydomonas reinhardtii]
gi|158279888|gb|EDP05647.1| predicted protein [Chlamydomonas reinhardtii]
Length = 486
Score = 40.8 bits (94), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 35/55 (63%), Gaps = 3/55 (5%)
Query: 170 DDAVQ---LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 221
D+A Q +VV R AG+ +++ G N +LL +YGFV +DNP+D ++AAL
Sbjct: 244 DEARQQYVIVVRRRVAAGQQVLLCYGRHTNLELLEHYGFVMQDNPHDTAPLDAAL 298
>gi|158295743|ref|XP_001688855.1| AGAP006364-PD [Anopheles gambiae str. PEST]
gi|347965224|ref|XP_003435732.1| AGAP013401-PA [Anopheles gambiae str. PEST]
gi|333469389|gb|EGK97284.1| AGAP013401-PA [Anopheles gambiae str. PEST]
Length = 451
Score = 40.8 bits (94), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 41/86 (47%), Gaps = 13/86 (15%)
Query: 174 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT------EDPQ 227
L D Y+AGE I + G N+KLL+ YGF NP D VE + T DP+
Sbjct: 276 NLHTDTAYRAGEQIFISYGTHNNTKLLLEYGFSIPSNPDD--FVELTIGTINAFMKHDPE 333
Query: 228 YQDKRMVAQR-----NGKLSVQVFHV 248
+ R+ ++ + +L Q+F V
Sbjct: 334 LRCLRLPREKYRFLADHRLDEQLFFV 359
>gi|332229557|ref|XP_003263953.1| PREDICTED: SET domain-containing protein 4 [Nomascus leucogenys]
Length = 440
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 55/240 (22%), Positives = 94/240 (39%), Gaps = 24/240 (10%)
Query: 19 SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
S L L +L+ EK G +S W PY+ L + A P+ E E+ L K
Sbjct: 105 SPLLALCTFLVSEKHAGDRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLK 156
Query: 79 AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL- 137
A+ E+ ++ + + LF + I F++ A+ V + V+L
Sbjct: 157 AKAEEQRAHVQEFFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLR 212
Query: 138 --QKVSLARRFALVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
Q L+ L P LL +S + KA + ++ ++ E + +
Sbjct: 213 PRQWECLSAEPDTCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICY 272
Query: 192 GPQPNSKLLINYGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 246
GP N +L + YGFV NP+ R ++ L + D Q K + + +G + F
Sbjct: 273 GPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332
>gi|410082051|ref|XP_003958604.1| hypothetical protein KAFR_0H00600 [Kazachstania africana CBS 2517]
gi|372465193|emb|CCF59469.1| hypothetical protein KAFR_0H00600 [Kazachstania africana CBS 2517]
Length = 508
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 47/196 (23%), Positives = 85/196 (43%), Gaps = 21/196 (10%)
Query: 24 LALYLMYE-KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
L L L+YE K + +KS W Y L A+ + W+E EL +L S I
Sbjct: 89 LILCLLYEWKGKKEKSKWWKYFNVLPSSN-----AMNGLMYWNEQELEHLRPSLVLDRIG 143
Query: 83 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE---IFKQAFVAVQSCVVHLQK 139
+++ K Y+++ T+ + S F + ++ E F + I +F L +
Sbjct: 144 KKS--AKNMYHKVLTL--VKESKFPEVLCNVEWEDFVYAASVIMAYSFDVENGESQTLNE 199
Query: 140 VSLARRF--------ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
+ +++PL L + + +C A L D +++ +P K GE +
Sbjct: 200 EDDDQDEEENTGYIKSMIPLADTLNSDTHQCNANLMYDDKFLKMYAIKPIKKGEQVFNIY 259
Query: 192 GPQPNSKLLINYGFVD 207
G PN+++L YG+V+
Sbjct: 260 GNHPNAEILRRYGYVE 275
>gi|254577261|ref|XP_002494617.1| ZYRO0A05654p [Zygosaccharomyces rouxii]
gi|238937506|emb|CAR25684.1| ZYRO0A05654p [Zygosaccharomyces rouxii]
Length = 494
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 50/216 (23%), Positives = 95/216 (43%), Gaps = 32/216 (14%)
Query: 17 KLSELACLALYLMYE-KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLT-- 73
+L L L ++YE K G++SFW Y++ L + + L + WS +LA L
Sbjct: 84 ELGHWEGLILCMVYEIKVMGQQSFWWNYLQVLPKSQDLNTL-----VYWSADQLAALEPS 138
Query: 74 ------GSPTKAE----ILERAEGIKREY-NELDTVWF----MAGSLFQQYPYDIPTEAF 118
G+ E IL+ + E+ +++ + F S+ Y +D+ +
Sbjct: 139 LVVGRLGADESQEMYRQILKYIQNFGPEFQSKIGQLTFEEFVHVASVIMSYSFDVDLKGE 198
Query: 119 TFEIFKQAFVAVQ----SCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 174
E + + S V H + + ++VPL L A + + A L ++++
Sbjct: 199 DDEDDEDEDEGEEEEGESNVAHDKYMK-----SMVPLADTLNADTKQFNAHLVYDKESLK 253
Query: 175 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 210
+V +P K G+ + + G PN+++L YG+V+ D
Sbjct: 254 MVSVKPIKMGQQVYNFYGEHPNAEILRRYGYVEWDG 289
>gi|449520517|ref|XP_004167280.1| PREDICTED: LOW QUALITY PROTEIN: sulfate transporter 4.1,
chloroplastic-like, partial [Cucumis sativus]
Length = 923
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 44/175 (25%), Positives = 73/175 (41%), Gaps = 22/175 (12%)
Query: 13 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 72
L N++ +A LA+ ++ E K G S W PYI L + + + + W E+EL +
Sbjct: 768 LLGNEIGNVAKLAVVVLLEHKLGLGSEWAPYIIRLPQ-----PWEMHNTIFWKESELEMI 822
Query: 73 TGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 132
S E L + IKRE+ + + +P I + + + F A+ V S
Sbjct: 823 RKSSLYEESLNQRSQIKREFLAIRKA-------LEAFPEII--DRISCDDFMHAYALVTS 873
Query: 133 CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGE 185
+ +L+P L + +AML DD ++V DR + GE
Sbjct: 874 -----RAWRSTEGVSLIPFA-DFLNHDGASEAMLLNDDDKQLSEVVADRDFAPGE 922
>gi|358395377|gb|EHK44764.1| hypothetical protein TRIATDRAFT_80097 [Trichoderma atroviride IMI
206040]
Length = 463
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 69/310 (22%), Positives = 123/310 (39%), Gaps = 57/310 (18%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTK 78
LA+YL++ + +RE + R +A S + ++E EL G+
Sbjct: 77 LAVYLLFVR-----------LREHGYEGPRSHVAAMPARYSSSIFFNEDELEVCAGTSLY 125
Query: 79 AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ 138
+ E I+ +Y + +F Q+P +P + + +K A V S +
Sbjct: 126 TITKQLEERIEDDYR------VLVMRVFTQHPDLLPLAKISIQDYKWALCTVWSRAMDF- 178
Query: 139 KVSLARRFALVPLGPPL---------LAYSSKCKAMLAAVDDAVQLVV--DRPYKAGESI 187
++P G PL + +S + K A + L V + Y+ G+ I
Sbjct: 179 ---------VLPNGKPLRVLAPFADMINHSPEVKQCHAYDPSSGNLSVLAGKDYEIGDQI 229
Query: 188 VVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFH 247
+ G PN++LL YGFV +NP D + + + P Y+ K+ + G S
Sbjct: 230 YISYGSIPNNRLLRLYGFVIPENPNDSYDLVLSTHPMAPFYEQKQKLWASAGLDSASTIP 289
Query: 248 VHAGREKEAISDMLP-----YLRLGYVSDTSEMQSV-ISSLGPICPVSPCMERAVLDQLA 301
+ + D LP YLR+ + D S++ ++ + L +S E +L L
Sbjct: 290 L-------TLIDPLPKSVLRYLRIQRL-DASDLAAIALQKLDTNEKISNSKEVEILQFLV 341
Query: 302 DYFKARLAGY 311
+ A L G+
Sbjct: 342 ESISALLDGF 351
>gi|148671823|gb|EDL03770.1| SET domain containing 4, isoform CRA_d [Mus musculus]
Length = 397
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 56/234 (23%), Positives = 95/234 (40%), Gaps = 24/234 (10%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
+S L L +L+ EK G +S W Y+ L + + P+ E E+ L SP
Sbjct: 61 VSPLLALCTFLVSEKHAGCRSLWKSYLDILPK-------SYTCPVCL-EPEVVDLLPSPL 112
Query: 78 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
KA+ E+ ++ + + LF + P D F++ F A+ V + V+L
Sbjct: 113 KAKAEEQRARVQDLFTSARGFFSTLQPLFAE-PVD---SVFSYRAFLWAWCTVNTRAVYL 168
Query: 138 Q---KVSLARRFALVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVW 190
+ + L+ L P LL +S + KA ++ + + + +
Sbjct: 169 RSRRQECLSAEPDTCALAPFLDLLNHSPHVQVKAAFNEKTRCYEIRTASRCRKHQEVFIC 228
Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVVEAA-----LNTEDPQYQDKRMVAQRNG 239
GP N +LL+ YGFV NP+ + V A L D Q K + + +G
Sbjct: 229 YGPHDNQRLLLEYGFVSVRNPHACVPVSADMLVKFLPAADKQLHRKITILKDHG 282
>gi|148671819|gb|EDL03766.1| SET domain containing 4, isoform CRA_a [Mus musculus]
Length = 378
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 56/234 (23%), Positives = 95/234 (40%), Gaps = 24/234 (10%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
+S L L +L+ EK G +S W Y+ L + + P+ E E+ L SP
Sbjct: 42 VSPLLALCTFLVSEKHAGCRSLWKSYLDILPK-------SYTCPVCL-EPEVVDLLPSPL 93
Query: 78 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
KA+ E+ ++ + + LF + P D F++ F A+ V + V+L
Sbjct: 94 KAKAEEQRARVQDLFTSARGFFSTLQPLFAE-PVD---SVFSYRAFLWAWCTVNTRAVYL 149
Query: 138 Q---KVSLARRFALVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVW 190
+ + L+ L P LL +S + KA ++ + + + +
Sbjct: 150 RSRRQECLSAEPDTCALAPFLDLLNHSPHVQVKAAFNEKTRCYEIRTASRCRKHQEVFIC 209
Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVVEAA-----LNTEDPQYQDKRMVAQRNG 239
GP N +LL+ YGFV NP+ + V A L D Q K + + +G
Sbjct: 210 YGPHDNQRLLLEYGFVSVRNPHACVPVSADMLVKFLPAADKQLHRKITILKDHG 263
>gi|384248108|gb|EIE21593.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 229
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 28/45 (62%), Gaps = 2/45 (4%)
Query: 8 YAAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 50
+A LL TNK + LA +L++E +G++SFW PY+ L RQ
Sbjct: 64 FAEALLATNKQSIGSSQVLAAHLLHEASKGQESFWRPYLATLPRQ 108
>gi|357131408|ref|XP_003567330.1| PREDICTED: ribosomal N-lysine methyltransferase 3-like
[Brachypodium distachyon]
Length = 495
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 19/55 (34%), Positives = 32/55 (58%)
Query: 170 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 224
D+ +++++ R AGE + G N+ LL YGF + DNPYD + ++ L T+
Sbjct: 251 DEDLEMIIVRDANAGEEVYNTYGTMGNAALLHRYGFTELDNPYDIVNIDLTLVTK 305
>gi|449019745|dbj|BAM83147.1| similar to protein N-methyltransferase [Cyanidioschyzon merolae
strain 10D]
Length = 576
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 23/62 (37%), Positives = 36/62 (58%), Gaps = 3/62 (4%)
Query: 155 LLAYSSKCKAMLAA--VDDAVQLVV-DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 211
L +SS+ ++ +A DA L + +R AG+ + + G N +LL YGFV+EDNP
Sbjct: 411 LFNHSSRVQSKVAYEYFYDAFSLSISNRDTHAGDQVFISYGTLTNDELLALYGFVEEDNP 470
Query: 212 YD 213
+D
Sbjct: 471 HD 472
>gi|10177069|dbj|BAB10511.1| unnamed protein product [Arabidopsis thaliana]
Length = 447
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 40/179 (22%), Positives = 80/179 (44%), Gaps = 22/179 (12%)
Query: 171 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV--EAALNTEDPQY 228
+A L R Y+ GE +++ G N +LL +YGF+ E+N D++ + E +L + +
Sbjct: 213 NAYCLYARRNYQLGEQVLLCYGTYTNLELLEHYGFMLEENSNDKVFIPLETSLFSLASSW 272
Query: 229 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPV 288
+ ++GKLS ++ LRL + + +SV+ + +
Sbjct: 273 PKDSLYIHQDGKLSFA---------------LISTLRLWLIPQSQRDKSVMRLVYAGSQI 317
Query: 289 SPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 347
S E V+ +++ + L P +++ED + LH ++ +R+E+K A
Sbjct: 318 SVKNEILVMKWMSEKCGSVLRDLPTSVTEDTVL-----LHNIDKLQDPELRLEQKETEA 371
>gi|302790237|ref|XP_002976886.1| hypothetical protein SELMODRAFT_416932 [Selaginella moellendorffii]
gi|300155364|gb|EFJ21996.1| hypothetical protein SELMODRAFT_416932 [Selaginella moellendorffii]
Length = 177
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 25/64 (39%), Positives = 35/64 (54%), Gaps = 2/64 (3%)
Query: 155 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 214
L S CK + AV +++++ R KAG + G PN LL YGFV E+NP+D
Sbjct: 54 FLWASELCK--IDAVTNSLKVYSLRSCKAGMQCFISYGALPNIDLLCFYGFVLENNPFDT 111
Query: 215 LVVE 218
+ VE
Sbjct: 112 IPVE 115
>gi|189190580|ref|XP_001931629.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187973235|gb|EDU40734.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 372
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 21/77 (27%), Positives = 37/77 (48%), Gaps = 2/77 (2%)
Query: 145 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 204
R A++P+ C+A A+ + + DR Y+ GE + + G LL YG
Sbjct: 179 RLAILPVADLFNHADVGCEARFAS--ENYSFIADRDYRTGEELHISYGSHSTDFLLTEYG 236
Query: 205 FVDEDNPYDRLVVEAAL 221
FV +N +D + ++ A+
Sbjct: 237 FVPTENCWDVVCLDEAI 253
>gi|302835223|ref|XP_002949173.1| hypothetical protein VOLCADRAFT_120737 [Volvox carteri f.
nagariensis]
gi|300265475|gb|EFJ49666.1| hypothetical protein VOLCADRAFT_120737 [Volvox carteri f.
nagariensis]
Length = 593
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 47/158 (29%), Positives = 70/158 (44%), Gaps = 14/158 (8%)
Query: 65 SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK 124
S+ ++A L G+P + + RA E E + F SL YP E F++E +
Sbjct: 276 SQEDVALLEGTPLHGDAV-RARQHLSEAFESSSPAFR--SLLGAYPDYFKPEWFSWESYL 332
Query: 125 QAFVAVQSCVVHLQKVSLARRFALVP-LG-----P-PLLAYSSKCKAMLAAVDDAVQLVV 177
A S + +Q S R L P LG P P + + SK + ++
Sbjct: 333 WAAELWYSYGIQVQFASGDIRTCLAPYLGLMNHHPLPHVVHFSKVDPETGCL----RVRA 388
Query: 178 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 215
RP +AG + + GP N+KLL+ YGF DNP D +
Sbjct: 389 FRPCEAGNQLFLSYGPYSNAKLLLFYGFAVRDNPADEV 426
>gi|195396323|ref|XP_002056781.1| GJ16703 [Drosophila virilis]
gi|194146548|gb|EDW62267.1| GJ16703 [Drosophila virilis]
Length = 539
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 63/312 (20%), Positives = 119/312 (38%), Gaps = 36/312 (11%)
Query: 28 LMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG 87
L+ EK +G S W P+I L + + L ++ ++ L G+ + +
Sbjct: 195 LIIEKVRGAASNWQPFIDTLPTR-------YNTVLYFTVEQMQRLRGTSACSAAVRHCRV 247
Query: 88 IKREYNEL---------DTVWFMAGSLFQQYP--YDIPTEAFTFEIFKQAFVAVQSCVVH 136
I R Y + D+V +LF +Y Y++ A + +Q V Q
Sbjct: 248 IARIYASMYKCAYMQPDDSVMAGMANLFTEYGLCYELYRWAVSTVTTRQNLVPRQ-LATD 306
Query: 137 LQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPN 196
V + AL+P K + ++ + +KAGE ++ G + N
Sbjct: 307 SDGVRNSPMSALIPFWDMANHRCGKITSYYKPSAQQMECIAQEAFKAGEQFFIYYGDRCN 366
Query: 197 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV-----AQRNGKLSVQVFHVHAG 251
+ L+++GF+D +N D + + L+ D + + ++ +R +L V H
Sbjct: 367 ADRLVHHGFLDMNNLKDYVHIRLGLSPTDALAEQRALLLSELNIERKAELRVLPAPEHIS 426
Query: 252 REKEAISDMLPYLRLGYVSD------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFK 305
E +L ++R+ +S S+++ + L C + +E L K
Sbjct: 427 GE------LLAFVRVFNMSKEQLEHWCSDLERAVDLLHIDCALETDLETRTWQYLYQRLK 480
Query: 306 ARLAGYPATLSE 317
L ATL E
Sbjct: 481 LLLGVLDATLKE 492
>gi|71656153|ref|XP_816628.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70881769|gb|EAN94777.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 565
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 51/187 (27%), Positives = 82/187 (43%), Gaps = 31/187 (16%)
Query: 184 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRL-----VVEAALNTEDPQYQDKR--MVAQ 236
G I + GP N +LL YGFV E N ++RL E A+ E + +R +VA
Sbjct: 362 GREIWMSYGPLQNWELLQFYGFVVEGNEHERLPFPFDFPEGAVGDE---WDGRRAALVAT 418
Query: 237 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 296
L+ + H GR A ++ LR+ ++++ E ++ + GP + E V
Sbjct: 419 YGLHLAGCCWICHDGRPPPA---LVALLRV-HLAEAEEFDTMERN-GPFASLGAGTEARV 473
Query: 297 LDQLADYFKARLAGYPATLSEDEAML------------TDYNLHP---KKRVATQLVRME 341
+ +AD + L + +L EDE +L D N P KR+A L+R+
Sbjct: 474 VATIADTIRCILDLFSTSLEEDERLLENGSGPVATHAGDDGNTQPLSCNKRLAI-LLRVG 532
Query: 342 KKMLNAC 348
K + C
Sbjct: 533 MKRIAHC 539
>gi|424512980|emb|CCO66564.1| predicted protein [Bathycoccus prasinos]
Length = 542
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 18/43 (41%), Positives = 25/43 (58%)
Query: 179 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 221
+ YK GE +++ G PN +LL+ YGFVD+ N D E L
Sbjct: 350 KSYKKGEEVLISYGLMPNDELLLRYGFVDDQNVADTYQFEGLL 392
>gi|71019075|ref|XP_759768.1| hypothetical protein UM03621.1 [Ustilago maydis 521]
gi|46099291|gb|EAK84524.1| hypothetical protein UM03621.1 [Ustilago maydis 521]
Length = 685
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 38/83 (45%), Gaps = 6/83 (7%)
Query: 146 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 205
++ P+ L A A L +++ +P AGE I PNS LL YG
Sbjct: 368 ISMTPMADMLNAKFESDNARLFYKSHVLEMRATKPIAAGEQIFNTYADPPNSDLLRRYGH 427
Query: 206 VDEDNPYD------RLVVEAALN 222
VDE N D +LVV+AA+N
Sbjct: 428 VDEPNGNDVVELDAKLVVQAAVN 450
>gi|71659283|ref|XP_821365.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70886742|gb|EAN99514.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 661
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 45/95 (47%), Gaps = 10/95 (10%)
Query: 13 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA-- 70
L ++ + +AC+A Y+ YEKKQ + + L Y R L Q+ V++ LW+ L
Sbjct: 466 LDSSNMESIACIAAYMFYEKKQPEIALRL-YRRLL-------QMGVQTTELWNNLGLCCF 517
Query: 71 YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSL 105
Y + L+RA I E L VW+ G +
Sbjct: 518 YSSQYDIALSCLQRAVAISTEDETLADVWYNIGHI 552
>gi|297845640|ref|XP_002890701.1| hypothetical protein ARALYDRAFT_472886 [Arabidopsis lyrata subsp.
lyrata]
gi|297336543|gb|EFH66960.1| hypothetical protein ARALYDRAFT_472886 [Arabidopsis lyrata subsp.
lyrata]
Length = 471
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 38/175 (21%), Positives = 71/175 (40%), Gaps = 19/175 (10%)
Query: 167 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL----------V 216
A + V++V + K + +++ G N L++YGFV E NPYD +
Sbjct: 252 AESNTLVKVVAETELKENDPLLLNYGCLSNDFFLLDYGFVIESNPYDTIELKYDEQLMDA 311
Query: 217 VEAALNTEDPQYQD-----KRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD 271
A P++ ++++Q N + V G + +L +R+ +
Sbjct: 312 ASMAAGVSSPKFSSPAPWQHQLLSQLNLAGEMPNLKVTIGGPEPVEGRLLAAIRILLCGE 371
Query: 272 TSEMQ----SVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 322
E++ + SL I P+ E AV + L+ +P + EDEA++
Sbjct: 372 MVEVEKHDLDTLKSLSAIAPLGIANEIAVFRTVIALCVIALSHFPTKIMEDEAII 426
>gi|323473309|gb|ADX78230.1| CIA6 [Chlamydomonas reinhardtii]
Length = 699
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 35/55 (63%), Gaps = 3/55 (5%)
Query: 170 DDAVQ---LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 221
D+A Q +VV R AG+ +++ G N +LL +YGFV +DNP+D ++AAL
Sbjct: 396 DEARQQYVIVVRRRVAAGQQVLLCYGRHTNLELLEHYGFVMQDNPHDTAPLDAAL 450
>gi|195040205|ref|XP_001991024.1| GH12451 [Drosophila grimshawi]
gi|193900782|gb|EDV99648.1| GH12451 [Drosophila grimshawi]
Length = 573
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 50/238 (21%), Positives = 94/238 (39%), Gaps = 38/238 (15%)
Query: 28 LMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG 87
L+ EK +G S W P+I L + + L ++ ++ L G+ + +
Sbjct: 203 LIIEKVRGAASAWQPFIDTLPTR-------YSTVLYFTVDQMQRLRGTSACSAAMRHCLV 255
Query: 88 IKREYNEL---------DTVWFMAGSLFQQYP--YDIPTEAFTFEIFKQAFVAVQ-SCVV 135
I R Y + D V +LF +Y Y++ A + +Q V + S V
Sbjct: 256 IARLYASMYKCAYIQPGDNVMAAKANLFTEYGLCYELYRWAVSTVTTRQNLVPRELSTVG 315
Query: 136 HLQKV----------------SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ---LV 176
+ +V + AR + L P + +C + + D A Q
Sbjct: 316 EVDQVCQLGGFEGTEIKRDAETGARNAPISALIPYWDMTNHRCGKITSYYDRAAQQMECT 375
Query: 177 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 234
+KAGE ++ G + N+ L+++GF+D N D + + L+ DP + + ++
Sbjct: 376 AQEAFKAGEQFFIYYGDRSNADRLVHHGFLDMHNLKDYVQIRLGLSPTDPLVEQRSLL 433
>gi|396469509|ref|XP_003838423.1| similar to SET domain-containing protein [Leptosphaeria maculans
JN3]
gi|312214991|emb|CBX94944.1| similar to SET domain-containing protein [Leptosphaeria maculans
JN3]
Length = 415
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 36/75 (48%), Gaps = 2/75 (2%)
Query: 139 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
K++ A +A+ P S C+ A ++ DR Y+AGE + V GP N
Sbjct: 190 KLTSADCYAMCPFMDYFNHSDSGCEPQHNA--HGYSVLADRAYRAGEEVYVSYGPHTNDF 247
Query: 199 LLINYGFVDEDNPYD 213
LL+ YGF+ + N D
Sbjct: 248 LLVEYGFLLDANSND 262
>gi|303272215|ref|XP_003055469.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463443|gb|EEH60721.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 468
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 32/117 (27%), Positives = 50/117 (42%), Gaps = 10/117 (8%)
Query: 10 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
A+ L +L L + +M E+ G +S W Y L RG L P+ W+E L
Sbjct: 87 AKELRDARLGGGLALNVAVMVERALGSESRWRDYFAVLP-SRGERTL----PMFWTEARL 141
Query: 70 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 126
L G+ + E AE ++ +Y+E + L +P E TFE + +A
Sbjct: 142 EALKGTDLATHVREDAENLRADYDEE-----VVNGLCVAHPEKFRREELTFERYLEA 193
>gi|303279242|ref|XP_003058914.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226460074|gb|EEH57369.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 457
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 29/48 (60%)
Query: 170 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 217
DDA+ + V + KAGE + G N+KLL +YGF DNP D++ +
Sbjct: 232 DDALFVRVVKASKAGEEVFNTYGKLGNAKLLCSYGFAQLDNPADKVTI 279
>gi|297598048|ref|NP_001044988.2| Os01g0879500 [Oryza sativa Japonica Group]
gi|255673923|dbj|BAF06902.2| Os01g0879500 [Oryza sativa Japonica Group]
Length = 263
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 20/63 (31%), Positives = 33/63 (52%)
Query: 162 CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 221
C + D+ ++++V R GE + G N+ LL YGF + DN YD + ++ AL
Sbjct: 15 CSYYVGDDDEDLEMIVVRDVNEGEEVFNTYGTMGNAALLHRYGFTEMDNSYDIVNIDLAL 74
Query: 222 NTE 224
T+
Sbjct: 75 VTK 77
>gi|320166344|gb|EFW43243.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 514
Score = 40.4 bits (93), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 57/230 (24%), Positives = 95/230 (41%), Gaps = 20/230 (8%)
Query: 16 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 75
N + + LAL LMYE+ + S W ++R L +ES L W++ EL +
Sbjct: 132 NAIDPMTALALGLMYERSRA-DSPWRAWLRMLPD-------PIESMLEWNDVELWPVEQL 183
Query: 76 PTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 135
K ER ++ Y + T Y D+ FT E F A V Q+ +
Sbjct: 184 YVKELREERIRNLEAVYESVIT------PFIDTYESDLVGVDFTIEAFVWAAVIAQTRGL 237
Query: 136 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 195
H S +L+P+ ++ + + A++ A + + KAGE I +
Sbjct: 238 H---ESEKNGLSLLPIV-DMINHHREPNAVVVASGPNILVRTKTSLKAGEEITI-DYEMS 292
Query: 196 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQV 245
+ LL+ YGFV+ D + + ++D Y + + + G LS QV
Sbjct: 293 SHVLLLLYGFVEMSENLDFYPIRLSWESKDIDYPRRLRLLEGRG-LSRQV 341
>gi|121703688|ref|XP_001270108.1| SET domain protein [Aspergillus clavatus NRRL 1]
gi|119398252|gb|EAW08682.1| SET domain protein [Aspergillus clavatus NRRL 1]
Length = 492
Score = 40.4 bits (93), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 58/242 (23%), Positives = 90/242 (37%), Gaps = 46/242 (19%)
Query: 13 LTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
L + L EL L L ++YE ++S W PY R ++ + WS EL
Sbjct: 81 LLSQDLGELGPWLSLMLVMIYEYLLREQSAWAPYYRIFPEN-------FDTLMFWSPAEL 133
Query: 70 AYLTGSPTKAEILERA-----------------------EGIKREYNELDTVWFMA---- 102
L GS +I + +G+ E T +
Sbjct: 134 QELQGSAIVDKIGRQGAEESILQMIAPVVKANPSLFPPIQGLSSWEGEAGTQALLGLAHV 193
Query: 103 -GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSK 161
GSL Y +DI + + + Q +VPL L A + +
Sbjct: 194 MGSLIMAYAFDIEKVNDEDDEDNEGEDGYMTDEEEDQSSK-----GMVPLADILNADADR 248
Query: 162 CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED-NPYDRLVVEAA 220
A L +D++ + +P AG+ I G P S LL YG+V ++ PYD V+EA+
Sbjct: 249 NNARLFQEEDSLVMKAIKPIAAGDEIFNDYGELPRSDLLRRYGYVTDNYAPYD--VIEAS 306
Query: 221 LN 222
L+
Sbjct: 307 LD 308
>gi|156054286|ref|XP_001593069.1| hypothetical protein SS1G_05991 [Sclerotinia sclerotiorum 1980]
gi|154703771|gb|EDO03510.1| hypothetical protein SS1G_05991 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 418
Score = 40.4 bits (93), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 51/231 (22%), Positives = 95/231 (41%), Gaps = 24/231 (10%)
Query: 8 YAAELLTTNKLSELACLA-LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSE 66
+ E L T K + + +LM + + +KS W YIR L + L + P+ W E
Sbjct: 74 FPIEFLETLKQDDPNIIGHFFLMQQYLKCEKSPWWQYIRLLPQPGDPKSLGI--PIWWPE 131
Query: 67 TELAYLTGSPT----------------KAEILERAEGIKREYNELDTVWFMAGSLFQQYP 110
+ +L G+ K +L R +EY+ + W A ++F
Sbjct: 132 EDQKFLAGTNAGPPLQKREQMWRDQWKKGVVLLRELPNHKEYSYILYQW--AATIFDSRS 189
Query: 111 YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD 170
+ P+ E ++ + + H++ + + LV +G + K L++
Sbjct: 190 FR-PSLTICPEALSESSKEMDLNLDHVRNDRFSILYPLVDIGNHNGINQVEWKKDLSS-- 246
Query: 171 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 221
++ LV G+ I + G + NS+LL+ YGF+ ++ +R VV L
Sbjct: 247 NSFDLVHSAGVSEGDQIYNYYGNKSNSELLLGYGFILPNDIVNRNVVNLKL 297
>gi|145346652|ref|XP_001417799.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578027|gb|ABO96092.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 490
Score = 40.4 bits (93), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 50/206 (24%), Positives = 82/206 (39%), Gaps = 25/206 (12%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS-------P 76
L + +M E GK S W Y+ + G S L W + ++ L G+
Sbjct: 110 LVIAVMCEMCAGKSSAWCEYLETVHEAVRVGH----SVLAWDDEQVTALFGTDAWRDAYE 165
Query: 77 TKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF-TFEIFKQAFVAVQSCVV 135
E L+ + + + T++F LF + + EA A VA S +
Sbjct: 166 NDDETLDLPMMTEEHFENVVTLFF---KLFPKLASGLSVEALRELHFAATAMVAGYSFTL 222
Query: 136 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD----AVQLVVDRPYKAGESIVVWC 191
++ A+VP +L ++ C+A + D +Q++ R K GE +
Sbjct: 223 GDDEIQ-----AMVPFWD-MLNHAPPCEASVRLHHDQKNGCLQMITVRGVKKGEEVFNTY 276
Query: 192 GPQPNSKLLINYGFVDEDNPYDRLVV 217
GP N++LL YGFV NP+ V
Sbjct: 277 GPLRNAELLRRYGFVLPRNPHGGTTV 302
>gi|392563539|gb|EIW56718.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
Length = 441
Score = 40.0 bits (92), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 16/46 (34%), Positives = 29/46 (63%)
Query: 173 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 218
+ LV+ P G ++ GP+PN++L++ YGF +NP D +V++
Sbjct: 250 ISLVIHTPTTTGSELLNNYGPKPNAELILGYGFSLPNNPDDTIVLK 295
>gi|172073177|ref|NP_663457.2| SET domain-containing protein 4 [Mus musculus]
gi|148671824|gb|EDL03771.1| SET domain containing 4, isoform CRA_e [Mus musculus]
Length = 439
Score = 40.0 bits (92), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 56/234 (23%), Positives = 95/234 (40%), Gaps = 24/234 (10%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
+S L L +L+ EK G +S W Y+ L + + P+ E E+ L SP
Sbjct: 103 VSPLLALCTFLVSEKHAGCRSLWKSYLDILPK-------SYTCPVCL-EPEVVDLLPSPL 154
Query: 78 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
KA+ E+ ++ + + LF + P D F++ F A+ V + V+L
Sbjct: 155 KAKAEEQRARVQDLFTSARGFFSTLQPLFAE-PVD---SVFSYRAFLWAWCTVNTRAVYL 210
Query: 138 Q---KVSLARRFALVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVW 190
+ + L+ L P LL +S + KA ++ + + + +
Sbjct: 211 RSRRQECLSAEPDTCALAPFLDLLNHSPHVQVKAAFNEKTRCYEIRTASRCRKHQEVFIC 270
Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVVEAA-----LNTEDPQYQDKRMVAQRNG 239
GP N +LL+ YGFV NP+ + V A L D Q K + + +G
Sbjct: 271 YGPHDNQRLLLEYGFVSVRNPHACVPVSADMLVKFLPAADKQLHRKITILKDHG 324
>gi|340517549|gb|EGR47793.1| hypothetical protein TRIREDRAFT_122428 [Trichoderma reesei QM6a]
Length = 482
Score = 40.0 bits (92), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 56/246 (22%), Positives = 99/246 (40%), Gaps = 48/246 (19%)
Query: 35 GKKSF----WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 90
G ++F W YIR L R + P +W+ E L G+ +A + + +
Sbjct: 111 GTRAFASTPWTEYIRFLPR-------PIPVPTMWTNDERELLKGTSLEAAVSAKLSALSS 163
Query: 91 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 150
EY++L A +L + + +E+ T E + A +S + L + A+VP
Sbjct: 164 EYDKLCE---EASAL--SFWSTLLSESATLEDWVLADAWYRSRCLELPRAG----HAMVP 214
Query: 151 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYK---AGESIVVWCG-PQPNSKLLINYGFV 206
G + +S A D +++ RP AG I + G +P +++L +YGF+
Sbjct: 215 -GLDMANHSQSHSAYYDESSDGDVVLLPRPGSKIPAGAEITISYGEAKPAAEMLFSYGFI 273
Query: 207 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 266
D+D+ L + +DP L FH++ G P +RL
Sbjct: 274 DKDSTVKELTLHLEALPDDP--------------LGRAKFHIYKGP---------PTVRL 310
Query: 267 GYVSDT 272
++D
Sbjct: 311 SIINDN 316
>gi|195565510|ref|XP_002106342.1| GD16174 [Drosophila simulans]
gi|194203718|gb|EDX17294.1| GD16174 [Drosophila simulans]
Length = 395
Score = 40.0 bits (92), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 22/79 (27%), Positives = 37/79 (46%)
Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
AL+P K + AAV ++ AGE ++ G + N+ LL++ GFV
Sbjct: 229 ALIPYWDMANHRPGKITSFYAAVPRQLECTAQEAVDAGEQFFIYYGDRSNTDLLVHNGFV 288
Query: 207 DEDNPYDRLVVEAALNTED 225
D++N D + + L+ D
Sbjct: 289 DDNNLKDYVNIRVGLSLTD 307
>gi|17865444|sp|P58467.1|SETD4_MOUSE RecName: Full=SET domain-containing protein 4
gi|17061796|gb|AAK68849.1| C21orf18 [Mus musculus]
Length = 439
Score = 40.0 bits (92), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 56/234 (23%), Positives = 95/234 (40%), Gaps = 24/234 (10%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
+S L L +L+ EK G +S W Y+ L + + P+ E E+ L SP
Sbjct: 103 VSPLLALCTFLVSEKHAGCRSLWKSYLDILPK-------SYTCPVCL-EPEVVDLLPSPL 154
Query: 78 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
KA+ E+ ++ + + LF + P D F++ F A+ V + V+L
Sbjct: 155 KAKAEEQRARVQDLFTSARGFFSTLQPLFAE-PVD---SVFSYRAFLWAWCTVNTRAVYL 210
Query: 138 Q---KVSLARRFALVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVW 190
+ + L+ L P LL +S + KA ++ + + + +
Sbjct: 211 RSRRQECLSAEPDTCALAPFLDLLNHSPHVQVKAAFNEKTRCYEIRTASRCRKHQEVFIC 270
Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVVEAA-----LNTEDPQYQDKRMVAQRNG 239
GP N +LL+ YGFV NP+ + V A L D Q K + + +G
Sbjct: 271 YGPHDNQRLLLEYGFVSVRNPHACVPVSADMLVKFLPAADKQLHRKITILKDHG 324
>gi|329663327|ref|NP_001192753.1| SET domain-containing protein 4 [Bos taurus]
gi|296490853|tpg|DAA32966.1| TPA: SET domain containing 4 [Bos taurus]
Length = 440
Score = 40.0 bits (92), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 47/201 (23%), Positives = 85/201 (42%), Gaps = 19/201 (9%)
Query: 19 SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
S L L +L+ EK G +S W PY+ L + A P+ E E+ L +P K
Sbjct: 105 SPLLALCTFLVSEKHAGDRSPWKPYLEVLPK-------AYTCPVC-LEPEVVNLLPNPLK 156
Query: 79 AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ 138
+ E+ + ++ + LF + I F++ + A+ AV + V+++
Sbjct: 157 TKAWEQRSHVWEFFSSSRGFFSSLQPLFSEAVETI----FSYRALRWAWCAVNTRAVYMK 212
Query: 139 K---VSLARRFALVPLGP--PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGE--SIVVWC 191
+ + L+ L P LL +S + A ++ + + G+ + +
Sbjct: 213 RPPLLCLSPEPDTCALAPYLDLLNHSPDVQVKAAFNEETRCYEIRTATRCGKHKEVFICY 272
Query: 192 GPQPNSKLLINYGFVDEDNPY 212
GP N +LL+ YGFV NP+
Sbjct: 273 GPHDNHRLLLEYGFVCVSNPH 293
>gi|340522118|gb|EGR52351.1| predicted protein [Trichoderma reesei QM6a]
Length = 377
Score = 40.0 bits (92), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 39/164 (23%), Positives = 71/164 (43%), Gaps = 25/164 (15%)
Query: 61 PLLWSETELAYLTGSPTKAEI-LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 119
P+LW EL L P ++++ LER E E W F++ D+P + +T
Sbjct: 107 PMLWPR-ELKQLL--PLESQVTLERRE------KEFQDNW----DDFKEAFPDVPRDDYT 153
Query: 120 FEIFKQAFVAVQSCVVHLQ-----KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 174
+ A++ V + + + K R AL+P+ + C+ + +
Sbjct: 154 Y-----AWLVVNTRTFYHETPETLKYPWEDRLALIPVADLFNHAAGGCRVYYSP-EGCYH 207
Query: 175 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 218
+V DR YK GE + + N L+ YGF+ ++N D + ++
Sbjct: 208 VVADRAYKKGEELFISYSSHSNDYNLLEYGFIPDENSLDDVYID 251
>gi|406606937|emb|CCH41659.1| hypothetical protein BN7_1200 [Wickerhamomyces ciferrii]
Length = 577
Score = 40.0 bits (92), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 55/222 (24%), Positives = 87/222 (39%), Gaps = 58/222 (26%)
Query: 42 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF- 100
P+I L R G SP W+E E + + T A++ I N+L W+
Sbjct: 110 PFIEFLPTGREIG-----SPFFWNEMERSLIKN--TDADL-----AIDVGLNKLVEEWYD 157
Query: 101 MAGSL---FQQYPYDIPTEAFT-----------FEIFKQAFVAVQSCVVHLQKVSL--AR 144
+ L FQ Y Y + F FE F V+ S +L ++ +R
Sbjct: 158 IVTKLPKKFQSYQYQKDLKFFHDFQKDRDVSKHFEFFNDDSVSWTSFAAYLWSSTIFTSR 217
Query: 145 RFALVPLGPPLLAYSSKCK----AMLAAVDDA-----------------VQLVVDRPYKA 183
F P L++ + +C+ ML + D + ++ K
Sbjct: 218 GF------PFLISSTDECRDLNEGMLVPIQDLSNHNPSVEIKWGRLDKFMTFTTEQIVKK 271
Query: 184 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 225
G+ I GP+ N +LL YGFV ++N YD+ V+ AL +D
Sbjct: 272 GDEIFSNYGPKSNHELLFGYGFVMDNNIYDKAVL--ALRLQD 311
>gi|170093191|ref|XP_001877817.1| SET-domain protein [Laccaria bicolor S238N-H82]
gi|164647676|gb|EDR11920.1| SET-domain protein [Laccaria bicolor S238N-H82]
Length = 524
Score = 40.0 bits (92), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 59/228 (25%), Positives = 98/228 (42%), Gaps = 32/228 (14%)
Query: 146 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 205
A+VP+ L A A L ++ ++++ RP K GE I G PN++LL YG
Sbjct: 276 IAMVPMADILNARYGSENAKLFYEENYLKMISTRPIKGGEQIWNTYGDLPNAELLRRYGH 335
Query: 206 VD--------EDNPYD------RLVVEAA-----LNTEDPQYQDKRMVAQRNGKLSVQVF 246
VD + NP D L+V A L+T+D + + + + F
Sbjct: 336 VDVIQLPNGGQGNPGDVAEIRADLIVSVAAEQHSLSTDDTHERIDWWLEEGGDDVFDLYF 395
Query: 247 HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 306
+ E ++ +RL + D E + + P P M+ L L + +
Sbjct: 396 DL------EIPPSIISVIRLLLLPD-EEWEKIKEKA---KPPKPKMDAVALTVLHEVLQR 445
Query: 307 RLAGYPATLSEDEAML-TDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
RL YP ++ +DE +L T +L+ + + +L EKK+L+ L TA
Sbjct: 446 RLKEYPTSIQDDEQLLMTAPSLNLRHAIIVRL--GEKKILDGILTKTA 491
>gi|145354549|ref|XP_001421544.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581782|gb|ABO99837.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 488
Score = 40.0 bits (92), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 17/42 (40%), Positives = 27/42 (64%)
Query: 172 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 213
V+L+ R +GE I + G N +LL++YGF+ +DNP+D
Sbjct: 276 GVELIARRALTSGEPIELSYGNLSNDELLLDYGFIVKDNPFD 317
>gi|299748031|ref|XP_002911244.1| tho2 protein [Coprinopsis cinerea okayama7#130]
gi|298407787|gb|EFI27750.1| tho2 protein [Coprinopsis cinerea okayama7#130]
Length = 2474
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 15/47 (31%), Positives = 30/47 (63%)
Query: 172 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 218
++ L+ G+ + GP+PNS+L+++YGF +DNP D ++++
Sbjct: 209 SISLIAHSAIWTGQEVFNNYGPKPNSELILSYGFSIQDNPDDSIILK 255
>gi|453088140|gb|EMF16181.1| SET domain-containing protein, partial [Mycosphaerella populorum
SO2202]
Length = 307
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 45/185 (24%), Positives = 70/185 (37%), Gaps = 24/185 (12%)
Query: 61 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 120
PL WS +L S + LER RE E D + ++ + + +
Sbjct: 112 PLFWSPELCHHLPES--VKQPLERM----REDYEKDLTYMLS--------LNCDDQTWKE 157
Query: 121 EIFKQAFVAVQSCVVHLQKVSLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 179
E FK + V S H + L F L P + + ++ ++V DR
Sbjct: 158 EDFKYYWAIVNSRCFHFKPPGLKPGFMVLCPFIDYMNHGPTGTGVKVSQSPKGYEVVADR 217
Query: 180 PYKAGESIVVWCGPQPNSKLLINYGFV---------DEDNPYDRLVVEAALNTEDPQYQD 230
Y+ I+ G PN KLL++YGF D+D D +++ A Q QD
Sbjct: 218 DYEPNTEILATYGSHPNDKLLVHYGFCLSYKPNEPSDDDIRLDHILLPAMSANTKSQLQD 277
Query: 231 KRMVA 235
M+
Sbjct: 278 VGMLG 282
>gi|260946533|ref|XP_002617564.1| hypothetical protein CLUG_03008 [Clavispora lusitaniae ATCC 42720]
gi|238849418|gb|EEQ38882.1| hypothetical protein CLUG_03008 [Clavispora lusitaniae ATCC 42720]
Length = 430
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 46/202 (22%), Positives = 85/202 (42%), Gaps = 22/202 (10%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
LS LA+YL+ EK++G SFW P+I D +L++ +P++W ++ P
Sbjct: 117 LSSFQLLAIYLVLEKERGAASFWKPFI---DMLPSIEELSL-APVVWKVLQV------PH 166
Query: 78 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPY--DIPT-EAFTFEIFKQAFVAVQSCV 134
++ R++ E + + Y D+P+ AF F A++ + S
Sbjct: 167 CDDLWRMLSRSARKHAE-----SVVARFEKDYAVVCDLPSVPAFERSSFLWAWMCINSRC 221
Query: 135 VHL---QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
+++ Q + F + P L +S++ + + ++ YK E +
Sbjct: 222 LYMSMPQAKDTSDNFTMAPY-VDFLNHSNEDQCGIKIDPHGFHVLTSSAYKPQEELYFSY 280
Query: 192 GPQPNSKLLINYGFVDEDNPYD 213
GP N LL YGF N ++
Sbjct: 281 GPHSNEFLLCEYGFTLPHNKWN 302
>gi|159471213|ref|XP_001693751.1| transcription factor, E2F and DP-related [Chlamydomonas
reinhardtii]
gi|158283254|gb|EDP09005.1| transcription factor, E2F and DP-related [Chlamydomonas
reinhardtii]
Length = 656
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 41/149 (27%), Positives = 62/149 (41%), Gaps = 6/149 (4%)
Query: 70 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
A L GSP AE + + + + SL + YP F++E + A
Sbjct: 164 AALAGSPLAAEAGQARRHLAEAFAASQPAFE---SLLKAYPDYFQPHWFSWESYLWAAEL 220
Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA---VQLVVDRPYKAGES 186
S + +Q + R LVP + + + VD A +++ RP G
Sbjct: 221 WYSYGIQVQVAAGDIRTCLVPYLGLMNHHPLPHVVHFSKVDPASRGLRVRAFRPCARGRQ 280
Query: 187 IVVWCGPQPNSKLLINYGFVDEDNPYDRL 215
+ + GP PNSKLL+ YGF DNP D +
Sbjct: 281 LFLSYGPYPNSKLLLFYGFALPDNPVDEV 309
>gi|410970027|ref|XP_003991492.1| PREDICTED: SET domain-containing protein 4 [Felis catus]
Length = 440
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 57/237 (24%), Positives = 98/237 (41%), Gaps = 25/237 (10%)
Query: 19 SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
S L L +L+ EK G +S W PY+ L + A P+ E E+ L P +
Sbjct: 105 SPLLALCTFLVSEKHAGDQSVWKPYLEILPK-------AYTCPVC-LEPEVVNLFPKPLR 156
Query: 79 AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ 138
A+ E+ ++ ++ + LF + I F++ A+ V + V++
Sbjct: 157 AKAEEQRARVREFFSSSRGFFSSLQPLFSEAVGSI----FSYRALLWAWCTVNTRAVYV- 211
Query: 139 KVSLARRFALVP----LGP--PLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGESIVVW 190
K R F+ P L P LL +S + A ++ ++ + E + +
Sbjct: 212 KPRRRRCFSAEPDTCALAPYLDLLNHSPHVQVEAAFNEETRCYEIRTASSCRKHEEVFIC 271
Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFH 247
GP N +LL+ YGFV NP+ + V + + DK+M N K+S+ H
Sbjct: 272 YGPHDNQRLLLEYGFVSIHNPHACVYVSEDILVKYLPSTDKQM----NKKISILKDH 324
>gi|342875304|gb|EGU77102.1| hypothetical protein FOXB_12400 [Fusarium oxysporum Fo5176]
Length = 371
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 35/77 (45%), Gaps = 2/77 (2%)
Query: 145 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 204
R +P CK +A+ +VQ DR Y GE + V GP N LL YG
Sbjct: 178 RLVCMPTADLFNHADQGCKLAYSALGYSVQ--ADRVYHQGEEVYVSYGPHSNDFLLSEYG 235
Query: 205 FVDEDNPYDRLVVEAAL 221
F+ + N +D + ++ +
Sbjct: 236 FILDTNRWDEVYLDEVI 252
>gi|296232125|ref|XP_002761462.1| PREDICTED: SET domain-containing protein 4 [Callithrix jacchus]
Length = 440
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 55/240 (22%), Positives = 93/240 (38%), Gaps = 24/240 (10%)
Query: 19 SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
S L L +L+ EK G +S W PY+ L + A P+ E E+ L K
Sbjct: 105 SPLLALCTFLVSEKHAGDRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPISLK 156
Query: 79 AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL- 137
A+ E+ ++ + + LF + I F++ A+ V + V+L
Sbjct: 157 AKAEEQRAHVQEFFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLR 212
Query: 138 --QKVSLARRFALVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
Q L+ L P LL +S + KA ++ ++ E + +
Sbjct: 213 PRQWECLSAEPDTCALAPYLDLLNHSPHVQVKAAFNEETHCYEIRTTSRWRKHEEVFICY 272
Query: 192 GPQPNSKLLINYGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 246
GP N +L + YGFV NP+ R ++ L + D Q K + + +G + F
Sbjct: 273 GPHDNHRLFLEYGFVSGHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332
>gi|323455796|gb|EGB11664.1| hypothetical protein AURANDRAFT_61664 [Aureococcus anophagefferens]
Length = 1916
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 17/46 (36%), Positives = 27/46 (58%)
Query: 171 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 216
DA + R Y AG+ + G + N++L+ NYGF++ NP+D V
Sbjct: 294 DAFAVNAHRDYDAGDEVHASYGKKSNAQLVANYGFLEPGNPFDDYV 339
>gi|392587357|gb|EIW76691.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
Length = 430
Score = 39.7 bits (91), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 16/51 (31%), Positives = 30/51 (58%)
Query: 168 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 218
+ + + +V + G+ I GP+PNS+L++ YGF +NP D +V++
Sbjct: 230 STETTISIVSHDAFNEGQEIYNNYGPKPNSELILGYGFSLPNNPDDTIVLQ 280
>gi|389646769|ref|XP_003721016.1| hypothetical protein MGG_02740 [Magnaporthe oryzae 70-15]
gi|86196443|gb|EAQ71081.1| hypothetical protein MGCH7_ch7g488 [Magnaporthe oryzae 70-15]
gi|351638408|gb|EHA46273.1| hypothetical protein MGG_02740 [Magnaporthe oryzae 70-15]
gi|440466942|gb|ELQ36183.1| hypothetical protein OOU_Y34scaffold00666g44 [Magnaporthe oryzae
Y34]
gi|440488101|gb|ELQ67845.1| hypothetical protein OOW_P131scaffold00283g3 [Magnaporthe oryzae
P131]
Length = 390
Score = 39.7 bits (91), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 23/84 (27%), Positives = 39/84 (46%), Gaps = 2/84 (2%)
Query: 138 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 197
+++ R L P+ + C +A D+ + DR Y AGE +++ G N
Sbjct: 173 ERLGKEDRMVLQPVADLFNHADAGC--AVAFNDEDFTIRADRDYDAGEEVLICYGNHSND 230
Query: 198 KLLINYGFVDEDNPYDRLVVEAAL 221
LL YGFV N +D + ++ A+
Sbjct: 231 FLLAEYGFVLAANRWDEVCIDDAI 254
>gi|297738159|emb|CBI27360.3| unnamed protein product [Vitis vinifera]
Length = 449
Score = 39.7 bits (91), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 43/180 (23%), Positives = 79/180 (43%), Gaps = 23/180 (12%)
Query: 179 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV--EAALNTEDPQYQDKRMVAQ 236
+ YK GE +++ G N +LL +YGF+ ++NP D+ + E + +D + Q
Sbjct: 257 KNYKKGEQVLLSYGTYTNLELLEHYGFLLDENPNDKAFIPLEPEVYASSSWPKDSLYIHQ 316
Query: 237 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 296
NGK S +L LRL + + S+ +SV + +S E V
Sbjct: 317 -NGKPSFA---------------LLSALRL-WATPASQRRSVGHLVYSGTQLSSENEIFV 359
Query: 297 LDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 356
++ +A L P ++ ED +L+ + ++A Q K++L C+ ++I
Sbjct: 360 MEWIAKSCHVVLENLPTSVEEDSLLLS----MERWKLAVQWRLRHKRILVDCISRCTEII 415
>gi|145518912|ref|XP_001445328.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124412772|emb|CAK77931.1| unnamed protein product [Paramecium tetraurelia]
Length = 761
Score = 39.7 bits (91), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 47/101 (46%), Gaps = 17/101 (16%)
Query: 11 ELLTTNKLS--ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 68
EL N+ S E L YL EKK+G+KSF+ PY++ + + + + WS+ E
Sbjct: 86 ELFDENETSDAEFNILIFYLFNEKKKGEKSFYHPYVQAIQ--------SNNTLIDWSKEE 137
Query: 69 LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQY 109
L Y+ E I RE +L +W A +F ++
Sbjct: 138 LNYIEDPIILDEF-----AIVRE--DLKDLWNQAKEIFNEF 171
>gi|194896580|ref|XP_001978500.1| GG17647 [Drosophila erecta]
gi|190650149|gb|EDV47427.1| GG17647 [Drosophila erecta]
Length = 544
Score = 39.7 bits (91), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 22/79 (27%), Positives = 37/79 (46%)
Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
AL+P K + AAV ++ +AGE ++ G + N+ LL++ GFV
Sbjct: 319 ALIPYWDMANHKPGKITSFYAAVSRQLECTAQEAVEAGEQFFIYYGDRSNTDLLVHNGFV 378
Query: 207 DEDNPYDRLVVEAALNTED 225
D +N D + + L+ D
Sbjct: 379 DVNNLKDYVNIRVGLSPTD 397
>gi|169606334|ref|XP_001796587.1| hypothetical protein SNOG_06204 [Phaeosphaeria nodorum SN15]
gi|160706968|gb|EAT86035.2| hypothetical protein SNOG_06204 [Phaeosphaeria nodorum SN15]
Length = 634
Score = 39.7 bits (91), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 49/226 (21%), Positives = 97/226 (42%), Gaps = 23/226 (10%)
Query: 3 HLLFYYAAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPL 62
H+LF + K+ + L L+ ++ +GK+S W YI L G ++ +PL
Sbjct: 82 HVLFVDSRLQPCQGKIPDHILTYLLLIEQRNKGKESPWHAYIACLP-----GAESMTTPL 136
Query: 63 LWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEI 122
+ + ++A+L G+ ER +++ + + AG D + + I
Sbjct: 137 WFDDEDMAFLAGTSLAPAAKERKSLYYQQWEQALGIMKDAGVALAD-EVDFESLLWAATI 195
Query: 123 F-KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL-----AYSSKCKAMLAAVDD-AVQL 175
F +AF++ H VPL P++ + S+K + + +++L
Sbjct: 196 FTSRAFISTHILPDH----------ETVPLLFPIVDILNHSVSAKVEWEFQPLASFSLKL 245
Query: 176 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 221
+ + AG+ + P+ N +LL+ YGF E NP ++ ++ A
Sbjct: 246 LEGDTFTAGQELFNNYAPKQNDELLLGYGFCLEHNPIEQFPLKLAF 291
>gi|387219019|gb|AFJ69218.1| set domain-containing protein 3, partial [Nannochloropsis gaditana
CCMP526]
Length = 265
Score = 39.7 bits (91), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 39/163 (23%), Positives = 62/163 (38%), Gaps = 29/163 (17%)
Query: 181 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQ-YQDKRMVAQRNG 239
YK GE + G + N++LL+ YGF DN ++ + + P +Q G
Sbjct: 18 YKKGEEVFTSYGRRTNAELLLFYGFALLDNEHESVALSMPGIPSPPSWFQASHSALGTAG 77
Query: 240 KLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---SEMQSVISSLGPICPVS------- 289
+ GR + D+L L + T SE+ + +L C ++
Sbjct: 78 SV--------GGRARSMAEDVLRPSHLLFAGATELPSELVAYFRALTACCSMNEKDLVEQ 129
Query: 290 --------PC--MERAVLDQLADYFKARLAGYPATLSEDEAML 322
PC ER L + A LA +P ++ EDE L
Sbjct: 130 KLDYMQHFPCSRHERDAFSTLGAHMSASLAAFPTSIEEDEVEL 172
>gi|226492747|ref|NP_001140859.1| uncharacterized protein LOC100272935 [Zea mays]
gi|194701488|gb|ACF84828.1| unknown [Zea mays]
gi|413951742|gb|AFW84391.1| hypothetical protein ZEAMMB73_159573 [Zea mays]
Length = 495
Score = 39.7 bits (91), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 55/247 (22%), Positives = 92/247 (37%), Gaps = 62/247 (25%)
Query: 22 ACLALYL--MYEKKQGKKSFWLPYIRELDRQRGRGQLAVES-PLLWSETEL-AYLTGSPT 77
CLAL + MYE+ QG S W Y++ L ES PL+W E L G+
Sbjct: 81 GCLALTVAVMYERAQGADSPWDAYLQLLPD--------CESVPLVWPAGEAECLLAGTEL 132
Query: 78 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
+ + E + ++ E ++G L D+ + F+ E + A V S +
Sbjct: 133 DKIVKQDKEFLCEDWKECIEPLMLSGEL------DVDPDDFSLEKYLSAKTLVSSRSFQI 186
Query: 138 QKVSLARRFALVPLGPPLLAYSSKCKAML------------------------------- 166
+ +VPL L + + C+ +
Sbjct: 187 DSYHGS---GMVPLAD-LFNHKTDCEHVHFTSASDASDSDGEEEEDDRSDASADDKPTTK 242
Query: 167 ---------AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 217
A D+ +++++ R GE + G N+ LL YGF + DN YD + +
Sbjct: 243 NPTSSPPGSRANDEDLEIIIVRDVNEGEEVYNTYGTMGNAALLHRYGFTELDNQYDIVNI 302
Query: 218 EAALNTE 224
+ AL T+
Sbjct: 303 DLALVTK 309
>gi|116197927|ref|XP_001224775.1| hypothetical protein CHGG_07119 [Chaetomium globosum CBS 148.51]
gi|88178398|gb|EAQ85866.1| hypothetical protein CHGG_07119 [Chaetomium globosum CBS 148.51]
Length = 555
Score = 39.3 bits (90), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 48/184 (26%), Positives = 83/184 (45%), Gaps = 27/184 (14%)
Query: 60 SPLLWSETELAYLTGSP--TKAEILERAEGIKREYNELDTVWFMAGS--LFQQYPYDIPT 115
S + ++E EL G+ T + L+R+ I+ +Y L V +A S LF P
Sbjct: 194 SSIFFAEEELEVCAGTSLYTITKKLDRS--IEDDYRTL-VVRVLAQSRDLF-------PL 243
Query: 116 EAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRFALVPLGPPLLAYSSKCK--AMLAA 168
+ F+ E +K A V S + + L FA +L +SS+ + + A
Sbjct: 244 DKFSIEDYKWALCTVWSRAMDFVLPDGNSIRLVAPFA------DMLNHSSEVEPCHIYDA 297
Query: 169 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 228
+ ++ + Y+AG+ ++ G PNS+LL YGFV NP D + + + P +
Sbjct: 298 SSGNLSVLAGKDYEAGDQAFIYYGSIPNSRLLRLYGFVMPGNPNDSYDLVISTHPSAPFF 357
Query: 229 QDKR 232
+ K+
Sbjct: 358 ERKQ 361
>gi|194707708|gb|ACF87938.1| unknown [Zea mays]
Length = 352
Score = 39.3 bits (90), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 47/205 (22%), Positives = 87/205 (42%), Gaps = 37/205 (18%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
L L L+ E+ + SFW PYI L P+ + ++ L +P ++ +
Sbjct: 3 LGLRLLQERAK-SDSFWWPYIANLPE-------TFTVPIFFPGEDIKNLQYAPILHQVNK 54
Query: 84 RA----EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 139
R E K +L TV + + Q D+ + + + A A S L
Sbjct: 55 RCRFLLEFEKEVQQKLHTVPLVDHPFYGQ---DVNSSSLGW-----AMSAASSRAFRLH- 105
Query: 140 VSLARRFALVPLGPPLL-----AYSSKCKAM----LAAVDDAVQLVVDRPYKAGESIVVW 190
VP+ PL+ +++ + + + ++D +V+++ ++ K E+I +
Sbjct: 106 -------GEVPMLLPLIDMCNHSFNPNARIVQERSVNSLDMSVKVLAEKKIKQNEAITLN 158
Query: 191 CGPQPNSKLLINYGFVDEDNPYDRL 215
G PN L++YGFV NPYD++
Sbjct: 159 YGCYPNDFFLLDYGFVITQNPYDQV 183
>gi|195353393|ref|XP_002043189.1| GM17489 [Drosophila sechellia]
gi|194127287|gb|EDW49330.1| GM17489 [Drosophila sechellia]
Length = 537
Score = 39.3 bits (90), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 22/79 (27%), Positives = 36/79 (45%)
Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
AL+P K + AAV ++ AGE ++ G + N+ LL++ GFV
Sbjct: 319 ALIPYWDMANHRQGKITSFYAAVPRQLECTAQEAVDAGEQFFIYYGDRSNTDLLVHNGFV 378
Query: 207 DEDNPYDRLVVEAALNTED 225
D+ N D + + L+ D
Sbjct: 379 DDYNLKDYVNIRVGLSLTD 397
>gi|428177750|gb|EKX46628.1| hypothetical protein GUITHDRAFT_107412 [Guillardia theta CCMP2712]
Length = 606
Score = 39.3 bits (90), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 17/49 (34%), Positives = 27/49 (55%)
Query: 173 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 221
+Q P K G + + GP N++LL+ YG+ ++DNPY +E L
Sbjct: 440 LQFCTMAPIKQGSQVFLNYGPLDNTQLLLYYGYAEQDNPYQTYAIELEL 488
>gi|407846232|gb|EKG02467.1| hypothetical protein TCSYLVIO_006496 [Trypanosoma cruzi]
Length = 546
Score = 39.3 bits (90), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 45/95 (47%), Gaps = 10/95 (10%)
Query: 13 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA-- 70
L ++ + +AC+A Y+ YEKKQ + + L Y R L Q+ V++ LW+ L
Sbjct: 351 LDSSNMESIACIAAYMFYEKKQPEIALRL-YRRLL-------QMGVQTTELWNNLGLCCF 402
Query: 71 YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSL 105
Y + L+RA I E L VW+ G +
Sbjct: 403 YSSQYDIALSCLQRAVAISTEDETLADVWYNIGHI 437
>gi|255071473|ref|XP_002499410.1| predicted protein [Micromonas sp. RCC299]
gi|226514673|gb|ACO60669.1| predicted protein [Micromonas sp. RCC299]
Length = 323
Score = 39.3 bits (90), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 24/35 (68%)
Query: 184 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 218
GE +V+ G + N +LL +GF D DNP+D LV++
Sbjct: 199 GEEVVISYGDKTNEELLFVHGFADRDNPHDALVLQ 233
>gi|195480581|ref|XP_002101314.1| GE17555 [Drosophila yakuba]
gi|194188838|gb|EDX02422.1| GE17555 [Drosophila yakuba]
Length = 548
Score = 39.3 bits (90), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 22/79 (27%), Positives = 36/79 (45%)
Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
AL+P K + AAV ++ AGE ++ G + N+ LL++ GFV
Sbjct: 323 ALIPYWDMANHRPGKITSFYAAVSRQLECTAQEAVAAGEQFFIYYGDRSNTDLLVHNGFV 382
Query: 207 DEDNPYDRLVVEAALNTED 225
D +N D + + L+ D
Sbjct: 383 DVNNLKDYVNIRVGLSPTD 401
>gi|440640494|gb|ELR10413.1| hypothetical protein GMDG_00825 [Geomyces destructans 20631-21]
Length = 492
Score = 39.3 bits (90), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 86/337 (25%), Positives = 132/337 (39%), Gaps = 48/337 (14%)
Query: 37 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA-EG 87
+S W PY L + ++S + WS ELA L G EI ++
Sbjct: 104 ESKWAPYFNVLPTK-------LDSLVFWSPEELAELQASAVLKKVGKDKAEEIFHQSISK 156
Query: 88 IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH-LQKVSLARRF 146
+ E ++D ++ S Y +DIP E + V QK SLA
Sbjct: 157 VTPEGTDVD-IFHRVASTIMAYAFDIPD----IEQEDEEGANEDDLVDDDEQKTSLA--- 208
Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
++PL +L + A L + +++ P K GE I+ G P S LL YG+V
Sbjct: 209 -MIPLAD-MLNADADNNARLHYDGEELEMRTINPIKTGEEILNDYGQLPRSDLLRRYGYV 266
Query: 207 -DEDNPYDRLVVEAALNT-EDPQYQD---KRMVAQRNGKLSVQVFHV------------- 248
D+ +D V E + +T D YQD + V R G++ ++
Sbjct: 267 TDKYATFD--VAEISTSTITDHIYQDLAGELKVYLRAGEIEARLELARREDVYEDAHDVG 324
Query: 249 HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARL 308
HA E ISD L L + + ++ SS S V L + R
Sbjct: 325 HATEEWPCISDELVALVYLLLVGEETLAAIQSSKMSFPSRSKMETELVGKALQRILERRE 384
Query: 309 AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
Y T+ EDE +L N + ++A Q VRM +K++
Sbjct: 385 REYATTVVEDENLLQSGNHSNRVKMAIQ-VRMGEKVV 420
>gi|356564844|ref|XP_003550657.1| PREDICTED: uncharacterized protein LOC100778605 [Glycine max]
Length = 549
Score = 39.3 bits (90), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 41/151 (27%), Positives = 69/151 (45%), Gaps = 14/151 (9%)
Query: 69 LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 128
+ L G+ EI++ + + +Y+EL F A L +P P E +T+E F A
Sbjct: 260 ITMLDGTLLLEEIMQARQHLHAQYDEL----FPA--LCNNFPDIFPPELYTWEKFLWACE 313
Query: 129 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVD---DAVQLVVDRPYKA 183
S + + R L+PL L S C ++ VD ++++ + RP ++
Sbjct: 314 LWYSNSMKIMYSDGKLRTCLIPLAGFL--NHSLCPHVMHYGKVDPATNSLKFCLSRPCRS 371
Query: 184 GESIVVWCGPQPNSKLLINYGFVDE-DNPYD 213
GE + G +S L+ YGF+ + DN YD
Sbjct: 372 GEECCLSYGNFSSSHLITFYGFLPQGDNSYD 402
>gi|71409849|ref|XP_807248.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70871208|gb|EAN85397.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 544
Score = 39.3 bits (90), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 45/95 (47%), Gaps = 10/95 (10%)
Query: 13 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA-- 70
L ++ + +AC+A Y+ YEKKQ + + L Y R L Q+ V++ LW+ L
Sbjct: 349 LDSSNMESIACIAAYMFYEKKQPEIALRL-YRRLL-------QMGVQTTELWNNLGLCCF 400
Query: 71 YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSL 105
Y + L+RA I E L VW+ G +
Sbjct: 401 YSSQYDIALSCLQRAVAISTEDETLADVWYNIGHI 435
>gi|194764087|ref|XP_001964163.1| GF21412 [Drosophila ananassae]
gi|190619088|gb|EDV34612.1| GF21412 [Drosophila ananassae]
Length = 1017
Score = 39.3 bits (90), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 17/52 (32%), Positives = 30/52 (57%)
Query: 183 AGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 234
AGE ++ G + N++ L+N GFVD DN D + + L+ DP + + ++
Sbjct: 838 AGEQFFIYYGDRTNTEFLVNNGFVDPDNRNDYVNIRLGLSPTDPLAEKRAII 889
>gi|395848935|ref|XP_003797093.1| PREDICTED: SET domain-containing protein 4 [Otolemur garnettii]
Length = 440
Score = 39.3 bits (90), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 53/233 (22%), Positives = 93/233 (39%), Gaps = 24/233 (10%)
Query: 19 SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
S L L +L+ EK G +S W PY+ L + A P+ E E+ L P K
Sbjct: 105 SPLLALCTFLVSEKHAGDQSPWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKPLK 156
Query: 79 AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV--- 135
A+ E+ ++ + + LF + I F++ A+ V + V
Sbjct: 157 AKAEEQRAHVQEFFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLR 212
Query: 136 HLQKVSLARRFALVPLGP--PLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGESIVVWC 191
H ++ L+ L P LL +S + A ++ ++ ++ E + +
Sbjct: 213 HRRRECLSAEPDTCALAPYLDLLNHSPNVQVRAAFNEETRCYEIRTASSWRKHEEVFICY 272
Query: 192 GPQPNSKLLINYGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNG 239
G N +LL+ YGFV NP+ R ++ L + D Q K + + +G
Sbjct: 273 GHHDNQRLLLEYGFVSIQNPHACVYVSREILVKYLPSTDKQMNKKISILKDHG 325
>gi|226505024|ref|NP_001151430.1| SET domain containing protein [Zea mays]
gi|195646778|gb|ACG42857.1| SET domain containing protein [Zea mays]
gi|413923893|gb|AFW63825.1| SET domain containing protein [Zea mays]
Length = 491
Score = 39.3 bits (90), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 47/205 (22%), Positives = 87/205 (42%), Gaps = 37/205 (18%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
L L L+ E+ + SFW PYI L P+ + ++ L +P ++ +
Sbjct: 142 LGLRLLQERAK-SDSFWWPYIANLPE-------TFTVPIFFPGEDIKNLQYAPILHQVNK 193
Query: 84 RA----EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 139
R E K +L TV + + Q D+ + + + A A S L
Sbjct: 194 RCRFLLEFEKEVQQKLHTVPLVDHPFYGQ---DVNSSSLGW-----AMSAASSRAFRLH- 244
Query: 140 VSLARRFALVPLGPPLL-----AYSSKCKAM----LAAVDDAVQLVVDRPYKAGESIVVW 190
VP+ PL+ +++ + + + ++D +V+++ ++ K E+I +
Sbjct: 245 -------GEVPMLLPLIDMCNHSFNPNARIVQERSVNSLDMSVKVLAEKKIKQNEAITLN 297
Query: 191 CGPQPNSKLLINYGFVDEDNPYDRL 215
G PN L++YGFV NPYD++
Sbjct: 298 YGCYPNDFFLLDYGFVITQNPYDQV 322
>gi|254568640|ref|XP_002491430.1| Nuclear protein that contains a SET-domain [Komagataella pastoris
GS115]
gi|238031227|emb|CAY69150.1| Nuclear protein that contains a SET-domain [Komagataella pastoris
GS115]
Length = 538
Score = 39.3 bits (90), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 19/61 (31%), Positives = 32/61 (52%)
Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
++VP L A ++ C A L + + + +P K GE + G PNS++L YG+V
Sbjct: 254 SMVPFADTLNADTNLCNANLTYQSENLVMKAIKPIKKGEQVYNTYGNHPNSEILRRYGYV 313
Query: 207 D 207
+
Sbjct: 314 E 314
>gi|449544081|gb|EMD35055.1| hypothetical protein CERSUDRAFT_107074 [Ceriporiopsis subvermispora
B]
Length = 457
Score = 39.3 bits (90), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 18/49 (36%), Positives = 30/49 (61%)
Query: 170 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 218
D AV L++ P G ++ GP+PN++L++ YGF NP D +V++
Sbjct: 275 DLAVSLLLHSPTPRGAELLNNYGPKPNAELVLGYGFALPSNPDDTIVLK 323
>gi|395326815|gb|EJF59220.1| SET domain-containing protein [Dichomitus squalens LYAD-421 SS1]
Length = 429
Score = 39.3 bits (90), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 17/44 (38%), Positives = 28/44 (63%)
Query: 175 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 218
LV+ P G ++ GP+PNS+L++ YGF +NP D +V++
Sbjct: 245 LVIHTPTPRGSELLNNYGPKPNSELILGYGFSLPNNPDDTIVLK 288
>gi|190347905|gb|EDK40262.2| hypothetical protein PGUG_04360 [Meyerozyma guilliermondii ATCC
6260]
Length = 466
Score = 39.3 bits (90), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 55/214 (25%), Positives = 90/214 (42%), Gaps = 30/214 (14%)
Query: 17 KLSELACLALYLMYEKKQGKKSFW------LPYIRELDRQRGRGQLAVESPLLWS-ETEL 69
+L+ ALY+ EKK+ + SFW LP + ELD +P++W E+E
Sbjct: 144 QLTAFQRTALYICLEKKRKENSFWCAFISSLPKLEELDF----------APIVWEVESE- 192
Query: 70 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFV 128
LTGS A+ E R + + +V F + ++ +E F A++
Sbjct: 193 --LTGSKA-ADFFELLPRSSRNHAKKVSVRFNEDYTAVSEFLTAAKSEPLNKMEFLWAWM 249
Query: 129 AVQSCVVHL---QKVSLARRFALVPLGPPLLAY--SSKCKAMLAAVDDAVQLVVDRPYKA 183
+ S +++ + A F L P L + KC + + +V VD + A
Sbjct: 250 CINSRCLYMSFPSSKAEADNFTLAPY-VDFLNHDCDEKCAIKIDSRGFSVISCVD--HAA 306
Query: 184 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 217
G+ ++ GP N LL Y F E N ++ L V
Sbjct: 307 GQELLFSYGPHSNEFLLCEYAFTMETNKWNNLDV 340
>gi|388579878|gb|EIM20197.1| RuBisCO-cytochrome methylase [Wallemia sebi CBS 633.66]
Length = 447
Score = 38.9 bits (89), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 74/380 (19%), Positives = 152/380 (40%), Gaps = 65/380 (17%)
Query: 13 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 72
L + + L + + +E QG S W Y + L +Q S + WS+ EL+ L
Sbjct: 76 LENDNIGSWTPLIMAMCWEYNQGGSSKWDAYFKILPKQ-------FTSLMFWSKEELSLL 128
Query: 73 TGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ--AFVAV 130
G+ +I E I+ E+ + + ++F DI +T ++FK+ + +
Sbjct: 129 KGTTVVDKI--GLEDIENEFERVRDIVKQNENVFG----DIAN--YTLDLFKRMGSLILS 180
Query: 131 QSCVVHLQKV---------------------SLARRFALVPLGPPLLAYSSKCKAMLAAV 169
+S V K + A+VP+ L + + A
Sbjct: 181 RSFTVEEWKTEEEREKEEEEEEDEDEEIDLRTSVDDVAMVPMADILNSRTDSVNAHTEYE 240
Query: 170 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD-----ED-----NPYD------ 213
++ ++++ + KAG+ I PN+ L+ YG VD +D N D
Sbjct: 241 ENCLRMISLQDIKAGDQIFNTYNDPPNADLIRRYGHVDYSPLSQDPDFMGNKNDVVELPA 300
Query: 214 RLVVEAALNTEDPQYQDKRM--VAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD 271
+++E AL ++++R+ + G+ S ++ H + + ++L L +
Sbjct: 301 DILLELALPDAKESHKERRVEFLLDECGEDSFELTH------DDLVPELLKICVLLFTES 354
Query: 272 TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKK 331
+E ++ S P + + + L K R+ Y +TL +D + L + + P+
Sbjct: 355 EAEFKTREKSRK--LPKASGFTKGKAEFLIKAIKQRMEQYGSTLEDDISKLDNKDSLPEN 412
Query: 332 RVATQLVRM-EKKMLNACLQ 350
+V + E+++LN ++
Sbjct: 413 NFKALVVTVGERRILNKAIE 432
>gi|254585507|ref|XP_002498321.1| ZYRO0G07502p [Zygosaccharomyces rouxii]
gi|238941215|emb|CAR29388.1| ZYRO0G07502p [Zygosaccharomyces rouxii]
Length = 562
Score = 38.9 bits (89), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 19/59 (32%), Positives = 29/59 (49%)
Query: 155 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 213
L + + K D + V K+GE + G + N +LL+NYGFV ++N YD
Sbjct: 224 FLNHKNDTKVKWCFQDGKMCFVSKESLKSGEELFNNYGDKSNEELLLNYGFVQDNNQYD 282
>gi|255581713|ref|XP_002531659.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase,
putative [Ricinus communis]
gi|223528717|gb|EEF30729.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase,
putative [Ricinus communis]
Length = 558
Score = 38.9 bits (89), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 35/143 (24%), Positives = 64/143 (44%), Gaps = 20/143 (13%)
Query: 80 EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 139
EI++ E ++ +Y+EL +L YP P E +T+E F A S + ++
Sbjct: 283 EIMQAKEHLRVQYDEL------VPALCNNYPDVFPPELYTWEQFLWACELWYSNSMKIKF 336
Query: 140 VSLARRFALVP--------LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
+ R L+P L P ++ Y + ++ + ++ + RP + GE +
Sbjct: 337 LDGKLRTCLIPIAGFLNHSLHPHIIHYGK-----VDSITNTLKFPLSRPCRVGEQCCLSY 391
Query: 192 GPQPNSKLLINYGFVDE-DNPYD 213
G + L+ YGF+ + DN YD
Sbjct: 392 GNFSGAHLITFYGFLPQGDNRYD 414
>gi|387197713|gb|AFJ68815.1| set domain protein, partial [Nannochloropsis gaditana CCMP526]
Length = 327
Score = 38.9 bits (89), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 37/145 (25%), Positives = 63/145 (43%), Gaps = 25/145 (17%)
Query: 18 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
S+ L + LM+E+ +G+ SF+ PY+ L R ++ L W+E L++L
Sbjct: 143 FSDEDMLLILLMHERGKGETSFFYPYLATLPR-------LPDTLLNWNEEGLSWLQDEGL 195
Query: 78 KAEILERAEGIKREYNELDTVWFMAG--SLFQQYPYDIP------TEAFTFEIFKQAFVA 129
E+ R + Y L AG LF + P D + ++ E F+ A++
Sbjct: 196 SLEVFLRESQLTAHYTRLVEEKLKAGWPGLFGEAPDDASDSESKGADPYSLENFRFAWLT 255
Query: 130 VQSCVVHLQKVSLARRF---ALVPL 151
+Q+ + RR AL+PL
Sbjct: 256 IQA-------RAFGRRLPYSALIPL 273
>gi|159131477|gb|EDP56590.1| SET domain protein [Aspergillus fumigatus A1163]
Length = 490
Score = 38.9 bits (89), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 45/193 (23%), Positives = 83/193 (43%), Gaps = 27/193 (13%)
Query: 27 YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAE 86
++ +K G + W YIR + +V P +S E L G+ + + +
Sbjct: 115 FVNKRQKIGISNPWTEYIRFM-------PASVPLPTFYSAEERELLRGTSLQTAVDAKLG 167
Query: 87 GIKREYNEL-----DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 141
+++E++ L + W Q++ +D T FTF+ +K +S VV L +
Sbjct: 168 SLEKEFDHLRQATEEIPWC------QEHWWDEDTGKFTFDDWKYVDAVYRSRVVDLPRSG 221
Query: 142 LARRFALVP---LGPPLLAYSSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWCGPQ-PN 196
A+VP + S K K +AV QL + + GE + + G + P
Sbjct: 222 ----HAIVPCVDMANHACEDSVKAKYDEEGAGNAVLQLRTGKKLRVGEEVTISYGDEKPA 277
Query: 197 SKLLINYGFVDED 209
S+++ +YGFV+ +
Sbjct: 278 SEMVFSYGFVENE 290
>gi|336473420|gb|EGO61580.1| hypothetical protein NEUTE1DRAFT_58975 [Neurospora tetrasperma FGSC
2508]
gi|350293291|gb|EGZ74376.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
Length = 533
Score = 38.9 bits (89), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 49/205 (23%), Positives = 84/205 (40%), Gaps = 30/205 (14%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-L 82
L L LM+E QG S W PY+ L Q ++P+ W+E ELA L S A++
Sbjct: 131 LILILMHEYLQGSSSNWSPYLSILPHQ-------FDTPMFWTEAELAELQASALVAKVGK 183
Query: 83 ERAEGIKRE-----YNELDTVWFMAGSLFQQ-----------YPYDIPTEAFTFEIFKQA 126
+ A+ + R E + V++ AG+ Q + A+ F++ K+
Sbjct: 184 DEADKMIRTKIVKVVQENEDVFYPAGTPKTQRLDEGELLKLGHRMGSAIMAYAFDLAKEE 243
Query: 127 FVAVQSCV-----VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 181
V + + +VP+ +L + A + + + R
Sbjct: 244 DDDEDEEEEEDGWVEDKIGGMNDTMGMVPMA-DMLNADAVFNAHINHGEACLTATSLREI 302
Query: 182 KAGESIVVWCGPQPNSKLLINYGFV 206
K GE I+ + GP +++LL YG+V
Sbjct: 303 KEGEEILNYYGPLSSAELLRRYGYV 327
>gi|242210759|ref|XP_002471221.1| predicted protein [Postia placenta Mad-698-R]
gi|220729780|gb|EED83649.1| predicted protein [Postia placenta Mad-698-R]
Length = 256
Score = 38.9 bits (89), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 19/64 (29%), Positives = 33/64 (51%)
Query: 172 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 231
++ LV+ + AG + GP+PN++L++ YGF NP D +V++ + Q
Sbjct: 77 SISLVLHNAHPAGAELFNNYGPKPNAELILGYGFALPHNPDDTIVLKLGGASAAQHAQHN 136
Query: 232 RMVA 235
VA
Sbjct: 137 NAVA 140
>gi|452825744|gb|EME32739.1| ribulose-1,5 bisphosphate carboxylase oxygenase large subunit
N-methyltransferase, putative isoform 1 [Galdieria
sulphuraria]
Length = 487
Score = 38.5 bits (88), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 79/331 (23%), Positives = 125/331 (37%), Gaps = 47/331 (14%)
Query: 38 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE-LD 96
S W PYI L G + WS +ELA L P E+ I + Y E L
Sbjct: 166 SLWKPYIDILPHALNTGLVY------WSSSELAQLQYRPLIEEV-----KINQYYREALY 214
Query: 97 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL- 155
T F + S P + + +F A VQS + V + +AL+P+ L
Sbjct: 215 TRVFESLS----SPVRVWLQNEKENVFFWALDMVQSRAFGIPDVG-NKTYALLPMMDMLN 269
Query: 156 LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 215
+S+ + ++ + ++ G I + GP N LL YGF+ +NP D
Sbjct: 270 HRVNSQTHFLYDSIANQYEMKTYSKLSPGTDIYISYGPLDNDHLLHFYGFLQTNNPSDYF 329
Query: 216 VVEAALNTEDPQYQDKRMVAQ------------------RNGKLSVQVFHVHAGREKEAI 257
V+ Y+ + AQ NGK + ++H H E + I
Sbjct: 330 QVKDIFQWLHLMYEQEEWQAQPSHLLEEKLSLLRKYHIYENGK-TFHLYHDHYDDEIDII 388
Query: 258 SDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERA--VLDQLADYFKARLAGYPATL 315
LR+ S T Q + + + +E V + K L ++
Sbjct: 389 ------LRVFMASKTDWQQIQENFAMGLFHKALSLENQLHVWQVIIGGCKHLLKDMKTSV 442
Query: 316 SEDEAMLTDYN-LHPKKRVATQLVRMEKKML 345
EDE +L + + L K ++A Q R+EKK +
Sbjct: 443 EEDEQLLKNKDQLSTKLQLAIQF-RLEKKYI 472
>gi|403331500|gb|EJY64700.1| hypothetical protein OXYTRI_15263 [Oxytricha trifallax]
Length = 741
Score = 38.5 bits (88), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 20/64 (31%), Positives = 36/64 (56%), Gaps = 4/64 (6%)
Query: 181 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN----TEDPQYQDKRMVAQ 236
YK G+ + G + N LL+NYGF +N Y+ L +N +E + ++K++ A+
Sbjct: 440 YKKGQQVFHCYGRRTNRFLLLNYGFCMNNNKYNSLSFRVWINFDWQSERQKDEEKKLKAE 499
Query: 237 RNGK 240
++GK
Sbjct: 500 KDGK 503
>gi|260822399|ref|XP_002606589.1| hypothetical protein BRAFLDRAFT_277814 [Branchiostoma floridae]
gi|229291933|gb|EEN62599.1| hypothetical protein BRAFLDRAFT_277814 [Branchiostoma floridae]
Length = 459
Score = 38.5 bits (88), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 53/208 (25%), Positives = 86/208 (41%), Gaps = 36/208 (17%)
Query: 26 LYLMYEKKQGKKSFWLPYIR------ELDRQRGRGQLAVESPLLWSETELAY---LTGSP 76
L LMYE S W PY++ +LD+ P+ W+E E+ TG P
Sbjct: 107 LALMYEYT-NPNSRWRPYLQLVPDFSQLDQ-----------PMFWTEDEIERDLCNTGIP 154
Query: 77 TKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ--AFVAVQSCV 134
+ +K EY L A +++ + E +FE++K+ AF+ S
Sbjct: 155 EASS--SDLTKMKLEYTSL------ALPFIRKHRHIFSEEVHSFELYKRMVAFIMAYSFF 206
Query: 135 VHLQKVSLARRFALVPLGPPL---LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
+ + +PL P+ L + +K A L D +++V R AGE +
Sbjct: 207 EPVNGREDEGGKSSLPLMVPMADILNHVAKNNAQLEWDADCLRMVTTRTVAAGEEVFNTF 266
Query: 192 GPQPNSKLLINYGFVDE--DNPYDRLVV 217
G N +LL YGF + +N YD + +
Sbjct: 267 GQLANWQLLHMYGFAEAWPENIYDTVDI 294
>gi|403342378|gb|EJY70508.1| SET domain containing protein [Oxytricha trifallax]
Length = 653
Score = 38.5 bits (88), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 23/92 (25%), Positives = 43/92 (46%), Gaps = 8/92 (8%)
Query: 4 LLFYYAAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 63
+F A + + + L ++L+YE ++G +SFW PY +D G L P
Sbjct: 88 FIFDNHASIFKATEDRDYLVLLVFLIYEHQKGTRSFWHPYFEAID----PGLL----PCF 139
Query: 64 WSETELAYLTGSPTKAEILERAEGIKREYNEL 95
WS+ + L S K +I + + + +++ L
Sbjct: 140 WSDQTIEELADSELKDQIRQERDNYEEDWDML 171
>gi|302829721|ref|XP_002946427.1| hypothetical protein VOLCADRAFT_86703 [Volvox carteri f.
nagariensis]
gi|300268173|gb|EFJ52354.1| hypothetical protein VOLCADRAFT_86703 [Volvox carteri f.
nagariensis]
Length = 658
Score = 38.5 bits (88), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 29/91 (31%), Positives = 42/91 (46%), Gaps = 12/91 (13%)
Query: 170 DDAVQ---LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL----- 221
D+A Q +VV RPY+ GE +++ G N +LL YGFV E N +D ++ AL
Sbjct: 312 DEATQQYCIVVRRPYREGEQVMLCYGRYTNLELLEYYGFVLEGNLHDTARLDPALLPLPS 371
Query: 222 ----NTEDPQYQDKRMVAQRNGKLSVQVFHV 248
P NG+ S Q+ H+
Sbjct: 372 AARTAGGAPHLAPSDCFLHANGQPSWQLLHL 402
>gi|242210761|ref|XP_002471222.1| predicted protein [Postia placenta Mad-698-R]
gi|220729781|gb|EED83650.1| predicted protein [Postia placenta Mad-698-R]
Length = 264
Score = 38.5 bits (88), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 16/47 (34%), Positives = 29/47 (61%)
Query: 172 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 218
++ LV+ + AG + GP+PN++L++ YGF NP D +V++
Sbjct: 216 SISLVLHNAHPAGAELFNNYGPKPNAELILGYGFALPHNPDDTIVLK 262
>gi|219122993|ref|XP_002181819.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407095|gb|EEC47033.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 579
Score = 38.5 bits (88), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 17/47 (36%), Positives = 28/47 (59%)
Query: 171 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 217
+A L D+ +G+ + + GP+ N +LL YGFV+ +NP D V+
Sbjct: 361 NAYSLATDQAIPSGDEVYISYGPRSNDQLLQYYGFVERNNPNDVYVM 407
>gi|308802011|ref|XP_003078319.1| N-methyltransferase (ISS) [Ostreococcus tauri]
gi|116056770|emb|CAL53059.1| N-methyltransferase (ISS) [Ostreococcus tauri]
Length = 429
Score = 38.5 bits (88), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 51/185 (27%), Positives = 76/185 (41%), Gaps = 17/185 (9%)
Query: 179 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQD-KRMVAQR 237
+ Y+ GE +++ G N +L+ YGFVD DN D E ++ Y KR +
Sbjct: 246 KDYETGEEVLISYGVLNNDELITRYGFVDVDNVADIYRFEGLMSYLQASYDPMKRALGAD 305
Query: 238 NGKLSVQVFHVH-----AGREKEAISD------MLPYLRLGYVSDTSEMQSVISSLGPIC 286
+LS + H A E ISD +L LR V T E + +
Sbjct: 306 QKRLST-LKRTHPELDQALWEGNFISDGNADPKLLWALRT--VLATPEEYAAAKGVDGFK 362
Query: 287 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 346
ER D + ++RLA YP T+ EDE L N +R A Q +K++L
Sbjct: 363 LGGGAPERRAADAVRAAVESRLAEYPTTIEEDEEALKTAN--GNERTAIQYRIRKKRILR 420
Query: 347 ACLQV 351
++
Sbjct: 421 DASRI 425
>gi|407920105|gb|EKG13323.1| hypothetical protein MPH_09605 [Macrophomina phaseolina MS6]
Length = 574
Score = 38.5 bits (88), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 48/219 (21%), Positives = 84/219 (38%), Gaps = 32/219 (14%)
Query: 12 LLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSE----- 66
LL N LS +A + L+ G+KS W PYI L + + +P+ ++E
Sbjct: 76 LLPNNVLSNIALIKELLL-----GEKSLWAPYINCLPKSE-----QLNTPIYFAEEMTQE 125
Query: 67 ------TELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 120
+ A+L G+ R E + E+ +V G I TE +T+
Sbjct: 126 AINGRRNDTAWLLGTNLDKSWRPRKEQWEEEWKNAVSVLKRQG---------IATEGYTW 176
Query: 121 EIFKQA--FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD 178
+ + A +S + + ++A++ LL + K + Q + +
Sbjct: 177 DAYAWAATIFTSRSFISDPGLSKESSQYAVLMPVIDLLNHRFPTKVAWFFNEGNFQFITE 236
Query: 179 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 217
P G I G + N +LL YGF +N D + +
Sbjct: 237 EPVPKGHEIFNNYGGKGNEELLNGYGFCIPNNHCDEVAI 275
>gi|367001244|ref|XP_003685357.1| hypothetical protein TPHA_0D02870 [Tetrapisispora phaffii CBS 4417]
gi|357523655|emb|CCE62923.1| hypothetical protein TPHA_0D02870 [Tetrapisispora phaffii CBS 4417]
Length = 495
Score = 38.5 bits (88), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 20/63 (31%), Positives = 32/63 (50%)
Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
+++PL L A + +C A L +++ +P KA E I G N +LL YG+V
Sbjct: 220 SMIPLADTLNADTKRCNANLIYDSGVLKMCAIKPIKANEQIYNTYGNHANFELLRRYGYV 279
Query: 207 DED 209
+ D
Sbjct: 280 EVD 282
>gi|328352061|emb|CCA38460.1| hypothetical protein PP7435_Chr2-0775 [Komagataella pastoris CBS
7435]
Length = 846
Score = 38.1 bits (87), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 19/61 (31%), Positives = 32/61 (52%)
Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
++VP L A ++ C A L + + + +P K GE + G PNS++L YG+V
Sbjct: 254 SMVPFADTLNADTNLCNANLTYQSENLVMKAIKPIKKGEQVYNTYGNHPNSEILRRYGYV 313
Query: 207 D 207
+
Sbjct: 314 E 314
>gi|413951745|gb|AFW84394.1| hypothetical protein ZEAMMB73_159573, partial [Zea mays]
Length = 339
Score = 38.1 bits (87), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 19/57 (33%), Positives = 32/57 (56%)
Query: 168 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 224
A D+ +++++ R GE + G N+ LL YGF + DN YD + ++ AL T+
Sbjct: 99 ANDEDLEIIIVRDVNEGEEVYNTYGTMGNAALLHRYGFTELDNQYDIVNIDLALVTK 155
>gi|169595142|ref|XP_001790995.1| hypothetical protein SNOG_00305 [Phaeosphaeria nodorum SN15]
gi|160701026|gb|EAT91800.2| hypothetical protein SNOG_00305 [Phaeosphaeria nodorum SN15]
Length = 391
Score = 38.1 bits (87), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 26/48 (54%), Gaps = 5/48 (10%)
Query: 175 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN-----PYDRLVV 217
+ DR YKAGE + V G N LL+ YGF+ + N P D L++
Sbjct: 203 VTADREYKAGEEVFVSYGAHTNDFLLVEYGFILDSNRNDAIPLDHLIL 250
>gi|242066082|ref|XP_002454330.1| hypothetical protein SORBIDRAFT_04g028760 [Sorghum bicolor]
gi|241934161|gb|EES07306.1| hypothetical protein SORBIDRAFT_04g028760 [Sorghum bicolor]
Length = 490
Score = 38.1 bits (87), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 46/204 (22%), Positives = 82/204 (40%), Gaps = 35/204 (17%)
Query: 24 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
L L L+ E+ + SFW PYI L P+ + ++ L +P ++ +
Sbjct: 141 LGLRLLQERAK-SDSFWWPYIANLPE-------TFTVPIFFPGEDIKNLQYAPLLHQVNK 192
Query: 84 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 143
R + E+ QQ + +P+ F + Q + S + + +
Sbjct: 193 RCRFLLEFEKEI-----------QQKLHTVPSVDHPF--YGQDVNS--SSLGWAMSAASS 237
Query: 144 RRFALVPLGPPLLAYSSKC------------KAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
R F L P LL C + + ++D +V++V ++ + SI +
Sbjct: 238 RAFRLHGEIPMLLPLIDMCNHSFNPNARIVQEGSVNSLDMSVKVVAEKKIEQNASITLNY 297
Query: 192 GPQPNSKLLINYGFVDEDNPYDRL 215
G PN L++YGFV NPYD++
Sbjct: 298 GCHPNDFFLLDYGFVITPNPYDQV 321
>gi|21952799|dbj|BAC06215.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
gi|22202682|dbj|BAC07340.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
gi|215769224|dbj|BAH01453.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222619626|gb|EEE55758.1| hypothetical protein OsJ_04288 [Oryza sativa Japonica Group]
Length = 495
Score = 38.1 bits (87), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 19/55 (34%), Positives = 31/55 (56%)
Query: 170 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 224
D+ ++++V R GE + G N+ LL YGF + DN YD + ++ AL T+
Sbjct: 255 DEDLEMIVVRDVNEGEEVFNTYGTMGNAALLHRYGFTEMDNSYDIVNIDLALVTK 309
>gi|134077289|emb|CAK45629.1| unnamed protein product [Aspergillus niger]
Length = 498
Score = 38.1 bits (87), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 40/157 (25%), Positives = 69/157 (43%), Gaps = 10/157 (6%)
Query: 57 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 116
A+ P +SE EL L GS + + + +++E+ L ++Y +D T
Sbjct: 143 AISLPTFYSEEELELLRGSSLRLAVHAKIASLEKEFEHLRRS-TEGLDWCEKYWWDEDTG 201
Query: 117 AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP-LGPPLLAYSSKCKAMLAAVDDA--- 172
TF +K +S +V L + A+VP + A KA+ D
Sbjct: 202 KLTFNDWKYVDALYRSRMVDLPR----HGHAMVPCIDMANHASEGTVKALYDEDADGNAV 257
Query: 173 VQLVVDRPYKAGESIVVWCGPQPN-SKLLINYGFVDE 208
+QL R +A E + + G + + S+L+ +YGF+DE
Sbjct: 258 LQLREGRSLRADEEVTISYGDEKSASELIFSYGFLDE 294
>gi|125528589|gb|EAY76703.1| hypothetical protein OsI_04658 [Oryza sativa Indica Group]
Length = 495
Score = 38.1 bits (87), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 19/55 (34%), Positives = 31/55 (56%)
Query: 170 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 224
D+ ++++V R GE + G N+ LL YGF + DN YD + ++ AL T+
Sbjct: 255 DEDLEMIVVRDVNEGEEVFNTYGTMGNAALLHRYGFTEMDNSYDIVNIDLALVTK 309
>gi|449302028|gb|EMC98037.1| hypothetical protein BAUCODRAFT_67154 [Baudoinia compniacensis UAMH
10762]
Length = 381
Score = 38.1 bits (87), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 15/41 (36%), Positives = 25/41 (60%)
Query: 171 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 211
+ +++ +R Y AGE ++ G N KLL++YGF+ E P
Sbjct: 204 NGYEMLANRDYDAGEEVLFTYGAHSNDKLLVHYGFICESPP 244
>gi|223946389|gb|ACN27278.1| unknown [Zea mays]
Length = 289
Score = 38.1 bits (87), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 19/57 (33%), Positives = 32/57 (56%)
Query: 168 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 224
A D+ +++++ R GE + G N+ LL YGF + DN YD + ++ AL T+
Sbjct: 47 ANDEDLEIIIVRDVNEGEEVYNTYGTMGNAALLHRYGFTELDNQYDIVNIDLALVTK 103
>gi|350629837|gb|EHA18210.1| hypothetical protein ASPNIDRAFT_38188 [Aspergillus niger ATCC 1015]
Length = 480
Score = 38.1 bits (87), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 40/157 (25%), Positives = 69/157 (43%), Gaps = 10/157 (6%)
Query: 57 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 116
A+ P +SE EL L GS + + + +++E+ L ++Y +D T
Sbjct: 125 AISLPTFYSEEELELLRGSSLRLAVHAKIASLEKEFEHLRRS-TEGLDWCEKYWWDEDTG 183
Query: 117 AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP-LGPPLLAYSSKCKAMLAAVDDA--- 172
TF +K +S +V L + A+VP + A KA+ D
Sbjct: 184 KLTFNDWKYVDALYRSRMVDLPR----HGHAMVPCIDMANHASEGTVKALYDEDADGNAV 239
Query: 173 VQLVVDRPYKAGESIVVWCGPQPN-SKLLINYGFVDE 208
+QL R +A E + + G + + S+L+ +YGF+DE
Sbjct: 240 LQLREGRSLRADEEVTISYGDEKSASELIFSYGFLDE 276
>gi|317030555|ref|XP_001392774.2| SET domain protein [Aspergillus niger CBS 513.88]
Length = 473
Score = 38.1 bits (87), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 40/157 (25%), Positives = 69/157 (43%), Gaps = 10/157 (6%)
Query: 57 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 116
A+ P +SE EL L GS + + + +++E+ L ++Y +D T
Sbjct: 143 AISLPTFYSEEELELLRGSSLRLAVHAKIASLEKEFEHLRRS-TEGLDWCEKYWWDEDTG 201
Query: 117 AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP-LGPPLLAYSSKCKAMLAAVDDA--- 172
TF +K +S +V L + A+VP + A KA+ D
Sbjct: 202 KLTFNDWKYVDALYRSRMVDLPR----HGHAMVPCIDMANHASEGTVKALYDEDADGNAV 257
Query: 173 VQLVVDRPYKAGESIVVWCGPQPN-SKLLINYGFVDE 208
+QL R +A E + + G + + S+L+ +YGF+DE
Sbjct: 258 LQLREGRSLRADEEVTISYGDEKSASELIFSYGFLDE 294
>gi|428174289|gb|EKX43186.1| hypothetical protein GUITHDRAFT_110913 [Guillardia theta CCMP2712]
Length = 437
Score = 38.1 bits (87), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 62/268 (23%), Positives = 108/268 (40%), Gaps = 38/268 (14%)
Query: 21 LACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS----- 75
+ LA++L+ E Q K SFW PY+ L + V P+ +S+ L
Sbjct: 165 IGALAVFLLLES-QNKSSFWRPYLCSLPKH-------VPLPMFYSKERRQQLKEQLPEDQ 216
Query: 76 --PTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSC 133
A + R + + Y +L V LF +YP E F++E F A + S
Sbjct: 217 RVKFDALVEARRDVVDLHYMQLLPV------LFLKYPTLFSPEVFSYEKFAWAISIIMSR 270
Query: 134 VVHLQKVSLA-----RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD--RPYKAGES 186
A R + L P + + A D ++ ++ + GE
Sbjct: 271 TWGKTYFDSALGPRGRNITVHTLAPAADMPNHDSSGLEANRDPRGRMTLNAQKNLSVGEQ 330
Query: 187 IVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE-DPQY--QDKRMVAQRNG-KLS 242
+ G + +++ L +YGFV +N + R + + +E DP D + A R + S
Sbjct: 331 FFISYGSKCDAEFLAHYGFVPFNNSHKR--CKGRVYSEGDPNMGGHDSNVTAWRKKWRKS 388
Query: 243 VQVFHV---HAGRE-KEAISDMLPYLRL 266
+ ++H A RE K+A+ ++L R+
Sbjct: 389 LSLYHALVRKARREGKDALKNLLESSRV 416
>gi|70995934|ref|XP_752722.1| SET domain protein [Aspergillus fumigatus Af293]
gi|66850357|gb|EAL90684.1| SET domain protein [Aspergillus fumigatus Af293]
Length = 490
Score = 38.1 bits (87), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 44/193 (22%), Positives = 83/193 (43%), Gaps = 27/193 (13%)
Query: 27 YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAE 86
++ +K G + W YIR + +V P +S E L G+ + + +
Sbjct: 115 FVNKRQKIGISNPWTEYIRFM-------PASVPLPTFYSAEERELLRGTSLQTAVDAKLG 167
Query: 87 GIKREYNEL-----DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 141
+++E++ L + W Q++ +D T FTF+ +K +S VV L +
Sbjct: 168 SLEKEFDHLRQATEEIPWC------QEHWWDEDTGKFTFDDWKYVDAVYRSRVVDLPRSG 221
Query: 142 LARRFALVP---LGPPLLAYSSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWCGPQ-PN 196
A+VP + S K + +AV QL + + GE + + G + P
Sbjct: 222 ----HAIVPCVDMANHACEDSVKARYDEEGAGNAVLQLRTGKKLRVGEEVTISYGDEKPA 277
Query: 197 SKLLINYGFVDED 209
S+++ +YGFV+ +
Sbjct: 278 SEMVFSYGFVENE 290
>gi|159490102|ref|XP_001703025.1| predicted protein [Chlamydomonas reinhardtii]
gi|158270838|gb|EDO96670.1| predicted protein [Chlamydomonas reinhardtii]
Length = 471
Score = 38.1 bits (87), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 25/48 (52%)
Query: 166 LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 213
L AV L R GE +++ G N L ++YGF+ EDNPYD
Sbjct: 202 LEGGGGAVGLFARRAITEGEPLLLSYGQLSNDFLFMDYGFIVEDNPYD 249
>gi|146415322|ref|XP_001483631.1| hypothetical protein PGUG_04360 [Meyerozyma guilliermondii ATCC
6260]
Length = 466
Score = 37.7 bits (86), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 52/211 (24%), Positives = 85/211 (40%), Gaps = 24/211 (11%)
Query: 17 KLSELACLALYLMYEKKQGKKSFW------LPYIRELDRQRGRGQLAVESPLLWS-ETEL 69
+L+ ALY+ EKK+ + SFW LP + ELD +P++W E+E
Sbjct: 144 QLTAFQRTALYICLEKKRKENSFWCAFISSLPKLEELDF----------APIVWEVESE- 192
Query: 70 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFV 128
LTGS A+ E R + + V F + ++ +E F A++
Sbjct: 193 --LTGSKA-ADFFELLPRSSRNHAKKVLVRFNEDYTAVSEFLTAAKSEPLNKMEFLWAWM 249
Query: 129 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR--PYKAGES 186
+ S +++ S L P + + C A D+ +V + AG+
Sbjct: 250 CINSRCLYMSFPSSKAEADNFTLAPYVDFLNHDCDEKCAIKIDSRGFLVISCVDHAAGQE 309
Query: 187 IVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 217
++ GP N LL Y F E N ++ L V
Sbjct: 310 LLFSYGPHSNEFLLCEYAFTMETNKWNNLDV 340
>gi|212544736|ref|XP_002152522.1| hypothetical protein PMAA_003730 [Talaromyces marneffei ATCC 18224]
gi|210065491|gb|EEA19585.1| hypothetical protein PMAA_003730 [Talaromyces marneffei ATCC 18224]
Length = 429
Score = 37.7 bits (86), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 19/59 (32%), Positives = 30/59 (50%)
Query: 179 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQR 237
R YK GE I + GP PN L + YGF E N D + ++ + + + + ++ QR
Sbjct: 264 RLYKKGEEIYMSYGPHPNDFLFVEYGFYLETNESDAIFLDDIIFKDFTVAEKEELIRQR 322
>gi|410079629|ref|XP_003957395.1| hypothetical protein KAFR_0E01060 [Kazachstania africana CBS 2517]
gi|372463981|emb|CCF58260.1| hypothetical protein KAFR_0E01060 [Kazachstania africana CBS 2517]
Length = 534
Score = 37.7 bits (86), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 25/44 (56%)
Query: 170 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 213
D+ V +V+D+ KAG I GP N LL +GF E NP+D
Sbjct: 307 DECVDIVLDKDVKAGVEIFNSYGPLSNVFLLSRHGFAVEGNPHD 350
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.321 0.135 0.403
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,616,468,379
Number of Sequences: 23463169
Number of extensions: 222431939
Number of successful extensions: 563042
Number of sequences better than 100.0: 766
Number of HSP's better than 100.0 without gapping: 355
Number of HSP's successfully gapped in prelim test: 411
Number of HSP's that attempted gapping in prelim test: 561820
Number of HSP's gapped (non-prelim): 1001
length of query: 376
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 232
effective length of database: 8,980,499,031
effective search space: 2083475775192
effective search space used: 2083475775192
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 78 (34.7 bits)