BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 018962
(348 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225452167|ref|XP_002264334.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
[Vitis vinifera]
Length = 509
Score = 644 bits (1662), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 306/348 (87%), Positives = 323/348 (92%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
MYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE+ELAYLTGSPTKAE+LERAEGI
Sbjct: 162 MYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESELAYLTGSPTKAEVLERAEGI 221
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
KREYNELDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFVA+QSCVVHLQKVSLARRFAL
Sbjct: 222 KREYNELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFVAIQSCVVHLQKVSLARRFAL 281
Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
VPLGPPLLAY S CKAMLAAVD +VQLVVDRPYKAGESIVVWCGPQPNSKLL+NYGFVDE
Sbjct: 282 VPLGPPLLAYRSNCKAMLAAVDGSVQLVVDRPYKAGESIVVWCGPQPNSKLLLNYGFVDE 341
Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
DN YDR+VVEAALNTEDPQYQDKRMVAQRNGKL+VQ FHV G+E+EA+SDMLPYLRLGY
Sbjct: 342 DNSYDRIVVEAALNTEDPQYQDKRMVAQRNGKLTVQKFHVSVGKEREAVSDMLPYLRLGY 401
Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
VSD SEMQSVISS GPICPVSPCMERAVLDQL DYF+ RLAGYP T+SEDE +L D NL+
Sbjct: 402 VSDPSEMQSVISSQGPICPVSPCMERAVLDQLVDYFERRLAGYPTTMSEDECLLADSNLN 461
Query: 301 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 348
PKK VATQLVR+EKKMLNACL+ T D+I LPD TVSPCPAPY PLL
Sbjct: 462 PKKLVATQLVRLEKKMLNACLKATVDLINQLPDHTVSPCPAPYTPLLK 509
>gi|359488614|ref|XP_003633789.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
[Vitis vinifera]
Length = 515
Score = 644 bits (1662), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 306/348 (87%), Positives = 323/348 (92%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
MYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE+ELAYLTGSPTKAE+LERAEGI
Sbjct: 168 MYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESELAYLTGSPTKAEVLERAEGI 227
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
KREYNELDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFVA+QSCVVHLQKVSLARRFAL
Sbjct: 228 KREYNELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFVAIQSCVVHLQKVSLARRFAL 287
Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
VPLGPPLLAY S CKAMLAAVD +VQLVVDRPYKAGESIVVWCGPQPNSKLL+NYGFVDE
Sbjct: 288 VPLGPPLLAYRSNCKAMLAAVDGSVQLVVDRPYKAGESIVVWCGPQPNSKLLLNYGFVDE 347
Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
DN YDR+VVEAALNTEDPQYQDKRMVAQRNGKL+VQ FHV G+E+EA+SDMLPYLRLGY
Sbjct: 348 DNSYDRIVVEAALNTEDPQYQDKRMVAQRNGKLTVQKFHVSVGKEREAVSDMLPYLRLGY 407
Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
VSD SEMQSVISS GPICPVSPCMERAVLDQL DYF+ RLAGYP T+SEDE +L D NL+
Sbjct: 408 VSDPSEMQSVISSQGPICPVSPCMERAVLDQLVDYFERRLAGYPTTMSEDECLLADSNLN 467
Query: 301 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 348
PKK VATQLVR+EKKMLNACL+ T D+I LPD TVSPCPAPY PLL
Sbjct: 468 PKKLVATQLVRLEKKMLNACLKATVDLINQLPDHTVSPCPAPYTPLLK 515
>gi|296090251|emb|CBI40070.3| unnamed protein product [Vitis vinifera]
Length = 428
Score = 643 bits (1659), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 306/347 (88%), Positives = 323/347 (93%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
MYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE+ELAYLTGSPTKAE+LERAEGI
Sbjct: 81 MYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESELAYLTGSPTKAEVLERAEGI 140
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
KREYNELDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFVA+QSCVVHLQKVSLARRFAL
Sbjct: 141 KREYNELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFVAIQSCVVHLQKVSLARRFAL 200
Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
VPLGPPLLAY S CKAMLAAVD +VQLVVDRPYKAGESIVVWCGPQPNSKLL+NYGFVDE
Sbjct: 201 VPLGPPLLAYRSNCKAMLAAVDGSVQLVVDRPYKAGESIVVWCGPQPNSKLLLNYGFVDE 260
Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
DN YDR+VVEAALNTEDPQYQDKRMVAQRNGKL+VQ FHV G+E+EA+SDMLPYLRLGY
Sbjct: 261 DNSYDRIVVEAALNTEDPQYQDKRMVAQRNGKLTVQKFHVSVGKEREAVSDMLPYLRLGY 320
Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
VSD SEMQSVISS GPICPVSPCMERAVLDQL DYF+ RLAGYP T+SEDE +L D NL+
Sbjct: 321 VSDPSEMQSVISSQGPICPVSPCMERAVLDQLVDYFERRLAGYPTTMSEDECLLADSNLN 380
Query: 301 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 347
PKK VATQLVR+EKKMLNACL+ T D+I LPD TVSPCPAPY PLL
Sbjct: 381 PKKLVATQLVRLEKKMLNACLKATVDLINQLPDHTVSPCPAPYTPLL 427
>gi|224117488|ref|XP_002331687.1| SET domain protein [Populus trichocarpa]
gi|222874165|gb|EEF11296.1| SET domain protein [Populus trichocarpa]
Length = 502
Score = 629 bits (1622), Expect = e-178, Method: Compositional matrix adjust.
Identities = 300/347 (86%), Positives = 320/347 (92%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
MYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE ELAYLTGSPTKAE+L+RA+GI
Sbjct: 155 MYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSEAELAYLTGSPTKAEVLDRADGI 214
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
KREY ELDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFVA+QSCVVHLQKVSLARRFAL
Sbjct: 215 KREYEELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFVAIQSCVVHLQKVSLARRFAL 274
Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
VPLGPPLLAYSS CKAML AVD AV+LVVDRPYKAGE IVVWCGPQPNSKLL+NYGFVDE
Sbjct: 275 VPLGPPLLAYSSNCKAMLTAVDGAVELVVDRPYKAGEPIVVWCGPQPNSKLLLNYGFVDE 334
Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
DNPYDR+ VEAALNTEDPQYQDKRMVAQRNGKLSVQVF V+AG+EKEA+SD+LPYLRLGY
Sbjct: 335 DNPYDRIAVEAALNTEDPQYQDKRMVAQRNGKLSVQVFQVYAGKEKEAVSDILPYLRLGY 394
Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
VSD SEMQSVISS GP+CPVSPCME+AVLDQL YF+ RLAGY ++SEDE ML D NL+
Sbjct: 395 VSDPSEMQSVISSQGPVCPVSPCMEQAVLDQLTVYFRTRLAGYCTSISEDELMLADPNLN 454
Query: 301 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 347
PKKRVATQLVR+EKKML ACLQ T D+I LPD T+ PCPAPYAPLL
Sbjct: 455 PKKRVATQLVRLEKKMLKACLQATVDLINQLPDHTMPPCPAPYAPLL 501
>gi|449455876|ref|XP_004145676.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cucumis
sativus]
gi|449492872|ref|XP_004159127.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cucumis
sativus]
Length = 521
Score = 628 bits (1620), Expect = e-177, Method: Compositional matrix adjust.
Identities = 298/347 (85%), Positives = 322/347 (92%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
MYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE EL YL+GSPTK E+LERAEGI
Sbjct: 174 MYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSEDELDYLSGSPTKKEVLERAEGI 233
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
K+EYNELDTVWFMAGSLFQQYPYDIPTEAF+FEIFKQAFVAVQSCVVHLQKVSLARRFAL
Sbjct: 234 KKEYNELDTVWFMAGSLFQQYPYDIPTEAFSFEIFKQAFVAVQSCVVHLQKVSLARRFAL 293
Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
VPLGPPLLAY S CKAML AVD AV+LVVDRPYKAGESI VWCGPQPNSKLL+NYGFVDE
Sbjct: 294 VPLGPPLLAYRSNCKAMLTAVDGAVELVVDRPYKAGESIAVWCGPQPNSKLLLNYGFVDE 353
Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
DN YDRLVVEAALNTEDPQYQDKRMVAQRNG+LS+Q F+V+AG+EKEA+ DMLPYLRLGY
Sbjct: 354 DNRYDRLVVEAALNTEDPQYQDKRMVAQRNGRLSIQAFYVYAGKEKEAVLDMLPYLRLGY 413
Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
V+ SEMQSVISS GP+CPVSPCMERA+L+Q+ADYFK RLAGYP TLSEDE +L D NL+
Sbjct: 414 VTHPSEMQSVISSQGPVCPVSPCMERAMLEQVADYFKRRLAGYPTTLSEDEFLLADGNLN 473
Query: 301 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 347
PKKRVATQLVR+EKK+L++CL+VT D I LPD TVSPCPAPYAPLL
Sbjct: 474 PKKRVATQLVRLEKKLLHSCLEVTIDFINQLPDHTVSPCPAPYAPLL 520
>gi|356571407|ref|XP_003553868.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
isoform 1 [Glycine max]
gi|356571409|ref|XP_003553869.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
isoform 2 [Glycine max]
Length = 502
Score = 613 bits (1581), Expect = e-173, Method: Compositional matrix adjust.
Identities = 288/347 (82%), Positives = 317/347 (91%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
MYEKKQGKKSFW PYIRELDRQRGRGQL+VESPLLW ++EL YL+GSP K E+++R E I
Sbjct: 155 MYEKKQGKKSFWYPYIRELDRQRGRGQLSVESPLLWLKSELDYLSGSPIKDEVIQREEAI 214
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
++EYNELDTVWFMAGSLFQQYPYDIPTEAF+FEIFKQAF A+QSCVVHLQKVSLARRFAL
Sbjct: 215 RKEYNELDTVWFMAGSLFQQYPYDIPTEAFSFEIFKQAFAAIQSCVVHLQKVSLARRFAL 274
Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
VPLGPPLL+Y S CKAML AVD AV+L VDRPYKAG+ IVVWCGPQPNSKLLINYGFVDE
Sbjct: 275 VPLGPPLLSYQSNCKAMLTAVDGAVELAVDRPYKAGDPIVVWCGPQPNSKLLINYGFVDE 334
Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
+N DRL+VEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV+AG+E+EA+ DML Y+RLGY
Sbjct: 335 NNSNDRLIVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVYAGKEREAVLDMLRYMRLGY 394
Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
VSD SEM+SVISS GP+CPVSPCMERA LDQLADYFKARLAGYP TL+EDE+MLTD NL+
Sbjct: 395 VSDPSEMESVISSQGPVCPVSPCMERAALDQLADYFKARLAGYPTTLAEDESMLTDDNLN 454
Query: 301 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 347
PKKRVATQ VR+EKKML+ACLQ T D I LPD T+SPCPAPYAPLL
Sbjct: 455 PKKRVATQYVRLEKKMLHACLQATTDFINQLPDHTISPCPAPYAPLL 501
>gi|356511552|ref|XP_003524489.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
[Glycine max]
Length = 503
Score = 608 bits (1568), Expect = e-171, Method: Compositional matrix adjust.
Identities = 288/348 (82%), Positives = 316/348 (90%), Gaps = 1/348 (0%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
MYEKKQGKKSFW PYIRELDRQRGRGQL+VESPLLWS++EL YL+GSP K E+++R E I
Sbjct: 155 MYEKKQGKKSFWYPYIRELDRQRGRGQLSVESPLLWSKSELDYLSGSPIKDEVIQREEAI 214
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
++EY ELDTVWFMAGSLFQQYPYDIPTEAF+FEIFKQAF A+QSCVVHLQKVSLARRFAL
Sbjct: 215 RKEYKELDTVWFMAGSLFQQYPYDIPTEAFSFEIFKQAFAAIQSCVVHLQKVSLARRFAL 274
Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
VPLGPPLL+Y S CKAML AVD AV+L VDRPYKAG+ IVVWCGPQPNSKLLINYGFVDE
Sbjct: 275 VPLGPPLLSYQSNCKAMLTAVDGAVELAVDRPYKAGDPIVVWCGPQPNSKLLINYGFVDE 334
Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
+N DRL+VEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV+AG+E+EA+ DML Y+RLGY
Sbjct: 335 NNSNDRLIVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVYAGKEREAVLDMLRYMRLGY 394
Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD-YNL 299
VSD SEMQSVISS GP+CPVSPCMERA LDQLADYFKARLAGYP L+EDE+MLTD NL
Sbjct: 395 VSDPSEMQSVISSQGPVCPVSPCMERAALDQLADYFKARLAGYPTILAEDESMLTDGGNL 454
Query: 300 HPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 347
+PKKRVATQ VR+EKKML+ACLQ T D I LPD T+SPCPAPYAPLL
Sbjct: 455 NPKKRVATQYVRLEKKMLHACLQATIDFINQLPDHTISPCPAPYAPLL 502
>gi|357497055|ref|XP_003618816.1| SET domain protein [Medicago truncatula]
gi|355493831|gb|AES75034.1| SET domain protein [Medicago truncatula]
Length = 501
Score = 606 bits (1563), Expect = e-171, Method: Compositional matrix adjust.
Identities = 284/347 (81%), Positives = 314/347 (90%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
MYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE+ELAYL GSP K EI++R EGI
Sbjct: 154 MYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESELAYLEGSPLKDEIVKRIEGI 213
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
++EYNELDTVWFM+GSLFQQYPYD+PTEAF FEIFKQAF AVQSCVVHLQ VSLARRFAL
Sbjct: 214 RKEYNELDTVWFMSGSLFQQYPYDLPTEAFPFEIFKQAFAAVQSCVVHLQNVSLARRFAL 273
Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
VPLGPPLLAY S CKAML AVD AVQLVVDRPYKAG+ IVVWCGPQPN+KLL NYGFVDE
Sbjct: 274 VPLGPPLLAYCSNCKAMLTAVDGAVQLVVDRPYKAGDPIVVWCGPQPNTKLLTNYGFVDE 333
Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
DN DRL+VE AL+TEDPQYQDKR+VAQRNGKLS+Q F+V+ G+E+EA+SDM+PY+RLGY
Sbjct: 334 DNSNDRLIVEVALSTEDPQYQDKRIVAQRNGKLSIQTFYVYTGKEREAVSDMIPYMRLGY 393
Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
VSD SEMQSVISS GP+CPVSPCMERAVLDQLADYF RLA YP TL+EDE+MLTD +L+
Sbjct: 394 VSDPSEMQSVISSQGPVCPVSPCMERAVLDQLADYFNTRLAAYPTTLAEDESMLTDGSLN 453
Query: 301 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 347
PK+RVATQLVR+EKKML+ACLQ D+I LPD +VSPCPAPYAP L
Sbjct: 454 PKRRVATQLVRLEKKMLHACLQAIMDLISQLPDHSVSPCPAPYAPSL 500
>gi|22326803|ref|NP_196930.2| Rubisco methyltransferase family protein [Arabidopsis thaliana]
gi|30684815|ref|NP_851038.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
gi|42573363|ref|NP_974778.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
gi|17473570|gb|AAL38260.1| putative protein [Arabidopsis thaliana]
gi|23297671|gb|AAN13005.1| unknown protein [Arabidopsis thaliana]
gi|332004624|gb|AED92007.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
gi|332004625|gb|AED92008.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
gi|332004626|gb|AED92009.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
Length = 514
Score = 599 bits (1544), Expect = e-169, Method: Compositional matrix adjust.
Identities = 281/348 (80%), Positives = 310/348 (89%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
MYEKKQGKKS W PYIRELDRQRGRGQL ESPLLWSE EL YLTGSPTKAE+LERAEGI
Sbjct: 167 MYEKKQGKKSVWYPYIRELDRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGI 226
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
KREYNELDTVWFMAGSLFQQYP+DIPTEAF+FEIFKQAFVA+QSCVVHLQ V LARRFAL
Sbjct: 227 KREYNELDTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQNVGLARRFAL 286
Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
VPLGPPLLAY S CKAML AVD AV+LVVDRPYKAG+ IVVWCGPQPN+KLL+NYGFVDE
Sbjct: 287 VPLGPPLLAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVVWCGPQPNAKLLLNYGFVDE 346
Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
DNPYDR++VEAALNTEDPQYQDKRMVAQRNGKLS QVF V G+E+EA+ DMLPYLRLGY
Sbjct: 347 DNPYDRVIVEAALNTEDPQYQDKRMVAQRNGKLSQQVFQVRVGKEREAVQDMLPYLRLGY 406
Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
+SD SEMQSVISS GP+CP+SPCMERAVLDQLA+YF RL+GYP T ED+A+L D +L
Sbjct: 407 MSDPSEMQSVISSQGPVCPMSPCMERAVLDQLANYFMRRLSGYPTTPKEDDALLADPSLS 466
Query: 301 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 348
P+KRVAT+LV++EKK+L ACL T D++ LPD +SPCPAPYAP L
Sbjct: 467 PRKRVATRLVQLEKKILVACLTTTVDLLNQLPDTAISPCPAPYAPSLK 514
>gi|297807453|ref|XP_002871610.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297317447|gb|EFH47869.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 516
Score = 597 bits (1539), Expect = e-168, Method: Compositional matrix adjust.
Identities = 280/348 (80%), Positives = 309/348 (88%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
MYEKKQGKKS W PYIRELDRQRGRGQL ESPLLWSE EL YLTGSPTKAE+LERAEGI
Sbjct: 169 MYEKKQGKKSVWYPYIRELDRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGI 228
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
KREYNELDTVWFMAGSLFQQYP+DIPTEAF+FEIFKQAFVA+QSCVVHLQ V LARRFAL
Sbjct: 229 KREYNELDTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQNVGLARRFAL 288
Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
VPLGPPLLAY S CKAML AVD AV+LVVDRPYKAG+ IVVWCGPQPN+KLL+NYGFVDE
Sbjct: 289 VPLGPPLLAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVVWCGPQPNAKLLLNYGFVDE 348
Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
DNPYDR++VEAALNTEDPQYQDKRMVAQRNGKLS QVF V G+E+EA+ DMLPYLRLGY
Sbjct: 349 DNPYDRIIVEAALNTEDPQYQDKRMVAQRNGKLSQQVFQVRVGKEREAVQDMLPYLRLGY 408
Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
+SD SEMQSVISS GP+C +SPCMERAVLDQLA+YF RL+GYP T ED+A+L D +L
Sbjct: 409 MSDPSEMQSVISSQGPVCTMSPCMERAVLDQLANYFMRRLSGYPTTPKEDDALLADPSLS 468
Query: 301 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 348
P+KRVAT+LV++EKK+L ACL T D++ LPD +SPCPAPYAP L
Sbjct: 469 PRKRVATRLVQLEKKILAACLTTTVDLLNQLPDTAISPCPAPYAPSLK 516
>gi|18377718|gb|AAL67009.1| unknown protein [Arabidopsis thaliana]
Length = 514
Score = 596 bits (1537), Expect = e-168, Method: Compositional matrix adjust.
Identities = 280/348 (80%), Positives = 309/348 (88%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
MYEKKQGKKS W PYIRELDRQRGRGQL ESPLLWSE EL YLTGSPTKAE+LERAEGI
Sbjct: 167 MYEKKQGKKSVWYPYIRELDRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGI 226
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
KREYNELDTVWFMAGSLFQQYP+DIPTEAF+FEIFKQAFVA+QSCVVHLQ V LARRFAL
Sbjct: 227 KREYNELDTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQNVGLARRFAL 286
Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
VPLGPPLLAY S CKAML AVD AV+LVVDRPYKAG+ IVVWCGPQPN+KLL+NYGFVDE
Sbjct: 287 VPLGPPLLAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVVWCGPQPNAKLLLNYGFVDE 346
Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
DNPYDR++VEAALNTE PQYQDKRMVAQRNGKLS QVF V G+E+EA+ DMLPYLRLGY
Sbjct: 347 DNPYDRVIVEAALNTEGPQYQDKRMVAQRNGKLSQQVFQVRVGKEREAVQDMLPYLRLGY 406
Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
+SD SEMQSVISS GP+CP+SPCMERAVLDQLA+YF RL+GYP T ED+A+L D +L
Sbjct: 407 MSDPSEMQSVISSQGPVCPMSPCMERAVLDQLANYFMRRLSGYPTTPKEDDALLADPSLS 466
Query: 301 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 348
P+KRVAT+LV++EKK+L ACL T D++ LPD +SPCPAPYAP L
Sbjct: 467 PRKRVATRLVQLEKKILVACLTTTVDLLNQLPDTAISPCPAPYAPSLK 514
>gi|125536207|gb|EAY82695.1| hypothetical protein OsI_37912 [Oryza sativa Indica Group]
Length = 505
Score = 570 bits (1469), Expect = e-160, Method: Compositional matrix adjust.
Identities = 265/347 (76%), Positives = 302/347 (87%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
MYEKKQG+ SFW PYI+ELDRQRGRGQLAVESPLLW+E+EL YL GSP K E++ R EGI
Sbjct: 158 MYEKKQGQDSFWYPYIKELDRQRGRGQLAVESPLLWTESELNYLKGSPIKDEVVARDEGI 217
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
+REYNELDT+WFMAGSLFQQYP+DIPTEAF FEIFKQAFVAVQSCVVHLQKVSLARRFAL
Sbjct: 218 RREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFAL 277
Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
VPLGPPLL Y S CKAML AV D+V+LVVDRPYKAGE I+VWCGPQPNS+LL+NYGF+DE
Sbjct: 278 VPLGPPLLTYKSNCKAMLTAVGDSVRLVVDRPYKAGEPIIVWCGPQPNSRLLLNYGFIDE 337
Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
DNPYDR+V+EA+LN EDPQ+Q+KRMVAQRNGKL++Q FHV G+EKE I++MLPYLRLGY
Sbjct: 338 DNPYDRIVIEASLNIEDPQFQEKRMVAQRNGKLAIQNFHVCVGKEKETIAEMLPYLRLGY 397
Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
+SD EMQS++SS G CPVSPC ERAVLDQL Y ++RLA YP TL ED+AML D NL
Sbjct: 398 ISDPDEMQSILSSEGDTCPVSPCTERAVLDQLVGYLESRLADYPTTLDEDDAMLADGNLE 457
Query: 301 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 347
PKK VAT+LVR+EKK+L+ CLQ + I LPD TVSPCPAP+AP L
Sbjct: 458 PKKEVATRLVRLEKKLLHGCLQAANEFINDLPDHTVSPCPAPFAPEL 504
>gi|125578929|gb|EAZ20075.1| hypothetical protein OsJ_35675 [Oryza sativa Japonica Group]
Length = 536
Score = 570 bits (1468), Expect = e-160, Method: Compositional matrix adjust.
Identities = 265/348 (76%), Positives = 302/348 (86%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
MYEKKQG+ SFW PYI+ELDRQRGRGQLAVESPLLW+E+EL YL GSP K E++ R EGI
Sbjct: 189 MYEKKQGQDSFWYPYIKELDRQRGRGQLAVESPLLWTESELNYLKGSPIKDEVVARDEGI 248
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
+REYNELDT+WFMAGSLFQQYP+DIPTEAF FEIFKQAFVAVQSCVVHLQKVSLARRFAL
Sbjct: 249 RREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFAL 308
Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
VPLGPPLL Y S CKAML AV D+V+LVVDRPYKAGE I+VWCGPQPNS+LL+NYGF+DE
Sbjct: 309 VPLGPPLLTYKSNCKAMLTAVGDSVRLVVDRPYKAGEPIIVWCGPQPNSRLLLNYGFIDE 368
Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
DNPYDR+V+EA+LN EDPQ+Q+KRMVAQRNGKL++Q FHV G+EKE I++MLPYLRLGY
Sbjct: 369 DNPYDRIVIEASLNIEDPQFQEKRMVAQRNGKLAIQNFHVCVGKEKETIAEMLPYLRLGY 428
Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
+SD EMQS++SS G CPVSPC ERAVLDQL Y ++RLA YP TL ED+AML D NL
Sbjct: 429 ISDPDEMQSILSSEGDTCPVSPCTERAVLDQLVGYLESRLADYPTTLDEDDAMLADGNLE 488
Query: 301 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 348
PKK VAT+LVR+EKK+L+ CLQ + I LPD TVSPCPAP+AP L
Sbjct: 489 PKKEVATRLVRLEKKLLHGCLQAANEFINDLPDHTVSPCPAPFAPELK 536
>gi|115487958|ref|NP_001066466.1| Os12g0236900 [Oryza sativa Japonica Group]
gi|77554044|gb|ABA96840.1| SET domain containing protein, expressed [Oryza sativa Japonica
Group]
gi|113648973|dbj|BAF29485.1| Os12g0236900 [Oryza sativa Japonica Group]
Length = 509
Score = 569 bits (1467), Expect = e-160, Method: Compositional matrix adjust.
Identities = 265/347 (76%), Positives = 302/347 (87%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
MYEKKQG+ SFW PYI+ELDRQRGRGQLAVESPLLW+E+EL YL GSP K E++ R EGI
Sbjct: 162 MYEKKQGQDSFWYPYIKELDRQRGRGQLAVESPLLWTESELNYLKGSPIKDEVVARDEGI 221
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
+REYNELDT+WFMAGSLFQQYP+DIPTEAF FEIFKQAFVAVQSCVVHLQKVSLARRFAL
Sbjct: 222 RREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFAL 281
Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
VPLGPPLL Y S CKAML AV D+V+LVVDRPYKAGE I+VWCGPQPNS+LL+NYGF+DE
Sbjct: 282 VPLGPPLLTYKSNCKAMLTAVGDSVRLVVDRPYKAGEPIIVWCGPQPNSRLLLNYGFIDE 341
Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
DNPYDR+V+EA+LN EDPQ+Q+KRMVAQRNGKL++Q FHV G+EKE I++MLPYLRLGY
Sbjct: 342 DNPYDRIVIEASLNIEDPQFQEKRMVAQRNGKLAIQNFHVCVGKEKETIAEMLPYLRLGY 401
Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
+SD EMQS++SS G CPVSPC ERAVLDQL Y ++RLA YP TL ED+AML D NL
Sbjct: 402 ISDPDEMQSILSSEGDTCPVSPCTERAVLDQLVGYLESRLADYPTTLDEDDAMLADGNLE 461
Query: 301 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 347
PKK VAT+LVR+EKK+L+ CLQ + I LPD TVSPCPAP+AP L
Sbjct: 462 PKKEVATRLVRLEKKLLHGCLQAANEFINDLPDHTVSPCPAPFAPEL 508
>gi|414881266|tpg|DAA58397.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
Length = 512
Score = 569 bits (1466), Expect = e-160, Method: Compositional matrix adjust.
Identities = 264/347 (76%), Positives = 303/347 (87%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
MYEKKQGK SFW PYI+ELDR RGRGQLAVESPLLW+E+EL YLTGSP K E++ R E I
Sbjct: 165 MYEKKQGKDSFWYPYIKELDRHRGRGQLAVESPLLWTESELDYLTGSPLKDEVVARDEAI 224
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
+REYNELDT+WFMAGSLFQQYP+DIPTEAF FEIFKQAFVAVQSCVVHLQKVSLARRFAL
Sbjct: 225 RREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFAL 284
Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
VPLGPPLL Y S CKAML A D+V+LVVDRPYKAGE I++WCGPQ NS+L++NYGFVDE
Sbjct: 285 VPLGPPLLTYRSNCKAMLTADGDSVRLVVDRPYKAGEPIIIWCGPQTNSRLVLNYGFVDE 344
Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
DNP+DR+ +EA+LNTEDPQYQ+KRMVAQRNGKL++Q F+V+ G+EK+ +++MLPYLRLGY
Sbjct: 345 DNPFDRVAIEASLNTEDPQYQEKRMVAQRNGKLAIQNFNVYVGKEKQTVAEMLPYLRLGY 404
Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
+S+ EMQS++SS G CPVSPC ERAVLDQL Y ++RLAGYP TL EDEAML D NL
Sbjct: 405 ISNPDEMQSILSSEGDTCPVSPCTERAVLDQLVGYLESRLAGYPTTLDEDEAMLADGNLE 464
Query: 301 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 347
PKK VAT+LVR+EKKML+ACLQ T + I LPD TVSPCPAPYAP L
Sbjct: 465 PKKEVATRLVRLEKKMLHACLQATNEFINDLPDHTVSPCPAPYAPEL 511
>gi|357160358|ref|XP_003578740.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
[Brachypodium distachyon]
Length = 516
Score = 567 bits (1462), Expect = e-159, Method: Compositional matrix adjust.
Identities = 266/348 (76%), Positives = 299/348 (85%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
MYEKKQGK S W PYI+ELDRQRGRGQLAVESPLLW+E+EL YL GSP + E++ R EGI
Sbjct: 169 MYEKKQGKDSLWYPYIKELDRQRGRGQLAVESPLLWTESELDYLNGSPMRDEVVVRDEGI 228
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
+REYNELDT+WFMAGSLF+QYP+D+PTEAF FEIFKQAFVAVQSCVVHLQKVSLARRFAL
Sbjct: 229 RREYNELDTLWFMAGSLFKQYPFDVPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFAL 288
Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
VPLGPPLL Y S CKAML AVDD+V+LVVDRPYKAGE I+VWCGPQPNS+LL+NYGFVDE
Sbjct: 289 VPLGPPLLTYKSNCKAMLTAVDDSVRLVVDRPYKAGEPIIVWCGPQPNSRLLLNYGFVDE 348
Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
DNPYDR+ +EA+LN EDPQYQ+KRMVAQRNGKL++Q F V G+EKE IS+MLPYLRLGY
Sbjct: 349 DNPYDRIAIEASLNMEDPQYQEKRMVAQRNGKLAIQKFQVCVGKEKETISEMLPYLRLGY 408
Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
+SD EMQ ++SS G CPVSPC ERAVLDQL Y K+RLAGYP TL EDEAML D NL
Sbjct: 409 ISDPDEMQCILSSEGDTCPVSPCSERAVLDQLVVYLKSRLAGYPTTLDEDEAMLADGNLE 468
Query: 301 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 348
PKK VAT+LVR+EKK+L+ CLQ + I LPD TVSPCPA YAP L
Sbjct: 469 PKKEVATRLVRLEKKLLHGCLQAAHEFISALPDHTVSPCPALYAPNLK 516
>gi|242053769|ref|XP_002456030.1| hypothetical protein SORBIDRAFT_03g029140 [Sorghum bicolor]
gi|241928005|gb|EES01150.1| hypothetical protein SORBIDRAFT_03g029140 [Sorghum bicolor]
Length = 512
Score = 565 bits (1455), Expect = e-158, Method: Compositional matrix adjust.
Identities = 261/347 (75%), Positives = 303/347 (87%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
MYEKKQGK SFW PYI+ELDR RGRGQLAVESPLLW+E+EL YLTGSP K E++ R E I
Sbjct: 165 MYEKKQGKDSFWYPYIKELDRHRGRGQLAVESPLLWTESELDYLTGSPLKDEVVARDEAI 224
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
+REYNELDT+WFMAGSLFQQYP+DIPTEAF FEIFKQAFVAVQSCVVHLQKVSLARRFAL
Sbjct: 225 RREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFAL 284
Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
VPLGPPLL Y S CKAML A D+V+LVVDRPYKAGE I++WCGPQ NS+L++NYGFVDE
Sbjct: 285 VPLGPPLLTYKSNCKAMLTADGDSVRLVVDRPYKAGEPIIIWCGPQTNSRLVLNYGFVDE 344
Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
DNP+DR+ +EA+LN+EDPQYQ+KRMVAQRNGKL++Q F+V+ G+EK+ +++MLPYLRLGY
Sbjct: 345 DNPFDRIAIEASLNSEDPQYQEKRMVAQRNGKLAIQNFNVYVGKEKQTVAEMLPYLRLGY 404
Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
+SD EMQS++SS G CP+SPC ERAVLDQL Y ++RLAGYP TL EDEAML D +L
Sbjct: 405 ISDPDEMQSILSSEGDTCPLSPCTERAVLDQLVGYLESRLAGYPTTLDEDEAMLADGSLE 464
Query: 301 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 347
PKK VAT+LVR+EKKM++ACLQ T + I LPD TVSPCPAPYAP L
Sbjct: 465 PKKEVATRLVRLEKKMIHACLQATNEFINDLPDHTVSPCPAPYAPEL 511
>gi|326510275|dbj|BAJ87354.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525555|dbj|BAJ88824.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 523
Score = 560 bits (1442), Expect = e-157, Method: Compositional matrix adjust.
Identities = 261/348 (75%), Positives = 298/348 (85%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
MYEKKQG+ S W PYI+ELDRQRGRGQLAVESPLLW+E+EL YL GSP + E++ R EGI
Sbjct: 176 MYEKKQGRDSLWYPYIKELDRQRGRGQLAVESPLLWTESELDYLNGSPMRDEVVVRDEGI 235
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
K+EYNELDT+WFMAGSLF+QYP+D+PTEAF FEIFKQAFVAVQSCVVHLQKVSLARRFAL
Sbjct: 236 KKEYNELDTLWFMAGSLFKQYPFDVPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFAL 295
Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
VPLGPPLL Y S CKAML AVD +V+L+VDRPYKAGE I+VWCGPQPNS+LL+NYGFVDE
Sbjct: 296 VPLGPPLLTYKSNCKAMLTAVDGSVRLLVDRPYKAGEPIIVWCGPQPNSRLLLNYGFVDE 355
Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
DNPYDR+ +EA+LNTEDPQYQ+KRMVAQRNGKL++Q F V G+EK+ IS+MLPYLRLGY
Sbjct: 356 DNPYDRIAIEASLNTEDPQYQEKRMVAQRNGKLAIQKFQVCVGKEKQTISEMLPYLRLGY 415
Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
+SD EMQ ++SS G CPVSPC ERAVLDQL Y K+RLAGYP L EDEAML D +L
Sbjct: 416 ISDPDEMQCILSSEGDTCPVSPCSERAVLDQLVVYLKSRLAGYPTNLDEDEAMLADGSLE 475
Query: 301 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 348
PKK VAT+LVR+EKKML+ CL+ + I LPD TVSPCPA YAP L
Sbjct: 476 PKKEVATRLVRLEKKMLHGCLEAANEFISGLPDHTVSPCPALYAPELK 523
>gi|116786810|gb|ABK24248.1| unknown [Picea sitchensis]
Length = 507
Score = 560 bits (1442), Expect = e-157, Method: Compositional matrix adjust.
Identities = 262/347 (75%), Positives = 300/347 (86%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
MYEKKQG +SFW P+IRELDRQRGRGQLAVESPLLWS EL Y TGSP K +LER GI
Sbjct: 160 MYEKKQGNQSFWRPFIRELDRQRGRGQLAVESPLLWSSEELKYFTGSPMKEIMLERNSGI 219
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
KREY ELDTVWFMAGSLF+QYPYDIPTEAF FEIFKQAFVAVQSCVVHLQ V+LARRFAL
Sbjct: 220 KREYEELDTVWFMAGSLFKQYPYDIPTEAFPFEIFKQAFVAVQSCVVHLQNVNLARRFAL 279
Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
VPLGPPLL+Y S CKAML AV D+VQL VDR YKAGE IVVWCGPQPN++LL+NYGFVDE
Sbjct: 280 VPLGPPLLSYKSNCKAMLKAVGDSVQLEVDREYKAGEPIVVWCGPQPNARLLLNYGFVDE 339
Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
DNP+DRL+VE +L+T+DP YQDKR++AQRNGKLSVQ F+++ GREKEA+ DMLPYLRL Y
Sbjct: 340 DNPHDRLIVEVSLDTKDPLYQDKRIIAQRNGKLSVQTFNIYIGREKEAVLDMLPYLRLAY 399
Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
VSD SEMQSV+SS GP+CPVSPC ERAVLDQL+ YF+ RLAGYP T SEDE +L D +
Sbjct: 400 VSDPSEMQSVLSSQGPVCPVSPCTERAVLDQLSRYFRERLAGYPTTASEDEIVLADPTTN 459
Query: 301 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 347
PK++VATQLV +EKKMLN+CL ++I LPD+ V+PCP+PY+P+L
Sbjct: 460 PKRQVATQLVLIEKKMLNSCLAAVYEIIDQLPDLAVTPCPSPYSPIL 506
>gi|7573451|emb|CAB87765.1| putative protein [Arabidopsis thaliana]
Length = 537
Score = 520 bits (1338), Expect = e-145, Method: Compositional matrix adjust.
Identities = 249/316 (78%), Positives = 269/316 (85%), Gaps = 18/316 (5%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
MYEKKQGKKS W PYIRELDRQRGRGQL ESPLLWSE EL YLTGSPTKAE+LERAEGI
Sbjct: 167 MYEKKQGKKSVWYPYIRELDRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGI 226
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ---------- 110
KREYNELDTVWFMAGSLFQQYP+DIPTEAF+FEIFKQAFVA+QSCVVHLQ
Sbjct: 227 KREYNELDTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQVVLVASSNLD 286
Query: 111 --------KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 162
V LARRFALVPLGPPLLAY S CKAML AVD AV+LVVDRPYKAG+ IVVW
Sbjct: 287 CYASSCTQNVGLARRFALVPLGPPLLAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVVW 346
Query: 163 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 222
CGPQPN+KLL+NYGFVDEDNPYDR++VEAALNTEDPQYQDKRMVAQRNGKLS QVF V
Sbjct: 347 CGPQPNAKLLLNYGFVDEDNPYDRVIVEAALNTEDPQYQDKRMVAQRNGKLSQQVFQVRV 406
Query: 223 GREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAG 282
G+E+EA+ DMLPYLRLGY+SD SEMQSVISS GP+CP+SPCMERAVLDQLA+YF RL+G
Sbjct: 407 GKEREAVQDMLPYLRLGYMSDPSEMQSVISSQGPVCPMSPCMERAVLDQLANYFMRRLSG 466
Query: 283 YPATLSEDEAMLTDYN 298
YP T ED+A+ N
Sbjct: 467 YPTTPKEDDALEASCN 482
>gi|302794360|ref|XP_002978944.1| hypothetical protein SELMODRAFT_110000 [Selaginella moellendorffii]
gi|300153262|gb|EFJ19901.1| hypothetical protein SELMODRAFT_110000 [Selaginella moellendorffii]
Length = 432
Score = 482 bits (1240), Expect = e-133, Method: Compositional matrix adjust.
Identities = 233/351 (66%), Positives = 279/351 (79%), Gaps = 4/351 (1%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEG 59
MYEKK+GK+SFW P+IRELDRQRGRGQ+AVESPLLW+ EL Y TGS K +LER EG
Sbjct: 81 MYEKKRGKESFWYPFIRELDRQRGRGQVAVESPLLWTSEELDEYFTGSRMKEVVLERLEG 140
Query: 60 IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 119
IKREY ELDTVWFMAGSLF++YP+DIPTEAF+FEIFKQAFVAVQSCVVHLQ VSL RRFA
Sbjct: 141 IKREYQELDTVWFMAGSLFKEYPFDIPTEAFSFEIFKQAFVAVQSCVVHLQGVSLPRRFA 200
Query: 120 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 179
LVPLGPPLLAY S CKAML A D V+L VDR YK GE I+VWCGPQPN++LL+NYGFVD
Sbjct: 201 LVPLGPPLLAYKSNCKAMLKAAGDLVRLEVDRAYKKGEQILVWCGPQPNTRLLLNYGFVD 260
Query: 180 EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLG 239
DNP+DRL VEA+LNT DP YQ+KR++ Q+N +L++Q F + GREKEA+ +MLPYLRLG
Sbjct: 261 PDNPHDRLSVEASLNTRDPFYQNKRIIVQKNNRLTIQNFQIFKGREKEAVLEMLPYLRLG 320
Query: 240 YVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD--Y 297
+VSD M+SV S+ GP CPVS C ERAVLDQLA YF+ R+A Y T+ ED A+L D
Sbjct: 321 HVSDPYHMESVFSAEGPTCPVSACNERAVLDQLAQYFQERIAKYKTTIDEDRALLEDGSS 380
Query: 298 NLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVS-PCPAPYAPLL 347
+++PK+RVATQL+ +EK++L+ L V LPD +V+ PC + P L
Sbjct: 381 DINPKQRVATQLLLIEKEILHNTLDVVNGFRNQLPDGSVAPPCCGDFVPKL 431
>gi|302809535|ref|XP_002986460.1| hypothetical protein SELMODRAFT_269129 [Selaginella moellendorffii]
gi|300145643|gb|EFJ12317.1| hypothetical protein SELMODRAFT_269129 [Selaginella moellendorffii]
Length = 432
Score = 482 bits (1240), Expect = e-133, Method: Compositional matrix adjust.
Identities = 233/351 (66%), Positives = 279/351 (79%), Gaps = 4/351 (1%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEG 59
MYEKK+GK+SFW P+IRELDRQRGRGQ+AVESPLLW+ EL Y TGS K +LER EG
Sbjct: 81 MYEKKRGKESFWYPFIRELDRQRGRGQVAVESPLLWTSEELDEYFTGSRMKEVVLERLEG 140
Query: 60 IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 119
IKREY ELDTVWFMAGSLF++YP+DIPTEAF+FEIFKQAFVAVQSCVVHLQ VSL RRFA
Sbjct: 141 IKREYQELDTVWFMAGSLFKEYPFDIPTEAFSFEIFKQAFVAVQSCVVHLQGVSLPRRFA 200
Query: 120 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 179
LVPLGPPLLAY S CKAML A D V+L VDR YK GE I+VWCGPQPN++LL+NYGFVD
Sbjct: 201 LVPLGPPLLAYKSNCKAMLKAAGDLVRLEVDRAYKKGEQILVWCGPQPNTRLLLNYGFVD 260
Query: 180 EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLG 239
DNP+DRL VEA+LNT DP YQ+KR++ Q+N +L++Q F + GREKEA+ +MLPYLRLG
Sbjct: 261 PDNPHDRLSVEASLNTRDPFYQNKRIIVQKNNRLTIQNFQIFKGREKEAVLEMLPYLRLG 320
Query: 240 YVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD--Y 297
+VSD M+SV S+ GP CPVS C ERAVLDQLA YF+ R+A Y T+ ED A+L D
Sbjct: 321 HVSDPYHMESVFSAEGPTCPVSACNERAVLDQLAQYFQERIAKYKTTIDEDRALLEDCSS 380
Query: 298 NLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVS-PCPAPYAPLL 347
+++PK+RVATQL+ +EK++L+ L V LPD +V+ PC + P L
Sbjct: 381 DINPKQRVATQLLLIEKEILHNTLDVVNGFRNQLPDGSVAPPCCGDFVPKL 431
>gi|168044593|ref|XP_001774765.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673920|gb|EDQ60436.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 523
Score = 479 bits (1234), Expect = e-133, Method: Compositional matrix adjust.
Identities = 229/348 (65%), Positives = 272/348 (78%), Gaps = 4/348 (1%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEG 59
MYEKKQGK+S+W PYIRELDRQRGRGQL+V SPLLWS EL Y TGS K +LER G
Sbjct: 178 MYEKKQGKESYWYPYIRELDRQRGRGQLSVASPLLWSREELNEYFTGSTMKEVVLERLAG 237
Query: 60 IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 119
IKREY ELDTVWFMAGSLF+QYP+D+PTEAF+FEIFKQAFVAVQSCVVHLQ VSLARRFA
Sbjct: 238 IKREYEELDTVWFMAGSLFKQYPFDLPTEAFSFEIFKQAFVAVQSCVVHLQGVSLARRFA 297
Query: 120 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 179
LVPLGPPLLAY S CKAML AVDD V L VDR YKAG+ I VWCGPQPNSKLL+NYGFVD
Sbjct: 298 LVPLGPPLLAYKSNCKAMLKAVDDNVVLEVDRAYKAGDPIAVWCGPQPNSKLLLNYGFVD 357
Query: 180 EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLG 239
EDNPYDRL VEA+L+TEDP YQ KR + Q+N +L++Q F ++ G+E EA+ DMLPY+RL
Sbjct: 358 EDNPYDRLAVEASLDTEDPLYQQKRAIVQKNNRLTIQTFQIYKGKEMEAVLDMLPYMRLA 417
Query: 240 YVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNL 299
+++D EM++V + GP+CPVS C ERAVL+QL YF+ RLAGY ++ + + D
Sbjct: 418 HLADPEEMETVSFAQGPVCPVSACNERAVLEQLEQYFEKRLAGYKSSHATEGG---DAKK 474
Query: 300 HPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 347
+ KKRVA +L+ +EK +L L ++I LPD +SPC PY P L
Sbjct: 475 NAKKRVAEKLLCIEKSILRNALAAVQELISQLPDSAISPCIGPYLPNL 522
>gi|168020073|ref|XP_001762568.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686301|gb|EDQ72691.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 427
Score = 466 bits (1198), Expect = e-128, Method: Compositional matrix adjust.
Identities = 224/348 (64%), Positives = 265/348 (76%), Gaps = 4/348 (1%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEG 59
MYEKKQGK+S+W PYIRELDRQRGRGQL+V SPLLWS EL Y TGS K +LER G
Sbjct: 82 MYEKKQGKESYWYPYIRELDRQRGRGQLSVASPLLWSPEELNEYFTGSTMKEVVLERLAG 141
Query: 60 IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 119
IKREY ELDTVWFMAGSLF+QYP+D+PTEAF+FEIFKQAFVAVQSCVVHLQ VSLARRFA
Sbjct: 142 IKREYEELDTVWFMAGSLFKQYPFDLPTEAFSFEIFKQAFVAVQSCVVHLQGVSLARRFA 201
Query: 120 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 179
LVPLGPPLLAY S CKAML AV D VQL VD YK G+ I VWCGPQPNSKLL+NYGFVD
Sbjct: 202 LVPLGPPLLAYKSNCKAMLKAVGDNVQLEVDHAYKTGDPIAVWCGPQPNSKLLLNYGFVD 261
Query: 180 EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLG 239
EDNP+DRL VEA+LNTEDP YQ KR V Q+N +L++Q F ++ G+E EA+ DMLPY+RLG
Sbjct: 262 EDNPFDRLAVEASLNTEDPLYQQKRAVVQKNNRLTIQTFQIYKGKEMEAVRDMLPYMRLG 321
Query: 240 YVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNL 299
+++D E+++V + P+C VS C ERAVL+Q+ +F+ RLAGY S D D
Sbjct: 322 HLADPEEIETVSFAQEPLCYVSACNERAVLNQIEHFFERRLAGYK---SSDTTKAVDAKK 378
Query: 300 HPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 347
K+ VA +L+ +EK +L L ++I LPD +SPC PY P L
Sbjct: 379 DAKRTVAKKLMSIEKNILRNALAAVHELIRELPDGAISPCIGPYLPNL 426
>gi|326503142|dbj|BAJ99196.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 425
Score = 438 bits (1126), Expect = e-120, Method: Compositional matrix adjust.
Identities = 199/255 (78%), Positives = 229/255 (89%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
MYEKKQG+ S W PYI+ELDRQRGRGQLAVESPLLW+E+EL YL GSP + E++ R EGI
Sbjct: 171 MYEKKQGRDSLWYPYIKELDRQRGRGQLAVESPLLWTESELDYLNGSPMRDEVVVRDEGI 230
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
K+EYNELDT+WFMAGSLF+QYP+D+PTEAF FEIFKQAFVAVQSCVVHLQKVSLARRFAL
Sbjct: 231 KKEYNELDTLWFMAGSLFKQYPFDVPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFAL 290
Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
VPLGPPLL Y S CKAML AVD +V+L+VDRPYKAGE I+VWCGPQPNS+LL+NYGFVDE
Sbjct: 291 VPLGPPLLTYKSNCKAMLTAVDGSVRLLVDRPYKAGEPIIVWCGPQPNSRLLLNYGFVDE 350
Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
DNPYDR+ +EA+LNTEDPQYQ+KRMVAQRNGKL++Q F V G+EK+ IS+MLPYLRLGY
Sbjct: 351 DNPYDRIAIEASLNTEDPQYQEKRMVAQRNGKLAIQKFQVCVGKEKQTISEMLPYLRLGY 410
Query: 241 VSDTSEMQSVISSLG 255
+SD EMQ ++SS G
Sbjct: 411 ISDPDEMQCILSSEG 425
>gi|212721460|ref|NP_001132025.1| uncharacterized protein LOC100193433 [Zea mays]
gi|194693232|gb|ACF80700.1| unknown [Zea mays]
gi|414881264|tpg|DAA58395.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
gi|414881265|tpg|DAA58396.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
Length = 252
Score = 374 bits (959), Expect = e-101, Method: Compositional matrix adjust.
Identities = 173/242 (71%), Positives = 206/242 (85%)
Query: 106 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 165
++ QKVSLARRFALVPLGPPLL Y S CKAML A D+V+LVVDRPYKAGE I++WCGP
Sbjct: 10 LIQEQKVSLARRFALVPLGPPLLTYRSNCKAMLTADGDSVRLVVDRPYKAGEPIIIWCGP 69
Query: 166 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGRE 225
Q NS+L++NYGFVDEDNP+DR+ +EA+LNTEDPQYQ+KRMVAQRNGKL++Q F+V+ G+E
Sbjct: 70 QTNSRLVLNYGFVDEDNPFDRVAIEASLNTEDPQYQEKRMVAQRNGKLAIQNFNVYVGKE 129
Query: 226 KEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 285
K+ +++MLPYLRLGY+S+ EMQS++SS G CPVSPC ERAVLDQL Y ++RLAGYP
Sbjct: 130 KQTVAEMLPYLRLGYISNPDEMQSILSSEGDTCPVSPCTERAVLDQLVGYLESRLAGYPT 189
Query: 286 TLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAP 345
TL EDEAML D NL PKK VAT+LVR+EKKML+ACLQ T + I LPD TVSPCPAPYAP
Sbjct: 190 TLDEDEAMLADGNLEPKKEVATRLVRLEKKMLHACLQATNEFINDLPDHTVSPCPAPYAP 249
Query: 346 LL 347
L
Sbjct: 250 EL 251
>gi|413950742|gb|AFW83391.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
gi|413950743|gb|AFW83392.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
Length = 252
Score = 362 bits (930), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 166/240 (69%), Positives = 203/240 (84%)
Query: 106 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 165
++ QKVSLARRFALVPLGPPLL Y S CKAML ++V+LVVDRPYKAGE I++WCGP
Sbjct: 10 LIQEQKVSLARRFALVPLGPPLLTYKSNCKAMLTVDGESVRLVVDRPYKAGEPIIIWCGP 69
Query: 166 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGRE 225
Q NS+L++NYGFVDE+NP+DR+ +EA+LNTEDPQYQ+KRMVAQRNGK ++Q F+V+ G+E
Sbjct: 70 QTNSRLVLNYGFVDENNPFDRISIEASLNTEDPQYQEKRMVAQRNGKHAIQNFNVYVGKE 129
Query: 226 KEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 285
K+ +++MLPYLRLGY+SD EMQS++SS G CPVSPC ERAVLDQL Y ++RLAGYP
Sbjct: 130 KQTVAEMLPYLRLGYISDPDEMQSILSSEGDTCPVSPCTERAVLDQLGGYLESRLAGYPT 189
Query: 286 TLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAP 345
TL+EDEAML D +L PK+ VAT+LVR+EKKML+ACLQ T + I LPD TVSPCPA YAP
Sbjct: 190 TLNEDEAMLADGSLEPKQEVATRLVRLEKKMLHACLQATNEFITDLPDHTVSPCPAQYAP 249
>gi|413950744|gb|AFW83393.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
Length = 281
Score = 346 bits (887), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 166/269 (61%), Positives = 203/269 (75%), Gaps = 29/269 (10%)
Query: 106 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 165
++ QKVSLARRFALVPLGPPLL Y S CKAML ++V+LVVDRPYKAGE I++WCGP
Sbjct: 10 LIQEQKVSLARRFALVPLGPPLLTYKSNCKAMLTVDGESVRLVVDRPYKAGEPIIIWCGP 69
Query: 166 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGRE 225
Q NS+L++NYGFVDE+NP+DR+ +EA+LNTEDPQYQ+KRMVAQRNGK ++Q F+V+ G+E
Sbjct: 70 QTNSRLVLNYGFVDENNPFDRISIEASLNTEDPQYQEKRMVAQRNGKHAIQNFNVYVGKE 129
Query: 226 KEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 285
K+ +++MLPYLRLGY+SD EMQS++SS G CPVSPC ERAVLDQL Y ++RLAGYP
Sbjct: 130 KQTVAEMLPYLRLGYISDPDEMQSILSSEGDTCPVSPCTERAVLDQLGGYLESRLAGYPT 189
Query: 286 TLSEDEAM-----------------------------LTDYNLHPKKRVATQLVRMEKKM 316
TL+EDEAM L D +L PK+ VAT+LVR+EKKM
Sbjct: 190 TLNEDEAMVMSCDFLRVVSWSLYKLAECYGIGFGHCQLADGSLEPKQEVATRLVRLEKKM 249
Query: 317 LNACLQVTADMIMLLPDVTVSPCPAPYAP 345
L+ACLQ T + I LPD TVSPCPA YAP
Sbjct: 250 LHACLQATNEFITDLPDHTVSPCPAQYAP 278
>gi|384246822|gb|EIE20311.1| hypothetical protein COCSUDRAFT_48681 [Coccomyxa subellipsoidea
C-169]
Length = 539
Score = 331 bits (849), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 165/322 (51%), Positives = 222/322 (68%), Gaps = 9/322 (2%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEG 59
MYEKK G++S W +I+ELDR +GRGQ+ +SPLLW E ++ YL GSP AEI ER +G
Sbjct: 99 MYEKKNGRQSVWYEFIKELDRIQGRGQMGAKSPLLWDEGQVDEYLAGSPLVAEIKERLKG 158
Query: 60 IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 119
I++EY ELDTVWFMAGSLF+ YPYD+PTEAF+ ++F+Q F AVQ+ VVHLQ V L++RFA
Sbjct: 159 IEKEYAELDTVWFMAGSLFKSYPYDVPTEAFSLKLFRQGFAAVQASVVHLQGVPLSKRFA 218
Query: 120 LVPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
LVPLGPPLL+YSS KAML +A VQL VDR Y GE I WCGPQPN +LL+NYG
Sbjct: 219 LVPLGPPLLSYSSTAKAMLTYNREAKEVQLAVDRSYTKGEPIEAWCGPQPNRRLLLNYGI 278
Query: 178 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 237
V ++NP+D++ + L DP +Q KR V Q+N + Q F + R+K +LPYLR
Sbjct: 279 VTDNNPHDKMALTVTLPHADPLFQAKRAVLQQNNLSTQQTFQLQ--RDKGLPELLLPYLR 336
Query: 238 LGYVSDTSEMQSVISSLGPIC--PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
L + +D + ++ +++L C P+SP ER VL QLA + + RL Y T EDE ++
Sbjct: 337 LAHCTDAASLK--LATLDTCCAAPISPENERTVLHQLASHLQDRLDRYKTTCEEDEVIIR 394
Query: 296 DYNLHPKKRVATQLVRMEKKML 317
P+++VA +L+R+EK +L
Sbjct: 395 STTAGPRQKVAARLLRIEKAIL 416
>gi|307107385|gb|EFN55628.1| hypothetical protein CHLNCDRAFT_57818 [Chlorella variabilis]
Length = 435
Score = 296 bits (758), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 153/321 (47%), Positives = 211/321 (65%), Gaps = 10/321 (3%)
Query: 2 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA-YLTGSPTKAEILERAEGI 60
YEKK+GK+ W +I+ELDR +GRG +SPLLW E + A L GSP EI R +GI
Sbjct: 91 YEKKRGKEGCWYRFIKELDRMQGRGSQGAKSPLLWDEGQAAELLAGSPVVGEIEARLQGI 150
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
++EY ELDTVW++AGSLF + P+ PTE F+F +F+QAF AVQS VVHLQ V+L +RFAL
Sbjct: 151 RKEYEELDTVWYLAGSLFNRQPFSPPTEQFSFPVFRQAFTAVQSSVVHLQGVALGKRFAL 210
Query: 121 VPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
VP+GPPLL YSS KAML ++ V+L VDR Y+ GE+++ WCGPQPNS+LLINYG V
Sbjct: 211 VPMGPPLLTYSSTAKAMLKFDPESHEVRLAVDRAYQPGEAVLAWCGPQPNSRLLINYGIV 270
Query: 179 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 238
DE NPYD+L + + ++DP Y+ KR G + Q F + A A +LPYLRL
Sbjct: 271 DESNPYDKLPLSITIPSDDPLYRLKRDRLAERGLSTQQTFQLQAAASLPA--QLLPYLRL 328
Query: 239 GYVSDTSEMQSVI--SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
+ + ++++ V GP+ P + E VL+QL + + R + Y T+ EDEA++ D
Sbjct: 329 VHSTREADVEGVKWEEEAGPVAPEN---ELTVLNQLITHLRLRQSRYRTTIEEDEAIIAD 385
Query: 297 YNLHPKKRVATQLVRMEKKML 317
P+ VA +L+++EK +L
Sbjct: 386 PAKGPRPTVAARLLKIEKGIL 406
>gi|255536985|ref|XP_002509559.1| conserved hypothetical protein [Ricinus communis]
gi|223549458|gb|EEF50946.1| conserved hypothetical protein [Ricinus communis]
Length = 348
Score = 277 bits (709), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 130/160 (81%), Positives = 148/160 (92%)
Query: 189 VEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQ 248
++AALNTEDPQYQDKRMVAQRNGKLS+QVF ++ G+EKEAISD+LPYLRLGYVSD SEMQ
Sbjct: 189 LQAALNTEDPQYQDKRMVAQRNGKLSIQVFQIYVGKEKEAISDILPYLRLGYVSDPSEMQ 248
Query: 249 SVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQ 308
SVISS GPICPVSPCME+AVLDQLADYFK RLAGYP +L+EDE ML D+NL+PKKRVATQ
Sbjct: 249 SVISSQGPICPVSPCMEQAVLDQLADYFKRRLAGYPTSLNEDELMLADHNLNPKKRVATQ 308
Query: 309 LVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 348
LVR+EKK+LNACLQ TAD+I LPD++VSPCPAPYAP+L
Sbjct: 309 LVRLEKKILNACLQATADLINQLPDLSVSPCPAPYAPILK 348
Score = 224 bits (571), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 103/110 (93%), Positives = 107/110 (97%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
MYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE ELAYLTGSPTKAE+LERA+GI
Sbjct: 81 MYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSEAELAYLTGSPTKAEVLERADGI 140
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ 110
KREY+ELDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFVA+QSCVVHLQ
Sbjct: 141 KREYDELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFVAIQSCVVHLQ 190
>gi|302847476|ref|XP_002955272.1| hypothetical protein VOLCADRAFT_76643 [Volvox carteri f.
nagariensis]
gi|300259344|gb|EFJ43572.1| hypothetical protein VOLCADRAFT_76643 [Volvox carteri f.
nagariensis]
Length = 488
Score = 277 bits (708), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 151/321 (47%), Positives = 200/321 (62%), Gaps = 9/321 (2%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEG 59
MYEKK KKSFW PYI+ELD+Q+ RG A ESPLLW + EL + L GSP + +R G
Sbjct: 127 MYEKKLKKKSFWYPYIKELDKQQARGPQAAESPLLWGDQELDSLLKGSPLLPAVRQRQAG 186
Query: 60 IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 119
I++EY LDTVWFMAGSLF +YP+D+PTE F+FE+F+QAF VQ+ +VHLQ V +A+RFA
Sbjct: 187 IRKEYEALDTVWFMAGSLFNKYPFDLPTETFSFELFQQAFAVVQASIVHLQGVPIAKRFA 246
Query: 120 LVPLGPPLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
LVPLGPPL+AYSS K M+ +D +V+LVV P +AG + WCGPQPNS+LL+NYG
Sbjct: 247 LVPLGPPLMAYSSTSKNMMTYDEDSRSVRLVVSGPVEAGRPVAAWCGPQPNSRLLLNYGV 306
Query: 178 VDEDNPYDRLVVE--AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPY 235
VDE NP+D+L L T DP + KR V G + Q F V R +LPY
Sbjct: 307 VDEHNPFDKLQARFTFTLPTSDPLFPAKRAVLSEAGLATQQSFDVSVARPLP--PQLLPY 364
Query: 236 LRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
+ L + ++ SV S +E A L L Y + R A Y L D ++
Sbjct: 365 MMLALATTPEQVASV--SFSDTAGHDRELEAAALAALMAYVQRRTAAYAHPLWRDLEIIN 422
Query: 296 DYNLHPKKRVATQLVRMEKKM 316
D + P+++VA +L ++EK +
Sbjct: 423 DPSSTPRQKVAARLTKIEKSI 443
>gi|413950741|gb|AFW83390.1| hypothetical protein ZEAMMB73_201403, partial [Zea mays]
Length = 130
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 91/110 (82%), Positives = 100/110 (90%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
MYEKKQGK SFW PYI+ELDR RGRGQLAVESPLLW+E+EL YL+GSP K E++ R E I
Sbjct: 19 MYEKKQGKDSFWYPYIKELDRHRGRGQLAVESPLLWTESELDYLSGSPLKDEVVARDEAI 78
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ 110
+REYNELDT+WFMAGSLFQQYP+DIPTEAF FEIFKQAFVAVQSCVVHLQ
Sbjct: 79 RREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIFKQAFVAVQSCVVHLQ 128
>gi|168002824|ref|XP_001754113.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694667|gb|EDQ81014.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 638
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 99/325 (30%), Positives = 152/325 (46%), Gaps = 27/325 (8%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS-ETELAYLTGSPTKAEILERAEG 59
MYE+ G+KS W PY++ SP+LW+ E + L GSP E+ +R+
Sbjct: 161 MYERSLGEKSVWYPYLQTFPS-------TTLSPILWTAEEQQKLLKGSPALEEVQQRSAA 213
Query: 60 IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 119
++ EY +L S F + P P E F+ E FK AF + S V+L L FA
Sbjct: 214 LEGEYEDLQ-------SYFTKDPQAFPQEYFSLEAFKSAFSVILSRAVYLPSADL---FA 263
Query: 120 LVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCG-PQPNSKLLINYG 176
LVP L + + +A L + D AV VDR YK GE + G + N+ LLI YG
Sbjct: 264 LVPYADAL-NHRADSQAYLDYSMEDQAVVFPVDRNYKEGEQVFTSYGRERSNADLLITYG 322
Query: 177 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL 236
FVDE+N D L +E L D K+ + Q+ S Q F ++ R + +L Y+
Sbjct: 323 FVDENNAMDYLDLEVGLVDGDRLLVLKQQILQQAMLDSPQTFPLYLDRFP---TQLLTYM 379
Query: 237 RLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
RL + D + ++ + + E L L + +L Y + ++ +L +
Sbjct: 380 RLSRLQDPALFPKIVFDKDIM--LDQANEYECLQLLMGECRTKLGNYEGGVDDEIRLLKN 437
Query: 297 YNLHPKKRVATQLVRMEKKMLNACL 321
+ ++RVA QL EKK+L + +
Sbjct: 438 KKISQRERVAAQLRLCEKKILTSTM 462
>gi|298706765|emb|CBJ29688.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Ectocarpus siliculosus]
Length = 521
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 98/338 (28%), Positives = 154/338 (45%), Gaps = 26/338 (7%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
++EK +GK+SFW YI L V LW+E +LA L GSP A +
Sbjct: 185 VHEKAKGKESFWSSYIGVLPTVE-----EVYPTYLWAEEDLALLEGSPVIAATESMRRKL 239
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
+ EY ++ L ++P +P E T+E F+ AF + S + L +S AL
Sbjct: 240 EVEYATVEN------DLLDKFPEILPREVHTYEEFQWAFAMLFSRAIRLGGLSTGEAVAL 293
Query: 121 VPLG------PPLLAY-SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 173
VP P +Y ++ + + + D V + DR YK E + + GP+ NS LL+
Sbjct: 294 VPYADLFNHNPFANSYIDARQQGLFFSKTDEVVVYADRSYKKMEQVYISYGPKGNSDLLL 353
Query: 174 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML 233
YGF + NPY+ + V +L+ D Y+ K+ G + F ++ R + ++L
Sbjct: 354 LYGFSLDRNPYNSVDVTVSLDENDELYERKKAFLSEAGLPPTKAFPLYNDRYPD---ELL 410
Query: 234 PYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM 293
YLRL ++ + L + E VLD L + KA +AGYP T +D +
Sbjct: 411 QYLRLIQLNTDQLRGRTLEDLSFEKKQTDVNELMVLDSLVEACKATIAGYPTTEEQDSKL 470
Query: 294 LTD----YNLHPKKRVATQLVRMEKKMLNACL-QVTAD 326
+ D L +R+A + R EK +L + VT D
Sbjct: 471 MNDPGMFRALSKTQRMAVKHRRQEKVILRRTIAAVTKD 508
>gi|452821842|gb|EME28868.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Galdieria sulphuraria]
Length = 490
Score = 110 bits (275), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 97/332 (29%), Positives = 162/332 (48%), Gaps = 39/332 (11%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPL-LWSETELAYLTGSPTKAEILERAEG 59
+YEK +G SF+ PY+ L L +PL LWS +L L GSPT + + +
Sbjct: 174 LYEKAKGFDSFFKPYLDILP------TLDELNPLFLWSNKDLDLLQGSPTLSACEQLRDK 227
Query: 60 IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 119
+ REY ++ ++ Q P + ++ F+ F+ AF + S + ++R A
Sbjct: 228 LLREYT------YLGKNIIPQIP-NFASKPIDFKQFQWAFGILFSRAICFPS---SKRIA 277
Query: 120 LVPLGPPLLAYSSKCKAML--------AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 171
LVP LL +S C A + V +AV + VDR Y+ E + V GP+ N +L
Sbjct: 278 LVPYAD-LLNHSPFCSAFIDEEKIPFGNGVTEAV-VYVDRLYEPYEQVYVSYGPRSNQEL 335
Query: 172 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD 231
L+ YGF E NP+D + + L+ DP Y +K + + GK +Q F ++ R +
Sbjct: 336 LLLYGFSLERNPFDCVEITIGLDKTDPLYLEKCRMLESYGKSPLQSFPLYMDRYP---VE 392
Query: 232 MLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 291
M +LR + +++Q+ ++ VS E + LD+L +Y +L YP +L +DE
Sbjct: 393 MAEFLRFCCIDTETDLQADFGTI-----VSASNEESALDKLLNYIVDQLRKYPTSLEDDE 447
Query: 292 AMLTDY----NLHPKKRVATQLVRMEKKMLNA 319
++ D L +R+A + EK++L+A
Sbjct: 448 KIIRDRAMFQTLEKNQRMAIRQRLGEKRILHA 479
>gi|219121061|ref|XP_002185762.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Phaeodactylum tricornutum CCAP
1055/1]
gi|209582611|gb|ACI65232.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Phaeodactylum tricornutum CCAP
1055/1]
Length = 575
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 92/356 (25%), Positives = 159/356 (44%), Gaps = 52/356 (14%)
Query: 1 MYEKK-QGKKSFWLPYIR---ELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILER 56
+YE+ +G++S W PY+ ++D V W + +LA+L GSP A
Sbjct: 179 IYERNVRGEESPWKPYLDVLPDIDE--------VNPTFTWPDEDLAFLNGSPVIAATKSL 230
Query: 57 AEGIKREYNELDTVWFMAG--SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 114
++REY+ L + G L +YP P EAF F+ ++ AF + S + L+ +
Sbjct: 231 QMKLRREYDAL-----LGGEDGLLAKYPDRFPAEAFNFKAWEWAFTMLFSRAIRLRSLKQ 285
Query: 115 ARRFALVPLGPPLLAYSSKCKA----------MLAAVDDAVQLVVDRPYKAGESIVVWCG 164
ALVP L+ +S +A + + D+ V L DR Y+ E I + G
Sbjct: 286 GETLALVPYAD-LINHSPFSQAYIDARQNGDWLFKSGDEEVILYADRGYRRMEQIYISYG 344
Query: 165 PQPNSKLLINYGFVDEDNPYDRLVVEAALNTE---------------DPQYQDKRMVAQR 209
P+ N++LL+ YGF E NP++ + V ++ DP ++K ++
Sbjct: 345 PKSNAELLLLYGFAVERNPFNSVDVTVSIAPRTASFVKELDDDTIPVDPLAEEKAAFLEQ 404
Query: 210 NGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVL 269
G+ + F +A R +ML YLRL ++ ++ +S E AVL
Sbjct: 405 VGRDATVDFPCYADRYP---VEMLEYLRLMQMTPEDTRGKPLAEFDYSRTISLGNEAAVL 461
Query: 270 DQLADYFKARLAGYPATLSEDEAMLTDYNLHP----KKRVATQLVRMEKKMLNACL 321
+ +L+ YP + +D A++ D +L +R+A + R EK++L +
Sbjct: 462 TSVITAVSRQLSNYPQSEEDDAALIKDKSLFRLLSYNQRMAVRHRRNEKRLLKRTI 517
>gi|397613505|gb|EJK62256.1| hypothetical protein THAOC_17139 [Thalassiosira oceanica]
Length = 648
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 91/355 (25%), Positives = 153/355 (43%), Gaps = 49/355 (13%)
Query: 6 QGKKSFWLPYIR---ELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
+G+ SF+ Y+ E+D V W + +LA+L GSP A ++R
Sbjct: 264 KGEDSFYAAYMGVLPEVDE--------VNPTFTWPDEDLAFLEGSPVVAATRSLQMKLRR 315
Query: 63 EYNELDTVWFMAG--SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
EY++L + G L ++P P E +TFE ++ AF + S + L+ + + R A+
Sbjct: 316 EYDDL-----LGGPDGLVAKFPLRFPAEHYTFENWEWAFTMLFSRAIRLRNLQVGERLAM 370
Query: 121 VPLGPPLLAYSSKCKAMLAAVDDA----------VQLVVDRPYKAGESIVVWCGPQPNSK 170
VP L+ +S+ +A + A + V L DR Y+ E + + G + N++
Sbjct: 371 VPYAD-LINHSAFSQAFIDARESGDWLFKSGEEEVILYADRGYRQMEQVYISYGQKSNAE 429
Query: 171 LLINYGFVDEDNPYDRLVVEAALN-------------TEDPQYQDKRMVAQRNGKLSVQV 217
LL+ YGF E NPY+ + V ++ EDP +K G+
Sbjct: 430 LLLLYGFALERNPYNSVDVTVSIAPRTKQIAEANEGVEEDPLADEKLEFLLSVGRDQTVD 489
Query: 218 FHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFK 277
F +A R +ML YLRL ++ +S +S E +VL + K
Sbjct: 490 FPCYADRYP---VEMLEYLRLMMMTPEDTRGKPLSDFDYSRTISSANEASVLRSVVAAVK 546
Query: 278 ARLAGYPATLSEDEAMLTDYNLHP----KKRVATQLVRMEKKMLNACLQVTADMI 328
+L +P T +D A++ D + +R+A + R EK++L L I
Sbjct: 547 YQLGLFPQTEEDDAAIIKDKGMFRLFSYNQRMAVRHRRNEKRLLKRTLAALEKQI 601
>gi|223992783|ref|XP_002286075.1| rubisco small subunit small subunit n-methyltransferase
[Thalassiosira pseudonana CCMP1335]
gi|220977390|gb|EED95716.1| rubisco small subunit small subunit n-methyltransferase
[Thalassiosira pseudonana CCMP1335]
Length = 434
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 84/321 (26%), Positives = 140/321 (43%), Gaps = 38/321 (11%)
Query: 30 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAG--SLFQQYPYDIPT 87
V W + +LA+L GSP A ++REY+ L + G L ++P P
Sbjct: 123 VNPTFTWKDEDLAFLDGSPVVAATRSLQMKLRREYDAL-----LGGQDGLIAKFPDRFPA 177
Query: 88 EAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD----- 142
E FT+E + AF + S + L+ + + R A+VP L+ +S+ A + A +
Sbjct: 178 EHFTYENWVWAFTMLFSRAIRLRNLQVGERLAMVPYAD-LINHSAFSGAFIDARESGDWL 236
Query: 143 -----DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE- 196
+ V L DR Y+ E + + G + N++LL+ YGF E NPY+ + V ++
Sbjct: 237 FKNGEEEVILYADRGYRQMEQVYISYGQKSNAELLLLYGFALERNPYNSVDVTVSIAPRT 296
Query: 197 ------------DPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
DP Q+K G+ F +A R +ML +LRL ++
Sbjct: 297 AALAAANEGIEVDPLAQEKVEFLASVGRDQTVDFPCYADRYP---VEMLEFLRLMMMTPE 353
Query: 245 SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP--- 301
++ +SP E AVL + + K +L YP + +D ++ D L
Sbjct: 354 DTRGKPLADFDYSRTISPANEAAVLSSVVEAVKYQLNLYPQSEEDDANIIKDKALFRLLS 413
Query: 302 -KKRVATQLVRMEKKMLNACL 321
+R+A + R EK++L L
Sbjct: 414 YNQRMAVRHRRNEKRLLKRTL 434
>gi|323456050|gb|EGB11917.1| hypothetical protein AURANDRAFT_61181 [Aureococcus anophagefferens]
Length = 516
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 88/329 (26%), Positives = 144/329 (43%), Gaps = 34/329 (10%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+ E+ +G +SFW YI L G + W ELAYL GSP + +
Sbjct: 176 ILERSKGSRSFWSEYIAILPTNEDVG-----ATFTWPAEELAYLEGSPAASATASMMAKL 230
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
+ E+ A L D E FTFE ++ AF + S + L+ A+
Sbjct: 231 RAEH---------AAVLEGNSALD--PEIFTFEAWQWAFTNLFSRAIRLKASRAGELLAM 279
Query: 121 VPL-----GPPLLAYSSKCKAMLAAV-----DDAVQLVVDRPYKAGESIVVWCGPQPNSK 170
VP P + + + A +D V L DR YK E + + GP+ N+
Sbjct: 280 VPYVDFINHSPFSSSYVDAREVPKAFPWEEKEDEVVLFADRAYKKFEQVFISYGPKSNAD 339
Query: 171 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK-LSVQVFHVHAGREKEAI 229
LL+ YGF + NP++ + + + +D Y K A+ G+ +S F ++A R +
Sbjct: 340 LLLLYGFALDRNPFNSVDLAVGASKDDALYDAKERFARGAGRDVSSAAFPLYADRFPD-- 397
Query: 230 SDMLPYLRLGYVS-DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLS 288
+++ +LR+ + D + + + +S E AVLD + D A +A YPA
Sbjct: 398 -ELVQFLRMACATEDHLGARPLDDPDNYVDILSLDNELAVLDTIRDACDAAVAAYPAKSG 456
Query: 289 EDEAMLTDYNLHPKKRVATQLVRMEKKML 317
+D + D L +R+A +LV EK++L
Sbjct: 457 DD---VPDAFLSRNQRMAKRLVNTEKRIL 482
>gi|302764082|ref|XP_002965462.1| hypothetical protein SELMODRAFT_406852 [Selaginella moellendorffii]
gi|300166276|gb|EFJ32882.1| hypothetical protein SELMODRAFT_406852 [Selaginella moellendorffii]
Length = 481
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 94/324 (29%), Positives = 145/324 (44%), Gaps = 27/324 (8%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLT-GSPTKAEILERAEG 59
MYE+++ + S W PY++ + SPLLW + E L GS A++ ++
Sbjct: 126 MYEQERSQDSPWYPYLKVF-------PASTLSPLLWEQEEQEELLRGSSALAKVKDQLTS 178
Query: 60 IKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF 118
+++ ++ L DT+ + D P E FTF FK AF V S V+L L F
Sbjct: 179 LRQTFDALKDTL---------KDNKDFPMEKFTFSAFKAAFSVVLSRAVYLPSAEL---F 226
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGF 177
ALVP G + SS+ + V+L VD+ YK G+ + + ++ LI YGF
Sbjct: 227 ALVPFGDLINHESSRSLLDYDIEEQKVKLAVDKRYKKGDQVFASYAQNLTSADFLIRYGF 286
Query: 178 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 237
+DE + D + +E L + D KR + Q G Q F V+ R + +L Y R
Sbjct: 287 LDESDENDFIEIEVGLVSGDSLAPLKREILQEVGLTVPQKFPVYLNRFP---TQLLTYTR 343
Query: 238 LGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDY 297
L + D+ + I V E L L + +L + T+ +D L
Sbjct: 344 LARIQDSGLFAKITFEKDLI--VCQTNEYETLMLLMADCRTKLLSFSDTMEDDMQTLKRK 401
Query: 298 NLHPKKRVATQLVRMEKKMLNACL 321
NL K+RVA QL EK++L +
Sbjct: 402 NLSYKQRVAAQLRLKEKRILTDTM 425
>gi|224129218|ref|XP_002320530.1| predicted protein [Populus trichocarpa]
gi|222861303|gb|EEE98845.1| predicted protein [Populus trichocarpa]
Length = 503
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 69/216 (31%), Positives = 101/216 (46%), Gaps = 21/216 (9%)
Query: 8 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNE 66
K S W YI L RQ S L W+ EL YL S + +ER + YN+
Sbjct: 161 KSSRWSNYISALPRQ-------PYSLLYWTRAELDTYLEASQIRERAIERITNVTGTYND 213
Query: 67 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 126
L +F +YP+ P E F E FK +F + S +V L S+ R ALVP
Sbjct: 214 LRL------RIFSKYPHLFPEEVFNMETFKWSFGILFSRLVRLP--SMDGRVALVPWAD- 264
Query: 127 LLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--N 182
+L +SS+ + L V DRPY+ GE + + G + N +LL++YGFV + N
Sbjct: 265 MLNHSSEVETFLDYDKSSKGVVFTTDRPYQPGEQVFISYGRKSNGELLLSYGFVPREGTN 324
Query: 183 PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
P D + + +L D Y++K +++G Q F
Sbjct: 325 PSDSVELSLSLKKSDKCYKEKLEALKKHGLSVSQCF 360
>gi|297829320|ref|XP_002882542.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297328382|gb|EFH58801.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 504
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 68/219 (31%), Positives = 104/219 (47%), Gaps = 21/219 (9%)
Query: 8 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNE 66
K S W YI L RQ S L W+ TEL YL S + +ER + Y +
Sbjct: 162 KSSRWYNYISALPRQ-------PYSLLYWTRTELDMYLEASQIRERAIERITNVVGTYED 214
Query: 67 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 126
L + +F ++P+ P E F E FK +F + S +V L S+ RFALVP
Sbjct: 215 LRS------RIFSKHPHLFPKEVFNDETFKWSFGILFSRLVRLP--SMDGRFALVPWAD- 265
Query: 127 LLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--N 182
+L ++ + + L V DRPY+ GE + + G + N +LL++YGFV + N
Sbjct: 266 MLNHNCEVETFLDYDKSSKGVVFTTDRPYQPGEQVFISYGNKSNGELLLSYGFVPREGTN 325
Query: 183 PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 221
P D + + +L D Y++K +++G + Q F V
Sbjct: 326 PSDSVELALSLRKNDKCYKEKLDALKKHGLSTPQCFPVR 364
>gi|302823067|ref|XP_002993188.1| hypothetical protein SELMODRAFT_449044 [Selaginella moellendorffii]
gi|300138958|gb|EFJ05708.1| hypothetical protein SELMODRAFT_449044 [Selaginella moellendorffii]
Length = 600
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 89/320 (27%), Positives = 140/320 (43%), Gaps = 25/320 (7%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
MYE+++ + S W PY++ + L E + L GS A++ ++ +
Sbjct: 126 MYEQERSQDSPWYPYVKVFPAS------TLSLLLWEQEEQEELLRGSSALAKVKDQLTSL 179
Query: 61 KREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 119
++ ++ L DT+ + D P E FTF FK AF V S V+L L FA
Sbjct: 180 RQTFDALKDTL---------KDNKDFPMEKFTFSAFKTAFSVVLSRAVYLPSAEL---FA 227
Query: 120 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFV 178
LVP G + SS+ + V+L VD+ YK G+ + + ++ LI YGF+
Sbjct: 228 LVPFGDLINHESSRSLLDYDIEEQKVKLAVDKRYKKGDQVFASYAQNLTSADFLIRYGFL 287
Query: 179 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 238
DE + D + +E L + D KR + Q G Q F ++ R + +L Y RL
Sbjct: 288 DESDENDCIEIEVGLVSGDSLAPLKREILQEVGLTVPQKFPLYLNR---FPTQLLTYTRL 344
Query: 239 GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYN 298
+ D+ + I VS E L L + +L T+ ++ L N
Sbjct: 345 ARIQDSGLFAKITFEKDLI--VSQTNEYETLMLLMADCRTKLLSSSDTMEDEMQTLRRKN 402
Query: 299 LHPKKRVATQLVRMEKKMLN 318
L K+RVA QL EK++L
Sbjct: 403 LSYKQRVAAQLRLKEKRILT 422
>gi|15231493|ref|NP_187424.1| rubisco methyltransferase-like protein [Arabidopsis thaliana]
gi|6466950|gb|AAF13085.1|AC009176_12 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
gi|6648179|gb|AAF21177.1|AC013483_1 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
gi|15028205|gb|AAK76599.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
gi|19310671|gb|AAL85066.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
gi|332641064|gb|AEE74585.1| rubisco methyltransferase-like protein [Arabidopsis thaliana]
Length = 504
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 68/219 (31%), Positives = 103/219 (47%), Gaps = 21/219 (9%)
Query: 8 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNE 66
K S W YI L RQ S L W+ TEL YL S + +ER + Y +
Sbjct: 162 KSSRWFNYISALPRQ-------PYSLLYWTRTELDMYLEASQIRERAIERITNVVGTYED 214
Query: 67 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 126
L + +F ++P P E F E FK +F + S +V L S+ RFALVP
Sbjct: 215 LRS------RIFSKHPQLFPKEVFNDETFKWSFGILFSRLVRLP--SMDGRFALVPWAD- 265
Query: 127 LLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--N 182
+L ++ + + L V DRPY+ GE + + G + N +LL++YGFV + N
Sbjct: 266 MLNHNCEVETFLDYDKSSKGVVFTTDRPYQPGEQVFISYGNKSNGELLLSYGFVPREGTN 325
Query: 183 PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 221
P D + + +L D Y++K +++G + Q F V
Sbjct: 326 PSDSVELALSLRKNDKCYEEKLDALKKHGLSTPQCFPVR 364
>gi|3065835|gb|AAC14296.1| putative methyltransferase [Arabidopsis thaliana]
Length = 504
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 68/219 (31%), Positives = 103/219 (47%), Gaps = 21/219 (9%)
Query: 8 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNE 66
K S W YI L RQ S L W+ TEL YL S + +ER + Y +
Sbjct: 162 KSSRWFNYISALPRQ-------PYSLLYWTRTELDMYLEASQIRERAIERITNVVGTYED 214
Query: 67 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 126
L + +F ++P P E F E FK +F + S +V L S+ RFALVP
Sbjct: 215 LRS------RIFSKHPQLFPKEVFNDETFKWSFGILFSRLVRLP--SMDGRFALVPWAD- 265
Query: 127 LLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--N 182
+L ++ + + L V DRPY+ GE + + G + N +LL++YGFV + N
Sbjct: 266 MLNHNCEVETFLDYDKSSKGVVFTTDRPYQPGEQVFISYGNKSNGELLLSYGFVPREGTN 325
Query: 183 PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 221
P D + + +L D Y++K +++G + Q F V
Sbjct: 326 PSDSVELALSLRKNDKCYEEKLDALKKHGLSTPQCFPVR 364
>gi|21537309|gb|AAM61650.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
Length = 504
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 68/219 (31%), Positives = 103/219 (47%), Gaps = 21/219 (9%)
Query: 8 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNE 66
K S W YI L RQ S L W+ TEL YL S + +ER + Y +
Sbjct: 162 KSSRWFNYISALPRQ-------PYSLLYWTRTELDMYLEASQIRERAIERITNVVGTYED 214
Query: 67 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 126
L + +F ++P P E F E FK +F + S +V L S+ RFALVP
Sbjct: 215 LRS------RIFSKHPQLFPKEVFNDETFKWSFGILFSRLVRLP--SMDGRFALVPWAD- 265
Query: 127 LLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--N 182
+L ++ + + L V DRPY+ GE + + G + N +LL++YGFV + N
Sbjct: 266 MLNHNCEVETFLDYDKSSKGVIFTTDRPYQPGEQVFISYGNKSNGELLLSYGFVPREGTN 325
Query: 183 PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 221
P D + + +L D Y++K +++G + Q F V
Sbjct: 326 PSDSVELALSLRKNDKCYEEKLDALKKHGLSTPQCFPVR 364
>gi|357462493|ref|XP_003601528.1| SET domain-containing protein [Medicago truncatula]
gi|355490576|gb|AES71779.1| SET domain-containing protein [Medicago truncatula]
gi|388500078|gb|AFK38105.1| unknown [Medicago truncatula]
Length = 497
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 67/219 (30%), Positives = 104/219 (47%), Gaps = 21/219 (9%)
Query: 8 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNE 66
K S W YI L RQ S L WS+ EL YL S + +ER + YN+
Sbjct: 155 KSSRWFSYISALPRQ-------PYSLLYWSQAELDRYLEASQIRERAIERTNNVIGTYND 207
Query: 67 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 126
M +F +YP P E F E FK +F + S +V L S+ + ALVP
Sbjct: 208 ------MRVRIFSKYPDFFPEEVFNIESFKWSFGILFSRMVRLP--SMDGKNALVPWAD- 258
Query: 127 LLAYSSKCKAMLAAVDDAVQLV--VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--N 182
++ +S + + L + +V DRPY+ GE + + G + N +LL++YGFV ++ N
Sbjct: 259 MMNHSCEVETFLDYDKSSKGIVFPTDRPYQPGEQVFISYGKKSNGELLLSYGFVPKEGTN 318
Query: 183 PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 221
P D + + +L D Y++K + ++ G Q F +
Sbjct: 319 PSDSVELSLSLKKSDESYKEKLELLKKYGLSGSQCFPIR 357
>gi|357469947|ref|XP_003605258.1| SET domain-containing protein [Medicago truncatula]
gi|355506313|gb|AES87455.1| SET domain-containing protein [Medicago truncatula]
Length = 494
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 67/219 (30%), Positives = 104/219 (47%), Gaps = 21/219 (9%)
Query: 8 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNE 66
K S W YI L RQ S L WS+ EL YL S + +ER + YN+
Sbjct: 152 KSSRWFSYISALPRQ-------PYSLLYWSQAELDRYLEASQIRERAIERTNNVIGTYND 204
Query: 67 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 126
M +F +YP P E F E FK +F + S +V L S+ + ALVP
Sbjct: 205 ------MRVRIFSKYPDFFPEEVFNIESFKWSFGILFSRMVRLP--SMDGKNALVPWAD- 255
Query: 127 LLAYSSKCKAMLAAVDDAVQLV--VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--N 182
++ +S + + L + +V DRPY+ GE + + G + N +LL++YGFV ++ N
Sbjct: 256 MMNHSCEVETFLDYDKSSKGIVFPTDRPYQPGEQVFISYGKKSNGELLLSYGFVPKEGTN 315
Query: 183 PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 221
P D + + +L D Y++K + ++ G Q F +
Sbjct: 316 PSDSVELSLSLKKSDESYKEKLELLKKYGLSGSQCFPIR 354
>gi|134254196|gb|AAI35195.1| LOC549331 protein [Xenopus (Silurana) tropicalis]
Length = 507
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 84/351 (23%), Positives = 158/351 (45%), Gaps = 36/351 (10%)
Query: 9 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
SFWLPYI+ L + ++PL ++E E+ YL + ++ + + R+Y
Sbjct: 94 NSFWLPYIKTLPNE-------YDTPLYFNEDEVQYLQSTQAILDVFSQYKNTARQY---- 142
Query: 69 TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
+F + Q +P +P ++FTF+ ++ A +V + + +R AL+PL
Sbjct: 143 -AYFY--KVIQTHPNANKLPLKDSFTFDDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 199
Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
+S DD + V + +K+GE I ++ G + N++ +I+ GF E+N +
Sbjct: 200 DMCNHTNSLITTGYNLEDDRCECVALQDFKSGEQIYIFYGTRSNAEFVIHNGFFFENNLH 259
Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL------ 238
DR+ ++ ++ D Y K V R G + VF +H E + +L +LR+
Sbjct: 260 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHV-TEPPISAQLLAFLRVFCMNED 318
Query: 239 ---GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDE 291
G++ + + + PVS E + +L + +AR L Y T+ +D
Sbjct: 319 ELKGHLIGDHAIDKIFTLGNSEFPVS--WENEI--KLWTFLEARASLLLKTYKTTVEDDN 374
Query: 292 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAP 342
+L ++ +A +L R+EK++L L+ +D L + P P
Sbjct: 375 KVLEQPDMTFHSAMAIKLRRVEKEILEKALKSASDNRKLYSKNSEEGTPLP 425
>gi|10439587|dbj|BAB15525.1| unnamed protein product [Homo sapiens]
Length = 512
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 83/351 (23%), Positives = 161/351 (45%), Gaps = 33/351 (9%)
Query: 9 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 87 NSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY---- 135
Query: 69 TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 136 -AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 192
Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
+ DD + V + ++AGE I ++ G + N++ +I+ GF ++N +
Sbjct: 193 DMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 252
Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD- 243
DR+ ++ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 253 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEE 311
Query: 244 --------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
S + + + PVS E + L D L Y T+ ED+++L
Sbjct: 312 ELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLK 371
Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPL 346
+++L + ++A +L EK++L ++ A + P + +SP APL
Sbjct: 372 NHDLSVRAKMAIKLRLGEKEILEKAVKSAA----VNPGI-LSPTDGGKAPL 417
>gi|443722302|gb|ELU11224.1| hypothetical protein CAPTEDRAFT_181634 [Capitella teleta]
Length = 541
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 79/316 (25%), Positives = 138/316 (43%), Gaps = 20/316 (6%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
SFWLPY++ L + SPL ++ +L L SPT +E++ + I R+Y
Sbjct: 168 SFWLPYLKILPH-------SYSSPLYFNPEDLQLLKASPTLSEMINQFRNITRQYAYFFN 220
Query: 70 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF-ALVPLGPPLL 128
+ F L + P I + ++ ++ A +V + + + R AL+PL
Sbjct: 221 L-FQGHELASKLP--IQVKNICYDDYRWAVSSVMTRQNQIPTLDGQRMISALIPLWDMCN 277
Query: 129 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 188
+ + + +D + AG + ++ G + N++LLI+ GFV N DRL
Sbjct: 278 HTNGQITTDFSLKNDRSECFSLEGTVAGAQVFIFYGSRSNAELLIHNGFVYPQNHSDRLT 337
Query: 189 VEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQ 248
+ ++ DP + K V R + ++F +H G SD L +LR+ +++ +++
Sbjct: 338 IRLGISKNDPLFSMKSEVLSRLSMQASRLFSLHCG-VNPVDSDTLAFLRVVVMTE-DDLR 395
Query: 249 SVISSLGPICP-------VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 301
+ ++ I VS ER LA L YP + ED +L +L
Sbjct: 396 TALACRQQISKLRDFDDFVSEDNERKAWAFLATRVLLLLKAYPTSAQEDATLLQGNDLST 455
Query: 302 KKRVATQLVRMEKKML 317
R+A QL EK +L
Sbjct: 456 HARLAVQLRHCEKNIL 471
>gi|146181028|ref|XP_001021989.2| SET domain containing protein [Tetrahymena thermophila]
gi|146144300|gb|EAS01744.2| SET domain containing protein [Tetrahymena thermophila SB210]
Length = 590
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 83/338 (24%), Positives = 155/338 (45%), Gaps = 49/338 (14%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+ ++++ K+S W PY+ L + P+ +SE +L++L GSP + ++ E+ I
Sbjct: 234 LLQERKNKESKWKPYLDILPSDYN------QFPIFFSEDDLSWLKGSPFQNQVREKKADI 287
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
KR+Y+++ +V F +Y TFE F A + S V LQ ++ + A
Sbjct: 288 KRDYDDICSV----APEFAEY---------TFEDFCWARMTASSRVFGLQ-INEQKTDAF 333
Query: 121 VPLGPPLLAYSSKCKAMLAAVDD-----AVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
VPL L + K DD +Q + D P GE + G + NS+ +NY
Sbjct: 334 VPLADML--NHRRPKQTSWQYDDQREGFVIQALEDIP--RGEQVYDSYGRKCNSRFFLNY 389
Query: 176 GFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL-SVQVFHVHAGREKEAISDMLP 234
GF++ DN + + + + EDP + K+ + G + +V+ + +++ +S+ +
Sbjct: 390 GFINLDNDANEVALRLTFDAEDPTIERKKEMM--GGDVPEFKVYRILENYQEQNVSEFMS 447
Query: 235 YLRLGYVSDTSEMQSVISSLG----------------PICPVSPCMERAVLDQLADYFKA 278
YLR + D S++ ++SSL P+S E + ++++ +
Sbjct: 448 YLRFILIRDNSKL-LMLSSLHEQQTENSENLSGYKPQKTPPISIQNETDMWVRISNMCQT 506
Query: 279 RLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 316
++ Y TL ED+ +L NL +R L EK++
Sbjct: 507 SISLYNTTLKEDKELLAKDNLTQNQRNCVLLRSGEKEV 544
>gi|194038089|ref|XP_001925323.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Sus scrofa]
gi|456754196|gb|JAA74239.1| SET domain containing 3 [Sus scrofa]
Length = 595
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 81/330 (24%), Positives = 150/330 (45%), Gaps = 28/330 (8%)
Query: 9 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 169 NSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY---- 217
Query: 69 TVWFMAGSLFQQYP--YDIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
+F + Q +P + +P E+FT+E ++ A +V + + +R AL+PL
Sbjct: 218 -AYFY--KVIQTHPQAHKLPLKESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 274
Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
+ DD + V R ++AGE I ++ G + N++ +I+ GF ++N +
Sbjct: 275 DMCNHTNGLITTGYNLEDDRCECVALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 334
Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
DR+ ++ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPVSAQLLAFLRVFCMTEG 393
Query: 245 SEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
+ + I +LG PVS E + L D L Y T+ ED+ L
Sbjct: 394 ELKEHLLGENAIDRIFTLGNSEYPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKTFLK 453
Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
++ L + +A +L EK++L ++ A
Sbjct: 454 NHGLSVRATMAVKLRLGEKEILEKAVESAA 483
>gi|440792294|gb|ELR13522.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
Length = 568
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 95/366 (25%), Positives = 149/366 (40%), Gaps = 62/366 (16%)
Query: 4 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 63
+K SFW PYI+ L + G + L ++ EL L GSPT ++ + +
Sbjct: 156 EKHNHSSFWTPYIKTLPKSYG-------TCLYFTLEELEGLRGSPTFTSAIKVIATVAIQ 208
Query: 64 YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV---QSCVVHLQKVSLARRFAL 120
Y + LFQ + AFT++ F A AV Q+ V +L+ +AL
Sbjct: 209 YTYIH-------DLFQIRKDILHINAFTWDEFIWAMSAVGSRQNQVPQWGHNALSE-YAL 260
Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
+P + D+ + R YK GE + ++ GP+PNS LL++ GFV E
Sbjct: 261 IPAWDMCNHDHGDLQTFWDVNSDSTESHAMRAYKKGEQVYIFYGPRPNSDLLLHAGFVYE 320
Query: 181 DNPYDRLVVEAALNTEDPQYQDK-RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLG 239
+N +D L + L + +DK R++ N K+ Q + G D++ +LR+
Sbjct: 321 NNRFDALAIRVRLAPDAEHIKDKLRLLHLNNMKMDSQYYLYGLG----LAVDLMAFLRI- 375
Query: 240 YVSDTSEMQSVISS-------------------------LGPICPVSPCMERAVLDQLAD 274
+ + E+Q V+ + P ++ E A L
Sbjct: 376 HAMNEQELQQVLGAYDQQEAKVHNGEHPASNGEVVASGVFDPRVKLNDRNELAALQLAEA 435
Query: 275 YFKARLAGYPATLS---------EDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
+ L+ YP TL ED+A L +L P R T L EK++LN L
Sbjct: 436 KCLSLLSLYPTTLQVANGVELKQEDQAALRTTSLTPNMRAVTLLRLKEKEILNRTL---- 491
Query: 326 DMIMLL 331
D I LL
Sbjct: 492 DAIRLL 497
>gi|332321747|sp|B7ZUF3.1|SETD3_XENTR RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|213624517|gb|AAI71209.1| LOC549331 protein [Xenopus (Silurana) tropicalis]
Length = 582
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 83/351 (23%), Positives = 157/351 (44%), Gaps = 36/351 (10%)
Query: 9 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
SFWLPYI+ L + ++PL ++E E+ YL + ++ + + R+Y
Sbjct: 169 NSFWLPYIKTLPNE-------YDTPLYFNEDEVQYLQSTQAILDVFSQYKNTARQY---- 217
Query: 69 TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
+F + Q +P +P ++FTF+ ++ A +V + + +R AL+PL
Sbjct: 218 -AYFY--KVIQTHPNANKLPLKDSFTFDDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 274
Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
+ DD + V + +K+GE I ++ G + N++ +I+ GF E+N +
Sbjct: 275 DMCNHTNGLITTGYNLEDDRCECVALQDFKSGEQIYIFYGTRSNAEFVIHNGFFFENNLH 334
Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL------ 238
DR+ ++ ++ D Y K V R G + VF +H E + +L +LR+
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHV-TEPPISAQLLAFLRVFCMNED 393
Query: 239 ---GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDE 291
G++ + + + PVS E + +L + +AR L Y T+ +D
Sbjct: 394 ELKGHLIGDHAIDKIFTLGNSEFPVS--WENEI--KLWTFLEARASLLLKTYKTTVEDDN 449
Query: 292 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAP 342
+L ++ +A +L R+EK++L L+ +D L + P P
Sbjct: 450 KVLEQPDMTFHSAMAIKLRRVEKEILEKALKSASDNRKLYSKNSEEGTPLP 500
>gi|62857953|ref|NP_001016577.1| histone-lysine N-methyltransferase setd3 [Xenopus (Silurana)
tropicalis]
gi|89272100|emb|CAJ81720.1| novel protein containing a SET domain [Xenopus (Silurana)
tropicalis]
Length = 581
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/351 (23%), Positives = 157/351 (44%), Gaps = 36/351 (10%)
Query: 9 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
SFWLPYI+ L + ++PL ++E E+ YL + ++ + + R+Y
Sbjct: 168 NSFWLPYIKTLPNE-------YDTPLYFNEDEVQYLQSTQAILDVFSQYKNTARQY---- 216
Query: 69 TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
+F + Q +P +P ++FTF+ ++ A +V + + +R AL+PL
Sbjct: 217 -AYFY--KVIQTHPNANKLPLKDSFTFDDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 273
Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
+ DD + V + +K+GE I ++ G + N++ +I+ GF E+N +
Sbjct: 274 DMCNHTNGLITTGYNLEDDRCECVALQDFKSGEQIYIFYGTRSNAEFVIHNGFFFENNLH 333
Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL------ 238
DR+ ++ ++ D Y K V R G + VF +H E + +L +LR+
Sbjct: 334 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHV-TEPPISAQLLAFLRVFCMNED 392
Query: 239 ---GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDE 291
G++ + + + PVS E + +L + +AR L Y T+ +D
Sbjct: 393 ELKGHLIGDHAIDKIFTLGNSEFPVS--WENEI--KLWTFLEARASLLLKTYKTTVEDDN 448
Query: 292 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAP 342
+L ++ +A +L R+EK++L L+ +D L + P P
Sbjct: 449 KVLEQPDMTFHSAMAIKLRRVEKEILEKALKSASDNRKLYSKNSEEGTPLP 499
>gi|340780678|pdb|3SMT|A Chain A, Crystal Structure Of Human Set Domain-Containing Protein3
Length = 497
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 77/329 (23%), Positives = 150/329 (45%), Gaps = 28/329 (8%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 169 SFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY----- 216
Query: 70 VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 217 AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVXTRQNQIPTEDGSRVTLALIPLWD 274
Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
+ DD + V + ++AGE I ++ G + N++ +I+ GF ++N +D
Sbjct: 275 XCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 334
Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL------- 238
R+ ++ ++ D Y K V R G + VF +H E + +L +LR+
Sbjct: 335 RVKIKLGVSKSDRLYAXKAEVLARAGIPTSSVFALHFT-EPPISAQLLAFLRVFCXTEEE 393
Query: 239 --GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
++ S + + + PVS E + L D L Y T+ ED+++L +
Sbjct: 394 LKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKN 453
Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
++L + + A +L EK++L ++ A
Sbjct: 454 HDLSVRAKXAIKLRLGEKEILEKAVKSAA 482
>gi|3403236|gb|AAC29137.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Spinacia oleracea]
Length = 491
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 87/326 (26%), Positives = 149/326 (45%), Gaps = 22/326 (6%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
M EKK G S W PYI L + S + WSE EL+ L GS L E +
Sbjct: 135 MREKKLGNSSSWKPYIDILPD-------STNSTIYWSEEELSELQGSQLLNTTLGVKELV 187
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
E+ +L+ + Q +P+D+ + F F F +C+ Q + L L
Sbjct: 188 ANEFAKLEEEVLVPHK--QLFPFDVTQDDF-FWAFGMLRSRAFTCL-EGQSLVLIPLADL 243
Query: 121 VPLGPPLLA--YSSKCK-AMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYG 176
P + A Y+ + + A L + + L P KAG+ +++ + + N++L ++YG
Sbjct: 244 ANHSPDITAPKYAWEIRGAGLFSRELVFSLRNPTPVKAGDQVLIQYDLNKSNAELALDYG 303
Query: 177 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL 236
+ + + + + D Y DK +A+ NG F + E+ ++MLPYL
Sbjct: 304 LTESRSERNAYTLTLEIPESDSFYGDKLDIAESNGMGESAYFDIVL--EQPLPANMLPYL 361
Query: 237 RLGYVS--DTSEMQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 291
RL + D ++S+ S G + P+SP E + + D + L+GY T++EDE
Sbjct: 362 RLVALGGEDAFLLESIFRNSIWGHLDLPISPANEELICQVIRDACTSALSGYSTTIAEDE 421
Query: 292 AMLTDYNLHPKKRVATQLVRMEKKML 317
+L + ++ P+ +A + EKK+L
Sbjct: 422 KLLAEGDIDPRLEIAITIRLGEKKVL 447
>gi|148744485|gb|AAI42996.1| SET domain containing 3 [Homo sapiens]
Length = 594
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 77/329 (23%), Positives = 152/329 (46%), Gaps = 28/329 (8%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 170 SFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY----- 217
Query: 70 VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 218 AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 275
Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
+ DD + V + ++AGE I ++ G + N++ +I+ GF ++N +D
Sbjct: 276 MCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 335
Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT- 244
R+ ++ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 336 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEE 394
Query: 245 --------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
S + + + PVS E + L D L Y T+ ED+++L +
Sbjct: 395 LKEHLLGDSAIDRIFTLGNSKFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKN 454
Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
++L + ++A +L EK++L ++ A
Sbjct: 455 HDLSVRAKMAIKLRLGEKEILEKAVKSAA 483
>gi|40068481|ref|NP_115609.2| histone-lysine N-methyltransferase setd3 isoform a [Homo sapiens]
gi|74750394|sp|Q86TU7.1|SETD3_HUMAN RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|28071092|emb|CAD61927.1| unnamed protein product [Homo sapiens]
gi|119602070|gb|EAW81664.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
gi|119602072|gb|EAW81666.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
gi|119602073|gb|EAW81667.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
gi|194380984|dbj|BAG64060.1| unnamed protein product [Homo sapiens]
gi|307686103|dbj|BAJ20982.1| SET domain containing 3 [synthetic construct]
Length = 594
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 77/329 (23%), Positives = 152/329 (46%), Gaps = 28/329 (8%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 170 SFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY----- 217
Query: 70 VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 218 AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 275
Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
+ DD + V + ++AGE I ++ G + N++ +I+ GF ++N +D
Sbjct: 276 MCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 335
Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT- 244
R+ ++ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 336 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEE 394
Query: 245 --------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
S + + + PVS E + L D L Y T+ ED+++L +
Sbjct: 395 LKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKN 454
Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
++L + ++A +L EK++L ++ A
Sbjct: 455 HDLSVRAKMAIKLRLGEKEILEKAVKSAA 483
>gi|356547583|ref|XP_003542190.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Glycine
max]
Length = 499
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 65/219 (29%), Positives = 100/219 (45%), Gaps = 21/219 (9%)
Query: 8 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNE 66
+ S W YI L RQ S L W++ EL YL S + +ER + YN+
Sbjct: 157 ESSRWSNYISALPRQ-------PYSLLYWTQAELDRYLEASQIRERAIERINNVIGTYND 209
Query: 67 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 126
L +F +YP P E F E FK +F + S +V L S+ ALVP
Sbjct: 210 LRL------RIFSKYPDLFPDEVFNIESFKWSFGILFSRLVRLP--SMGGNVALVPWAD- 260
Query: 127 LLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--N 182
+L +S + L + DRPY+ GE + + G + N +LL++YGFV ++ N
Sbjct: 261 MLNHSCDVETFLDYDKTSKGIVFTTDRPYQPGEQVFISYGKKSNGELLLSYGFVPKEGAN 320
Query: 183 PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 221
P D + + +L D Y++K + + G + Q F +
Sbjct: 321 PSDSVELSLSLKKSDASYKEKLELLKNYGLSASQCFPIQ 359
>gi|355718753|gb|AES06373.1| SET domain containing 3 [Mustela putorius furo]
Length = 585
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 81/330 (24%), Positives = 152/330 (46%), Gaps = 28/330 (8%)
Query: 9 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
SFW PYI+ L + ++PL + E E+ L + ++ + + R+Y
Sbjct: 169 NSFWQPYIQTLPSE-------YDTPLYFEEDEVRDLQSTQAIHDVFSQYKNTARQY---- 217
Query: 69 TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
+F + Q +P+ +P +AFT+E ++ A +V + + +R AL+PL
Sbjct: 218 -AYFY--KVIQTHPHANKLPLKDAFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 274
Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
+ DD + V R ++AGE I ++ G + N++ +I+ GF ++N +
Sbjct: 275 DMCNHTNGLITTGYNLEDDRCECVALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 334
Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
DR+ ++ ++ D Y K V R G + VF +H+ E + +L +LR+ +++
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPVSAQLLAFLRVFCMTEE 393
Query: 245 SEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
+ + I +LG PVS E + L D L Y T+ ED++ L
Sbjct: 394 ELKEHLLGDNALDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLK 453
Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
D++L + +A +L EK++L ++ A
Sbjct: 454 DHDLSVRAAMAIKLRLGEKEILEKAVKSAA 483
>gi|3403234|gb|AAC29136.1| ribulose-1,5-bisphosphate carboxylase/oxygenase N-methyltransferase
[Spinacia oleracea]
gi|3403238|gb|AAC29138.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase II [Spinacia oleracea]
Length = 495
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 84/329 (25%), Positives = 152/329 (46%), Gaps = 24/329 (7%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
M EKK G S W PYI L + S + WSE EL+ L GS L E +
Sbjct: 135 MREKKLGNSSSWKPYIDILPD-------STNSTIYWSEEELSELQGSQLLNTTLGVKELV 187
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAF--TFEIFK-QAFVAVQS---CVVHLQKVSL 114
E+ +L+ + Q +P+D+ + F F + + +AF ++ ++ L + +
Sbjct: 188 ANEFAKLEEEVLVPHK--QLFPFDVTQDDFFWAFGMLRSRAFTCLEGQSLVLIPLADLWV 245
Query: 115 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLI 173
+ + P A+ + A L + + L P KAG+ +++ + + N++L +
Sbjct: 246 QQANHSPDITAPKYAWEIR-GAGLFSRELVFSLRNPTPVKAGDQVLIQYDLNKSNAELAL 304
Query: 174 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML 233
+YG + + + + + D Y DK +A+ NG F + E+ ++ML
Sbjct: 305 DYGLTESRSERNAYTLTLEIPESDSFYGDKLDIAESNGMGESAYFDIVL--EQPLPANML 362
Query: 234 PYLRLGYVS--DTSEMQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLS 288
PYLRL + D ++S+ S G + P+SP E + + D + L+GY T++
Sbjct: 363 PYLRLVALGGEDAFLLESIFRNSIWGHLDLPISPANEELICQVIRDACTSALSGYSTTIA 422
Query: 289 EDEAMLTDYNLHPKKRVATQLVRMEKKML 317
EDE +L + ++ P+ +A + EKK+L
Sbjct: 423 EDEKLLAEGDIDPRLEIAITIRLGEKKVL 451
>gi|168003103|ref|XP_001754252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694354|gb|EDQ80702.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 431
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 68/243 (27%), Positives = 116/243 (47%), Gaps = 25/243 (10%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEG 59
+ E + + S W PY L + S L W+E E+ +LT SP + + LE
Sbjct: 94 ISEASREESSRWFPYFATLPK-------TPSSILQWTEEEVNTWLTASPVREKALECIRD 146
Query: 60 IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 119
+ Y +L ++F ++P P++ +T FK AF + S +V L V + A
Sbjct: 147 VTETYRDL------RATIFLKHPEVFPSQVYTLAAFKWAFGILFSRLVRLPSVG---KLA 197
Query: 120 LVPLGPPLLAYSSKCKAMLAAVDD---AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 176
LVP +L +S + + L + +V V DR Y++GE + + G + + +L + YG
Sbjct: 198 LVPWAD-MLNHSPQVDSFLDFDQNNAKSVVTVTDRAYQSGEQVFISYGKRSSGELFLAYG 256
Query: 177 FV-DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPY 235
F+ E N +D + +E ++++DP ++ K A G S Q F V R+ + +L Y
Sbjct: 257 FIPSELNVHDSVELEMEIDSDDPSFEAKLRAANEQGLSSPQRFPV---RKDGFPAQLLAY 313
Query: 236 LRL 238
RL
Sbjct: 314 ARL 316
>gi|440907688|gb|ELR57800.1| SET domain-containing protein 3 [Bos grunniens mutus]
Length = 594
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 78/330 (23%), Positives = 150/330 (45%), Gaps = 28/330 (8%)
Query: 9 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 169 NSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY---- 217
Query: 69 TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 218 -AYFY--KVIQTHPHAHKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 274
Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
S DD + V + ++AGE I ++ G + N++ +I+ GF ++N +
Sbjct: 275 DMCNHTSGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 334
Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
DR+ ++ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEE 393
Query: 245 ---------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
S + + + PVS E + L D L Y T+ ED++ L
Sbjct: 394 ELKEHLLGDSAIDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLK 453
Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
+++L + +A +L EK++L ++ A
Sbjct: 454 NHDLSARATMAIKLRLGEKEILERAVKSAA 483
>gi|110331827|gb|ABG67019.1| hypothetical protein LOC84193 [Bos taurus]
Length = 488
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 75/324 (23%), Positives = 146/324 (45%), Gaps = 24/324 (7%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 177 SFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY----- 224
Query: 70 VWFMAGSLFQQYPYDIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPL 127
+F + + +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 225 AYFYKVIQTHPHAHKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMC 284
Query: 128 LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 187
S DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+
Sbjct: 285 NHTSGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRV 344
Query: 188 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT--- 244
++ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 345 KIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELK 403
Query: 245 ------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYN 298
S + + + PVS E + L D L Y T+ ED++ L +++
Sbjct: 404 EHLLGDSAIDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLKNHD 463
Query: 299 LHPKKRVATQLVRMEKKMLNACLQ 322
L + +A +L EK++L ++
Sbjct: 464 LSARATMAIKLRLGEKEILERAVK 487
>gi|119914085|ref|XP_589822.3| PREDICTED: histone-lysine N-methyltransferase setd3 [Bos taurus]
gi|297488270|ref|XP_002696879.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Bos taurus]
gi|296475307|tpg|DAA17422.1| TPA: SET domain containing 3 [Bos taurus]
Length = 601
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 78/329 (23%), Positives = 150/329 (45%), Gaps = 28/329 (8%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 177 SFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY----- 224
Query: 70 VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 225 AYFY--KVIQTHPHAHKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 282
Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
S DD + V + ++AGE I ++ G + N++ +I+ GF ++N +D
Sbjct: 283 MCNHTSGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 342
Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT- 244
R+ ++ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 343 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEE 401
Query: 245 --------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
S + + + PVS E + L D L Y T+ ED++ L +
Sbjct: 402 LKEHLLGDSAIDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLKN 461
Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
++L + +A +L EK++L ++ A
Sbjct: 462 HDLSARATMAIKLRLGEKEILERAVKSAA 490
>gi|255582876|ref|XP_002532210.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
gi|223528106|gb|EEF30179.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
Length = 508
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 70/220 (31%), Positives = 104/220 (47%), Gaps = 23/220 (10%)
Query: 8 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNE 66
K S W YI L RQ S L W+ EL YL S + +ER + YN+
Sbjct: 166 KSSKWSNYISALPRQ-------PYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYND 218
Query: 67 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 126
L +F +YP P E F E FK +F + S +V L S+ + ALVP
Sbjct: 219 LRL------RIFSKYPDLFPEEVFNLETFKWSFGILFSRLVRLP--SMDGKVALVPWAD- 269
Query: 127 LLAYSSKCKAMLAAVDDAVQLVV---DRPYKAGESIVVWCGPQPNSKLLINYGFVDED-- 181
+L +S + + L D + Q VV DR Y+ GE + + G + N +LL++YGFV +
Sbjct: 270 MLNHSCEVETFLD-YDKSSQGVVFTTDRQYEPGEQVFISYGKKSNGELLLSYGFVPREGT 328
Query: 182 NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 221
NP D + + +L D Y++K +++G + Q F V
Sbjct: 329 NPSDSVELSLSLKKSDKSYKEKLEALKKHGFSASQCFPVR 368
>gi|293333172|ref|NP_001168589.1| uncharacterized protein LOC100382373 [Zea mays]
gi|223949395|gb|ACN28781.1| unknown [Zea mays]
gi|414885391|tpg|DAA61405.1| TPA: hypothetical protein ZEAMMB73_723554 [Zea mays]
Length = 489
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 95/334 (28%), Positives = 148/334 (44%), Gaps = 38/334 (11%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+ E +G S W PY+ L RQ +S + WSE EL + G+ ++L G+
Sbjct: 136 LSEVARGADSPWAPYLAILPRQ-------TDSTIFWSEEELLEIQGT----QLLSTTVGV 184
Query: 61 KREY--NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV---VHLQKVSLA 115
K EY +E D+V S + D+ + TF+ F AF ++S V + K++L
Sbjct: 185 K-EYVQSEFDSVQAEIISTNK----DLFPGSITFDDFLWAFGMLRSRVFPELRGDKLALI 239
Query: 116 RRFALVPLGPPLLAYSS----KCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSK 170
LV P + + S K K + + L K+G+ I + + + N++
Sbjct: 240 PFADLVNHSPNITSEGSSWEIKGKGLFGR-ELMFSLRTPVNVKSGQQIYIQYDLDKSNAE 298
Query: 171 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 230
L ++YGFV+ + D V ++ DP Y DK +A+ NG F V +
Sbjct: 299 LALDYGFVESNPSRDSFTVTLEISESDPFYGDKLDIAEANGLGETAYFDVIL--NEPLPP 356
Query: 231 DMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGY 283
MLPYLRL + T SV L P+SP E ++ + D K+ LA Y
Sbjct: 357 QMLPYLRLLCIGGTDAFLLEALFRNSVWGHLE--LPLSPDNEESICQAMRDACKSALADY 414
Query: 284 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 317
T+ EDE + NL P+ +A + EKK+L
Sbjct: 415 HTTIEEDEELSGRENLQPRLAIAIGVRAGEKKVL 448
>gi|426248573|ref|XP_004018037.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
setd3 [Ovis aries]
Length = 596
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 78/330 (23%), Positives = 150/330 (45%), Gaps = 28/330 (8%)
Query: 9 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 176 NSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY---- 224
Query: 69 TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 225 -AYFY--RVIQTHPHAHKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 281
Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
S DD + V + ++AGE I ++ G + N++ +I+ GF ++N +
Sbjct: 282 DMCNHTSGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 341
Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
DR+ ++ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 342 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEE 400
Query: 245 ---------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
S + + + PVS E + L D L Y T+ ED++ L
Sbjct: 401 ELKEHLLGDSAIDRIFTLGNSEYPVSWDNEVRLWAFLEDRASLLLKTYKTTIEEDKSFLK 460
Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
+++L + +A +L EK++L ++ A
Sbjct: 461 NHDLSARATMAVKLRLGEKEILERAVKSAA 490
>gi|114654683|ref|XP_522946.2| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2 [Pan
troglodytes]
gi|332843114|ref|XP_003314566.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Pan
troglodytes]
gi|397525919|ref|XP_003832895.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1 [Pan
paniscus]
gi|397525921|ref|XP_003832896.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2 [Pan
paniscus]
gi|410227562|gb|JAA11000.1| SET domain containing 3 [Pan troglodytes]
gi|410255618|gb|JAA15776.1| SET domain containing 3 [Pan troglodytes]
gi|410289938|gb|JAA23569.1| SET domain containing 3 [Pan troglodytes]
gi|410342147|gb|JAA40020.1| SET domain containing 3 [Pan troglodytes]
Length = 594
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 77/329 (23%), Positives = 151/329 (45%), Gaps = 28/329 (8%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 170 SFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY----- 217
Query: 70 VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 218 AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 275
Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
+ DD + V + ++AGE I ++ G + N++ +I+ GF ++N +D
Sbjct: 276 MCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 335
Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT- 244
R+ ++ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 336 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEE 394
Query: 245 --------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
S + + + PVS E + L D L Y T+ ED+++L +
Sbjct: 395 LKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKN 454
Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
+L + ++A +L EK++L ++ A
Sbjct: 455 QDLSVRAKMAIKLRLGEKEILEKAVKSAA 483
>gi|281182452|ref|NP_001162549.1| histone-lysine N-methyltransferase setd3 [Papio anubis]
gi|332321745|sp|A9X1D0.1|SETD3_PAPAN RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|163781076|gb|ABY40825.1| SET domain containing 3, isoform 1 (predicted) [Papio anubis]
Length = 595
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 77/330 (23%), Positives = 151/330 (45%), Gaps = 28/330 (8%)
Query: 9 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 169 NSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY---- 217
Query: 69 TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 218 -AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 274
Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
+ DD + V + ++AGE I ++ G + N++ +I+ GF ++N +
Sbjct: 275 DMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 334
Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
DR+ ++ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEE 393
Query: 245 ---------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
S + + + PVS E + L D L Y T+ ED+++L
Sbjct: 394 ELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLK 453
Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
+ +L + ++A +L EK++L ++ A
Sbjct: 454 NQDLSVRAKMAIKLRLGEKEILEKAVKSAA 483
>gi|431839268|gb|ELK01195.1| SET domain-containing protein 3 [Pteropus alecto]
Length = 805
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 80/330 (24%), Positives = 151/330 (45%), Gaps = 28/330 (8%)
Query: 9 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 348 NSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY---- 396
Query: 69 TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 397 -AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 453
Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
+ DD + V R ++AGE I ++ G + N++ +I+ GF ++N +
Sbjct: 454 DMCNHTNGLITTGYNLEDDRCECVALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 513
Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
DR+ ++ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 514 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEE 572
Query: 245 SEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
+ + I +LG PVS E + L D L Y T+ ED + L
Sbjct: 573 ELKEHLLGDNAIDRIFTLGNSEYPVSWDNEVKLWTFLEDRASLLLKTYKTTVEEDRSFLR 632
Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
+++L + +A +L EK++L ++ A
Sbjct: 633 NHDLSVRAAMAVKLRLGEKEILERAVKSAA 662
>gi|426377975|ref|XP_004055723.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Gorilla
gorilla gorilla]
Length = 594
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 77/329 (23%), Positives = 151/329 (45%), Gaps = 28/329 (8%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 170 SFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY----- 217
Query: 70 VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 218 AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 275
Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
+ DD + V + ++AGE I ++ G + N++ +I+ GF ++N +D
Sbjct: 276 MCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 335
Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT- 244
R+ ++ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 336 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEE 394
Query: 245 --------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
S + + + PVS E + L D L Y T+ ED+++L +
Sbjct: 395 LKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKN 454
Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
+L + ++A +L EK++L ++ A
Sbjct: 455 QDLSVRAKMAIKLRLGEKEILEKTVKSAA 483
>gi|297695854|ref|XP_002825140.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
[Pongo abelii]
gi|395746278|ref|XP_003778419.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
[Pongo abelii]
Length = 595
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 77/329 (23%), Positives = 151/329 (45%), Gaps = 28/329 (8%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 170 SFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY----- 217
Query: 70 VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 218 AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 275
Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
+ DD + V + ++AGE I ++ G + N++ +I+ GF ++N +D
Sbjct: 276 MCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 335
Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT- 244
R+ ++ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 336 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEE 394
Query: 245 --------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
S + + + PVS E + L D L Y T+ ED+++L +
Sbjct: 395 LKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKN 454
Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
+L + ++A +L EK++L ++ A
Sbjct: 455 QDLSVRAKMAIKLRLGEKEILEKAVKSAA 483
>gi|332252553|ref|XP_003275417.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
[Nomascus leucogenys]
gi|332252555|ref|XP_003275418.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
[Nomascus leucogenys]
Length = 595
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 77/329 (23%), Positives = 151/329 (45%), Gaps = 28/329 (8%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 170 SFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY----- 217
Query: 70 VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 218 AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 275
Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
+ DD + V + ++AGE I ++ G + N++ +I+ GF ++N +D
Sbjct: 276 MCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 335
Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT- 244
R+ ++ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 336 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEE 394
Query: 245 --------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
S + + + PVS E + L D L Y T+ ED+++L +
Sbjct: 395 LKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKN 454
Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
+L + ++A +L EK++L ++ A
Sbjct: 455 QDLSVRAKMAIKLRLGEKEILEKAVKSAA 483
>gi|355778846|gb|EHH63882.1| hypothetical protein EGM_16943 [Macaca fascicularis]
Length = 595
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 77/329 (23%), Positives = 151/329 (45%), Gaps = 28/329 (8%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 170 SFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY----- 217
Query: 70 VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 218 AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 275
Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
+ DD + V + ++AGE I ++ G + N++ +I+ GF ++N +D
Sbjct: 276 MCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 335
Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT- 244
R+ ++ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 336 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEE 394
Query: 245 --------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
S + + + PVS E + L D L Y T+ ED+++L +
Sbjct: 395 LKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKN 454
Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
+L + ++A +L EK++L ++ A
Sbjct: 455 QDLSVRAKMAIKLRLGEKEILEKAVKSAA 483
>gi|386780935|ref|NP_001247800.1| SET domain containing 3 [Macaca mulatta]
gi|355693560|gb|EHH28163.1| hypothetical protein EGK_18532 [Macaca mulatta]
gi|380817110|gb|AFE80429.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
gi|383422129|gb|AFH34278.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
gi|384949778|gb|AFI38494.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
Length = 595
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 77/330 (23%), Positives = 151/330 (45%), Gaps = 28/330 (8%)
Query: 9 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 169 NSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY---- 217
Query: 69 TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 218 -AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 274
Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
+ DD + V + ++AGE I ++ G + N++ +I+ GF ++N +
Sbjct: 275 DMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 334
Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
DR+ ++ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEE 393
Query: 245 ---------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
S + + + PVS E + L D L Y T+ ED+++L
Sbjct: 394 ELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLK 453
Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
+ +L + ++A +L EK++L ++ A
Sbjct: 454 NQDLSVRAKMAIKLRLGEKEILEKAVKSAA 483
>gi|296215874|ref|XP_002754318.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Callithrix jacchus]
Length = 610
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 79/329 (24%), Positives = 153/329 (46%), Gaps = 28/329 (8%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 185 SFWQPYIQTLPSE-------YDTPLYFEEEEVRYLQSTQAVHDVFSQYKNTARQY----- 232
Query: 70 VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 233 AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 290
Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
+ DD + V + ++AGE I ++ G + N++ +I+ GF ++N +D
Sbjct: 291 MCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 350
Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 245
R+ ++ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 351 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEE 409
Query: 246 EMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
+ + I +LG PVS E + L D L Y T+ ED+++L +
Sbjct: 410 LKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKN 469
Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
+L + ++A +L EK++L ++ A
Sbjct: 470 QDLSVRAKMAIKLRLGEKEILEKAVKSAA 498
>gi|332320543|sp|B0VX69.2|SETD3_CALJA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
Length = 595
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 79/329 (24%), Positives = 153/329 (46%), Gaps = 28/329 (8%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 170 SFWQPYIQTLPSE-------YDTPLYFEEEEVRYLQSTQAVHDVFSQYKNTARQY----- 217
Query: 70 VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 218 AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 275
Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
+ DD + V + ++AGE I ++ G + N++ +I+ GF ++N +D
Sbjct: 276 MCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 335
Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 245
R+ ++ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 336 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEE 394
Query: 246 EMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
+ + I +LG PVS E + L D L Y T+ ED+++L +
Sbjct: 395 LKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKN 454
Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
+L + ++A +L EK++L ++ A
Sbjct: 455 QDLSVRAKMAIKLRLGEKEILEKAVKSAA 483
>gi|168986666|gb|ACA35060.1| SET domain containing 3 isoform a (predicted) [Callithrix jacchus]
Length = 597
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 79/329 (24%), Positives = 153/329 (46%), Gaps = 28/329 (8%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 172 SFWQPYIQTLPSE-------YDTPLYFEEEEVRYLQSTQAVHDVFSQYKNTARQY----- 219
Query: 70 VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 220 AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 277
Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
+ DD + V + ++AGE I ++ G + N++ +I+ GF ++N +D
Sbjct: 278 MCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 337
Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 245
R+ ++ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 338 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEE 396
Query: 246 EMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
+ + I +LG PVS E + L D L Y T+ ED+++L +
Sbjct: 397 LKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKN 456
Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
+L + ++A +L EK++L ++ A
Sbjct: 457 QDLSVRAKMAIKLRLGEKEILEKAVKSAA 485
>gi|332321478|sp|B1MTJ4.2|SETD3_CALMO RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
Length = 595
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 79/329 (24%), Positives = 153/329 (46%), Gaps = 28/329 (8%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 170 SFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY----- 217
Query: 70 VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 218 AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 275
Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
+ DD + V + ++AGE I ++ G + N++ +I+ GF ++N +D
Sbjct: 276 MCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 335
Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 245
R+ ++ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 336 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEE 394
Query: 246 EMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
+ + I +LG PVS E + L D L Y T+ ED+++L +
Sbjct: 395 LKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKN 454
Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
+L + ++A +L EK++L ++ A
Sbjct: 455 QDLSVRAKMAIKLRLGEKEILEKAVKSAA 483
>gi|225447500|ref|XP_002267469.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic [Vitis
vinifera]
gi|296085051|emb|CBI28466.3| unnamed protein product [Vitis vinifera]
Length = 497
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 66/220 (30%), Positives = 105/220 (47%), Gaps = 23/220 (10%)
Query: 8 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNE 66
+ S W YI L RQ S L W+ EL YL S + +ER + YN+
Sbjct: 155 QSSRWSNYISALPRQ-------PYSLLYWTRAELDKYLEASQIRERAIERINDVTGTYND 207
Query: 67 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 126
L +F ++P+ P E F E FK +F + S +V L S+ + ALVP
Sbjct: 208 LRL------RIFSKHPHLFPEEVFNMETFKWSFGILFSRLVRLP--SMDEKIALVPWAD- 258
Query: 127 LLAYSSKCKAMLAAVDDAVQLVV---DRPYKAGESIVVWCGPQPNSKLLINYGFVDED-- 181
+L +S + + L D + Q VV DR Y+ E + + G + N +LL++YGFV +
Sbjct: 259 MLNHSCEVETFLD-YDKSSQGVVFTTDRTYQPSEQVFISYGKKSNGELLLSYGFVPREGT 317
Query: 182 NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 221
NP D++ + +L D Y++K +++G + Q F +
Sbjct: 318 NPNDKVELLLSLKKSDKCYKEKSEAMKKHGLSTSQCFPIQ 357
>gi|50252331|dbj|BAD28364.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplast precursor
[Oryza sativa Japonica Group]
gi|215769445|dbj|BAH01674.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 495
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 92/343 (26%), Positives = 146/343 (42%), Gaps = 56/343 (16%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+ E +G S W PY+ L RQ +S + WSE EL + G+ ++L G+
Sbjct: 141 LREAARGAGSPWAPYLAILPRQ-------TDSTIFWSEEELLEIQGT----QLLSTTMGV 189
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTE-------AFTFEIFKQAFVAVQSCVVHLQKVS 113
K EY + S F+ +I +E TF F AF ++S V +
Sbjct: 190 K-EYVQ---------SEFESVEAEIISENRELFPGTVTFNDFLWAFGILRSRVFAELR-- 237
Query: 114 LARRFALVPLGPPLLAYSS-----------KCKAMLAAVDDAVQLVVDRPYKAGESIVV- 161
+ AL+P L+ +S K K + D L K+GE I +
Sbjct: 238 -GDKLALIPFAD-LVNHSDDITSKESSWEIKGKGLFGR-DVVFSLRTPVNVKSGEQIYIQ 294
Query: 162 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 221
+ + N++L ++YGF + ++ D + ++ DP Y DK +A+ NG F +
Sbjct: 295 YDLDKSNAELALDYGFTESNSSRDAYTLTLEISESDPFYDDKLDIAELNGMGETAYFDIV 354
Query: 222 AGREKEAISDMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLAD 274
G + MLPYLRL + T +V L PVS E A+ + +
Sbjct: 355 LG--ESLPPQMLPYLRLLCLGGTDAFLLEALFRNAVWGHLE--LPVSQDNEEAICQVIRN 410
Query: 275 YFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 317
K+ L Y T+ EDE +L NL P+ ++A ++ EKK+L
Sbjct: 411 ACKSALGAYHTTIEEDEELLGSENLQPRLQIAVEVRAGEKKVL 453
>gi|218202140|gb|EEC84567.1| hypothetical protein OsI_31339 [Oryza sativa Indica Group]
Length = 649
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 93/344 (27%), Positives = 148/344 (43%), Gaps = 58/344 (16%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+ E +G S W PY+ L RQ +S + WSE EL + G+ ++L G+
Sbjct: 295 LREAARGAGSPWAPYLAILPRQ-------TDSTIFWSEEELLEIQGT----QLLSTTMGV 343
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTE-------AFTFEIFKQAFVAVQSCVVHLQKVS 113
K EY + S F+ +I +E TF F AF ++S V +
Sbjct: 344 K-EYVQ---------SEFESVEAEIISENRELFPGTVTFNDFLWAFGILRSRVFAELR-- 391
Query: 114 LARRFALVPLGPPLLAYSS-----------KCKAMLAAVDDAVQLVVDRPYKAGESIVV- 161
+ AL+P L+ +S K K + D L K+GE I +
Sbjct: 392 -GDKLALIPFAD-LVNHSDDITSKESSWEIKGKGLFGR-DVVFSLRTPVNVKSGEQIYIQ 448
Query: 162 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 221
+ + N++L ++YGF + ++ D + ++ DP Y DK +A+ NG F +
Sbjct: 449 YDLDKSNAELALDYGFTESNSSRDAYTLTLEISESDPFYDDKLDIAELNGMGETAYFDIV 508
Query: 222 AGREKEAIS-DMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLA 273
G E++ MLPYLRL + T +V L PVS E A+ +
Sbjct: 509 LG---ESLPPQMLPYLRLLCLGGTDAFLLEALFRNAVWGHL--ELPVSQDNEEAICQVIR 563
Query: 274 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 317
+ K+ L Y T+ EDE +L NL P+ ++A ++ EKK+L
Sbjct: 564 NACKSALGAYHTTIEEDEELLGSENLQPRLQIAVEVRAGEKKVL 607
>gi|169409575|gb|ACA57918.1| SET domain containing 3 isoform a (predicted) [Callicebus moloch]
Length = 597
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 79/329 (24%), Positives = 153/329 (46%), Gaps = 28/329 (8%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 172 SFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY----- 219
Query: 70 VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 220 AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 277
Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
+ DD + V + ++AGE I ++ G + N++ +I+ GF ++N +D
Sbjct: 278 MCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 337
Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 245
R+ ++ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 338 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEE 396
Query: 246 EMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
+ + I +LG PVS E + L D L Y T+ ED+++L +
Sbjct: 397 LKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKN 456
Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
+L + ++A +L EK++L ++ A
Sbjct: 457 QDLSVRAKMAIKLRLGEKEILEKAVKSAA 485
>gi|332321743|sp|C1FXW2.1|SETD3_DASNO RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|226526916|gb|ACO71275.1| SET domain containing 3 isoform a (predicted) [Dasypus
novemcinctus]
Length = 589
Score = 80.9 bits (198), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 79/329 (24%), Positives = 152/329 (46%), Gaps = 28/329 (8%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 170 SFWQPYIQSLPGE-------YDTPLYFEEDEVRYLHSTQAIHDVFSQYKNTARQY----- 217
Query: 70 VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 218 AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 275
Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
+ DD + V + ++AGE I ++ G + N++ +I+ GF ++N +D
Sbjct: 276 MCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 335
Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 245
R+ ++ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 336 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEE 394
Query: 246 EMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
+ + I +LG PVS E + L D L Y T+ ED++ L +
Sbjct: 395 LKEHLLGENAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSFLKN 454
Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
++L + +A +L EK++L ++ A
Sbjct: 455 HDLSVRATMAIKLRLGEKEILEKAVKSAA 483
>gi|147843303|emb|CAN82664.1| hypothetical protein VITISV_015206 [Vitis vinifera]
Length = 507
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 77/296 (26%), Positives = 138/296 (46%), Gaps = 17/296 (5%)
Query: 32 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 91
S + WSE EL + G+ L E ++ E+ +++ + S +P + +
Sbjct: 153 STIYWSEEELVEIQGTQLSNTTLGVKEYVQSEFLKVEEEVILPHSQLFPFPVTLDDFLWA 212
Query: 92 FEIFK-QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLA--YSSKCK-AMLAAVDDAVQL 147
F I + +AF ++ Q + L L+ P + Y+ + K A L + D L
Sbjct: 213 FGILRSRAFSRLRG-----QNLVLIPLADLINHSPSITTEEYAWEIKGAGLFSRDQLFSL 267
Query: 148 VVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 206
KAGE +++ + + N++L ++YGF++ + + ++ DP + DK +
Sbjct: 268 RTPVSVKAGEQVLIQYDLDKSNAELALDYGFIESRPNRNSYTLTLEISESDPFFGDKLDI 327
Query: 207 AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS--DTSEMQSVISSL--GPI-CPVS 261
A+ NG + F + G+ A MLPYLRL + D ++S+ + G + PVS
Sbjct: 328 AESNGLSEIAYFDIVLGQSLPAA--MLPYLRLVALGGPDAFLLESIFRNTIWGHLELPVS 385
Query: 262 PCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 317
E + + D K+ L+GY T+ EDE + + NLHP+ +A + EKK+L
Sbjct: 386 RANEELICQVIQDACKSALSGYLTTIEEDEKLKEEGNLHPRLEIAVGVRTGEKKVL 441
>gi|332321744|sp|B5FW36.1|SETD3_OTOGA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|197215622|gb|ACH53017.1| SET domain containing 3 isoform a (predicted) [Otolemur garnettii]
Length = 595
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 79/329 (24%), Positives = 152/329 (46%), Gaps = 28/329 (8%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 170 SFWQPYIQSLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY----- 217
Query: 70 VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 218 AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 275
Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
+ DD + V + ++AGE I ++ G + N++ +I+ GF ++N +D
Sbjct: 276 MCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 335
Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 245
R+ ++ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 336 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEE 394
Query: 246 EMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
+ + I +LG PVS E + L D L Y T+ ED+ +L +
Sbjct: 395 LKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKFVLKN 454
Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
++L + +A +L EK++L ++ A
Sbjct: 455 HDLSVRATMAIKLRLGEKEILEKAVKSAA 483
>gi|225462926|ref|XP_002267249.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic [Vitis
vinifera]
gi|296087793|emb|CBI35049.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 77/296 (26%), Positives = 138/296 (46%), Gaps = 17/296 (5%)
Query: 32 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 91
S + WSE EL + G+ L E ++ E+ +++ + S +P + +
Sbjct: 153 STIYWSEEELVEIQGTQLSNTTLGVKEYVQSEFLKVEEEVILPHSQLFPFPVTLDDFLWA 212
Query: 92 FEIFK-QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLA--YSSKCK-AMLAAVDDAVQL 147
F I + +AF ++ Q + L L+ P + Y+ + K A L + D L
Sbjct: 213 FGILRSRAFSRLRG-----QNLVLIPLADLINHSPSITTEEYAWEIKGAGLFSRDQLFSL 267
Query: 148 VVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 206
KAGE +++ + + N++L ++YGF++ + + ++ DP + DK +
Sbjct: 268 RTPVSVKAGEQVLIQYDLDKSNAELALDYGFIESRPNRNSYTLTLEISESDPFFGDKLDI 327
Query: 207 AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS--DTSEMQSVISSL--GPI-CPVS 261
A+ NG + F + G+ A MLPYLRL + D ++S+ + G + PVS
Sbjct: 328 AESNGLSEIAYFDIVLGQSLPAA--MLPYLRLVALGGPDAFLLESIFRNTIWGHLELPVS 385
Query: 262 PCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 317
E + + D K+ L+GY T+ EDE + + NLHP+ +A + EKK+L
Sbjct: 386 RANEELICQVIQDACKSALSGYLTTIEEDEKLKEEGNLHPRLEIAVGVRTGEKKVL 441
>gi|346474100|gb|AEO36894.1| hypothetical protein [Amblyomma maculatum]
Length = 459
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 86/354 (24%), Positives = 151/354 (42%), Gaps = 53/354 (14%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+ E G+ SFW PYI L + + L ++ EL L+GS E L+ I
Sbjct: 127 ILELSAGESSFWHPYISVLPD-------SFNTVLYFNIEELELLSGSAVLDEALKLHRSI 179
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV---QSCVVHLQKVSL--- 114
R+Y + F L + P+ + FT+++++ A AV Q+ V + L
Sbjct: 180 ARQYAYFHKI-FRTHPLAKSLPF---KDCFTYDLYRWAVSAVMTRQNAVPWTESDGLGGD 235
Query: 115 -------ARRFALVPL--------GPPLLAYSS-----KCKAMLAAVDDAVQLVVDRPYK 154
A ALVPL G L Y S +C AM R +
Sbjct: 236 DVEIDGTAAVTALVPLWDMCNHSDGKVLTDYDSSASMVRCYAM-------------RDFD 282
Query: 155 AGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLS 214
GE + ++ G + N++ I+ GFV EDN YD + ++ ++ +DP + K + + + LS
Sbjct: 283 KGEEVTIFYGKRTNAEFFIHNGFVFEDNRYDAVDIKLGVSKKDPLFAVKSKLCE-DHDLS 341
Query: 215 VQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLAD 274
+ R++ D+ +LR+ + D S+ ++ S I S R L L
Sbjct: 342 LSGTFALVARDRPVSEDLSTFLRILVLKDASQPEAF--SAEHILTSSDSNARDALTFLVV 399
Query: 275 YFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 328
+ L +P + E E ++ D + + ++A +L +E K+L + L+ + +
Sbjct: 400 RIELLLKAFPKSDEEYEDIIKDGASNARVKMAARLRLLESKVLASVLETLGNHV 453
>gi|291411315|ref|XP_002721936.1| PREDICTED: SET domain containing 3 [Oryctolagus cuniculus]
Length = 591
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 81/331 (24%), Positives = 147/331 (44%), Gaps = 30/331 (9%)
Query: 9 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 169 NSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY---- 217
Query: 69 TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 218 -AYFY--RVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 274
Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
+ DD + V R + AGE I ++ G + N++ +I+ GF ++N +
Sbjct: 275 DMCNHTNGLITTGYNLEDDRCECVALRDFHAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 334
Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
DR+ ++ ++ D Y K V R G + VF +H E + +L +LR+ +
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRV-FCMTE 392
Query: 245 SEMQSVISSLGPI----------CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 294
E++ + G I PVS E + L D L Y T+ ED+A+L
Sbjct: 393 EELREHLLGDGAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKAVL 452
Query: 295 TDYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
L + +A +L EK++L ++ A
Sbjct: 453 RSPALSARAAMAVKLRLGEKEILEKAVRSAA 483
>gi|224051705|ref|XP_002200601.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Taeniopygia
guttata]
Length = 593
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 78/329 (23%), Positives = 153/329 (46%), Gaps = 28/329 (8%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
SFWLPYI+ L + ++PL + E E+ +L + ++ + + R+Y
Sbjct: 170 SFWLPYIQTLPSE-------YDTPLYFEEDEVQHLQSTQAIHDVFSQYKNTARQY----- 217
Query: 70 VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
+F + Q +P +P ++FT++ ++ A +V + + +R AL+PL
Sbjct: 218 AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 275
Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
+ DD + V + +KAGE I ++ G + N++ +I+ GF ++N +D
Sbjct: 276 MCNHTNGLITTGYNLEDDRCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 335
Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 245
R+ ++ ++ D Y K V R G + VF +H+ E + +L +LR+ +S+
Sbjct: 336 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPAISAQLLAFLRVFCMSEEE 394
Query: 246 EMQSVIS--SLGPICPVSPCMERAVLD---QLADYFKAR----LAGYPATLSEDEAMLTD 296
+ +I ++G I + D +L + +AR L Y T+ D++ L
Sbjct: 395 LKEHLIGEHAIGKIFTLGNSDFPVSWDNEVKLWTFLEARASLLLKTYKTTVEVDKSFLET 454
Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
++L P +A +L EK++L ++ A
Sbjct: 455 HDLTPHAIMAIKLRLGEKEILEKAVKSAA 483
>gi|395827792|ref|XP_003787079.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Otolemur
garnettii]
Length = 595
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 79/329 (24%), Positives = 152/329 (46%), Gaps = 28/329 (8%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 170 SFWQPYIQSLPSE-------YDTPLYFEEDEVRYLQSTQAIYDVFSQYKNTARQY----- 217
Query: 70 VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 218 AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 275
Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
+ DD + V + ++AGE I ++ G + N++ +I+ GF ++N +D
Sbjct: 276 MCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 335
Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 245
R+ ++ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 336 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEE 394
Query: 246 EMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
+ + I +LG PVS E + L D L Y T+ ED+ +L +
Sbjct: 395 LKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKFVLKN 454
Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
++L + +A +L EK++L ++ A
Sbjct: 455 HDLSVRATMAIKLRLGEKEILEKAVKSAA 483
>gi|449453618|ref|XP_004144553.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Cucumis
sativus]
gi|449511789|ref|XP_004164054.1| PREDICTED: LOW QUALITY PROTEIN: ribulose-1,5 bisphosphate
carboxylase/oxygenase large subunit N-methyltransferase,
chloroplastic-like [Cucumis sativus]
Length = 497
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 68/220 (30%), Positives = 105/220 (47%), Gaps = 23/220 (10%)
Query: 8 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNE 66
K S W YI L RQ S L W+ EL YL S + +ER + YN+
Sbjct: 155 KSSRWNNYISALPRQ-------PYSLLYWTREELDRYLEASEIRERAIERITNVVGTYND 207
Query: 67 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 126
L +F ++P P E F E FK +F + S +V L S+ + ALVP
Sbjct: 208 LSI------RVFSKHPELFPEEVFNIETFKWSFGILFSRLVRLP--SMDGKVALVPWAD- 258
Query: 127 LLAYSSKCKAMLAAVDDAVQLVV---DRPYKAGESIVVWCGPQPNSKLLINYGFVDED-- 181
+L ++ + + L D A Q VV DR Y+ GE + + G + N +LL++YGFV ++
Sbjct: 259 MLNHNCEVETFLD-YDKASQGVVFTTDRAYQPGEQVFISYGKKSNGELLLSYGFVPKEGS 317
Query: 182 NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 221
NP D + + +L D Y++K +++G + Q F +
Sbjct: 318 NPSDSVELLLSLKKSDKCYKEKLEALKKHGLRASQCFPIQ 357
>gi|403274243|ref|XP_003928891.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Saimiri
boliviensis boliviensis]
Length = 513
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 79/330 (23%), Positives = 152/330 (46%), Gaps = 28/330 (8%)
Query: 9 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 87 NSFWQPYIQTLPSE-------YDTPLYFEEEEVRYLQSTQAIHDVFSQYKNTARQY---- 135
Query: 69 TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 136 -AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 192
Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
+ DD + V + ++AGE I ++ G + N++ +I+ GF ++N +
Sbjct: 193 DMCNHTNGLITTGYNLEDDRCECVALQDFQAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 252
Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
DR+ ++ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 253 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEE 311
Query: 245 SEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
+ + I +LG PVS E + L D L Y T+ ED+ +L
Sbjct: 312 ELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKFVLK 371
Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
+ +L + ++A +L EK++L ++ A
Sbjct: 372 NQDLSVRAKMAIKLRLGEKEILEKAVKSAA 401
>gi|302804384|ref|XP_002983944.1| hypothetical protein SELMODRAFT_119151 [Selaginella moellendorffii]
gi|300148296|gb|EFJ14956.1| hypothetical protein SELMODRAFT_119151 [Selaginella moellendorffii]
Length = 439
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 87/339 (25%), Positives = 147/339 (43%), Gaps = 35/339 (10%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+ +K S W YIR L R ++SPL WSE ELA L G+ + + E +
Sbjct: 71 LLHEKAKPHSDWSAYIRVLPR-------TLDSPLFWSEEELAELKGTQLLSSMNGFKEFL 123
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV------HLQKVSL 114
KREY+++ T + + P +T E F AF ++S +L V L
Sbjct: 124 KREYDKVMT------EVIEPRPDVFDRSLYTLEAFTWAFGILRSRTFPPLIGDNLALVPL 177
Query: 115 A----RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 170
A F L P + K + A + + E ++ + + N++
Sbjct: 178 ADFVNHGFGLTNEDP---GWKVKSAGVFARQETLTLQAAANCAEKQEVLIQYGKKKGNAQ 234
Query: 171 LLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEA 228
L +YGFVD D N D + ++ + DK +AQ G S F+++ R +
Sbjct: 235 LATDYGFVDSDEKNNRDSFTLTLQVSLSERFADDKVDIAQMAGLDSTAYFNLY--RNQGP 292
Query: 229 ISDMLPYLRLG--YVSDTSEMQSVISSL---GPICPVSPCMERAVLDQLADYFKARLAGY 283
DM+ YLRL + SD+ ++++ + P+S E A+ + + + +A L Y
Sbjct: 293 PEDMIAYLRLIALFGSDSFLLEALFRNTVWDHLRLPISRENEEAICEAMIEGCRATLREY 352
Query: 284 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 322
+T+ ED +L L +K++A + EK++L LQ
Sbjct: 353 SSTIDEDTMLLNSSELSTRKKMAVVVRLGEKRILQEQLQ 391
>gi|348554489|ref|XP_003463058.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cavia
porcellus]
Length = 789
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 80/330 (24%), Positives = 151/330 (45%), Gaps = 28/330 (8%)
Query: 9 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
SFWLPYI+ L + ++PL + E E+ L + ++ + + R+Y
Sbjct: 169 NSFWLPYIQTLPSE-------YDTPLYFEEEEVQCLQSTQAIHDVFSQYKNTARQY---- 217
Query: 69 TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 218 -AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 274
Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
+ DD + V + ++AGE I ++ G + N++ +I+ GF ++N +
Sbjct: 275 DMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 334
Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
DR+ ++ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEE 393
Query: 245 SEMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
+ ++ +LG PVS E + L D L Y T+ ED+A+L
Sbjct: 394 ELKEHLLGENAIDRIFTLGNSEFPVSWENEVKLWSFLEDRASLLLKTYKTTIEEDKAVLK 453
Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
L + ++A +L EK++L +Q A
Sbjct: 454 GPELPTRMKMAVKLRLGEKEILERTVQSAA 483
>gi|344273731|ref|XP_003408672.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Loxodonta
africana]
Length = 597
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 79/330 (23%), Positives = 151/330 (45%), Gaps = 28/330 (8%)
Query: 9 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
SFW PYI+ L + ++PL + E E+ +L + ++ + + R+Y
Sbjct: 169 NSFWQPYIQTLPSE-------YDTPLYFEEDEVRHLQSTQAIHDVFSQYKNTARQY---- 217
Query: 69 TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 218 -AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 274
Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
+ DD + V + ++AGE I ++ G + N++ +I+ GF ++N +
Sbjct: 275 DMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 334
Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
DR+ ++ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEE 393
Query: 245 SEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
+ + I +LG PVS E + L D L Y T+ ED+A L
Sbjct: 394 ELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKAFLK 453
Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
++L + +A +L EK++L ++ A
Sbjct: 454 GHDLSIRATMAVKLRLGEKEILERAVKSAA 483
>gi|226501968|ref|NP_001140387.1| uncharacterized protein LOC100272441 [Zea mays]
gi|194699272|gb|ACF83720.1| unknown [Zea mays]
gi|413923744|gb|AFW63676.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Zea mays]
Length = 503
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 63/214 (29%), Positives = 101/214 (47%), Gaps = 21/214 (9%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELD 68
S W+ YI L RQ S L W+ EL AYL SP + ++R + YN+L
Sbjct: 163 SRWISYIAALPRQ-------PYSLLYWTRAELDAYLVASPIRKRAIQRITDVIGTYNDLR 215
Query: 69 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 128
+F ++P P E + E F +F + S +V L S+ R ALVP +L
Sbjct: 216 D------RIFSRHPDLFPEEVYNIETFLWSFGILFSRLVRLP--SMDGRVALVPWAD-ML 266
Query: 129 AYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPY 184
+S + + L + DR Y+ GE + + G + + +LL++YGFV ++ NP
Sbjct: 267 NHSPEVETFLDFDKSSRGIVFTTDRSYQPGEQVFISYGKKSSGELLLSYGFVPKEGTNPN 326
Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
D + + +L+ D Y++K +RNG + F
Sbjct: 327 DSVELLVSLDKSDNCYKEKLQALKRNGLSESESF 360
>gi|432098266|gb|ELK28072.1| Histone-lysine N-methyltransferase setd3 [Myotis davidii]
Length = 585
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 77/329 (23%), Positives = 147/329 (44%), Gaps = 28/329 (8%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
SFW PYI+ L + ++PL + E E+ L + ++ + + R+Y
Sbjct: 159 SFWQPYIQTLPSE-------YDTPLYFEEDEVRSLQSTQAVHDVFSQYKNTARQY----- 206
Query: 70 VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 207 AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 264
Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
+ DD + V R ++AGE I ++ G + N++ +I+ GF ++N +D
Sbjct: 265 MCNHTNGLITTGYNLEDDRCECVALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 324
Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL------- 238
R+ ++ ++ D Y K V R G + VF +H E + +L +LR+
Sbjct: 325 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-MEPPISAQLLAFLRVFCMTEEE 383
Query: 239 --GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
++ + + + + PVS E + L D L Y T ED++ L +
Sbjct: 384 LKDHLLGDNAIDKIFTLGNSEYPVSWDNEVKLWTFLEDRASLLLKTYKTTSEEDKSFLKN 443
Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
++L + R+A +L EK++L + A
Sbjct: 444 HDLSVRARMAIKLRLGEKEILEKAVTSAA 472
>gi|332321746|sp|B2KI88.1|SETD3_RHIFE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|183637154|gb|ACC64548.1| SET domain containing 3 isoform a (predicted) [Rhinolophus
ferrumequinum]
Length = 594
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 78/330 (23%), Positives = 151/330 (45%), Gaps = 28/330 (8%)
Query: 9 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 169 NSFWQPYIQTLPSE-------YDTPLYFGEDEVRYLQSTQAIHDVFSQYKNTARQY---- 217
Query: 69 TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 218 -AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 274
Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
+ DD + V + ++AGE I ++ G + N++ +I+ GF ++N +
Sbjct: 275 DMCNHTNGLITTGYNLEDDRCECVALQDFQAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 334
Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
DR+ ++ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEE 393
Query: 245 SEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
+ + I +LG PVS E + L D L Y + ED++ L
Sbjct: 394 ELKEHLLGDNAIDRIFTLGNSEYPVSWDNEVKLWTFLEDRASLLLKTYKTNIEEDKSFLK 453
Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
+++L + +A +L EK++L ++ A
Sbjct: 454 NHDLSVRATMAIKLRLGEKEILEKAVKSAA 483
>gi|410962953|ref|XP_003988033.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Felis catus]
Length = 591
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 79/328 (24%), Positives = 148/328 (45%), Gaps = 36/328 (10%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
SFW PYI+ L + ++PL + E E+ L + ++ + + R+Y
Sbjct: 170 SFWQPYIQTLPSE-------YDTPLYFEEDEVRDLQSTQAIHDVFSQYKNTARQY----- 217
Query: 70 VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 126
+F + Q +P+ +P +AFT+E ++ V S + + L + G P
Sbjct: 218 AYFY--KVIQTHPHANKLPLKDAFTYEDYRLGLV---SLALGRWALGLECGVGIARCGKP 272
Query: 127 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 186
+ + DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR
Sbjct: 273 QITTGYNLE------DDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDR 326
Query: 187 LVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE 246
+ ++ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 327 VKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPVSAQLLAFLRVFCMTEEEL 385
Query: 247 MQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDY 297
+ + I +LG PVS E + L D L Y T+ ED+A L ++
Sbjct: 386 KEHLLGDNAIDRIFTLGNSEYPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKAFLKNH 445
Query: 298 NLHPKKRVATQLVRMEKKMLNACLQVTA 325
NL + +A +L EK++L ++ A
Sbjct: 446 NLSVRATMAIKLRLGEKEILEKAVKSAA 473
>gi|357137766|ref|XP_003570470.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like
[Brachypodium distachyon]
Length = 389
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 64/208 (30%), Positives = 102/208 (49%), Gaps = 23/208 (11%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELD 68
S W YI L RQ S L W+ TE+ AYL SP + + R + YN+L
Sbjct: 74 SRWSSYIDALPRQ-------PYSLLYWTRTEIDAYLVASPIRERAISRIGDVIGTYNDL- 125
Query: 69 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 128
+F ++P P E + E F+ +F + S +V L S+ + ALVP +L
Sbjct: 126 -----RDRIFSKHPELFPEEVYNMENFRWSFGILFSRLVRLP--SMDGKVALVPWAD-ML 177
Query: 129 AYSSKCKAMLAAVDDAVQLVV---DRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NP 183
++ + A L D + Q +V DR Y+ GE + + G + + +LL++YGFV ++ NP
Sbjct: 178 NHNPEVDAFLD-FDKSSQGIVFTTDRSYQPGEQVFISYGKKSSGELLLSYGFVPKEGTNP 236
Query: 184 YDRLVVEAALNTEDPQYQDKRMVAQRNG 211
D + +LN D Y++K +R+G
Sbjct: 237 NDSVEFSVSLNKSDDCYREKLQALKRHG 264
>gi|332321742|sp|E2RBS6.1|SETD3_CANFA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
Length = 588
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 79/330 (23%), Positives = 151/330 (45%), Gaps = 28/330 (8%)
Query: 9 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
SFW PYI+ L + ++PL + E E+ L + ++ + + R+Y
Sbjct: 169 NSFWQPYIQTLPSE-------YDTPLYFEEDEVRDLQSTQAIHDVFSQYKNTARQY---- 217
Query: 69 TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
+F + Q +P+ +P +AFT+E ++ A +V + + +R AL+PL
Sbjct: 218 -AYFY--KVIQTHPHANKLPLKDAFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 274
Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
+ DD + V R ++AGE I ++ G + N++ +I+ GF ++N +
Sbjct: 275 DMCNHTNGLITTGYNLEDDRCECVALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 334
Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
DR+ ++ ++ D Y K V R G + VF +H + + +L +LR+ +++
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHY-TDPPVSAQLLAFLRVFCMTEE 393
Query: 245 SEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
+ + I +LG PVS E + L D L Y T+ ED++ L
Sbjct: 394 ELKEHLLGDNALDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLR 453
Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
+++L + +A +L EK++L ++ A
Sbjct: 454 NHDLSVRATMAIKLRLGEKEILEKAVKSAA 483
>gi|73964462|ref|XP_547974.2| PREDICTED: SET domain containing 3 [Canis lupus familiaris]
Length = 589
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 79/330 (23%), Positives = 151/330 (45%), Gaps = 28/330 (8%)
Query: 9 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
SFW PYI+ L + ++PL + E E+ L + ++ + + R+Y
Sbjct: 169 NSFWQPYIQTLPSE-------YDTPLYFEEDEVRDLQSTQAIHDVFSQYKNTARQY---- 217
Query: 69 TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
+F + Q +P+ +P +AFT+E ++ A +V + + +R AL+PL
Sbjct: 218 -AYFY--KVIQTHPHANKLPLKDAFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 274
Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
+ DD + V R ++AGE I ++ G + N++ +I+ GF ++N +
Sbjct: 275 DMCNHTNGLITTGYNLEDDRCECVALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 334
Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
DR+ ++ ++ D Y K V R G + VF +H + + +L +LR+ +++
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHY-TDPPVSAQLLAFLRVFCMTEE 393
Query: 245 SEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
+ + I +LG PVS E + L D L Y T+ ED++ L
Sbjct: 394 ELKEHLLGDNALDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLR 453
Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
+++L + +A +L EK++L ++ A
Sbjct: 454 NHDLSVRATMAIKLRLGEKEILEKAVKSAA 483
>gi|302754606|ref|XP_002960727.1| hypothetical protein SELMODRAFT_449995 [Selaginella moellendorffii]
gi|300171666|gb|EFJ38266.1| hypothetical protein SELMODRAFT_449995 [Selaginella moellendorffii]
Length = 430
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 88/339 (25%), Positives = 147/339 (43%), Gaps = 35/339 (10%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+ +K S W YIR L R ++SPL WSE ELA L G+ + I E +
Sbjct: 71 LLHEKAKPHSDWSAYIRVLPR-------TLDSPLFWSEEELAELKGTQLLSSINGFKEFL 123
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV------HLQKVSL 114
KREY+++ T + + P +T E F AF ++S +L V L
Sbjct: 124 KREYDKVMT------EVIEPRPDVFDRSLYTLEAFTWAFGILRSRTFPPLIGDNLALVPL 177
Query: 115 A----RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 170
A F L P + K + A + + E ++ + + N++
Sbjct: 178 ADFVNHGFGLTNEDP---YWHVKSAGVFARQETLTLQAAANCAEKQEVLMQYGKKKGNAQ 234
Query: 171 LLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEA 228
L +YGFVD D N D + ++ + DK +AQ G S F+++ R +
Sbjct: 235 LATDYGFVDSDEKNNRDSFTLTLQVSLSERFADDKVDIAQMAGLDSTAYFNLY--RNQGP 292
Query: 229 ISDMLPYLRLG--YVSDTSEMQSVISSL---GPICPVSPCMERAVLDQLADYFKARLAGY 283
DM+ YLRL + SD+ ++++ + P+S E A+ + + + +A L Y
Sbjct: 293 PEDMIAYLRLIALFGSDSFLLEALFRNTVWDHLRLPISRENEEAICEAMIEGCRATLREY 352
Query: 284 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 322
+T+ ED +L L +K++A + EK++L LQ
Sbjct: 353 SSTIDEDTMLLNSSELSTRKKMAVVVRLGEKRILQEQLQ 391
>gi|444915331|ref|ZP_21235465.1| SET domain containing protein [Cystobacter fuscus DSM 2262]
gi|444713560|gb|ELW54457.1| SET domain containing protein [Cystobacter fuscus DSM 2262]
Length = 449
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 82/336 (24%), Positives = 146/336 (43%), Gaps = 26/336 (7%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+ ++K + SFW PYI L + PL + E A L G + +A+ +
Sbjct: 99 LLQEKHREGSFWKPYIDSLPESYS------QMPLFYGSEEHALLKGCFALTLLTHQAQSL 152
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
+ +Y SL Q P E FT F A ++V S + L+K + L
Sbjct: 153 REDYL----------SLCQNVP---GYERFTPGEFVWARLSVSSRLFSLKKGGFLGQ-TL 198
Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVV--DRPYKAGESIVVWCGPQPNSKLLINYGFV 178
VP+ +L + + +D V+ + AG+ + G + N +L+++GFV
Sbjct: 199 VPMAD-MLNHRRPPDVLWETTEDGESFVMKANNAVAAGDEVHDSYGAKSNDLMLLHFGFV 257
Query: 179 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 238
+DN +D + + DP K+M+ + + F + +LR+
Sbjct: 258 TDDNEHDEAFLGLRILDGDPLAATKQMLLMLPSPTAARPFKISRPYVHTTTRMAFSFLRI 317
Query: 239 GYV--SDTSEMQS-VISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
+D ++ S V+S + P+S E VL+ LA +ARL+ +P +L++DE +L
Sbjct: 318 AAAVPNDIEDISSRVMSGERALGPLSVENEENVLELLAATCQARLSIFPTSLAQDEELLR 377
Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLL 331
+L P R + R EK+++ L++T + LL
Sbjct: 378 GESLSPNARNCVLVRRAEKQLIEDYLEMTRVCLKLL 413
>gi|444909511|ref|ZP_21229702.1| hypothetical protein D187_00317 [Cystobacter fuscus DSM 2262]
gi|444720460|gb|ELW61244.1| hypothetical protein D187_00317 [Cystobacter fuscus DSM 2262]
Length = 445
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 85/331 (25%), Positives = 140/331 (42%), Gaps = 30/331 (9%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+ EK +G SFW P++ L + PL +SE E A + GS LER +
Sbjct: 94 LEEKHRGADSFWKPFVDSLP------EAYPHVPLFYSEQERARMKGSQ-----LERLVEV 142
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPT-EAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 119
+R+ E + + Q +P E F FE + A +++ S + L+ +
Sbjct: 143 QRQSFEQE---------YAQLREKLPEYERFGFEEYVWARISLYSRLFSLKGGLQGP--S 191
Query: 120 LVPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
LVPL + + + + +D +++ R AG I G + + L++ GF
Sbjct: 192 LVPLSD-MFNHRQPPDVLWSTSEDGQTFRMIAQRAVPAGTEIHTHYGAKSSDVFLLHSGF 250
Query: 178 V-DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL 236
V D + D + + L DP K+ + + F V + A + +L
Sbjct: 251 VPDGNEENDEVYLSVGLPPGDPLASVKQQMFGLASATAKHPFKVSRQGKYLASWSVFSFL 310
Query: 237 RLGYVSDTSEM---QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM 293
R+ + S + ++S I PVS E VL LA + RL +P TL EDE +
Sbjct: 311 RMAHASPDEFLALSNRLLSGTKTIAPVSVACEERVLGTLAAACEERLKAFPTTLEEDERL 370
Query: 294 LTDYNLHPKKRVATQLVRMEKKMLNACLQVT 324
L + L P +R L R EK++L L++T
Sbjct: 371 LREGPLSPNERSCVLLRRQEKRLLGDYLELT 401
>gi|301764186|ref|XP_002917505.1| PREDICTED: SET domain-containing protein 3-like [Ailuropoda
melanoleuca]
Length = 591
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 79/330 (23%), Positives = 151/330 (45%), Gaps = 28/330 (8%)
Query: 9 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
SFW PYI+ L + ++PL + E E+ L + ++ + + R+Y
Sbjct: 169 NSFWQPYIQTLPSE-------YDTPLYFEEEEVRDLQCTQAIHDVFSQYKNTARQY---- 217
Query: 69 TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
+F + Q +P+ +P +AFT+E ++ A +V + + +R AL+PL
Sbjct: 218 -AYFY--KVIQTHPHANKLPLKDAFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 274
Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
+ DD + V R ++AGE I ++ G + N++ +I+ GF ++N +
Sbjct: 275 DMCNHTNGLITTGYNLEDDRCECVALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 334
Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
DR+ ++ ++ D Y K V R G + VF +H + + +L +LR+ +++
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TDPPVSAQLLAFLRVFCMTEE 393
Query: 245 SEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
+ + I +LG PVS E + L D L Y T+ ED++ L
Sbjct: 394 ELKEHLLGDNALDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLK 453
Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
+++L + +A +L EK++L ++ A
Sbjct: 454 NHDLSVRATMAIKLRLGEKEILEKAVKSAA 483
>gi|281338628|gb|EFB14212.1| hypothetical protein PANDA_005835 [Ailuropoda melanoleuca]
Length = 585
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 79/330 (23%), Positives = 151/330 (45%), Gaps = 28/330 (8%)
Query: 9 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
SFW PYI+ L + ++PL + E E+ L + ++ + + R+Y
Sbjct: 169 NSFWQPYIQTLPSE-------YDTPLYFEEEEVRDLQCTQAIHDVFSQYKNTARQY---- 217
Query: 69 TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
+F + Q +P+ +P +AFT+E ++ A +V + + +R AL+PL
Sbjct: 218 -AYFY--KVIQTHPHANKLPLKDAFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 274
Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
+ DD + V R ++AGE I ++ G + N++ +I+ GF ++N +
Sbjct: 275 DMCNHTNGLITTGYNLEDDRCECVALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 334
Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
DR+ ++ ++ D Y K V R G + VF +H + + +L +LR+ +++
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TDPPVSAQLLAFLRVFCMTEE 393
Query: 245 SEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
+ + I +LG PVS E + L D L Y T+ ED++ L
Sbjct: 394 ELKEHLLGDNALDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLK 453
Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
+++L + +A +L EK++L ++ A
Sbjct: 454 NHDLSVRATMAIKLRLGEKEILEKAVKSAA 483
>gi|338719872|ref|XP_001488117.2| PREDICTED: histone-lysine N-methyltransferase setd3-like [Equus
caballus]
Length = 609
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 77/330 (23%), Positives = 147/330 (44%), Gaps = 28/330 (8%)
Query: 9 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 183 NSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAVHDVFSQYKNTARQY---- 231
Query: 69 TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 232 -AYFY--RVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 288
Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
+ DD + V + ++AGE I ++ G + N++ +I+ GF ++N +
Sbjct: 289 DMCNHTTGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 348
Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL------ 238
DR+ ++ ++ D Y K V R G + VF +H E + +L +LR+
Sbjct: 349 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEE 407
Query: 239 ---GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
++ + + + + PVS E + L D L Y T+ ED A L
Sbjct: 408 ELKDHLLGDNAIDRIFTLGNSEYPVSWDNEVKLWTFLEDRALLLLKTYKTTVEEDRAFLK 467
Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
+ +L + +A +L EK++L ++ A
Sbjct: 468 NSDLSVRATMAIKLRLGEKEILEKAVKSAA 497
>gi|195651313|gb|ACG45124.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Zea mays]
Length = 503
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 62/214 (28%), Positives = 100/214 (46%), Gaps = 21/214 (9%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELD 68
S W+ YI L RQ S L W+ EL AYL SP + ++R + YN+L
Sbjct: 163 SRWISYIAALPRQ-------PYSLLYWTRAELDAYLVASPIRKRAIQRITDVIGTYNDLR 215
Query: 69 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 128
+F ++P P E + E F +F + S +V L S+ R LVP +L
Sbjct: 216 D------RIFSRHPDLFPEEVYNIETFLWSFGILFSRLVRLP--SMDGRVVLVPWAD-ML 266
Query: 129 AYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPY 184
+S + + L + DR Y+ GE + + G + + +LL++YGFV ++ NP
Sbjct: 267 NHSPEVETFLDFDKSSRGIVFTTDRSYQPGEQVFISYGKKSSGELLLSYGFVPKEGTNPN 326
Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
D + + +L+ D Y++K +RNG + F
Sbjct: 327 DSVELLVSLDKSDNCYKEKLQALKRNGLSESESF 360
>gi|343961019|dbj|BAK62099.1| SET domain containing 3 isoform a [Pan troglodytes]
Length = 492
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 73/314 (23%), Positives = 143/314 (45%), Gaps = 28/314 (8%)
Query: 9 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 169 NSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY---- 217
Query: 69 TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 218 -AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 274
Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
+ DD + V + ++AGE I ++ G + N++ +I+ GF ++N +
Sbjct: 275 DMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 334
Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
DR+ ++ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEE 393
Query: 245 ---------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
S + + + PVS E + L D L Y T+ ED+++L
Sbjct: 394 ELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLK 453
Query: 296 DYNLHPKKRVATQL 309
+ +L + ++A +L
Sbjct: 454 NQDLSVRAKMAIKL 467
>gi|410928182|ref|XP_003977480.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Takifugu
rubripes]
Length = 598
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 82/330 (24%), Positives = 147/330 (44%), Gaps = 38/330 (11%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
SFWLPYIR L ++ ++PL + + E+ L G+ ++L + R+Y
Sbjct: 170 SFWLPYIRTLPQE-------YDTPLFYEQDEVQLLQGTQAVQDVLSQYRNTARQY----- 217
Query: 70 VWFMAGSLFQQYPYD--IP-TEAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRFALV 121
+F L Q +P +P ++FTF+ ++ A +V + + ++V+LA L+
Sbjct: 218 AYFY--KLIQTHPASSKLPLKDSFTFDDYRWAVSSVMTRQNQIPTEDGRQVTLA----LI 271
Query: 122 PLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 181
PL + DD + V + YK E I ++ G + N++ +I+ GF ++
Sbjct: 272 PLWDMCNHRNGLITTGYNLEDDRCECVALQDYKKNEQIYIFYGTRSNAEFVIHNGFFYQE 331
Query: 182 NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV 241
N +D++ ++ ++ + Y K V R G +F ++ E+ + +L +LR+ +
Sbjct: 332 NAHDQVKIKLGISKSERLYAMKAEVLARAGIPVSSIFALYCN-EQPISAQLLAFLRV-FC 389
Query: 242 SDTSEMQ---------SVISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 291
E++ + I +LG + PVS E + L L Y T ED
Sbjct: 390 MKEEELRDYLLGGHAINKIVTLGSMEFPVSWDNEIKLWTFLETRVALLLKAYKTTSEEDS 449
Query: 292 AMLTDYNLHPKKRVATQLVRMEKKMLNACL 321
+ L L P R+A QL EK +L L
Sbjct: 450 STLEKSELSPHSRMAIQLRLAEKWILEKAL 479
>gi|302785554|ref|XP_002974548.1| hypothetical protein SELMODRAFT_101776 [Selaginella moellendorffii]
gi|300157443|gb|EFJ24068.1| hypothetical protein SELMODRAFT_101776 [Selaginella moellendorffii]
Length = 467
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 60/213 (28%), Positives = 101/213 (47%), Gaps = 20/213 (9%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIK 61
E GK S W PYI L R+ G S LLW+ ++ A+L+ + K L+ ++
Sbjct: 121 EASLGKSSPWYPYIAALPRRPG-------SILLWTALDVEAHLSATSIKDRALQCVREVE 173
Query: 62 REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 121
+N+L+ FM + P E F E FK AF + S +V L SL ++ AL+
Sbjct: 174 DTFNDLNKQVFMKNR------EEFPPEVFNLESFKWAFGILFSRLVRLP--SLGQKLALI 225
Query: 122 PLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV- 178
P G +L + ++ L + ++ +DR Y++ + + + G + N +LL+ YGFV
Sbjct: 226 PFGD-MLNHDTEVTTFLDFDSGSKSITCTLDRGYESNKEVFISYGKRSNGELLVAYGFVP 284
Query: 179 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 211
N D + + L+ D Y+ K + +G
Sbjct: 285 SGKNSEDSVSITLGLDPADEMYEAKLGALKEHG 317
>gi|218191491|gb|EEC73918.1| hypothetical protein OsI_08761 [Oryza sativa Indica Group]
Length = 502
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 58/193 (30%), Positives = 94/193 (48%), Gaps = 16/193 (8%)
Query: 32 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEA 89
S L W+ EL AYL SP + ++R + YN+L D ++ LF P E
Sbjct: 177 SLLYWTRPELDAYLVASPIRERAIQRITDVVGTYNDLRDRIFSKHSDLF-------PEEV 229
Query: 90 FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQL 147
+ E F+ +F + S +V L S+ R ALVP +L +S + + L +
Sbjct: 230 YNLETFRWSFGILFSRLVRLP--SMDGRVALVPWAD-MLNHSPEVETFLDYDKSSGGIVF 286
Query: 148 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRM 205
DR Y+ GE + + G + + +LL++YGFV ++ NP D + + +LN D Y++K
Sbjct: 287 TTDRSYQPGEQVFISYGKKSSGELLLSYGFVPKEGTNPNDSVELLVSLNKSDKCYKEKLQ 346
Query: 206 VAQRNGKLSVQVF 218
+RNG + F
Sbjct: 347 ALKRNGLSEFESF 359
>gi|302821397|ref|XP_002992361.1| hypothetical protein SELMODRAFT_430576 [Selaginella moellendorffii]
gi|300139777|gb|EFJ06511.1| hypothetical protein SELMODRAFT_430576 [Selaginella moellendorffii]
Length = 463
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 58/242 (23%), Positives = 98/242 (40%), Gaps = 47/242 (19%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
++EK+ G+ S W PYI L R + S W + ELA + S E + R I
Sbjct: 114 LFEKRAGEASIWAPYISCLPRWG-----TIHSTAFWRKEELAMIQESSLSYETMSRRAAI 168
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
+ E+NE+ + FQ+Y + + ++ FK A+V C ++ + A+
Sbjct: 169 REEFNEMQPI-------FQRYEH-VFGGPVSYASFKHAYVTATVCS-RAWRIDGLEKLAM 219
Query: 121 VPLGPPLLAYSSKCKAMLAAVDD---------------------AVQLVVDRPYKAGESI 159
VP + + AML D QL D+ Y AGE +
Sbjct: 220 VPFAD-FMNHDWSSNAMLTYDTDNGSTEVEEVKVYSDCLDIALFCAQLFADKNYAAGEQV 278
Query: 160 VVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFH 219
+ GP N+ L +++GF NP+D++ + ++ D ++K +Q H
Sbjct: 279 TISFGPLCNASLALDFGFTVPYNPWDKVQLWLGISRRDSLRKEK-----------LQYLH 327
Query: 220 VH 221
H
Sbjct: 328 AH 329
>gi|115448405|ref|NP_001047982.1| Os02g0725200 [Oryza sativa Japonica Group]
gi|45735887|dbj|BAD12920.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase [Oryza sativa Japonica
Group]
gi|45736017|dbj|BAD13045.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase [Oryza sativa Japonica
Group]
gi|113537513|dbj|BAF09896.1| Os02g0725200 [Oryza sativa Japonica Group]
gi|215737236|dbj|BAG96165.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623589|gb|EEE57721.1| hypothetical protein OsJ_08208 [Oryza sativa Japonica Group]
Length = 502
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 58/193 (30%), Positives = 94/193 (48%), Gaps = 16/193 (8%)
Query: 32 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEA 89
S L W+ EL AYL SP + ++R + YN+L D ++ LF P E
Sbjct: 177 SLLYWTRPELDAYLVASPIRERAIQRITDVVGTYNDLRDRIFSKHSDLF-------PEEV 229
Query: 90 FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQL 147
+ E F+ +F + S +V L S+ R ALVP +L +S + + L +
Sbjct: 230 YNLETFRWSFGILFSRLVRLP--SMDGRVALVPWAD-MLNHSPEVETFLDYDKSSGGIVF 286
Query: 148 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRM 205
DR Y+ GE + + G + + +LL++YGFV ++ NP D + + +LN D Y++K
Sbjct: 287 TTDRSYQPGEQVFISYGKKSSGELLLSYGFVPKEGTNPNDSVELLVSLNKSDKCYKEKLQ 346
Query: 206 VAQRNGKLSVQVF 218
+RNG + F
Sbjct: 347 ALKRNGLSEFESF 359
>gi|449280698|gb|EMC87934.1| SET domain-containing protein 3 [Columba livia]
Length = 593
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 81/334 (24%), Positives = 153/334 (45%), Gaps = 36/334 (10%)
Query: 9 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
SFWLPYI+ L + +PL + E E+ YL + ++ + + R+Y
Sbjct: 169 NSFWLPYIQTLPSE-------YNTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY---- 217
Query: 69 TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
+F + Q +P +P ++FT++ ++ A +V + + +R AL+PL
Sbjct: 218 -AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 274
Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
+ DD + V + +KAGE I ++ G + N++ +I+ GF ++N +
Sbjct: 275 DMCNHTNGLITTGYNLEDDRCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 334
Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
DR+ ++ ++ D Y K V R G + VF +H+ E + +L +LR+ +S+
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMSEE 393
Query: 245 SEMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDE 291
+ +I +LG PVS E +L + +AR L Y T+ +D+
Sbjct: 394 ELKEHLIGEHAIDKIFTLGNSEFPVSWDNEV----KLWTFLEARASLLLKTYKTTVEDDK 449
Query: 292 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
+ L ++L +A +L EK++L ++ A
Sbjct: 450 SFLETHDLTSHAIMAIKLRLGEKEILEKAVKSAA 483
>gi|427784595|gb|JAA57749.1| Putative histone-lysine n-methyltransferase setd3 [Rhipicephalus
pulchellus]
Length = 485
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 76/326 (23%), Positives = 140/326 (42%), Gaps = 23/326 (7%)
Query: 7 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 66
G+ SFW PYI L + + L +S EL L GS E L+ I R+Y+
Sbjct: 163 GESSFWHPYISTL-------PASFNTVLYFSVEELELLHGSTVLDEALKLHRSIARQYSY 215
Query: 67 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS--CVVHLQKVSLARR------- 117
+ F L + PY + FT+++++ A AV + V L +
Sbjct: 216 FHKI-FRTHPLAKSLPY---KDCFTYDLYRWAVSAVMTRQNAVPLTDTAGGDDEDGTDAM 271
Query: 118 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
A+VPL K + ++ R ++ G+ + ++ G + N++ I+ GF
Sbjct: 272 TAMVPLWDMCNHSDGKVFTDYDISANMLRCYAMRDFEKGQEVTIFYGRRTNAEFFIHNGF 331
Query: 178 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 237
V +N +D + ++ ++ +DP Y K + + +F + RE+ D+ +LR
Sbjct: 332 VFPENRHDSVDIKLGISKQDPLYAVKAKLCDDHELTPSGIFAL-VPRERPVCEDLSTFLR 390
Query: 238 LGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDY 297
+ + D S+ S I + R L+ L + L +P + E E ++ D
Sbjct: 391 ILVLKDASQAASFTDE--HIMVATDDNAREALNFLIVRIQLLLRAFPKSDQEYENIIADE 448
Query: 298 NLHPKKRVATQLVRMEKKMLNACLQV 323
+ + ++A QL +E+K+L A L+
Sbjct: 449 GSNGRLKMAAQLRLLERKILTAVLET 474
>gi|326921018|ref|XP_003206761.1| PREDICTED: LOW QUALITY PROTEIN: SET domain-containing protein
3-like [Meleagris gallopavo]
Length = 593
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 80/334 (23%), Positives = 153/334 (45%), Gaps = 36/334 (10%)
Query: 9 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
SFWLPYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 169 NSFWLPYIQTLPNE-------YDTPLYFEEDEVQYLRSTQAIHDVFSQYKNTARQY---- 217
Query: 69 TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
+F + Q +P +P ++FT++ ++ A +V + + +R AL+PL
Sbjct: 218 -AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 274
Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
+ DD + V + +KAGE I ++ G + N++ +I+ GF ++N +
Sbjct: 275 DMCNHTNGLITTGYNLEDDRCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 334
Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
DR+ ++ ++ D Y K V R G + VF +H+ E + +L +LR+ +++
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-IEPPISAQLLAFLRVFCMNEE 393
Query: 245 SEMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDE 291
+ +I +LG P+S E +L + +AR L Y T+ +D+
Sbjct: 394 ELKEHLIGEHAIDKIFTLGNSEFPISWDNEV----KLWTFLEARASLLLKTYKTTVEDDK 449
Query: 292 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
L ++L +A +L EKK+L ++ A
Sbjct: 450 LFLETHDLTSHATMAIKLRLGEKKILEKTVKSAA 483
>gi|57529914|ref|NP_001006486.1| histone-lysine N-methyltransferase setd3 [Gallus gallus]
gi|363734802|ref|XP_003641459.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Gallus
gallus]
gi|75571462|sp|Q5ZML9.1|SETD3_CHICK RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|53127281|emb|CAG31024.1| hypothetical protein RCJMB04_1k10 [Gallus gallus]
Length = 593
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 79/334 (23%), Positives = 154/334 (46%), Gaps = 36/334 (10%)
Query: 9 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
SFWLPYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 169 NSFWLPYIQTLPSE-------YDTPLYFEEDEVQYLRSTQAIHDVFSQYKNTARQY---- 217
Query: 69 TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
+F + Q +P +P ++FT++ ++ A +V + + +R AL+PL
Sbjct: 218 -AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 274
Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
+ DD + V + +KAGE I ++ G + N++ +I+ GF ++N +
Sbjct: 275 DMCNHTNGLITTGYNLEDDRCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 334
Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
DR+ ++ ++ D Y K V R G + VF +H+ E + +L +LR+ +++
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-IEPPISAQLLAFLRVFCMNEE 393
Query: 245 SEMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDE 291
+ +I +LG P+S E +L + +AR L Y T+ +D+
Sbjct: 394 ELKEHLIGEHAIDKIFTLGNSEFPISWDNEV----KLWTFLEARASLLLKTYKTTVEDDK 449
Query: 292 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
+ L ++L +A +L EK++L ++ A
Sbjct: 450 SFLETHDLTSHATMAIKLRLGEKEILEKAVKSAA 483
>gi|326495906|dbj|BAJ90575.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 507
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 62/208 (29%), Positives = 102/208 (49%), Gaps = 23/208 (11%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELD 68
S W YI L RQ S L W+ TE+ AYL SP + + R + YN+L
Sbjct: 166 SRWSSYIDALPRQ-------PYSLLYWTRTEIDAYLVASPIRERAISRISDVIGTYNDLR 218
Query: 69 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 128
+F ++P P + + E F+ +F + S +V L+ S+ + ALVP +L
Sbjct: 219 D------RIFSKHPDLFPEKVYNMENFRWSFGILFSRLVRLE--SMGGKVALVPWAD-ML 269
Query: 129 AYSSKCKAMLAAVDDAVQLVV---DRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NP 183
+S + A L D + Q +V DR Y+ GE + + G + + +LL++YGFV ++ NP
Sbjct: 270 NHSPEVDAFLD-YDKSSQGIVFTTDRSYQPGEQVFISYGKKSSGELLLSYGFVPKEGTNP 328
Query: 184 YDRLVVEAALNTEDPQYQDKRMVAQRNG 211
D + +L D Y++K +++G
Sbjct: 329 NDSVEFLVSLKKSDECYKEKLQALKKHG 356
>gi|41056027|ref|NP_956348.1| histone-lysine N-methyltransferase setd3 [Danio rerio]
gi|82187658|sp|Q7SXS7.1|SETD3_DANRE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|32766447|gb|AAH55261.1| SET domain containing 3 [Danio rerio]
Length = 596
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 75/317 (23%), Positives = 134/317 (42%), Gaps = 24/317 (7%)
Query: 12 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 71
WLPYI+ L + ++PL + E E+ +L + ++L + + R+Y +
Sbjct: 172 WLPYIKTLPSE-------YDTPLYFEEEEVRHLLATQAIQDVLSQYKNTARQY-----AY 219
Query: 72 FMAGSLFQQYPYDIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLA 129
F +P +AFTF+ ++ A +V + + +R AL+PL
Sbjct: 220 FYKVIHTHPNASKLPLKDAFTFDDYRWAVSSVMTRQNQIPTADGSRVTLALIPLWDMCNH 279
Query: 130 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 189
+ DD + V + YK GE I ++ G + N++ +I+ GF EDN +DR+ +
Sbjct: 280 TNGLITTGYNLEDDRCECVALKDYKEGEQIYIFYGTRSNAEFVIHNGFFFEDNAHDRVKI 339
Query: 190 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL---------GY 240
+ ++ + Y K V R G + +F +H E + +L +LR+ Y
Sbjct: 340 KLGVSKGERLYAMKAEVLARAGIPASSIFALHCS-EPPISAQLLAFLRVFCMTEEELRDY 398
Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
+ + + + PVS E + L L Y ED +ML +L
Sbjct: 399 LVGDHAINKIFTLGNTEFPVSWENEIKLWTFLETRAALLLKTYKTASEEDRSMLEKPDLS 458
Query: 301 PKKRVATQLVRMEKKML 317
R+A +L EK++L
Sbjct: 459 LHSRIAIKLRLAEKEIL 475
>gi|47215092|emb|CAF98166.1| unnamed protein product [Tetraodon nigroviridis]
Length = 444
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 79/329 (24%), Positives = 143/329 (43%), Gaps = 36/329 (10%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
SFWLPYIR L ++ ++PL + + ++ L G+ ++L + R+Y
Sbjct: 88 SFWLPYIRTLPQE-------YDTPLFYQQQDVQLLHGTQAIQDVLSQYRNTARQY----- 135
Query: 70 VWFMAGSLFQQYPYD--IP-TEAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRFALV 121
+F L Q +P +P ++FTF+ ++ A +V + + ++V+LA L+
Sbjct: 136 AYFY--KLVQTHPASSKLPLKDSFTFDDYRWAVSSVMTRQNQIPTEDGRQVTLA----LI 189
Query: 122 PLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 181
PL + DD + V + YK E I ++ G + N++ +I+ GF ++
Sbjct: 190 PLWDMCNHRNGLITTGYNLEDDRCECVALQDYKKNEQIYIFYGTRSNAEFVIHNGFFYQE 249
Query: 182 NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL--- 238
N +D++ ++ ++ + Y K V R G VF ++ E + +L +LR+
Sbjct: 250 NAHDQVKIKLGISKSERLYAMKAEVLGRAGIPVSSVFALYCN-EPPISAQLLAFLRVFCM 308
Query: 239 ------GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEA 292
Y+ + +++ PVS E + L L Y T ED +
Sbjct: 309 MEEELKDYLFGAQAINRLVTLGSMEFPVSWENEIKLWTFLETRAALLLKAYKTTAEEDSS 368
Query: 293 MLTDYNLHPKKRVATQLVRMEKKMLNACL 321
L +L P R+A QL EK +L L
Sbjct: 369 TLDKTDLSPHSRMAVQLRLAEKAILEKAL 397
>gi|357153645|ref|XP_003576520.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
[Brachypodium distachyon]
Length = 492
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 82/334 (24%), Positives = 148/334 (44%), Gaps = 38/334 (11%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+ E +G S W PY+ L RQ +S + WSE EL + G+ + + E +
Sbjct: 138 LREAARGGDSLWAPYLAILPRQ-------TDSTIFWSEEELLEIQGTQLLSTTMGVKEYV 190
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
+ E++ ++ + G +P + TF+ F AF ++S V + + AL
Sbjct: 191 QSEFDNVEAK--IIGPNKDLFP-----DTITFDDFLWAFGILRSRVFPELR---GDKLAL 240
Query: 121 VPLGPPLLAYSS-----------KCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPN 168
+P L+ +S+ + K L D L K+GE + V + + N
Sbjct: 241 IPFAD-LINHSADITSKQSCWEIQGKGFLGR-DVVFSLRTPMEVKSGEQVYVQYDLDKSN 298
Query: 169 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEA 228
++L ++YGF + ++ D + ++ DP Y DK +A+ NG F V G +
Sbjct: 299 AELALDYGFTETNSTRDSYTLTLEISESDPFYGDKLDIAELNGMGETAYFDVVLG--ESL 356
Query: 229 ISDMLPYLRLGYVSDTSE--MQSVISS--LGPI-CPVSPCMERAVLDQLADYFKARLAGY 283
M+ YLRL + T ++++ + G + PVS E ++ + K+ L Y
Sbjct: 357 PPQMITYLRLLCLGGTDAFLLEALFRNKVWGFLELPVSRDNEESICQVIQTACKSALTAY 416
Query: 284 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 317
T+ EDE +L +L + ++A ++ EKK+L
Sbjct: 417 HTTIEEDEELLKREDLQSRHQIAVEVRAGEKKVL 450
>gi|242066146|ref|XP_002454362.1| hypothetical protein SORBIDRAFT_04g029430 [Sorghum bicolor]
gi|241934193|gb|EES07338.1| hypothetical protein SORBIDRAFT_04g029430 [Sorghum bicolor]
Length = 499
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 65/209 (31%), Positives = 102/209 (48%), Gaps = 25/209 (11%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL- 67
S W YI L RQ S L W+ EL AYL SP + ++R + YN+L
Sbjct: 159 SRWSSYIAALPRQ-------PYSLLYWTRAELDAYLVASPIRKRAIQRITDVIGTYNDLR 211
Query: 68 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 127
D ++ LF P E + E F +F + S +V L S+ + ALVP +
Sbjct: 212 DRIFSRHSDLF-------PEEVYNIETFLWSFGILFSRLVRLP--SMDEKVALVPWAD-M 261
Query: 128 LAYSSKCKAMLAAVDDAVQLVV---DRPYKAGESIVVWCGPQPNSKLLINYGFVDED--N 182
L +S + + L D + Q +V DR Y+ GE + + G + + +LL++YGFV ++ N
Sbjct: 262 LNHSPEVETFLD-FDKSSQGIVFTTDRSYQPGEQVFISYGKKSSGELLLSYGFVPKEGTN 320
Query: 183 PYDRLVVEAALNTEDPQYQDKRMVAQRNG 211
P D + + +L+ D Y++K +RNG
Sbjct: 321 PNDSVELLVSLDKSDKCYKEKLQALKRNG 349
>gi|148686777|gb|EDL18724.1| mCG18357, isoform CRA_b [Mus musculus]
Length = 466
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 78/330 (23%), Positives = 151/330 (45%), Gaps = 28/330 (8%)
Query: 9 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
SFW PYI+ L + ++PL + E E+ L + ++ + + R+Y
Sbjct: 41 NSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY---- 89
Query: 69 TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
+F + Q +P+ +P E+FT+E ++ A +V + + +R AL+PL
Sbjct: 90 -AYFY--KVIQTHPHANKLPLKESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 146
Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
+ DD + V + ++AG+ I ++ G + N++ +I+ GF ++N +
Sbjct: 147 DMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSH 206
Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
DR+ ++ ++ D Y K V R G + VF +H+ E + +L +LR+ +++
Sbjct: 207 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEE 265
Query: 245 SEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
+ + I +LG PVS E + L D L Y T+ ED+ +L
Sbjct: 266 ELKEHLLGDSAIDRIFTLGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLK 325
Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
+ +L + +A +L EK++L ++ A
Sbjct: 326 NPDLSVRATMAIKLRLGEKEILEKAVKSAA 355
>gi|432952574|ref|XP_004085141.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Oryzias
latipes]
Length = 606
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 77/321 (23%), Positives = 140/321 (43%), Gaps = 28/321 (8%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
SFW PYIR L ++ ++PL + + ++ L G+ ++L + + R+Y
Sbjct: 170 SFWSPYIRSLPQE-------YDTPLYYQQEDVQLLLGTQAVQDVLNQYKNTARQY----- 217
Query: 70 VWFMAGSLFQQYP--YDIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
+F L Q +P +P + F+F+ ++ A +V + + V +R AL+PL
Sbjct: 218 AYFY--KLVQTHPAASKLPLKDGFSFDDYRWAVSSVMTRQNQIPTVDGSRVTLALIPLWD 275
Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
+ DD + V + YK E I ++ G + N++ +I+ GF +DN +D
Sbjct: 276 MCNHTNGLITTGYNLEDDRCECVALQDYKKNEQIYIFYGTRSNAEFVIHNGFFFQDNAHD 335
Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL------- 238
R+ ++ ++ + Y K V R G + VF +H + + +L +LR+
Sbjct: 336 RVKIKLGVSKSERLYAMKAEVLARAGIPASCVFALHC-NDPPISAQLLAFLRVFCMTEEE 394
Query: 239 --GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
Y+ + + + PVS E + L L Y T ED ++L
Sbjct: 395 LKDYLLGERAINKIFTLGNSDFPVSWENEIKLWTFLETRAALLLKTYKTTSEEDRSILEK 454
Query: 297 YNLHPKKRVATQLVRMEKKML 317
+L R+A QL EK++L
Sbjct: 455 PDLSLHTRLAVQLRLAEKQIL 475
>gi|148686779|gb|EDL18726.1| mCG18357, isoform CRA_d [Mus musculus]
Length = 597
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 78/329 (23%), Positives = 151/329 (45%), Gaps = 28/329 (8%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
SFW PYI+ L + ++PL + E E+ L + ++ + + R+Y
Sbjct: 173 SFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY----- 220
Query: 70 VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
+F + Q +P+ +P E+FT+E ++ A +V + + +R AL+PL
Sbjct: 221 AYFY--KVIQTHPHANKLPLKESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 278
Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
+ DD + V + ++AG+ I ++ G + N++ +I+ GF ++N +D
Sbjct: 279 MCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHD 338
Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 245
R+ ++ ++ D Y K V R G + VF +H+ E + +L +LR+ +++
Sbjct: 339 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEE 397
Query: 246 EMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
+ + I +LG PVS E + L D L Y T+ ED+ +L +
Sbjct: 398 LKEHLLGDSAIDRIFTLGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKN 457
Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
+L + +A +L EK++L ++ A
Sbjct: 458 PDLSVRATMAIKLRLGEKEILEKAVKSAA 486
>gi|268370088|ref|NP_082538.2| histone-lysine N-methyltransferase setd3 [Mus musculus]
gi|81879567|sp|Q91WC0.1|SETD3_MOUSE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=Endothelial differentiation inhibitory protein D10;
AltName: Full=SET domain-containing protein 3
gi|16359331|gb|AAH16123.1| SET domain containing 3 [Mus musculus]
gi|18044800|gb|AAH19973.1| Setd3 protein [Mus musculus]
gi|26327255|dbj|BAC27371.1| unnamed protein product [Mus musculus]
gi|74145116|dbj|BAE27425.1| unnamed protein product [Mus musculus]
gi|74151505|dbj|BAE38861.1| unnamed protein product [Mus musculus]
gi|148686776|gb|EDL18723.1| mCG18357, isoform CRA_a [Mus musculus]
Length = 594
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 78/329 (23%), Positives = 151/329 (45%), Gaps = 28/329 (8%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
SFW PYI+ L + ++PL + E E+ L + ++ + + R+Y
Sbjct: 170 SFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY----- 217
Query: 70 VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
+F + Q +P+ +P E+FT+E ++ A +V + + +R AL+PL
Sbjct: 218 AYFY--KVIQTHPHANKLPLKESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 275
Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
+ DD + V + ++AG+ I ++ G + N++ +I+ GF ++N +D
Sbjct: 276 MCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHD 335
Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 245
R+ ++ ++ D Y K V R G + VF +H+ E + +L +LR+ +++
Sbjct: 336 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEE 394
Query: 246 EMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
+ + I +LG PVS E + L D L Y T+ ED+ +L +
Sbjct: 395 LKEHLLGDSAIDRIFTLGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKN 454
Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
+L + +A +L EK++L ++ A
Sbjct: 455 PDLSVRATMAIKLRLGEKEILEKAVKSAA 483
>gi|160774366|gb|AAI55279.1| SET domain containing 3 [Danio rerio]
Length = 596
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 74/317 (23%), Positives = 133/317 (41%), Gaps = 24/317 (7%)
Query: 12 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 71
WLPYI+ L + ++PL + E E+ +L + ++L + + R+Y +
Sbjct: 172 WLPYIKTLPSE-------YDTPLYFEEEEVRHLLATQAIQDVLSQYKNTARQY-----AY 219
Query: 72 FMAGSLFQQYPYDIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLA 129
F +P +AFTF+ ++ A +V + + +R AL+PL
Sbjct: 220 FYKVIHTHPNASKLPLKDAFTFDDYRWAVSSVMTRQNQIPTADGSRVTLALIPLWDMCNH 279
Query: 130 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 189
+ DD + V + YK GE I ++ G + N++ +I+ GF EDN +DR+ +
Sbjct: 280 TNGLITTGYNLEDDRCECVALKDYKEGEQIYIFYGTRSNAEFVIHNGFFFEDNAHDRVKI 339
Query: 190 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL---------GY 240
+ ++ + Y K V R G + +F +H E + +L +LR+ Y
Sbjct: 340 KLGVSKSERLYAMKAEVLARAGIPASSIFALHCS-EPPISAQLLAFLRVFCMTEEELRDY 398
Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
+ + + + PVS E + L L Y ED +ML +L
Sbjct: 399 LVGDHAINKIFTLGNTEFPVSWENEIKLWTFLETRAALLLKTYKTASEEDRSMLEKPDLS 458
Query: 301 PKKRVATQLVRMEKKML 317
R+ +L EK++L
Sbjct: 459 LHSRITIKLRLAEKEIL 475
>gi|308807993|ref|XP_003081307.1| putative methyltransferase (ISS) [Ostreococcus tauri]
gi|116059769|emb|CAL55476.1| putative methyltransferase (ISS) [Ostreococcus tauri]
Length = 505
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 86/318 (27%), Positives = 132/318 (41%), Gaps = 56/318 (17%)
Query: 5 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKRE 63
+ G + YIR L R+ G S L W E E+ L GSP++ ER E +
Sbjct: 150 ESGNAGTFGEYIRALPRRTG-------SVLDWPEDEVETLLKGSPSRLAAAERQESVNAA 202
Query: 64 YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 123
E+ + +P DI A + AF + S ++ L ++ ALVP
Sbjct: 203 IAEIRS----------SFP-DITEGALRW-----AFDILFSRLIRLD--AMGGELALVPW 244
Query: 124 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE--D 181
+L + C A + AV L DR Y AGE + G +P+S+LLI+YGF E +
Sbjct: 245 AD-MLNHKPGCAAFIDLNGSAVNLTTDRAYAAGEQVWASYGQRPSSELLISYGFAPEVGE 303
Query: 182 NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH-AGREKEAI----------- 229
NP D + ++ DP Q K V +R G V+ F + G ++ +
Sbjct: 304 NPDDEYSLTLGVDVNDPYAQAKADVLRRMGLSPVETFPLRLNGYPRQLLQYASFILCNPD 363
Query: 230 --SDMLPYLRLGYVSDTSEMQSVISSL------------GPICPVSPCMERAVLDQLADY 275
S++ R + + QS+ S+ G I P E AV + LAD
Sbjct: 364 KPSELEGLARTAFTGSANFGQSIFDSVRGLAQGQARGKQGVILGGVPG-EIAVREMLADM 422
Query: 276 FKARLAGYPATLSEDEAM 293
L+ YP +L +D+ +
Sbjct: 423 CAEALSAYPNSLEKDKGI 440
>gi|354483159|ref|XP_003503762.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Cricetulus
griseus]
gi|344254671|gb|EGW10775.1| SET domain-containing protein 3 [Cricetulus griseus]
Length = 577
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 76/329 (23%), Positives = 148/329 (44%), Gaps = 28/329 (8%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
SFW PYI+ L + ++PL + E E+ L + ++ + + R+Y
Sbjct: 170 SFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY----- 217
Query: 70 VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 218 AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 275
Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
+ DD + V + ++AGE I ++ G + N++ +I+ GF ++N +D
Sbjct: 276 MCNHTNGLITTGYNLEDDRCECVALQDFQAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 335
Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT- 244
R+ ++ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 336 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEE 394
Query: 245 --------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
S + + + PVS E + L D L Y T+ ED+ +L +
Sbjct: 395 LKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKN 454
Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
+L + +A +L EK++L ++ A
Sbjct: 455 PDLSVRATMAIKLRLGEKEILEKAVKSAA 483
>gi|12848462|dbj|BAB27964.1| unnamed protein product [Mus musculus]
gi|46241521|gb|AAS82953.1| endothelial differentiation inhibitory protein D10 [Mus musculus]
Length = 594
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 78/329 (23%), Positives = 151/329 (45%), Gaps = 28/329 (8%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
SFW PYI+ L + ++PL + E E+ L + ++ + + R+Y
Sbjct: 170 SFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY----- 217
Query: 70 VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
+F + Q +P+ +P E+FT+E ++ A +V + + +R AL+PL
Sbjct: 218 AYFY--KVIQTHPHANKLPLKESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 275
Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
+ DD + V + ++AG+ I ++ G + N++ +I+ GF ++N +D
Sbjct: 276 MCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAESVIHSGFFFDNNSHD 335
Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 245
R+ ++ ++ D Y K V R G + VF +H+ E + +L +LR+ +++
Sbjct: 336 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEE 394
Query: 246 EMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
+ + I +LG PVS E + L D L Y T+ ED+ +L +
Sbjct: 395 LKEHLLGDSAIDRIFTLGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKN 454
Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
+L + +A +L EK++L ++ A
Sbjct: 455 PDLSVRATMAIKLRLGEKEILEKAVKSAA 483
>gi|302759643|ref|XP_002963244.1| hypothetical protein SELMODRAFT_80789 [Selaginella moellendorffii]
gi|300168512|gb|EFJ35115.1| hypothetical protein SELMODRAFT_80789 [Selaginella moellendorffii]
Length = 467
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 58/213 (27%), Positives = 99/213 (46%), Gaps = 20/213 (9%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIK 61
E GK S W PYI L R+ G S LLW+ ++ +L+ + K L+ ++
Sbjct: 121 EASLGKSSPWYPYIAALPRRPG-------SILLWTALDVETHLSATSIKDRALQCVREVE 173
Query: 62 REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 121
+N+L+ FM + P E F + FK AF + S +V L SL ++ AL+
Sbjct: 174 DTFNDLNKQVFMKNR------EEFPPEVFNLKSFKWAFGILFSRLVRLP--SLGQKLALI 225
Query: 122 PLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV- 178
P G +L + ++ L + ++ +DR Y++ + + G + N +LL+ YGFV
Sbjct: 226 PFGD-MLNHDTEVTTFLDFDSGSKSITCTLDRGYESNREVFISYGKRSNGELLVAYGFVP 284
Query: 179 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 211
N D + + L+ D Y+ K + +G
Sbjct: 285 SGKNSEDSVSITLGLDPADEMYEAKLGTLKEHG 317
>gi|58177849|gb|AAH89108.1| Setd3 protein [Rattus norvegicus]
Length = 450
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 75/330 (22%), Positives = 148/330 (44%), Gaps = 28/330 (8%)
Query: 9 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
SFW PYI+ L + ++PL + E E+ L + ++ + + R+Y
Sbjct: 23 NSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY---- 71
Query: 69 TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 72 -AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 128
Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
+ DD + V + ++AG+ I ++ G + N++ +I+ GF ++N +
Sbjct: 129 DMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSH 188
Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
DR+ ++ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 189 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEE 247
Query: 245 ---------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
S + + + PVS E + L D L Y T+ ED+ +L
Sbjct: 248 ELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKTVLK 307
Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
+ +L + +A +L EK++L ++ A
Sbjct: 308 NPDLSVRATMAIKLRLGEKEILEKAVKSAA 337
>gi|149044197|gb|EDL97579.1| rCG27725, isoform CRA_c [Rattus norvegicus]
Length = 468
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 75/330 (22%), Positives = 148/330 (44%), Gaps = 28/330 (8%)
Query: 9 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
SFW PYI+ L + ++PL + E E+ L + ++ + + R+Y
Sbjct: 41 NSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY---- 89
Query: 69 TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 90 -AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 146
Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
+ DD + V + ++AG+ I ++ G + N++ +I+ GF ++N +
Sbjct: 147 DMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSH 206
Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
DR+ ++ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 207 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEE 265
Query: 245 ---------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
S + + + PVS E + L D L Y T+ ED+ +L
Sbjct: 266 ELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKTVLK 325
Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
+ +L + +A +L EK++L ++ A
Sbjct: 326 NPDLSVRATMAIKLRLGEKEILEKAVKSAA 355
>gi|392341246|ref|XP_002726820.2| PREDICTED: histone-lysine N-methyltransferase setd3 [Rattus
norvegicus]
gi|392349051|ref|XP_216781.6| PREDICTED: histone-lysine N-methyltransferase setd3 [Rattus
norvegicus]
gi|149044195|gb|EDL97577.1| rCG27725, isoform CRA_a [Rattus norvegicus]
Length = 596
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 75/330 (22%), Positives = 148/330 (44%), Gaps = 28/330 (8%)
Query: 9 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
SFW PYI+ L + ++PL + E E+ L + ++ + + R+Y
Sbjct: 169 NSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY---- 217
Query: 69 TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 218 -AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 274
Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
+ DD + V + ++AG+ I ++ G + N++ +I+ GF ++N +
Sbjct: 275 DMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSH 334
Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
DR+ ++ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEE 393
Query: 245 ---------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
S + + + PVS E + L D L Y T+ ED+ +L
Sbjct: 394 ELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKTVLK 453
Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
+ +L + +A +L EK++L ++ A
Sbjct: 454 NPDLSVRATMAIKLRLGEKEILEKAVKSAA 483
>gi|330822500|ref|XP_003291689.1| hypothetical protein DICPUDRAFT_57488 [Dictyostelium purpureum]
gi|325078125|gb|EGC31794.1| hypothetical protein DICPUDRAFT_57488 [Dictyostelium purpureum]
Length = 540
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 76/314 (24%), Positives = 130/314 (41%), Gaps = 32/314 (10%)
Query: 36 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 95
W + L GSP + G R+Y +F LF P + T FT+E F
Sbjct: 185 WGIEDFKQLEGSPVFQTAVNYTRGSMRQY-----CYFY--QLFDNNPGILQTSNFTYEAF 237
Query: 96 KQAFVAVQSCVVHLQKVSLARRFALVPLGP--PLLAYSSKCKAMLAAVDDAVQLVVDRPY 153
A VQS V + AL+P ++ K + V + + Y
Sbjct: 238 IWAVATVQS---RQNPVGGGQEMALIPFWDFCNHSSHGGKITTFIDPVKHVLTCSAAKSY 294
Query: 154 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQ--YQDK-RMVAQRN 210
K GE + ++ GP+PNS+ + GF + N D + L+ ED + DK ++ +R
Sbjct: 295 KKGEQVYMYYGPRPNSQFYLFQGFSLKTNLNDDYSFDMDLDNEDDRDIAHDKIHILEERC 354
Query: 211 GKLSVQVFHVHAGREKEAI-SDMLPYLRLGYVS--DTSEM-----------QSVISSLGP 256
G Q + E + ++++P+ R+ +S +T ++ + P
Sbjct: 355 GLRVGQTVSLSQNPSSEKLPAEIIPFYRIAALSPEETKKLAPPQEEGHHHHHQGPMDMKP 414
Query: 257 --ICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEK 314
+S E+ L D KARL+GYP TL++DE + + N ++R ++ EK
Sbjct: 415 EAFNIISEENEKKAFKLLLDSLKARLSGYPTTLAQDEQEMKN-NPTTQRRYVLYILINEK 473
Query: 315 KMLNACLQVTADMI 328
K+L ++ +I
Sbjct: 474 KILERNIKYVEQLI 487
>gi|395504553|ref|XP_003756612.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Sarcophilus
harrisii]
Length = 602
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 79/333 (23%), Positives = 151/333 (45%), Gaps = 36/333 (10%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
SFWLPYI+ L + ++PL + E E+ +L + ++ + + R+Y
Sbjct: 177 SFWLPYIQTLPSE-------YDTPLYFEEDEVQHLQSTQAIHDVFSQYKNTARQY----- 224
Query: 70 VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
+F + Q +P +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 225 AYFY--KVIQTHPNANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 282
Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
+ DD + V + + GE I ++ G + N++ +I+ GF ++N +D
Sbjct: 283 MCNHTNGLITTGYNLEDDRCECVALQDFNVGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 342
Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 245
R+ ++ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 343 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEE 401
Query: 246 EMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEA 292
+ +I +LG PVS E +L + +AR L Y T+ ED++
Sbjct: 402 LKEHLIGEHAIDRIFTLGNSEFPVSWDNEV----KLWTFLEARASLLLKTYKTTIEEDKS 457
Query: 293 MLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
L ++L +A +L EK++L ++ A
Sbjct: 458 FLATHDLTFHATMAIKLRLGEKEILEKAVKSAA 490
>gi|308802083|ref|XP_003078355.1| ribulose-1,5-bisphosphate carb (ISS) [Ostreococcus tauri]
gi|116056807|emb|CAL53096.1| ribulose-1,5-bisphosphate carb (ISS) [Ostreococcus tauri]
Length = 520
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 84/340 (24%), Positives = 145/340 (42%), Gaps = 39/340 (11%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET-ELAYLTGSPTKAEILERAEGIK 61
E+ +G S W PY++ L + ++PL W++ + A L GSP A+ +ER++ +
Sbjct: 132 ERIKGGASEWAPYVKTL-------RANPDAPLFWTDAKDFALLKGSPVAADAIERSKSAR 184
Query: 62 REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 121
EY + V + P P EA+ F + A+ + + A+ +ALV
Sbjct: 185 TEYASITEV-------IKSDPSSYPPEAYEFLTEARFVDALATVCAKATWLPTAQCYALV 237
Query: 122 PL----------GPPLLAYSS-----KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP- 165
PL P +L S+ +C VD A ++ A S V+
Sbjct: 238 PLLDVISIGGAPVPGVLPPSASDGVVRCGPADYDVDTASVVLRCATKAAANSEVIQLDAL 297
Query: 166 -QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV-QVFHVHAG 223
+ N +L +N G+VD+ +P D + ++ + T D + K+ V + G + Q F V+
Sbjct: 298 QRNNGELFLNTGYVDQKHPGDYIYMKTDIQTSDRLFTAKKQVLEGMGFTAADQYFPVYKD 357
Query: 224 REKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGY 283
R + + YLR V D EM +V I VS E +L L + +A Y
Sbjct: 358 RMP---TQLYSYLRFSRVQDPGEMMAVSFEEDKI--VSVMNEYEILQILMGDCRELMAEY 412
Query: 284 PATLSEDEAMLTDYNLHPKKRVATQLVRM-EKKMLNACLQ 322
++ +L + P + + +RM EKK++ A +
Sbjct: 413 DTNEEDELNLLKLSDQMPVREIEAAKLRMSEKKLIGATMN 452
>gi|126290266|ref|XP_001367810.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Monodelphis domestica]
Length = 595
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 77/329 (23%), Positives = 147/329 (44%), Gaps = 28/329 (8%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
SFWLPYI+ L + ++PL + E E+ +L + ++ + + R+Y
Sbjct: 170 SFWLPYIQTLPSE-------YDTPLYFEEDEVQHLQSTQAIHDVFSQYKNTARQY----- 217
Query: 70 VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
+F + Q +P +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 218 AYFY--KVIQTHPNANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 275
Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
+ DD + V + + GE I ++ G + N++ +I+ GF ++N +D
Sbjct: 276 MCNHTNGLITTGYNLEDDRCECVALQDFNVGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 335
Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 245
R+ ++ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 336 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEE 394
Query: 246 EMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
+ +I +LG PVS E + L L Y T+ ED++ L
Sbjct: 395 LKEHLIGEHAIDRIFTLGNSEFPVSWDNEVKLWTFLEARASLLLKTYKTTIEEDKSFLAT 454
Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
++L +A +L EK++L ++ A
Sbjct: 455 HDLTFHATMAIKLRLGEKEILEKAVKSAA 483
>gi|387016380|gb|AFJ50309.1| Histone-lysine N-methyltransferase setd3 [Crotalus adamanteus]
Length = 592
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 80/334 (23%), Positives = 151/334 (45%), Gaps = 36/334 (10%)
Query: 9 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
SFWLPYI+ L + + L + E E+ YL + +I + + R+Y
Sbjct: 169 NSFWLPYIQTLPNE-------YNTALYFEEDEVQYLQSTQAIHDIFSQYKNTARQY---- 217
Query: 69 TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
+F + Q +P +P ++FT++ ++ A +V + + +R AL+PL
Sbjct: 218 -AYFY--KVVQTHPNASKLPLKDSFTYDDYRWAVSSVMARQNQIPAEDGSRVTLALIPLW 274
Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
+ DD + V + +KAGE I ++ G + N++ +I+ GF ++N +
Sbjct: 275 DMCNHTNGLITTGYNLKDDRCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 334
Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
DR+ ++ ++ D Y K V R G + VF +H+ E + +L +LR+ +++
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTED 393
Query: 245 SEMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDE 291
+ +I +LG PVS E +L + +AR L Y T+ +D+
Sbjct: 394 ELKEHLIGEHTIDRIFTLGNSEFPVSWDNEV----KLWTFLEARASLLLKTYKTTIHDDK 449
Query: 292 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
+L +L +A +L EK++L ++ A
Sbjct: 450 FILETQDLTHNATMAIKLRLGEKEILEKAIKSAA 483
>gi|17368377|sp|P94026.1|RBCMT_TOBAC RecName: Full=Ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic; AltName:
Full=[Ribulose-bisphosphate carboxylase]-lysine
N-methyltransferase; Short=RuBisCO LSMT; Short=RuBisCO
methyltransferase; Short=rbcMT; Flags: Precursor
gi|1731475|gb|AAC49565.1| ribulose-1,5-bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase [Nicotiana tabacum]
gi|1731477|gb|AAC49566.1| ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase [Nicotiana tabacum]
Length = 491
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 71/301 (23%), Positives = 136/301 (45%), Gaps = 20/301 (6%)
Query: 29 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 88
+ +S + WSE EL+ + G+ + + + ++ E+ +++ + Q +P+ I +
Sbjct: 156 STDSTIYWSEEELSEIQGTQLLSTTMSVKDYVQNEFQKVEEEVILRNK--QLFPFPITLD 213
Query: 89 AF--TFEIFK-QAFVAVQS-CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA 144
F F I + +AF +++ ++ + L A V A+ + A L + D
Sbjct: 214 DFFWAFGILRSRAFSRLRNQNLILVPFADLTNHNARVTTEDH--AHEVRGPAGLFSWDLL 271
Query: 145 VQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 203
L KAG+ + + + + N+ + ++YGF++ + D + ++ D Y DK
Sbjct: 272 FSLRSPLKLKAGDQLFIQYDLNKSNADMALDYGFIEPSSARDAFTLTLEISESDEFYGDK 331
Query: 204 RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM-------QSVISSLGP 256
+A+ NG F + G+ M+PYLRL + T SV LG
Sbjct: 332 LDIAETNGIGETAYFDIKIGQSLPPT--MIPYLRLVALGGTDAFLLESIFRNSVWGHLG- 388
Query: 257 ICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 316
PVS E + + D K+ L+GY T+ EDE ++ + NL + ++A + EK++
Sbjct: 389 -LPVSRANEELICKVVRDACKSALSGYHTTIEEDEKLMEEGNLSTRLQIAVGIRLGEKRV 447
Query: 317 L 317
L
Sbjct: 448 L 448
>gi|297849804|ref|XP_002892783.1| hypothetical protein ARALYDRAFT_471564 [Arabidopsis lyrata subsp.
lyrata]
gi|297338625|gb|EFH69042.1| hypothetical protein ARALYDRAFT_471564 [Arabidopsis lyrata subsp.
lyrata]
Length = 482
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 80/313 (25%), Positives = 142/313 (45%), Gaps = 27/313 (8%)
Query: 29 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 88
+ +S + WSE ELA L G+ + L E ++ E+ +L+ + D+ +
Sbjct: 148 STDSTVFWSEEELAELKGTQLLSTTLGVKEYVENEFLKLEQEILLPNK-------DLFSS 200
Query: 89 AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL------GPPLLA--YSSKCK-AMLA 139
T + F AF ++S + + L+PL P + Y+ + K A L
Sbjct: 201 RITLDDFIWAFGILKSRAFSRLR---GQNLVLIPLADLINHNPAITTEDYAYEIKGAGLF 257
Query: 140 AVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 198
+ D L KAGE + + + + N++L ++YGFV+ + + + + DP
Sbjct: 258 SRDLLFSLKSPVYVKAGEQVYIQYDLNKSNAELALDYGFVESNPNRNSYTLTIEIPESDP 317
Query: 199 QYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV--SDTSEMQSVISSL-- 254
+ DK +A+ N F V G+ A ML YLRL + SD ++S+ ++
Sbjct: 318 FFGDKLDIAETNKMGETGYFDVVDGQTLPA--GMLQYLRLVALGGSDAFLLESIFNNTIW 375
Query: 255 GPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRME 313
G + PVS E + + D K+ L+G+ T+ EDE +L + L P+ +A ++ E
Sbjct: 376 GHLELPVSRSNEELICRVVRDACKSALSGFSTTIEEDEKLLEEGKLDPRLEMALKIRIGE 435
Query: 314 KKMLNACLQVTAD 326
K++L Q+ D
Sbjct: 436 KRVLQQIDQIFKD 448
>gi|260803924|ref|XP_002596839.1| hypothetical protein BRAFLDRAFT_284593 [Branchiostoma floridae]
gi|229282099|gb|EEN52851.1| hypothetical protein BRAFLDRAFT_284593 [Branchiostoma floridae]
Length = 500
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 84/331 (25%), Positives = 144/331 (43%), Gaps = 34/331 (10%)
Query: 4 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 63
+K SFW PYI +PL + E E+ +L GS +++L + + I R+
Sbjct: 177 EKYSSNSFWAPYINIFPG-------TYTTPLYFEEGEMLHLQGSLNFSDVLNQYKSIARQ 229
Query: 64 YNELDTVWFMAGSLFQQYP--YDIP-TEAFTFEIFKQAFVAVQSCVVHLQKV--SLARRF 118
Y +F LFQ P +P E FTF+ ++ A V + + +V S R
Sbjct: 230 Y-----AYFY--KLFQTQPEAAGLPLKECFTFDEYRWA---VSTVMTRQNQVPTSDGRHL 279
Query: 119 --ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 176
AL+P+ + + D+ + + R + + ++ G + N++ LI+ G
Sbjct: 280 ITALIPMWDMCNHSNGEVSTEFNLGSDSAECLAMREFPTDSQVYIFYGMRSNAEFLIHNG 339
Query: 177 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL 236
FV +N +DR+ V+ ++ D + K V R G + F VH G++ ++L +L
Sbjct: 340 FVYPENVHDRVNVKLGVSKNDSLFAMKAEVLSRAGIHASTSFQVHCGKDP-IPPELLVFL 398
Query: 237 RL-----GYVSD--TSEMQSV-ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATL 287
R+ G + D TSE QS +S LG C V+ E L + Y ++
Sbjct: 399 RVFTMVEGDLRDLLTSEHQSAYLSCLGRSDCMVTQEQETKAWAFLETRLSLLIRSYRTSI 458
Query: 288 SEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 318
+ E L ++ R A QL E ++L+
Sbjct: 459 KDVETELQAPDMTYHSRAALQLKLAEMQILS 489
>gi|115657973|ref|XP_798530.2| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Strongylocentrotus purpuratus]
Length = 682
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 82/325 (25%), Positives = 138/325 (42%), Gaps = 26/325 (8%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
SFW PY+ L + PL ++ E+ GS E L++ + I R+Y L
Sbjct: 171 SFWKPYLDVLPS-------SYSLPLYFTPDEIQLFQGSTMYGECLKQHKNIARQYAYL-- 221
Query: 70 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL-QKVSLARRFALVPLGPPLL 128
F +L + I E FT++ ++ A V + + K +L+PL
Sbjct: 222 --FKLLNLPENSKLHI-REYFTYDFYRWAVSTVMTRQNQIPAKDGKGMSLSLIPLWDMCN 278
Query: 129 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 188
+ + K D+ + R + GE I + G + ++ LL+ GFV N YD +
Sbjct: 279 HANGEMKTDFIEERDSCVNMALRDFSVGEQIFICYGRRSSADLLLYSGFVYPGNVYDGMA 338
Query: 189 VEAALNTEDPQYQDKRMVAQRNGKLSV--QVFHVHAGREKEAISDMLPYLRLGYVSD--- 243
++ L++ D Y K + KL V Q +H+ AG+E + ++L +LR+ + D
Sbjct: 339 IQLGLSSSDRLYAMKAQLCSVM-KLGVPSQNYHISAGKEPVTL-ELLTFLRIFCMQDLEL 396
Query: 244 ------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDY 297
+ Q++ S + +S E LA Y ++ EDE L D
Sbjct: 397 RDRLLGDNRAQALFSLVDRSQIISKLNELRTCVYLATRVTLLQRQYKTSIQEDEEKLKDG 456
Query: 298 NLHPKKRVATQLVRMEKKMLNACLQ 322
NL ++R A QL+ +EK L L+
Sbjct: 457 NLSAQERSALQLLLIEKCTLENVLE 481
>gi|168043570|ref|XP_001774257.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674384|gb|EDQ60893.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 458
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 55/213 (25%), Positives = 96/213 (45%), Gaps = 19/213 (8%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+ K GK S W PYIR L RG +++ + W + EL L S + R I
Sbjct: 113 LLHKHAGKASPWEPYIRCLPPLRG-----LQNTVFWRDEELELLRQSNVYDQTEHRKTLI 167
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
+++ + V +YP ++ E T E FK A+ S ++ + +
Sbjct: 168 SNQFDLVQAV-------VNKYP-ELFGETVTLESFKHAYCVASSRSWGVEALG---SITM 216
Query: 121 VPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
VP + + S +A+LA ++ ++V D+ Y G +V+ G PNS L +++GF
Sbjct: 217 VPF-VDMFNHDSSARALLAYYEEEGYAEVVADKDYNQGSQVVITYGTLPNSSLALDFGFT 275
Query: 179 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 211
DNP+D + + + DP +K + + +G
Sbjct: 276 LPDNPHDEVQIWMEAPSGDPLRAEKLKLLRDHG 308
>gi|168063638|ref|XP_001783777.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664720|gb|EDQ51429.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 395
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 61/224 (27%), Positives = 101/224 (45%), Gaps = 25/224 (11%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+ E+ G+ S W PYI L A+ S + W + EL + + E ++R I
Sbjct: 52 LVEQHLGQASQWAPYINCLPTCG-----ALHSTVFWKKEELELVRFTSLHRETMQRRAVI 106
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
E+ + V Q+ P+ I E FKQA+ +S + S R
Sbjct: 107 GSEFASVLPV-------LQKCPH-IFGERVLHSKFKQAYATGKSL-----RRSSNTRILT 153
Query: 121 VPLGPPLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
VP + S C+A+L+ ++ +++ D+ Y GE +V+ G PN+ L +++GF
Sbjct: 154 VPF-VDFFNHDSNCRALLSYDEERACAEVIADKNYARGEQVVISYGRLPNTTLALDFGFT 212
Query: 179 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 222
NPYD++ V AL+ DP + K + +G +V VHA
Sbjct: 213 ISCNPYDQVEVWMALSHRDPLRKMKLALLHAHGMPTV----VHA 252
>gi|255080880|ref|XP_002504006.1| predicted protein [Micromonas sp. RCC299]
gi|226519273|gb|ACO65264.1| predicted protein [Micromonas sp. RCC299]
Length = 529
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 91/346 (26%), Positives = 140/346 (40%), Gaps = 48/346 (13%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIK 61
E+ +G S W PY+ L + PLLW+ EL L GSP + + + R E
Sbjct: 130 ERHKGPLSEWAPYVATLP------SAGSDHPLLWTAGELQTLLQGSPVREQAVSRLESAD 183
Query: 62 REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV-AVQSCVVHLQKVSLARRFAL 120
EY + + P D P +A+ F + + AFV A+ + + ++ A +A+
Sbjct: 184 DEYASI-------ADQIRSNPNDFPPDAYEF-LTRDAFVDALATVLARAVWLNAANCYAM 235
Query: 121 VPL-------------------------GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 155
VPL G P LA ++ AA + + + +
Sbjct: 236 VPLVDLLPLVGSPPPGVSPAAAAGGPAVGKPGLAAAAGVVDYDAATECVAVVSANDAQQT 295
Query: 156 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG-KLS 214
+ V + L + G VDE + D L A+ D Y+ KR + + G
Sbjct: 296 ARVVCVDPLARNAGDLFLATGAVDESHCGDYLAFAASCTQTDRLYEAKRQILEGMGMSAD 355
Query: 215 VQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLAD 274
Q F V A R +L Y+R V D E+ SV I VSP E VL L
Sbjct: 356 GQTFPVFADRMP---MQLLAYMRFARVQDPGELMSVSFEEDRI--VSPMNEYEVLQLLMQ 410
Query: 275 YFKARLAGYPATLSEDEAM-LTDYNLHPKKRVATQLVRMEKKMLNA 319
+ LA Y ++ E E + L + L ++RVA +L EK+++NA
Sbjct: 411 DAREMLAEYESSSEEFELLQLKEKGLSARQRVAAKLRLAEKRLINA 456
>gi|217038301|gb|ACJ76599.1| SET domain-containing protein 3 (predicted) [Oryctolagus cuniculus]
Length = 394
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 58/234 (24%), Positives = 107/234 (45%), Gaps = 19/234 (8%)
Query: 9 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 169 NSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFY 221
Query: 69 TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
V Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 222 RV-------IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 274
Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
+ DD + V R + AGE I ++ G + N++ +I+ GF ++N +
Sbjct: 275 DMCNHTNGLITTGYNLEDDRCECVALRDFHAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 334
Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 238
DR+ ++ ++ D Y K V R G + VF +H E + +L +LR+
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRV 387
>gi|145524453|ref|XP_001448054.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124415587|emb|CAK80657.1| unnamed protein product [Paramecium tetraurelia]
Length = 581
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 85/365 (23%), Positives = 151/365 (41%), Gaps = 64/365 (17%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+ ++K SFW PY+ L Q P+ ++ +L +L GSP +I ++ +
Sbjct: 219 LLQEKSRPNSFWKPYLDIL------PQSYPSFPIFFNNYDLEWLQGSPFLKQINDKLSDL 272
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
K++YN++ V F QY +F F A + S + + + + A
Sbjct: 273 KKDYNDICNV----APEFSQY---------SFYEFCWARMTASSRIFGI-NIKGVKTDAF 318
Query: 121 VPLG-------PPLLA--YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 171
VPL P L + YS + + + D+ + DR G+ I G + NS+
Sbjct: 319 VPLADMLNHKRPKLTSWCYSEEKQGFIIETDEKI----DR----GQMIFDSYGRKCNSRF 370
Query: 172 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDK--------------RMVAQRNGKLSVQV 217
L+NYGFV +DN + + V A DP Q K R++ +G +
Sbjct: 371 LLNYGFVVDDNDANEVNVTVAAEFNDPLIQLKEDATEEQLKQPKTFRLIMDTDGINEITH 430
Query: 218 FHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISS-----LGP--ICPVSPCMERAVLD 270
F + + + + Y+R + D +++Q +++ + P I P+ E + D
Sbjct: 431 FL------EATVMEFMSYIRFLVIRDQTQLQFLLNERESKYIKPTKIQPLGIHNELDMWD 484
Query: 271 QLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIML 330
+ L+ YP TL +D+ +L +L +R L EK++L Q + M L
Sbjct: 485 LIRRICYVSLSRYPTTLEQDKEILQICDLTTNQRNCLILRMGEKEILKFYYQFSEKMKQL 544
Query: 331 LPDVT 335
L +
Sbjct: 545 LSNFN 549
>gi|302814473|ref|XP_002988920.1| hypothetical protein SELMODRAFT_129035 [Selaginella moellendorffii]
gi|300143257|gb|EFJ09949.1| hypothetical protein SELMODRAFT_129035 [Selaginella moellendorffii]
Length = 389
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 52/206 (25%), Positives = 97/206 (47%), Gaps = 20/206 (9%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+ +K+ + S W PYI L ++ S + W++ EL YL SP E ++R + +
Sbjct: 52 LAHQKKKETSAWAPYISCLPPFG-----SMHSTIFWTQDELVYLKVSPVYRETVQRKDVV 106
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
+ E+ + + P+ + E FK A+ V S ++ + + AL
Sbjct: 107 RMEFAAAENALLLC-------PHIFGSRVSALE-FKHAYATVCSRAWGIETI---KSLAL 155
Query: 121 VPLGPPLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
VP + + C+AML+ +D ++V DR Y G+ +V+ G N+ L +++GF
Sbjct: 156 VPF-VDFFNHDANCRAMLSYDEDRHCAEVVSDRDYATGDQVVISYGQLSNATLALDFGFA 214
Query: 179 DEDNPYDRLV-VEAALNTEDPQYQDK 203
NP+D++ + +L+ +DP K
Sbjct: 215 LPFNPHDQVAGIWLSLSEKDPLRDSK 240
>gi|15223054|ref|NP_172856.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase
[Arabidopsis thaliana]
gi|17369870|sp|Q9XI84.1|RBCMT_ARATH RecName: Full=[Fructose-bisphosphate aldolase]-lysine
N-methyltransferase, chloroplastic; AltName:
Full=Aldolases N-methyltransferase; AltName:
Full=[Ribulose-bisphosphate carboxylase]-lysine
N-methyltransferase-like; Short=AtLSMT-L;
Short=LSMT-like enzyme; Flags: Precursor
gi|5080779|gb|AAD39289.1|AC007576_12 Putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase [Arabidopsis thaliana]
gi|28973755|gb|AAO64193.1| putative ribulose-1,5 bisphosphate carboxylase oxygenase large
subunit N-methyltransferase [Arabidopsis thaliana]
gi|332190979|gb|AEE29100.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase
[Arabidopsis thaliana]
Length = 482
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 77/314 (24%), Positives = 139/314 (44%), Gaps = 29/314 (9%)
Query: 29 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 88
+ +S + WSE ELA L G+ + L E ++ E+ +L+ + D+ +
Sbjct: 148 STDSTVFWSEEELAELKGTQLLSTTLGVKEYVENEFLKLEQEILLPNK-------DLFSS 200
Query: 89 AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCK----------AML 138
T + F AF ++S + + L+PL L+ ++ K A L
Sbjct: 201 RITLDDFIWAFGILKSRAFSRLR---GQNLVLIPLAD-LINHNPAIKTEDYAYEIKGAGL 256
Query: 139 AAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 197
+ D L KAGE + + + + N++L ++YGFV+ + + + + D
Sbjct: 257 FSRDLLFSLKSPVYVKAGEQVYIQYDLNKSNAELALDYGFVESNPKRNSYTLTIEIPESD 316
Query: 198 PQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS--DTSEMQSVISSL- 254
P + DK +A+ N F + G+ A ML YLRL + D ++S+ ++
Sbjct: 317 PFFGDKLDIAESNKMGETGYFDIVDGQTLPA--GMLQYLRLVALGGPDAFLLESIFNNTI 374
Query: 255 -GPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 312
G + PVS E + + D K+ L+G+ T+ EDE +L L P+ +A ++
Sbjct: 375 WGHLELPVSRTNEELICRVVRDACKSALSGFDTTIEEDEKLLDKGKLEPRLEMALKIRIG 434
Query: 313 EKKMLNACLQVTAD 326
EK++L Q+ D
Sbjct: 435 EKRVLQQIDQIFKD 448
>gi|307108530|gb|EFN56770.1| hypothetical protein CHLNCDRAFT_8187, partial [Chlorella
variabilis]
Length = 398
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 93/340 (27%), Positives = 148/340 (43%), Gaps = 34/340 (10%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
++E+ G S W PYI L G SP+ W E +LA L GS +
Sbjct: 50 LHERAMGNASRWAPYIALLPADSG-------SPVQWEEADLAELQGSQVLGTVQGYRAYF 102
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV------VHLQKVSL 114
++ +++L F S +D P F F+ F A V++ ++ V L
Sbjct: 103 QQRFDQLQAEVFGPNS----QAFD-PI-VFNFDAFLWAACTVRARAHPPLDGGNIALVPL 156
Query: 115 ARRFALVPLGPPLLA-YSSKCKAMLAAVDDAVQLVVDRP--YKAGESIVVWCGPQ-PNSK 170
A P PP A + K L LV++ AG++I + GPQ + +
Sbjct: 157 ADMVRSQPSWPPDSAGWQLKQTGGLFGAGSTQALVMEASGSMAAGDAIAMDFGPQKSDGQ 216
Query: 171 LLINYGFVDE--DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV-HAGREKE 227
LL+++G +D + P L +E L+ ED Y DK + + N +L+ H+ A R +
Sbjct: 217 LLVDHGVIDPLVNQPSYALTLE--LSKEDRNYDDKADILELN-ELAESTEHILRADRAPD 273
Query: 228 AISDMLPYLRLGYVSDTSEMQSVISSL---GPICPVSPCMERAVLDQLADYFKARLAGYP 284
A + L +D ++S+ + PVS ER QL D A LA YP
Sbjct: 274 AGLLPVLRLLNLSGTDAFLLESIFRNEVWEHMQLPVSEDNERGCYQQLIDGCTAALAAYP 333
Query: 285 ATLSEDEAMLTDYNLHPKKRVATQL-VRM-EKKMLNACLQ 322
++ ED A++ +L P R + + VR+ EK+ L+A L+
Sbjct: 334 TSIDEDLALMASGSLQPGSRRQSAVRVRLGEKEALDATLR 373
>gi|302755392|ref|XP_002961120.1| hypothetical protein SELMODRAFT_402746 [Selaginella moellendorffii]
gi|300172059|gb|EFJ38659.1| hypothetical protein SELMODRAFT_402746 [Selaginella moellendorffii]
Length = 371
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 56/201 (27%), Positives = 89/201 (44%), Gaps = 35/201 (17%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLA-VESPLLWSETELAYLTGSPTKAEILERAEG 59
+ EK + ++S W PY+R L QL +ES + W + ELA+L SPT E +E +
Sbjct: 56 LSEKHKAQESQWAPYLRCL------PQLGDIESTMFWKDEELAWLKHSPTYRETMECLKI 109
Query: 60 IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 119
IK E++ L+ F + D+ E + F A+ Q
Sbjct: 110 IKSEFHVLEANVF-------PWCRDVLGEV-SLTDFMHAYSTDQ---------------- 145
Query: 120 LVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
+P + C+ L+ D V D+ YKAG+ I + G PNS L ++YGF
Sbjct: 146 -IPFAD-FFNHDHNCQTRLSYDKEKDCAVAVADQDYKAGDEIFLSYGSTPNSILAVDYGF 203
Query: 178 VDEDNPYDRLVVEAALNTEDP 198
NP++++ V ++ DP
Sbjct: 204 AVASNPHEQVEVPMGVSLTDP 224
>gi|168067849|ref|XP_001785817.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162662541|gb|EDQ49381.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 489
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 85/344 (24%), Positives = 148/344 (43%), Gaps = 49/344 (14%)
Query: 9 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL- 67
S W PY+ L + +++SPL WS+ ELA L G+ + E ++ EYN L
Sbjct: 127 SSKWRPYLDSLPK-------SLDSPLFWSDEELAELVGTQLLGSVTGYLEFLENEYNNLV 179
Query: 68 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL---- 123
+ V +F Y TF+ FK AF ++S ALVP+
Sbjct: 180 EEVLEPNNKIFNPAVY-------TFDGFKWAFGILRSRTFSPLT---GEDIALVPIADLV 229
Query: 124 --------GPP--LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP-QPNSKLL 172
G P + +S+ + D + + + AGE +++ G + N+ L
Sbjct: 230 NHGKGLGDGSPSWVRKGTSQFWNIGKGSSDLLTVRASANFSAGEQVLMQYGATKSNADLA 289
Query: 173 INYGFVDEDNPY--------DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR 224
++YGFV+ D D L + ++ +D DK + + NG F + G+
Sbjct: 290 LDYGFVERDRGSQFSPGIERDSLALSLEISPDDRFVDDKADILEINGFQCSMQFDLSRGQ 349
Query: 225 EKEAISDMLPYLRLGYVS--DTSEMQSVI--SSLGPIC-PVSPCMERAVLDQLADYFKAR 279
+M+ +LRL +S D+ ++++ + G + PVS E A+ + + KA
Sbjct: 350 GPS--DEMITFLRLSALSGPDSFLLEALFRNEAWGHVSLPVSRDNEEALCTSMLEGLKAA 407
Query: 280 LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQV 323
L GY T+ +D +L +L + +A +VR+ +K + LQ
Sbjct: 408 LDGYSTTVEQDMELLARGDLSTRMEIAV-VVRLGEKRVMQELQT 450
>gi|8778402|gb|AAF79410.1|AC068197_20 F16A14.25 [Arabidopsis thaliana]
Length = 474
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 76/308 (24%), Positives = 138/308 (44%), Gaps = 25/308 (8%)
Query: 29 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 88
+ +S + WSE ELA L G+ + L E ++ E+ +L+ + D+ +
Sbjct: 148 STDSTVFWSEEELAELKGTQLLSTTLGVKEYVENEFLKLEQEILLPNK-------DLFSS 200
Query: 89 AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL----LAYSSKCKAMLAAVDDA 144
T + F AF +++ + ++ F + P + AY K A L + D
Sbjct: 201 RITLDDFIWAF-----GILNRESLTSMFEFEQINHNPAIKTEDYAYEIKG-AGLFSRDLL 254
Query: 145 VQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 203
L KAGE + + + + N++L ++YGFV+ + + + + DP + DK
Sbjct: 255 FSLKSPVYVKAGEQVYIQYDLNKSNAELALDYGFVESNPKRNSYTLTIEIPESDPFFGDK 314
Query: 204 RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS--DTSEMQSVISSL--GPI-C 258
+A+ N F + G+ A ML YLRL + D ++S+ ++ G +
Sbjct: 315 LDIAESNKMGETGYFDIVDGQTLPA--GMLQYLRLVALGGPDAFLLESIFNNTIWGHLEL 372
Query: 259 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 318
PVS E + + D K+ L+G+ T+ EDE +L L P+ +A ++ EK++L
Sbjct: 373 PVSRTNEELICRVVRDACKSALSGFDTTIEEDEKLLDKGKLEPRLEMALKIRIGEKRVLQ 432
Query: 319 ACLQVTAD 326
Q+ D
Sbjct: 433 QIDQIFKD 440
>gi|326492674|dbj|BAJ90193.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 85/335 (25%), Positives = 143/335 (42%), Gaps = 40/335 (11%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+ E +G S W PY+ L RQ +S + WSE EL + G+ + + E +
Sbjct: 95 LREAARGGDSLWAPYLAILPRQ-------TDSTIFWSEEELLEIQGTQLLSTTMGVKEYV 147
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
+ E++ ++ AG + D+ TF+ F AF ++S V + + AL
Sbjct: 148 QSEFDNVE-----AGII--NVNKDLFPGTITFDDFLWAFGVLRSRVFPELR---GDKLAL 197
Query: 121 VPLGPPL----------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNS 169
+P + + K K L D L K+GE I V + + N+
Sbjct: 198 IPFADLINHNGDITSKESCWEIKGKGFLGR-DTVFSLRTPVDVKSGEQIYVQYDLDKSNA 256
Query: 170 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI 229
+L ++YGF + ++ D + ++ DP Y+DK +A+ NG F V G +
Sbjct: 257 ELALDYGFTESNSSRDSYTLTLEISESDPFYEDKLDIAELNGMGETAYFDVVLG--ESLP 314
Query: 230 SDMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYFKARLAG 282
M+ YLRL + T V L PVS E ++ + + K+ LA
Sbjct: 315 PQMITYLRLLCLGGTDAFLLEALFRNKVWEHLE--LPVSRDNEESICQVIQNACKSALAA 372
Query: 283 YPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 317
Y T+ EDE +L +L + ++A ++ EKK+L
Sbjct: 373 YHTTIEEDEELLEREDLQSRHQIAVEVRVGEKKVL 407
>gi|326496433|dbj|BAJ94678.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 453
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 85/335 (25%), Positives = 143/335 (42%), Gaps = 40/335 (11%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+ E +G S W PY+ L RQ +S + WSE EL + G+ + + E +
Sbjct: 99 LREAARGGDSLWAPYLAILPRQ-------TDSTIFWSEEELLEIQGTQLLSTTMGVKEYV 151
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
+ E++ ++ AG + D+ TF+ F AF ++S V + + AL
Sbjct: 152 QSEFDNVE-----AGII--NVNKDLFPGTITFDDFLWAFGVLRSRVFPELR---GDKLAL 201
Query: 121 VPLGPPL----------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNS 169
+P + + K K L D L K+GE I V + + N+
Sbjct: 202 IPFADLINHDGDITSKESCWEIKGKGFLGR-DTVFSLRTPVDVKSGEQIYVQYDLDKSNA 260
Query: 170 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI 229
+L ++YGF + ++ D + ++ DP Y+DK +A+ NG F V G +
Sbjct: 261 ELALDYGFTESNSSRDSYTLTLEISESDPFYEDKLDIAELNGMGETAYFDVVLG--ESLP 318
Query: 230 SDMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYFKARLAG 282
M+ YLRL + T V L PVS E ++ + + K+ LA
Sbjct: 319 PQMITYLRLLCLGGTDAFLLEALFRNKVWEHLE--LPVSRDNEESICQVIQNACKSALAA 376
Query: 283 YPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 317
Y T+ EDE +L +L + ++A ++ EKK+L
Sbjct: 377 YHTTIEEDEELLEREDLQSRHQIAVEVRVGEKKVL 411
>gi|145344456|ref|XP_001416748.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576974|gb|ABO95041.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 515
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 90/339 (26%), Positives = 137/339 (40%), Gaps = 43/339 (12%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE-LAYLTGSPTKAEILERAEGIK 61
E+ +G S W PY++ L ++PL W+E E A L GSP + +ER+ +
Sbjct: 131 ERIKGGASDWAPYVKTL-------AANPDAPLFWTEAEDFALLKGSPIVNDAVERSRSAR 183
Query: 62 REYNELDTVWFMAGSLFQQYPYDIPTEAFTF---EIFKQAFVAVQSCVVHLQKVSLARRF 118
EY + V + P P EA+ F E F A V + L S +
Sbjct: 184 EEYAAIVEV-------IKGDPTAFPAEAYEFFTEERFVDALATVCAKATWLPTASC---Y 233
Query: 119 ALVPL-------GPPLLAY---SSKCKAMLAAVD---DAVQLVVDRPYKA-GESIVVWCG 164
ALVPL G P+ S+K A D D+ +V+ KA S VV
Sbjct: 234 ALVPLLDVITIAGSPVPGVSPPSAKDGIARCAADYDVDSACVVLSAVVKAPANSRVVQLD 293
Query: 165 P--QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG-KLSVQVFHVH 221
P + N +L +N G VD+ +P D L + + D + K+ V + G Q F V+
Sbjct: 294 PLQRNNGELFLNTGRVDQKHPGDYLYMRTEIQPSDRLFSAKKQVLEGMGFTAENQYFPVY 353
Query: 222 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 281
E + + YLR V D EM +V I VS E +L L + ++
Sbjct: 354 ---EDRMPTQLYSYLRFARVQDPGEMMAVSFEEDKI--VSVMNEYEILQLLMGDCRELMS 408
Query: 282 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNAC 320
Y ++ +L + + + +RM +K L C
Sbjct: 409 EYDTNEEDELNLLKLSDTMRVREIEAAKLRMSEKKLIGC 447
>gi|340720054|ref|XP_003398458.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Bombus
terrestris]
Length = 484
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 84/337 (24%), Positives = 143/337 (42%), Gaps = 31/337 (9%)
Query: 4 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 63
+K + S W PY+ L + L + ++ L GSPT L++ I R+
Sbjct: 165 EKHKEYSKWKPYLDILPT-------TYTTVLYMTAADMNELKGSPTLEAALKQCRNIARQ 217
Query: 64 YNELDTVWFMAGSLFQQYPYDIPT---EAFTFEIFKQAF--VAVQSCVVHLQKVSLARRF 118
Y +F LFQ+ + + FT+E + A V + ++ + SL
Sbjct: 218 Y-----AYF--NKLFQKNNNAVSAILRDVFTYEKYCWAVSTVMTRQNIIPSKDGSLMIH- 269
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
AL+P+ SK A + + R +K E I + GP+ NS ++ GFV
Sbjct: 270 ALIPMWDMCNHEDSKITTDFNATLNCCECYALRDFKKAEQIFISYGPRTNSDFFVHSGFV 329
Query: 179 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-MLPYLR 237
DN D + ++ DP ++++ + + +V F + G E ISD +L +LR
Sbjct: 330 YMDNEQDGFKLRLGISKADPLHKERVELLNKLDLPAVGEFLLKPG--TEPISDTLLAFLR 387
Query: 238 LGYVSDTSEMQSVISS--LGPICPVSPCMERAVLDQLADYFKARL----AGYPATLSEDE 291
+ + E+ I S + + + +E V + + + RL A YP TL ED
Sbjct: 388 V-FSMRKEELAHWIQSDRVNDLKHMDCALETVVEENVKKFLLTRLQLLIANYPTTLKEDL 446
Query: 292 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 328
+L + L K++A QL EK++L L+ I
Sbjct: 447 QLL-ETTLPRIKKLAIQLRVTEKRILQGALEYVQQWI 482
>gi|145350419|ref|XP_001419603.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579835|gb|ABO97896.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 524
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 78/317 (24%), Positives = 124/317 (39%), Gaps = 54/317 (17%)
Query: 5 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKRE 63
+ G + YIR L R+ G S L W E E+ L GSP++ ER + +
Sbjct: 167 ESGTAGTFGEYIRALPRRTG-------SVLDWPEDEVDKLLKGSPSRLAAAERQDSVNAA 219
Query: 64 YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 123
+E+ + +P T + AF + S ++ L ++ ALVP
Sbjct: 220 IDEIRSY----------FPE------ITVGALRWAFDILFSRLIRLD--AMGGELALVPW 261
Query: 124 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE--D 181
+L + C A + DAV L DR Y GE + G +P+S+LLI+YGF E +
Sbjct: 262 -ADMLNHKPGCAAFIDLNGDAVNLTTDRSYVKGEQVWASYGQRPSSELLISYGFAPEVGE 320
Query: 182 NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR--------------EKE 227
NP D + ++ DP K V + G V+ F + E
Sbjct: 321 NPDDEYALTLGVDVNDPLADAKAQVLRDMGLSPVETFPLRLNGYPRQLLQYASFILCNPE 380
Query: 228 AISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSP-----------CMERAVLDQLADYF 276
S++ + + + QS+ S+ + E AV + LAD
Sbjct: 381 KPSELKGLAQSAFTGSANIGQSIFDSVRGLTNGKARGKQGVILGGVAGEIAVREMLADLC 440
Query: 277 KARLAGYPATLSEDEAM 293
L+ YP TL +D+ +
Sbjct: 441 AEALSAYPNTLEKDKGL 457
>gi|298711968|emb|CBJ32910.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 247
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 64/196 (32%), Positives = 92/196 (46%), Gaps = 23/196 (11%)
Query: 147 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT-EDPQYQDK-R 204
+ R + AGE +++ GP+ N LL YGFV++DNP D + ++ D +D R
Sbjct: 40 VTTQRGWTAGEQVLISYGPRSNDHLLRRYGFVEQDNPNDVYRITGLIDKLSDVLGKDSVR 99
Query: 205 MVAQRNGKLSVQ--------VFHVHAGR-----EKEAISDMLPYLRLGYVSDTS--EMQS 249
++ + GKL V V GR EKE ++P RL V D E ++
Sbjct: 100 VLRESGGKLGTTGDNAEGRPVESVTVGRSGLLGEKEE-GRVMPVFRLAVVKDDQLPEGKA 158
Query: 250 VISSLGPIC-PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDY--NLHPKKRVA 306
SL +SP E A D L G+ TL+EDEA L+ +L +KRVA
Sbjct: 159 AGISLKDFSNEISPANEAAARDALRKLCIKEREGFATTLAEDEAYLSSLGNSLGAQKRVA 218
Query: 307 TQLVRMEKK-MLNACL 321
RMEKK +L+A +
Sbjct: 219 FSF-RMEKKRVLDAAI 233
>gi|146162512|ref|XP_001009518.2| SET domain containing protein [Tetrahymena thermophila]
gi|146146406|gb|EAR89273.2| SET domain containing protein [Tetrahymena thermophila SB210]
Length = 789
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 87/351 (24%), Positives = 148/351 (42%), Gaps = 58/351 (16%)
Query: 4 KKQGKKSFWLPYIRELDRQRGRGQLAVES---PLLWSETELAYLTGSPTKAEILERAEGI 60
+K+ KSFW P++ L VE P+L+++ EL +L GSP ++ ER E I
Sbjct: 433 EKKNHKSFWKPFL---------DILPVEYTTFPILYTDEELFWLKGSPFLNQVKERRECI 483
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEA--FTFEIFKQAFVAVQSCVVHLQKVSLARRF 118
++Y Q IP A T + F A + S + L ++ R
Sbjct: 484 TQDY--------------QAIVSKIPEFAKLCTLDEFAWARMMAASRIYGL-FINKKRTD 528
Query: 119 ALVPLGPPLL----AYSS------KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPN 168
A VPL AY++ K ML A +D + G+ I CG + N
Sbjct: 529 AFVPLADMFNHRRPAYTNWGFCEDKGGFMLKASEDI---------RRGDQIYYSCGRKCN 579
Query: 169 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK-RMVAQRNGKLSVQVFHVHAGREKE 227
S+ L+NYGFV ++N + + + + +D K +M+ +R K +F +H E++
Sbjct: 580 SRFLLNYGFVVKNNEANEIQLRVDFDKKDETLPIKLQMIGKR--KPESLIFRIHINYEEK 637
Query: 228 AISDMLPYLRLGYVSDTSEMQ-----SVISSLGPIC--PVSPCMERAVLDQLADYFKARL 280
++ + +LR + D ++ S P+ P S E+ + ++ +
Sbjct: 638 SVLEFFGFLRFVLIRDYIVLEKFHEMSEGKEFDPLRTPPFSIENEKQMWTEIHKICAEIM 697
Query: 281 AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLL 331
YP TL ED+ +L L ++ L EK++L + + M LL
Sbjct: 698 IQYPTTLDEDKKILETSKLTINQKNCVILRMGEKEILMYYITMADRMKKLL 748
>gi|361129824|gb|EHL01706.1| putative Ribosomal N-lysine methyltransferase 4 [Glarea lozoyensis
74030]
Length = 483
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 91/357 (25%), Positives = 146/357 (40%), Gaps = 50/357 (14%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA--E 58
M + Q S W PY+ L ++ ++S + WSE ELA L S +I + E
Sbjct: 94 MMSEGQQSDSRWAPYLAVLPQK-------LDSLVFWSEEELAELQASSVAKKIGRSSAEE 146
Query: 59 GIKREYNELDTVWF------MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL--- 109
+ + L F S+ Y +DIP E E KQ + L
Sbjct: 147 MFTKHISPLGLGEFNVELCHQVASVIMAYAFDIPEE----EPAKQENGGAEGETDDLVSD 202
Query: 110 ----QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 165
+K L+ ++PL L A + + A + ++ +++ +P AGE I G
Sbjct: 203 DGEDEKTILS----MIPLADMLNADAERNNARIYYENEDLEMRTIKPIMAGEEIFNDYGQ 258
Query: 166 QPNSKLLINYGFVDED-NPYDRLVVEAA----LNTEDPQ---------------YQDKRM 205
P S LL YG+V E+ YD + + +A L TE PQ +++
Sbjct: 259 LPRSDLLRRYGYVTENYAQYDVVEISSASIKSLMTEKPQEIQSGQFLDPLTSAEAEERVA 318
Query: 206 VAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCME 265
+A R G L A E+ AI D L L ++ D ++++++S + S
Sbjct: 319 LADREGILEDSYDVNIANAEERAIPDELLALLYLFLLDNENLEAIVTSQSALPSRSKLAT 378
Query: 266 RAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 322
V L + R A Y TL EDE +L NL + +A Q+ EK++L ++
Sbjct: 379 ELVGKVLVKVLRHREAEYATTLEEDEKLLQAANLPRRTAMAIQVRHGEKRVLRLAVE 435
>gi|270005261|gb|EFA01709.1| hypothetical protein TcasGA2_TC007289 [Tribolium castaneum]
Length = 230
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 55/219 (25%), Positives = 100/219 (45%), Gaps = 12/219 (5%)
Query: 118 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
+AL+PL + V D + + + +KAGE + ++ G + N+ L ++ GF
Sbjct: 14 YALIPLWDMCNHTNGTISTAYNPVLDRSECLAVKNFKAGEQLFIFYGSRSNADLFVHNGF 73
Query: 178 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD--MLPY 235
V E+N YD + ++ DP Q + + GKLS+ + R+ + D +L +
Sbjct: 74 VFENNDYDVYWIRLGISKSDPLQQKRGHLL---GKLSIASTCDFSIRKGASPIDGQLLAF 130
Query: 236 LRLGYVSDTSEMQSVISS-----LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSE 289
LR+ + + ++ I+S LG I C + +E L K L+ Y TL E
Sbjct: 131 LRV-FNMNEEQLDHWINSDKSADLGHIDCALDTALETKSWRFLHARLKLLLSTYKTTLDE 189
Query: 290 DEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 328
DE +L + P + +A ++ EK+++ L+ I
Sbjct: 190 DEKLLAEAQATPNRLLAIKMRATEKRIIRETLEYVEQYI 228
>gi|350595011|ref|XP_003484025.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Sus
scrofa]
Length = 326
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/150 (30%), Positives = 77/150 (51%), Gaps = 7/150 (4%)
Query: 73 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS 132
M +F +YP P E F E FK +F + S +V L S+ + ALVP ++ +S
Sbjct: 73 MRVRIFSKYPDFFPEEVFNIESFKWSFGILFSRMVRLP--SMDGKNALVPWAD-MMNHSC 129
Query: 133 KCKAMLAAVDDAVQLV--VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLV 188
+ + L + +V DRPY+ GE + + G + N +LL++YGFV ++ NP D +
Sbjct: 130 EVETFLDYDKSSKGIVFPTDRPYQPGEQVFISYGKKSNGELLLSYGFVPKEGTNPSDSVE 189
Query: 189 VEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
+ +L D Y++K + ++ G Q F
Sbjct: 190 LSLSLKKSDESYKEKLELLKKYGLSGSQCF 219
>gi|356534483|ref|XP_003535783.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Glycine
max]
Length = 463
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 50/205 (24%), Positives = 96/205 (46%), Gaps = 16/205 (7%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+ EKK G+ S W PYI L +Q G+L + + W+E+EL + S E +++ I
Sbjct: 112 LIEKKLGQGSEWYPYISCLPQQ---GEL--HNTVFWTESELEMIRPSSVYQETIDQKSQI 166
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
++++ + ++ + F Y A T +F V + V + AL
Sbjct: 167 EKDFLAIKHIFECSHQSFGDSTYKDFMHACTLVLFDHFNVELP---VGSRAWGSTNGLAL 223
Query: 121 VPLGPPLLAYSSKCKAMLAAVDD-------AVQLVVDRPYKAGESIVVWCGPQPNSKLLI 173
+P L + +A++ + DD ++Q++ DR Y GE +++ G N+ L++
Sbjct: 224 IPFAD-FLNHDGVSEAIVMSDDDKQCSEVQSLQIIADRDYAPGEQVLIRYGKFSNATLML 282
Query: 174 NYGFVDEDNPYDRLVVEAALNTEDP 198
++GF N YD++ ++ + DP
Sbjct: 283 DFGFTIPYNIYDQVQIQFDIPKHDP 307
>gi|52545671|emb|CAH56365.1| hypothetical protein [Homo sapiens]
Length = 380
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 61/262 (23%), Positives = 123/262 (46%), Gaps = 14/262 (5%)
Query: 77 LFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSS 132
+ Q +P+ +P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 9 VIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNG 68
Query: 133 KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 192
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++
Sbjct: 69 LITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLG 128
Query: 193 LNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT-------- 244
++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 129 VSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLG 187
Query: 245 -SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKK 303
S + + + PVS E + L D L Y T+ ED+++L +++L +
Sbjct: 188 DSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNHDLSVRA 247
Query: 304 RVATQLVRMEKKMLNACLQVTA 325
++A +L EK++L ++ A
Sbjct: 248 KMAIKLRLGEKEILEKAVKSAA 269
>gi|189236574|ref|XP_975615.2| PREDICTED: similar to SET domain containing 3 [Tribolium castaneum]
Length = 667
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 54/213 (25%), Positives = 99/213 (46%), Gaps = 12/213 (5%)
Query: 118 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
+AL+PL + V D + + + +KAGE + ++ G + N+ L ++ GF
Sbjct: 451 YALIPLWDMCNHTNGTISTAYNPVLDRSECLAVKNFKAGEQLFIFYGSRSNADLFVHNGF 510
Query: 178 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD--MLPY 235
V E+N YD + ++ DP Q + + GKLS+ + R+ + D +L +
Sbjct: 511 VFENNDYDVYWIRLGISKSDPLQQKRGHLL---GKLSIASTCDFSIRKGASPIDGQLLAF 567
Query: 236 LRLGYVSDTSEMQSVISS-----LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSE 289
LR+ + + ++ I+S LG I C + +E L K L+ Y TL E
Sbjct: 568 LRV-FNMNEEQLDHWINSDKSADLGHIDCALDTALETKSWRFLHARLKLLLSTYKTTLDE 626
Query: 290 DEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 322
DE +L + P + +A ++ EK+++ L+
Sbjct: 627 DEKLLAEAQATPNRLLAIKMRATEKRIIRETLE 659
>gi|190402231|gb|ACE77646.1| hypothetical protein [Sorex araneus]
Length = 350
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 61/248 (24%), Positives = 114/248 (45%), Gaps = 11/248 (4%)
Query: 88 EAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 146
E+FT+E ++ A +V + + +R AL+PL + DD +
Sbjct: 12 ESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCE 71
Query: 147 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 206
V + ++AGE I ++ G + N++ +++ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 72 CVALQDFRAGEQIYIFYGTRSNAEFVVHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEV 131
Query: 207 AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS--------SLGPI- 257
R G + VF +H E + +L +LR+ +++ + ++ +LG
Sbjct: 132 LARAGIPTSSVFALHV-TELPISAQLLAFLRVFCMTEEELREHLLGENAIDRIFTLGNSE 190
Query: 258 CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 317
PVS E + L D L Y T+ ED+A L L P+ +A +L EK++L
Sbjct: 191 YPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKAFLQSPGLSPRAAMAVKLRLGEKEIL 250
Query: 318 NACLQVTA 325
++ A
Sbjct: 251 EKAVRSAA 258
>gi|302766942|ref|XP_002966891.1| hypothetical protein SELMODRAFT_408134 [Selaginella moellendorffii]
gi|300164882|gb|EFJ31490.1| hypothetical protein SELMODRAFT_408134 [Selaginella moellendorffii]
Length = 374
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 59/206 (28%), Positives = 90/206 (43%), Gaps = 42/206 (20%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLA-VESPLLWSETELAYLTGSPTKAEILERAEG 59
+ EK + ++S W PY+R L QL +ES + W ELA+L SPT E +E +
Sbjct: 56 LSEKHKAQESQWAPYLRCL------PQLGDIESTMFWKAEELAWLKHSPTYRETMECLKI 109
Query: 60 IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL----- 114
IK E++ L +A Q +P+ C L +VSL
Sbjct: 110 IKSEFHLLT----LANK--QVFPW---------------------CRDALGEVSLTDFMH 142
Query: 115 ARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLL 172
A +P + C+ L+ D V D+ YKAG+ I + G PNS L
Sbjct: 143 AYSTDQIPFAD-FFNHDHNCQTRLSYDKEKDCAVAVADQDYKAGDEIFLSYGSTPNSILA 201
Query: 173 INYGFVDEDNPYDRLVVEAALNTEDP 198
++YGF NP++++ V ++ DP
Sbjct: 202 VDYGFAVASNPHEQVEVPMGVSLTDP 227
>gi|79315114|ref|NP_001030864.1| SET domain-containing protein [Arabidopsis thaliana]
gi|51971180|dbj|BAD44282.1| unnamed protein product [Arabidopsis thaliana]
gi|332645817|gb|AEE79338.1| SET domain-containing protein [Arabidopsis thaliana]
Length = 353
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 55/226 (24%), Positives = 102/226 (45%), Gaps = 27/226 (11%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
EKK G+KS W+PYI L + + S + W E EL+ + S E +++ I++
Sbjct: 9 EKKMGQKSRWVPYISRLPQP-----AEMHSSIFWGEDELSMIRCSAVHQETVKQKAQIEK 63
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 122
+++ F+A + Q P I TE E F A+ V S + ++R +L+P
Sbjct: 64 DFS------FVAQAFKQHCP--IVTERPDLEDFMYAYALVGS-----RAWENSKRISLIP 110
Query: 123 LGPPLLAYSSKCKAMLAAVDD------AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 176
+ +L D+ +Q+ DR Y G+ + + G N+ L++++G
Sbjct: 111 FADFMNHDGLSASIVLRDEDNQLSEFSTLQVTADRNYSPGDEVFIKYGEFSNATLMLDFG 170
Query: 177 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ---VFH 219
F N +D + ++ + +DP K + Q + +V+ +FH
Sbjct: 171 FTFPYNIHDEVQIQMDVPNDDPLRNMKLGLLQTHHTRTVKDINIFH 216
>gi|348537527|ref|XP_003456245.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Oreochromis niloticus]
Length = 607
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 80/340 (23%), Positives = 138/340 (40%), Gaps = 51/340 (15%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY----- 64
SFWLPYIR L ++ + PL + + ++ L G+ ++L + + R+Y
Sbjct: 170 SFWLPYIRSLPQE-------YDIPLYYQQEDVQLLLGTQAVQDVLSQYKNTARQYAYFYK 222
Query: 65 -----NELDTVWF---------MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ 110
L +V M G LF Q+ A + + +Q + +
Sbjct: 223 LVQDKGMLGSVELRLFASLTPVMGGKLFDQW-------AVSSVMTRQNQIPTEDG----S 271
Query: 111 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 170
+V+LA L+PL + DD + V + YK E I ++ G + N++
Sbjct: 272 RVTLA----LIPLWDMCNHTNGLITTGYNLEDDRCECVALQDYKENEQIYIFYGTRSNAE 327
Query: 171 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 230
+I+ GF +D+ +DR+ ++ ++ + Y K V R G + VF +H E +
Sbjct: 328 FVIHNGFFFQDDAHDRVKIKLGVSKSERLYAMKAEVLARAGIPASYVFALHCN-EPPISA 386
Query: 231 DMLPYLRLG---------YVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 281
+L +LR+ Y+ + + + PVS E + L L
Sbjct: 387 QLLAFLRVFCMTEDELKYYLLGDRAINKIFTLGNSEFPVSWENEIKLWTFLETRAALLLK 446
Query: 282 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 321
Y T ED +ML +L R+A QL EK++L L
Sbjct: 447 TYKTTSEEDRSMLEKPDLSLHSRMAIQLRLAEKQILEKAL 486
>gi|42565948|ref|NP_191068.2| SET domain-containing protein [Arabidopsis thaliana]
gi|56236044|gb|AAV84478.1| At3g55080 [Arabidopsis thaliana]
gi|59958342|gb|AAX12881.1| At3g55080 [Arabidopsis thaliana]
gi|332645816|gb|AEE79337.1| SET domain-containing protein [Arabidopsis thaliana]
Length = 463
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 54/221 (24%), Positives = 102/221 (46%), Gaps = 22/221 (9%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
EKK G+KS W+PYI L + + S + W E EL+ + S E +++ I++
Sbjct: 124 EKKMGQKSRWVPYISRLPQP-----AEMHSSIFWGEDELSMIRCSAVHQETVKQKAQIEK 178
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 122
+++ F+A + Q P I TE E F A+ V S + ++R +L+P
Sbjct: 179 DFS------FVAQAFKQHCP--IVTERPDLEDFMYAYALVGS-----RAWENSKRISLIP 225
Query: 123 LGPPLLAYSSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 181
+ +L D+ + ++ DR Y G+ + + G N+ L++++GF
Sbjct: 226 FADFMNHDGLSASIVLRDEDNQLSEVTADRNYSPGDEVFIKYGEFSNATLMLDFGFTFPY 285
Query: 182 NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ---VFH 219
N +D + ++ + +DP K + Q + +V+ +FH
Sbjct: 286 NIHDEVQIQMDVPNDDPLRNMKLGLLQTHHTRTVKDINIFH 326
>gi|297726941|ref|NP_001175834.1| Os09g0411650 [Oryza sativa Japonica Group]
gi|255678893|dbj|BAH94562.1| Os09g0411650, partial [Oryza sativa Japonica Group]
Length = 206
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 47/158 (29%), Positives = 82/158 (51%), Gaps = 9/158 (5%)
Query: 166 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGRE 225
+ N++L ++YGF + ++ D + ++ DP Y DK +A+ NG F + G
Sbjct: 10 KSNAELALDYGFTESNSSRDAYTLTLEISESDPFYDDKLDIAELNGMGETAYFDIVLG-- 67
Query: 226 KEAIS-DMLPYLRLGYVSDTSE--MQSVISS--LGPI-CPVSPCMERAVLDQLADYFKAR 279
E++ MLPYLRL + T ++++ + G + PVS E A+ + + K+
Sbjct: 68 -ESLPPQMLPYLRLLCLGGTDAFLLEALFRNAVWGHLELPVSQDNEEAICQVIRNACKSA 126
Query: 280 LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 317
L Y T+ EDE +L NL P+ ++A ++ EKK+L
Sbjct: 127 LGAYHTTIEEDEELLGSENLQPRLQIAVEVRAGEKKVL 164
>gi|302786274|ref|XP_002974908.1| hypothetical protein SELMODRAFT_102436 [Selaginella moellendorffii]
gi|300157067|gb|EFJ23693.1| hypothetical protein SELMODRAFT_102436 [Selaginella moellendorffii]
Length = 389
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 51/201 (25%), Positives = 96/201 (47%), Gaps = 22/201 (10%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+ +K+ + S W PYI L ++ S + W++ EL YL SP E ++R + +
Sbjct: 52 LAHQKKKETSAWAPYISCLPPFG-----SMHSTIFWTQDELVYLKVSPVYRETVQRKDVV 106
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
+ E+ + V L QQ + + T ++ V S ++ + + AL
Sbjct: 107 RMEFAAAENVCM----LMQQVKLFVCSRILT------DYITVCSRAWGIETI---KSLAL 153
Query: 121 VPLGPPLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
VP + + C+AML+ +D ++V DR Y G+ +V+ G N+ L +++GF
Sbjct: 154 VPF-VDFFNHDANCRAMLSYDEDRHCAEVVSDRDYATGDQVVISYGQLSNATLALDFGFA 212
Query: 179 DEDNPYDRLV-VEAALNTEDP 198
NP+D++ + +L+ +DP
Sbjct: 213 LPFNPHDQVAGIWLSLSEKDP 233
>gi|307109960|gb|EFN58197.1| hypothetical protein CHLNCDRAFT_142047 [Chlorella variabilis]
Length = 485
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 79/300 (26%), Positives = 133/300 (44%), Gaps = 27/300 (9%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
++ G W PY+R L + G L WSE E+ +L GS ++ LE +
Sbjct: 141 QRAAGSAGGWWPYVRLLPERTG-------CVLEWSEEEVEWLCGSQLHSDALEIRAAAEA 193
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 122
+ E+ V A + + + AF + AF + S +V L L + AL+P
Sbjct: 194 SWAEMQAVLAAAKAQGRAPAHG----AFGRAQLQWAFAVLLSRLVRL--AGLGDQEALLP 247
Query: 123 LGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
LL + + L +A + AV L +R Y+AGE +++ G + + +LL++YGF +
Sbjct: 248 WAD-LLNHDCAAASFLDWSATEAAVVLRAERRYRAGEQLLISYGQKTSGELLLSYGFCPD 306
Query: 181 --DNPYD--RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPY- 235
NP+D RL++E L D K +++G + Q+F + R A +++ Y
Sbjct: 307 LGSNPHDGCRLLLE--LAPGDAARNWKAAALRQHGLAASQLFPL---RMAAAPFELVHYT 361
Query: 236 -LRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 294
V E + + L + P ++ A L+ + KA LA YP + D A L
Sbjct: 362 AFSAAVVGSRQEAEQLARRLFEEGDIPPALQTAALEAVVAACKAALAAYPRSFDGDRAEL 421
>gi|302771638|ref|XP_002969237.1| hypothetical protein SELMODRAFT_410177 [Selaginella moellendorffii]
gi|300162713|gb|EFJ29325.1| hypothetical protein SELMODRAFT_410177 [Selaginella moellendorffii]
Length = 336
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 61/217 (28%), Positives = 95/217 (43%), Gaps = 24/217 (11%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
M EK++G+ S W PYI L G+ + S + W ELA L SP ER +
Sbjct: 117 MAEKRKGQNSAWAPYIECLP---SFGE--IHSTIFWDPKELACLECSPIHRGTGERNALL 171
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
+ EY E+ V S Y D+ + E FK + V S S +
Sbjct: 172 QSEYREVKKV---VESCPHLYDPDV-----SLEQFKHEYATVSSRAWGQGPHS---DMTM 220
Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVV-DRPYKAGES-----IVVWCGPQPNSKLLIN 174
+PL + + + + + DD +VV R Y+ G+ + + G N+ L ++
Sbjct: 221 IPL-VDFANHDPRSRTLFSHADDNCTVVVASRDYQTGDENFHLKVHICYGDHSNAVLALD 279
Query: 175 YGFVDEDNPYDRLVVEAALNTEDPQYQDK-RMVAQRN 210
YGFV DNP+D + + +EDP + K + +AQ N
Sbjct: 280 YGFVVPDNPFDEAEIFLEIPSEDPLREIKLQYMAQNN 316
>gi|302754340|ref|XP_002960594.1| hypothetical protein SELMODRAFT_402971 [Selaginella moellendorffii]
gi|300171533|gb|EFJ38133.1| hypothetical protein SELMODRAFT_402971 [Selaginella moellendorffii]
Length = 403
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 61/217 (28%), Positives = 95/217 (43%), Gaps = 24/217 (11%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
M EK++G+ S W PYI L G+ + S + W ELA L SP ER +
Sbjct: 117 MAEKRKGQNSAWAPYIECLP---SFGE--IHSTIFWDPKELACLECSPIHRGTGERNALL 171
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
+ EY E+ V S Y D+ + E FK + V S S +
Sbjct: 172 QSEYREVKKV---VESCPHLYDPDV-----SLEQFKHEYATVSSRAWGQGPHS---DMTM 220
Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVV-DRPYKAGE-----SIVVWCGPQPNSKLLIN 174
+PL + + + + + DD +VV R Y+ G+ + + G N+ L ++
Sbjct: 221 IPL-VDFANHDPRSRTLFSHADDNCTVVVASRDYQTGDENFHLKVHICYGDHSNAVLALD 279
Query: 175 YGFVDEDNPYDRLVVEAALNTEDPQYQDK-RMVAQRN 210
YGFV DNP+D + + +EDP + K + +AQ N
Sbjct: 280 YGFVVPDNPFDEAEIFLEIPSEDPLREIKLQYMAQNN 316
>gi|156064409|ref|XP_001598126.1| hypothetical protein SS1G_00212 [Sclerotinia sclerotiorum 1980]
gi|154691074|gb|EDN90812.1| hypothetical protein SS1G_00212 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 470
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 86/348 (24%), Positives = 144/348 (41%), Gaps = 46/348 (13%)
Query: 6 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA-------- 57
Q + S W PY+ L + ++S + WS++ELA L S +I ++
Sbjct: 96 QMENSKWAPYLAVLPER-------LDSLVFWSDSELAELQASAVVKKIGKKDAEDMFKSY 148
Query: 58 ---EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQK 111
+G+K E+ S+ Y +DIP + + A V +K
Sbjct: 149 IAPQGLKHSSTEM---CHKVASVIMAYAFDIPDPSDAPTSGGKGGEAGDDLVSDDGEDEK 205
Query: 112 VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 171
L+ ++PL L A + + A L ++ +++ +P GE I G P S L
Sbjct: 206 TILS----MIPLADMLNADADRNNARLICDNEELEMRAIKPISKGEEIFNDYGQLPRSDL 261
Query: 172 LINYGFV-DEDNPYD------RLVVEAALN----------TEDPQYQDKRMVAQRNGKLS 214
L YG+V D + YD L+V N T+D + + +A+R G
Sbjct: 262 LRRYGYVTDGYSAYDVAEISAELIVSLFRNGKVHPSLHKLTQD-GLKTRLELAEREGVYE 320
Query: 215 VQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLAD 274
VH+ ++ +I D L + D S ++++++S I S LA
Sbjct: 321 DSFDLVHSSPDEPSIPDELLAFLYLLLVDESHLKAILNSESSIPSRSKLTTELAGQVLAT 380
Query: 275 YFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 322
+AR Y TL EDE +L + +L + +A Q+ EKK+L A +Q
Sbjct: 381 LLQAREKEYSTTLEEDEDLLKNADLPVRHAMAIQVRSGEKKVLRAAMQ 428
>gi|449442309|ref|XP_004138924.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Cucumis
sativus]
Length = 503
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 74/313 (23%), Positives = 135/313 (43%), Gaps = 29/313 (9%)
Query: 30 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 89
+S + WSE ELA + G+ + L E +K E+ +++ + D+
Sbjct: 170 TDSTVFWSEEELAEIQGTQLLSTTLNVKEYVKSEFLKVEEEILLRHK-------DLFPSR 222
Query: 90 FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL----------LAYSSKCKAMLA 139
T + F AF ++S + + L+P + A+ K A L
Sbjct: 223 ITLDDFFWAFGILRSRAFSRLR---GQNLVLIPFADLVNHSANVTTEEHAWEVKGPAGLF 279
Query: 140 AVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 198
+ D L KAG+ + + + + N+ L ++YGF+++ + + + + D
Sbjct: 280 SWDVLFSLRSPLSVKAGDQVFIQYDLKKSNADLALDYGFIEQKSDRNAYTLTLEIPESDL 339
Query: 199 QYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE--MQSVI--SSL 254
+ DK +A+ NG F + E+ MLP+LRL + T ++S+ S
Sbjct: 340 FFDDKLDIAETNGLNQTAYFDIIL--ERPFPPAMLPFLRLLALGGTDAFLLESLFRNSVW 397
Query: 255 GPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRME 313
G + PVS E + + + +A L+GY T+ EDE L + NL + R+A + E
Sbjct: 398 GHLEMPVSRANEELICQVVRNACEAALSGYHTTIEEDEK-LKEENLDSRLRIAVGIREGE 456
Query: 314 KKMLNACLQVTAD 326
K++L +Q+ D
Sbjct: 457 KRVLQQIIQIFKD 469
>gi|145528147|ref|XP_001449873.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124417462|emb|CAK82476.1| unnamed protein product [Paramecium tetraurelia]
Length = 605
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 76/351 (21%), Positives = 146/351 (41%), Gaps = 43/351 (12%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+ ++K+ + SFW PY+ L + P+ +++++L +L GSP ++ ++ +
Sbjct: 244 LLQEKKIQDSFWKPYLDVLPKSYSNF------PIFFNDSDLEWLKGSPFLKQVKDKITDL 297
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
K++Y ++ V A Q +F+ F A + S + + + + A
Sbjct: 298 KKDYCDICQV---APEFLQN----------SFDEFCWARMTASSRIFGIN-IKGVKTDAF 343
Query: 121 VPLG-------PPLLA--YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 171
VPL P L + YS + + + D+ ++ G+ I G + NS+
Sbjct: 344 VPLADMLNHKRPKLTSWCYSDERQGFIIETDENIE--------KGQMIFDSYGSKCNSRF 395
Query: 172 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD 231
L+NYGFV +DN + + V + Q K +++ + + F + + SD
Sbjct: 396 LLNYGFVVDDNNANEVNVMVEPDGTISLIQLKEGLSRETLQFP-KSFRLVIDPNDVSFSD 454
Query: 232 MLPYLRLGYVSDTSEMQSVISSLGPICP-----VSPCMERAVLDQLADYFKARLAGYPAT 286
+ ++R + + E +++ I P +S E A + + + L YP T
Sbjct: 455 FMSFIRFILIQEEKEFANLLGKNSYIKPTKIHFISIQNELATWNLIENICIRALNQYPTT 514
Query: 287 LSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVS 337
L +D +L L +R L EKK+LN Q + M L + S
Sbjct: 515 LEQDLEILKICELTTNQRNCLILRMGEKKILNFYKQFSEKMRQLFSNFDFS 565
>gi|449495943|ref|XP_004159992.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Cucumis
sativus]
Length = 503
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 74/313 (23%), Positives = 135/313 (43%), Gaps = 29/313 (9%)
Query: 30 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 89
+S + WSE ELA + G+ + L E +K E+ +++ + D+
Sbjct: 170 TDSTVFWSEEELAEIQGTQLLSTTLNVKEYVKSEFLKVEEEILLRHK-------DLFPSR 222
Query: 90 FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL----------LAYSSKCKAMLA 139
T + F AF ++S + + L+P + A+ K A L
Sbjct: 223 ITLDDFFWAFGILRSRAFSRLR---GQNLVLIPFADLVNHSANVTTEEHAWEVKGPAGLF 279
Query: 140 AVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 198
+ D L KAG+ + + + + N+ L ++YGF+++ + + + + D
Sbjct: 280 SWDVLCSLRSPLSVKAGDQVFIQYDLKKSNADLALDYGFIEQKSDRNAYTLTLEIPESDL 339
Query: 199 QYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE--MQSVI--SSL 254
+ DK +A+ NG F + E+ MLP+LRL + T ++S+ S
Sbjct: 340 FFDDKLDIAETNGLNQTAYFDIIL--ERPFPPAMLPFLRLLALGGTDAFLLESLFRNSVW 397
Query: 255 GPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRME 313
G + PVS E + + + +A L+GY T+ EDE L + NL + R+A + E
Sbjct: 398 GHLEMPVSRANEELICQVVRNACEAALSGYHTTIEEDEK-LKEENLDSRLRIAVGIREGE 456
Query: 314 KKMLNACLQVTAD 326
K++L +Q+ D
Sbjct: 457 KRVLQQIIQIFKD 469
>gi|297820264|ref|XP_002878015.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297323853|gb|EFH54274.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 473
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 48/197 (24%), Positives = 92/197 (46%), Gaps = 19/197 (9%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
EKK+G+KS W+PYI L + + S + W E E + + S E +++ I++
Sbjct: 124 EKKKGQKSRWVPYISRLPQP-----AEMHSTIFWGEDEFSMIRCSAVHKETVKQKAQIEK 178
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 122
E++ F+A + Q YP I E E F A+ V S + ++ +L+P
Sbjct: 179 EFS------FVAQAFKQHYPMVI--ERPYLEDFMYAYALVGS-----RAWETSKGISLIP 225
Query: 123 LGPPLLAYSSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 181
+ +L+ D+ + ++ DR Y G+ + + G N+ L++++GF
Sbjct: 226 FADFMNHDGLSASIVLSDEDNQLSEVTADRNYSPGDEVFIKYGEFSNATLMLDFGFTVPY 285
Query: 182 NPYDRLVVEAALNTEDP 198
N +D + ++ + +DP
Sbjct: 286 NIHDEVQIQMDVPNDDP 302
>gi|392349055|ref|XP_003750278.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Rattus
norvegicus]
Length = 416
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 55/234 (23%), Positives = 107/234 (45%), Gaps = 19/234 (8%)
Query: 9 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
SFW PYI+ L + ++PL + E E+ L + ++ + + R+Y
Sbjct: 108 NSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQYAYFY 160
Query: 69 TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
V Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 161 KV-------IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 213
Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
+ DD + V + ++AG+ I ++ G + N++ +I+ GF ++N +
Sbjct: 214 DMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSH 273
Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 238
DR+ ++ ++ D Y K V R G + VF +H E + +L +LR+
Sbjct: 274 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRV 326
>gi|350408192|ref|XP_003488333.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Bombus
impatiens]
Length = 484
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 83/337 (24%), Positives = 142/337 (42%), Gaps = 31/337 (9%)
Query: 4 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 63
+K + S W PY+ L + L + ++ L GSPT L++ I R+
Sbjct: 165 EKHKEYSKWKPYLDILPT-------TYTTVLYMTAADMNELKGSPTLEAALKQCRNIARQ 217
Query: 64 YNELDTVWFMAGSLFQQYPYDIPT---EAFTFEIFKQAF--VAVQSCVVHLQKVSLARRF 118
Y +F LFQ+ + + FT+E + A V + ++ + SL
Sbjct: 218 Y-----AYF--NKLFQKNNNAVSAILRDVFTYEKYCWAVSTVMTRQNIIPSKDGSLMIH- 269
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
AL+P+ +SK A + + R +K E I + G + NS ++ GFV
Sbjct: 270 ALIPMWDMCNHENSKITTDFNATLNCCECYALRDFKKAEQIFISYGARTNSDFFVHSGFV 329
Query: 179 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-MLPYLR 237
DN D + ++ DP +++ + + +V F + G E ISD +L +LR
Sbjct: 330 YMDNEQDGFKLRLGISKADPLQKERVELLNKLDLPAVGEFLLKPG--TEPISDTLLAFLR 387
Query: 238 LGYVSDTSEMQSVISS--LGPICPVSPCMERAVLDQLADYFKARL----AGYPATLSEDE 291
+ + E+ I S + + + +E V + + + RL A YP TL ED
Sbjct: 388 V-FSMRKEELAHWIQSDRVNDLKHMDCALETVVEENVKKFLLTRLQLLIANYPTTLKEDL 446
Query: 292 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 328
+L + L K++A QL EK++L L+ I
Sbjct: 447 QLL-ETTLPRIKKLAIQLRVTEKRILQGALEYVQQWI 482
>gi|302753470|ref|XP_002960159.1| hypothetical protein SELMODRAFT_437298 [Selaginella moellendorffii]
gi|300171098|gb|EFJ37698.1| hypothetical protein SELMODRAFT_437298 [Selaginella moellendorffii]
Length = 377
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 49/190 (25%), Positives = 85/190 (44%), Gaps = 26/190 (13%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+ E+ +G+ S W PYI L + +++ LW +TEL+YL SP + ER E I
Sbjct: 119 LMERYKGQSSVWAPYISCLPQPA-----ELDNTFLWEDTELSYLKASPLYGKTRERLEMI 173
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
E+ ++ + LF + + E FK + V S + +++ L
Sbjct: 174 TTEFGQVQNALNVWPQLFGK---------VSLEDFKHVYATVFS-----RSLAIGEDSTL 219
Query: 121 VPLGPPLLAY-----SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
V + P+L + +S K + + + DR Y + I + G N++L ++Y
Sbjct: 220 VMI--PMLDFFNHNATSFAKLSFNGLLNYAVVTADRAYTENDQIWINYGDLSNAELALDY 277
Query: 176 GFVDEDNPYD 185
GF +NPYD
Sbjct: 278 GFTVPENPYD 287
>gi|145516108|ref|XP_001443948.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124411348|emb|CAK76551.1| unnamed protein product [Paramecium tetraurelia]
Length = 572
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 73/342 (21%), Positives = 139/342 (40%), Gaps = 45/342 (13%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
SFW PYI L P+ ++ ++L +L GSP +I ++ ++++YN++
Sbjct: 228 SFWKPYIDILPSSYPSF------PIFYNNSDLEWLKGSPFLKQIKDKLADLQKDYNDICN 281
Query: 70 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG----- 124
V F QY F F A + S + + ++ + A VPL
Sbjct: 282 V----VPEFTQY---------QFHEFCWARMTASSRIFGI-NINGVKTDAFVPLADMLNH 327
Query: 125 --PPLLA--YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
P L + YS + + + D+ ++ G+ I G + NS+ +NYGFV E
Sbjct: 328 KRPKLTSWCYSDEKQGFIIETDEKIE--------RGQMIFDSYGRKCNSRFFLNYGFVVE 379
Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
N + + + + DP Q K + + + + F + ++ A+ D + ++R
Sbjct: 380 GNDANEVNLAVEADQNDPLLQLKEQAIKESLQWP-KNFKLLMDTDETAVIDFMSHIRFLV 438
Query: 241 VSDTSEMQSVISSLGP-------ICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM 293
+ D ++++ +++ P+ E + + K L YP T +D+ +
Sbjct: 439 IRDEAQLKLLLNQKNSQNFKSTKTQPLGIYNELEMWKMIGRICKKTLKQYPTTFEQDQEI 498
Query: 294 LTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVT 335
L L +R L EK++L Q + M LL +
Sbjct: 499 LQICELTTNQRNCLILRMGEKEILKFYFQFSERMKELLSNFN 540
>gi|327259114|ref|XP_003214383.1| PREDICTED: SET domain-containing protein 3-like, partial [Anolis
carolinensis]
Length = 311
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 55/197 (27%), Positives = 97/197 (49%), Gaps = 18/197 (9%)
Query: 142 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 201
DD + V + +KAGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 9 DDRCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 68
Query: 202 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS--------S 253
K V R G + VF +HA E + +L +LR+ +++ + +I +
Sbjct: 69 MKAEVLARAGIPTSSVFALHA-TEPPISAQLLAFLRVFCMTEDELKEHLIGEHAIDRIFT 127
Query: 254 LGPI-CPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVATQ 308
LG PVS E +L + +AR L Y T+SED+A L +L +A +
Sbjct: 128 LGNSEFPVSWDNEV----KLWTFLEARASLLLKTYKTTVSEDKAFLGTQDLTCNATMAIK 183
Query: 309 LVRMEKKMLNACLQVTA 325
L EK++L ++ A
Sbjct: 184 LRLGEKEILEKAIKSAA 200
>gi|255947868|ref|XP_002564701.1| Pc22g06730 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211591718|emb|CAP97961.1| Pc22g06730 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 679
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 60/203 (29%), Positives = 98/203 (48%), Gaps = 15/203 (7%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
M + +G + FW PY+R L + GQL +PL + E ++ ++ G+ +ER +
Sbjct: 111 MGQYLRGSEGFWYPYLRTLPQP---GQLT--TPLFFGEEDVDWIQGTGIPEAAVERIKVW 165
Query: 61 KREYN----ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA-VQSCVVHLQKVSLA 115
+++Y+ +LD + F +QY +++ A T I +AF A V S V +
Sbjct: 166 EQKYDLGYLKLDEIGFPD---CEQYTWELYLWASTI-ITSRAFSAKVLSGAVQPDDLPED 221
Query: 116 RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
AL+PL L + K A D+ + L+V + AG+ I GP+ N +LLINY
Sbjct: 222 GVSALLPL-IDLPNHRPMAKVEWRAGDEDIGLLVLEDHSAGQEISNNYGPRNNEQLLINY 280
Query: 176 GFVDEDNPYDRLVVEAALNTEDP 198
GF NP D +V + + P
Sbjct: 281 GFCIAGNPTDYRIVLLGVKPDSP 303
>gi|367029027|ref|XP_003663797.1| hypothetical protein MYCTH_2080826, partial [Myceliophthora
thermophila ATCC 42464]
gi|347011067|gb|AEO58552.1| hypothetical protein MYCTH_2080826, partial [Myceliophthora
thermophila ATCC 42464]
Length = 357
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 90/201 (44%), Gaps = 32/201 (15%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVES-PLLWSETELAYLTGSPTKAEILERAEGIK 61
E +GK SFW PYI L Q+AV + P W + ++AYL G+ I E E +K
Sbjct: 122 EYLKGKDSFWWPYIATLPPPE---QVAVWALPPFWPDHDIAYLEGTNAHVAIQEIQENVK 178
Query: 62 REYNELDTVWFMAGSLFQQYPY-DIPTEAFTFEIFKQAFVAVQSCVVH----LQKVSLAR 116
RE+ + A L ++ + D+P A+T ++K AF S L + R
Sbjct: 179 REFKQ-------ARKLLKEEDFPDLP--AYTQLLYKWAFCIFTSRSFRPSLVLSDATKRR 229
Query: 117 RFALVPLG---------PPLLAYSSKCKAM-----LAAVDDAVQLVVDRPYKAGESIVVW 162
AL+P G PLL ++ +V D +L+ PY+ G +
Sbjct: 230 LSALLPQGVQLDDFSVLQPLLDIANHSPTARYTWDTTSVPDTCRLICHDPYQPGTQVYNN 289
Query: 163 CGPQPNSKLLINYGFVDEDNP 183
G + NS+LL+ YGF+ + P
Sbjct: 290 YGLKTNSELLLAYGFILPETP 310
>gi|198413420|ref|XP_002131202.1| PREDICTED: similar to SET domain containing 3 [Ciona intestinalis]
Length = 577
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 71/312 (22%), Positives = 142/312 (45%), Gaps = 39/312 (12%)
Query: 40 ELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA---FTFEIFK 96
E+ YL G+ + + + + I R++ L V+ GS ++ +P +A FTF+ ++
Sbjct: 195 EMKYLKGTAALSVAINQFKSIVRQFALLYQVF--NGSHQKEDVEKLPLQARNAFTFDTYR 252
Query: 97 QAFVAVQS----CVVHLQKV----SLARRFALVPL--------GPPLLAYSSKCKAMLAA 140
AV + H+ V AL+P+ GP AY+ A
Sbjct: 253 WCASAVTTRQNKIPTHVGDVLGDLDENSTLALIPMWDMFNHAIGPLSTAYN--------A 304
Query: 141 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 200
+ ++ + + +K GE + + G + NS LLI+ GFV +++P+D++ + ++ +DP Y
Sbjct: 305 LTRGIECLAMQDFKTGEQVKICYGARTNSDLLIHNGFVMKESPFDKVRIHLGVSQKDPLY 364
Query: 201 QDK-RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT------SEMQSVISS 253
K +++ + N ++S Q +L +LR+ ++++ + ++ +SS
Sbjct: 365 SLKAKLLEKLNVEVSGQFAVCSMDNSLPTSPQLLVFLRVFHMNEEELRSWLEKQKNELSS 424
Query: 254 LGPI---CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLV 310
L I V + V + L + K L G+ E M+ D +L + ++A Q
Sbjct: 425 LREIYISGEVKFKSDVKVWEFLENRVKLLLMGFKKIGDNIEEMMEDKSLTHRSKLALQFR 484
Query: 311 RMEKKMLNACLQ 322
E ++L+AC+
Sbjct: 485 IEEHRILSACVN 496
>gi|219126444|ref|XP_002183467.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217405223|gb|EEC45167.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 519
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 91/343 (26%), Positives = 134/343 (39%), Gaps = 57/343 (16%)
Query: 4 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 63
K G SF+ PY L P+ WS EL L GS ++I +R + I+ +
Sbjct: 125 KTHGSSSFFHPYYEILP------PTLRNMPIFWSAFELQELEGSHLLSQIADRGQAIQDD 178
Query: 64 YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 123
Y + V G+L T + FK A + V S LQ + R ALVP
Sbjct: 179 YEAILEVAPSLGTLC------------TLDEFKWARMCVCSRNFGLQ-IDGHRTSALVPH 225
Query: 124 GPPLLAYSSK-CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 182
L Y + K V + + +AG + G + N + L+NYGF EDN
Sbjct: 226 ADMLNHYRPRETKWTFDEVTQCFTITSLQSIQAGAQVYDSYGQKCNHRFLLNYGFAVEDN 285
Query: 183 PY------DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL 236
+ + +E ++ D +QDK R E IS
Sbjct: 286 RELDGFCPNEVPLELYVDPADILFQDKLEFWTRG--------------ETNQIS------ 325
Query: 237 RLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
G V+ Q+V S+G P S E + + RLA YP T+S+D A L D
Sbjct: 326 --GAVTAGLIAQAVGGSMGRGVP-SHAAESYTSGPVVK--RVRLASYPTTISQDMADLQD 380
Query: 297 YNLHPK---KRVATQLVRMEKKMLN---ACLQVTADMIMLLPD 333
+P+ +R A VR EK++L+ + DM+ + D
Sbjct: 381 EASYPQFSNRRHAKIQVRGEKEVLHHFRVWSETALDMLTFIED 423
>gi|440797255|gb|ELR18348.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
Length = 431
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 49/162 (30%), Positives = 76/162 (46%), Gaps = 15/162 (9%)
Query: 32 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 91
+ L +S+ EL+YL GS +R I+ +Y+ + T LF YP E F+
Sbjct: 138 TTLFFSDDELSYLEGSSLHHFARQRRRAIESQYDTIFT------PLFVDYPEHFAPEQFS 191
Query: 92 FEIFKQAFVAV--QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 149
+ +K A + +S VV K L + + P + + K + AVD +
Sbjct: 192 LDAWKWALSVIWSRSFVVDEGKRGLVPWADMFNMAPE----TEQVKVAVDAVDHHLIYSA 247
Query: 150 DRPYKAGESIVVWCGPQ---PNSKLLINYGFVDEDNPYDRLV 188
P K GE I V G N++LL++YGFV E+NP+D +V
Sbjct: 248 RSPIKKGEQIFVAYGQSRQMSNAQLLMDYGFVLENNPHDAVV 289
>gi|380015248|ref|XP_003691619.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Apis
florea]
Length = 483
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 76/333 (22%), Positives = 139/333 (41%), Gaps = 23/333 (6%)
Query: 4 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 63
++ + S W PY+ L + L + ++ L GSPT L++ I R+
Sbjct: 164 ERHKENSKWKPYLDILPT-------TYTTVLYMTAADMIELKGSPTLEAALKQCRNIARQ 216
Query: 64 YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF-ALVP 122
Y+ + V+ + D+ FT+E + A V + + +R AL+P
Sbjct: 217 YSYFNKVFQNNNNAVSAILRDV----FTYERYCWAVSTVMTRQNLIPSEDGSRMIHALIP 272
Query: 123 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 182
+ + + A + + R +K GE I + GP+ NS ++ GFV +N
Sbjct: 273 MWDMCNHENGRITTDFNATSNYCECYALRDFKKGEQIFISYGPRTNSDFFVHSGFVYMEN 332
Query: 183 PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-MLPYLRLGYV 241
D + ++ D +++ + + +V F + G E ISD +L +LR+ +
Sbjct: 333 KQDGFKLRLGISKADSLQKERIELLNKLDLPTVGEFLLKLG--TEPISDLLLAFLRV-FS 389
Query: 242 SDTSEMQSVISS--LGPICPVSPCMERAVLDQLADYFKARL----AGYPATLSEDEAMLT 295
+E+ I S + + + +E V + + + RL A YP TL ED +L
Sbjct: 390 MRKAELAHWIRSDRVNDLKHMDCALETVVEENVRKFLLTRLQLLIANYPTTLKEDLQLL- 448
Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 328
+ L K++ QL EK++L L+ I
Sbjct: 449 ETTLPQIKKLTIQLRVTEKRILQGALEYVEQWI 481
>gi|399949805|gb|AFP65462.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Chroomonas mesostigmatica
CCMP1168]
Length = 464
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 67/326 (20%), Positives = 134/326 (41%), Gaps = 31/326 (9%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
E+ G+KS W Y L +++ + W +++ +L GS E IK
Sbjct: 147 ERAIGEKSKWKVYYEILPKEKDLNLV-----FRWKISDIVFLRGSKVLNASFYLKEKIKI 201
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 122
++ ++ F L P + F + ++ A + S + LQ + ++ ALVP
Sbjct: 202 QFLRIEKTIFSKNRLV------YPEKIFNLQSWEWAISLLLSRAIFLQNM---KKIALVP 252
Query: 123 LG-----PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
P K + + ++ + + D+ Y + I G + N +LL+ YGF
Sbjct: 253 YADFINHNPFSTSYINSKKIAFSENNEIVMYADKDYNKFDQIFTTYGQKTNLELLVLYGF 312
Query: 178 VDEDNPYDRLVVEAALNTEDPQYQDKRMV---AQRNGKLSVQVFHVHAGREKEAISDMLP 234
+ E NP+D + + AL+T+D Y K ++ +++ VF+ +E +
Sbjct: 313 IIERNPFDSIELRVALSTKDELYNKKEKFINDCEKTEQITFPVFYYKYPKE------LYE 366
Query: 235 YLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 294
++RL S+L + +E+ + + K L Y T++E++ +
Sbjct: 367 FMRLCLSGPRDFFGEEFSNLNFTDEENFNLEKIIRKTVIFACKKNLKAYNKTINEEKILN 426
Query: 295 TDYN---LHPKKRVATQLVRMEKKML 317
N L ++ + + + EKK+L
Sbjct: 427 NLSNIIVLTKNQKTSIKQRKCEKKIL 452
>gi|302896454|ref|XP_003047107.1| SET domain protein [Nectria haematococca mpVI 77-13-4]
gi|256728035|gb|EEU41394.1| SET domain protein [Nectria haematococca mpVI 77-13-4]
Length = 1037
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 96/385 (24%), Positives = 153/385 (39%), Gaps = 83/385 (21%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK-------AEI 53
MYE QG+KS W PY L + ++P+ WSE+EL L S + AE
Sbjct: 672 MYEYLQGEKSQWKPYFDVLPS-------SFDTPMFWSESELDQLQASHMRHKIGKADAES 724
Query: 54 LERAE-------------GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 100
+ R G R ++L + GS Y +D+ + E +V
Sbjct: 725 MFRKTLLPIIRKNSSVFGGENRSDDDLVEIAHRMGSTIMAYAFDLENDEDEEEEETDGWV 784
Query: 101 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 160
+ + +VP+ +L ++ A + ++++ + RP KAGE I
Sbjct: 785 EDREGKSMM---------GMVPMAD-ILNADAEFNAHVNHEEESLTVTSLRPIKAGEEIF 834
Query: 161 VWCGPQPNSKLLINYGFVDEDNP-YDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFH 219
+ GP PNS+LL YG+V E + YD VVE + + V + N +S QV
Sbjct: 835 NYYGPHPNSELLRRYGYVTERHSRYD--VVEIPWDVVES-------VMRLNFGISGQVLE 885
Query: 220 V--HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGP-----------------ICPV 260
H E+E D R +T E+ S + GP + V
Sbjct: 886 KLRHGLEEEEEFEDTFVLER-----ETGEVNSDGTFSGPARFESMPEDLQEQLKTFLKGV 940
Query: 261 SPCMERAVLDQ----------LADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLV 310
A+ D+ LA +A + YP + SED +L +L + R+A ++
Sbjct: 941 KKAQPEAIPDKRKRDEIHHAVLAKTLQALASKYPTSTSEDGILLQRQDLSQRTRMAIEVR 1000
Query: 311 RMEKKMLNACLQVTA--DMIMLLPD 333
EKK+L + T+ D+ M + D
Sbjct: 1001 LGEKKLLQEAIASTSSVDVEMTVDD 1025
>gi|242049248|ref|XP_002462368.1| hypothetical protein SORBIDRAFT_02g024510 [Sorghum bicolor]
gi|241925745|gb|EER98889.1| hypothetical protein SORBIDRAFT_02g024510 [Sorghum bicolor]
Length = 489
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 90/334 (26%), Positives = 142/334 (42%), Gaps = 39/334 (11%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+ E +G S W PY+ L RQ S A L S + ++L G+
Sbjct: 136 LSEVARGADSPWAPYLAILPRQTD------------STIFCAGLKKSSLRYKLLSTTVGV 183
Query: 61 KREY--NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV---VHLQKVSLA 115
K EY +E D+V S + D+ + TF+ F AF ++S V + K++L
Sbjct: 184 K-EYVQSEFDSVQAEIISRNK----DLFPGSITFDDFLWAFGILRSRVFPELRGDKLALV 238
Query: 116 RRFALVPLGPPLLAYSS----KCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSK 170
LV P + + S K K + + L K+G+ I + + + N++
Sbjct: 239 PFADLVNHSPDITSEGSSWEIKGKGLFGR-EPMFSLRTPVDVKSGQQIYIQYDLDKSNAE 297
Query: 171 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 230
L ++YGFV+ + D V ++ DP Y DK +A+ N F + ++
Sbjct: 298 LALDYGFVESNPSRDSYTVTLEISESDPFYGDKLDIAELNELGETAYFDIIL--DEPLPP 355
Query: 231 DMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGY 283
MLPYLRL + T SV L P+SP E ++ + D K+ LA Y
Sbjct: 356 QMLPYLRLLCIGGTDAFILEALFRNSVWGHLE--LPLSPDNEESICQVMRDACKSALAAY 413
Query: 284 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 317
T+ EDE + NL P+ +A + EKK+L
Sbjct: 414 HTTIEEDEELSERENLQPRLTIAIGVRAGEKKVL 447
>gi|322694827|gb|EFY86647.1| SET domain protein [Metarhizium acridum CQMa 102]
Length = 467
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 86/368 (23%), Positives = 150/368 (40%), Gaps = 61/368 (16%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEG 59
MYE G +S W PYI L L ++P+ WSE EL+YL S T +I + AE
Sbjct: 102 MYEFFLGDESKWKPYIDVL-------PLTFDTPMFWSEEELSYLQASATVNKIGKADAEE 154
Query: 60 IKREY--------------------NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 99
+ R +L + GS Y +D+ E + +
Sbjct: 155 MFRTRLIPAIRGNPSVFVSSGDCSDEDLIGLAHRMGSTIMAYAFDLENEEAENDEESDGW 214
Query: 100 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESI 159
V + + V++A +L ++ A + D+ + + R KAGE I
Sbjct: 215 VEDREGKSMMGMVAMA----------DILNADAEFNAHVNHGDEELTVTSIRDIKAGEEI 264
Query: 160 VVWCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAALNTEDPQYQD------KRM 205
+ + GP PNS+LL YG++ E + P+D V+ +L +E QD +RM
Sbjct: 265 LNYYGPHPNSELLRRYGYITEKHSRYDVVEIPWD--AVQHSLMSELGVPQDIMAETMERM 322
Query: 206 VAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE-MQSVISSL----GPICPV 260
++ + V +G + P + G D E +++ I L G +
Sbjct: 323 --DQDDLEDIFVLERDSGEPNPDGTFAGPAVVDGMPPDLKEQLKATIKLLQKLDGNLISD 380
Query: 261 SPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNAC 320
+ + + + + + Y T++EDE +L +L ++R+A Q+ EKK+L
Sbjct: 381 KRKRDDILRSTMVETLRLIASRYSTTIAEDEVLLAQDSLTRRQRMAVQVRLGEKKLLQEA 440
Query: 321 LQVTADMI 328
++M+
Sbjct: 441 CDHFSEMV 448
>gi|380480025|emb|CCF42668.1| SET domain-containing protein RMS1 [Colletotrichum higginsianum]
Length = 318
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 55/230 (23%), Positives = 107/230 (46%), Gaps = 34/230 (14%)
Query: 118 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
+VP+ +L ++ A + +D + +V RP KAGE I+ + GP PNS+LL YG+
Sbjct: 71 LGMVPMAD-ILNADAEFNAHVNHGEDDLSVVALRPIKAGEEILNYYGPHPNSELLRRYGY 129
Query: 178 VDEDN--------PYDRLVVEAALNTE---------------DPQ-YQDKRMVAQRNGKL 213
V + P+D +V++ L + DP+ ++D ++ + +G+
Sbjct: 130 VTPKHSRYDVVEIPWD--LVQSILTEQLRLTDDVWKQLAEHVDPEDFEDVFVLERDSGEP 187
Query: 214 SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLA 273
+ + +E +++ L+ + ++++ G + P + +A
Sbjct: 188 DSEGRLTTPAKVQEVSAELEEQLK-------AVLKAIKKVRGDLIPDKRKRDEVYQHVVA 240
Query: 274 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQV 323
+ LA YP T EDEA+L NL ++R+A ++ EK++L LQ+
Sbjct: 241 AALQKLLAQYPTTAEEDEALLASGNLTSRQRMAVEVRLGEKRLLKEALQM 290
>gi|424513480|emb|CCO66102.1| predicted protein [Bathycoccus prasinos]
Length = 571
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 89/354 (25%), Positives = 147/354 (41%), Gaps = 54/354 (15%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA-YLTGSPTKAEILERAEGIK 61
EK + K S W PY++ L+ S L ++E E L G+ E +R + K
Sbjct: 171 EKAKAKGSEWWPYLKTLNGNPN-------SVLRFTEEEFKELLKGTSIDKEARQRRDSAK 223
Query: 62 REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV-AVQSCVVHLQKVSLARRFAL 120
EY L + P P + + F + + AF+ A+ Q ++ A +A+
Sbjct: 224 EEYEALRAA-------IAEDPGKYPLDVYAF-LTESAFIDALDIVCARAQWLNSANCYAM 275
Query: 121 VPL--------GPPLLA-------------------YSSKCKAMLAAVDDA-VQLVVDRP 152
VPL PP ++ + +C VD A V L +
Sbjct: 276 VPLMDAIPICGAPPPVSPEDPSFARFYEIRDIKTGLTAVRCGYADYDVDSASVVLCANTR 335
Query: 153 YKAGESIV-VWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 211
AG I+ + + NS+L +++G VD+ +P D L+ DP Y K+ V + G
Sbjct: 336 ASAGSKILQIDHSVRNNSELYLSFGDVDDQHPGDYEYWPTELSENDPLYAAKKSVLEAQG 395
Query: 212 KLSV-QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLD 270
Q F V+ R + L YLR V+++ E+ +V + + VSP E L
Sbjct: 396 FADKGQTFPVYKDRMPR---EFLSYLRFARVTNSEELFAVSFTEDKV--VSPMNEYETLQ 450
Query: 271 QLADYFKARLAGYPATLSEDEAMLTDY-NLHPKKRVATQLVRMEKKMLNACLQV 323
L + R++ Y T EDE +L ++ K R A++L R EK+++ +
Sbjct: 451 LLMADCRDRMSAYD-TNEEDELLLQKRDDVSLKIRNASRLRRCEKELVGEMMNA 503
>gi|440804743|gb|ELR25614.1| SET domain containing protein, partial [Acanthamoeba castellanii
str. Neff]
Length = 273
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 48/159 (30%), Positives = 74/159 (46%), Gaps = 15/159 (9%)
Query: 32 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 91
+ L +S+ EL+YL GS +R I+ +Y+ + T LF YP E F+
Sbjct: 124 TTLFFSDDELSYLEGSSLHYFARQRRRAIESQYDTIFT------PLFVDYPEHFAPEQFS 177
Query: 92 FEIFKQAFVAV--QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 149
+ +K A + +S VV K L + + P + + K + AVD +
Sbjct: 178 LDAWKWALSVIWSRSFVVDEGKSGLVPWADMFNMAPE----TEQVKVAVDAVDHHLIYSA 233
Query: 150 DRPYKAGESIVVWCGPQ---PNSKLLINYGFVDEDNPYD 185
P K GE I V G N++LL++YGFV E+NP+D
Sbjct: 234 RSPIKKGEQIFVAYGQSRQMSNAQLLMDYGFVLENNPHD 272
>gi|302836231|ref|XP_002949676.1| Rubisco large subunit N-methyltransferase [Volvox carteri f.
nagariensis]
gi|300265035|gb|EFJ49228.1| Rubisco large subunit N-methyltransferase [Volvox carteri f.
nagariensis]
Length = 484
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 84/312 (26%), Positives = 141/312 (45%), Gaps = 26/312 (8%)
Query: 30 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 89
+++PLLWSE EL L G+ ++L+ G + T + LF P P
Sbjct: 144 LDTPLLWSEDELQELQGT----QVLQTLGGYLTFFRS--TFQQLQSGLFTSNPAAFPPSI 197
Query: 90 FTFEIFKQAFVAVQSCV---VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 146
FT F A AV+S + K++LA L L A +SK A + Q
Sbjct: 198 FTLPRFLWAVAAVRSRSHPPLDGPKIALA---PLTELVSHRRAANSKLSVRSAGLFGRGQ 254
Query: 147 LVV---DRPYKAGESIVVWCGP-QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQD 202
++V R + GE + + GP + + +L++YG +D +P + + D D
Sbjct: 255 VLVLEATRAIRKGEPLSMDYGPGKLDGPVLVDYGVMDVTSPKPGYSLTLKMPDSDRFIDD 314
Query: 203 KRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV--SDTSEMQSVISS--LGPIC 258
K + + N V+++ +++ +ML +LRL + SD ++S+ + G +
Sbjct: 315 KLDILESNDLPQSVVYNLTP--DEQPTIEMLAFLRLMQLKGSDAFLLESIFRNDVWGFMQ 372
Query: 259 -PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPK--KRVATQLVRM-EK 314
PVS E AV + L++ +A L GY T+ +D A L K +R A L+R+ EK
Sbjct: 373 EPVSEGNEEAVCNTLSEGARAALGGYGTTIDQDLAELRAQGSRAKGSRREAALLIRLGEK 432
Query: 315 KMLNACLQVTAD 326
+ L+A + D
Sbjct: 433 EALDAVARFFED 444
>gi|159479580|ref|XP_001697868.1| rubisco large subunit N-methyltransferase [Chlamydomonas
reinhardtii]
gi|158273966|gb|EDO99751.1| rubisco large subunit N-methyltransferase [Chlamydomonas
reinhardtii]
Length = 475
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 79/306 (25%), Positives = 130/306 (42%), Gaps = 27/306 (8%)
Query: 32 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 91
+PLLWSE E L G+ + + + +L LF P P FT
Sbjct: 146 TPLLWSEEETRALAGTQVAGTLNSYLTFFRSTFAQLQA------GLFTANPAAFPPAVFT 199
Query: 92 FEIFKQAFVAVQSCV---VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 148
F A AV+S + K++LA LV L A ++K + + Q+
Sbjct: 200 LPNFVWAVAAVRSRSHPPLEGDKIALA---PLVDLVSHRRAANTKLSVRSSGLFGRGQVA 256
Query: 149 V---DRPYKAGESIVVWCGP-QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 204
V R + GE++ + P + + +L++YG +D +P + L+ D DK
Sbjct: 257 VVEATRAIRKGEALGMDYAPGKLDGPVLLDYGVMDTASPKPGYSLTLTLDESDKFVDDKA 316
Query: 205 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS--DTSEMQSVISSLGPICPVSP 262
+ + G + + +++ +M+ +LRL + D ++S+ + VS
Sbjct: 317 DIVEGAGLRPSMTYSITP--DQQPGEEMMAFLRLMNIKAMDAFLLESIFRN-----EVSE 369
Query: 263 CMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRV-ATQLVRM-EKKMLNAC 320
E AV LA+ +A LAGYP TL +D A L + R A LVR+ EK+ L+A
Sbjct: 370 GNEEAVCAMLAEGARAALAGYPTTLDQDLAALRSNSTPLGSRAEAALLVRLGEKESLDAV 429
Query: 321 LQVTAD 326
+ D
Sbjct: 430 ARFFED 435
>gi|159477607|ref|XP_001696900.1| rubisco small subunit N-methyltransferase [Chlamydomonas
reinhardtii]
gi|158274812|gb|EDP00592.1| rubisco small subunit N-methyltransferase [Chlamydomonas
reinhardtii]
Length = 411
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 52/182 (28%), Positives = 83/182 (45%), Gaps = 14/182 (7%)
Query: 151 RPYKAGESIVVWCGP---------QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 201
R A + +VVW G +PN +LL+ G + ++N D L A L D Y
Sbjct: 166 RAAGARKGVVVWDGAGSEMLLNDGRPNGELLLATGTLQDNNSSDFLSWPAGLVPADRYYM 225
Query: 202 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVS 261
K V + G + + F V+A R +L YLRL V+D + + + +S
Sbjct: 226 MKSQVLESMGYSAAEEFPVYADRMP---IQLLAYLRLSRVADPALLAKC--TFEADVELS 280
Query: 262 PCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 321
E +L L + RLA Y + ED + +L PK+R+A +L EK+++NA +
Sbjct: 281 QMNEYEILQILMGDCRERLASYTKSYEEDVKIAQQSDLSPKERLAVKLRLGEKRIINATM 340
Query: 322 QV 323
+
Sbjct: 341 EA 342
>gi|425766115|gb|EKV04742.1| hypothetical protein PDIG_87340 [Penicillium digitatum PHI26]
gi|425778867|gb|EKV16969.1| hypothetical protein PDIP_33360 [Penicillium digitatum Pd1]
Length = 679
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 95/207 (45%), Gaps = 23/207 (11%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
M + +G + FW PY+R L + GQL +PL + E ++ ++ G+ +ER +
Sbjct: 111 MAQYLRGPEGFWYPYLRTLPQP---GQLT--TPLFFGEEDVDWIQGTGIPEAAVERIKIW 165
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR--- 117
+ +Y D+ + G+ +P E +T+E++ A + S + +S A +
Sbjct: 166 EEKY---DSGYLQLGA--TGFP---DCETYTWELYLWASTIITSRAFSAKVLSGAVQPGD 217
Query: 118 ------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 171
AL+PL L + K A D + L+V + AG+ I GP+ N +L
Sbjct: 218 LPEDGVSALLPL-IDLPNHRPMAKVEWRAGDKDIGLLVLEDHSAGQEISNNYGPRNNEQL 276
Query: 172 LINYGFVDEDNPYDRLVVEAALNTEDP 198
LINYGF NP D +V + + P
Sbjct: 277 LINYGFCIAGNPTDYRIVHLGVKPDSP 303
>gi|322703179|gb|EFY94792.1| UV-endonuclease UVE-1 [Metarhizium anisopliae ARSEF 23]
Length = 1118
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 84/368 (22%), Positives = 150/368 (40%), Gaps = 61/368 (16%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEG 59
MYE G +S W PYI L L ++P+ WSE EL+YL S T +I + AE
Sbjct: 753 MYEFFLGHQSKWKPYIDVL-------PLTFDTPMFWSEEELSYLQASATVNKIGKADAEE 805
Query: 60 IKREY--------------------NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 99
+ R +L + GS Y +D+ E + +
Sbjct: 806 MFRTRLIPAIRGNPSVFASSGDCSDEDLIGLAHRMGSTIMAYAFDLENEEAENDDESDGW 865
Query: 100 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESI 159
V + + V++A +L ++ A + D+ + + R KAGE I
Sbjct: 866 VEDREGKSMMGMVAMA----------DILNADAEFNAHVNHGDEELTVTSIRDIKAGEEI 915
Query: 160 VVWCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAALNTEDPQYQDKRMVAQRNG 211
+ + GP PNS+LL YG++ E + P+D V+ +L +E QD ++ +
Sbjct: 916 LNYYGPHPNSELLRRYGYITEKHSRYDVVEIPWD--AVQHSLMSELGVPQD--IMTETMD 971
Query: 212 KL------SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE-MQSVISSL----GPICPV 260
K+ + V +G + P + G D E +++ I L G +
Sbjct: 972 KMDQDDLEDIFVLERDSGEPNPDGTFAGPAVVDGMPPDLKEQLKATIKLLQKVDGNLISD 1031
Query: 261 SPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNAC 320
+ + + + + + Y T++EDE +L +L ++R+A ++ EKK+L
Sbjct: 1032 KRKRDDILRSTMVETLRLIASRYSTTIAEDEILLAQDSLTRRQRMAVRVRLGEKKLLQEA 1091
Query: 321 LQVTADMI 328
++M+
Sbjct: 1092 FDHFSEMV 1099
>gi|351701197|gb|EHB04116.1| SET domain-containing protein 3 [Heterocephalus glaber]
Length = 705
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 62/251 (24%), Positives = 116/251 (46%), Gaps = 14/251 (5%)
Query: 88 EAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 146
++FT+E ++ A +V + + +R AL+PL + DD +
Sbjct: 346 DSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCE 405
Query: 147 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 206
V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 406 CVALQDFQAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEV 465
Query: 207 AQRNG---KLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG 255
R G + VF +H E + +L +LR+ +++ + + I +LG
Sbjct: 466 LARAGIPTYVWSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGENAIDRIFTLG 524
Query: 256 PI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEK 314
PVS E + L D L Y T ED+A+L + +L + ++A +L EK
Sbjct: 525 NSEFPVSWENEVKLWSFLEDRASLLLKTYKTTTEEDKAVLKNPDLPARTKMAIKLRLGEK 584
Query: 315 KMLNACLQVTA 325
++L +Q A
Sbjct: 585 EILEKAVQSAA 595
>gi|148686778|gb|EDL18725.1| mCG18357, isoform CRA_c [Mus musculus]
Length = 536
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 49/193 (25%), Positives = 95/193 (49%), Gaps = 10/193 (5%)
Query: 142 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 201
DD + V + ++AG+ I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 234 DDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 293
Query: 202 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISS 253
K V R G + VF +H+ E + +L +LR+ +++ + + I +
Sbjct: 294 MKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFT 352
Query: 254 LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 312
LG PVS E + L D L Y T+ ED+ +L + +L + +A +L
Sbjct: 353 LGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRLG 412
Query: 313 EKKMLNACLQVTA 325
EK++L ++ A
Sbjct: 413 EKEILEKAVKSAA 425
>gi|332020870|gb|EGI61268.1| SET domain-containing protein 3 [Acromyrmex echinatior]
Length = 232
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 55/211 (26%), Positives = 95/211 (45%), Gaps = 11/211 (5%)
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
AL+P+ + + A D + R +K GE + + GP+ NS ++ GFV
Sbjct: 18 ALIPMWDMCNHENGRITTDFNATSDRCECYALRDFKKGEQVFISYGPRTNSDFFVHSGFV 77
Query: 179 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-MLPYLR 237
DN D + ++ D +++ + + SV F + G E ISD +L +LR
Sbjct: 78 CMDNEQDGFKLRLGISKADSLQKERIELLSKLDLPSVGEFLLKPG--TEPISDTLLAFLR 135
Query: 238 LGYVSDTSEMQSVISS-----LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 291
+ + +E+ + S L + C + +E V L + +A YP TL ED
Sbjct: 136 V-FSMRKAELTHWLRSDKVFDLKHVDCALETVVEENVRKFLLTRLQLLIANYPTTLKEDL 194
Query: 292 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 322
+L + L K++A QL EK++L+ L+
Sbjct: 195 ELL-ETTLPQMKKMAVQLRVTEKRILSGALE 224
>gi|148686780|gb|EDL18727.1| mCG18357, isoform CRA_e [Mus musculus]
Length = 458
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 49/193 (25%), Positives = 95/193 (49%), Gaps = 10/193 (5%)
Query: 142 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 201
DD + V + ++AG+ I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 156 DDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 215
Query: 202 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISS 253
K V R G + VF +H+ E + +L +LR+ +++ + + I +
Sbjct: 216 MKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFT 274
Query: 254 LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 312
LG PVS E + L D L Y T+ ED+ +L + +L + +A +L
Sbjct: 275 LGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRLG 334
Query: 313 EKKMLNACLQVTA 325
EK++L ++ A
Sbjct: 335 EKEILEKAVKSAA 347
>gi|342881738|gb|EGU82570.1| hypothetical protein FOXB_06936 [Fusarium oxysporum Fo5176]
Length = 467
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 88/374 (23%), Positives = 153/374 (40%), Gaps = 53/374 (14%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEG 59
+YE QG+ S W PY L + ++P+ WS+ EL L S + +I + AE
Sbjct: 107 IYEYLQGENSKWKPYFDVLPS-------SFDTPMFWSDNELDQLQASHMRHKIGKADAEN 159
Query: 60 --------IKREYNELDTVWFMAGS--------LFQQYPYDIPTEAFTFEIFKQAFVAVQ 103
I R E+ F AG+ + + I AF E ++
Sbjct: 160 MFQKTLLPIIRSNAEI----FNAGNKTDAELIEIAHRMGSTIMAYAFDLENDEEEEEEAD 215
Query: 104 SCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 163
V S+ +VP+ +L ++ A + ++++ + RP KAGE I+ +
Sbjct: 216 GWVEDRDGKSM---MGMVPMAD-ILNADAEFNAHVNHEEESLTVTSLRPIKAGEEILNYY 271
Query: 164 GPQPNSKLLINYGFVDEDN--------PYDRLVVEAALNTE-----DPQYQDKRMVAQRN 210
GP PNS+LL YG+V E + P+D +VE+AL + Q + + +
Sbjct: 272 GPHPNSELLRRYGYVTEKHSRYDVVEIPWD--IVESALTSNFGIPGQVLEQIRGALEEDE 329
Query: 211 GKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE-MQSVISSLGPICP--VSPCMERA 267
V G + P D E +++ + + P + +R
Sbjct: 330 EFEDTFVLERETGEVNSDGTFAEPARFESMPEDLQEQLKTFLKGIKKAQPDAIPDKRKRD 389
Query: 268 VLDQ--LADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
+ Q LA +A +A YP ++SEDE +L +L+ + R+A + EKK+L + ++
Sbjct: 390 EIHQAVLAKTLEALVARYPTSISEDENLLK-QDLNQRTRMAIAVRLGEKKLLQEAITASS 448
Query: 326 DMIMLLPDVTVSPC 339
+ + D P
Sbjct: 449 GDVEMTMDDESGPA 462
>gi|238485948|ref|XP_002374212.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
gi|83768069|dbj|BAE58208.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|220699091|gb|EED55430.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
Length = 713
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 54/190 (28%), Positives = 87/190 (45%), Gaps = 7/190 (3%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
M + QGK+ FW PYIR L Q G A+ +PL + +L +L G+ ++A +
Sbjct: 134 MGQYLQGKEGFWYPYIRTLP-QPG----ALTTPLYYEGDDLEWLEGTSLSPARQQKANLL 188
Query: 61 KREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 119
K +Y + T AG ++Y +D+ A T + + V S V+ ++
Sbjct: 189 KEKYGTVYTELCKAGFDGAEKYTWDLYLWASTIFVSRAFSAKVLSGVIPDTQLPEENVSV 248
Query: 120 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 179
L+P +L + K A V +V AG+ I GP+ N +L++NYGF
Sbjct: 249 LLPF-IDILNHRPLAKVEWRAGKGNVAFLVLEDVAAGQEISNNYGPRNNEQLMMNYGFCL 307
Query: 180 EDNPYDRLVV 189
+NP D +V
Sbjct: 308 PNNPCDYRIV 317
>gi|347836900|emb|CCD51472.1| similar to SET domain-containing protein [Botryotinia fuckeliana]
Length = 470
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 86/350 (24%), Positives = 142/350 (40%), Gaps = 44/350 (12%)
Query: 6 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA-------- 57
Q S W PY+ L Q + S + WS++ELA L S +I ++
Sbjct: 96 QVDDSKWAPYLAILPEQ-------LNSLVFWSDSELAELQASAVVKKIGKQGAEDMFKTY 148
Query: 58 ---EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQK 111
+G++ E+ S+ Y +DIP + + A V +K
Sbjct: 149 ITPQGLQHSSTEM---CHKVASVIMAYAFDIPDPSEGPTSGGKGEEAADDLVSDDGEDEK 205
Query: 112 VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 171
L+ ++PL L A + + A L ++ +++ +P GE I G P S L
Sbjct: 206 TILS----MIPLADMLNADADRNNARLICDNEDLEMRAIKPIAKGEEIFNDYGQLPRSDL 261
Query: 172 LINYGFV-DEDNPYDRLVVEAAL----------NTEDPQY-QDKRMV----AQRNGKLSV 215
L YG+V D + YD + A L + P+ QDK V A+R G
Sbjct: 262 LRRYGYVTDGYSAYDVAEISAELIVSLFRNGKVHPSLPKLTQDKLKVRLDLAEREGVYDE 321
Query: 216 QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADY 275
VH+ ++ +I D L + D S +++++ S + S LA
Sbjct: 322 SFDLVHSSPDEPSIPDELLAFLYLLLVDESHLKAILDSESSLPSRSKLTTELAGQVLAIL 381
Query: 276 FKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
+AR Y T+ EDE +L + +L + +A Q+ EKK+L A ++ A
Sbjct: 382 LQARENEYSTTVEEDEDLLKNADLPIRTAMAIQVRSGEKKVLRAAIREAA 431
>gi|242007310|ref|XP_002424484.1| SET domain-containing protein, putative [Pediculus humanus
corporis]
gi|212507902|gb|EEB11746.1| SET domain-containing protein, putative [Pediculus humanus
corporis]
Length = 492
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 73/333 (21%), Positives = 141/333 (42%), Gaps = 33/333 (9%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
E + + SFW YI L + L + + + SPT + + I R
Sbjct: 170 EHVKNENSFWHHYISSLPSD-------YNTVLYFDLNDFLEMKNSPTFEMATKHCKNIAR 222
Query: 63 EYNELDTVWFMAGSLFQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF- 118
+Y +F +LFQ + I FT+++++ A V + + S +
Sbjct: 223 QY-----AYF--NNLFQNSNDEASLILRNVFTYQLYRWAVSTVMTRQNFIPSSSTSNDVE 275
Query: 119 ----ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 174
L+PL + +++ + VD + L +P+K GE ++++ G + NS L++
Sbjct: 276 NGINGLIPLWD-MCNHTNGYLSTQYKVDRSECLAC-KPFKKGEQVLIFYGERSNSDFLVH 333
Query: 175 YGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP 234
GFV ++NP+D + ++ D + + + + G F++++G E ++L
Sbjct: 334 NGFVYDENPHDSFRLRLGISKSDKLHGLRCELLKDLGIPDSGDFYLYSGSEP-VRENLLA 392
Query: 235 YLRLGY--VSDTSEMQSVISSLGPI----CPVSPCMERAVLDQLADYFKARLAGYPATLS 288
+LR+ V + + +S S L + C + +E V + L D L Y
Sbjct: 393 FLRIFNMDVENLNHWKSHSSRLSDLMWKDCALDTKIESKVWNFLYDRINLLLKTYKG--D 450
Query: 289 EDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 321
E E + D N +R+ ++ EKK L++ L
Sbjct: 451 EVEVRVEDSNSTECRRLVRAQLKCEKKFLSSIL 483
>gi|34784341|gb|AAH57968.1| Setd3 protein [Mus musculus]
Length = 408
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 49/193 (25%), Positives = 95/193 (49%), Gaps = 10/193 (5%)
Query: 142 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 201
DD + V + ++AG+ I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 106 DDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 165
Query: 202 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISS 253
K V R G + VF +H+ E + +L +LR+ +++ + + I +
Sbjct: 166 MKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFT 224
Query: 254 LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 312
LG PVS E + L D L Y T+ ED+ +L + +L + +A +L
Sbjct: 225 LGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRLG 284
Query: 313 EKKMLNACLQVTA 325
EK++L ++ A
Sbjct: 285 EKEILEKAVKSAA 297
>gi|317144568|ref|XP_001820210.2| SET domain protein [Aspergillus oryzae RIB40]
gi|391871646|gb|EIT80803.1| N-methyltransferase [Aspergillus oryzae 3.042]
Length = 703
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 54/190 (28%), Positives = 87/190 (45%), Gaps = 7/190 (3%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
M + QGK+ FW PYIR L Q G A+ +PL + +L +L G+ ++A +
Sbjct: 124 MGQYLQGKEGFWYPYIRTLP-QPG----ALTTPLYYEGDDLEWLEGTSLSPARQQKANLL 178
Query: 61 KREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 119
K +Y + T AG ++Y +D+ A T + + V S V+ ++
Sbjct: 179 KEKYGTVYTELCKAGFDGAEKYTWDLYLWASTIFVSRAFSAKVLSGVIPDTQLPEENVSV 238
Query: 120 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 179
L+P +L + K A V +V AG+ I GP+ N +L++NYGF
Sbjct: 239 LLPF-IDILNHRPLAKVEWRAGKGNVAFLVLEDVAAGQEISNNYGPRNNEQLMMNYGFCL 297
Query: 180 EDNPYDRLVV 189
+NP D +V
Sbjct: 298 PNNPCDYRIV 307
>gi|255562948|ref|XP_002522479.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
gi|223538364|gb|EEF39971.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
Length = 502
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 86/170 (50%), Gaps = 9/170 (5%)
Query: 154 KAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 212
K+GE +++ + + N++L ++YGF+++ + + ++ DP + DK +A+ NG
Sbjct: 293 KSGEQVLIQYDLNKSNAELALDYGFIEKTPDRNTYTLTLQISESDPFFGDKLDIAETNGS 352
Query: 213 LSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE--MQSVISS--LGPI-CPVSPCMERA 267
F + G MLPYLRL + T ++S+ + G + P+S E
Sbjct: 353 GETADFDIVLGNPLPPA--MLPYLRLVALGGTDAFLLESIFRNTIWGHLELPISRANEEL 410
Query: 268 VLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 317
+ + D K+ L+GY T+ EDE L +L+P+ +A + EKK+L
Sbjct: 411 ICRVVRDACKSALSGYHTTIEEDEK-LEAADLNPRLEIAVGIRAGEKKVL 459
>gi|340519125|gb|EGR49364.1| predicted protein [Trichoderma reesei QM6a]
Length = 963
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 81/358 (22%), Positives = 143/358 (39%), Gaps = 63/358 (17%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE----- 55
MYE +G +S W PYI L + E+P+ WS+ EL L S T++++ +
Sbjct: 597 MYEYFKGSESKWKPYIDVLPS-------SFETPMFWSDAELDELQASATRSKVGKASAEE 649
Query: 56 ----------RAEG----IKREYNELDTVWFMA--GSLFQQYPYDIPTEAFTFEIFKQAF 99
RA + Y++ D + GS Y +D F+
Sbjct: 650 MFQDKVLPVIRANQHLFPTSQTYSDDDLIQLAHRMGSTIMSYSFD----------FQNED 699
Query: 100 VAVQSCVVHLQKVSLARR-FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 158
+ + A+ +VP+ +L ++ A + DDA+ + R KAGE
Sbjct: 700 EEDEDETEEWVEEREAKSTMGMVPMAD-ILNADAEYNAHVNYGDDALTVTALRTIKAGEE 758
Query: 159 IVVWCGPQPNSKLLINYGFVD-EDNPYDRL---------VVEAALNTEDPQYQDKRMVAQ 208
I + GP PNS+LL YG+V + + YD + V A+L Q DK
Sbjct: 759 IFNYYGPHPNSELLRRYGYVTPKHSRYDVVELPWTLVEESVAASLGLSSEQL-DKARECL 817
Query: 209 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT--SEMQSVISSLGPICPVSPCMER 266
+ +L E + R + + +++S++ ++ P S +R
Sbjct: 818 DSDELEDTFVLERETEEPNPDGTLTGSARFSEIPEDLRDQLKSLLKAIRKAVPSSVVDKR 877
Query: 267 A-------VLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 317
+L + D +R YP ++SEDE +L ++ ++R A + EK+++
Sbjct: 878 KRDEIQHNILIRALDALASR---YPTSISEDERILAGNDISERRRAAVTVRLGEKRLI 932
>gi|449017905|dbj|BAM81307.1| similar to ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplast precursor
[Cyanidioschyzon merolae strain 10D]
Length = 567
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 98/379 (25%), Positives = 148/379 (39%), Gaps = 78/379 (20%)
Query: 6 QGKKSFWLPYIRELDR--QRGRGQLAVES----PL----LWSETELAYLTGSPTKAEILE 55
+G SFW PY+ L +G ES PL LW+E E+ +L GSPT
Sbjct: 184 RGPGSFWGPYLDILPPVPWEFKGAEPAESLSMDPLDALWLWAEDEMQWLQGSPTLLSARA 243
Query: 56 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE-AFTFEIFKQAFVAVQSCVVHLQKVSL 114
++REY E L++++P+ E AF E F AF + S V L +
Sbjct: 244 LRSKVEREYAE------ACERLYRRHPHIFDLEGAFRLERFLWAFGVLFSRAVSLPAENG 297
Query: 115 ARRFALVPLGPPLLAYSSKCKAMLAA----------------------------VDDA-- 144
ALVP L +S+ C + + A DDA
Sbjct: 298 M--LALVPYAD-LANHSAFCVSFIDARTAAFPYAFRASSKQKRGQWWQRFLAPNSDDAGA 354
Query: 145 ----------------VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 188
V DR Y E + V G + N++LL+ YGFV + NPY+ +
Sbjct: 355 VANTDSSHYREDAQREVVAYADRFYDKFEQVYVSYGQKSNAELLLLYGFVSDRNPYNSVE 414
Query: 189 VEAALNTEDPQ----YQDKRMVAQRNGK--LSVQVFHVHAGREKEAISDMLPYLRLGYVS 242
V +L+ + KR G+ + F ++A R + +L + L
Sbjct: 415 VCVSLSGSEAAGAGLLDRKRSFLLACGRDPDKPECFPLYADRYPLELMQLLRFASL--TE 472
Query: 243 DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYN---- 298
+ S + + PV+ E A L K L YP + ED+A L D +
Sbjct: 473 QDAAGYSDLEQIDVAQPVNRENEIAAKSALLQACKIALQAYPTSADEDDAALKDKSMAQL 532
Query: 299 LHPKKRVATQLVRMEKKML 317
L K+R++ +L R EK++L
Sbjct: 533 LSRKQRLSVRLRRSEKRIL 551
>gi|149044196|gb|EDL97578.1| rCG27725, isoform CRA_b [Rattus norvegicus]
Length = 538
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 47/193 (24%), Positives = 92/193 (47%), Gaps = 10/193 (5%)
Query: 142 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 201
DD + V + ++AG+ I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 234 DDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 293
Query: 202 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVIS 252
K V R G + VF +H E + +L +LR+ +++ S + + +
Sbjct: 294 MKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFT 352
Query: 253 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 312
PVS E + L D L Y T+ ED+ +L + +L + +A +L
Sbjct: 353 LGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKTVLKNPDLSVRATMAIKLRLG 412
Query: 313 EKKMLNACLQVTA 325
EK++L ++ A
Sbjct: 413 EKEILEKAVKSAA 425
>gi|358392567|gb|EHK41971.1| hypothetical protein TRIATDRAFT_251278, partial [Trichoderma
atroviride IMI 206040]
Length = 956
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 83/387 (21%), Positives = 150/387 (38%), Gaps = 74/387 (19%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEG 59
+YE +G +S W PY+ L + E+P+ WS+ E+ L S T+++I + AE
Sbjct: 593 IYEYFKGDQSSWKPYLDVL-------PASFETPMFWSDAEVDELQASATRSKIGKTNAEE 645
Query: 60 I--------------------KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 99
+ + EL + GS Y +D F+
Sbjct: 646 MFHAKILPVIRGNPDIFQTSQAKSDEELIQLAHRMGSTIMSYAFD----------FQNED 695
Query: 100 VAVQSCVVHLQKVSLARR-FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 158
+ + A+ +VP+ +L ++ A + DDA+ + R KAGE
Sbjct: 696 EEEEDDSEEWVEDREAKSTMGMVPMAD-ILNADAEYNAHVNYGDDALTVATLRTIKAGEE 754
Query: 159 IVVWCGPQPNSKLLINYGFVDEDN--------PY----DRLVVEAALNTED--------- 197
I+ + GP PNS+LL YG+V + P+ D L L++E
Sbjct: 755 ILNYYGPHPNSELLRRYGYVTPKHSRYDVVELPWKMIEDALAANLGLSSEQLDSAREHLD 814
Query: 198 -PQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGP 256
++++ ++ + + + + + + E D+ L+ M I + P
Sbjct: 815 LDEFEETFVLERESDEPNPDGTFANPAKFSEIPEDLREQLK--------SMLKAIRKVDP 866
Query: 257 ICPVSPCMERAVLDQ-LADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKK 315
C V V L A + YP T+ EDE +L+ NL +++ A + EK+
Sbjct: 867 SCIVDKRKRDEVQHTVLITALDALTSQYPTTIIEDELILSGSNLSERRKAAVTVRLGEKR 926
Query: 316 MLNACLQVTADMIMLLPDVTVSPCPAP 342
+L + ++ + D + PAP
Sbjct: 927 LLQEARVLLSE---IASDAILDDAPAP 950
>gi|302768639|ref|XP_002967739.1| hypothetical protein SELMODRAFT_408995 [Selaginella moellendorffii]
gi|300164477|gb|EFJ31086.1| hypothetical protein SELMODRAFT_408995 [Selaginella moellendorffii]
Length = 421
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 47/203 (23%), Positives = 80/203 (39%), Gaps = 36/203 (17%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
++EK+ G+ S W PYI L R + S W + EL + S E + R I
Sbjct: 114 LFEKRAGEASIWAPYISCLPRWG-----TIHSTAFWRKEELTMIQESSLSYETMSRRAAI 168
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
+ E+NE+ +V F +D + A V+ V+ + +A A
Sbjct: 169 REEFNEMQSVPFA-----DFMNHDWSSNAMLTYDTDNGSTEVEEVKVYSDCLYIALFCA- 222
Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
QL D+ Y AGE + + GP N+ L +++GF
Sbjct: 223 -------------------------QLFADKNYAAGEQVTISFGPLCNASLALDFGFTVP 257
Query: 181 DNPYDRLVVEAALNTEDPQYQDK 203
NP+D++ + ++ D ++K
Sbjct: 258 YNPWDKVQLWLGISRRDSLRKEK 280
>gi|307190530|gb|EFN74527.1| SET domain-containing protein 3 [Camponotus floridanus]
Length = 232
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 54/211 (25%), Positives = 93/211 (44%), Gaps = 11/211 (5%)
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
AL+P+ + + A D + R +K GE + + GP+ NS ++ GFV
Sbjct: 18 ALIPMWDMCNHENGRITTDFNATSDHCECYALRNFKKGEQVFISYGPRTNSDFFVHSGFV 77
Query: 179 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-MLPYLR 237
+N D + ++ D +++ + + G SV F + G E ISD +L +LR
Sbjct: 78 YMNNKQDGFKLRLGISKADSLQKERIELLSKLGLPSVGEFLLKPG--TEPISDTLLAFLR 135
Query: 238 LGYVSDTSEMQSVISSLGPI------CPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 291
+ + +E+ + S C + +E V L + +A YP TL ED
Sbjct: 136 V-FSMRKAELAHWLRSDKVFDLKHMDCALETVVEENVRKFLLTRLQLLIANYPTTLKEDL 194
Query: 292 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 322
+L + L K++A QL EK++L L+
Sbjct: 195 ELL-ETTLPQIKKMAVQLRVTEKRILLGALE 224
>gi|449542715|gb|EMD33693.1| hypothetical protein CERSUDRAFT_56467 [Ceriporiopsis subvermispora
B]
Length = 510
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 65/226 (28%), Positives = 98/226 (43%), Gaps = 31/226 (13%)
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
A+VP+ L A A L + +++V +P AGE I G PNS LL YG V
Sbjct: 260 AMVPMADMLNARFESENAKLFYEEHYLKMVATKPINAGEQIWNTYGDPPNSDLLRRYGHV 319
Query: 179 D----------EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQV-FHVHAGREK- 226
D E NP D + + A L R + G L V+V F + +
Sbjct: 320 DVVPLGEPLSGEGNPADVVEIRADL-----VVSAVRKARKAAGDLQVRVDFWLEEADDDT 374
Query: 227 -------EAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 279
E ++L ++RL +S T + + + + G + +E +L + D K R
Sbjct: 375 FVLMTDCEVPEELLSFIRL--LSLTKDEWNKVKAKGKLP--KGKLELELLPAIVDVLKER 430
Query: 280 LAGYPATLSEDEAML---TDYNLHPKKRVATQLVRMEKKMLNACLQ 322
L YP T+ EDE++L + NL KR A + EK++L LQ
Sbjct: 431 LKEYPTTIEEDESLLGPDSAVNLSFNKRNAVVVRLGEKRILRGALQ 476
>gi|384251065|gb|EIE24543.1| hypothetical protein COCSUDRAFT_40909 [Coccomyxa subellipsoidea
C-169]
Length = 685
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 52/171 (30%), Positives = 76/171 (44%), Gaps = 18/171 (10%)
Query: 37 SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK 96
+E E++ L G+P +E + I+ +Y V +L YP DI + T + F
Sbjct: 65 TEEEVSMLEGTPAHTTFVEARQHIREQYRAAQPVL---QALTAAYPDDITPDLVTEDKFI 121
Query: 97 QAFVAVQSCVVHLQKVSLARRFALVPLG--------PPLLAYSSKCKAMLAAVDDAVQLV 148
A S + ++ V A R LVP+ P ++ Y L A D+++L
Sbjct: 122 WACELWYSYAIEVEYVDGAVRQTLVPIAHLLNHSPWPHIVRY-----GRLDAATDSLRLR 176
Query: 149 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR--LVVEAALNTED 197
R AGE + GP PN KLL+ YGF DNP+D + EA N D
Sbjct: 177 AFRHCAAGEQCFLSYGPLPNLKLLLFYGFALPDNPHDTVPITFEAEKNEGD 227
>gi|310799999|gb|EFQ34892.1| SET domain-containing protein [Glomerella graminicola M1.001]
Length = 478
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 85/371 (22%), Positives = 149/371 (40%), Gaps = 72/371 (19%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+YE QG S W Y L + ++ + W +L YL GS ++I +
Sbjct: 106 IYEYLQGNASPWKTYFEVLPEK-------FDTLMFWESPDLEYLKGSAVLSKIGKDEADE 158
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR--- 117
L + AG F Q P+E+ ++ + + + L+ +
Sbjct: 159 MFRSRILPVISANAGIFFPQ-GVSPPSESELLQLAHRMGSIIMAYAFDLENEEEPEQEDE 217
Query: 118 -----------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 166
+VP+ +L ++ A + +D + + RP KAGE I+ + GP
Sbjct: 218 EWVEDREGKTMLGMVPMAD-ILNADAEFNAHVNHGEDDLSVTALRPIKAGEEILNYYGPH 276
Query: 167 PNSKLLINYGFVDEDN--------PYDRLVVEAALNTEDPQYQDK--RMVAQRNGKLSVQ 216
PNS+LL YG+V + P+D +V++ L TE + D+ + VA+ ++
Sbjct: 277 PNSELLRRYGYVTPKHSRYDVVEIPWD--LVQSTL-TEQLRLTDEVWKQVAEHVDPEDLE 333
Query: 217 VFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL---------------GPICP-- 259
V E S+ G++ +++Q V + L G + P
Sbjct: 334 DVFVLERESGEPDSE-------GHLQTPAKVQEVSAELEEQLKDVLKAIKKVRGDLIPDK 386
Query: 260 -----VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEK 314
V C+ + L +L LA YP T EDEAML N+ ++++A ++ EK
Sbjct: 387 RKRDEVYQCVVVSTLQKL-------LAQYPTTAEEDEAMLASGNVTSRQKLAVEVRLGEK 439
Query: 315 KMLNACLQVTA 325
+++ LQV
Sbjct: 440 RLIKEALQVAG 450
>gi|348675930|gb|EGZ15748.1| hypothetical protein PHYSODRAFT_561468 [Phytophthora sojae]
Length = 430
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 84/355 (23%), Positives = 142/355 (40%), Gaps = 36/355 (10%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
M+E+ +G +SF+ P+I + + P+ W+E EL L G+ + +
Sbjct: 89 MHERSKGGESFFAPFIASM-------PTTFDLPVFWTEAELNELKGTNVLLLTQLMKQHL 141
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
+R++ + + F +PT T + + A + S VS ++
Sbjct: 142 ERDFENIHQA---VAADFPDIFASLPT--LTIDDYMWAMSVIWSRAF---GVSKGGKYLH 193
Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLV---------VDRPYKAGESIVVWCGPQPNSKL 171
V L P + ++ + +DD V V AG ++ + G N+KL
Sbjct: 194 V-LCPAMDMFNHDV-TVRKPLDDFVSFNEEKQMMTHHVPEDVAAGSAVHISYGQYSNAKL 251
Query: 172 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD 231
L +YGFV +N + + DP ++ K+ V N Q + H + +
Sbjct: 252 LYSYGFVSPENFRRGVDFWMKIPLSDPYFKLKQTVLDSNELTKEQTYDFHGTLLSNDVDE 311
Query: 232 -MLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSED 290
+L LR+ +++ Q + I V E AV + L + +L+ Y TL ED
Sbjct: 312 RLLATLRVILMNEQEIRQYKKAFESSILSVRN--ELAVYENLQSTCRRKLSNYATTLEED 369
Query: 291 EAMLTDYNLHPKKRVATQL-VRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYA 344
EA+L + K R+A + VRME K QVT +I L S P A
Sbjct: 370 EAILAETETESKPRLAFAVRVRMEDK------QVTTSVIETLEQWKQSLASKPDA 418
>gi|336261436|ref|XP_003345507.1| hypothetical protein SMAC_07495 [Sordaria macrospora k-hell]
gi|380088183|emb|CCC13858.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 499
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 48/197 (24%), Positives = 84/197 (42%), Gaps = 28/197 (14%)
Query: 6 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 65
+GK S W PYI L + A+ P W+E ++ L G+ I E + +K EY
Sbjct: 115 KGKSSLWAPYISTLTDPSQLDKWAL--PPFWTEHDIELLRGTNAYVAIQEIQDNVKSEYK 172
Query: 66 ELDTVWFMAGS----LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 121
+ + GS + Q Y+ FT F+ + + +S ++++ L+
Sbjct: 173 QARKILKQEGSPDYRAYTQVLYNWAYCMFTSRSFRPSLILSESAREYVER--------LL 224
Query: 122 PLGPPLLAYS--------------SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 167
P G + +S ++ L + A +L+ Y+ G+ + G +
Sbjct: 225 PEGAKIDDFSILQPLYDIGNHSPEAEYSWNLTSEPSACELICRNSYEPGQQVFNNYGKKT 284
Query: 168 NSKLLINYGFVDEDNPY 184
NS+LL+ YGFV E+N Y
Sbjct: 285 NSELLLGYGFVTENNDY 301
>gi|408393455|gb|EKJ72719.1| hypothetical protein FPSE_07119 [Fusarium pseudograminearum CS3096]
Length = 465
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 83/383 (21%), Positives = 153/383 (39%), Gaps = 71/383 (18%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEG 59
+YE +G S W Y L + ++P+ WSE EL L S + +I + AE
Sbjct: 103 IYEYLRGDSSQWKSYFDVLPS-------SFDTPMFWSENELDQLQASHMRHKIGKADAEN 155
Query: 60 I-------------------KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 100
+ R +EL + GS Y +D+ + E ++
Sbjct: 156 MFKKTLVPIIRSNPSIFNAENRSDSELVEIAHRMGSTIMAYAFDLENDEEEEEETEEWVE 215
Query: 101 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 160
+ +VP+ +L ++ A + ++++ + RP KAGE I+
Sbjct: 216 DRDGKSM----------MGMVPMAD-ILNADAEFNAHVNHEEESLTVTSLRPIKAGEEIL 264
Query: 161 VWCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAALNT----------------E 196
+ GP PNS+LL YG+V E + P+D +VE+ L
Sbjct: 265 NYYGPHPNSELLRRYGYVTEKHSRYDVVEIPWD--IVESVLTNFGISSKILKQIRGEFEG 322
Query: 197 DPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGP 256
+ +++D ++ + G+++ + + D+ L+ S ++ V S P
Sbjct: 323 EEEFEDTFVLERDTGEINSDGTFAEPAKFEGMPEDLQEQLK----SFLKGIKKVQSDTIP 378
Query: 257 ICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 316
+ +AVL + + AR YP ++SED+ +L NL + R+AT + EKK+
Sbjct: 379 DKRKRDEIHQAVLVKTLEALAAR---YPTSISEDQTLLNGQNLDQRARMATVVRLGEKKL 435
Query: 317 LNACLQVTADMIMLLPDVTVSPC 339
L + ++ + + D P
Sbjct: 436 LQEAIATFSEDVEMTMDDESGPA 458
>gi|303277863|ref|XP_003058225.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226460882|gb|EEH58176.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 612
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 71/252 (28%), Positives = 107/252 (42%), Gaps = 40/252 (15%)
Query: 1 MYEK-KQGKKSFWLPYIREL---------DRQRGRGQLAVESPLLW-SETELAYLTGSPT 49
+YEK G +S W P + L + G L + +P W +E A L G+P
Sbjct: 249 VYEKYALGDRSPWAPLLASLPMDDGGGDDGDRTAAGALGL-TPASWPAEVTDALLRGAPL 307
Query: 50 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 109
+ ++ E R++ L F A L + +P PTE +T F+ A A + + +
Sbjct: 308 LDDAVKARETTARQHAAL----FPA--LGEHFPEVFPTELYTLRRFRIASEAWNAYGMTV 361
Query: 110 QKVSLARRFALVPLGPPLLAYSSKCKAMLA-------AV------DDAVQLVVDRPYKAG 156
Q ++ PP A+L AV DDA+ L + R +AG
Sbjct: 362 QAETVGGASGGGEHHPPAPTTCLPPIALLCNHATWPHAVRYSRLRDDALHLPIARGVRAG 421
Query: 157 ESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQR-----NG 211
E I V G + N++LL+ YGF DNPYD + L+ E PQ + + + A R
Sbjct: 422 EEIFVSYGAKSNAELLLFYGFGVRDNPYD----DVPLSLELPQGEVRDVSALRERVLHRA 477
Query: 212 KLSVQVFHVHAG 223
KLS+ V G
Sbjct: 478 KLSLSPHSVRCG 489
>gi|307195794|gb|EFN77608.1| SET domain-containing protein 3 [Harpegnathos saltator]
Length = 245
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 55/211 (26%), Positives = 91/211 (43%), Gaps = 11/211 (5%)
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
AL+P+ + + A D + R ++ GE I + GP+ NS ++ GFV
Sbjct: 31 ALIPMWDMCNHENGRITTDFNATSDRCECYALRNFQKGEQIFISYGPRTNSDFFVHSGFV 90
Query: 179 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDM-LPYLR 237
DN D + ++ D +++ + + SV F + G E ISDM L +LR
Sbjct: 91 YMDNEQDGFKLRLGISKADSLQKERTELLGKLDLPSVGEFLLKPG--TEPISDMLLAFLR 148
Query: 238 LGYVSDTSEMQSVISSLGPI------CPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 291
+ + +E+ + S C + +E V L + +A YP TL ED
Sbjct: 149 V-FSMRKAELAHWLRSDKVFDLKHMDCALETVVEENVRKFLLTRLQLLIANYPTTLKEDL 207
Query: 292 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 322
+L + L K++ QL EKK+L L+
Sbjct: 208 ELL-ETTLPQVKKMTVQLRVTEKKILLGALE 237
>gi|301094750|ref|XP_002896479.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262109454|gb|EEY67506.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 478
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 76/320 (23%), Positives = 128/320 (40%), Gaps = 31/320 (9%)
Query: 33 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 92
P+ WSE EL++L GS +I ER I+++Y+ + V D F+
Sbjct: 144 PIFWSEEELSWLKGSYIIQQIQERKAAIRKDYDVICRV-------------DPSFARFSL 190
Query: 93 EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA-MLAAVDDAVQLVVDR 151
+ F A + V S L + + ALVP L Y + + DA +
Sbjct: 191 DRFSWARMIVCSRNFGL-TIDGVKTAALVPFADMLNHYRPRETSWTFDQSIDAFTITSLG 249
Query: 152 PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY------DRLVVEAALNTEDPQ-YQDKR 204
G + G + N + L+NYGF EDN + ++++ L+ D Q + DKR
Sbjct: 250 TIGTGAQVYDSYGKKCNHRFLLNYGFAVEDNTEEDGRNPNEVLIDFQLSPADGQLFYDKR 309
Query: 205 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPIC---PVS 261
+G ++ + + + RL + T E S + P P+S
Sbjct: 310 AYLHESGIYTMDA-RLSCSHSDANTREGFSFARL--IVATEEEFSTMKMKSPAHSSPPIS 366
Query: 262 PCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNA 319
E L+ L + +L+ Y T+ ED +L Y L + A +R EK++
Sbjct: 367 FDNEIRALEYLRNLMTHQLSLYDTTIEEDNELLASKQYPLFSNRIQALFFIRGEKQVCRY 426
Query: 320 CLQVTADMIMLLPDVTVSPC 339
Q AD ++ L + ++ C
Sbjct: 427 -FQELADKVIPLFSLPLAEC 445
>gi|255083899|ref|XP_002508524.1| set domain protein [Micromonas sp. RCC299]
gi|226523801|gb|ACO69782.1| set domain protein [Micromonas sp. RCC299]
Length = 425
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 59/212 (27%), Positives = 88/212 (41%), Gaps = 29/212 (13%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELA-YLTGSPTKAEILERAEGIKREYNELD 68
S + Y+R L R+ G L W E ++ L GSP++ +ER + +E+
Sbjct: 87 SRFATYVRALPRRTG-------GVLDWPEEDVKELLAGSPSQRAAMERQASVDAAIDEIR 139
Query: 69 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 128
+P P + AF + S ++ L A ALVP +L
Sbjct: 140 A----------SFPQLTPG------ALRWAFDVLFSRLIRLPNRGGA--LALVPWAD-ML 180
Query: 129 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE--DNPYDR 186
+ C A + AV L DR YK GE + GP+P+S+LLI+YGF +NP D
Sbjct: 181 NHRPGCDAYIDDTGGAVCLSPDRRYKPGEQVYASYGPRPSSELLISYGFAPAVGENPDDE 240
Query: 187 LVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
V ++ D K +R G V+ F
Sbjct: 241 FEVVLGIDPNDRHADAKADALRRIGLSPVEAF 272
>gi|46130858|ref|XP_389160.1| hypothetical protein FG08984.1 [Gibberella zeae PH-1]
Length = 1000
Score = 57.8 bits (138), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 84/383 (21%), Positives = 155/383 (40%), Gaps = 71/383 (18%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEG 59
+YE QG S W Y L + ++P+ WSE EL L S + +I + AE
Sbjct: 638 IYEYLQGDSSQWKSYFDVLPS-------SFDTPMFWSENELDQLQASHMRHKIGKADAED 690
Query: 60 I-------------------KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 100
+ R EL + GS Y +D+ + E ++
Sbjct: 691 MFKKTLVPIIRSNPSIFNAENRSDYELVEIAHRMGSTIMAYAFDLENDEEEEEETEEWVE 750
Query: 101 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 160
+ + +VP+ +L ++ A + ++++ + RP KAGE I+
Sbjct: 751 DREGKSM----------MGMVPMAD-ILNADAEFNAHVNHEEESLTVTSLRPIKAGEEIL 799
Query: 161 VWCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAALNT----------------E 196
+ GP PNS+LL YG+V E + P+D +VE+ L E
Sbjct: 800 NYYGPHPNSELLRRYGYVTEKHSRYDVVEIPWD--IVESVLTNFGISSKILEQIRGEFEE 857
Query: 197 DPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGP 256
+ +++D ++ + G+++ + + D+ L+ ++ ++Q S P
Sbjct: 858 EEEFEDTFVLERDTGEVNSDGTFAEPAKFEGMPEDLQEQLK-SFLKGIKKLQ---SDTIP 913
Query: 257 ICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 316
+ +AVL + + AR YP ++SED+ +L NL + R+AT + EKK+
Sbjct: 914 DKRKRDEIHQAVLVKTLEALAAR---YPTSISEDQILLNGQNLDQRARMATVVRLGEKKL 970
Query: 317 LNACLQVTADMIMLLPDVTVSPC 339
L + ++ + + D P
Sbjct: 971 LQEAIATFSEDVEMTMDDESGPA 993
>gi|229596469|ref|XP_001008992.3| SET domain containing protein [Tetrahymena thermophila]
gi|225565279|gb|EAR88747.3| SET domain containing protein [Tetrahymena thermophila SB210]
Length = 629
Score = 57.8 bits (138), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 48/152 (31%), Positives = 65/152 (42%), Gaps = 20/152 (13%)
Query: 33 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIP--TEAF 90
P L+SE EL YL G+ + E+ E IK +Y + V IP F
Sbjct: 260 PALFSEEELQYLEGTAALKLVQEQKEDIKTDYESISQV--------------IPEFKSEF 305
Query: 91 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD 150
+FE F+ AF+ S V + KV + +VPL L S + DDA
Sbjct: 306 SFEQFRWAFLCSHSRVFGI-KVKGVKTSVMVPLADMLNHKHSGQEDSEWVFDDATNCFTV 364
Query: 151 RPYKA---GESIVVWCGPQPNSKLLINYGFVD 179
+ K + I G + NSKL +NYGFVD
Sbjct: 365 KALKKIQRNQQIHFSYGSKCNSKLFLNYGFVD 396
>gi|170067683|ref|XP_001868579.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167863782|gb|EDS27165.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 269
Score = 57.4 bits (137), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 50/189 (26%), Positives = 89/189 (47%), Gaps = 24/189 (12%)
Query: 153 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 212
Y+ GE I ++ G + N+ L++ GFV DN + + +LN + Q++ ++ + ++ G
Sbjct: 74 YRKGEQIFIYYGNRTNADFLVHNGFVYPDNANSAVAIPLSLNPTEEQFEQRKQLLEKLGL 133
Query: 213 LSVQVFHVHAGREKEAIS-DMLPYLR--------LGYVSDTSEMQSVISSLGPICPVSPC 263
S F+V G IS ++L + R LG+ +QS + L P C P
Sbjct: 134 ASSGDFNVQRGGGDSFISPELLGFARVFNMTKEQLGHWQGEDAVQSQL--LEPDC---PG 188
Query: 264 MERAVLDQLADYFKARLA----GYPATLSEDEAMLTDYN------LHPKKRVATQLVRME 313
+E ++ +++ Y RL TL +DEA+L + L K + Q +E
Sbjct: 189 LEASLREKVWKYLSIRLQLALRMTGTTLDQDEALLANQGQKGAQKLGHIKSMLVQFRVVE 248
Query: 314 KKMLNACLQ 322
KK+L+ L+
Sbjct: 249 KKILSEALE 257
>gi|281207217|gb|EFA81400.1| mRNA-decapping enzyme 2 [Polysphondylium pallidum PN500]
Length = 1078
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 78/321 (24%), Positives = 134/321 (41%), Gaps = 62/321 (19%)
Query: 30 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 89
+ S L +S+ E L GSP +++ + K + L SLF++ + A
Sbjct: 130 LNSGLFFSDAEAELLNGSPAYIDLMNQRLDAKELFGRL-------KSLFKEQQFS--KCA 180
Query: 90 FTFEIFKQAFVAVQSCVVHLQKVSL---ARRFALVPLGPPLLAYSSKCKAMLAAVD---- 142
T++ K A+ V S ++ + +L F V L P L Y + + AA D
Sbjct: 181 MTYDRLKWAYSVVDSRKIYTEAPNLDANGNPFITVVLAP-FLDYFNHAEDAQAAYDFDYD 239
Query: 143 -DAVQLVVDRPYKAGESIVVWCGPQP-NSKLLINYGFVDEDNPYDRLV---VEAALNT-- 195
A+++V +P K GE I + G Q NS LLI+YGF+D+ + V VE LNT
Sbjct: 240 ESAIKVVALQPIKKGEQIFLNYGNQDCNSDLLIHYGFIDQSSTAKHCVNVLVEELLNTIP 299
Query: 196 -EDPQYQDKRMVAQRNGKLS--VQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS 252
DPQ +K + + + + +++F E IS L Y
Sbjct: 300 ASDPQLIEKTELLTKAFEQNERMKLFKDSLTEELLKISKYLSYKNF-------------- 345
Query: 253 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML---TDYNLHPKKRVATQL 309
++L L ++ YP T+ ED A++ T++ ++ + +
Sbjct: 346 --------------SLLPYLKSLIDMKMKAYPTTMEEDRAIIEATTEFEKLSQRSKMSII 391
Query: 310 VRMEK----KMLNACLQVTAD 326
+R+++ K + A +QV D
Sbjct: 392 MRLQEKETLKEIGALIQVKID 412
>gi|444705829|gb|ELW47217.1| Histone-lysine N-methyltransferase setd3 [Tupaia chinensis]
Length = 539
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 46/193 (23%), Positives = 91/193 (47%), Gaps = 10/193 (5%)
Query: 142 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 201
DD + V + ++ GE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 234 DDRCECVALQDFRPGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 293
Query: 202 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVIS 252
K V R G + VF +H + + +L +LR+ +++ S + + +
Sbjct: 294 MKAEVLARAGIPTSSVFALHF-TDPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFT 352
Query: 253 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 312
PVS E + L D L Y T+ ED+++L +L + +A +L
Sbjct: 353 LGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKSRDLSVRATMAIKLRLG 412
Query: 313 EKKMLNACLQVTA 325
EK++L ++ A
Sbjct: 413 EKEILERAVRSAA 425
>gi|428175234|gb|EKX44125.1| hypothetical protein GUITHDRAFT_109909 [Guillardia theta CCMP2712]
Length = 442
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 46/189 (24%), Positives = 77/189 (40%), Gaps = 28/189 (14%)
Query: 30 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM---------AGSLFQQ 80
+ +PL WS+ E L GS YN LD W M A L Q
Sbjct: 161 LTTPLFWSDKEREELQGSNL--------------YNMLDG-WTMNVEKLHRSTARVLGQH 205
Query: 81 YPY-DIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA---RRFALVPLGPPLLAYSSKCKA 136
+ D+P ++ + FK A+ + + + S R+ + P+ K
Sbjct: 206 NVFPDLPKAIYSLKEFKWAYATIFARAFDVDGKSFGFSGRQRIMAPMADLFNHGDVKTSY 265
Query: 137 MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 196
A +L + + GE I + + N++ L+ YGFV E NP+D + + A++ +
Sbjct: 266 TFNAASGHFELFTQQFFSRGEQIFMNYDSKNNAEFLLQYGFVIESNPHDYVGIAASIGND 325
Query: 197 DPQYQDKRM 205
P Y+DK +
Sbjct: 326 QPFYRDKSL 334
>gi|156384284|ref|XP_001633261.1| predicted protein [Nematostella vectensis]
gi|156220328|gb|EDO41198.1| predicted protein [Nematostella vectensis]
Length = 403
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 78/333 (23%), Positives = 140/333 (42%), Gaps = 40/333 (12%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
S W PY+ L R + + L +S ++ L GSP+ E L++ GI ++Y
Sbjct: 86 STWAPYLNILPR-------SYSTCLYFSPDDMMALQGSPSMGEALKQFRGIVKQY----- 133
Query: 70 VWFMAGSLFQQYPYDIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 128
V+F +P +FTF+ F+ A V + ++ S AL+P+
Sbjct: 134 VYFFRLVQINPEASRLPLKNSFTFDDFRWAVSTVMTRQNDVKVSSNETVKALIPM----W 189
Query: 129 AYSSKCKA-MLAAVDDA---VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
+ C DD+ V+ + +P +AG+ + ++ G + N+ L + GFV +
Sbjct: 190 DMCNHCNGPFTTGFDDSTKEVKSLAFKPTRAGDQVFIFYGRRNNADRLFHNGFVYTEAEE 249
Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS-DMLPYLRLGYVSD 243
D + ++ ++ D Y K + G L R E IS ++ +LR+ + +
Sbjct: 250 DWVNIQLGVSKNDRLYAMKAQILAMVG-LDASGRSYRVLRGPEPISPELRIFLRV-FSMN 307
Query: 244 TSEMQSV--------ISSLGPICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDE 291
T E++ ++ L +C + +L +F R L Y T EDE
Sbjct: 308 TGELKPYLFNPEGLPVTPLAELCKAEFTLSEENELKLWSFFHTRLQLILGQYKTTKQEDE 367
Query: 292 AMLT--DYNLHPKKRVATQLVRMEKKMLNACLQ 322
A+L+ D LH R +L E+ +L + L+
Sbjct: 368 ALLSRDDNTLH--TRNCIRLRMSERDILVSALE 398
>gi|348671353|gb|EGZ11174.1| hypothetical protein PHYSODRAFT_361758 [Phytophthora sojae]
Length = 486
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 76/319 (23%), Positives = 126/319 (39%), Gaps = 29/319 (9%)
Query: 33 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 92
P+ WS+ EL +L GS +I ER I+++Y+ + V D F+
Sbjct: 152 PIFWSDEELGWLKGSYIIQQIQERKAAIRKDYDVICRV-------------DPAFARFSL 198
Query: 93 EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA-MLAAVDDAVQLVVDR 151
+ F A + V S L + + ALVP L Y + + DA +
Sbjct: 199 DRFSWARMIVCSRNFGL-TIDGVKTAALVPFADMLNHYRPRETSWTFDQSIDAFTITSLG 257
Query: 152 PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY------DRLVVEAALNTEDPQ-YQDKR 204
G + G + N + L+NYGF EDN + ++++ L+ D Q + DKR
Sbjct: 258 TIGTGAQVYDSYGKKCNHRFLLNYGFAVEDNTEEDGRNPNEVLIDFQLSQADGQLFYDKR 317
Query: 205 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS--VISSLGPICPVSP 262
+G ++ + + + RL V+ E S + S P+S
Sbjct: 318 AYLHESGIYTMDA-RLSCSHSDANTREGFSFARL-IVATEDEFSSMKMKSPAHSSPPISF 375
Query: 263 CMERAVLDQLADYFKARLAGYPATLSEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNAC 320
E L L D +L+ Y T+ ED +L Y L + A +R EK++
Sbjct: 376 DNEIRALQYLRDLMTHQLSLYDTTIEEDNELLASKQYPLFSNRIQALFFIRGEKQVCR-Y 434
Query: 321 LQVTADMIMLLPDVTVSPC 339
Q AD ++ L + ++ C
Sbjct: 435 FQELADKVIQLFSLPLAEC 453
>gi|160331079|ref|XP_001712247.1| met [Hemiselmis andersenii]
gi|159765694|gb|ABW97922.1| met [Hemiselmis andersenii]
Length = 464
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 66/323 (20%), Positives = 140/323 (43%), Gaps = 33/323 (10%)
Query: 7 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 66
G KS W Y L R+ + W+ ++ +L GS T L E IK ++
Sbjct: 156 GDKSKWKIYFDILPREED-----LNLGFRWNLNDIVFLRGSKTLNASLYLKEKIKIQFLR 210
Query: 67 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP- 125
L+ F L +YP I F ++ A + S + LQ + ++ +LVP
Sbjct: 211 LEKTIFSKNRL--KYPVSI----FNLAQWEWALSILLSRAIFLQNL---KKVSLVPYADF 261
Query: 126 ----PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 181
P K + + + + + D+ Y + I G + N +LL+ YGF+ E
Sbjct: 262 MNHNPFSTSYINSKKISFSKNHEIVMYADKDYNKFDQIFTTYGQKTNLELLLLYGFILER 321
Query: 182 NPYDRLVVEAALNTEDPQYQDKRMV---AQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 238
NP+D + + +L+ +D ++ K+ ++ +++ +F+ +E + +LR
Sbjct: 322 NPFDSIELRISLSDKDSFFEKKKQFMIECEKTSEITFPIFYYKYPKE------LYEFLRF 375
Query: 239 GYVSDTSEMQSV-ISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDE---AML 294
+S+ E+ S +S + +E+ + + + L Y +SE++ ++
Sbjct: 376 C-ISNQEELGSTDLSDFNFNDENNYEIEKIIRKLVLFSCEKLLKNYSKKVSEEKILNSLN 434
Query: 295 TDYNLHPKKRVATQLVRMEKKML 317
+++ + +++A + + EKK++
Sbjct: 435 SNFLISKNQKMALKQSKCEKKII 457
>gi|296419472|ref|XP_002839331.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295635461|emb|CAZ83522.1| unnamed protein product [Tuber melanosporum]
Length = 541
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 52/197 (26%), Positives = 93/197 (47%), Gaps = 22/197 (11%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
E +GK+SFW PY+ L R+ ++PL +S+ +L +L G +A +E + I R
Sbjct: 94 EWLKGKESFWWPYLCVLPRE-------FDTPLYFSDEDLQFLQGCNLEATEVEARKLIWR 146
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA--- 119
E E A S+ Q+ YD TE +T+E++ A S + + R
Sbjct: 147 EEFE------AAVSILQREGYD--TEYYTWELYLWASTIFTSRSFPGKLMDWDRIIVHED 198
Query: 120 -LVPLGPPLLAYSSKCKAMLAA---VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
+P+ PL+ + A + D +++++ AG + GP+ N +LL+ Y
Sbjct: 199 DTMPILFPLIDSLNHYPATIITWQPSDTSLRIISGVGVSAGAEVYNNYGPKANEELLMGY 258
Query: 176 GFVDEDNPYDRLVVEAA 192
GF NP+D +++++
Sbjct: 259 GFTLLQNPFDSFLLKSS 275
>gi|19112238|ref|NP_595446.1| ribosomal lysine methyltransferase Set10 [Schizosaccharomyces pombe
972h-]
gi|74626910|sp|O74738.1|SET10_SCHPO RecName: Full=Ribosomal N-lysine methyltransferase set10
gi|3738151|emb|CAA21252.1| ribosomal lysine methyltransferase Set10 [Schizosaccharomyces
pombe]
Length = 547
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 62/240 (25%), Positives = 103/240 (42%), Gaps = 31/240 (12%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
E +G +S W YI L + +PL ++E + A+L + + ER K
Sbjct: 89 ESLKGIQSKWYGYIEYLPK-------TFNTPLYFNENDNAFLISTNAYSAAQERLHIWKH 141
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQSCVVHLQKVSLARRFALV 121
EY E A SL + PTE FTF+++ A V C + +
Sbjct: 142 EYQE-------ALSL-----HPSPTERFTFDLYIWSATVFSSRC---FSSNLIYKDSEST 186
Query: 122 PLGPPLL-AYSSKCKAMLAAVDD-----AVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
P+ PL+ + + K K + D +VQL+ G + GP+ N +LL+ Y
Sbjct: 187 PILLPLIDSLNHKPKQPILWNSDFQDEKSVQLISQELVAKGNQLFNNYGPKGNEELLMGY 246
Query: 176 GFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG--KLSVQVFHVHAGREKEAISDML 233
GF DNP+D + ++ A++ + P K + + + +LS VF + +KE +L
Sbjct: 247 GFCLPDNPFDTVTLKVAIHPDLPHKDQKAAILENDCQFQLSNLVFFLPKSPDKEIFQKIL 306
>gi|357615786|gb|EHJ69829.1| putative SET domain containing 3 [Danaus plexippus]
Length = 489
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 61/254 (24%), Positives = 103/254 (40%), Gaps = 49/254 (19%)
Query: 4 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 63
+K SFW PYI L + + L ++ ELA L SP L+ I R+
Sbjct: 168 EKNNPNSFWKPYIDVLPEK-------YSTVLYFNSEELAELRPSPVFESSLKLYRSIVRQ 220
Query: 64 Y----NELDTVWFMAGSLFQQYPYDIPT-----EAFTFEIFKQAFVAV---QSCVVHLQK 111
Y N++ T+ D+P + FTF+ ++ A V Q+ +V
Sbjct: 221 YAYFYNKIHTI-------------DLPVLKNLQDIFTFDNYRWAVSTVMTRQNNIVQGTA 267
Query: 112 VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR-------PYKAGESIVVWCG 164
+L A +PL C + L ++R Y+ E I ++ G
Sbjct: 268 FTLTN--AFIPLW-------DMCNHKHGKITTDFNLELNRGECYALQDYRRDEQIFIFYG 318
Query: 165 PQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR 224
+PNS L ++ GFV DN YD L + ++ D K + + G V F ++ G
Sbjct: 319 ARPNSDLFLHNGFVYPDNDYDSLSIALGISPNDALRNGKVNLLNKLGLSGVTNFSLYKGA 378
Query: 225 EKEAISDMLPYLRL 238
++ ++L ++R+
Sbjct: 379 SPISV-ELLAFIRI 391
>gi|321470773|gb|EFX81748.1| hypothetical protein DAPPUDRAFT_317395 [Daphnia pulex]
Length = 495
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 76/336 (22%), Positives = 134/336 (39%), Gaps = 25/336 (7%)
Query: 9 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL- 67
KSFW PY+ L + ++ + ++ E+ L GSP + L I R+Y+
Sbjct: 167 KSFWKPYLDALPS-------SYDTVMYFTPDEITELKGSPAFDDALRMCRNIARQYSYFY 219
Query: 68 ----DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 123
V +L + Y+ A + + +Q + Q + K L AL+PL
Sbjct: 220 SLLQKNVDPALSNLRANFTYNDYRWAVSTVMTRQNLIPSQEEISGNDKDQLPPVNALIPL 279
Query: 124 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 183
+ R + GE + ++ G + ++ I+ GFVD +N
Sbjct: 280 WDFCNHQDGQFSTEFQLESRRTVCQAGRDFGPGEQVFIFYGTRTCAEQFIHNGFVDINNA 339
Query: 184 YDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ------VFHVHAGREKEAISDMLPYLR 237
+D L ++ L+ DP + + + LS + F + AG + +L +LR
Sbjct: 340 HDALTLKVGLSKSDPLAGQRATLLCKLRILSDEKISGPIAFQLKAG-PQPVDGKLLAFLR 398
Query: 238 LGYVSDTSEMQSVISSLGPICPVSPC-MERAVLDQLADYFKAR----LAGYPATLSEDEA 292
L ++ S + + S C +E V D+ + KAR L YP T D
Sbjct: 399 LFCMTKDSLDRWLQSDNASNLMHEECGIETEVDDKSWSFLKARCQLLLQLYPTTKEADLK 458
Query: 293 MLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 328
ML + +L +R+ L EK++L + ++ A I
Sbjct: 459 MLEE-DLSSHRRMCVLLRLAEKRILLSAIECAAQRI 493
>gi|302792358|ref|XP_002977945.1| hypothetical protein SELMODRAFT_107696 [Selaginella moellendorffii]
gi|300154648|gb|EFJ21283.1| hypothetical protein SELMODRAFT_107696 [Selaginella moellendorffii]
Length = 467
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 61/240 (25%), Positives = 100/240 (41%), Gaps = 55/240 (22%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEG 59
MYE+ +GK S W Y++ L RQ P LWSE E+ L G+ + E
Sbjct: 82 MYERSKGKGSKWYRYLKTLPRQE-------SVPFLWSEEEIDGLLLGTELHKALKEDKLL 134
Query: 60 IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ-KVSLARRF 118
+K ++ E L ++ P + P + FTFE +++A +S V ++ +
Sbjct: 135 MKEDWEE------NIAPLTKEDPLEFPAQDFTFE----SYLAAKSLVSSRSFEIDAEHGY 184
Query: 119 ALVPLGPPLLAYSSKCKA-----MLAA-----------VDDA---------------VQL 147
+VPL ++ K A ML A +DD +++
Sbjct: 185 GMVPLAD---LFNHKTDAEDVHFMLNASDSDDDDNGLIIDDGLANGDCREISSDKSVLEM 241
Query: 148 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD--RLVVEAALNTEDPQYQDKRM 205
V+ + AG I G N+ LL YGF + +NP+D L ++ L ++Q KR+
Sbjct: 242 VMVKDVAAGSEIFNTYGQLGNAALLHRYGFTEPNNPHDIVNLDMDCVLEVLLSRFQKKRV 301
>gi|212542185|ref|XP_002151247.1| SET domain protein [Talaromyces marneffei ATCC 18224]
gi|210066154|gb|EEA20247.1| SET domain protein [Talaromyces marneffei ATCC 18224]
Length = 709
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 65/213 (30%), Positives = 94/213 (44%), Gaps = 24/213 (11%)
Query: 8 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETE--LAYLTGSPTKAEILERAEGI----- 60
+ FW PYIR L G+ + +PL + E E L +L G + A ER I
Sbjct: 119 EHGFWYPYIRSLP-----GKEELTTPLFFREEEGDLEWL-GMTSLAASRERRLAIWRGNY 172
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA--VQSCVVHLQKVSLARRF 118
+R Y L + F + Y +D+ A T I +AF A + S + L+ + R
Sbjct: 173 ERGYTMLKELGFEG---VEGYTWDLYLWASTI-ISSRAFTAKVLASVIPELKNAEVDRVS 228
Query: 119 ALVPLGPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 176
L+PL + A + K K A D++ LVV AGE + GP+ N +L++NYG
Sbjct: 229 VLLPL---IDATNHKPLSKVEWRAGTDSIGLVVMSDVAAGEEVGNNYGPRNNEQLMMNYG 285
Query: 177 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQR 209
F DNP + VV + P Q K Q
Sbjct: 286 FCIPDNPCEYRVVSLRAPLDSPLAQIKAQYEQH 318
>gi|162606198|ref|XP_001713614.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Guillardia theta]
gi|13794534|gb|AAK39909.1|AF165818_117 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Guillardia theta]
Length = 460
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 65/331 (19%), Positives = 147/331 (44%), Gaps = 41/331 (12%)
Query: 6 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 65
+ KKSFW PYI L + L W EL ++ GS + +K +Y
Sbjct: 153 KNKKSFWFPYIGILPEEYDLKLL-----FRWPLKELFFIKGSRLSKASDYLKKKLKAQYE 207
Query: 66 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP 125
++ +FQ+ P++ F ++ ++ + + S + LQ+ ++ L+P
Sbjct: 208 MVNK------EVFQRNRLLYPSKIFNYQNWEWSMSILLSRTISLQE---TKKVVLIPY-I 257
Query: 126 PLLAYSSKCKAMLAA----VDDAVQLVV--DRPYKAGESIVVWCGPQPNSKLLINYGFVD 179
LL ++ + ++ + D+ ++VV D+ + + + G + N +LL YGF+
Sbjct: 258 DLLNHNPFSSSFISYRKIPLSDSKEIVVYSDKNCNKFDQLYISYGQKSNLELLNLYGFIA 317
Query: 180 EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL- 238
E NPYD +++ +++ +D +++K+ N K + + + + +M+ ++++
Sbjct: 318 ERNPYDSVIIRISMSPKDIFFKEKKSFLFSNKKFFYNSYPIFLYKYPD---EMIEFIKIC 374
Query: 239 ---GYVSDTSEMQSVISSLGPICPVSPC----MERAVLDQLADYFKARLAGYPATLSEDE 291
++D + + I + + C +E+++ DY R L E+
Sbjct: 375 LFNTNINDKNFNLNKIENYDYTKIIKSCIVTVIEKSLNSNYNDYENLR----NIMLKENL 430
Query: 292 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 322
++D ++++ + +EKK+LN L+
Sbjct: 431 LHISD-----NQKISIKYNALEKKILNRFLE 456
>gi|296804474|ref|XP_002843089.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
gi|238845691|gb|EEQ35353.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
Length = 455
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 74/305 (24%), Positives = 128/305 (41%), Gaps = 20/305 (6%)
Query: 32 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 91
+ + ++E EL GS A + + ++ +Y +L L Q+ P + FT
Sbjct: 107 ASIFFTEDELQVCAGSSLYALTRQLEQRVRDDYRQLLV------PLLSQHRDLFPLDQFT 160
Query: 92 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYS---SKCKAMLAAVDDAVQLV 148
E +K A ++ S + VS LV +L +S +C A D + ++
Sbjct: 161 IEDYKWALCSIWSRAMDF-AVSGTTSVRLVAPLADMLNHSPDVKQCHAYDPTSGD-LSIL 218
Query: 149 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 208
+ Y+ G+ + ++ G PN++LL YGFV DNP D + + P Y+ K +
Sbjct: 219 AAKDYQVGDQVFIYYGSVPNNRLLRLYGFVLPDNPNDSYDLVLQTSPLAPLYEQKERLWA 278
Query: 209 RNGKLSVQVFHVHAGREKEAI-SDMLPYLRLGYVSDTSEMQSVISSL--GPICPVSPCME 265
G S + K+ + +++L YLR+ + D S + + L G VS E
Sbjct: 279 LAGLDSTCTIPLTV---KDPLPNNVLRYLRIQRL-DESNITDITLQLVNGTDGKVSDGNE 334
Query: 266 RAVLDQLADYFKARLAGYPATLSEDEAMLT--DYNLHPKKRVATQLVRMEKKMLNACLQV 323
VL L D + L G+ L + EA L DY A + E+++L +
Sbjct: 335 MQVLQFLVDSIGSLLEGFGIPLEKLEAQLAAGDYPAGGNAWAAAHVSAGEQRVLTRAKRT 394
Query: 324 TADMI 328
D++
Sbjct: 395 AEDLL 399
>gi|357444999|ref|XP_003592777.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase [Medicago truncatula]
gi|355481825|gb|AES63028.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase [Medicago truncatula]
Length = 451
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 50/211 (23%), Positives = 94/211 (44%), Gaps = 22/211 (10%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+ K G+ S W PYI L Q + + + W+E+EL + S E + + I
Sbjct: 113 LIHKNLGQDSEWHPYISCLPPQA-----EMHNTIFWNESELEMIRQSSVYQETIYQKSQI 167
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
++++ E+ V FQ P+ FT++ F A V S + + +L
Sbjct: 168 EKDFLEIKPV-------FQ--PFCQSFGDFTWKDFMHACTLVGS-----RAWGSTKGLSL 213
Query: 121 VPLGPPLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
+P L + +A++ + DD ++ DR Y GE +++ G N+ L++++GF
Sbjct: 214 IPFAD-FLNHDGISEAIVMSDDDNKCSEVFSDRDYVPGEQVLIRYGKFSNATLMLDFGFT 272
Query: 179 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQR 209
N YD++ ++ + DP K + Q+
Sbjct: 273 IPYNIYDQVQIQYDIPKYDPLRHTKLELLQQ 303
>gi|340966944|gb|EGS22451.1| hypothetical protein CTHT_0019870 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 499
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/183 (25%), Positives = 74/183 (40%), Gaps = 12/183 (6%)
Query: 6 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 65
+G+ SFW PYI L + + P W E ++ +L G+ I E IKREY
Sbjct: 119 KGENSFWWPYIATLPQPEQVNSWTL--PAFWPEDDIQFLEGTNAHVAIGEIQANIKREYK 176
Query: 66 ELDTVW----FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK-VSLARRFAL 120
+ V F + Q Y FT F+ + + QS ++ + AR
Sbjct: 177 QARKVLKEENFPNWKEYSQMLYKWAFSIFTSRSFRPSLILSQSVKDYVSTLLPSAREIDD 236
Query: 121 VPLGPPLLAYSSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
+ PL ++ D + QL+ Y+ G+ + G + NS+LL+ Y
Sbjct: 237 FSILQPLFDIANHSMTATYTWDTTSDPNCCQLICQDSYRPGDQVFNNYGFKTNSELLLAY 296
Query: 176 GFV 178
GF+
Sbjct: 297 GFI 299
>gi|358366345|dbj|GAA82966.1| SET domain protein [Aspergillus kawachii IFO 4308]
Length = 673
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 59/211 (27%), Positives = 89/211 (42%), Gaps = 7/211 (3%)
Query: 6 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 65
+G + FW PYIR L Q G ++ +P + +L +L G+ A +R E +K +Y
Sbjct: 95 RGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGTSLLAAREKRLEVLKEKYE 149
Query: 66 ELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG 124
+ T AG Y +D+ A + I + V S V +S + L+P+
Sbjct: 150 KGSTALRNAGFEGADAYTWDLYLWAASMFISRAFSARVLSGVFPETDLSEEKLSVLLPI- 208
Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
+ + K A D V VV AG+ I GP+ N +L++NYGF NP
Sbjct: 209 IDMGNHRPLAKVEWRAGKDDVAFVVLEDVSAGQEISNNYGPRNNEQLMMNYGFCIPGNPC 268
Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 215
D +V P Y K Q L+V
Sbjct: 269 DHRIVSLRAPPGSPLYMAKSHQLQMYPDLAV 299
>gi|302804448|ref|XP_002983976.1| hypothetical protein SELMODRAFT_423083 [Selaginella moellendorffii]
gi|300148328|gb|EFJ14988.1| hypothetical protein SELMODRAFT_423083 [Selaginella moellendorffii]
Length = 266
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 50/189 (26%), Positives = 81/189 (42%), Gaps = 28/189 (14%)
Query: 4 KKQGK--KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIK 61
K Q K S W PYI L +++ LW +TEL+YL SP + ER E I
Sbjct: 77 KGQAKLQSSAWAPYISCLPEPA-----ELDNTFLWEDTELSYLRASPLYGKTRERLEMIT 131
Query: 62 REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 121
E+ ++ + LF + + E FK + V S + +++ LV
Sbjct: 132 TEFGQVQNALDVWPQLFGK---------VSLEDFKHVYATVFS-----RSLAIGEDSTLV 177
Query: 122 PLGPPLLAY-----SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 176
+ P+L + +S K + + + DR Y + I + G N++L ++YG
Sbjct: 178 MI--PMLDFFNHNATSFAKLSFNGLLNYAVVTADRDYAENDQIWINYGDLSNAELALDYG 235
Query: 177 FVDEDNPYD 185
F +NPYD
Sbjct: 236 FAVPENPYD 244
>gi|388516285|gb|AFK46204.1| unknown [Lotus japonicus]
Length = 271
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 39/133 (29%), Positives = 65/133 (48%), Gaps = 7/133 (5%)
Query: 93 EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVD 150
E FK +F + S +V L S+ + ALVP +L +S + L + D
Sbjct: 2 ESFKWSFGILFSRMVRLP--SMDGKVALVPWAD-MLNHSCDVETFLDYDKQSKGIVFTTD 58
Query: 151 RPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQ 208
RPY+ GE + + G + N +LL++YGFV + NP D + + +L D Y++K + +
Sbjct: 59 RPYQPGEQVFISYGKKSNGELLLSYGFVTREGANPSDSVELSLSLKKSDGSYKEKLELLK 118
Query: 209 RNGKLSVQVFHVH 221
+ G Q F +
Sbjct: 119 KYGLSGSQCFPIR 131
>gi|109158151|pdb|2H21|A Chain A, Structure Of Rubisco Lsmt Bound To Adomet
gi|109158152|pdb|2H21|B Chain B, Structure Of Rubisco Lsmt Bound To Adomet
gi|109158153|pdb|2H21|C Chain C, Structure Of Rubisco Lsmt Bound To Adomet
gi|109158154|pdb|2H23|A Chain A, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
Adohcy
gi|109158155|pdb|2H23|B Chain B, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
Adohcy
gi|109158156|pdb|2H23|C Chain C, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
Adohcy
gi|109158157|pdb|2H2E|A Chain A, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
gi|109158158|pdb|2H2E|B Chain B, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
gi|109158159|pdb|2H2E|C Chain C, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
gi|109158160|pdb|2H2J|A Chain A, Structure Of Rubisco Lsmt Bound To Sinefungin And
Monomethyllysine
gi|109158161|pdb|2H2J|B Chain B, Structure Of Rubisco Lsmt Bound To Sinefungin And
Monomethyllysine
gi|109158162|pdb|2H2J|C Chain C, Structure Of Rubisco Lsmt Bound To Sinefungin And
Monomethyllysine
Length = 440
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 83/325 (25%), Positives = 136/325 (41%), Gaps = 24/325 (7%)
Query: 12 WLPYIRELDRQRGR---------GQL--AVESPLLWSETELAYLTGSPTKAEILERAEGI 60
WL I L R+R R G L +S + WSE EL L GS + E +
Sbjct: 77 WLSVILFLIRERSREDSVWKHYFGILPQETDSTIYWSEEELQELQGSQLLKTTVSVKEYV 136
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQS-CVVHLQKVSLARRF 118
K E +L+ + P + + F I + +AF +++ +V + L
Sbjct: 137 KNECLKLEQEIILPNKRLFPDPVTLDDFFWAFGILRSRAFSRLRNENLVVVPMADLINHS 196
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGF 177
A V AY K A L + D L KAGE + + + + N++L ++YGF
Sbjct: 197 AGVTTEDH--AYEVKGAAGLFSWDYLFSLKSPLSVKAGEQVYIQYDLNKSNAELALDYGF 254
Query: 178 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 237
++ + + ++ DP + DK VA+ NG F + R +LPYLR
Sbjct: 255 IEPNENRHAYTLTLEISESDPFFDDKLDVAESNGFAQTAYFDIFYNRTLPP--GLLPYLR 312
Query: 238 LGYVSDTSE--MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEA 292
L + T ++S+ + G + VS E + + + K+ LAGY T+ +D
Sbjct: 313 LVALGGTDAFLLESLFRDTIWGHLELSVSRDNEELLCKAVREACKSALAGYHTTIEQDRE 372
Query: 293 MLTDYNLHPKKRVATQLVRMEKKML 317
L + NL + +A + EK +L
Sbjct: 373 -LKEGNLDSRLAIAVGIREGEKMVL 396
>gi|17367341|sp|Q43088.1|RBCMT_PEA RecName: Full=Ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic; AltName:
Full=[Fructose-bisphosphate aldolase]-lysine
N-methyltransferase; AltName:
Full=[Ribulose-bisphosphate carboxylase]-lysine
N-methyltransferase; Short=PsLSMT; Short=RuBisCO LSMT;
Short=RuBisCO methyltransferase; Short=rbcMT; Flags:
Precursor
gi|508551|gb|AAA69903.1| ribulose-1,5 bisphosphate carboxylase large subunit
N-methyltransferase [Pisum sativum]
Length = 489
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 83/325 (25%), Positives = 136/325 (41%), Gaps = 24/325 (7%)
Query: 12 WLPYIRELDRQRGR---------GQL--AVESPLLWSETELAYLTGSPTKAEILERAEGI 60
WL I L R+R R G L +S + WSE EL L GS + E +
Sbjct: 125 WLSVILFLIRERSREDSVWKHYFGILPQETDSTIYWSEEELQELQGSQLLKTTVSVKEYV 184
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQS-CVVHLQKVSLARRF 118
K E +L+ + P + + F I + +AF +++ +V + L
Sbjct: 185 KNECLKLEQEIILPNKRLFPDPVTLDDFFWAFGILRSRAFSRLRNENLVVVPMADLINHS 244
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGF 177
A V AY K A L + D L KAGE + + + + N++L ++YGF
Sbjct: 245 AGVTTEDH--AYEVKGAAGLFSWDYLFSLKSPLSVKAGEQVYIQYDLNKSNAELALDYGF 302
Query: 178 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 237
++ + + ++ DP + DK VA+ NG F + R +LPYLR
Sbjct: 303 IEPNENRHAYTLTLEISESDPFFDDKLDVAESNGFAQTAYFDIFYNRTLPP--GLLPYLR 360
Query: 238 LGYVSDTSE--MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEA 292
L + T ++S+ + G + VS E + + + K+ LAGY T+ +D
Sbjct: 361 LVALGGTDAFLLESLFRDTIWGHLELSVSRDNEELLCKAVREACKSALAGYHTTIEQDRE 420
Query: 293 MLTDYNLHPKKRVATQLVRMEKKML 317
L + NL + +A + EK +L
Sbjct: 421 -LKEGNLDSRLAIAVGIREGEKMVL 444
>gi|24987776|pdb|1MLV|A Chain A, Structure And Catalytic Mechanism Of A Set Domain Protein
Methyltransferase
gi|24987777|pdb|1MLV|B Chain B, Structure And Catalytic Mechanism Of A Set Domain Protein
Methyltransferase
gi|24987778|pdb|1MLV|C Chain C, Structure And Catalytic Mechanism Of A Set Domain Protein
Methyltransferase
gi|33357815|pdb|1OZV|A Chain A, Crystal Structure Of The Set Domain Of Lsmt Bound To
Lysine And Adohcy
gi|33357816|pdb|1OZV|B Chain B, Crystal Structure Of The Set Domain Of Lsmt Bound To
Lysine And Adohcy
gi|33357817|pdb|1OZV|C Chain C, Crystal Structure Of The Set Domain Of Lsmt Bound To
Lysine And Adohcy
gi|33357822|pdb|1P0Y|A Chain A, Crystal Structure Of The Set Domain Of Lsmt Bound To
Melysine And Adohcy
gi|33357823|pdb|1P0Y|B Chain B, Crystal Structure Of The Set Domain Of Lsmt Bound To
Melysine And Adohcy
gi|33357824|pdb|1P0Y|C Chain C, Crystal Structure Of The Set Domain Of Lsmt Bound To
Melysine And Adohcy
Length = 444
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 83/325 (25%), Positives = 136/325 (41%), Gaps = 24/325 (7%)
Query: 12 WLPYIRELDRQRGR---------GQL--AVESPLLWSETELAYLTGSPTKAEILERAEGI 60
WL I L R+R R G L +S + WSE EL L GS + E +
Sbjct: 81 WLSVILFLIRERSREDSVWKHYFGILPQETDSTIYWSEEELQELQGSQLLKTTVSVKEYV 140
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQS-CVVHLQKVSLARRF 118
K E +L+ + P + + F I + +AF +++ +V + L
Sbjct: 141 KNECLKLEQEIILPNKRLFPDPVTLDDFFWAFGILRSRAFSRLRNENLVVVPMADLINHS 200
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGF 177
A V AY K A L + D L KAGE + + + + N++L ++YGF
Sbjct: 201 AGVTTEDH--AYEVKGAAGLFSWDYLFSLKSPLSVKAGEQVYIQYDLNKSNAELALDYGF 258
Query: 178 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 237
++ + + ++ DP + DK VA+ NG F + R +LPYLR
Sbjct: 259 IEPNENRHAYTLTLEISESDPFFDDKLDVAESNGFAQTAYFDIFYNRTLPP--GLLPYLR 316
Query: 238 LGYVSDTSE--MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEA 292
L + T ++S+ + G + VS E + + + K+ LAGY T+ +D
Sbjct: 317 LVALGGTDAFLLESLFRDTIWGHLELSVSRDNEELLCKAVREACKSALAGYHTTIEQDRE 376
Query: 293 MLTDYNLHPKKRVATQLVRMEKKML 317
L + NL + +A + EK +L
Sbjct: 377 -LKEGNLDSRLAIAVGIREGEKMVL 400
>gi|168005531|ref|XP_001755464.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693592|gb|EDQ79944.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1033
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 45/188 (23%), Positives = 83/188 (44%), Gaps = 24/188 (12%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+ EK +G S W YI L + + P+L + EL + P E+++ + I
Sbjct: 606 LREKAKGSSSAWASYINILPQN-------MTVPILLEDHELHEVQWWPVLRELVQVRKSI 658
Query: 61 KREYNELDTVWFMAGSLFQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR 117
+ ++ L +V +AG+ F++Y + + + AFT +F A + ++ ++
Sbjct: 659 RESFSLL-SVDDLAGADFEEYRWAAMMVHSRAFTLPVFADDHYAPYVMMPYMDMINHHYH 717
Query: 118 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
+ + P+ V++V R K GE + GP+ N L + YGF
Sbjct: 718 YQADWMSQPIWG-------------GKVEIVARRDIKKGEELFASFGPRANDNLFLYYGF 764
Query: 178 VDEDNPYD 185
V +DNP+D
Sbjct: 765 VLKDNPFD 772
>gi|168014081|ref|XP_001759585.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689124|gb|EDQ75497.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 340
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 51/212 (24%), Positives = 97/212 (45%), Gaps = 28/212 (13%)
Query: 127 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 186
LL +SS+ +++ +++V ++ + GE++V+ GP N LL++YGFV NP DR
Sbjct: 123 LLQHSSESQSL-----PVLEVVAEKDLEKGENVVLNYGPLSNDILLLDYGFVMPKNPNDR 177
Query: 187 --------------LVVEAALNT-EDPQYQDKRMVAQRN--GKLSVQVFHVHAGREKEAI 229
LV + +++ +DP ++ + N G S Q+ V G +
Sbjct: 178 VELRYDDQLLHMACLVAKVNIDSFKDPTTSQLALLTRLNLHGPSSSQM--VTLGGTELVE 235
Query: 230 SDMLPYLRLGYVSDTSEMQSV----ISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 285
+L +R+ + D E+ V + + P+ ER + L LA +P
Sbjct: 236 GRLLAAVRVMHAQDPMELLDVDLEALQTWNQSPPLGVLNERKTIRTLIGLGMLALASFPT 295
Query: 286 TLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 317
+ ED++ L ++ R+A Q ++K++L
Sbjct: 296 EIEEDQSELVKGDISENHRLAIQFRMLKKRLL 327
>gi|302815683|ref|XP_002989522.1| hypothetical protein SELMODRAFT_129980 [Selaginella moellendorffii]
gi|300142700|gb|EFJ09398.1| hypothetical protein SELMODRAFT_129980 [Selaginella moellendorffii]
Length = 464
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 86/370 (23%), Positives = 147/370 (39%), Gaps = 62/370 (16%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+YE+ Q K S+W PYI L + P+ +S ++ + +P ++ +R +
Sbjct: 110 LYERAQ-KGSYWWPYISMLPH-------SFTLPIFFSGVDIESIDYAPVTHQVKKRCRFL 161
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTF---EIFKQAFVAVQSCVVHLQKVSLARR 117
+ +EL + + P +I A F A AV S + V+
Sbjct: 162 LQFSSEL--------AKLESLPEEIHPFAGQFVDSGALGWAMAAVSSRAFRIHGVTNKLC 213
Query: 118 FALVPLGPPLLAYSSKCKAMLAAVDDAV----------QLVVDRPYKAGESIVVWCGPQP 167
A++ PL+ + A +++ + ++V R + G +I + GP
Sbjct: 214 SAMML---PLIDMCNHSFQPNAHIEEDLSRDAQDVSFLKVVTKRNLEKGSAITLNYGPLS 270
Query: 168 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA--QRNGK------LSVQVFH 219
N LL++YGFV DNP+DR+ L + ++ RM+A R G S QV
Sbjct: 271 NDLLLLDYGFVIPDNPHDRI----ELRYDGSLMENARMIAGLSRTGSPPFSSPASWQVDR 326
Query: 220 --------------VHAGREKEAISDMLPYLRLGYVSDTS--EMQSVIS--SLGPICPVS 261
V G +E +L LR+ + E + ++S + G VS
Sbjct: 327 LKQLGLADSGESQKVTLGGPEEVDGRLLAALRILHAESQEPLERRELVSLQAWGVESMVS 386
Query: 262 PCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 321
E VL L + T+ EDEA L+D +L R+A Q +K+++ L
Sbjct: 387 SDNEERVLRTLCGLGAIVFNQFKTTIEEDEAKLSDKSLAETSRIAVQFRLTKKRLVVRVL 446
Query: 322 QVTADMIMLL 331
+ +M L
Sbjct: 447 ESLKKRLMDL 456
>gi|412990750|emb|CCO18122.1| predicted protein [Bathycoccus prasinos]
Length = 543
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 57/201 (28%), Positives = 85/201 (42%), Gaps = 31/201 (15%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIK 61
E K + Y++ L R G S L W E+++ L GSP+ LER +
Sbjct: 179 EDNSSNKYRFATYVKALPRSTG-------SVLEWPESDVRTLLAGSPSLFSALERRASVA 231
Query: 62 REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 121
E+ + Q+ +DI +F S ++ L+ SL ALV
Sbjct: 232 AAIAEIRVNFPELNEKTLQWAFDI--------LF--------SRLIRLE--SLGGNLALV 273
Query: 122 PLGPPLLAYSSKCKAM--LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 179
P +L + C+A L V L DR Y+ GE + G +P+S+LLI+YGF
Sbjct: 274 PWAD-MLNHQPGCEAFIDLDRGSRKVCLTTDRSYEPGEQVWASYGQRPSSELLISYGFAP 332
Query: 180 E--DNPYDRLVVEAALNTEDP 198
DNP D + ++ EDP
Sbjct: 333 AVGDNPDDEYALNLQIDEEDP 353
>gi|325183831|emb|CCA18289.1| conserved hypothetical protein [Albugo laibachii Nc14]
gi|325183979|emb|CCA18437.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 561
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 73/301 (24%), Positives = 121/301 (40%), Gaps = 38/301 (12%)
Query: 33 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 92
P+ WS+ EL++L GS +I ER I+++Y+ + D F+
Sbjct: 227 PIFWSDQELSWLKGSYILHQIQERKAAIRKDYDAICRA-------------DPSFSRFSL 273
Query: 93 EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA-MLAAVDDAVQLVVDR 151
E F A + V S L + + ALVP L Y + + D +
Sbjct: 274 ERFSWARMIVCSRNFGL-TIDGVKTAALVPFADMLNHYRPRETSWTFDQKLDGFTITSLE 332
Query: 152 PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY------DRLVVEAALNTEDPQYQDKRM 205
+G + G + N + L+NYGF EDN + ++V+ L+ D Q +
Sbjct: 333 SICSGAQVYDSYGKKCNHRFLLNYGFAVEDNTEEDGSNPNEIMVDFQLDPGDGQ-----L 387
Query: 206 VAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY-----VSDTSEMQSVISSLGPIC-- 258
+ + L + R + SD P R G+ ++ T + S + P
Sbjct: 388 LYDKTAYLYESGIYTMNARLSCSHSD--PSTREGFSFARLIAATEDEFSSMKMRSPAHAS 445
Query: 259 -PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML--TDYNLHPKKRVATQLVRMEKK 315
P+S E A L+ L +L YP +L E EA+L +Y L+ + A +R EK+
Sbjct: 446 PPISFRNEIAALNLLKQLMDTQLDQYPTSLDEGEAILKSKEYPLYSNRIQALFFIRGEKQ 505
Query: 316 M 316
+
Sbjct: 506 V 506
>gi|148908465|gb|ABR17345.1| unknown [Picea sitchensis]
Length = 350
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 43/154 (27%), Positives = 73/154 (47%), Gaps = 16/154 (10%)
Query: 153 YKAGESIVVWCG-PQPNSKLLINYGFVDED----NPYDRLVVEAALNTEDPQYQDKRMVA 207
++ GE +++ G + N +L ++YGFV+ + + D + ++ DP + DK +A
Sbjct: 136 FRTGEQVLMQYGMNKSNGQLALDYGFVERNRKNGSNRDIFTLTLEISESDPFFADKLDIA 195
Query: 208 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM-------QSVISSLGPICPV 260
+ NG + F + G + ML +LRL + T SV L PV
Sbjct: 196 ELNGMETTAYFDITQG--QGVPESMLTFLRLIALGGTDAFLLEPLFRDSVWEHLS--LPV 251
Query: 261 SPCMERAVLDQLADYFKARLAGYPATLSEDEAML 294
S E A+ + D ++ L+GY T+ EDEA+L
Sbjct: 252 SQENEAAICKVVLDGCQSTLSGYGTTIEEDEALL 285
>gi|442753255|gb|JAA68787.1| Putative set domain-containing protein [Ixodes ricinus]
Length = 428
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 83/192 (43%), Gaps = 16/192 (8%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
+K G+ S W P++ L R +P+ T L + E+ R I+R
Sbjct: 113 QKLLGEASRWWPFVDSLPR-------TFTTPVFLRRTVFESLP-KDLREEVHTRITSIQR 164
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF---- 118
+ +L + + G + ++ + FT+ F A+ AV + + Q + + +
Sbjct: 165 TFLKLKVL--LGGHVEEEPEVQSLSTGFTWNNFVWAWTAVNTRCIFAQGSNSSSLWENDH 222
Query: 119 -ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
AL P L + K A V + +++ + + A E + + GP N +L ++YGF
Sbjct: 223 CALAPF-LDCLNHHWKASIETAMVGENFEILSHKSHDANEQVFISYGPHSNRRLFLDYGF 281
Query: 178 VDEDNPYDRLVV 189
V DNP D +VV
Sbjct: 282 VLPDNPNDVVVV 293
>gi|440804394|gb|ELR25271.1| rubisco lsmt substrate-binding protein [Acanthamoeba castellanii
str. Neff]
Length = 408
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 80/346 (23%), Positives = 131/346 (37%), Gaps = 39/346 (11%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
M + + SFW PY L + V P+ W++ ++ L GSP A +L + +
Sbjct: 52 MLHELRKPDSFWRPYFDALPSR-------VNLPMFWADEDMQLLAGSPLHAAVLAQKKQA 104
Query: 61 KREYNELDTVWFMAGSLFQQYP--YDIPTE------AFTFEIFKQAFVAVQSCVVHLQKV 112
+ + E + ++YP + + + +++ F+ + S +
Sbjct: 105 RDWHTE------HIVPIVRRYPRPFGVSDDDSSLEPSYSLARFEWVLSMIASRAFWHFDL 158
Query: 113 SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV---VDRPYKAGESIVVWCGPQPNS 169
+VP+ + + DD Q V +PY GE + + N
Sbjct: 159 KDTWEPHMVPMADLINHSLTNDNVSKYTFDDKTQTFIVHVQQPYAEGEQVFITYCTDSNF 218
Query: 170 KLLINYGFVDEDN-------PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 222
+LL Y + EDN D + E + R + QR L+ Q + V
Sbjct: 219 ELLKTYAMMVEDNYNKYTEIRLDETTIARICPDEVERLTKTRALTQRG--LAKQTYPV-- 274
Query: 223 GREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAG 282
+ +E D++ LRL ++ T S+ PVS E V D +A K L+
Sbjct: 275 -KSEEFPLDLVQALRLYHLPLTDSHTE--STCFETDPVSVQNELMVYDTIAGCVKELLSQ 331
Query: 283 YPATLSEDEAMLT-DYNLHPKKRVATQLVRMEKKMLNACLQVTADM 327
YP T ED AML D L R+A R +K L V A+M
Sbjct: 332 YPITAQEDAAMLAHDPRLSATARLAVAYRREDKLFLTEVGSVFAEM 377
>gi|429861365|gb|ELA36056.1| set domain protein [Colletotrichum gloeosporioides Nara gc5]
Length = 471
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 77/360 (21%), Positives = 138/360 (38%), Gaps = 55/360 (15%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-LERAEG 59
+YE QG S W PY L + + + W ++L L GS ++I E A+
Sbjct: 104 IYEFLQGAASPWKPYFEVLPEK-------FHTLMFWESSDLENLKGSAVLSKIGKEEADE 156
Query: 60 IKREY----------------------NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ 97
+ R EL + GS+ Y +D+ E +
Sbjct: 157 MFRSRILTVIAANPAIFYPEGSSPLGEAELLQLAHRMGSIIMAYAFDLDNEEEPEQEEDD 216
Query: 98 AFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGE 157
++ + L +VP+ +L ++ A + DD + + RP AGE
Sbjct: 217 EWIEDRDGKTML---------GMVPMAD-ILNADAEFNAHVNHGDDELTVTALRPIPAGE 266
Query: 158 SIVVWCGPQPNSKLLINYGFVDEDN--------PYD--RLVVEAALNTEDPQYQDKRMVA 207
I+ + GP PNS+LL YG+V + P+D + V L D ++ +
Sbjct: 267 EILNYYGPHPNSELLRRYGYVTPKHSRYDVVEIPWDLVQASVSEHLKIGDDVWKQVQEYV 326
Query: 208 QRNGKLSVQVFHVHAGR-EKEAISDMLPYLRLGYVSDTSEMQSVISSL----GPICPVSP 262
V V +G + E + +R ++++V+ ++ G + P
Sbjct: 327 DPEELEDVFVLERESGEPDSEGQFRTVAEVREISAELEEQLKAVLKAIKKINGDLIPDKR 386
Query: 263 CMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 322
+ + + L+ YP + EDEA+L +L ++R+A + EKK+L L+
Sbjct: 387 KRDEVFHAVIVSTLQKILSQYPTSTQEDEALLATSDLTNRQRMAIHVRLGEKKLLKEALE 446
>gi|299473350|emb|CBN77749.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 563
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 79/298 (26%), Positives = 125/298 (41%), Gaps = 30/298 (10%)
Query: 33 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIP-TEAFT 91
P+ W+E E+ L GS ++ ER + I+ +Y G + YP P + T
Sbjct: 212 PIFWTEEEMRLLQGSYLVTQVEERNQAIEGDY----------GVICDLYP---PFRDVAT 258
Query: 92 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSK-CKAMLAAVDDAVQLVVD 150
E FK A + V S L ++ R ALVP L Y + K +
Sbjct: 259 LEEFKWARMCVCSRNFGLD-INGLRTSALVPYADMLNHYRPRETKWTYDNNRGGFTITTL 317
Query: 151 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY------DRLVVEAALNTEDPQYQDKR 204
G + G + N + L+NYGF E+N + + + L+ DP Q K
Sbjct: 318 HRILGGAQVYDSYGQKCNHRFLLNYGFAIENNQEANGFCPNEVPLLFRLDARDPLRQKKA 377
Query: 205 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS--VISSLGPI-CPVS 261
+ +G +V + G + +A+ L LR+ V+D +EM + + ++ + P+S
Sbjct: 378 RFWRMDGPEQRRV-RLCVG-DTDAVRGALSMLRV-IVADAAEMGARYMYRTVKDVRFPLS 434
Query: 262 PCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP--KKRVATQLVRMEKKML 317
E A +++L L YP TL ED A L + L P +R A V EK +L
Sbjct: 435 VRNEVAAMERLLLLTTGALDAYPTTLEEDRAALKNGGLEPFSNRRHALIQVYGEKVVL 492
>gi|315039895|ref|XP_003169325.1| hypothetical protein MGYG_08872 [Arthroderma gypseum CBS 118893]
gi|311337746|gb|EFQ96948.1| hypothetical protein MGYG_08872 [Arthroderma gypseum CBS 118893]
Length = 455
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 72/305 (23%), Positives = 128/305 (41%), Gaps = 20/305 (6%)
Query: 32 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 91
+ + +++ EL GS A + + + +Y +L +L Q+ P + FT
Sbjct: 107 ASIFFTDDELQVCKGSSLYALTPQLEQRVHDDYRQLLV------ALLSQHRDLFPLDQFT 160
Query: 92 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYS---SKCKAMLAAVDDAVQLV 148
E +K A ++ S + A + PL +L +S +C A D + ++
Sbjct: 161 IEDYKWALCSIWSRAMDFAVSETASVRLVAPLAD-MLNHSPDVKQCHAYDPTSGD-LSIL 218
Query: 149 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 208
+ Y+ G+ I ++ G PN++LL YGFV DNP D + + P Y+ K +
Sbjct: 219 AAKDYQVGDQIFIYYGSVPNNRLLRLYGFVLPDNPNDSYDLVLQTSPLAPLYEQKERLWA 278
Query: 209 RNGKLSVQVFHVHAGREKEAI-SDMLPYLRLGYVSDTSEMQSVISSL--GPICPVSPCME 265
G S + K+ + +++L YLR+ + D S + + L G V+ E
Sbjct: 279 LAGLDSTCTIPLTV---KDPLPNNVLRYLRIQRL-DESNITDITLRLVNGTDGKVNDGNE 334
Query: 266 RAVLDQLADYFKARLAGYPATLSEDEAMLT--DYNLHPKKRVATQLVRMEKKMLNACLQV 323
VL L D + L G+ L + EA L DY A + E+++L +
Sbjct: 335 IQVLQFLVDSIGSLLEGFGIPLEKLEAQLVAGDYPAGGNAWAAAHVSAGEQRVLTRAKKT 394
Query: 324 TADMI 328
D++
Sbjct: 395 AEDLL 399
>gi|302762396|ref|XP_002964620.1| hypothetical protein SELMODRAFT_81798 [Selaginella moellendorffii]
gi|300168349|gb|EFJ34953.1| hypothetical protein SELMODRAFT_81798 [Selaginella moellendorffii]
Length = 464
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 57/213 (26%), Positives = 90/213 (42%), Gaps = 30/213 (14%)
Query: 145 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 204
+++V R + G +I + GP N LL++YGFV DNP+DR+ L + ++ R
Sbjct: 248 LKVVTKRNLEKGSAITLNYGPLSNDLLLLDYGFVIPDNPHDRI----ELRYDGSLMENAR 303
Query: 205 MVA--QRNGK------LSVQVFH--------------VHAGREKEAISDMLPYLRLGYVS 242
M+A R G S QV V G +E +L LR+ +
Sbjct: 304 MIAGLSRTGSPPFSSPASWQVDRLKQLGLADSGESQKVTLGGPEEVDGRLLAALRILHAE 363
Query: 243 DTS--EMQSVIS--SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYN 298
E + ++S + G VS E VL L + T+ EDEA L+D +
Sbjct: 364 SQEPLERRELVSLQAWGVESMVSSDNEERVLRTLCGLAAIVFNQFKTTIEEDEAKLSDKS 423
Query: 299 LHPKKRVATQLVRMEKKMLNACLQVTADMIMLL 331
L R+A Q +K+++ L+ +M L
Sbjct: 424 LAETSRIAVQFRLTKKRLVVRVLESLKKRLMDL 456
>gi|260831632|ref|XP_002610762.1| hypothetical protein BRAFLDRAFT_91548 [Branchiostoma floridae]
gi|229296131|gb|EEN66772.1| hypothetical protein BRAFLDRAFT_91548 [Branchiostoma floridae]
Length = 604
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 54/217 (24%), Positives = 103/217 (47%), Gaps = 35/217 (16%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+ EK +GK SFW PYIR L + +P+ ++E+EL L+ S + E+A +
Sbjct: 244 LLEKNKGKDSFWYPYIRSLPN-------SFTTPVYFTESELNALSPS-----LQEKARDL 291
Query: 61 KRE----YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS--- 113
K+E +N+L+ F+ L + FTF+ F+ A+ +++ ++ +
Sbjct: 292 KKELLHAFNDLEP--FVTSCLPEL------DSTFTFDAFRWAWSVLKTRTLYQEDCRSPY 343
Query: 114 LARR----FALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQP 167
L+ + LVP+ L+ +S KA ++ V PY+ + + + G +
Sbjct: 344 LSNKEPQTSTLVPM-LDLINHSPSAKARFGYNVNTSCYEVRVLEPYRKYDQVFISYGFEE 402
Query: 168 NSKLLINYGFVDEDNPYDRLVVEAALNTED-PQYQDK 203
N++L++ +GF +NP D + + + E PQ D+
Sbjct: 403 NTELMLKFGFFVPENPKDFMKINLSEMLESLPQINDE 439
>gi|225448769|ref|XP_002275729.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Vitis
vinifera]
Length = 480
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 48/199 (24%), Positives = 90/199 (45%), Gaps = 25/199 (12%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
E K G+ S W PYI L Q G + S + WSE EL + S E + + I++
Sbjct: 123 EWKMGQDSEWAPYINRLP-QPGE----MHSTIFWSEGELKMIQQSSVYQETINQKAQIQK 177
Query: 63 EYNELDTVW-FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSC-VVHLQKVSLARRFAL 120
++ + V + +LF+ DI + F + +C +V + + +L
Sbjct: 178 DFLAIKPVLHHFSENLFK----DISLKEF-----------MHACALVGSRAWGSTKGLSL 222
Query: 121 VPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
+P + + ++L +D +++ DR Y GE +++ G PN+ LL+++GF
Sbjct: 223 IPFA-DFVNHDGFSDSVLLGDEDKQLSEVIADRNYAPGEQVLIRYGKFPNATLLLDFGFT 281
Query: 179 DEDNPYDRLVVEAALNTED 197
N YD++ ++ + D
Sbjct: 282 LPYNIYDQVQIQVNIPHHD 300
>gi|302826668|ref|XP_002994755.1| hypothetical protein SELMODRAFT_432653 [Selaginella moellendorffii]
gi|300136963|gb|EFJ04180.1| hypothetical protein SELMODRAFT_432653 [Selaginella moellendorffii]
Length = 688
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 47/191 (24%), Positives = 82/191 (42%), Gaps = 26/191 (13%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
+K Q + S W PYI L +++ LW +TEL+YL SP + ER E I
Sbjct: 503 QKFQLQSSAWAPYISCLPEPA-----ELDNTFLWEDTELSYLRASPLYGKTRERLEIITT 557
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 122
E+ ++ + LF + + E F + V S + +++ LV
Sbjct: 558 EFGQVQNALDVWPQLFGK---------VSVEDFMHVYATVFS-----RPLAIGEDSTLVM 603
Query: 123 LGPPLLAYSSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
+ P+L + + A A + + + DR + I + CG N++L ++YGF
Sbjct: 604 I--PMLDFFNHNAASFAKLSFNGLLNYAVVTADRDCAENDQIWINCGDLSNAELALDYGF 661
Query: 178 VDEDNPYDRLV 188
+N YD ++
Sbjct: 662 TVPENRYDEVM 672
>gi|145553305|ref|XP_001462327.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124430166|emb|CAK94954.1| unnamed protein product [Paramecium tetraurelia]
Length = 481
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 74/357 (20%), Positives = 139/357 (38%), Gaps = 55/357 (15%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+ ++ + + SFW PYI L + P + + A L GSPT ++ + +
Sbjct: 123 LLQENRRQNSFWKPYIDVLPKD------VSGFPTNFDAEQDALLKGSPTLFTVMNQRKTF 176
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF-- 118
+ EY+ L A FQ+Y Y T+ F V + ++++R F
Sbjct: 177 QEEYDNLKE----AVKEFQRYGY-------TYNDF-----------VKFRTLTISRSFPV 214
Query: 119 ---------ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD--RPYKAGESIVVWCGPQP 167
LVPL + + + DA + R + GE + G
Sbjct: 215 YIGENEQQQLLVPLAD-FINHDNNGFLQYGYSPDADGFFMQAVRNIQKGEELFYNYGQWS 273
Query: 168 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN---GKLSVQVFHVHAGR 224
N +NYGF NP ++ + L+ D ++ K + N G V R
Sbjct: 274 NKYFFMNYGFASLTNPMNQFDFDICLDRNDRMFKMKVELTGGNICWGNRLVNETDHDTFR 333
Query: 225 EKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICP---VSPC---MERAVLDQLADYFKA 278
+ A + + ++ + D +++ + + P +P +E+A L D +
Sbjct: 334 QSLAT---VRFAQISKLDDFLQLEEDVQNYNQFWPGWHTTPKTIELEKATFKALRDLLVS 390
Query: 279 RLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVT 335
LA + +T+ +D+ L D + +R L EK+++ ++V DM++ + D T
Sbjct: 391 ELANFASTIEDDQRRLNDPSTPEFRRHIIMLTMREKQIIKKNIEV-CDMMLSVIDKT 446
>gi|345325919|ref|XP_001512656.2| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Ornithorhynchus anatinus]
Length = 345
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 51/197 (25%), Positives = 94/197 (47%), Gaps = 18/197 (9%)
Query: 142 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 201
DD + V + + AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 42 DDRCECVALQDFTAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 101
Query: 202 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS--------S 253
K V R G + VF +H E + +L +LR+ +++ + +I +
Sbjct: 102 MKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLIGDHAIDKIFT 160
Query: 254 LGPI-CPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVATQ 308
LG PVS E +L + +AR L Y T+ ED++ L +L +A +
Sbjct: 161 LGNSEFPVSWDNEV----KLWTFLEARASLLLKTYKTTIEEDKSFLETPDLTFHATMAIK 216
Query: 309 LVRMEKKMLNACLQVTA 325
L EK++L ++ A
Sbjct: 217 LRLGEKEILEKAVKSAA 233
>gi|167521575|ref|XP_001745126.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776740|gb|EDQ90359.1| predicted protein [Monosiga brevicollis MX1]
Length = 390
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 49/204 (24%), Positives = 85/204 (41%), Gaps = 14/204 (6%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-D 68
SFW PYI L ++ PL W +L L GS E + + + R+Y L +
Sbjct: 118 SFWRPYIACL-------PVSYSVPLYWDLPDLMSLRGSSLFVEAIRLYKHVCRQYGYLHN 170
Query: 69 TVWFMAGSLFQQYPY--DIPTEAFTFEIFKQAFVAVQSCVVHLQKVS----LARRFALVP 122
+ A +P + EAFTFE ++ A V + + + + AL+P
Sbjct: 171 KLSVRANPSCSCFPLTLGLSPEAFTFEDWRWAVATVMTRQNSIPQAGPDGQMKPTLALIP 230
Query: 123 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 182
L + + + + ++ V P K G I +W G + N + L++ GF +
Sbjct: 231 LWDMINHANHPMSTQFDSERECLEFVCPAPAKPGSQITMWYGDRNNGQFLLHQGFFFAGH 290
Query: 183 PYDRLVVEAALNTEDPQYQDKRMV 206
D + V +L+ D Y+ K ++
Sbjct: 291 ANDYVNVPFSLDETDSLYKIKALL 314
>gi|367048695|ref|XP_003654727.1| hypothetical protein THITE_2117893 [Thielavia terrestris NRRL 8126]
gi|347001990|gb|AEO68391.1| hypothetical protein THITE_2117893 [Thielavia terrestris NRRL 8126]
Length = 481
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 53/189 (28%), Positives = 77/189 (40%), Gaps = 18/189 (9%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
E +G+ SFW PYI L + A+ P W E ++AYL G+ I E +K
Sbjct: 112 EYLKGRDSFWAPYIATLPQPEHVSAWAL--PAFWPEEDIAYLAGTNAHVAIAEIQANVKS 169
Query: 63 EYNE----LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV----AVQSCVVHLQKVSL 114
E+ + L F A + Q Y FT F+ + V A Q L
Sbjct: 170 EFKQARKALKAAGFPAWQDYTQMLYKWAFCIFTSRSFRPSLVLSEPAKQQMAELLPPGCQ 229
Query: 115 ARRFALVPLGPPLLAYSSKCKAMLAAVDDA-----VQLVVDRPYKAGESIVVWCGPQPNS 169
F+++ PL ++ A D A QLV Y+ GE + G + NS
Sbjct: 230 LDDFSILQ---PLFDIANHSMTARYAWDVASDPASCQLVCHDAYQPGEQVYNNYGLKTNS 286
Query: 170 KLLINYGFV 178
+LL+ YGF+
Sbjct: 287 ELLLAYGFI 295
>gi|57899520|dbj|BAD87034.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
gi|57899939|dbj|BAD87851.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
Length = 509
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 45/210 (21%), Positives = 78/210 (37%), Gaps = 36/210 (17%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+ E+ G +S W PYI+ L + + + +LW EL + S E +E E
Sbjct: 110 IMEQHLGNESGWAPYIKSLPTKD-----QMHNMVLWDLNELHAVQNSSIYDEAIEHKEQA 164
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
K+E+ L LF + A + V + QK
Sbjct: 165 KKEFLALKPALDHFPHLFGEVKLGDFMHASALDFLNHDGVFGSVLIYDEQK--------- 215
Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
D +++ DR Y GE +++ G N+ L +N+GF
Sbjct: 216 ----------------------DVCEIIADRNYAVGEQVMIRYGKYSNATLALNFGFTLA 253
Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRN 210
N YD+ ++ + +DP Y+ K + Q++
Sbjct: 254 RNIYDQALIRIDMPVQDPLYKKKLDIWQKH 283
>gi|168046556|ref|XP_001775739.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672891|gb|EDQ59422.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 524
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 60/197 (30%), Positives = 86/197 (43%), Gaps = 23/197 (11%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET-ELAYLTGSPTKAEILERAEG 59
+YE+ +G+ SFW PYI L ++ +L P+LWS+ +L GSP ++ ER
Sbjct: 161 LYERARGETSFWYPYIAVLPKEL---ELIGIHPMLWSQKMRREWLEGSPM-LDVTERRLA 216
Query: 60 IKREYNELDTVWFMAGSLF----QQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL- 114
I RE E + AG L + P I A + + +S ++LQ L
Sbjct: 217 ICREDYEA-MLLAGAGRLTPRGNEGEPISITETAVQ---WAATMLLSRSFSLNLQTQKLR 272
Query: 115 -----ARRFALVPLGPPLLAYSSKCKAMLAAVDD---AVQLVVDRPYKAGESIVVWCGPQ 166
ALVP L SS + D L R Y GE + GP
Sbjct: 273 PGSFAEDTIALVPWADMLNHSSSAGRESCLVYDQKSGVATLQAHRTYSEGEQVFDSYGPS 332
Query: 167 PN-SKLLINYGFVDEDN 182
+ S+LL++YGFVDE+N
Sbjct: 333 CSPSRLLLDYGFVDEEN 349
>gi|449662705|ref|XP_002165483.2| PREDICTED: uncharacterized protein LOC100209819 [Hydra
magnipapillata]
Length = 819
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 78/328 (23%), Positives = 130/328 (39%), Gaps = 49/328 (14%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY-LTGSPTKAEILERAEG 59
M+E Q K SFW Y+ + G PL W E E G P +I+ E
Sbjct: 191 MWEFNQ-KDSFWASYLLLVPEISEFGH-----PLFWKEEEYNLEFQGMPLLNDIIVDREN 244
Query: 60 IKREYNELDTVWF-----MAGSLFQQYPYDIPTEAFTFEIFKQ--AFVAVQSCVVHLQKV 112
I+ EY E ++ + GSL E ++ E FK+ AFV S +
Sbjct: 245 IETEYAEFVLLFLRRNKDLFGSL----------ENYSLEFFKRMVAFVMAYSFTEDEESP 294
Query: 113 SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLL 172
S+ VP+ +L + S A L +Q++ R K GE + G N++LL
Sbjct: 295 SM------VPMAD-ILNHHSNNNAHLVFHKSNLQMISIRRIKKGEEVFNTFGKLGNTELL 347
Query: 173 INYGFVD-EDNPYDRLVV-----------EAALNTEDPQYQDKRMVAQRNGKLSVQVFHV 220
YG+V+ N YD L++ + +DP K + R G V F +
Sbjct: 348 QMYGYVEIPSNQYDSLLLPVKDFYKIMTSKNGTANDDPYLLAKINLLNRTGIAEVDAFFM 407
Query: 221 HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVS---PCMERAVLDQLADYFK 277
D++ +L++ + SD E++ ++ + P S + + L + + K
Sbjct: 408 FDKNGLRCGPDLIQFLKIFHASD-RELEKILKTRASKRPESFYHKLLRKLRLSKKTE--K 464
Query: 278 ARLAGYPATLSEDEAMLTDYNLHPKKRV 305
L ++ED+ + N + +K V
Sbjct: 465 NSLGMTVIDITEDDTEMDIENFNKRKNV 492
>gi|317038661|ref|XP_001401929.2| SET domain protein [Aspergillus niger CBS 513.88]
Length = 699
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 55/199 (27%), Positives = 85/199 (42%), Gaps = 7/199 (3%)
Query: 6 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 65
+G + FW PYIR L Q G ++ +P + +L +L G+ A +R E +K +Y
Sbjct: 121 RGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGTSLLAAREKRLEVLKEKYE 175
Query: 66 ELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG 124
+ T AG Y +D+ A + I + V S V +S + L+P+
Sbjct: 176 KGSTELRNAGFEGADAYTWDLYLWAASMFISRAFSAKVLSGVFPETDLSEEKLSVLLPI- 234
Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
+ + K A D + VV AG+ I GP+ N +L++NYGF NP
Sbjct: 235 IDMGNHRPLAKVEWRAGKDDIAFVVLEDVWAGQEISNNYGPRNNEQLMMNYGFCIPGNPC 294
Query: 185 DRLVVEAALNTEDPQYQDK 203
D +V P Y K
Sbjct: 295 DHRIVSLRAPPGSPLYMAK 313
>gi|121707885|ref|XP_001271968.1| SET domain protein [Aspergillus clavatus NRRL 1]
gi|119400116|gb|EAW10542.1| SET domain protein [Aspergillus clavatus NRRL 1]
Length = 677
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 54/195 (27%), Positives = 88/195 (45%), Gaps = 27/195 (13%)
Query: 6 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 65
QG+ FW PYIR L + L++ +PL + +L +L G+ +R E +K Y
Sbjct: 105 QGEDGFWFPYIRTLPQP-----LSLTTPLYYEGDDLGWLKGTSLWPAREQRMELLKEAYE 159
Query: 66 ELDTVWFMAGSLFQ---QYPYD--------IPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 114
+ V + + FQ +Y +D I + AF+ ++ +AF + + VS+
Sbjct: 160 --NGVRELRKAGFQDVDKYTWDLYLWASSMIVSRAFSPKVLAEAFADID---LPEDGVSV 214
Query: 115 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 174
L+P L+ + K A V +V AG+ I GP+ N +L++N
Sbjct: 215 -----LLPC-IDLMNHRPLAKVEWRAGKQDVAYLVLEDVAAGQEIANNYGPRNNEQLMMN 268
Query: 175 YGFVDEDNPYDRLVV 189
YGF DNP D +V
Sbjct: 269 YGFCLPDNPCDYRIV 283
>gi|350632383|gb|EHA20751.1| hypothetical protein ASPNIDRAFT_120572 [Aspergillus niger ATCC
1015]
Length = 668
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 55/199 (27%), Positives = 85/199 (42%), Gaps = 7/199 (3%)
Query: 6 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 65
+G + FW PYIR L Q G ++ +P + +L +L G+ A +R E +K +Y
Sbjct: 95 RGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGTSLLAAREKRLEVLKEKYE 149
Query: 66 ELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG 124
+ T AG Y +D+ A + I + V S V +S + L+P+
Sbjct: 150 KGSTELRNAGFEGADAYTWDLYLWAASMFISRAFSAKVLSGVFPETDLSEEKLSVLLPI- 208
Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
+ + K A D + VV AG+ I GP+ N +L++NYGF NP
Sbjct: 209 IDMGNHRPLAKVEWRAGKDDIAFVVLEDVWAGQEISNNYGPRNNEQLMMNYGFCIPGNPC 268
Query: 185 DRLVVEAALNTEDPQYQDK 203
D +V P Y K
Sbjct: 269 DHRIVSLRAPPGSPLYMAK 287
>gi|134074534|emb|CAK38827.1| unnamed protein product [Aspergillus niger]
Length = 625
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 55/199 (27%), Positives = 85/199 (42%), Gaps = 7/199 (3%)
Query: 6 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 65
+G + FW PYIR L Q G ++ +P + +L +L G+ A +R E +K +Y
Sbjct: 47 RGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGTSLLAAREKRLEVLKEKYE 101
Query: 66 ELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG 124
+ T AG Y +D+ A + I + V S V +S + L+P+
Sbjct: 102 KGSTELRNAGFEGADAYTWDLYLWAASMFISRAFSAKVLSGVFPETDLSEEKLSVLLPI- 160
Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
+ + K A D + VV AG+ I GP+ N +L++NYGF NP
Sbjct: 161 IDMGNHRPLAKVEWRAGKDDIAFVVLEDVWAGQEISNNYGPRNNEQLMMNYGFCIPGNPC 220
Query: 185 DRLVVEAALNTEDPQYQDK 203
D +V P Y K
Sbjct: 221 DHRIVSLRAPPGSPLYMAK 239
>gi|358397725|gb|EHK47093.1| hypothetical protein TRIATDRAFT_298882 [Trichoderma atroviride IMI
206040]
Length = 481
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 47/185 (25%), Positives = 83/185 (44%), Gaps = 14/185 (7%)
Query: 6 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 65
+G++SFW PYI+ L + A+ P W E E L G+ + + + + +KRE+
Sbjct: 98 RGEESFWWPYIQALPQPEDVDDWAL--PPFWPEEEAELLEGTNVEVGLDKIRDDLKREFR 155
Query: 66 ELDTVWFMA--------GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL-QKVSLAR 116
E + + L + Y+ F+ F+ + V ++ L + VS+
Sbjct: 156 EAKAMLLASQKDAEDDFSELLTRELYNWAYCIFSSRSFRASLVMTEAQQQALPEDVSVDD 215
Query: 117 RFALVPL---GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 173
L+PL G +A + + A A QL V R ++ G+ I P+ N++LL+
Sbjct: 216 FSVLLPLFDIGNHDMAVDVRWELDAANSGAACQLRVGREHQPGQQIFNNYSPKTNAELLL 275
Query: 174 NYGFV 178
YGF+
Sbjct: 276 GYGFM 280
>gi|406860468|gb|EKD13526.1| putative SET domain-containing protein [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 474
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 84/351 (23%), Positives = 140/351 (39%), Gaps = 51/351 (14%)
Query: 6 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEGIKREY 64
Q + S W PY+ L + ++S + WSE+EL L S +I AE + E+
Sbjct: 95 QQEDSKWAPYLALLPSR-------LDSLVFWSESELLELQASTVVNKIGRASAEQLFLEH 147
Query: 65 -------NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-----CVVHLQKV 112
N + S+ Y +DIP K+ +S +V +
Sbjct: 148 ISPLGLSNTNTEMCHKVASVVMAYAFDIPE--------KKGHDDPESPEDGDDLVSDNEE 199
Query: 113 SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLL 172
+++PL L A + A L ++ +++ +P GE I+ G P S LL
Sbjct: 200 EENTILSMIPLADMLNADADGNNARLCCDNEELEMRSIKPISKGEEILNDYGQLPRSDLL 259
Query: 173 INYGFV-DEDNPYDRLVVE-------AALNTEDP-------------QYQDKRMVAQRNG 211
YG++ D+ YD V E A+L+TE P + + + +AQR G
Sbjct: 260 RRYGYISDKYAAYD--VAELSTQSLLASLSTEQPLLAGGTLQPLSREKLEQRVELAQREG 317
Query: 212 KLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQ 271
H G + +I D L L + D + ++ +S + S V
Sbjct: 318 VYEDSYDLTHPGPDDPSIPDELLALLYILLLDNENLAAIETSHASLPSRSKLATSLVGQI 377
Query: 272 LADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 322
L ++R Y T+ D+A+L NL +KR+A ++ EK +L +Q
Sbjct: 378 LTKILESRKQEYATTIEADQAILQADNLPSRKRMAVEVRLGEKLVLEKAIQ 428
>gi|242769547|ref|XP_002341787.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
gi|218724983|gb|EED24400.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
Length = 739
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 63/219 (28%), Positives = 95/219 (43%), Gaps = 29/219 (13%)
Query: 8 KKSFWLPYIRELDRQRGRGQLAVESPLLWSE--TELAYLTGSPTKAEILERAEGIKREYN 65
++ FW PYI+ L G + +PLL+ E +LA+L + A R + K Y
Sbjct: 149 EEGFWYPYIQSLP-----GPEELTTPLLFKEEDGDLAWLNMTSLAASRERRLQIWKVNYE 203
Query: 66 ELDTVWFMAGSLFQQ--------YPYDIPTEAFTFEIFKQAFVA--VQSCVVHLQKVSLA 115
+ A S+ Q Y +D+ A T I +AF A + S + LQ
Sbjct: 204 K-------AYSMMQDLGVENARLYTWDLYLWASTI-ISSRAFTAKVLASVIPKLQTAEEG 255
Query: 116 RRFALVPLGPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 173
R ++ L P + A + K K A D++ LVV +AG+ + GP+ N +L++
Sbjct: 256 DRISV--LLPLIDATNHKPLSKVEWRAGTDSIGLVVMSDLRAGDEVGNNYGPRNNEQLMM 313
Query: 174 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 212
NYGF DNP + VV + P Q K Q K
Sbjct: 314 NYGFCIPDNPCEYRVVSLRAPPDSPLAQIKAQYEQHCSK 352
>gi|358056332|dbj|GAA97699.1| hypothetical protein E5Q_04377 [Mixia osmundae IAM 14324]
Length = 347
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 53/107 (49%), Gaps = 4/107 (3%)
Query: 88 EAFTFEIFKQAFVAVQS-CV-VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA- 144
E F+ F+ A++ V S CV + L + F LVPL + +SS C D A
Sbjct: 122 EIIDFDAFRWAWLCVNSRCVWLDLDYEAHEENFTLVPL-LDMANHSSTCANATVKYDHAH 180
Query: 145 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEA 191
+L + RP K GE IV G + L YGF++ NP++R+ + A
Sbjct: 181 FELKLTRPVKRGEEIVFEYGGHDQATLWAEYGFIESSNPHERIDLTA 227
>gi|358388339|gb|EHK25932.1| hypothetical protein TRIVIDRAFT_82204 [Trichoderma virens Gv29-8]
Length = 915
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 78/352 (22%), Positives = 149/352 (42%), Gaps = 51/352 (14%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
M+E +G +S W PY+ L + E+P+ WS EL L S T+ ++ +A+
Sbjct: 550 MFEYFKGDESKWKPYMDVL-------PASFETPMFWSGAELDELQASATRTKV-GKADAE 601
Query: 61 KREYNELDTVWFMAGSLF---QQYPYDIPTE----------AFTFEIFKQAFVAVQSCVV 107
+ + ++ V +F Q Y D + ++ F+ + +
Sbjct: 602 EMFHAKVLPVIRANHEIFPSSQSYSDDELVQLAHRMGSTIMSYAFDFQNEDEEDEEDEEE 661
Query: 108 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 167
++ +VP+ +L ++ A + DDA+ + R KAGE I+ + GP P
Sbjct: 662 WVEDRESKSTMGMVPMAD-ILNADAEYNAHVNYGDDALTVTALRTIKAGEEILNYYGPHP 720
Query: 168 NSKLLINYGFVD-EDNPYD------RLV---VEAALNTEDPQYQDKRMVAQRNGKLSVQV 217
NS+LL YG+V + + YD +LV + A+L + Q + R + V
Sbjct: 721 NSELLRRYGYVTPKHSRYDVVELPWKLVENSLAASLGLSEQQLDNAREYLDMDEIEDTFV 780
Query: 218 FHVHAGREKEAISDMLPYLRLGYVSDTSE-----MQSVISSLGPICPVSPCMER------ 266
RE + + + SD E ++ ++ ++ + P S +R
Sbjct: 781 L----DRESDEPNPDGTFTGSARFSDIPEDLRDQLKLLLKAVRKVDPSSVADKRKRDEIQ 836
Query: 267 -AVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 317
+VL + D ++ YP T+ EDE +L+ ++ ++R A + EK+++
Sbjct: 837 HSVLIKALDALASQ---YPTTIPEDERILSGSDISERQRAAVTVRLGEKRLI 885
>gi|224098926|ref|XP_002311320.1| SET domain-containing protein [Populus trichocarpa]
gi|222851140|gb|EEE88687.1| SET domain-containing protein [Populus trichocarpa]
Length = 490
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 80/349 (22%), Positives = 144/349 (41%), Gaps = 46/349 (13%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+ +K + S W PY+ L + S + WSE ELA L G+ + L +
Sbjct: 128 LIREKLKEDSTWRPYLDVLPE-------STNSTIFWSEEELAELQGTQLLSTTLGVKSYL 180
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
+RE+ +++ + Q +P + T + F AF ++S + + L
Sbjct: 181 RREFLKVEEEILVPHK--QLFPSPV-----TLDDFSWAFGILRSRSFSRLR---GQNLVL 230
Query: 121 VPLG-----------------PPLL----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESI 159
+PL P + Y K A L + D L KAGE +
Sbjct: 231 IPLADLCNFLHTWLLDQVNHSPDITIEDGVYEIKG-AGLFSRDLIFSLRSPISLKAGEQV 289
Query: 160 VVWCGPQ-PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
++ N++L ++YGF++ + + + ++ DP + DK +A+ NG + F
Sbjct: 290 LIQYNLNLSNAELAVDYGFIEAKSDRNMYTLTLQISESDPFFGDKLDIAETNGLGEIADF 349
Query: 219 H-VHAGREKEAISDMLPYLRLGYVSDTSEMQSVISS--LGPI-CPVSPCMERAVLDQLAD 274
V + L + LG +D+ ++S+ + G + PVS E + + D
Sbjct: 350 DIVLGNPLPPTLLPYLRLVALGG-TDSFLLESIFRNTIWGHLELPVSRANEELICRVVRD 408
Query: 275 YFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQV 323
K+ L+GY T+ EDE L L+P+ +A + EKK+L ++
Sbjct: 409 ACKSALSGYHTTIEEDEK-LKGEELNPRLEIAVGIRAGEKKVLQQIEEI 456
>gi|297736447|emb|CBI25318.3| unnamed protein product [Vitis vinifera]
Length = 487
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 48/205 (23%), Positives = 91/205 (44%), Gaps = 31/205 (15%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
E K G+ S W PYI L Q G + S + WSE EL + S E + + I++
Sbjct: 124 EWKMGQDSEWAPYINRLP-QPGE----MHSTIFWSEGELKMIQQSSVYQETINQKAQIQK 178
Query: 63 EYNELDTVWF-MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSC-VVHLQKVSLARRFAL 120
++ + V + +LF+ DI + F + +C +V + + +L
Sbjct: 179 DFLAIKPVLHHFSENLFK----DISLKEF-----------MHACALVGSRAWGSTKGLSL 223
Query: 121 VPLGPPLLAYSSKCKAMLAAVDD--------AVQLVVDRPYKAGESIVVWCGPQPNSKLL 172
+P + + ++L +D ++++ DR Y GE +++ G PN+ LL
Sbjct: 224 IPFAD-FVNHDGFSDSVLLGDEDKQLSESSSTLEVIADRNYAPGEQVLIRYGKFPNATLL 282
Query: 173 INYGFVDEDNPYDRLVVEAALNTED 197
+++GF N YD++ ++ + D
Sbjct: 283 LDFGFTLPYNIYDQVQIQVNIPHHD 307
>gi|328854233|gb|EGG03367.1| hypothetical protein MELLADRAFT_90239 [Melampsora larici-populina
98AG31]
Length = 509
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 57/216 (26%), Positives = 92/216 (42%), Gaps = 24/216 (11%)
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
A+VPL L A S A L + + + + GE I PN+ LL YG V
Sbjct: 260 AMVPLADILNAKSGCENAKLFYEPTTLNMTTTKSIRKGEQIYNTYADPPNADLLRRYGHV 319
Query: 179 DEDNPYD----------RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV-------- 220
D++NP+D RL E +L+ DPQ Q+ + K +++V +
Sbjct: 320 DDENPFDLAEVSLELCIRLAAE-SLHPSDPQNQNTLDELKSRAKWALEVSDIDEIFMLPT 378
Query: 221 HAGRE-KEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 279
+ RE KE + D L + +S E Q+ S G + P M + R
Sbjct: 379 KSQREPKEILPDELVIMLRILLSTEEEFQT-WKSKGKVP--KPAMSEPIAQLAIQILSNR 435
Query: 280 LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKK 315
L Y T+ D+ +L D +L ++++ + VR+ +K
Sbjct: 436 LNQYSTTIQNDQDLLKDQSL-SRRKLKSIKVRLGEK 470
>gi|328700922|ref|XP_003241429.1| PREDICTED: SET domain-containing protein 3-like [Acyrthosiphon
pisum]
Length = 463
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 47/200 (23%), Positives = 83/200 (41%), Gaps = 17/200 (8%)
Query: 5 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 64
++ KKSFW Y+ L + +P+ + +L L GSP L+ I R+Y
Sbjct: 174 RKDKKSFWHSYLTTL-------PVTYSTPVYFDVADLEALKGSPAFEAALKLNRNIARQY 226
Query: 65 NELDTVWFMAGSLFQQYPYD-IPTEAFTFEIFKQA---FVAVQSCVVHLQKVSLARRFAL 120
++ ++ P I + FT+E ++ A ++ Q+ V S AL
Sbjct: 227 AYFKKLFQLSND-----PASVILKDTFTYEYYRWAVSTLMSRQNTVPSSDNPS-ENVSAL 280
Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
+PL S + + D Y A E + ++ G + N+ L++ GFV
Sbjct: 281 IPLWDMFNHRSGRLSTDFVKSSNVCVCYADGDYAADEQVYIFYGVRTNADFLVHNGFVYP 340
Query: 181 DNPYDRLVVEAALNTEDPQY 200
DN +D + + ++ DP Y
Sbjct: 341 DNEHDAVKIRLGVSRSDPLY 360
>gi|66828265|ref|XP_647487.1| hypothetical protein DDB_G0268558 [Dictyostelium discoideum AX4]
gi|60475797|gb|EAL73732.1| hypothetical protein DDB_G0268558 [Dictyostelium discoideum AX4]
Length = 459
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 60/218 (27%), Positives = 91/218 (41%), Gaps = 35/218 (16%)
Query: 5 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 64
K KSFW PY+ L ++ + + +SE EL L S K + R +GI+R Y
Sbjct: 123 KLNDKSFWYPYVSVLPKE-------FTTSIYFSEEELDELQSSKLKEFTIIRKDGIERHY 175
Query: 65 NELDTVWFMAGSLFQQYPYDIPT---EAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL- 120
N T G + + P T + +T E+F A SCV +R F+L
Sbjct: 176 NSTFTRLSNRG-IAEFSPTSTQTLQQKGYTLELFTWAL----SCV-------WSRAFSLS 223
Query: 121 ------VPLGPPLLAYS-SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG---PQPNSK 170
VPL A SK K D + + GE I G P +S+
Sbjct: 224 DSDGGMVPLADMFNAEEISKSKVQPKVTDSTLDYYASDDIEIGEQIFTPYGVYKPLSSSQ 283
Query: 171 LLINYGFV-DEDNPYDRLVVEAAL-NTEDPQYQDKRMV 206
+L++YGFV D P D + + + + ++P Q K+ +
Sbjct: 284 MLMDYGFVFDHGTPSDNVAISVPIFHPDEPNIQVKQSI 321
>gi|367036287|ref|XP_003648524.1| hypothetical protein THITE_2106073 [Thielavia terrestris NRRL 8126]
gi|346995785|gb|AEO62188.1| hypothetical protein THITE_2106073 [Thielavia terrestris NRRL 8126]
Length = 496
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 52/198 (26%), Positives = 80/198 (40%), Gaps = 32/198 (16%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG- 59
+YE QG+ S W PY+ L ++P+ WS TEL+ L S A++ RAE
Sbjct: 114 IYEHLQGEASRWRPYLDVL-------PPTFDTPMFWSPTELSELQASALVAKV-GRAEAD 165
Query: 60 ---------IKREYNELDTVWFMAG----------SLFQQYPYDIPTEAFTFEIFKQAFV 100
+ R + E V+F G L + I AF E
Sbjct: 166 RMIEAKVLPVIRAHEE---VFFPPGRAKLDDAQLFELAHRMGSTIMAYAFDLENDDSDND 222
Query: 101 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 160
++ +VP+ +L ++ A + DDA+ RP +AG+ I+
Sbjct: 223 EADEDDEWVEDREGRTMLGMVPMAD-MLNADAEFNAHINHGDDALTATALRPIRAGDEIL 281
Query: 161 VWCGPQPNSKLLINYGFV 178
+ GP PN +LL YG+V
Sbjct: 282 NYYGPLPNGELLRRYGYV 299
>gi|400602586|gb|EJP70188.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
Length = 797
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 40/77 (51%), Gaps = 2/77 (2%)
Query: 117 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 176
R AL+P+ L ++ C + +A + DR Y+AGE + G N LL YG
Sbjct: 593 RLALLPVADVLNHANAGCSVAFST--EAYDITADRAYQAGEEVYTSYGAHSNDFLLAEYG 650
Query: 177 FVDEDNPYDRLVVEAAL 193
FV DNP+D+L ++ L
Sbjct: 651 FVLPDNPWDQLCLDKVL 667
>gi|291235388|ref|XP_002737626.1| PREDICTED: SET domain containing 4-like [Saccoglossus kowalevskii]
Length = 353
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 52/203 (25%), Positives = 93/203 (45%), Gaps = 26/203 (12%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
E+ + K SFW YI+ L + + +P+ ++ E+ +L K ++ + E I
Sbjct: 116 ERSRQKDSFWYNYIKVLPK-------SYSNPVYFTNEEINWLP-RRIKRKVFDECEKINT 167
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK-----VSLAR- 116
Y EL ++ + S F + F + F+ A+ V + V++ + +S+ R
Sbjct: 168 AYRELKNLFSILESTFVSFK-----GIFEYSAFRWAWCTVNTRSVYMLQEQNPHLSIERD 222
Query: 117 RFALVPLGPPLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 174
+AL P LL +++ + KA V ++ K + + ++ GP N KL I
Sbjct: 223 HYALAPF-LDLLNHTNTVEVKASYNPVSKCYEIFTCTACKKYDQMFIYYGPHDNVKLFIE 281
Query: 175 YGFVDEDNPYDRLVVEAALNTED 197
YGFV N ++ VVE L+ ED
Sbjct: 282 YGFVLPQNQHN--VVE--LDFED 300
>gi|365989356|ref|XP_003671508.1| hypothetical protein NDAI_0H00910 [Naumovozyma dairenensis CBS 421]
gi|343770281|emb|CCD26265.1| hypothetical protein NDAI_0H00910 [Naumovozyma dairenensis CBS 421]
Length = 540
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 50/192 (26%), Positives = 77/192 (40%), Gaps = 19/192 (9%)
Query: 1 MYEKK-QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLT--------GSPTKA 51
MYE K +S W Y L L + W++ EL++LT G
Sbjct: 143 MYEMKVLQHESRWSSYFNVLPSSESLNTL-----MYWNDKELSFLTPSLVVNRVGKGDAE 197
Query: 52 EILERAEGIKREYNELDTVWFMAGSL----FQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 107
+ R E+NE D + GS+ F P I +F EI
Sbjct: 198 TMYRRILDTINEFNE-DILTEKLGSISWEEFLYIPSIIMAYSFDVEIKNDDDENEGDEEF 256
Query: 108 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 167
++ +++PL L A + KC A L D+++++ +P K GE + G P
Sbjct: 257 DEKEEEPELLKSMIPLADTLNADTHKCNANLTYDKDSLKMLAIKPIKKGEQVYNTYGELP 316
Query: 168 NSKLLINYGFVD 179
NS+LL YG+V+
Sbjct: 317 NSELLRKYGYVE 328
>gi|340520781|gb|EGR51016.1| N-methyltransferase [Trichoderma reesei QM6a]
Length = 470
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 79/186 (42%), Gaps = 21/186 (11%)
Query: 32 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 91
S + +SE EL G+ + + IK +Y +L A LF Q+P P + FT
Sbjct: 107 SSIFFSEGELEVCAGTSLYTVTKQLEQRIKDDYRQL------AVRLFAQHPDLFPLQKFT 160
Query: 92 FEI----------FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS---KCKAML 138
E +K A V S + + L P +L +SS +C A
Sbjct: 161 IEDVRLLRRATDPYKWALCTVWSRSMDFTLPDGSSIRLLAPFAD-MLNHSSEVKQCHAYD 219
Query: 139 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 198
D + + + Y+ G+ + ++ GP PN++LL YGFV DNP D + + P
Sbjct: 220 VKSGD-LSVFAGKDYEIGDQVYIYYGPIPNNRLLRLYGFVIPDNPNDSYDLVLTTHPMAP 278
Query: 199 QYQDKR 204
Y+ K+
Sbjct: 279 FYEQKQ 284
>gi|66825817|ref|XP_646263.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|60474297|gb|EAL72234.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 567
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 74/345 (21%), Positives = 129/345 (37%), Gaps = 45/345 (13%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
S W PYI+ L +Q + W E GSP + G R+Y
Sbjct: 164 SEWGPYIKLLPKQ-------YNTVYYWGLKEFTQFRGSPNLEYAMRYVRGAMRQY----- 211
Query: 70 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR-----FALVPLG 124
++ + + +P +FT++ F A VQS Q A AL+P
Sbjct: 212 -CYLYSMIDRTQSNIMPISSFTWDAFVWAISTVQS----RQNPVYAGNGNGSIMALIPFW 266
Query: 125 P--PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 182
+ SK + + + + +K GE + ++ GP+ N++LL++ GF + N
Sbjct: 267 DFCNHSSTGSKITSFYHMDSNCMTSGAIKDFKKGEQVYMFYGPRDNTQLLMHAGFATKTN 326
Query: 183 PYDRLVVEAAL--NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
+D E L + ++ ++ +R + V V E +++P+ R+ Y
Sbjct: 327 LHDSYPFELHLLEGNHEIRHDKVHLLEERGIRDGVVVNLNQNPTSNELPLELIPFYRI-Y 385
Query: 241 VSDTSEMQSVIS---------------SLGPIC--PVSPCMERAVLDQLADYFKARLAGY 283
E +++ L P+ ++ E L K +LA Y
Sbjct: 386 ALSEQETRAIAPPQVPGEHNHHHGHQLELKPLAFKIITQENEEKAYSNLVQALKGKLASY 445
Query: 284 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 328
P TL EDE L N +R EKK+L+ ++ +I
Sbjct: 446 PTTLEEDEQELKK-NPPANQRFILYTKINEKKILDRNIKYLESLI 489
>gi|302810436|ref|XP_002986909.1| hypothetical protein SELMODRAFT_235145 [Selaginella moellendorffii]
gi|300145314|gb|EFJ11991.1| hypothetical protein SELMODRAFT_235145 [Selaginella moellendorffii]
Length = 447
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 60/242 (24%), Positives = 99/242 (40%), Gaps = 57/242 (23%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEG 59
MYE+ +GK S W Y++ L Q P LWSE E+ L G+ + E
Sbjct: 58 MYERSKGKGSKWYRYLKTLPCQE-------SVPFLWSEEEIDGLLLGTELHKALKEDKLL 110
Query: 60 IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ-KVSLARRF 118
+K ++ E L ++ P + P + FTFE +++A +S V ++ +
Sbjct: 111 MKEDWEE------NIAPLTKEDPLEFPAQDFTFE----SYLAAKSLVSSRSFEIDAEHGY 160
Query: 119 ALVPLGPPLLAYSSKCKA-----MLAA-------------VDDA---------------V 145
+VPL ++ K A ML A +DD +
Sbjct: 161 GMVPLAD---LFNHKTDAEDVHFMLNASDSDDDDDNNGLIIDDGLANGDCREISSDKSVL 217
Query: 146 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD--RLVVEAALNTEDPQYQDK 203
++V+ + AG I G N+ LL YGF + +NP+D L ++ L ++Q K
Sbjct: 218 EMVMVKDVAAGSEIFNTYGQLGNAALLHRYGFTEPNNPHDIVNLDMDCLLEVLLSRFQKK 277
Query: 204 RM 205
R+
Sbjct: 278 RV 279
>gi|189189204|ref|XP_001930941.1| SET domain-containing protein RMS1 [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187972547|gb|EDU40046.1| SET domain-containing protein RMS1 [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 476
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 54/210 (25%), Positives = 82/210 (39%), Gaps = 44/210 (20%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL------ 54
+YE G S W PY L + + + W+E ELA L S ++
Sbjct: 91 LYEYHNGSASNWAPYFAVLPTE-------FNTLMFWTEDELAELQASAVVGKVGKESADE 143
Query: 55 ---------------------ERAEGIKREYNELDTVWFM--AGSLFQQYPYDIPTEAFT 91
ERA+ +E L+ + M GSL Y +D+ T
Sbjct: 144 AFLEQLLPVIEEFADIVFSGDERAKDKAKEMRSLENLELMHKMGSLIMAYAFDVEPATPT 203
Query: 92 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 151
E+ ++ A + L K +VPL L A + +C A L D +++ +
Sbjct: 204 KEVDEEG-FAEEEEDAALPK-------GMVPLADMLNADADRCNARLFYEKDCLEMKALK 255
Query: 152 PYKAGESIVVWCGPQPNSKLLINYGFVDED 181
P +AGE I GP P S LL YG+V ++
Sbjct: 256 PIQAGEEIFNDYGPLPRSDLLRRYGYVTDN 285
>gi|145549620|ref|XP_001460489.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124428319|emb|CAK93092.1| unnamed protein product [Paramecium tetraurelia]
Length = 482
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 69/342 (20%), Positives = 130/342 (38%), Gaps = 32/342 (9%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+ ++ + K SFW PYI L + P + + A L GSPT ++ + +
Sbjct: 124 LLQENRRKYSFWKPYIDVLPKD------VSGFPTYFDAEQDALLKGSPTLFTVINQRKVF 177
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
K EY L A FQ+Y Y T++ F + + S +Q ++ L
Sbjct: 178 KEEYENLKE----AVKEFQKYGY-------TYDDFIKFRILTISRSFTVQIGEKEQQQLL 226
Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVD--RPYKAGESIVVWCGPQPNSKLLINYGFV 178
VPL + + + DA + R + GE + G N +NYGF
Sbjct: 227 VPLAD-FINHDNNGFLKYGYSKDADGFFMQAVRNIQKGEELFYNYGQWSNKYFFMNYGFA 285
Query: 179 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN---GKLSVQVFHVHAGREKEAISDMLPY 235
NP ++ ++ LN D + K + + N G V R+ A + +
Sbjct: 286 SLTNPMNQFDLDICLNKNDRLFNLKISLTKGNMCWGNRLVNETDHDTFRQSLAT---VRF 342
Query: 236 LRLGYVSDTSEMQSVISSLGPICP------VSPCMERAVLDQLADYFKARLAGYPATLSE 289
++ + D +++ + + P + +E+A L L + +T+ +
Sbjct: 343 TQISKLDDFLQLEEDVQNFKQFWPGWHTTIKTIELEKATFKALKGILVTELGNFASTIED 402
Query: 290 DEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLL 331
DE L D ++ L EK+++ + + M+ ++
Sbjct: 403 DERRLNDPQTPEFRKHIIMLTLREKQIIKKNIDICDLMLQVI 444
>gi|322712432|gb|EFZ04005.1| histone-lysine N-methyltransferase [Metarhizium anisopliae ARSEF
23]
Length = 462
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 64/271 (23%), Positives = 112/271 (41%), Gaps = 24/271 (8%)
Query: 32 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 91
S + +++ EL G+ + + I+ +Y +L + QYP P + FT
Sbjct: 107 SSIFFTDDELEVCAGASLYTITKQLQQRIEDDYRDLVV------RVLVQYPDLFPLDKFT 160
Query: 92 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS---KCKAMLAAVDDAVQLV 148
+K A AV S + Q + L P +L +SS +C A+ D + ++
Sbjct: 161 LHHYKWALCAVWSRAMDFQLSDGSSIRLLAPFAD-MLNHSSESKQCHVYDASSGD-LSVL 218
Query: 149 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 208
+ Y+AG+ + + G PN +LL YGF+ NP D + A + P ++ K+ +
Sbjct: 219 AGKDYEAGDQVYIHYGSIPNHRLLRLYGFIIPGNPNDSYDLVLATHPLAPFFELKQKLWA 278
Query: 209 RNGKLSVQVFHVHAGREKEAISDMLP-----YLRLGYVSDTSEMQSVISSLGPICPVSPC 263
G S + ++D LP YLR+ + D S++ S+ +S
Sbjct: 279 LAGLDSTCTISL-------TLTDPLPKNVIRYLRIQRL-DESDLASIALGQAADEKISNS 330
Query: 264 MERAVLDQLADYFKARLAGYPATLSEDEAML 294
E VL L + + L + L + E L
Sbjct: 331 NEVQVLQSLVESIASLLGSFGTRLEKLEEQL 361
>gi|357131865|ref|XP_003567554.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Brachypodium distachyon]
Length = 316
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 46/181 (25%), Positives = 74/181 (40%), Gaps = 22/181 (12%)
Query: 8 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 67
KKS W PY+R L R + + + W EL + S E +ER E +E++ +
Sbjct: 38 KKSGWAPYVRSLPRND-----QMHNMMFWDLNELHMVRISSICDEAIERRERAMKEFSAV 92
Query: 68 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 127
LF + E F A S +V + +R +L+P
Sbjct: 93 KPSLECFPHLFGE---------IKLEDFMHA-----SALVSSRAWQTSRGVSLIPFAD-F 137
Query: 128 LAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
L + ++L D +++ DR Y GE ++V G N+ L +N+GF N YD
Sbjct: 138 LNHDGVSDSILLYDGQKDIAEVISDRNYAVGEQVMVRYGKYSNAMLALNFGFTLPRNIYD 197
Query: 186 R 186
+
Sbjct: 198 Q 198
>gi|328772383|gb|EGF82421.1| hypothetical protein BATDEDRAFT_86633 [Batrachochytrium
dendrobatidis JAM81]
Length = 648
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 85/361 (23%), Positives = 138/361 (38%), Gaps = 51/361 (14%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA-EILERAEGIKREYNELD 68
S WLPY+ L + PL+W+ + L G + ++ER E I+ N
Sbjct: 146 SHWLPYLASLPKNYAL-------PLMWTRDRIQNLLGGTSLLYMMIERLEWIQ---NSTK 195
Query: 69 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR----------- 117
V G F PT A T + + A ++ S K SL +
Sbjct: 196 VVENACGHYF-------PTGALTVQSMQWATCSIWSRAFPKAKPSLDLQDGSHQDVQDWI 248
Query: 118 -FALVPLGPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 174
+ + L P L ++ K + + V + G ++ GP+ N LL N
Sbjct: 249 GLSEICLFPILDMFNHKRGYRVEWRMTEKGVSFITPDGICKGSELLNNYGPKGNENLLSN 308
Query: 175 YGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ--RNGKLSVQVFHVHAGREKEAISDM 232
YGFV E+NP D V L EDP Y K+ V + LS VF + + ++M
Sbjct: 309 YGFVIENNPEDYFKVFLGLQQEDPLYTAKKAVLEVVSENDLSHLVF-----LKDDLPTNM 363
Query: 233 LPYLRLGYVSDTSEMQSVISSLGPICPVSP-----CMERAV--LDQLADYFKARLAGYPA 285
+ R+ V+++ E+ + + L VSP C+ + L L + ++L +
Sbjct: 364 ISISRV-LVANSWELAILEARLTSSTKVSPHSTPICLHNEILALSTLYNLLNSKLKVLKS 422
Query: 286 TLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAP 345
T E ++ P A +LV + + + L+ L +T S C Y
Sbjct: 423 TTPNYEQDTIEF---PNDDEARKLVHIYRAGQQSILEHAIQNTRTLASMT-SSCHKQYET 478
Query: 346 L 346
L
Sbjct: 479 L 479
>gi|159476096|ref|XP_001696150.1| protein N-methyltransferase [Chlamydomonas reinhardtii]
gi|158275321|gb|EDP01099.1| protein N-methyltransferase [Chlamydomonas reinhardtii]
Length = 474
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 56/213 (26%), Positives = 95/213 (44%), Gaps = 35/213 (16%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
++ K+QG +S P+I +L G PL WS+ +LA L A++ E+
Sbjct: 146 LWHKRQGSQSPLAPWIAQLPADTG-------VPLNWSDKQLAALQYPYLVAQVKEQ---- 194
Query: 61 KREYNEL-DTV-----------------WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 102
+RE+ L DT+ W+ G + + + P T + V
Sbjct: 195 QREWTALYDTLRGSGMAAGAAPPSREEFWWAMG-VVRSRTFSGPYIGSTLSDRLRLAGLV 253
Query: 103 QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAA--VDDAVQLVVDRPYKAGESIV 160
+ VV L + SL +++A+ PL L ++S ++ ++ D+ +V R +K GE +
Sbjct: 254 AALVVILSR-SL-KQYAICPL-IDLFNHTSAAQSEVSYNYFGDSYSVVASRDFKKGEQVF 310
Query: 161 VWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 193
+ G Q N L+ YGF + DNP D V+ L
Sbjct: 311 ITYGAQSNDSLMQYYGFAEADNPQDTYVISDVL 343
>gi|451999637|gb|EMD92099.1| hypothetical protein COCHEDRAFT_1134267 [Cochliobolus
heterostrophus C5]
Length = 476
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 78/327 (23%), Positives = 130/327 (39%), Gaps = 54/327 (16%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL------ 54
+YE G S W PY L ++ + W+E EL L S +I
Sbjct: 91 LYEYHNGSASNWAPYFAVLPTD-------FDTLMFWTEDELTELQASAVVNKIGKEGANE 143
Query: 55 ---------------------ERAEGIKREYNELDTVWFM--AGSLFQQYPYDIPTEAFT 91
ERA+ + +E + + M GSL Y +D+ A +
Sbjct: 144 VFIEQLLPVIEEFADVIFSGDERAKDLAKEMRAPENLELMHKMGSLIMAYAFDVEP-AIS 202
Query: 92 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 151
+ + A + L K +VPL L A + +C A L D +++ +
Sbjct: 203 DKEVDEEGFAEEEEDAALPK-------GMVPLADMLNADADRCNARLFYEKDGLEMKALK 255
Query: 152 PYKAGESIVVWCGPQPNSKLLINYGFVDED-NPYDRLVVEAALNTE----DPQYQDKRMV 206
P +AGE I GP P S LL YG++ E+ YD + + A L ++ D + +KR+
Sbjct: 256 PIQAGEEIFNDYGPLPRSDLLRRYGYITENYAQYDVVEIPADLVSQALAHDGLWHEKRIE 315
Query: 207 AQRNGKLSVQVFHVHAG---REKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPC 263
++ + + A +E++S L L + + E + + S G + P +
Sbjct: 316 YLDEQEIVDTGYDIAASVPFSLEESLSPELVILVETMLLPSEEFER-LQSKGRL-PKAEK 373
Query: 264 MERAVLDQLADYFKARLAGYPATLSED 290
M + L +AR+A YP TL +D
Sbjct: 374 MTGKAAEILYKIVQARIAQYPTTLEQD 400
>gi|308806489|ref|XP_003080556.1| SET-domain transcriptional regulator-like protein (ISS)
[Ostreococcus tauri]
gi|116059016|emb|CAL54723.1| SET-domain transcriptional regulator-like protein (ISS)
[Ostreococcus tauri]
Length = 394
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 52/186 (27%), Positives = 83/186 (44%), Gaps = 20/186 (10%)
Query: 12 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 71
W P +R L + S L W+ +L + G + E +K +Y+ L
Sbjct: 77 WAPALRALPTR-------ASSSLAWNADDLGAVAGEDLANRLREYRRSVKVQYDAL---- 125
Query: 72 FMAGSLFQQYPYDIPTEAF-TFEIFKQAFVAVQSCVVHLQK-VSLARRFALVP----LGP 125
F A L +Q P P AF + F++A+ S + +Q SL R +VP
Sbjct: 126 FPA--LCEQVPEAFPARAFGDYAKFERAYDIWTSYAMKVQDPDSLQIREVIVPGVFLCNH 183
Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
L A+S + ++ A +L + R GE+I + G N+ LL+ YGF E+NPYD
Sbjct: 184 SLSAHSVRYTSLERGTK-AFRLELSRGCVEGEAITISYGRLDNADLLMFYGFSLENNPYD 242
Query: 186 RLVVEA 191
R+ + +
Sbjct: 243 RVSLHS 248
>gi|402224283|gb|EJU04346.1| hypothetical protein DACRYDRAFT_114691 [Dacryopinax sp. DJM-731 SS1]
Length = 1313
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 57/215 (26%), Positives = 90/215 (41%), Gaps = 19/215 (8%)
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
A+VP+ L A A L D +Q++ +P GE I G PNS LL YG+V
Sbjct: 1060 AMVPMADMLNARCGCNNAKLFYTRDDLQMMATKPIAKGEQIWNTYGDPPNSDLLRRYGYV 1119
Query: 179 DE-------DNPYDRLVVEAALNTEDPQ---YQDKRMVAQRNGKLSVQVFHVHAGREKEA 228
D +P D + + A E + YQD+ G V V +
Sbjct: 1120 DALTLPDGVGSPSDVVEINADTVVEAAKVQSYQDRIDWWLEEGGDDAFVLDV-----TYS 1174
Query: 229 ISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLS 288
+ D + L + + + + S P P ++ + L + RLA YP +L+
Sbjct: 1175 VPDEMLSLVRLLLLNQEDWEKAQSK---GKPPKPKLDEKSYEVLLVVLQKRLAMYPISLT 1231
Query: 289 EDEAMLTDYN-LHPKKRVATQLVRMEKKMLNACLQ 322
E E ML N L+ K+R A + E+++L+ L+
Sbjct: 1232 EQEGMLRSSNELNEKRRNALIVTTGEQRILHKTLE 1266
>gi|451852693|gb|EMD65988.1| hypothetical protein COCSADRAFT_86793 [Cochliobolus sativus ND90Pr]
Length = 478
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 82/355 (23%), Positives = 138/355 (38%), Gaps = 51/355 (14%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+YE QG+ S W Y+ L + A E+P+ W+ EL L G+ E + + E
Sbjct: 114 LYEYLQGEASRWKTYLDILPQ-------AFETPIFWTPDELKELEGTSLTTEKIGKKESD 166
Query: 61 K--RE--------------------YNELD--TVWFMAGSLFQQYPYDIPTEAFTFEIFK 96
+ RE NE D ++ GS Y +D+ E E +
Sbjct: 167 RMLRERILPIVTSHPDVFSPPGAPRLNEDDLLSLAHRMGSTIMAYAFDLENEEEQSEDEE 226
Query: 97 QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 156
++ + SL +VP+ +L +++ A + D + AG
Sbjct: 227 DGWIEDRDGK------SL---IGMVPMAD-MLNANAEFNAHVHHGDQLQVTSLRESIPAG 276
Query: 157 ESIVVWCGPQPNSKLLINYGFV-DEDNPYDRL-----VVEAALNTEDPQYQDKRMVAQRN 210
I+ + GP P+S+LL YG+V E + YD +V AL E +D +R
Sbjct: 277 SEILNYYGPLPSSELLRRYGYVTSEHHRYDVAEISWSLVRTALAEELKLSEDTIADIERK 336
Query: 211 GKLSVQVFHVHAGREKEAIS----DMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMER 266
+ ++ F V E S P LR + ++ + +L P E
Sbjct: 337 LESELEEFFVIERDAGEPSSYGTLTQPPVLREISTELEEQTKAFLKALKKRDPKRKRSET 396
Query: 267 AVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 321
L + RL YP + +DE++L+ L + R+A ++ EK++L L
Sbjct: 397 ICNTVLEKALRTRLGQYPTSAKQDESLLSKEGLSKRHRMAVEVRLGEKRLLQEAL 451
>gi|301122791|ref|XP_002909122.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262099884|gb|EEY57936.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 426
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 79/357 (22%), Positives = 147/357 (41%), Gaps = 38/357 (10%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
M+E+ + SF+ P+I L + P+ WSE+EL L G+ + +
Sbjct: 89 MHERSKRSDSFFAPFIASLPT-------TFDLPVFWSESELNELKGTNVLLLTQLMKQQL 141
Query: 61 KREYNELDTVWFMAGSLFQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR 117
+R++ + ++ + +P +PT T E + A + S + + R
Sbjct: 142 QRDFENIHQ------AVVEDFPEVFALLPT--LTLEDYTWAMSVIWSRAFGVTREKKYLR 193
Query: 118 FALVPLGPPLLAYSSKCKAML---AAVDDAVQLV---VDRPYKAGESIVVWCGPQPNSKL 171
L P + + + +L + D+ Q++ V + AG ++ + G N+KL
Sbjct: 194 -VLCP-AMDMFNHDVSLRILLDDFVSFDEETQMLTHHVPKEVAAGSALQISYGQYSNAKL 251
Query: 172 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR-EKEAIS 230
L +YGFV ++N + + DP + K+ V N Q + E +
Sbjct: 252 LFSYGFVAKENSRRAVDFWMKIPPNDPYLKLKQTVLDSNELTRDQTYDFCGTLFENDVDE 311
Query: 231 DMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSED 290
+L LR+ +++ E++ + +S E AV + L + + +LA + TL ED
Sbjct: 312 RLLATLRVILMNE-QEIR-LYKKAFETSIISIRNELAVYENLQNTCRRKLANFATTLEED 369
Query: 291 EAMLTDYNLHPKKRVATQL-VRMEKK--------MLNACLQVTADMIMLLPDVTVSP 338
EA+L + R++ + VR+E K L QV A + + P T P
Sbjct: 370 EAILAEMATESSPRLSFAVRVRVEDKQVLTGVIDTLEKWKQVLASNLEMYPPSTTRP 426
>gi|395332633|gb|EJF65011.1| SET domain-containing protein [Dichomitus squalens LYAD-421 SS1]
Length = 502
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 61/223 (27%), Positives = 97/223 (43%), Gaps = 27/223 (12%)
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
A+VP+ L A A L + +++V +P +AGE I G PNS LL YG V
Sbjct: 257 AMVPMADMLNARFESENAKLFYEERELKMVTTKPVEAGEQIWNTYGDPPNSDLLRRYGHV 316
Query: 179 D----------EDNPYDRLVVEAAL----NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR 224
D NP D + V A L ++ +Y + V + VF +
Sbjct: 317 DVVPLRPPLSGMGNPRDIVEVRADLIVSAVSKKVEYSLQERVDWWLEEAEDDVFILRT-- 374
Query: 225 EKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP 284
+ E +++ + RL ++S+ +++ S P V P VL D ARL YP
Sbjct: 375 DCELPEELVSFERLLFLSEDEWIKTAKKSKLPKPKVDP----DVLTVAIDVLSARLKEYP 430
Query: 285 ATLSEDEAMLT-----DYNLHPKKRVATQLVRMEKKMLNACLQ 322
++ EDE +L+ +L+ K V +L EK++L L+
Sbjct: 431 TSIEEDEKLLSADKVESLSLNKKHAVIVRL--GEKRILQGTLK 471
>gi|384484604|gb|EIE76784.1| hypothetical protein RO3G_01488 [Rhizopus delemar RA 99-880]
Length = 400
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 56/237 (23%), Positives = 95/237 (40%), Gaps = 48/237 (20%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
MYE ++ SFW PY L R+ +P+ W++ +L L G+ ++I + E
Sbjct: 90 MYEIEK-PDSFWKPYFDVLPRE-------FTTPMFWNQEDLKELEGTDIISKI-GKKESE 140
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK--QAFVAVQSCVVHLQKVSLARR- 117
+ +NEL+ + ++YP + T E+F + + S LQK
Sbjct: 141 ELFHNELEPI-------IKKYPNLFDEQKHTIELFHICGSLIMAYSFNDELQKAPKENNK 193
Query: 118 -----------------------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYK 154
++VP+ L + A L D++Q+ + K
Sbjct: 194 EEEKEEEEEEEEEEEEEEEEEGLISMVPMADMLNHKTGFNNARLFHEPDSLQMRAIKDIK 253
Query: 155 AGESIVVWCGPQPNSKLLINYGFVDEDNPYDR------LVVEAALNTEDPQYQDKRM 205
GE I G N+ LL YGFVDE N +D L+VE +D +++++
Sbjct: 254 EGEQIYNTYGDLCNADLLRKYGFVDEKNDFDLVELDGPLLVEVCCEDQDEALKERKI 310
>gi|330806388|ref|XP_003291152.1| hypothetical protein DICPUDRAFT_155733 [Dictyostelium purpureum]
gi|325078672|gb|EGC32310.1| hypothetical protein DICPUDRAFT_155733 [Dictyostelium purpureum]
Length = 465
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 77/305 (25%), Positives = 122/305 (40%), Gaps = 34/305 (11%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
MY K +KSFW PY+ L + + L +S+ EL L S K + R +GI
Sbjct: 125 MYSK-LNEKSFWHPYVTVLPE-------SFSTSLYFSDNELDELQASQLKEFTIIRKDGI 176
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA-FVAVQSCVVHLQKVSLARR-F 118
+R Y S F + +P E ++ Q F SCV + SLA
Sbjct: 177 ERHYE----------STFSRLSKLVP-EFSNLALYNQELFTWALSCVWS-RAFSLAENDG 224
Query: 119 ALVPLGPPLLAYS-SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG---PQPNSKLLIN 174
+VPL A SK K + D + GE I G P +S++L++
Sbjct: 225 GMVPLADMFNAEDRSKSKVLPKVTDTTLDYYASDDIAEGEQIFTPYGVYKPLSSSQMLMD 284
Query: 175 YGFV-DEDNPYDRLVVEA-ALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDM 232
YGF+ DE D + + + ++P K+ + + N ++ +VF + A D+
Sbjct: 285 YGFIFDEGTVSDNVAITVPVFHNDEPNLSTKQEILEENDIIN-EVFLLQKTDPLPA--DL 341
Query: 233 LPYLRLGYVSDTSEMQSVISSLGP---ICPVSPCMERAVLDQLADYFKARLAGYPATLSE 289
L Y R+ + Q+ L P P++ E+ L L + L Y L
Sbjct: 342 LLYARVKNLIAKECDQAKKHFLSPNTRNTPLNTRNEKVSLRFLENLIHRYLDSYGTNLES 401
Query: 290 DEAML 294
D+ +L
Sbjct: 402 DKNLL 406
>gi|71895277|ref|NP_001025965.1| SET domain-containing protein 4 [Gallus gallus]
gi|53134599|emb|CAG32346.1| hypothetical protein RCJMB04_23h14 [Gallus gallus]
Length = 439
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 77/347 (22%), Positives = 136/347 (39%), Gaps = 52/347 (14%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
EK G++S W PY+ L + P+ E ++ L P + + E+ +
Sbjct: 116 EKHAGERSLWKPYLDVLPK-------TYSCPVC-LEQDVVQLLPEPLRKQAQEQRTAVHE 167
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 122
Y + SLF + I F + + A+ + + +++ K S F+L P
Sbjct: 168 LYMSSKAFFSSLQSLFAENTATI----FNYSALEWAWCTINTRTIYM-KHSQRECFSLEP 222
Query: 123 ----LGP--PLLAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 174
L P LL +S + KA ++ + K E + + GP N +LL+
Sbjct: 223 DVYALAPYLDLLNHSPNVQVKAAFNEQSRNYEIQTNSQCKKYEEVFICYGPHDNQRLLLE 282
Query: 175 YGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP 234
YGFV DNP+ + V + + DK Q+N KLS+ H D+L
Sbjct: 283 YGFVAVDNPHSSVYVSSDTLLKYFPSLDK----QKNAKLSILKEH-----------DLLE 327
Query: 235 YLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM- 293
L G+ + + + + L C RA+ L D AR +E +A+
Sbjct: 328 NLTFGWDGPSWRLLTALKVLSLGGDEFTCWRRAL---LGDVISAR--------NEQQALN 376
Query: 294 ----LTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTV 336
+ + + + V Q+ +M++ N Q+T + L D+ +
Sbjct: 377 ITTKICHFLIEETQHVLLQISQMKRDKENLKEQLTLVEALRLEDLKI 423
>gi|392569623|gb|EIW62796.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
Length = 509
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 61/221 (27%), Positives = 92/221 (41%), Gaps = 23/221 (10%)
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
A+VP+ L A A L + +++V +P KAGE I G PNS LL YG V
Sbjct: 264 AMVPMADMLNARFESENAKLFYDERELKMVSTKPIKAGEQIWNTYGDPPNSDLLRRYGHV 323
Query: 179 D----------EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG----KLSVQVFHVHAGR 224
D NP D + V A L + K + +R + VF +
Sbjct: 324 DLVPLSAPLSGLGNPGDVVEVRADLIVSVAAKKVKHDLKERVDWWLEEADDDVFVLRT-- 381
Query: 225 EKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP 284
+ E +++ ++RL + ++ S P P +++ VL D + RL YP
Sbjct: 382 DCELAEELVSFVRLLLLPKDEWEKAAQKSKLP----KPKLDKDVLTIAVDVLEKRLKDYP 437
Query: 285 ATLSEDEAMLTDY---NLHPKKRVATQLVRMEKKMLNACLQ 322
TL EDEA+ L KR A + EK++L L+
Sbjct: 438 TTLEEDEALFAPERFGELSLNKRHAVVVRLGEKRILRGTLK 478
>gi|344277088|ref|XP_003410336.1| PREDICTED: SET domain-containing protein 4 [Loxodonta africana]
Length = 440
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 57/225 (25%), Positives = 93/225 (41%), Gaps = 25/225 (11%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
EK G +S W PY+ L + P+ W E E+ L P +A+ E+ ++
Sbjct: 117 EKHAGDQSSWKPYLETLPK-------TYTCPVCW-EPEVVNLLPRPLRAKAQEQRTRVQE 168
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 122
+ + LF + +I FT+ A+ V + V+L+ L R F+ P
Sbjct: 169 FFTSFRDFFSSLQPLFSEAVENI----FTYSALLWAWCTVNTRAVYLRHRQL-RCFSAEP 223
Query: 123 ----LGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 174
L P LL +S + KA ++V + E + + GP N +LL+
Sbjct: 224 DTCALAPYLDLLNHSPDVQVKAAFNEKTRCYEIVAVSSCRKHEEVFICYGPHDNHRLLLE 283
Query: 175 YGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFH 219
YGFV NP+ + V + + DK+M N K+S+ H
Sbjct: 284 YGFVSTRNPHACVYVSRDILVKYLPSTDKQM----NKKISILKDH 324
>gi|336467028|gb|EGO55192.1| hypothetical protein NEUTE1DRAFT_147775 [Neurospora tetrasperma
FGSC 2508]
gi|350288355|gb|EGZ69591.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
Length = 504
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 48/193 (24%), Positives = 84/193 (43%), Gaps = 20/193 (10%)
Query: 6 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 65
+GK SFW PYI L + A+ P W+E ++ L G+ I E +K EY
Sbjct: 115 KGKSSFWAPYISTLADPSQLDKWAL--PPFWAEDDIELLKGTNAYVAIQEIQSNVKSEYK 172
Query: 66 ELDTVWFMAG----SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKV----SLARR 117
+ + G + Q Y+ FT F+ + V +S +++++ S
Sbjct: 173 QARKILKKEGFPDYRDYTQVLYNWAYCMFTSRSFRPSLVLSESAREYVERLLPEGSKIDD 232
Query: 118 FALVPLGPPLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLL 172
F+++ PL ++ + L + A +L+ + Y G+ + G + NS+LL
Sbjct: 233 FSILQ---PLYDIGNHSWDASYTWNLTSEPSACELICNDSYGPGQQVFNNYGFKTNSELL 289
Query: 173 INYGFVDEDNPYD 185
+ YGF+ NP D
Sbjct: 290 LGYGFI--INPKD 300
>gi|119500300|ref|XP_001266907.1| SET domain protein [Neosartorya fischeri NRRL 181]
gi|119415072|gb|EAW25010.1| SET domain protein [Neosartorya fischeri NRRL 181]
Length = 704
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 54/195 (27%), Positives = 86/195 (44%), Gaps = 27/195 (13%)
Query: 6 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 65
+G + FW PYIR L + L++ +PL + +L +L G+ +R K +Y
Sbjct: 132 KGSEGFWFPYIRTLPQP-----LSLTTPLYYEGGDLRWLDGTSLAPAREQRMGVWKEKYK 186
Query: 66 ELDTVWFMAGSLFQ---QYPYDI--------PTEAFTFEIFKQAFVAVQSCVVHLQKVSL 114
T AG FQ QY +D+ + AF+ ++ +A V+ + VS+
Sbjct: 187 NGITELRKAG--FQDVDQYTWDLYLWSSSILVSRAFSAKVLAEAVTDVE---LPEDGVSV 241
Query: 115 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 174
L+P L+ + K A V VV +G+ I GP+ N +L++N
Sbjct: 242 -----LLPC-IDLMNHRPLAKVEWRAGKQDVAFVVLEDVGSGQEISNNYGPRNNEQLMMN 295
Query: 175 YGFVDEDNPYDRLVV 189
YGF DNP D +V
Sbjct: 296 YGFCLPDNPCDYRIV 310
>gi|159490820|ref|XP_001703371.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280295|gb|EDP06053.1| predicted protein [Chlamydomonas reinhardtii]
Length = 339
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 48/193 (24%), Positives = 82/193 (42%), Gaps = 13/193 (6%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
M+E+ +G +S W PY+ L PL W E L G+ ++L RA+
Sbjct: 125 MWERWRGPESRWAPYLALLPDD------MTHMPLYWKRREFRELRGTAAYDKMLGRAQHP 178
Query: 61 KREYNELDTVWF-MAGSLFQQYP-YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF 118
++ +W + G ++P +P +E+++ A AV S L +
Sbjct: 179 SDAPTQVPLLWSEVVGPFIAEHPELGLPGGERGYELYRWATAAVASYSFILGD---DKYQ 235
Query: 119 ALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 176
A+VP+ L + L + +Q++ R AG +V G N++LL YG
Sbjct: 236 AMVPVWDLLNHITGDVNVRLHHCSKRHVLQMIAMRDIVAGSELVNNYGELSNAELLRGYG 295
Query: 177 FVDEDNPYDRLVV 189
FV+ N Y+ + V
Sbjct: 296 FVERANRYNHIPV 308
>gi|409080258|gb|EKM80618.1| hypothetical protein AGABI1DRAFT_71041 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 492
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 60/221 (27%), Positives = 94/221 (42%), Gaps = 25/221 (11%)
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
A+VP+ L A A L D +++V +P K GE I G PN++LL YG V
Sbjct: 245 AMVPMADILNARYQTENAKLFHEKDELKMVTTKPIKTGEQIWNTYGDLPNAELLRRYGHV 304
Query: 179 D--------EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 230
D NP D + ++A L V K + + + G E I
Sbjct: 305 DFLSLPSGGHGNPGDVVEIKADLII-SAVSSTPEAVKDDEAKERID-WWLEEGGEDVFIL 362
Query: 231 D--------MLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAG 282
D M+ +++L ++ ++ S P +E + D L + RLA
Sbjct: 363 DYEYDLPPVMISFVKLLLLTQADWEKAREKS----KPPKSRLEGILYDILISTLEKRLAE 418
Query: 283 YPATLSEDEAMLT-DYNLHPKKRVATQLVRMEKKMLNACLQ 322
YP T+ D+A+LT D L+ K + +L EK++L+ LQ
Sbjct: 419 YPTTIETDKALLTNDTPLNNKNAIIVRL--GEKEILHGILQ 457
>gi|281201870|gb|EFA76078.1| hypothetical protein PPL_10657 [Polysphondylium pallidum PN500]
Length = 1234
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 51/199 (25%), Positives = 82/199 (41%), Gaps = 17/199 (8%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+YEK + SFW P+ L + + ++ TEL L G+ AE L+ +
Sbjct: 820 IYEK-ENPNSFWRPFFDTLPS-------YFPTSIHYTSTELLELEGTNLFAETLQ----V 867
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
K + + F L +QYP P F++E F A S + L K+ L
Sbjct: 868 KEHLQSIRDMLF--PELSEQYPTIFPESLFSWENFLWARSLFDSRAIQL-KIDDKITNCL 924
Query: 121 VPLGPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
VP+ + + + + D ++V I + G N +L + YGFV
Sbjct: 925 VPMADMINHHHNAQISQRFFDQTDQCFKMVSCCSVPPNAQIFLHYGALQNRELALYYGFV 984
Query: 179 DEDNPYDRLVVEAALNTED 197
+DNPYD +++ L ED
Sbjct: 985 IQDNPYDSMLIGFDLPDED 1003
>gi|358056251|dbj|GAA97802.1| hypothetical protein E5Q_04481 [Mixia osmundae IAM 14324]
Length = 433
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 74/308 (24%), Positives = 126/308 (40%), Gaps = 40/308 (12%)
Query: 31 ESPLLWSETELAYLTGSPTKAEI-LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 89
+S + WS+ EL L GS +I E AEG Y+ L +F+ P EA
Sbjct: 110 DSLMFWSDDELRELQGSSVLDKIGREEAEG--SYYSILVPYLSKHADIFK------PLEA 161
Query: 90 FTFEIFKQAFVAVQSCVVHLQKVSL---------------ARRFALVPLGPPLLAYSSKC 134
++ ++ + + S H+ +VP+ L A S
Sbjct: 162 YSLALYHRCGSLILSRSFHVSNQDDSASDASDDDDAAYHEVETVGMVPMADVLNAKSGSA 221
Query: 135 KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL- 193
A L DA+ + + AGE I PN+ LL YG VDE N D + + A L
Sbjct: 222 NACLVYHPDALVMTTTKEIAAGEQIFNTYNDPPNADLLRRYGHVDEVNLNDNVEISADLI 281
Query: 194 NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI-SDMLPYLRLGYVSDTSEMQSVIS 252
+D +R L ++ V+ + E + D + +++ + SE + +
Sbjct: 282 GCKD---------LERVDWLLDRLDDVYTLTQAEDLPEDFITAVKI-LTASKSEFRKIQK 331
Query: 253 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 312
+ P ++ A ++ + + RLA Y +T+ EDE++L + + A LVR+
Sbjct: 332 A--DDLP-DDVLDEATAMRVREILQMRLAQYSSTIEEDESLLASSTMLTSRSRAALLVRL 388
Query: 313 -EKKMLNA 319
EK++L A
Sbjct: 389 GEKRILAA 396
>gi|327295326|ref|XP_003232358.1| hypothetical protein TERG_07206 [Trichophyton rubrum CBS 118892]
gi|326465530|gb|EGD90983.1| hypothetical protein TERG_07206 [Trichophyton rubrum CBS 118892]
Length = 692
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 48/195 (24%), Positives = 87/195 (44%), Gaps = 23/195 (11%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
++E+ +G+ S W PY+ L R + S L + +++L +L G+ +
Sbjct: 113 VHEQLKGRDSHWWPYLATLPRAS-----ELTSALFFQDSDLEWLQGTSLYETHRAYRNTV 167
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF--VAVQSCVVHLQKVSLARRF 118
K EY+ +A S+ + Y + E++T++IF A+ +A ++ + L+
Sbjct: 168 KEEYD-------LAISILRDEGY-LAIESYTWDIFCWAYTLIASRAFTSRVLDAYLSNHP 219
Query: 119 AL-----VPLGPPLLAYSSK---CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 170
+L + PL+ +S+ K A ++L V P GE + GP N +
Sbjct: 220 SLKQEEEFQIMLPLVDFSNHKPLAKIEWQAEATEIRLKVVEPTFTGEEVHNNYGPLNNQQ 279
Query: 171 LLINYGFVDEDNPYD 185
L+ YGF DNP D
Sbjct: 280 LMTTYGFCIVDNPCD 294
>gi|346319394|gb|EGX88996.1| Protein kinase-like domain [Cordyceps militaris CM01]
Length = 1753
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 70/302 (23%), Positives = 121/302 (40%), Gaps = 18/302 (5%)
Query: 34 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 93
+ ++E EL GS + + ++ ++ +L L Q+ P + FT E
Sbjct: 993 IFFAEDELQVCEGSSLHTLTTQLEQRVQDDFRQLLV------QLLSQHRDLFPLDQFTIE 1046
Query: 94 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYS---SKCKAMLAAVDDAVQLVVD 150
+K A + S + + PL +L +S +C A D + ++
Sbjct: 1047 DYKWALCTIWSRAMDFAVSDTTSVRLVAPLAD-MLNHSLDVKQCHAYDPTSGD-LSILAA 1104
Query: 151 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 210
+ Y+ G+ I ++ G PN++LL YGFV DNP D + + P Y+ K +
Sbjct: 1105 KDYQVGDQIFIYYGSVPNNRLLRLYGFVLLDNPNDSYDLVLQTSPMAPLYEQKERLWALA 1164
Query: 211 GKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL--GPICPVSPCMERAV 268
G S + A + ++L YLR + D +++ + L G V+ E V
Sbjct: 1165 GLDSTCTIPLTA--KHPLPKNVLRYLRTQRL-DAADVADMTLQLLNGTDGKVNDGNEIQV 1221
Query: 269 LDQLADYFKARLAGYPATLSEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNACLQVTAD 326
L L D + L G+ L + EA L Y A Q+ E+ +L + D
Sbjct: 1222 LQFLIDSLGSVLEGFGIPLEKLEAQLAGGFYPAGGNAWAAAQVSAGEQGILTRAKKTAED 1281
Query: 327 MI 328
M+
Sbjct: 1282 ML 1283
>gi|390354259|ref|XP_001201449.2| PREDICTED: SET domain-containing protein 4-like [Strongylocentrotus
purpuratus]
Length = 455
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 49/198 (24%), Positives = 82/198 (41%), Gaps = 19/198 (9%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT--KAEILERAE 58
+ E+ +GK SFW PYI L + +P S T+ A PT ++ + + +
Sbjct: 127 LTERSKGKSSFWYPYINVLPKD-------FTTPAFGS-TKQADFDVLPTIARSRAINQLQ 178
Query: 59 GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF----KQAFVAVQSCVVHLQKVSL 114
I+ + ++ F QY ++F + F + ++ C K S
Sbjct: 179 DIRAAFESASCLFEDIERTFPQYRIFFSLDSFVWAWFVINSRSVYIEPSGCEAFDPKAS- 237
Query: 115 ARRFALVPLGPPLLAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLL 172
FAL P LL +S ++ A V + ++ Y A + + + GP N LL
Sbjct: 238 -DDFALAPFLD-LLNHSPGAEVTAGFDPVSNCYRIKTLDSYHAYDQVFIHYGPHDNVNLL 295
Query: 173 INYGFVDEDNPYDRLVVE 190
+ YGFV NP+D + E
Sbjct: 296 LEYGFVIPSNPHDAVSFE 313
>gi|66827459|ref|XP_647084.1| hypothetical protein DDB_G0267502 [Dictyostelium discoideum AX4]
gi|60475269|gb|EAL73204.1| hypothetical protein DDB_G0267502 [Dictyostelium discoideum AX4]
Length = 472
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 69/329 (20%), Positives = 133/329 (40%), Gaps = 67/329 (20%)
Query: 31 ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 90
++ L + E E+ YL GSP +I+ E + Y++L F + + +
Sbjct: 163 DTSLYFDEKEIEYLAGSPAFVDIMVEKEVATKLYDQLSQTLFKDNVILEMCQG--QSTII 220
Query: 91 TFEIFKQAFVAVQSCVVHLQK----VSLARRFALVPLGPPLLAYSSKCKAMLAAVD---- 142
++ F+ A + + +++ S ++ L P+ PP++ Y + A +D
Sbjct: 221 GWDQFRWAHSTITARKIYVTDPDSVGSDGKQMKLSPVVPPIVDYFNHGNQPSAEIDYNEE 280
Query: 143 -DAVQLVVDRPYKAGESIVV-----WCGPQPNSKLLINYGF----VDEDNPYDRLVVE-- 190
+V + + K GE I V +CG S LL++YG+ +D+ + + L+ E
Sbjct: 281 LGSVDVKAIKDIKKGEEIFVSYDHHYCG----SDLLVDYGYLPNQIDDKSCVNVLMEELL 336
Query: 191 AALNTEDPQYQDK-----RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 245
+N +DP DK +++ ++ KL +
Sbjct: 337 ETINLDDPIKDDKYYLVNKLLETKDIKLKIS----------------------------- 367
Query: 246 EMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML--TDYN-LHPK 302
M S+ L I + ++L+ L ++ YP T+ +D+ L +YN L +
Sbjct: 368 -MDSLTEDLLKISKYMSYKQESLLEYLKRLVSLKIGHYPTTIIQDKEFLLSKEYNQLSAR 426
Query: 303 KRVATQLVRMEKKMLNAC---LQVTADMI 328
++A L EK++L+ LQ D I
Sbjct: 427 SKLAFNLAFQEKQILSNVYTKLQENIDTI 455
>gi|336468018|gb|EGO56181.1| hypothetical protein NEUTE1DRAFT_83233 [Neurospora tetrasperma FGSC
2508]
gi|350289741|gb|EGZ70966.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
Length = 459
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 45/177 (25%), Positives = 82/177 (46%), Gaps = 13/177 (7%)
Query: 32 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 91
S +L++E +L G+ + + I+ ++ L LF Q+P P + FT
Sbjct: 107 SSILFAEDDLEACAGTSLYTITKQLEQSIEDDHRALVV------RLFVQHPDLFPLDKFT 160
Query: 92 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP--PLLAYSSKCKA--MLAAVDDAVQL 147
E +K A V S + LA ++ L P +L ++S+ K + + +
Sbjct: 161 VEDYKWALCTVWSRAMDF---VLADGNSIRLLAPFADMLNHTSEVKQCHVYDPSSGNLSV 217
Query: 148 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 204
+ + Y+AG+ + + GP PNS+LL YGFV NP D + + + + P ++ K+
Sbjct: 218 LAGKDYEAGDQVFINYGPVPNSRLLRLYGFVIPGNPNDSYDLVLSTHPQAPFFEQKQ 274
>gi|303275964|ref|XP_003057276.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226461628|gb|EEH58921.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 308
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 88/208 (42%), Gaps = 31/208 (14%)
Query: 15 YIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFM 73
Y++ L R+ G L W E ++ L GSP++ ER + E+
Sbjct: 93 YVKALPRRTG-------GVLDWPEEDVKTLLAGSPSQRAAYERQASVDGAIEEIRA---- 141
Query: 74 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSK 133
++P P + AF + S ++ L + ALVP +L +
Sbjct: 142 ------EFPQLTPG------ALRWAFDVLFSRLIRLP--NRGGELALVPWAD-MLNHKPG 186
Query: 134 CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE--DNPYDRLVVEA 191
C A + V L DR YK GE + G +P+++LLI+YGF E +NP D +
Sbjct: 187 CNAYIDDSGGKVCLQPDRAYKPGEQVFASYGQRPSAELLISYGFAPEVGENPDDEYEITL 246
Query: 192 ALNTEDPQYQDKRMVA-QRNGKLSVQVF 218
++ D +Y D + A ++ G V+ F
Sbjct: 247 GIDPND-RYADAKAAALEKIGLRPVESF 273
>gi|302834219|ref|XP_002948672.1| hypothetical protein VOLCADRAFT_104004 [Volvox carteri f.
nagariensis]
gi|300265863|gb|EFJ50052.1| hypothetical protein VOLCADRAFT_104004 [Volvox carteri f.
nagariensis]
Length = 510
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 91/378 (24%), Positives = 150/378 (39%), Gaps = 70/378 (18%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
++ K+QG +S P+I +L G P+LW E ++A L A++ E+
Sbjct: 143 LWHKRQGSQSPLAPWIAQLPSDTG-------VPVLWDERQIAALQYPYLIAQVKEQQREW 195
Query: 61 KREYNEL---------------DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ-----AFV 100
++ Y +L D W M S + + P T + + A +
Sbjct: 196 QQLYGDLVRSGTPAGVQAPSREDFFWAM--SCVRSRTFSGPYIGSTLQDRLRTAGLVAVL 253
Query: 101 AVQSCVVHL---QKVSLA-------------------RRFALVPLGPPLLAYSSKCKAML 138
A + V+ L QK A +++A+ PL L +SS ++ +
Sbjct: 254 AAGNTVLGLADPQKTLSAAIAVLLFNVLYELILSRSLKQYAICPL-IDLFNHSSAVQSEV 312
Query: 139 AA--VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN-- 194
A D+ +V R +K GE + + G Q N L+ YGF + +NP D V+ L
Sbjct: 313 AYNYFGDSYSVVASREFKKGEQVFISYGAQSNDSLMQYYGFAEANNPQDVYVMTDMLRWL 372
Query: 195 TEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL 254
T R+ A + L+ + V R S+ L +R +D SE + +SS
Sbjct: 373 TAVRSVGQSRLDALKGSPLANSLQQVAIQRAGFP-SETLQAVRFLLAAD-SEAGADVSSF 430
Query: 255 GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKR---------V 305
SP E + + +A+ + L ++L ED A+L+ R V
Sbjct: 431 SKSG--SPDQEAQLAEVVAEVVRRELGHLGSSLQEDLALLSSTGASAGGRKGGTAAAAAV 488
Query: 306 ATQLVRMEKK-MLNACLQ 322
A R+EKK +L A LQ
Sbjct: 489 AAVAFRVEKKRLLTAVLQ 506
>gi|322698908|gb|EFY90674.1| putative histone-lysine N-methyltransferase [Metarhizium acridum
CQMa 102]
Length = 437
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 38/135 (28%), Positives = 63/135 (46%), Gaps = 13/135 (9%)
Query: 77 LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRFALVPLGPPLLAYS 131
+ QYP P + FT E +K A V S + + + L FA +L +S
Sbjct: 146 MLGQYPDLFPLDKFTVEDYKWALCTVWSRAMDFVLPDGKSIRLLAPFA------DMLNHS 199
Query: 132 SKCKA--MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 189
S+ K + A + ++ + Y+AG+ + + GP PN++LL YGFV NP D +
Sbjct: 200 SEAKQCHVYDASSGNLSVLAGKDYEAGDQVFINYGPMPNNRLLRLYGFVVPGNPNDSYDL 259
Query: 190 EAALNTEDPQYQDKR 204
A + P ++ K+
Sbjct: 260 VLATHPMAPFFKQKQ 274
>gi|449464220|ref|XP_004149827.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Cucumis
sativus]
Length = 499
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 46/189 (24%), Positives = 81/189 (42%), Gaps = 22/189 (11%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+ E K G S W PYI L + + + + W E+EL + S E L + I
Sbjct: 127 LLEHKLGLGSEWAPYIIRLPQP-----WEMHNTIFWKESELEMIRKSSLYEESLNQRSQI 181
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
KRE+ + + +P I + + + F A+ V S + +L
Sbjct: 182 KREFLAIRKA-------LEAFPEII--DRISCDDFMHAYALVTS-----RAWRSTEGVSL 227
Query: 121 VPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
+P L + +AML DD ++V DR + GE +++ G N+ L++++GF
Sbjct: 228 IPFAD-FLNHDGASEAMLLNDDDKQLSEVVADRDFAPGEHVLIRYGKYSNATLMLDFGFA 286
Query: 179 DEDNPYDRL 187
N +D++
Sbjct: 287 LPYNIHDQV 295
>gi|164423408|ref|XP_963594.2| hypothetical protein NCU08733 [Neurospora crassa OR74A]
gi|157070080|gb|EAA34358.2| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 459
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 45/177 (25%), Positives = 81/177 (45%), Gaps = 13/177 (7%)
Query: 32 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 91
S +L++E +L G+ + + I+ ++ L LF Q+P P + FT
Sbjct: 107 SSILFAEDDLEACAGTSLYTITKQLEQSIEDDHRALVV------RLFVQHPDLFPLDKFT 160
Query: 92 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP--PLLAYSSKCKA--MLAAVDDAVQL 147
E +K A V S + LA ++ L P +L ++S+ K + + +
Sbjct: 161 VEDYKWALCTVWSRAMDF---VLADGNSIRLLAPFADMLNHTSEVKQCHVYDPSSGTLSV 217
Query: 148 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 204
+ Y+AG+ + + GP PNS+LL YGFV NP D + + + + P ++ K+
Sbjct: 218 FAGKDYEAGDQVFINYGPVPNSRLLRLYGFVIPGNPNDSYDLVLSTHPQAPFFEQKQ 274
>gi|70993754|ref|XP_751724.1| SET domain protein [Aspergillus fumigatus Af293]
gi|66849358|gb|EAL89686.1| SET domain protein [Aspergillus fumigatus Af293]
gi|159125354|gb|EDP50471.1| SET domain protein [Aspergillus fumigatus A1163]
Length = 674
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 52/193 (26%), Positives = 84/193 (43%), Gaps = 23/193 (11%)
Query: 6 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 65
+G + FW PYIR L + L++ +PL + +L +L G+ +R K +Y
Sbjct: 102 RGSEGFWFPYIRTLPQP-----LSLTTPLYYEGDDLRWLDGTSLAPAREQRMGVWKEKYE 156
Query: 66 ELDTVWFMAG-SLFQQYPYDI--------PTEAFTFEIFKQAFVAVQSCVVHLQKVSLAR 116
T AG QY +D+ + AF+ ++ +A V+ + VS+
Sbjct: 157 NGITELRKAGFEDVDQYTWDLYLWSSSILVSRAFSAKVLAEAVTDVE---LPEDGVSV-- 211
Query: 117 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 176
L+P L+ + K A V VV +G+ I GP+ N +L++NYG
Sbjct: 212 ---LLPC-IDLMNHRPLAKVEWRAGKQDVAFVVLEDVASGQEISNNYGPRNNEQLMMNYG 267
Query: 177 FVDEDNPYDRLVV 189
F DNP D +V
Sbjct: 268 FCLPDNPCDYRIV 280
>gi|255568191|ref|XP_002525071.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
gi|223535652|gb|EEF37318.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
Length = 456
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 46/199 (23%), Positives = 93/199 (46%), Gaps = 24/199 (12%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+ ++K G++S W PYI L Q G + S + WS++EL + S E +++ I
Sbjct: 98 LVDQKVGQESKWAPYISRLP-QLGE----MHSTIFWSKSELDMIFQSSVYKETIKQKAQI 152
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
++++ + V + +P + + TF+ F A+ V+S + + +L
Sbjct: 153 EKDFLTIKPV-------LEHFPQ--ISRSITFQDFMHAYALVKS-----RAWGSTKGVSL 198
Query: 121 VPLGPPLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
+P L + +A++ +D ++ DR Y E +++ G N+ LL+++GF
Sbjct: 199 IPFAD-FLNHDGFSEAVVLNDEDKQVSEVAADRNYAPHEEVLIRYGKFSNATLLLDFGFS 257
Query: 179 DEDNPYDRLVVEAALNTED 197
N +++ VE +N D
Sbjct: 258 LPYNIHEQ--VEIQINIPD 274
>gi|389741836|gb|EIM83024.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 502
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 60/231 (25%), Positives = 102/231 (44%), Gaps = 34/231 (14%)
Query: 119 ALVPLGPPLLA-YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
A+VP+ L A Y S+ + +D ++++ +P GE I G PNS LL YG
Sbjct: 254 AMVPMADMLNARYRSENAKLFYETED-LRMITTKPILKGEQIFNTYGDPPNSDLLRRYGH 312
Query: 178 VDE--------DNPYDRLVVE-----AALNTEDPQYQDKRMVAQR------NGKLSVQVF 218
VD NP D +VE A + + Q A+R G V +
Sbjct: 313 VDLVPLPNGDIGNPAD--IVELRGDLAFFSISERHKQPVESSAERVDWWLEEGGEDVFIL 370
Query: 219 HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 278
+ E +++P+ RL S + ++ S P V + ++L +A+ +
Sbjct: 371 ETN----HELPDELVPFCRLLLQSQSEWEKTKSKSKLPKAKV----DESILSTIANALER 422
Query: 279 RLAGYPATLSEDEAMLTD-YNLHPKKRVATQLVRMEKKMLNACLQVTADMI 328
RLA YP ++ ED+ +LT+ +L+ K V +L EK++L+ L + +
Sbjct: 423 RLAEYPTSVEEDQKLLTEPLSLNRKHAVIVRL--GEKRILHGTLSTVKEKL 471
>gi|145353540|ref|XP_001421068.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581304|gb|ABO99361.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 813
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 54/195 (27%), Positives = 80/195 (41%), Gaps = 23/195 (11%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
E+ +G+K+ ++ L R + ++ WSE EL LTG+ E + E K
Sbjct: 77 ERFRGEKATRAAHVATLPR-------SFDTAFFWSEEELRELTGTTCLRETMNLREETKN 129
Query: 63 EYNELDTVW--FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
+Y L G +++ D +E + A + S L R A+
Sbjct: 130 DYETLTKKMEAIGEGGWMREHEVD-------YERYAWARSNLWSRQCDLLMPDGKRTRAM 182
Query: 121 VPLGPPLLAYSSKC----KAMLAAVDDAVQLVVDRPYKAGES--IVVWCGPQPNSKLLIN 174
VP + +S+K L A + V + YKAGE I G NSKLL
Sbjct: 183 VPT-FDIFNHSAKAPLGKTHKLNAEKNCVTVYAADDYKAGEQAFISYGSGEAANSKLLTW 241
Query: 175 YGFVDEDNPYDRLVV 189
YGF +DNPY+ L V
Sbjct: 242 YGFCIDDNPYEELDV 256
>gi|308802149|ref|XP_003078388.1| related to histone-lysine N-methyltransferase (ISS) [Ostreococcus
tauri]
gi|116056840|emb|CAL53129.1| related to histone-lysine N-methyltransferase (ISS), partial
[Ostreococcus tauri]
Length = 446
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 86/346 (24%), Positives = 137/346 (39%), Gaps = 51/346 (14%)
Query: 7 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 66
G++S + P++R L + ++ + W E EL L GS A + + EY+
Sbjct: 85 GERSAFWPWLRLLPSE-------TDAAVGWDEDELRELQGSNVVAFARAIKKSWREEYDA 137
Query: 67 LDTVWFMAGSLFQQYPYDIPTEAF--------TFEIFKQAFVAVQSCVVHLQKVSLARRF 118
LD F D P EAF TFE F A V S + L+ S +
Sbjct: 138 LD---------FAGLGVDFP-EAFGGEHAAHYTFEKFTWARFVVWSRAIDLKTDSTSA-- 185
Query: 119 ALVPLGPPLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 173
++ + P+L A S K A +AV++ +K + +P+ L+
Sbjct: 186 PVIRMLVPILDMANHAPSGKLLPRWDAKANAVKIYAGSAFKRNTELRFNYDTKPSQYFLL 245
Query: 174 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML 233
YGF+ E NP + + V L+ D + K + +R+G + R + D+L
Sbjct: 246 QYGFIPEANPAECVEVTMQLSQRDNLRERKEALLRRHGLDPTKRNFEWKVRGLD--YDLL 303
Query: 234 PYLRLGYVSDTSEMQSVIS-----SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLS 288
R+ D SE+ S S + + +AVL + L GY TL
Sbjct: 304 AAARI-IAMDESELDDDTSVALSVSGASVSAKNDARTKAVLLK---SLITSLDGYGTTLG 359
Query: 289 EDEAMLTDYNLH----PKKRVA-TQLVRMEKKMLNACLQVTADMIM 329
ED + + +N PKKR L+RM +K L +AD +
Sbjct: 360 EDNSYIARFNTSSDELPKKRKRFAVLLRMREK---GILLASADALF 402
>gi|303271159|ref|XP_003054941.1| methyltransferase [Micromonas pusilla CCMP1545]
gi|226462915|gb|EEH60193.1| methyltransferase [Micromonas pusilla CCMP1545]
Length = 544
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 87/343 (25%), Positives = 136/343 (39%), Gaps = 43/343 (12%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIK 61
E+ +G S W PY+ L V+ PL W E+E+ + L GSPT + + RA +
Sbjct: 145 ERAKGSLSDWAPYVNTLPT-----GCTVDHPLRWEESEIRSLLKGSPTCEQAVGRAVDAR 199
Query: 62 REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 121
EY + + Y+ TE F A V + V L ++ +ALV
Sbjct: 200 EEYASIRAAIAADADAYPADAYEFLTEL----AFTDALATVLARAVWLNAANV---YALV 252
Query: 122 PL--------------GPPLLAYSSKCKAMLAAV-----DDAVQLV-VDRPYKAGESIVV 161
PL P A + + + AAV D A + V V A ++ V
Sbjct: 253 PLVDLLPVVGAPPPGVNPAAAAADAGARGLDAAVGVVDYDAATECVAVVSANDARQTAPV 312
Query: 162 WCGP---QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG-KLSVQV 217
C + L ++ G V+ + D L + D Y K+ + + G Q
Sbjct: 313 VCADALGRNAGDLFLSTGRVNGAHVGDYLTFVTSTVMSDKLYAAKKQILEGMGYSADAQA 372
Query: 218 FHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFK 277
F V A R + Y+R V + SE+ +V I VSP E +L L +
Sbjct: 373 FPVFADRMP---LQLFAYMRFARVQEPSELMTVSFEEDRI--VSPMNEYEILQLLMGDAR 427
Query: 278 ARLAGYPATLSEDEAM-LTDYNLHPKKRVATQLVRMEKKMLNA 319
LA Y + E E + L + N+ ++ A +L EK+++NA
Sbjct: 428 EMLAEYENSSEEFELLQLKETNISERQMTAAKLRLGEKRLINA 470
>gi|428182808|gb|EKX51668.1| hypothetical protein GUITHDRAFT_102933 [Guillardia theta CCMP2712]
Length = 436
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 59/275 (21%), Positives = 116/275 (42%), Gaps = 35/275 (12%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
EK +G +S + P+I L PLLW++ E+ L G+ EIL E ++
Sbjct: 115 EKMKGSESSYKPFIDVL-------PWDSLHPLLWTDEEVDLLEGTYAHREILAFREQVEV 167
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF----KQAFVAVQS----CVVHLQKVSL 114
+ V G ++Q+ I TE T E F + AF +V S + L
Sbjct: 168 ATELFEPVLNPKG--WKQFFQTIETEKMTPEEFGFMMRGAFASVLSRAFDSKIGRGDKGL 225
Query: 115 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY-----------KAGESIVVWC 163
R + P L ++ D A++ ++ + + GE + +
Sbjct: 226 EERVVI----PLLDIFNHGSYGPSITFDTALERDNEKGFPVRVADKGKSIEEGEELFGFY 281
Query: 164 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV-QVFHVHA 222
G +PN +L YGFV + + +++ +DP + K + + G ++V Q+F +
Sbjct: 282 GDKPNWNMLTTYGFVSPNPKCQETTLSVSIDEKDPYFAQKEEILKARGMVAVEQLFDIR- 340
Query: 223 GREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPI 257
+ + + ++ Y R+ +S+ +++ V ++ G +
Sbjct: 341 -HDTDPMGPLINYFRIREISNEADLTKVQTNYGEM 374
>gi|156361027|ref|XP_001625323.1| predicted protein [Nematostella vectensis]
gi|156212150|gb|EDO33223.1| predicted protein [Nematostella vectensis]
Length = 447
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 50/192 (26%), Positives = 79/192 (41%), Gaps = 20/192 (10%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
MYEK K SFW PYIR L +P ++ EL +L + + E+ I
Sbjct: 135 MYEKYLEKGSFWAPYIRTLPD-------TFNTPCYFTRKEL-FLLPEQCREQAFEQVTQI 186
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CVVHLQKVSLAR--- 116
K+ Y + + Q + + FE FK A+ V + V H + A+
Sbjct: 187 KQSYKSFAKAY---NDVLQDFDCNF-WRTVDFESFKWAWCVVNTRSVYHDEPNRRAQPID 242
Query: 117 -RFALVPLGPPLLAYSSKCK--AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 173
AL PL LL + K + + ++ V Y+ G + + GP N++L +
Sbjct: 243 GNCALAPL-LDLLNHCDKAEMCGRFNSSSKNYEINVITEYQKGTQVFINYGPHDNTRLFL 301
Query: 174 NYGFVDEDNPYD 185
YGFV N ++
Sbjct: 302 EYGFVLPRNVHN 313
>gi|85093434|ref|XP_959692.1| hypothetical protein NCU09581 [Neurospora crassa OR74A]
gi|28921141|gb|EAA30456.1| predicted protein [Neurospora crassa OR74A]
Length = 504
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 46/191 (24%), Positives = 79/191 (41%), Gaps = 28/191 (14%)
Query: 6 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 65
+GK SFW PYI L + A+ P W+E ++ L G+ I E +K EY
Sbjct: 115 KGKSSFWAPYISTLADPSQLDKWAL--PPFWAEDDIELLQGTNAYIAIQEIQNNVKSEYK 172
Query: 66 ELDTVWFMAG----SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 121
+ + G + Q Y+ FT F+ + V +S ++++ L+
Sbjct: 173 QARKILKKEGFPDYREYTQVLYNWAYCMFTSRSFRPSLVLSESAREYVER--------LL 224
Query: 122 PLGP---------PLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 167
P G PL ++ + L + A +L+ + Y G+ + G +
Sbjct: 225 PEGTKIDDFSVLQPLYDIGNHSWDASYTWNLTSEPSACELICNDSYGPGQQVFNNYGFKT 284
Query: 168 NSKLLINYGFV 178
NS+LL+ YGF+
Sbjct: 285 NSELLLGYGFI 295
>gi|50557134|ref|XP_505975.1| YALI0F28061p [Yarrowia lipolytica]
gi|49651845|emb|CAG78787.1| YALI0F28061p [Yarrowia lipolytica CLIB122]
Length = 454
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 58/223 (26%), Positives = 90/223 (40%), Gaps = 38/223 (17%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+Y K G + W PY L Q ++S ++W++ EL L GS +I G
Sbjct: 89 LYMMKAGSMTKWKPYFDVLPTQ-------LDSLMMWTDDELEGLKGSMIVKKI--GKAGA 139
Query: 61 KREYNE---------------LDTV---WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 102
+ +Y E DT + G L Y +D P ++F+ +
Sbjct: 140 EEDYQEKLKPIIDAHPEYFKDCDTSLESFHRMGGLIMAYSFDAP-DSFS----EDEEDDE 194
Query: 103 QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA-VQLVVDRPYKAGESIVV 161
L L + A+VPL L A++ C A L A DD + +P K GE +
Sbjct: 195 DIEHDDLYNEGLVK--AMVPLADTLNAHTRFCNANLIAEDDGGFSMTAIQPIKKGEQVYN 252
Query: 162 WCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALNTEDPQYQDK 203
G PN L YG+V +E +D +VE +++ Y +K
Sbjct: 253 TYGELPNCDFLRRYGYVENEGTEFD--IVEFSMDEISDFYANK 293
>gi|281207968|gb|EFA82146.1| hypothetical protein PPL_04566 [Polysphondylium pallidum PN500]
Length = 510
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 28/104 (26%), Positives = 48/104 (46%), Gaps = 12/104 (11%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+YE G +S W YI+ L L+V+ P+LW++ +L L G+ + + E E +
Sbjct: 107 LYEASIGSQSKWYGYIKSL-------PLSVDLPILWNDADLKNLKGTSIETVVYENKETV 159
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 104
YN+ ++ L +P F+ + FK+A V S
Sbjct: 160 DATYNK-----YIKSKLIANHPDVFNEHVFSLDNFKRASCLVSS 198
>gi|358386801|gb|EHK24396.1| hypothetical protein TRIVIDRAFT_168260 [Trichoderma virens Gv29-8]
Length = 370
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/140 (23%), Positives = 59/140 (42%), Gaps = 6/140 (4%)
Query: 54 LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 113
LE E +++ E W + F+ D+P E +T+ + + K
Sbjct: 113 LESREHLRKREKEFQGNW----NAFKDAFPDVPYEEYTYAWMIVNTRSFYNETPETLKYP 168
Query: 114 LARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 173
R AL+P+ CK +A D +V DR YK GE + + N +L+
Sbjct: 169 WEDRLALIPVADLFNHSDDGCKVYYSA--DGYHIVADREYKKGEELFISYSSHSNDYILL 226
Query: 174 NYGFVDEDNPYDRLVVEAAL 193
YGF+ +++ D + ++ A+
Sbjct: 227 EYGFIPDESLDDDVYIDDAV 246
>gi|449283795|gb|EMC90389.1| SET domain-containing protein 4 [Columba livia]
Length = 440
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 67/285 (23%), Positives = 112/285 (39%), Gaps = 39/285 (13%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
EK G+KS W PY+ L + P+ E ++ L P + + E+ +
Sbjct: 117 EKHAGEKSLWKPYLDVLPK-------TYSCPVC-LEHDVVSLLPEPLRKKAQEQRTKVHE 168
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 122
Y + LF + I F + + A+ + + +++ K S + F+L P
Sbjct: 169 LYISSKAFFSSLQPLFAENTETI----FNYSALEWAWCTINTRTIYM-KHSQRKCFSLEP 223
Query: 123 ----LGP--PLLAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 174
L P LL +S + KA + ++ + K E + + GP N +LL+
Sbjct: 224 DVYALAPYLDLLNHSPNVQVKAAFNEQTRSYEIRTNSLCKKYEEVFICYGPHDNQRLLLE 283
Query: 175 YGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP 234
YGFV DNP+ + V +A + DK QRN K+S+ H D+L
Sbjct: 284 YGFVAMDNPHSSVYVSSATLLKYFPPLDK----QRNAKVSILKDH-----------DLLE 328
Query: 235 YLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 279
L G+ + + + + L C R + L D AR
Sbjct: 329 NLTFGWDGPSWRLLTALKVLSLGADEFTCWRRTL---LGDVISAR 370
>gi|396495152|ref|XP_003844476.1| similar to SET domain-containing protein [Leptosphaeria maculans
JN3]
gi|312221056|emb|CBY00997.1| similar to SET domain-containing protein [Leptosphaeria maculans
JN3]
Length = 475
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 54/212 (25%), Positives = 80/212 (37%), Gaps = 48/212 (22%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI------- 53
+YE G S W PY L + + + WSE ELA L S A+I
Sbjct: 91 LYEHLNGDASNWAPYFAVLPNE-------FNTLMFWSEHELAELQASAVLAKIGREGANE 143
Query: 54 ------------------------LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 89
++AE ++ E N T+ GSL Y +DI
Sbjct: 144 AFLGQLVPVIKEFAGIFFSGDSRAAQKAEEMRDEKN--ITLMHKMGSLIMAYAFDIEPAT 201
Query: 90 FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 149
+ V + + +L + ++PL L A + +C A L +++
Sbjct: 202 ------PRKDVDEEGFAEEEEDEALPK--GMIPLADMLNADADRCNARLFYEQKYLEMKA 253
Query: 150 DRPYKAGESIVVWCGPQPNSKLLINYGFVDED 181
+P KAGE I GP P S LL YG+V E+
Sbjct: 254 LKPIKAGEEIFNDYGPLPRSDLLRRYGYVTEN 285
>gi|322697804|gb|EFY89580.1| putative histone-lysine N-methyltransferase [Metarhizium acridum
CQMa 102]
Length = 466
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 43/175 (24%), Positives = 75/175 (42%), Gaps = 9/175 (5%)
Query: 32 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 91
S + ++E EL G+ + + I+ +Y +L + P P FT
Sbjct: 107 SSIFFTEDELEVCAGTSLYTITKQLKQRIEDDYKDL------IARVLGPRPDLFPLNKFT 160
Query: 92 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA--MLAAVDDAVQLVV 149
+K A V S + + + L P +L +SS+ K + A + ++
Sbjct: 161 IHHYKWALCTVWSRAMDFELYDGSSMRLLAPFAD-MLNHSSESKQCHVYDASTGNLSILA 219
Query: 150 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 204
+ Y+AG+ + + G PNS+LL YGFV DNP D + A + P ++ K+
Sbjct: 220 GKDYEAGDQVYIHYGSIPNSRLLRLYGFVIPDNPNDSYDLVLATHPMAPFFEQKQ 274
>gi|302804174|ref|XP_002983839.1| hypothetical protein SELMODRAFT_445692 [Selaginella moellendorffii]
gi|300148191|gb|EFJ14851.1| hypothetical protein SELMODRAFT_445692 [Selaginella moellendorffii]
Length = 236
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 46/186 (24%), Positives = 79/186 (42%), Gaps = 29/186 (15%)
Query: 8 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 67
+ S W PYI L G +++ LW +TEL+YL SP + ER E I E+ ++
Sbjct: 74 QSSAWAPYISCLPEPAG-----LDNTFLWEDTELSYLRASPLYGKTRERLEIITTEFGQV 128
Query: 68 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 127
+ LF + + E F + V S + +++ LV + P+
Sbjct: 129 QNALDVWPQLFGK---------VSVEDFMHVYATVFS-----RPLAIGEDSTLVMI--PM 172
Query: 128 LAYSSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 182
L + + A A + + + DR + I + CG N++L ++YGF
Sbjct: 173 LDFFNHNAASFAKLSFNGLLNYAVVTADRDCAENDQIWINCGDLSNAELALDYGFT---V 229
Query: 183 PYDRLV 188
P +RL+
Sbjct: 230 PENRLI 235
>gi|302840199|ref|XP_002951655.1| hypothetical protein VOLCADRAFT_105180 [Volvox carteri f.
nagariensis]
gi|300262903|gb|EFJ47106.1| hypothetical protein VOLCADRAFT_105180 [Volvox carteri f.
nagariensis]
Length = 517
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 44/193 (22%), Positives = 81/193 (41%), Gaps = 13/193 (6%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
MYEK +G++S W PY+ + PL W E L G+ +++ + +
Sbjct: 192 MYEKSRGRQSRWAPYLNLIPDD------MTHMPLYWKHREFKELRGTAAYDKMMGKVQCP 245
Query: 61 KREYNELDTVWF-MAGSLFQQYP-YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF 118
++ +W + Q++P ++P +++++ A AV S L
Sbjct: 246 ADAPTQVPVLWSEVVEPFIQEHPELELPEGKAGYDLYRWATCAVASYSFILGDDKYQ--- 302
Query: 119 ALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 176
A+VP+ L + + L A + ++ R GE +V G N++LL YG
Sbjct: 303 AMVPVWDLLNHITGRVNVRLHHCAKRHVLHMIATRDILRGEELVNNYGELSNAELLRGYG 362
Query: 177 FVDEDNPYDRLVV 189
FV+ N + + V
Sbjct: 363 FVEARNRNNHVQV 375
>gi|409045252|gb|EKM54733.1| hypothetical protein PHACADRAFT_97093 [Phanerochaete carnosa
HHB-10118-sp]
Length = 513
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 63/222 (28%), Positives = 97/222 (43%), Gaps = 26/222 (11%)
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
A+VP+ L + A L D+ V +++ KAGE I G PNS LL YGF
Sbjct: 271 AMVPMADMLNGRFNTETARLFYDDEHVLRMMTVHEIKAGEQIWNTYGDPPNSDLLRRYGF 330
Query: 178 VD----------EDNPYD------RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 221
+D NP D LVVEAA + QD+ V + VF V
Sbjct: 331 IDVTKLESPLSGAGNPADIVEIPANLVVEAATKHTTSKTQDR--VDWWLEEAEDDVFVV- 387
Query: 222 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 281
G + E +M+ RL + +E + + G + P M+ + D ++RL
Sbjct: 388 -GTDCELPPEMVSLARL-LLQPKAEWEKT-KAKGKVP--KPTMDTTIAAIAMDVLQSRLK 442
Query: 282 GYPATLSEDEAMLTDYN-LHPKKRVATQLVRMEKKMLNACLQ 322
YP ++ EDE +L D + L +++A + EK++L L+
Sbjct: 443 EYPTSVEEDERLLADESQLGFNRKMAVTVRLGEKRILAGTLR 484
>gi|426197159|gb|EKV47086.1| hypothetical protein AGABI2DRAFT_203917 [Agaricus bisporus var.
bisporus H97]
Length = 491
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 59/221 (26%), Positives = 94/221 (42%), Gaps = 25/221 (11%)
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
A+VP+ L A A L D +++V +P ++GE I G PN++LL YG V
Sbjct: 244 AMVPMADILNARYQTENAKLFHEKDELKMVTTKPIRSGEQIWNTYGDLPNAELLRRYGHV 303
Query: 179 D--------EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 230
D NP D + ++A L V K + + + G E I
Sbjct: 304 DFLSLPSEGHGNPGDVVEIKADLII-SAVSSIPEAVKDDEAKERID-WWLEEGGEDIFIL 361
Query: 231 D--------MLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAG 282
D M+ +++L + ++ S P +E + D L + RLA
Sbjct: 362 DYEYDLPPVMISFVKLLLLPQADWEKAREKS----KPPKSKLEGILYDILISTLEKRLAE 417
Query: 283 YPATLSEDEAMLTDYNLHPKKRVATQLVRM-EKKMLNACLQ 322
YP T+ D+A+LT N P + +VR+ EK++L+ LQ
Sbjct: 418 YPTTIETDKALLT--NDTPLNKKNAIIVRLGEKEILHGILQ 456
>gi|326480913|gb|EGE04923.1| SET domain-containing protein [Trichophyton equinum CBS 127.97]
Length = 692
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 47/200 (23%), Positives = 78/200 (39%), Gaps = 35/200 (17%)
Query: 2 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIK 61
+E+ +G+ S W PY+ L R + S L + +++L +L G+ +K
Sbjct: 114 HEQLKGRDSHWWPYLATLPRAS-----ELTSALFYQDSDLDWLQGTNLYQTHQAYRNTVK 168
Query: 62 REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 121
EY+ ++ G L E+++++IF A+ + S + +R
Sbjct: 169 EEYDSAISILRDEGCL--------AVESYSWDIFCWAYTLIAS------RAFTSRVLDAY 214
Query: 122 PLGPPLLAYSSKCKAMLAAVDDA----------------VQLVVDRPYKAGESIVVWCGP 165
P L + + ML VD + + L V P GE I GP
Sbjct: 215 LSNHPTLKQDEEFQIMLPLVDSSNHKPLAKIEWRAEATEIGLKVIEPTFTGEEIHNNYGP 274
Query: 166 QPNSKLLINYGFVDEDNPYD 185
N +L+ YGF DNP D
Sbjct: 275 LNNQQLMTTYGFCIVDNPCD 294
>gi|294948379|ref|XP_002785721.1| hypothetical protein Pmar_PMAR008080 [Perkinsus marinus ATCC 50983]
gi|239899769|gb|EER17517.1| hypothetical protein Pmar_PMAR008080 [Perkinsus marinus ATCC 50983]
Length = 353
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 21/72 (29%), Positives = 37/72 (51%)
Query: 118 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
++PL S+K + V++ Q++ ++P K GE I G N LL+ +GF
Sbjct: 171 LCVIPLADQFNHSSTKWHTRVREVEEGFQMLAEKPVKKGEEIFNNYGLYTNEMLLLTHGF 230
Query: 178 VDEDNPYDRLVV 189
++ DNP+D +
Sbjct: 231 IEFDNPHDHFIT 242
>gi|241603784|ref|XP_002405757.1| SET domain-containing protein, putative [Ixodes scapularis]
gi|215502568|gb|EEC12062.1| SET domain-containing protein, putative [Ixodes scapularis]
Length = 429
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 44/192 (22%), Positives = 81/192 (42%), Gaps = 16/192 (8%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
+K G+ S W P++ L R +P+ L + E+ I+R
Sbjct: 114 QKLLGEASRWWPFVDSLPR-------TFTTPVFLRRKVFESLP-KDLREEVQTGITFIQR 165
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF---- 118
+ +L + + G + ++ + FT+ F A+ AV + + Q + + +
Sbjct: 166 TFLKLKVL--LGGHVEEEPEVQCLSTGFTWNNFVWAWTAVNTRCIFAQGSNSSSLWEDDH 223
Query: 119 -ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
AL P L + K A V + +++ + + A E + + GP N +L ++YGF
Sbjct: 224 CALAPF-LDCLNHHWKASIETAMVGENFEILSHKSHDANEQVFISYGPHSNRRLFLDYGF 282
Query: 178 VDEDNPYDRLVV 189
V DNP D +VV
Sbjct: 283 VLPDNPNDVVVV 294
>gi|356577306|ref|XP_003556768.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Glycine
max]
Length = 487
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 71/324 (21%), Positives = 135/324 (41%), Gaps = 18/324 (5%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+ ++ S W Y L ++ +S + WSE EL+ L G+ + +
Sbjct: 130 LIRERSRSNSLWKHYFSVLPKE-------TDSTIYWSEEELSELQGTQLLNTTRSVKQYV 182
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQS-CVVHLQKVSLARRF 118
+ EY L+ + P + + F I + +AF +++ +V +
Sbjct: 183 ENEYRRLEEEIILPNKKLFPSPLTLDDFFWAFGILRSRAFSRLRNENLVVIPFADFINHS 242
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGF 177
A V AY K A L + D L KAG+ + + + + N++L ++YGF
Sbjct: 243 ARVTTEDH--AYEIKGAAGLFSWDYLFSLRSPLSLKAGDQVYIQYDLNKSNAELALDYGF 300
Query: 178 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV-HAGREKEAISDMLPYL 236
++ + + + ++ DP + DK +A+ NG F + ++ + L +
Sbjct: 301 IEPNADRNAYTLTLQISESDPFFGDKLDIAESNGFGETAYFDIFYSRPLPPGLLPYLRLV 360
Query: 237 RLGYVSDTSEMQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM 293
LG +D ++S+ S G + PVS E + + + K LAGY T+ ED+
Sbjct: 361 ALG-GTDAFLLESIFRNSIWGHLELPVSRDNEELICRVVRETCKTALAGYHTTIEEDQK- 418
Query: 294 LTDYNLHPKKRVATQLVRMEKKML 317
L + L + +A + EK++L
Sbjct: 419 LKEAKLDSRHAIAVGIREGEKQLL 442
>gi|326473914|gb|EGD97923.1| hypothetical protein TESG_05224 [Trichophyton tonsurans CBS 112818]
Length = 692
Score = 47.8 bits (112), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 47/200 (23%), Positives = 78/200 (39%), Gaps = 35/200 (17%)
Query: 2 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIK 61
+E+ +G+ S W PY+ L R + S L + +++L +L G+ +K
Sbjct: 114 HEQLKGRDSHWWPYLATLPRAS-----ELTSALFYQDSDLDWLQGTNLYQTHQAYRNTVK 168
Query: 62 REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 121
EY+ ++ G L E+++++IF A+ + S + +R
Sbjct: 169 EEYDSAISILRDEGCL--------AVESYSWDIFCWAYTLIAS------RAFTSRVLDAY 214
Query: 122 PLGPPLLAYSSKCKAMLAAVDDA----------------VQLVVDRPYKAGESIVVWCGP 165
P L + + ML VD + + L V P GE I GP
Sbjct: 215 FSNHPTLKQDEEFQIMLPLVDSSNHKPLAKIEWRAEATEIGLKVIEPTFTGEEIHNNYGP 274
Query: 166 QPNSKLLINYGFVDEDNPYD 185
N +L+ YGF DNP D
Sbjct: 275 LNNQQLMTTYGFCIVDNPCD 294
>gi|255071849|ref|XP_002499599.1| predicted protein [Micromonas sp. RCC299]
gi|226514861|gb|ACO60857.1| predicted protein [Micromonas sp. RCC299]
Length = 588
Score = 47.8 bits (112), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 62/240 (25%), Positives = 99/240 (41%), Gaps = 32/240 (13%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVE-----SPLLW-SETELAYLTGSPTKAEIL 54
+ E+ G+ S W I L G G+ + +P+ W E A L G+P A+ +
Sbjct: 242 IAERALGEASPWHAVIASLPWPEG-GEGSASPCGGCTPVSWPREACDALLGGTPLLADAI 300
Query: 55 ERAEGIKREYNELDTVWFMAGSLFQQYPYDI-PTEAFTFEIFKQAFVAVQSCVVHLQKVS 113
+E + R++ L F A ++ D+ P A+T + F++A A S + +Q
Sbjct: 301 AASEKLARQHAAL----FPA---LSEHMADVFPASAYTLDNFRRAHEAWNSYGMTVQASP 353
Query: 114 LARRFALVP---------LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG 164
+P L P ++ YS D ++L V R AGE + V G
Sbjct: 354 GEPAATCLPPVAMLCNHALWPHVVRYSRL-------RDGTLRLPVARSVHAGEEVFVSYG 406
Query: 165 PQPNSKLLINYGFVDEDNPYDRLVVEAAL-NTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 223
+ N++LL+ YGF NPYD + + L E R A L++ V AG
Sbjct: 407 AKSNAELLLFYGFALPGNPYDDVPLSLELPGGEVADVTKAREAALARAGLTLSPHAVRAG 466
>gi|413923745|gb|AFW63677.1| hypothetical protein ZEAMMB73_839660 [Zea mays]
gi|413923746|gb|AFW63678.1| hypothetical protein ZEAMMB73_839660 [Zea mays]
Length = 306
Score = 47.8 bits (112), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 44/151 (29%), Positives = 65/151 (43%), Gaps = 19/151 (12%)
Query: 9 KSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL 67
S W+ YI L RQ S L W+ EL AYL SP + ++R + YN+L
Sbjct: 162 SSRWISYIAALPRQ-------PYSLLYWTRAELDAYLVASPIRKRAIQRITDVIGTYNDL 214
Query: 68 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 127
+F ++P P E + E F +F + S +V L S+ R ALVP +
Sbjct: 215 ------RDRIFSRHPDLFPEEVYNIETFLWSFGILFSRLVRLP--SMDGRVALVPWA-DM 265
Query: 128 LAYSSKCKAML--AAVDDAVQLVVDRPYKAG 156
L +S + + L + DR Y+ G
Sbjct: 266 LNHSPEVETFLDFDKSSRGIVFTTDRSYQPG 296
>gi|240278777|gb|EER42283.1| conserved hypothetical protein [Ajellomyces capsulatus H143]
gi|325090312|gb|EGC43622.1| conserved hypothetical protein [Ajellomyces capsulatus H88]
Length = 471
Score = 47.8 bits (112), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 45/169 (26%), Positives = 73/169 (43%), Gaps = 21/169 (12%)
Query: 32 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 91
S + ++E EL TG+ A + I+ +Y L L Q+ P FT
Sbjct: 107 SSIFFTEDELEVCTGTSLYAITKQLGRCIQDDYKALVV------RLLIQHRDLFPLSKFT 160
Query: 92 FEIFKQAFVAVQSCVVHL-----QKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDA 144
E +K A V S + + + L FA +L +SS + A +
Sbjct: 161 IEDYKWALCTVWSRAMDFVLPDGKSIRLLAPFA------DMLNHSSDVRQCHAYDPLSGN 214
Query: 145 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR--LVVEA 191
+ ++ + YKAG+ + ++ G PN++LL YGF+ NP D LV+E
Sbjct: 215 LSILAGKDYKAGDQVFIYYGSIPNNRLLRLYGFIIPSNPNDNYELVLET 263
>gi|171679805|ref|XP_001904849.1| hypothetical protein [Podospora anserina S mat+]
gi|170939528|emb|CAP64756.1| unnamed protein product [Podospora anserina S mat+]
Length = 468
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 49/189 (25%), Positives = 75/189 (39%), Gaps = 29/189 (15%)
Query: 8 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE- 66
K S+W PYI L + A+ P +W E ++ L + + E IK+EY
Sbjct: 114 KDSYWWPYISTLPQPDRVDTWAL--PAVWPEDDIECLEETNAHVAVREIQANIKKEYKHA 171
Query: 67 ---LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 123
L V F + Q Y FT F+ + + Q H+ L P
Sbjct: 172 RKLLKEVDFPGWQEYTQLLYKWAFCIFTSRSFRPSLILSQETQDHV--------LGLTPH 223
Query: 124 GP---------PLLAY-----SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 169
G PLL +S+ + L VD QL+ + Y+ G+ + G + NS
Sbjct: 224 GTKVDDFSILQPLLDIGNHDPTSQYQWNLE-VDGTCQLICNNAYQPGQQVFNNYGLKSNS 282
Query: 170 KLLINYGFV 178
+LL+ YGF+
Sbjct: 283 ELLLGYGFI 291
>gi|393230612|gb|EJD38215.1| SET domain-containing protein [Auricularia delicata TFB-10046 SS5]
Length = 381
Score = 47.4 bits (111), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 51/211 (24%), Positives = 86/211 (40%), Gaps = 19/211 (9%)
Query: 14 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 73
PY+ D R QL +PL ++ ELA L G+ A +R + E V
Sbjct: 78 PYV---DSLPSRAQL--RTPLHFTPQELALLKGTNMAAATTDREADWRSECERCRAVLGH 132
Query: 74 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL-LAYSS 132
G Y + + F ++ + ++ + + LVPL L A +
Sbjct: 133 WGEHLTWEHYLTASTHLSSRAFPSTLLSPEPALI----PTPSSHPVLVPLIDSLNHARAH 188
Query: 133 KCKAMLAAVDDAVQ---LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 189
++ D+ +V P AG ++ GP+PN++L++ YGF DNP D LV+
Sbjct: 189 PVSWSVSPADNGAHTLSIVQHAPVAAGAEVLNNYGPKPNAELVLGYGFALPDNPDDTLVL 248
Query: 190 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHV 220
+ + D+R + + G L +F V
Sbjct: 249 KVS------GAADRREIWRAGGGLQRILFDV 273
>gi|320163219|gb|EFW40118.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 1188
Score = 47.4 bits (111), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 45/202 (22%), Positives = 81/202 (40%), Gaps = 22/202 (10%)
Query: 33 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 92
PL W++ EL +L G+ I ER ++ ++ + V L ++ P P + FT+
Sbjct: 245 PLWWNDAELDHLDGTNIGGYIQERRNQVRNQFLNVFPV------LSREQPALFPKDVFTY 298
Query: 93 EIFKQAFVAVQSCVVHLQ-KVSLARRFALVPLGPPLLAYSSKC------------KAMLA 139
E + AF S L+ V+ +G P+ +C A +
Sbjct: 299 EAYLWAFSTCSSRAFPLRVTVNPTTGVESHAIGNPMKEPCVECLLPLLDMMNHQFGASIT 358
Query: 140 AVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 197
D +V+ + GE + GP+ N +LL+ YGF +N D + ++ + D
Sbjct: 359 WFTDETSVRFFTGAKVRKGEQVYNNYGPKSNEELLMGYGFCLPNNEADHVKIQLTVGN-D 417
Query: 198 PQYQDKRMVAQRNGKLSVQVFH 219
P + K + + +G H
Sbjct: 418 PDGEAKLAILRWHGLSLTHFLH 439
>gi|403215215|emb|CCK69715.1| hypothetical protein KNAG_0C06190 [Kazachstania naganishii CBS
8797]
Length = 496
Score = 47.4 bits (111), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 46/193 (23%), Positives = 81/193 (41%), Gaps = 29/193 (15%)
Query: 7 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERAE 58
G+KS W PY++ L ++ QL + W++ EL L G+ E+ E
Sbjct: 102 GEKSKWWPYLQVLPKKTDMNQL-----IYWADDELELLKPSLILERVGADKAKEMFENVV 156
Query: 59 GI--KREYNELDT-----VW---FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH 108
I K E D+ W + S+ Y +D+ + + E K+ +
Sbjct: 157 DIINKSTLKEKDSYILKVTWENFLLVASIIMSYSFDV--QDYVEE--KEGGTDEEEDDNE 212
Query: 109 LQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPN 168
+ V + ++PL L + + KC A L + +++ + K GE I G PN
Sbjct: 213 SENVRSLK--CMIPLADTLNSNTHKCNAHLIHGSNLLEMRSIKAIKKGEQIYNIYGDHPN 270
Query: 169 SKLLINYGFVDED 181
S++L YG+++ D
Sbjct: 271 SEILRRYGYIEPD 283
>gi|302784522|ref|XP_002974033.1| hypothetical protein SELMODRAFT_414219 [Selaginella moellendorffii]
gi|300158365|gb|EFJ24988.1| hypothetical protein SELMODRAFT_414219 [Selaginella moellendorffii]
Length = 527
Score = 47.4 bits (111), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 53/197 (26%), Positives = 78/197 (39%), Gaps = 34/197 (17%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW-SETELAYLTGSPTKAEILERAEG 59
+ E+ +GK+SFW PYI L +L++ PLLW +ET L GSP I R +
Sbjct: 117 LVERSRGKESFWHPYISALPSVE---ELSISHPLLWPAETIQELLQGSPMLDTIATRLKL 173
Query: 60 IKREYNELDTVW---FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA- 115
+ ++ L T F+ G E + V S V+ + SL
Sbjct: 174 CQEDHEALLTAGIEKFLPGG----------------ETLSEGDVRWASAVLLSRAFSLEL 217
Query: 116 ------RRFALVPLGPPLLAYSSKCKAMLAAVDD---AVQLVVDRPYKAGESIVVWCGPQ 166
LVP L SS + D L + Y G+ + GP
Sbjct: 218 DVDDDFDTLCLVPWADMLNHCSSAGEESCLIFDQDTKTASLEAHKSYSKGDEVFDSYGPA 277
Query: 167 -PNSKLLINYGFVDEDN 182
S+L ++YGFVD++N
Sbjct: 278 LTGSQLFLDYGFVDDEN 294
>gi|212546319|ref|XP_002153313.1| SET domain protein [Talaromyces marneffei ATCC 18224]
gi|210064833|gb|EEA18928.1| SET domain protein [Talaromyces marneffei ATCC 18224]
Length = 481
Score = 47.4 bits (111), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 84/377 (22%), Positives = 147/377 (38%), Gaps = 71/377 (18%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT---------KA 51
+YE G++S W Y + L + ++ + WSE E + L S +
Sbjct: 100 IYEYSLGEQSNWKQYFQVLPSK-------FDTLMFWSEEEFSQLQASAVVDKVGKRDAEE 152
Query: 52 EILERAEGIKREYNEL------------DT-------VWFMAGSLFQQYPYDIPTEAFTF 92
+I E+ + R + +L DT + GSL Y +DI
Sbjct: 153 DIFEKVLPLVRAHPDLFPPIDGVMSYDDDTGAQALLELAHRMGSLIMAYAFDIEKAEEEE 212
Query: 93 EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRP 152
+ ++ L K +VPL L A + + A L + A+ + +P
Sbjct: 213 SEGEDGYLTDDEE--QLPK-------GMVPLADLLNADADRNNARLFQEEGALVMRAIKP 263
Query: 153 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL----------NTEDPQYQD 202
KAG+ I G P S LL YG+V DN VVE L N ED +Y
Sbjct: 264 IKAGDEIFNDYGELPRSDLLRRYGYV-TDNYAQYDVVELPLTGICHAAGFDNIEDKEYPQ 322
Query: 203 KRMVAQRNGKLSVQVFHVHAGREKEAISDMLP----YLRLGYVSDTSEMQSVISSLGPIC 258
+++ Q ++ + + ++ + D+LP L D+ E+Q ++S
Sbjct: 323 LKLLDQL--EILEDGYCILRPSPEDTLLDILPDELLALLKTLTLDSEELQRLLSKNKHPK 380
Query: 259 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDY-------NLHPKKRVATQLVR 311
P+ E + L D ++++ Y T+ ED+ +L + ++ +A Q+
Sbjct: 381 PILGAREARI---LLDAAQSKMGQYGTTIQEDKILLQQFASSSVLRTRERRRHMAVQVRV 437
Query: 312 MEKKMLNACLQVTADMI 328
EK++L A L + D +
Sbjct: 438 GEKEILQALLMMLQDFL 454
>gi|225561342|gb|EEH09622.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
Length = 487
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 78/339 (23%), Positives = 134/339 (39%), Gaps = 50/339 (14%)
Query: 32 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 91
S + +++ EL GS A I+ +Y L L Q+ P + FT
Sbjct: 125 SSIFFTDDELEVCAGSSLYALTKRLGRCIEDDYRALVV------RLLVQHQDLFPLDKFT 178
Query: 92 FEIFKQAFVAVQSCVVHL-----QKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDA 144
E +K A V S + + + L FA +L +SS+ + A +
Sbjct: 179 IEDYKWALCTVWSRAMDFVLPGGKSIRLMAPFA------DMLNHSSEVRQCHAYDPLSGN 232
Query: 145 VQLVVDRPYKAGES-----IVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQ 199
+ ++ + Y+AG+ + ++ G PN++LL YGFV NP D + + P
Sbjct: 233 LTILAGKDYEAGDQGVFFQVFIYYGSIPNNRLLRLYGFVMPGNPNDSYDLVLETHPMAPF 292
Query: 200 YQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV----SDTSEMQSVISSLG 255
++ KR + G S + ++D LP LGY+ SD S++ S+
Sbjct: 293 FEQKRKLWDLAGFDSTSTISI-------TLTDPLPKNVLGYLRIQRSDESDLASIARQ-- 343
Query: 256 PICP----VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKR---VATQ 308
I P +S E VL L + F L + L E L + ++P + A
Sbjct: 344 RIDPKYEKISDSNEVEVLQSLIESFCGLLDSFGTQLESLEKQLAE-GVYPSRGNAWAAAH 402
Query: 309 LVRMEKKMLNACLQVTADMIMLLPDVTVS-----PCPAP 342
+ E+++L + DM+ + + + P PAP
Sbjct: 403 VSLGEQQVLRLARKRAEDMLAAVESGSGNEKGSLPAPAP 441
>gi|452841392|gb|EME43329.1| hypothetical protein DOTSEDRAFT_131367 [Dothistroma septosporum
NZE10]
Length = 445
Score = 47.0 bits (110), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 80/348 (22%), Positives = 136/348 (39%), Gaps = 50/348 (14%)
Query: 7 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI------------- 53
G +S W PY L ++ ++ + WS+ EL +L GS +I
Sbjct: 97 GSQSRWKPYFDVL-------PVSFDNLMFWSDRELRHLEGSTVVGKIGKEAADATFREQL 149
Query: 54 ---LERAEGIKREYNE-LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 109
+ER K NE L + GS Y +D+ T + + + + L
Sbjct: 150 IPVIERISKAKAADNEELLRMCHRMGSTIMAYGFDLETSSDQAKNDGEEWEEDSDAGETL 209
Query: 110 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 169
K +VPL L A + + A L DD V + +P KAGE + G P +
Sbjct: 210 PK-------GMVPLADMLNADADRNNAKLFYEDDKVVMKTIKPVKAGEELYNDFGSLPRA 262
Query: 170 KLLINYGFVDEDN---------PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV 220
LL YG++ DN P D + A L T+D ++ A+ G L
Sbjct: 263 DLLRRYGYL-TDNYAQYDVVEIPADLIKERAGLRTQD--VDERWQYAEEQGVLDDGYDVS 319
Query: 221 HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARL 280
A E+ + L L +E + V + P + +L + Y R
Sbjct: 320 RASSEEGQFPEELCVLLNLLALPRAEFEKVKNKDKIPKPDLTTNAKKLLRTILVY---RY 376
Query: 281 AGYPATLSEDEAMLTDYNLHPKKR-VATQLVRMEKKMLNACLQVTADM 327
A YP + + M +D +L+ ++R +A +++ EK++L + +++
Sbjct: 377 AAYPGNVDQ---MHSDVSLNDRRRKMAIVVIQGEKQVLQEAVDAISEI 421
>gi|356521657|ref|XP_003529470.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Glycine
max]
Length = 487
Score = 47.0 bits (110), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 72/316 (22%), Positives = 130/316 (41%), Gaps = 20/316 (6%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
S W Y L ++ +S + WSE EL+ L G+ + ++ E+ L+
Sbjct: 139 SLWKHYFSILPKE-------TDSTIYWSEEELSELQGTQLLNTTRSVKQYVQNEFRRLEE 191
Query: 70 VWFMAGSLFQQYPYDIPTEAF--TFEIFK-QAFVAVQS-CVVHLQKVSLARRFALVPLGP 125
+ + +P I + F F I + +AF +++ +V + L A V
Sbjct: 192 EIIIPNK--KLFPSSITLDDFFWAFGILRSRAFSRLRNENLVVIPLADLINHSARVTTDD 249
Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPY 184
AY K A L + D L KAG+ + + + + N++L ++YGF++ +
Sbjct: 250 H--AYEIKGAAGLFSWDYLFSLRSPLSLKAGDQVYIQYDLNKSNAELALDYGFIEPNTDR 307
Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
+ + ++ DP + DK +A+ NG F + R L +D
Sbjct: 308 NAYTLTLQISESDPFFGDKLDIAESNGFGETAYFDIFYNRPLPPGLLPYLRLVALGGTDA 367
Query: 245 SEMQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 301
++S+ S G + PVS E + + + K LAGY T+ ED+ L + L
Sbjct: 368 FLLESIFRNSIWGHLELPVSRDNEELICRVVRETCKTALAGYHTTIEEDQK-LKEAKLDS 426
Query: 302 KKRVATQLVRMEKKML 317
+ +A + EK +L
Sbjct: 427 RHAIAVGIREGEKNLL 442
>gi|195439104|ref|XP_002067471.1| GK16171 [Drosophila willistoni]
gi|194163556|gb|EDW78457.1| GK16171 [Drosophila willistoni]
Length = 511
Score = 47.0 bits (110), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 64/310 (20%), Positives = 121/310 (39%), Gaps = 34/310 (10%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
EK +G S W PYI L + + L ++ ++ L G+ + L + I R
Sbjct: 166 EKSRGADSIWKPYIDVLPSR-------YNTVLYFTVEQMRRLRGTSVCSSALRQCRMIAR 218
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR----- 117
+Y +L + S + +E+++ A V + +L +A +
Sbjct: 219 KYAKLYAFAYCDSSYLRPDTGLFTQHGLCYELYRWAVSTVMT-RQNLVPREIATKDDGNS 277
Query: 118 --FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
AL+P K + + ++ KAG ++ G +PN+ LL++
Sbjct: 278 PISALIPCWDMANHRPGKITSFYDSNAHQMECTAQEFCKAGNQFFIYYGDRPNADLLVHN 337
Query: 176 GFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREK-----EAIS 230
GFVD +N D + + L+ D +A++ +L ++ H G + E IS
Sbjct: 338 GFVDPNNNKDFVNIRLGLSPTDG-------LAEKRSRLLDRLNIEHKGEFRVLPAPEYIS 390
Query: 231 -DMLPYLRLGYVSD------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGY 283
+L ++R+ +S S+++ + L C + +E FK L
Sbjct: 391 GQLLAFVRVFNMSSDQLDHWCSDLERAVDLLHIDCALETDLETRTWQYFHQRFKLLLGVL 450
Query: 284 PATLSEDEAM 293
ATL E + +
Sbjct: 451 EATLREADEL 460
>gi|387191841|gb|AFJ68625.1| set domain-containing protein [Nannochloropsis gaditana CCMP526]
Length = 736
Score = 47.0 bits (110), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 38/193 (19%), Positives = 74/193 (38%), Gaps = 22/193 (11%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
E+ +G +SFW PY+R L + P+ ++ +E + + R +
Sbjct: 267 ERMKGPQSFWWPYLRNLPEK------YAHMPIFYNNSEFGSIQIPSLMRTVQSRCRMLVN 320
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 122
++ +Q + P E + + C + + P
Sbjct: 321 ----------ISDGYLRQLSHGGPAENPFLDDVHANDMGWGLCAASSRALRNIPGLGSTP 370
Query: 123 LGPPLLAYSSKCKAMLAAVDD------AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 176
L P++ + + + D ++QLV R + G+++ + G N +LL++YG
Sbjct: 371 LMVPVIDFCEHAVSPTCYIKDYRKSGGSIQLVAGRDLQPGDALTISYGNLTNPQLLLDYG 430
Query: 177 FVDEDNPYDRLVV 189
F DNP+DR V
Sbjct: 431 FTLSDNPHDRFEV 443
>gi|308812602|ref|XP_003083608.1| unnamed protein product [Ostreococcus tauri]
gi|116055489|emb|CAL58157.1| unnamed protein product [Ostreococcus tauri]
Length = 427
Score = 47.0 bits (110), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 65/271 (23%), Positives = 113/271 (41%), Gaps = 27/271 (9%)
Query: 57 AEGIKREYNELDTV--WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 114
AE + R Y+E D V W A F+ +P + FT F++A V++ + +
Sbjct: 134 AEKVMRRYDE-DIVRGWNAAVRTFRTFPTIFRAQDFTRSKFEEALAIVRANSFEVTRADG 192
Query: 115 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 174
R LVPL L+ +S + VDD + VD ++AG+ + G +++
Sbjct: 193 VRERVLVPLAHLLVHDTSSSVPCVKMVDDTFVINVD-EHRAGDELSCSHGEYSDAETFAR 251
Query: 175 YG----FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL--SVQVFHVHAGREKEA 228
+G + +E+N D + + D+ + + G + + G A
Sbjct: 252 FGTSAVYSEENNARDVITF---------TFPDEVHLKEEIGSCGPAEDIGFTRDG----A 298
Query: 229 ISDMLPYLRLGYVSDT--SEMQSVISSLGPIC--PVSPCMERAVLDQLADYFKARLAGYP 284
++++ LRL + T SEM+ L + P+S E AV D L L YP
Sbjct: 299 SAELMCALRLVSANATEWSEMRKPNFDLQSLKNRPLSEESEVAVYDALFATLTDLLNSYP 358
Query: 285 ATLSEDEAMLTDYNLHPKKRVATQLVRMEKK 315
+ +DE +L L +R A ++ EK+
Sbjct: 359 YSDVDDEHLLRGDRLADDERRAVKIRLREKR 389
>gi|322706860|gb|EFY98439.1| SET domain protein [Metarhizium anisopliae ARSEF 23]
Length = 595
Score = 47.0 bits (110), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 52/209 (24%), Positives = 96/209 (45%), Gaps = 21/209 (10%)
Query: 12 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 71
W YIR L R + P +W+E E L G+ +A + + + +E++ L V
Sbjct: 223 WTEYIRLLPR-------PIPVPTMWTEPERLLLNGTSLEAALEAKLLSLGKEFDTLREV- 274
Query: 72 FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR-FALVPLGPPLLAY 130
+ +P+ E+ + +V V + + + L R A+VP G ++ +
Sbjct: 275 ------SEDFPFWNEFLWSGEEVSLEDWVLVDAWY-RSRCLELPRSGTAMVP-GLDMVNH 326
Query: 131 SSKCKAMLAAVD-DAVQLVV--DRPYKAGESIVVWCG-PQPNSKLLINYGFVDEDNPYDR 186
SSK A D D V L++ P ++GE + + G +P S++L +YGF+D +N D+
Sbjct: 327 SSKATAYYEEDDHDNVVLLIRPGCPVRSGEEVTISYGDAKPASEMLFSYGFIDPNNIVDK 386
Query: 187 LVVEAALNTEDPQYQDKRMVAQRNGKLSV 215
L + +DP + K ++ L++
Sbjct: 387 LTLRLDPFPDDPLARAKLRISNSGPTLTI 415
>gi|400594002|gb|EJP61885.1| histone-lysine N-methyltransferase [Beauveria bassiana ARSEF 2860]
Length = 481
Score = 47.0 bits (110), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 64/277 (23%), Positives = 111/277 (40%), Gaps = 24/277 (8%)
Query: 34 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 93
+ +S+ E+ GS + I +Y +L T M ++ P F E
Sbjct: 115 IFFSDEEMQVCKGSSLYTLTTQLRGRIGDDYKKLLTRVLM------RHRNLFPLSKFGIE 168
Query: 94 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS---KCKAMLAAVDDAVQLVVD 150
+K A V S + VS L+ +L +SS +C A D + ++
Sbjct: 169 HYKWALCTVWSRGMDF-TVSEGNSLRLLAPFADMLNHSSDVKQCHAYDPTTGD-LSILAS 226
Query: 151 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 210
+ Y G+ + ++ GP PN++LL YGFV +NP+D + + P Y+ K + +
Sbjct: 227 KDYNVGDQVFIYYGPVPNNRLLRLYGFVLPENPHDSYDLVLQTSPMAPLYEQKERLWKLA 286
Query: 211 GKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL------GPICPVSPCM 264
G + + A +D LP L Y+ +S++ ++ G +S
Sbjct: 287 GLDTACTIPLTA-------NDPLPRSVLRYLRIQRLDESLLGAMTMQIATGADEKISDDS 339
Query: 265 ERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 301
E +L L D A L G+ L A L +++P
Sbjct: 340 ETLILQFLIDSISAILEGFSIPLDILTAQLAAGDVYP 376
>gi|294868786|ref|XP_002765694.1| hypothetical protein Pmar_PMAR013760 [Perkinsus marinus ATCC 50983]
gi|239865773|gb|EEQ98411.1| hypothetical protein Pmar_PMAR013760 [Perkinsus marinus ATCC 50983]
Length = 330
Score = 47.0 bits (110), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 21/72 (29%), Positives = 36/72 (50%)
Query: 118 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
++PL S+K + V+ Q++ ++P K GE I G N LL+ +GF
Sbjct: 171 LCVIPLADQFNHSSTKWHTRVREVEGGFQMLAEKPVKKGEEIFNNYGLYTNEMLLLTHGF 230
Query: 178 VDEDNPYDRLVV 189
++ DNP+D +
Sbjct: 231 IEFDNPHDHFIT 242
>gi|302754814|ref|XP_002960831.1| hypothetical protein SELMODRAFT_402223 [Selaginella moellendorffii]
gi|300171770|gb|EFJ38370.1| hypothetical protein SELMODRAFT_402223 [Selaginella moellendorffii]
Length = 486
Score = 46.6 bits (109), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 43/162 (26%), Positives = 71/162 (43%), Gaps = 18/162 (11%)
Query: 32 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF- 90
S W +TEL+YL SP + ER E I E+ ++ F L Q D+ + F
Sbjct: 313 STFRWEDTELSYLRASPLYGKARERLEMITTEFGQVQND-FCTCVLEQ--ALDVWPQLFG 369
Query: 91 --TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY-----SSKCKAMLAAVDD 143
+ E K + V S + +++ LV + P+L + +S K + +
Sbjct: 370 KVSLEDLKHVYATVFS-----RSLAIGEDSTLVMI--PMLDFFNHNATSFAKLSFNGLLN 422
Query: 144 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
+ DR Y + I + G N++L ++YGF +NPYD
Sbjct: 423 YAVVTADRDYAENDQIWINYGDLSNAELALDYGFTVPENPYD 464
>gi|330933580|ref|XP_003304225.1| hypothetical protein PTT_16721 [Pyrenophora teres f. teres 0-1]
gi|311319308|gb|EFQ87682.1| hypothetical protein PTT_16721 [Pyrenophora teres f. teres 0-1]
Length = 476
Score = 46.6 bits (109), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 53/210 (25%), Positives = 80/210 (38%), Gaps = 44/210 (20%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL------ 54
+YE G S W PY L + + + W+E ELA L S +I
Sbjct: 91 LYEYHNGSASNWAPYFAVLPTE-------FNTLMFWTEDELAELQASAVVGKIGKESADE 143
Query: 55 ---------------------ERAEGIKREYNELDTVWFM--AGSLFQQYPYDIPTEAFT 91
E+A+ +E + M GSL Y +D+ T
Sbjct: 144 AFLEQLLPVIEEFADIVFSGDEKAKDKAKEMRSPKNLELMHKMGSLIMAYAFDVEPATPT 203
Query: 92 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 151
E+ ++ A + L K +VPL L A + +C A L D +++ +
Sbjct: 204 KEVDEEG-FAEEEEDAALPK-------GMVPLADMLNADADRCNARLFYEKDCLEMKALK 255
Query: 152 PYKAGESIVVWCGPQPNSKLLINYGFVDED 181
P +AGE I GP P S LL YG+V ++
Sbjct: 256 PIQAGEEIFNDYGPLPRSDLLRRYGYVTDN 285
>gi|299470104|emb|CBN78133.1| protein N-methyltransferase [Ectocarpus siliculosus]
Length = 482
Score = 46.6 bits (109), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 36/74 (48%)
Query: 117 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 176
R AL+PL + YS M D A+ L V + G+ GP N LL YG
Sbjct: 215 RMALLPLIDSINHYSRMPTHMYWEADGALSLSVGAAFDPGDHAFASYGPVSNDDLLQYYG 274
Query: 177 FVDEDNPYDRLVVE 190
FV++DNP D V+E
Sbjct: 275 FVEQDNPSDTYVLE 288
>gi|400597281|gb|EJP65016.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
Length = 484
Score = 46.6 bits (109), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 52/220 (23%), Positives = 82/220 (37%), Gaps = 75/220 (34%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+YE +G S W PY+ L E+P+ W+ EL L T ++
Sbjct: 112 LYEHLRGAASAWRPYLDVL-------PATFETPMFWTGAELGALQAGATAGKV------- 157
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
RE E DT + + + +P ++F+ + +V +LA R
Sbjct: 158 GRESAE-DTFRGILLPVVRAHP----------DVFQGSAALSDEALV-----ALAHR--- 198
Query: 121 VPLGPPLLAYS---------------------SKCKAMLAAV------------------ 141
+G ++AY+ KAM+ V
Sbjct: 199 --MGSTIMAYAFDLENDEEREDEEDEDGWVEDRDGKAMMGMVPMADILNADAEFNAHVNH 256
Query: 142 -DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
D+ + + RP KAGE I+ + GP PNS+LL YG+V E
Sbjct: 257 GDNELTVTALRPIKAGEEILNYYGPHPNSELLRRYGYVTE 296
>gi|347967018|ref|XP_321037.5| AGAP002018-PA [Anopheles gambiae str. PEST]
gi|333469795|gb|EAA01259.5| AGAP002018-PA [Anopheles gambiae str. PEST]
Length = 493
Score = 46.6 bits (109), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 77/345 (22%), Positives = 136/345 (39%), Gaps = 55/345 (15%)
Query: 9 KSFWLPYIREL-DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 67
KS W PY+ L DR +PL ++ ++ L + L+ + I R+Y +
Sbjct: 163 KSDWKPYLDLLPDR--------YTTPLYYTTEDMGELAETDAFLPALKLCKHIARQYGFI 214
Query: 68 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS----CVVHLQKV-SLARRFALVP 122
F Q D + FT+++F+ A V + V+L + + AL+P
Sbjct: 215 RR--------FVQEKVDELRDCFTYDVFRWAVSTVMTRQNKVPVNLAEFDGMDHTLALIP 266
Query: 123 L------GPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYK--AGESIVVWCGPQPNSKLL 172
L P A ++C A A ++ ++ + R A I + G + +++ L
Sbjct: 267 LWDMANHAFPDTANETRCVAETCYNATNEQLECSLTREVSDIASVPIFIVYGTRTDAEFL 326
Query: 173 INYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDM 232
++ GFV NP+ + L P Y+++ + + G + F RE A +
Sbjct: 327 VHNGFVCPRNPHANVQKRFTLVPAIPLYKERAHLLELLGMPTTGTFSFGPAREPAAATTT 386
Query: 233 LP----YLRLGYVSDTS------------EMQSVISSLGPICPVSPC--MERAVLDQLAD 274
P + L VS + + + + + P C ER LA
Sbjct: 387 TPISQELISLARVSSMTAKELDEYTAMKETQRQTLRTYQALLPAELCARTER----WLAT 442
Query: 275 YFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 319
K L YP T+ +DEA+L N H +R+ + EK++L +
Sbjct: 443 VMKIMLLRYPTTIEQDEALLKT-NRHHIRRLLIEYRLGEKQILRS 486
>gi|380089029|emb|CCC12973.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 465
Score = 46.6 bits (109), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 61/270 (22%), Positives = 105/270 (38%), Gaps = 21/270 (7%)
Query: 32 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 91
S + ++E EL G+ + + I+ ++ L LF Q+ P + F+
Sbjct: 116 SSIFFTEEELEVCAGTSLYTITKQLEQSIEDDHRAL------VMQLFIQHRDLFPLDKFS 169
Query: 92 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCK--AMLAAVDDAVQLVV 149
E +K A V S + Q L P +L +SS+ K + + ++
Sbjct: 170 IEDYKWALCTVWSRRMDFQLRDGKSMRLLAPFAD-MLNHSSEAKPCHVYDVSSGNLSVLA 228
Query: 150 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQR 209
+ Y+ G+ + + G PNS+LL YGFV NP D + + + + P Y+ K +
Sbjct: 229 GKDYEPGDQVFINYGSVPNSRLLRLYGFVIPGNPNDTYDLVLSTHPQAPFYEQKHKLWVS 288
Query: 210 NGKLSVQVFHVHAGREKEAISDMLP-----YLRLGYVSDTSEMQSVISSLGPICPVSPCM 264
G S + ++D LP YLR+ + + + VS
Sbjct: 289 AGLDSTSTIPL-------TLTDPLPKNVLRYLRIQRADASDLAAMALQNAKADEKVSDSN 341
Query: 265 ERAVLDQLADYFKARLAGYPATLSEDEAML 294
E +L L + F L G+ L + E L
Sbjct: 342 EVEILQFLVESFGHLLGGFGTPLEKLEEQL 371
>gi|336260071|ref|XP_003344832.1| hypothetical protein SMAC_06115 [Sordaria macrospora k-hell]
Length = 456
Score = 46.6 bits (109), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 61/270 (22%), Positives = 105/270 (38%), Gaps = 21/270 (7%)
Query: 32 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 91
S + ++E EL G+ + + I+ ++ L LF Q+ P + F+
Sbjct: 107 SSIFFTEEELEVCAGTSLYTITKQLEQSIEDDHRAL------VMQLFIQHRDLFPLDKFS 160
Query: 92 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCK--AMLAAVDDAVQLVV 149
E +K A V S + Q L P +L +SS+ K + + ++
Sbjct: 161 IEDYKWALCTVWSRRMDFQLRDGKSMRLLAPFAD-MLNHSSEAKPCHVYDVSSGNLSVLA 219
Query: 150 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQR 209
+ Y+ G+ + + G PNS+LL YGFV NP D + + + + P Y+ K +
Sbjct: 220 GKDYEPGDQVFINYGSVPNSRLLRLYGFVIPGNPNDTYDLVLSTHPQAPFYEQKHKLWVS 279
Query: 210 NGKLSVQVFHVHAGREKEAISDMLP-----YLRLGYVSDTSEMQSVISSLGPICPVSPCM 264
G S + ++D LP YLR+ + + + VS
Sbjct: 280 AGLDSTSTIPL-------TLTDPLPKNVLRYLRIQRADASDLAAMALQNAKADEKVSDSN 332
Query: 265 ERAVLDQLADYFKARLAGYPATLSEDEAML 294
E +L L + F L G+ L + E L
Sbjct: 333 EVEILQFLVESFGHLLGGFGTPLEKLEEQL 362
>gi|323447496|gb|EGB03414.1| hypothetical protein AURANDRAFT_72732 [Aureococcus anophagefferens]
Length = 403
Score = 46.6 bits (109), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 44/151 (29%), Positives = 62/151 (41%), Gaps = 14/151 (9%)
Query: 33 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 92
P+ W EL +L GS +I ER IK +Y + +W P I + T
Sbjct: 141 PIFWQPDELEWLKGSYLLTQIEERKRAIKADYEAICGIW----------PSFI--DVCTL 188
Query: 93 EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSK-CKAMLAAVDDAVQLVVDR 151
E FK A + V S + V+ AR A+VP L + + K A + +
Sbjct: 189 EEFKWARMCVCSRNFGVV-VNGARTSAMVPYADMLNHFRPRETKWTFDNSRGAFTITSLQ 247
Query: 152 PYKAGESIVVWCGPQPNSKLLINYGFVDEDN 182
G I G + N + L+NYGF EDN
Sbjct: 248 KISVGSQIYDSYGQKCNHRFLLNYGFAIEDN 278
>gi|367009050|ref|XP_003679026.1| hypothetical protein TDEL_0A04830 [Torulaspora delbrueckii]
gi|359746683|emb|CCE89815.1| hypothetical protein TDEL_0A04830 [Torulaspora delbrueckii]
Length = 484
Score = 46.2 bits (108), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 39/68 (57%), Gaps = 1/68 (1%)
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
+++PL L A +SKC A L +++++ +P GE + G PNS+LL YG+V
Sbjct: 213 SMIPLADTLNANTSKCNANLVYDIESLKMCATKPIGMGEQVYNIYGDHPNSELLRRYGYV 272
Query: 179 D-EDNPYD 185
+ E + YD
Sbjct: 273 EWEGSKYD 280
>gi|302754816|ref|XP_002960832.1| hypothetical protein SELMODRAFT_437299 [Selaginella moellendorffii]
gi|300171771|gb|EFJ38371.1| hypothetical protein SELMODRAFT_437299 [Selaginella moellendorffii]
Length = 418
Score = 46.2 bits (108), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 46/172 (26%), Positives = 76/172 (44%), Gaps = 22/172 (12%)
Query: 36 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF---TF 92
W +TEL+YL SP + ER E I E+ ++ F L Q D+ + F +
Sbjct: 202 WEDTELSYLRASPLYGKARERLEMITTEFGQVQND-FCTCVLEQ--ALDVWPQLFGKVSL 258
Query: 93 EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY-----SSKCKAMLAAVDDAVQL 147
E K + V S + +++ LV + P+L + +S K + + +
Sbjct: 259 EDLKHVYATVFS-----RSLAIGEDSTLVMI--PMLDFFNHNATSFAKLSFNGLLNYAVV 311
Query: 148 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQ 199
DR Y + I + G N++L ++YGF +NPYD E L T+ P+
Sbjct: 312 TADRDYAENDQIWINYGDLSNAELALDYGFTVPENPYD----ETELLTQFPE 359
>gi|1150596|emb|CAA86307.1| putative transcription regulator [Saccharomyces cerevisiae]
Length = 496
Score = 46.2 bits (108), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 81/192 (42%), Gaps = 19/192 (9%)
Query: 8 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA-E 58
++S W PY + ++ L + W + EL L G E+ ER +
Sbjct: 103 ERSRWAPYFKVWNKPSDMNAL-----IFWDDNELQLLKPSLVLERIGKKEAKEMHERIIK 157
Query: 59 GIKREYNELDTVWFMAGSLFQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 115
IK+ E T S F + Y I + +F E+ + + +++
Sbjct: 158 SIKQIGGEFSTCVANCPSKFDNFAYIASIILSYSFDLEMQDSSVNENEEEETSEEELENE 217
Query: 116 RRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 174
R +++PL L A +SKC A L + +++V R + E + G PNS+LL
Sbjct: 218 RYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRR 277
Query: 175 YGFVDED-NPYD 185
YG+V+ D + YD
Sbjct: 278 YGYVEWDGSKYD 289
>gi|219125216|ref|XP_002182882.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217405676|gb|EEC45618.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 488
Score = 46.2 bits (108), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 40/161 (24%), Positives = 63/161 (39%), Gaps = 17/161 (10%)
Query: 33 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 92
P WSE+EL L GSP + + EGI+++YN + W + + P+ F
Sbjct: 138 PRQWSESELKALRGSPLLIRVRKAREGIQKDYNLIREAW------KEIHSSHSPSTTANF 191
Query: 93 ---EIFKQAFVAVQSCVVH-----LQKVSLARRFALVPLGPPLLAYSSKCKAMLAAV--- 141
E F A V S K ++PL + +C +
Sbjct: 192 PKLEEFSSAMAVVSSRAFSGMAGFETKGGAVNDTTMIPLLDLCNHHRGRCVTKNVSYRFK 251
Query: 142 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 182
D V + G+++ + G Q N++L +NYGF DN
Sbjct: 252 DGTVMVKAVTDIAIGDTLKITYGAQGNAQLFLNYGFCVADN 292
>gi|80479475|gb|AAI08868.1| Unknown (protein for MGC:132347) [Xenopus laevis]
Length = 456
Score = 46.2 bits (108), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 45/194 (23%), Positives = 84/194 (43%), Gaps = 19/194 (9%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
E+ G++S W PY+ + P+ W E E+ +L +P + + LE+ K
Sbjct: 117 ERFAGERSQWKPYLDVIPS-------TYSCPVYW-ELEIVHLLPAPLRQKALEQ----KT 164
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA--- 119
E EL T + Q D + +T++ + A+ V + V+++ R A
Sbjct: 165 EVQELHTESLAFFNSLQPLFCDNVADIYTYDALRWAWCTVNTRTVYMKHTQQDRLLAQQD 224
Query: 120 LVPLGP--PLLAYSSKCK--AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
+ L P LL +S + + A + ++ + + + + GP N +LL+ Y
Sbjct: 225 VCALAPYLDLLNHSPEVQVEAEFSKDRRCYEIRTNSGCRKHDQAFICYGPHDNQRLLLEY 284
Query: 176 GFVDEDNPYDRLVV 189
GFV +NP+ + V
Sbjct: 285 GFVAANNPHRSVYV 298
>gi|302803412|ref|XP_002983459.1| hypothetical protein SELMODRAFT_445547 [Selaginella moellendorffii]
gi|300148702|gb|EFJ15360.1| hypothetical protein SELMODRAFT_445547 [Selaginella moellendorffii]
Length = 536
Score = 46.2 bits (108), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 53/197 (26%), Positives = 78/197 (39%), Gaps = 34/197 (17%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW-SETELAYLTGSPTKAEILERAEG 59
+ E+ +GK+SFW PYI L +L++ PLLW +ET L GSP I R +
Sbjct: 117 LVERSRGKESFWHPYIAALPSV---DELSISHPLLWPAETIQELLQGSPMLDTIATRLKL 173
Query: 60 IKREYNELDTVW---FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA- 115
+ ++ L T F+ G E + V S V+ + SL
Sbjct: 174 CQEDHEALLTAGIEKFLPGG----------------ETLSEGDVRWASAVLLSRAFSLEL 217
Query: 116 ------RRFALVPLGPPLLAYSSKCKAMLAAVDD---AVQLVVDRPYKAGESIVVWCGPQ 166
LVP L SS + D L + Y G+ + GP
Sbjct: 218 DVDDDFDTLCLVPWADMLNHCSSAGEESCLIFDQDTKTASLEAHKSYSKGDEVFDSYGPA 277
Query: 167 -PNSKLLINYGFVDEDN 182
S+L ++YGFVD++N
Sbjct: 278 LTGSQLFLDYGFVDDEN 294
>gi|403414266|emb|CCM00966.1| predicted protein [Fibroporia radiculosa]
Length = 420
Score = 46.2 bits (108), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 54/212 (25%), Positives = 92/212 (43%), Gaps = 27/212 (12%)
Query: 14 PYIREL---DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE---- 66
PYIR L D+ R +PL +S+TEL L GS L+R + E+ +
Sbjct: 117 PYIRMLPAPDKLR--------TPLHFSDTELEALKGSNLYGATLDRRRDWQSEWEQCQKT 168
Query: 67 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 126
+ TV G F Y + + F ++ +V S + ++ G
Sbjct: 169 IATVDLTWGEQFSWERYLSASTYLSSRAFPSMVLSPNPSLV-----STEESYPVLLPGID 223
Query: 127 LLAYS-----SKCKAMLAAVD-DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
L +S S ++ + D + + LV+ + AG ++ GP+PN++L++ YGF
Sbjct: 224 SLNHSRGQPVSWVVSIGTSSDVNRISLVLHKSTPAGSELLNNYGPKPNAELILGYGFSLP 283
Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 212
+NP D +V++ N+ Q K V RN +
Sbjct: 284 ENPDDTIVLKIGGNSASGLQQQKWEVG-RNAQ 314
>gi|346324642|gb|EGX94239.1| SET domain-containing protein RMS1 [Cordyceps militaris CM01]
Length = 482
Score = 46.2 bits (108), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 52/206 (25%), Positives = 81/206 (39%), Gaps = 41/206 (19%)
Query: 7 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-LERAEGIKREY- 64
G W PY+ L A +P+ WS EL L SP A++ E A+ + R
Sbjct: 121 GAACRWRPYLDVL-------PAAFATPMFWSPAELGALQASPAVAKVGRESADNMFRGIL 173
Query: 65 -------------------NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSC 105
++ + GS Y +D+ E E + +V +
Sbjct: 174 LPAVRAHAHVFAGSERLSDEQIVALAHRMGSTIMAYAFDLDKEEDEDEDGEDGWVEDRDG 233
Query: 106 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 165
+ V +A +L ++ + DD + + RP +AGE I+ + GP
Sbjct: 234 KALMGMVPMA----------DILNADAEFNVHVNHGDDDLTVTALRPIRAGEEILNYYGP 283
Query: 166 QPNSKLLINYGFVDEDNP-YDRLVVE 190
PNS+LL YG+V E + YD VVE
Sbjct: 284 HPNSELLRRYGYVTERHARYD--VVE 307
>gi|428179206|gb|EKX48078.1| hypothetical protein GUITHDRAFT_106158 [Guillardia theta CCMP2712]
Length = 410
Score = 46.2 bits (108), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 49/199 (24%), Positives = 87/199 (43%), Gaps = 21/199 (10%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+ EK + ++S W +IR + + ++ WSE +A L P A R +
Sbjct: 31 LVEKWKAERSRWWRFIRSIPP-------SYDTLENWSEQSVARLQYKPFLAIAARRKRVV 83
Query: 61 KREYNELDTV--------WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKV 112
E+++L + W + + + +F+ E + A V + H ++
Sbjct: 84 NDEFSQLQRLLSRCKKRSWNEPEAAEEAERIQLGFSSFSREDYLWAAGTVSTRSCHYERK 143
Query: 113 S-LARRFALVPLGPPLLAYSSKCKAMLAAV---DDAV--QLVVDRPYKAGESIVVWCGPQ 166
S + R V P+L + + A +AA DA+ ++ R Y+ GE +++ G
Sbjct: 144 SGYSLRGETVGCLVPVLDFLNHSTAPVAACGFCKDAMVYRVTCLRSYEEGEQVMIHYGNW 203
Query: 167 PNSKLLINYGFVDEDNPYD 185
N+ LL +YGFV EDNP D
Sbjct: 204 SNAGLLEHYGFVLEDNPLD 222
>gi|440302460|gb|ELP94773.1| hypothetical protein EIN_341910 [Entamoeba invadens IP1]
Length = 823
Score = 45.8 bits (107), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 72/317 (22%), Positives = 122/317 (38%), Gaps = 79/317 (24%)
Query: 4 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 63
K +K F PYI L PL +SE EL L G+ ++L E K
Sbjct: 93 NKTNEKCFHFPYINTLPT-------TFSCPLSYSENELKMLKGT----KLLVTVEKTKTF 141
Query: 64 YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV-- 121
+L + +L QYP T F+ F Q V +V +R F ++
Sbjct: 142 LKKLSDYY---ETLTHQYP----TRFQQFDDFYQRLVWAH-------QVFWSRAFLVIYP 187
Query: 122 -PLGP-----PLLAYSSKCKAMLAAVDDAVQLVVDRPYKA----GESIVVWCGPQ----- 166
P+G P +S+ + V V +R + V+ CG Q
Sbjct: 188 DPIGDVASLIPFADFSNH------NTETKVTYVSNRQTQTFSLQTNEKVLHCGEQIFNNY 241
Query: 167 ---PNSKLLINYGFVDEDNPYDRLVV-----EAALNTEDPQYQDKRM------------- 205
PN K+L+ YGFV +NPYD +++ E + + ++ +M
Sbjct: 242 RIRPNEKMLLGYGFVISENPYDEVLLRINFKERHFEKQVEESEESKMEVENKENERMEVE 301
Query: 206 -------VAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS--DTSEMQSVISSLGP 256
+ Q + V F + REKE +D+L LR+ +S + ++ + L
Sbjct: 302 EEDNEDEITQILKREGVDRFDYYLTREKELPTDLLRVLRIVNLSLVEANQYSQALLDLSY 361
Query: 257 ICPVSPC-MERAVLDQL 272
+ P++ R++++Q+
Sbjct: 362 VSPINEIKATRSLMEQI 378
>gi|378731232|gb|EHY57691.1| hypothetical protein HMPREF1120_05719 [Exophiala dermatitidis
NIH/UT8656]
Length = 714
Score = 45.8 bits (107), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 59/214 (27%), Positives = 81/214 (37%), Gaps = 46/214 (21%)
Query: 7 GKKSFWLPYIRELD--RQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 64
G KS+W PYI L Q E+ LLW L G+ KA A K Y
Sbjct: 138 GDKSWWAPYISSLPTVEDVSHSQFEDEADLLW-------LEGTNLKAGFAAEAARWKEMY 190
Query: 65 NELDTVWFMAG--SLFQQYPYDIPTEAFTFEIFKQAFV--AVQSCVVHLQKVSLARRFAL 120
+ G L Q + A+T+E F+ A +S + +L AL
Sbjct: 191 --------LKGMHQLKQSQWENAVNGAYTWERFRWAMTIFGSRSFTSQVLDATLPADKAL 242
Query: 121 VP------------LGP----------PLLAYSSK---CKAMLAAVDDAVQLVVDRPYKA 155
+ LG PL+ S+ K A V L V PY++
Sbjct: 243 LQQYRHDDGRDLCVLGELFAQHFGVLLPLVDISNHKPGAKVEWQARYSFVGLQVLEPYES 302
Query: 156 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 189
G+ I GP+ N LL+ YGF DNP+D +V+
Sbjct: 303 GQEIFNNYGPRDNETLLVAYGFTIPDNPFDHVVI 336
>gi|403158396|ref|XP_003307692.2| hypothetical protein PGTG_00642 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375163798|gb|EFP74686.2| hypothetical protein PGTG_00642 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 622
Score = 45.8 bits (107), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 30/78 (38%), Positives = 39/78 (50%), Gaps = 2/78 (2%)
Query: 118 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
A+VPL L A + A L D +++ R K GE I G PNS LL YG
Sbjct: 345 IAMVPLADLLNAKTGSENARLFYETDCLKMKATRNIKKGEQIYNTYGDPPNSDLLRRYGH 404
Query: 178 VDEDNPYDRLVVEAALNT 195
VD+ N +D VVE ++ T
Sbjct: 405 VDDPNRFD--VVEISIKT 420
>gi|391340216|ref|XP_003744440.1| PREDICTED: SET domain-containing protein 4-like [Metaseiulus
occidentalis]
Length = 381
Score = 45.8 bits (107), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 28/101 (27%), Positives = 47/101 (46%), Gaps = 5/101 (4%)
Query: 90 FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 149
FE F A+ AV + ++++ L P LL + K + V++ +
Sbjct: 165 LNFETFSWAWSAVNTRCIYVE----GHGSTLAPF-LDLLNHHWKASIETSFVNNHFIIRS 219
Query: 150 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 190
+ Y+AG + + G N L +NYGFV ++NP D + VE
Sbjct: 220 NVGYEAGSEVFIGYGSHDNRTLFLNYGFVLDENPNDCITVE 260
>gi|302658278|ref|XP_003020845.1| SET domain protein [Trichophyton verrucosum HKI 0517]
gi|291184711|gb|EFE40227.1| SET domain protein [Trichophyton verrucosum HKI 0517]
Length = 692
Score = 45.8 bits (107), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 45/201 (22%), Positives = 77/201 (38%), Gaps = 35/201 (17%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
++E+ +G+ S W PY+ L R S L + + +L +L G+ +
Sbjct: 113 VHEQLKGRDSHWWPYLATLPRAS-----EFTSALFYQDNDLEWLQGTNLYQTHQAYRNAV 167
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
+ EY+ ++ G L E++ ++IF A+ + S + +R
Sbjct: 168 QEEYDSAISILRDEGFL--------AVESYRWDIFCWAYTLIAS------RAFTSRVLDA 213
Query: 121 VPLGPPLLAYSSKCKAMLAAVDDA----------------VQLVVDRPYKAGESIVVWCG 164
P L + + ML VD + + L V P +GE + G
Sbjct: 214 YFSNHPTLKQDEEFQIMLPLVDSSNHKPLAKIEWRAEATEIGLKVIEPTSSGEEVHNNYG 273
Query: 165 PQPNSKLLINYGFVDEDNPYD 185
P N +L+ YGF DNP D
Sbjct: 274 PLNNQQLMTTYGFCIVDNPCD 294
>gi|328772335|gb|EGF82373.1| hypothetical protein BATDEDRAFT_86177 [Batrachochytrium
dendrobatidis JAM81]
Length = 966
Score = 45.8 bits (107), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 51/206 (24%), Positives = 78/206 (37%), Gaps = 41/206 (19%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
S W PY L R P+L S +LA TG+ E++ + R+Y L
Sbjct: 652 SLWQPYFDLLPR-------VFTIPVLGSAQDLAAYTGTSIIDEVVHDMIALMRQYLYLQH 704
Query: 70 VWFMAGSLFQQYPYD-IPTEAFTFEIFKQAFVAVQS-----CVVHLQKVSLARRFALVPL 123
+ F+ P IP FTF F A V + C + + + L+PL
Sbjct: 705 I-------FKSIPEPPIPLADFTFAAFSWARAIVSTRQNEICYANPSTSEMQQFLCLIPL 757
Query: 124 --------GPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 173
G + +K C +A+ D GE I + G + N ++L+
Sbjct: 758 FDMFNHKPGNSTTQFDTKEYCSETIASCD----------VSPGEQIFIHYGKRSNQEMLL 807
Query: 174 NYGFVDEDN-PYDRLVVEAALNTEDP 198
GFVD N YD + + ++ DP
Sbjct: 808 YSGFVDPTNIEYDHIKLSVSIPQSDP 833
>gi|348684109|gb|EGZ23924.1| hypothetical protein PHYSODRAFT_296170 [Phytophthora sojae]
Length = 452
Score = 45.8 bits (107), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 51/192 (26%), Positives = 77/192 (40%), Gaps = 36/192 (18%)
Query: 30 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 89
V+ PL W + + L G ++ R Y+++ F A + F + EA
Sbjct: 143 VDLPLYWDDKQFEELQGCEEARRAMQHG---ARFYSQVYKHLFGANNQF------VNAEA 193
Query: 90 FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY---SSKCKAMLAAVDDAVQ 146
F + I S ++ + FAL+P S C+ L + D+ VQ
Sbjct: 194 FFWAI---------SILMSRATSGQNQPFALIPFFDWFNHAGNGSDNCRHALDS-DECVQ 243
Query: 147 ---------LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD--RLVVEAAL-- 193
+ R Y+ GE + + G N +LL NYGF +NPYD L + AAL
Sbjct: 244 DFDMQKGFTIHTTRSYEPGEQLFINYGSHGNLRLLRNYGFTMPNNPYDVVNLPMPAALQQ 303
Query: 194 -NTEDPQYQDKR 204
N DP + KR
Sbjct: 304 PNEADPAFAQKR 315
>gi|324503528|gb|ADY41532.1| SET domain-containing protein 3 [Ascaris suum]
Length = 502
Score = 45.8 bits (107), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 63/275 (22%), Positives = 109/275 (39%), Gaps = 51/275 (18%)
Query: 4 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 63
+K S WLPY+ L + +PL +S EL L+ SP E L + R+
Sbjct: 167 QKLSPDSSWLPYLDALPQ-------TFSTPLYFSALELRKLSPSPAYEESLIMYRNVARQ 219
Query: 64 YNELDTVWFMAGS----------------------LFQQYPYDIPTEAFTFEIFKQAFVA 101
+ V+F+A LF P+ + FTF++++ A
Sbjct: 220 F-----VYFLAAVQRSERSRSAKKDKNHAAVGMEPLFLNAPFTVSN--FTFDLYRWAVAC 272
Query: 102 VQSCVVHL----QKVSLARRFALVPLGPPLL-----AYSSKCKAMLAAVDDAVQLVVDRP 152
V + + + K S + A VP PLL + + D + +
Sbjct: 273 VTTRINFIPSQYAKDSNGQPVA-VPCLIPLLDMANHEFDHPLTVHFSTEGDYASIKATKD 331
Query: 153 YKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALNTEDPQYQDKRMV---AQ 208
YKAG+ + ++ G + N + ++ GFV D +N D ++ D Q + + + A
Sbjct: 332 YKAGDEVTIFYGIRTNRQFFLHNGFVPDGENKNDTYKLKIGFPRGDKQVRARLKLMHDAG 391
Query: 209 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD 243
N + V VF V+A +S +L + R+ V +
Sbjct: 392 FNAESRVFVFEVNASERPVPLS-LLDFARVFLVEN 425
>gi|323452617|gb|EGB08490.1| hypothetical protein AURANDRAFT_71532 [Aureococcus anophagefferens]
Length = 1114
Score = 45.4 bits (106), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 80/331 (24%), Positives = 130/331 (39%), Gaps = 54/331 (16%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK---------- 50
+YE+++G KS W P+I L + L WSE ELA L GS
Sbjct: 106 LYERRKGAKSRWGPHIALLP------ATPPHALLRWSEAELAELAGSDALELANRWRSQV 159
Query: 51 ----AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 106
+EI++++ E + + A ++P+ + E F++ + +
Sbjct: 160 SSDFSEIVDKSRAAVEESDPGKQL-SAAVKASLRFPW-LDLEGFSWAV----------SM 207
Query: 107 VHLQKVSLARRFALVPLGPPLLAY--------SSKCKAMLAAVDDAVQLVVDR---PYKA 155
+ + VS++R+ A PP+ A+ DDA V R K
Sbjct: 208 IWSRCVSVSRKGA-----PPIKAFLPVVDMHNHDPGAPENHGFDDARDGFVLRRTGNAKK 262
Query: 156 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 215
G+ + + PN+ LL+ YGF + + + A L+ E P Y+ KR + KL +
Sbjct: 263 GDELKLCYDGLPNAWLLLLYGFALDHAAHAGRDLYAPLSPEAPHYEAKRAALE---KLGL 319
Query: 216 QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADY 275
A + A D LP RL ++ ++ + L + S RA L
Sbjct: 320 GATADGAAPFRLAADDALPE-RL--LTALMAQRATLDELPGLPATSEATARAAAGDLVAA 376
Query: 276 FKARLAGYPATLSEDEAMLTDYNLHPKKRVA 306
A LA Y + ED A L D P+ R+A
Sbjct: 377 CDALLAAYRGSEDEDAAALADPATPPRLRLA 407
>gi|346327621|gb|EGX97217.1| SET domain-containing protein, putative [Cordyceps militaris CM01]
Length = 371
Score = 45.4 bits (106), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 38/77 (49%), Gaps = 2/77 (2%)
Query: 117 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 176
R AL+P+ S C + + + DR Y+A E + G N LL YG
Sbjct: 178 RLALLPVADMFNHASVGCAVAFST--EVYDVTADRDYEADEELYTSYGAHSNDFLLAEYG 235
Query: 177 FVDEDNPYDRLVVEAAL 193
F+ +DNP+D+L ++A L
Sbjct: 236 FMLQDNPHDQLCLDAVL 252
>gi|302832548|ref|XP_002947838.1| hypothetical protein VOLCADRAFT_88145 [Volvox carteri f.
nagariensis]
gi|300266640|gb|EFJ50826.1| hypothetical protein VOLCADRAFT_88145 [Volvox carteri f.
nagariensis]
Length = 508
Score = 45.4 bits (106), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 45/160 (28%), Positives = 72/160 (45%), Gaps = 8/160 (5%)
Query: 139 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV--DEDNPYDRLVVEAALNTE 196
A A+ L DRPY AG+ + V GP+ + +LL++YGF NP+ + A++ +
Sbjct: 242 GATKGALVLRADRPYAAGQQVYVSYGPKSSGELLLSYGFCPPPASNPHQDCRLRVAVDRQ 301
Query: 197 -DPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL--RLGYVSDTSEMQSVISS 253
DP K R+G S F + E + L +L R +T E+ SV+
Sbjct: 302 GDPLADLKEQALARHGLPSELEFPLKLEGIPEGLLQYLAFLDARPKVAQETFELASVLFE 361
Query: 254 LGPICPVSPCMERAV--LDQLADYFKARLAGYPATLSEDE 291
G P+ + V L L++ A L YP ++ D+
Sbjct: 362 SGGF-PLLDGQDTLVLALRGLSNRCTAALKAYPTSMEADQ 400
>gi|428171155|gb|EKX40074.1| hypothetical protein GUITHDRAFT_113813 [Guillardia theta CCMP2712]
Length = 353
Score = 45.4 bits (106), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 24/79 (30%), Positives = 41/79 (51%), Gaps = 1/79 (1%)
Query: 111 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 170
++ L+R FA G L +S Q+V ++ +K G+S+ + G + N +
Sbjct: 183 EIVLSRAFAFSRTGGDDLVFSG-TSVKYDNSKQEFQIVAEKDFKVGQSVEISYGLKSNHE 241
Query: 171 LLINYGFVDEDNPYDRLVV 189
LL++YGF+ DNP D V+
Sbjct: 242 LLLSYGFILPDNPEDFFVI 260
>gi|408392258|gb|EKJ71616.1| hypothetical protein FPSE_08255 [Fusarium pseudograminearum CS3096]
Length = 527
Score = 45.4 bits (106), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 35/63 (55%), Gaps = 2/63 (3%)
Query: 131 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 190
S CK + +A+ +VQ DR YK GE + V GP N LL YGF+ + N +D + ++
Sbjct: 194 SQGCKLVYSALGYSVQ--TDRAYKQGEEVFVSYGPHSNDFLLTEYGFILDTNRWDEVYLD 251
Query: 191 AAL 193
+
Sbjct: 252 EVI 254
>gi|345566622|gb|EGX49564.1| hypothetical protein AOL_s00078g53 [Arthrobotrys oligospora ATCC
24927]
Length = 611
Score = 45.4 bits (106), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 68/296 (22%), Positives = 123/296 (41%), Gaps = 36/296 (12%)
Query: 8 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA-EILERAEGIKREYNE 66
+ FW PYIR L + ++PL +++ E+ L G+ A ++L R + E+
Sbjct: 125 QSQFW-PYIRLLPK-------TFDTPLYFNDDEMERLAGTNLGAGDVLLRKQLWMEEWEA 176
Query: 67 LDTVWFMAG---SLFQQYPYDIPTEA---FTFEIFKQAFVAV--QSCVVHLQKVSLARRF 118
F+ G ++Y +D+ A +T F V + S + +S F
Sbjct: 177 GKQ--FLEGVGAERAREYTWDLFLRAATIYTSRSFPSKLVGITMDSSIEENTMLSDDNGF 234
Query: 119 -ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
L+PL +L + K + + L+ G + GP+ N +LL+ YGF
Sbjct: 235 PVLIPL-VDILNHKPNTKIIWEPTQTSFSLITPETISEGSQVFNNYGPKGNEELLMGYGF 293
Query: 178 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV-QVFHVHAGREKEAISDMLPYL 236
V +NP D L ++ ++ P+ Q ++ QR K + +VFH+ + + +P L
Sbjct: 294 VIPENPGDSLAMKFTIS---PRGQAAQIWEQRALKQTWREVFHLTKSADSGQKTSTVPAL 350
Query: 237 RLGY-----------VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 281
+ V++ +E+ + + P+S E AV L K +LA
Sbjct: 351 ESDWPEAFVDLFRILVANENEIDDLENGDINATPISIRNELAVALGLKAAIKQKLA 406
>gi|390602144|gb|EIN11537.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
SS5]
Length = 503
Score = 45.4 bits (106), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 59/220 (26%), Positives = 87/220 (39%), Gaps = 24/220 (10%)
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
A+VP+ L A A L +++V +P +GE I G PNS LL YG V
Sbjct: 255 AMVPMADMLNARYGSENAKLFYESRDLRMVTTKPIASGEQIWNTYGDPPNSDLLRRYGHV 314
Query: 179 D---------EDNPYDRLVVEA--ALNTEDPQYQDKRMVAQ-----RNGKLSVQVFHVHA 222
D NP D + V A LN + + Q + + G V VF
Sbjct: 315 DLLALSDGDGMGNPSDIVEVRADLVLNHVNSKKQSHELEERIDWWLEEGGDDVFVF---- 370
Query: 223 GREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAG 282
R+ E S+++ +RL + T ++ P V +L + RL
Sbjct: 371 TRDAELPSELVSLIRLLILPPTEWTKTRDKGKLPKGKVDDVR---ILHVVTGALHERLQQ 427
Query: 283 YPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 322
YP ++ +DEA+L L KR A + EK +L L
Sbjct: 428 YPTSIEDDEALLA-TALSENKRQAVIVRLAEKHILRKALH 466
>gi|116206234|ref|XP_001228926.1| hypothetical protein CHGG_02410 [Chaetomium globosum CBS 148.51]
gi|88183007|gb|EAQ90475.1| hypothetical protein CHGG_02410 [Chaetomium globosum CBS 148.51]
Length = 442
Score = 45.4 bits (106), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 48/190 (25%), Positives = 76/190 (40%), Gaps = 20/190 (10%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
E +GK SFW PY+ L + P W E ++AYL + I E +K
Sbjct: 118 EYLKGKDSFWWPYLATLPSPDQVNAWVL--PAFWPEDDIAYLECTNAHVAIQEIQANVKG 175
Query: 63 EYNE----LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL-----QKVS 113
E+ + L F + + Y FT F+ + + + H+ Q V
Sbjct: 176 EFKQARKILKNENFPDVAAYTSLMYKWAFTIFTSRSFRPSLILSDTTKRHISTLLPQSVE 235
Query: 114 LARRFALVPLGPPLLAYSSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWCGPQPN 168
L F+++ PLL ++ + + D DA LV Y G + G + N
Sbjct: 236 L-DDFSILQ---PLLDIANHSPTAVYSWDTTSPADACTLVCGDRYPPGAQVFNNYGLKTN 291
Query: 169 SKLLINYGFV 178
S+LL+ YGF+
Sbjct: 292 SELLLGYGFI 301
>gi|400596811|gb|EJP64567.1| histone-lysine N-methyltransferase [Beauveria bassiana ARSEF 2860]
Length = 406
Score = 45.1 bits (105), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 54/227 (23%), Positives = 100/227 (44%), Gaps = 12/227 (5%)
Query: 76 SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCK 135
+F ++ P + F+F+ +K A ++ S + +S L+ +L ++S K
Sbjct: 147 GVFMRHRDLFPLDKFSFQHYKWALSSIWSRGMDF-TISEGNSVRLMAPFADMLNHASDAK 205
Query: 136 AMLAAVDDAVQLVV--DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 193
A L V R Y+ G+ + ++ G NS+LL YGFV DNP D +
Sbjct: 206 QCHAYDPSTGSLTVLACRDYEVGDQVFIYYGNVSNSRLLRLYGFVLPDNPNDNYELVLQT 265
Query: 194 NTEDPQYQDKRMVAQRNGKLSV--QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVI 251
++ P Y+ K QR KL+ ++ + + +L YLR+ + D S++ ++
Sbjct: 266 SSMAPLYEQK----QRLWKLAGLDEISTIPLSLQNPLPDSVLRYLRIQRL-DASDLGTMT 320
Query: 252 SSLG--PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
+ +S E +L L+ +A L G+ +L + E L +
Sbjct: 321 MQIATESYTKISDENESQILLFLSQSIEALLEGFEISLEKLETQLAE 367
>gi|384248321|gb|EIE21805.1| SET domain-containing protein, partial [Coccomyxa subellipsoidea
C-169]
Length = 275
Score = 45.1 bits (105), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 50/202 (24%), Positives = 85/202 (42%), Gaps = 30/202 (14%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
M EKK GK S W Y+ L + PL W +L L G+ ++E+ G
Sbjct: 88 MTEKKLGKSSKWKGYLDFLPKS------IPGMPLFWDSEQLQSLEGT----SLIEKMNGC 137
Query: 61 KR----------EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ 110
K ++N + + F+ + + P++ + + ++ A V+ S +
Sbjct: 138 KAMPDRPLEPPCKFNSV-VLPFLQSNAHLKLPHNAASTRRLY-VWATAMVSAYSFTIGED 195
Query: 111 KVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQPN 168
+ A+VP+ L + L A A++++ GE ++ G PN
Sbjct: 196 RFQ-----AMVPMWDALNHITGHANVRLHHCARKGALRMIATCLITKGEQVINSYGDLPN 250
Query: 169 SKLLINYGFVDED-NPYDRLVV 189
S+LL YGFV+ D NP+D L V
Sbjct: 251 SELLRRYGFVETDPNPHDCLEV 272
>gi|384246211|gb|EIE19702.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 503
Score = 45.1 bits (105), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 82/322 (25%), Positives = 132/322 (40%), Gaps = 40/322 (12%)
Query: 33 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 92
PL WSE EL+ L G+ + + + + +Y EL+ LF + P ++
Sbjct: 155 PLFWSEEELSQLEGTQLLSSVQGYRQFFEAKYAELEE------QLFAPHREAFPPKSHQL 208
Query: 93 EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY----SSKCKAMLAA--VDDAVQ 146
+ F A V+S V ALVPL L+ + ++ + LA A
Sbjct: 209 DDFLWAVATVRSRV---HSPLDGEDVALVPLAD-LVQHRKLQGARWQLQLAGGLFSKAQA 264
Query: 147 LVVD--RPYKAGESIVVWCGP--------QPNSKLLINYGFVDEDNPY-------DRLVV 189
LVV+ R Y GE + + G + +S++L++YG +D D P ++
Sbjct: 265 LVVEAQRDYAEGEVVTMDFGAPLTEEDQEKLDSQVLLDYGALDADRPQADPGVVQGGFIL 324
Query: 190 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 249
AL +D Y DK + + NG F + A E L D ++
Sbjct: 325 SLALPEDDKYYDDKADILELNGLSEAASFVLRANEEPSEQLLGFLRLLNLSGQDAFLLEP 384
Query: 250 VISSLG---PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVA 306
+ + + PVS ERAV + + + +A L GY ++ +D L D P R+
Sbjct: 385 LFRNEAWGHMLAPVSEANERAVYESMMEGCRAALQGYATSIDDDLRALRDT--QPGTRLE 442
Query: 307 TQ-LVRM-EKKMLNACLQVTAD 326
LVR+ EK+ L+A L D
Sbjct: 443 KAILVRLGEKETLDATLAFFED 464
>gi|46129354|ref|XP_389038.1| hypothetical protein FG08862.1 [Gibberella zeae PH-1]
Length = 478
Score = 45.1 bits (105), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 23/67 (34%), Positives = 36/67 (53%), Gaps = 2/67 (2%)
Query: 127 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 186
L + CK + +A+ +VQ DR YK GE + V GP N LL YGF+ + N +D
Sbjct: 187 LFNHDQGCKLVYSALGYSVQ--TDRVYKQGEEVYVSYGPHSNDFLLTEYGFILDTNRWDE 244
Query: 187 LVVEAAL 193
+ ++ +
Sbjct: 245 VYLDEVI 251
>gi|366987955|ref|XP_003673744.1| hypothetical protein NCAS_0A08050 [Naumovozyma castellii CBS 4309]
gi|342299607|emb|CCC67363.1| hypothetical protein NCAS_0A08050 [Naumovozyma castellii CBS 4309]
Length = 499
Score = 45.1 bits (105), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 47/85 (55%), Gaps = 3/85 (3%)
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
+++PL L + ++KC A L +D++++ KAGE + G PN+++L YG+V
Sbjct: 229 SMIPLADTLNSDTNKCNAHLIYDEDSLKMRAISNIKAGEQVYNIYGNHPNAEILRRYGYV 288
Query: 179 D-EDNPYD--RLVVEAALNTEDPQY 200
+ E + YD L +E + T QY
Sbjct: 289 EWEGSKYDFGELPLEVIIETLHEQY 313
>gi|320163048|gb|EFW39947.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 476
Score = 45.1 bits (105), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 84/377 (22%), Positives = 155/377 (41%), Gaps = 51/377 (13%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-LERAEG 59
MYE K SFW PY+ L +++P+ W++ +L L G+ T + + E AE
Sbjct: 106 MYEI-TNKDSFWRPYLDLLPE-------TLDTPMFWNDDDLELLEGTSTLSHLGKEDAET 157
Query: 60 IKRE----YNELDTVWF----MAGSLFQQYPYDIPTEAFTFEIFKQAFV----------- 100
I E + +L F +L+ + I +F+ + +
Sbjct: 158 IFTEQIVPFMKLHPTHFDLKVHNMALYHRVASVIMAYSFSEDDDEDDDDEDDDEEEDCCD 217
Query: 101 --AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCK-AMLAAVDDAVQLVVDRPYKAGE 157
A C ++ + + A+VPL +L + + C A L + + P AG
Sbjct: 218 GDANNECCSQKRQKRM-EKIAMVPLAD-MLDHKTGCNNARLFYGKTTLAMSCIEPCAAGH 275
Query: 158 SIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL---NTEDPQYQDKRMVAQRNGKLS 214
+ G NS+LL YGF+D+ N ++ + + + E + ++ M A
Sbjct: 276 ELYNTYGDLSNSELLRKYGFIDDVNEHNSVDIPVEMLEERFESCSFMEEAMEALEEIGCW 335
Query: 215 VQVFHVHAG----REKEAISDML-----PYLRLGYVSDTSEMQSVISSLGPICPVSPCME 265
+ FH+ A +E EA +L L + D E++S +++L V+ C
Sbjct: 336 LPEFHIPADALPPQELEASIALLFQSPKQVRALRALDDEDEIRSFLATL-----VNKC-R 389
Query: 266 RAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
R V + L + + R Y T EDE L + +L ++++A ++ E+ +L+ +
Sbjct: 390 RKVSETLLAFGQKRAEEYTTTREEDEERLKESDLTHRQKMALRVRIGERTILHNYISHLK 449
Query: 326 DMIMLLPDVTVSPCPAP 342
+ + P + PAP
Sbjct: 450 ERLETTPPDQETKEPAP 466
>gi|403412960|emb|CCL99660.1| predicted protein [Fibroporia radiculosa]
Length = 508
Score = 44.7 bits (104), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 63/227 (27%), Positives = 92/227 (40%), Gaps = 37/227 (16%)
Query: 120 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 179
+VP+ L A A L + +++V +P KAGE I G PNS LL YG VD
Sbjct: 258 MVPMADMLNARFGSENAKLFYEEHHLKMVTTKPIKAGEQIWNTYGDPPNSDLLRRYGHVD 317
Query: 180 ----------EDNPYD------RLVVEAALNTEDPQYQDK----RMVAQRNGKLSVQVFH 219
NP D L V AA + QDK VA + F
Sbjct: 318 LVPLEPPLAGLGNPADIVEIGADLAVFAAKKDSPEKLQDKIDWWLEVANDD------TFV 371
Query: 220 VHAGREKEAISDMLPYLRLGYV-SDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 278
+ G + + +++ + RL ++ D E S L P ++ VL D
Sbjct: 372 I--GTDCQLPEELVSFARLLFLPRDEWEKVRQKSKL-----PKPKIDAQVLSVAEDVLSR 424
Query: 279 RLAGYPATLSEDEAMLTDYNLHP---KKRVATQLVRMEKKMLNACLQ 322
R+ Y T+ +DEA+L N P K+ A + EK++L+ LQ
Sbjct: 425 RINEYSTTIEDDEALLALENAQPLSLNKKHALIVRHGEKRILHGTLQ 471
>gi|328869852|gb|EGG18227.1| hypothetical protein DFA_03714 [Dictyostelium fasciculatum]
Length = 504
Score = 44.7 bits (104), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 31/124 (25%), Positives = 54/124 (43%), Gaps = 16/124 (12%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
MYE +G +S W YI+ + ++ P+LW + + YL G+ + ++E E +
Sbjct: 92 MYETSKGVQSKWYSYIQSM-------PTVIDLPILWDKESIEYLVGTDLEEIVIENIETL 144
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
+ +Y E + + +P FT E FK A V S ++ + +L
Sbjct: 145 EEQYRE------DVEPIIKNHPETFKENIFTLESFKIASTIVSSRAFNIDQYHGE---SL 195
Query: 121 VPLG 124
VPL
Sbjct: 196 VPLA 199
>gi|348690659|gb|EGZ30473.1| hypothetical protein PHYSODRAFT_553476 [Phytophthora sojae]
Length = 437
Score = 44.7 bits (104), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 87/349 (24%), Positives = 147/349 (42%), Gaps = 51/349 (14%)
Query: 1 MYEK-KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG 59
+YEK +G KS W +I L R + L + EL L GS + E
Sbjct: 102 LYEKFVRGSKSKWAKHIELLPR-------TYHNALYFGPEELRALEGSNVYFIAQQMEEK 154
Query: 60 IKREYNELDTVWFMAGSLFQQYP----YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 115
+ +Y L + LF+ P D+ E F+ E +K A + S V +A
Sbjct: 155 VAHDYARLKESVLL--ELFENVPEGINVDLFDEFFSLENYKWALSTIWS---RFGDVPVA 209
Query: 116 RRF--ALVPLGPPLLAYSSKCKAMLAAVDDAVQ---LVVDRPYKAGESIVVWCGPQPNSK 170
++ A+VP+ +L + + + M D + Q LV + + AG + + GP N K
Sbjct: 210 KQSFKAMVPVFD-MLNHDPEAE-MSHFFDMSTQRFKLVSHQHWNAGAQMFINYGPLSNHK 267
Query: 171 LLINYGFVDEDNPYDRLVVEAALNTEDPQ---YQDKRMVAQRNGKLSVQVFHVHAGREKE 227
LL YGFV NP+D VE L ++ +Q+K + NG HA E
Sbjct: 268 LLALYGFVIIGNPFD--AVEMWLPMDEASTKFFQEKEQLLLTNGL-------DHATNPFE 318
Query: 228 AISD-----MLPYLRLGYVS--DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARL 280
++D +L R+ + E + + + +S E+ L +L + L
Sbjct: 319 LVADESNDLLLMAARIQEIDCETVEEFEELANKALEGEMISLENEQEALTRLIYTLEKML 378
Query: 281 AGYPATLSEDEAML------TDYNLHPKKRVATQLVRMEKKMLNACLQV 323
+P ++ ED+ +L TD NL+ +R+A + R +K +L+ + +
Sbjct: 379 ESFPTSIEEDDILLEQDDKKTD-NLN-HERMAVAVRRSDKYILSENINM 425
>gi|451854554|gb|EMD67847.1| hypothetical protein COCSADRAFT_34629 [Cochliobolus sativus ND90Pr]
Length = 476
Score = 44.7 bits (104), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 90/377 (23%), Positives = 143/377 (37%), Gaps = 71/377 (18%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL------ 54
+YE G S W PY L ++ + W+E ELA L S +I
Sbjct: 91 LYEYHNGSASNWAPYFAVLPTD-------FDTLMFWTEDELAELQASAVVNKIGKEGANE 143
Query: 55 ---------------------ERAEGIKREYNELDTVWFMA--GSLFQQYPYDIPTEAFT 91
ERA+ +E + + M GSL Y +D+ A +
Sbjct: 144 VFIEQLLPVIEEFADVIFSGDERAKHKAKEMRAPENLELMHKMGSLIMAYAFDVEP-AIS 202
Query: 92 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 151
+ + A + L K +VPL L A +C A L D +++ +
Sbjct: 203 DKEVDEEGFAEEEEDAALPK-------GMVPLADMLNADGDRCNARLFYEKDGLEMKALK 255
Query: 152 PYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYD-----------RLVVEAALNTEDPQ 199
P +AG+ I GP P S LL YG++ D YD L + + E +
Sbjct: 256 PIQAGDEIFNDYGPLPRSDLLRRYGYITDNYAQYDVVEIPVDLVSQTLAHDGLWHEERIE 315
Query: 200 YQDKRMVAQRNGKLSVQV-FHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPIC 258
Y D++ + ++ + F + +E++S L L + E + + S G +
Sbjct: 316 YLDEQEIVDTGYDIAASIPFSL-----EESLSPELVILVETMLLPREEFER-LQSKGRL- 368
Query: 259 PVSPCMERAVLDQLADYFKARLAGYPATLSED-----EAMLTDYNLHPKKRVA-TQLVRM 312
P + M L +AR+A YP TL +D E ++RVA + VR+
Sbjct: 369 PKAEKMTGKAAKFLYKIVQARIAQYPTTLEQDLQISSETQPVQTMSRKERRVAMARAVRI 428
Query: 313 -EKKMLNACLQVTADMI 328
EKK+L + AD I
Sbjct: 429 GEKKLLVQTEERLADKI 445
>gi|299115489|emb|CBN75653.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 451
Score = 44.7 bits (104), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 21/70 (30%), Positives = 38/70 (54%)
Query: 142 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 201
+D + LV + + +G + GP PNS+LL+ +GF DNP++ + + A + P +
Sbjct: 229 NDCLHLVTLQDWASGSEVKFSYGPLPNSRLLLLHGFCLPDNPFESVELWAMMEPGAPGFA 288
Query: 202 DKRMVAQRNG 211
+K + NG
Sbjct: 289 EKNKIMLDNG 298
>gi|358399747|gb|EHK49084.1| hypothetical protein TRIATDRAFT_213818 [Trichoderma atroviride IMI
206040]
Length = 378
Score = 44.7 bits (104), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 36/171 (21%), Positives = 72/171 (42%), Gaps = 24/171 (14%)
Query: 28 LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPT 87
V P++W EL +L + + +R + + ++++ F + DI
Sbjct: 102 FEVGMPMMWPR-ELKHLLPLEPRNLVFKREKAFQGDWSD-----------FHKAFSDISY 149
Query: 88 EAFTFEIFKQAFVAVQSCVVHLQ-----KVSLARRFALVPLGPPLLAYSSKCKAMLAAVD 142
E +T+ A++ V + + + K R AL+P+ + C+ +
Sbjct: 150 EEYTY-----AWLTVNTRTFYNESPETLKYPWEDRLALIPVADLFNHADAGCRVYYSP-- 202
Query: 143 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 193
+ +V DR YK GE + + N L+ YGFV ++NP D + ++ +
Sbjct: 203 EGYHIVADRDYKRGEELYISYSSHSNDYNLVEYGFVPDENPSDDVYIDDVI 253
>gi|428173103|gb|EKX42007.1| hypothetical protein GUITHDRAFT_141487 [Guillardia theta CCMP2712]
Length = 355
Score = 44.7 bits (104), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 49/188 (26%), Positives = 74/188 (39%), Gaps = 18/188 (9%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
MY K K+SFW PY + + SP+ WSE EL L GS + +
Sbjct: 143 MYNPK--KESFWKPYFDIMPTN-------LTSPIFWSEDELQELAGSEVSNMARIEKKRL 193
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF-A 119
+ Y+EL +F+ +AFT + + A S V+ L + +
Sbjct: 194 RAMYDELRE------RIFKHDRKTFLKQAFTLKNWFWANGLYDSRVIQLNRQTGHGNVPT 247
Query: 120 LVPLGPPLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
+PL + S+ K + A + DR G + G + N + L+ GF
Sbjct: 248 FIPLIDMVNCIESQDKTFIQYDKKLRAAVMYADRAVSRGVQVFESYGNKSNYEYLLYNGF 307
Query: 178 VDEDNPYD 185
V EDNP D
Sbjct: 308 VMEDNPND 315
>gi|115391295|ref|XP_001213152.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114194076|gb|EAU35776.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 691
Score = 44.7 bits (104), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 49/185 (26%), Positives = 83/185 (44%), Gaps = 15/185 (8%)
Query: 6 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE-- 63
+G + FW PYI L + G L +PL + +L +L G+ + A E+ E + +E
Sbjct: 123 RGSEGFWYPYICTLPQP---GDLT--TPLYYEGADLRWLEGT-SLAPAREQKESLLKEKY 176
Query: 64 ---YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
+ EL F ++Y +++ A T + + V + VV ++ L
Sbjct: 177 QSTFEELRKSGFGDA---EKYTWELYLWASTIFVSRAFSAKVLAGVVPHAELPEENVSVL 233
Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
+P +L + K A + V VV AGE + GP+ N +L++NYGF +
Sbjct: 234 LPF-IDVLNHRPLAKVEWRAGERDVLFVVLEHVAAGEEVANNYGPRNNEQLMMNYGFCLQ 292
Query: 181 DNPYD 185
+NP D
Sbjct: 293 NNPCD 297
>gi|195132508|ref|XP_002010685.1| GI21676 [Drosophila mojavensis]
gi|193907473|gb|EDW06340.1| GI21676 [Drosophila mojavensis]
Length = 593
Score = 44.7 bits (104), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 72/350 (20%), Positives = 133/350 (38%), Gaps = 44/350 (12%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+ EK +G S W P+I L + + L ++ ++ L G+ + + I
Sbjct: 252 IMEKLRGADSPWQPFIDTLPSR-------YNTVLYFTVEQMQRLRGTSACSAAVRHCRVI 304
Query: 61 KREYNEL---------DTVWFMAGSLFQQYP--YDIPTEAFTFEIFKQAFVAVQSCVVHL 109
R Y + D+V +LF Y Y++ A + +Q V Q
Sbjct: 305 ARLYASMYKCAFMQLDDSVMGGMANLFTDYGLCYELYRWAVSTVTTRQNLVPRQEIPSDA 364
Query: 110 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 169
+ ++ AL+P S K + ++ YK+GE ++ G + N+
Sbjct: 365 ANLPIS---ALIPYWDMANHRSGKITSFYDQAAGQMECTAQEAYKSGEQYFIYYGDRSNA 421
Query: 170 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV-----AQRNGKLSVQVFHVHAGR 224
L++ GFVD NP D + + L+ D + + ++ +R +L V H
Sbjct: 422 DRLVHNGFVDMQNPKDYVQIRLGLSPTDALAEQRAILLAELNIERKAELRVLPAPEHISG 481
Query: 225 EKEAISDMLPYLRLGYVSD------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 278
E +L ++R+ +S S+++ + L C + +E L K
Sbjct: 482 E------LLAFVRVFNMSKEQLEHWCSDLERAVDLLHIDCALETDLETRTWQYLYQRLKL 535
Query: 279 RLAGYPATLSED------EAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 322
L ATL E EA+ + + Q R+E+++L+ LQ
Sbjct: 536 LLGVLEATLKETDELKQLEALQQQADASEIDIMVLQYRRLERRILSDALQ 585
>gi|148226164|ref|NP_001079674.1| SET domain containing 4 [Xenopus laevis]
gi|28422727|gb|AAH46855.1| MGC53706 protein [Xenopus laevis]
Length = 456
Score = 44.3 bits (103), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 46/194 (23%), Positives = 82/194 (42%), Gaps = 19/194 (9%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
E+ G S W PY+ + P+ W E E+ +L +P + + LE+ K
Sbjct: 117 ERFAGDCSQWKPYLDVIPS-------TYSCPVYW-ELEIIHLLPAPLRKKALEQ----KT 164
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA--- 119
E EL T S Q D + +T++ + A+ V + V+++ R A
Sbjct: 165 EVQELHTESLAFFSSLQPLFCDNVADIYTYDALRWAWCTVNTRTVYMKHTQQDRLLAQQD 224
Query: 120 LVPLGP--PLLAYSSKCK--AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
+ L P LL +S + + A + ++ + + + + GP N +LL+ Y
Sbjct: 225 VCALAPYLDLLNHSPEVQVEAEFSKDRRCYEIRTNSGCRKHDQAFICYGPHDNQRLLLEY 284
Query: 176 GFVDEDNPYDRLVV 189
GFV +NP+ + V
Sbjct: 285 GFVAANNPHRSVYV 298
>gi|440464611|gb|ELQ34010.1| hypothetical protein OOU_Y34scaffold00824g3 [Magnaporthe oryzae
Y34]
Length = 373
Score = 44.3 bits (103), Expect = 0.077, Method: Compositional matrix adjust.
Identities = 46/163 (28%), Positives = 71/163 (43%), Gaps = 24/163 (14%)
Query: 33 PLLWSETELAYLTGSPTKAEILERAEGIKREYN-ELDTVWFMAGSL----FQQYPYDIPT 87
P +W + EL L PT A + E + +YN E +TV S+ FQ Y + + T
Sbjct: 107 PFMWPK-ELQKLL--PTSARVF--LENQQTKYNHEWNTVSQAMPSISEERFQYYWHIVNT 161
Query: 88 EAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQL 147
F +E V+ C S R ALVPL C+ ++ + + +
Sbjct: 162 RTFLYE------VSETECY------SWEDRLALVPLADIFNHADEGCR--VSYMPEHYVI 207
Query: 148 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 190
DR Y+AGE + + G N LL YGF+ N +D + ++
Sbjct: 208 TTDRAYEAGEELFISYGDHSNDCLLTEYGFLLPKNRWDIICID 250
>gi|303288796|ref|XP_003063686.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226454754|gb|EEH52059.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 538
Score = 44.3 bits (103), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 91/359 (25%), Positives = 142/359 (39%), Gaps = 64/359 (17%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG- 59
++EK G S W YIR L R A+++PL WS ELA L G+ ++L A G
Sbjct: 163 LHEKSLGDDSRWAAYIRCLPRVE-----ALDAPLFWSSEELAELAGT----QLLANAAGY 213
Query: 60 ---IKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 115
++ + L F +LF D AF+ F AF ++S L V
Sbjct: 214 DSYVRGTHAALKETTFKEHPALFGDAGDDDGGGAFSEREFLWAFGVLRSRA--LPPVDQG 271
Query: 116 RRFALVPLGPPLLAYSSKC-------KAMLAAV---------DDAVQLVVDRP----YKA 155
AL+P G + + C +AAV +V L V++ K
Sbjct: 272 ESIALIP-GIDMANHDGLCSQTWQLNNGGIAAVFGGRGGADGGGSVLLRVEKTKAGGAKR 330
Query: 156 GESIVVWCGP-QPNSKLLINYGFVDEDNPYDRLVVEA-ALNTEDPQYQDKRMVAQRNGKL 213
GE I GP +S+ ++YGFVD V+ ++ +D DK V G
Sbjct: 331 GEEIRCNYGPANIDSQFALDYGFVDAFCSRPGYVLGPLSIPEDDVNAFDKMDVLSVAGLK 390
Query: 214 SVQVFHVHAGREKEAISDMLPYLRLGYVSDTS----------EMQSVISSLGPICPVSPC 263
F + A ++ +M+ ++RL + + E ++IS PVSP
Sbjct: 391 ESPAFTIRA--FEDPPPEMVVFMRLLNLKNDDAFLLEAIFRQECWALISD-----PVSP- 442
Query: 264 MERAVLDQLADY-FKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 321
D AD + L Y + +D + D + P+ R+A ++ EK+ L L
Sbjct: 443 ------DNEADAGCEEALGAYATKIEDDRGVADDADASPRLRLAARVRMGEKQALEEVL 495
>gi|340503949|gb|EGR30449.1| SET domain protein [Ichthyophthirius multifiliis]
Length = 518
Score = 44.3 bits (103), Expect = 0.083, Method: Compositional matrix adjust.
Identities = 50/214 (23%), Positives = 92/214 (42%), Gaps = 25/214 (11%)
Query: 142 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 201
D+ + +P K G+ I G + N LL+ YGF N YD + +N Q
Sbjct: 250 DNYFVVTTQKPEKKGQQIYNCYGQRTNKFLLMWYGFCFNKNRYDSYSLRLWINMRQEQLN 309
Query: 202 D---KRMVAQ---------------RNGKLSVQVFHVHAGREKEAIS-DMLPYLRLGYVS 242
+ +++V Q + K+++ + +K I+ D++ YLRL +
Sbjct: 310 NDLFEKIVFQEFLEKEDCKGGFVWKKQEKVNLDDITQNFRIKKNKINIDLIIYLRLYLMM 369
Query: 243 DTS--EMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
+++ V+ SL PVSP E VL L+ + T+ +D+ +L + NL+
Sbjct: 370 HYKGPDLKRVMVSL----PVSPVYECFVLSFAIRLLSYLLSRFTTTIKDDKELLQNQNLN 425
Query: 301 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDV 334
K R A +K++L + + ++LL +
Sbjct: 426 YKYRFAIIYRLNQKEILQEQISLMNQALILLNQL 459
>gi|222640175|gb|EEE68307.1| hypothetical protein OsJ_26571 [Oryza sativa Japonica Group]
Length = 422
Score = 44.3 bits (103), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 35/130 (26%), Positives = 59/130 (45%), Gaps = 22/130 (16%)
Query: 123 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 182
+ P +L Y K V +++ + RP KAGE + G P S L+ YGF+ DN
Sbjct: 302 VSPHILNYGRVDK-----VTKSLKFPLSRPCKAGEQCFLSYGKHPGSHLITFYGFLPRDN 356
Query: 183 PYDR--LVVEAALNTEDP----------QYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 230
PYD L ++ +++ ED + +RM+ R + +Q + ++ +
Sbjct: 357 PYDVIPLDLDTSVDEEDSSSPSVTTSQTSHMGERMLG-RQSRTGLQ----RSTKKDSFVH 411
Query: 231 DMLPYLRLGY 240
YLRLG+
Sbjct: 412 CYFVYLRLGH 421
>gi|449702130|gb|EMD42824.1| Hypothetical protein EHI5A_004190 [Entamoeba histolytica KU27]
Length = 749
Score = 44.3 bits (103), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 54/221 (24%), Positives = 84/221 (38%), Gaps = 22/221 (9%)
Query: 4 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 63
K K F PYI L + PL ++ EL + G+ A + E+
Sbjct: 47 NKTNPKCFHWPYINVLPE-------TYDCPLSYTIDELNLMKGTKLYAAV-EKINAFL-- 96
Query: 64 YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 123
+ V + L QQ+P ++F ++FK+ A QS V + F V
Sbjct: 97 ---MKVVDYYNNKLIQQFPQYF--QSFD-DLFKRLQWAHQSFWSRAFLVIYPQPFGEVGS 150
Query: 124 GPPLLAYSSKCKAMLAAVDDAVQL------VVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
P +S+ C Q + K GE I + N KLL+ YGF
Sbjct: 151 LIPFCDFSNHCTQAKVTYISNTQTETFSFQTNEELVKPGEQIFNNYRIRSNEKLLLGYGF 210
Query: 178 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
V+E+NP D L++ +D QY + + ++ S F
Sbjct: 211 VEENNPCDNLLLRIYFEVDDNQYNEIEEILKQEEIKSFDFF 251
>gi|167389227|ref|XP_001738871.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165897700|gb|EDR24782.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 791
Score = 44.3 bits (103), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 54/222 (24%), Positives = 88/222 (39%), Gaps = 24/222 (10%)
Query: 4 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 63
K K F PYI L + + PL ++ EL + G+ + E+
Sbjct: 89 NKTNPKCFHWPYINVLPK-------TYDCPLSYTIDELNIMKGTKLYVAV-EKINAFL-- 138
Query: 64 YNELDTVWFMAGSLFQQYP-YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 122
+ V + L QQ+P Y P + ++FK+ A QS V + F V
Sbjct: 139 ---MKVVDYYNNKLIQQFPQYFQPFD----DLFKRLQWAHQSFWSRAFLVIYPQPFGEVG 191
Query: 123 LGPPLLAYSSKC-KAMLAAVDDAVQLVV-----DRPYKAGESIVVWCGPQPNSKLLINYG 176
P +S+ C +A + + + + K GE I + N KLL+ YG
Sbjct: 192 SLIPFCDFSNHCTQAKVTYISNTRTETFSFQTNEEVVKPGEQIFNNYRIRSNEKLLLGYG 251
Query: 177 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
FV+E+NP D L++ +D QY + + ++ S F
Sbjct: 252 FVEENNPCDNLLLRIYFEVDDNQYNEIEEILKQEEIKSFDFF 293
>gi|328872715|gb|EGG21082.1| hypothetical protein DFA_00957 [Dictyostelium fasciculatum]
Length = 643
Score = 43.9 bits (102), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 74/348 (21%), Positives = 136/348 (39%), Gaps = 52/348 (14%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN---- 65
S+W Y+ L +Q + + ++ E+ L SP R G+ R Y+
Sbjct: 302 SYWYEYVSILPKQ-------FTTTVYFTREEMTQLQASPVHRFTEMRLNGVHRHYDTTIS 354
Query: 66 ---------ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLAR 116
E D+ S + + +T + FK A V S L +
Sbjct: 355 RLRFGYEGGEDDSTKTKTKSQLDAMK-EFKDDRYTLDQFKWALGCVWSRAFSLSE----E 409
Query: 117 RFALVPLGPPLLAYS----SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG---PQPNS 169
+VPL A + SK ++A ++ + +AGE I G +
Sbjct: 410 DGGMVPLADMFNADTVISRSKVHPKISASSPSLVYTASQDIEAGEQIFTPYGVYKTLGSG 469
Query: 170 KLLINYGFVDEDNPY--DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 227
++L++YGF+ ED +V A + +P Y KR + Q NG + + F + + +
Sbjct: 470 QMLMDYGFIHEDGSSADSTIVTVAPIPPSEPLYDLKRHLMQSNG-IESEEFTI---TKNK 525
Query: 228 AISDMLPYLRLGYVS--DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 285
++ + R+ ++ ++ + + S ++P E+A L L++ L Y
Sbjct: 526 LAKELFLFARIKSINKKESDQASAHFMSTQRHSMLNPRNEKAALRLLSNLISRHLDAYQT 585
Query: 286 TLSEDEAML-------TDYNLHPKKRVAT----QLVRMEKKMLNACLQ 322
T+ +D +L T+ N H T +L MEK +LN+ L+
Sbjct: 586 TIDQDNQILKEIEKDKTNTN-HSSVTFNTINAIKLRLMEKNILNSFLK 632
>gi|407035166|gb|EKE37568.1| [Ribulose-bisphosphate-carboxylase]-lysine N-methyltransferase
[Entamoeba nuttalli P19]
Length = 791
Score = 43.9 bits (102), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 55/222 (24%), Positives = 84/222 (37%), Gaps = 24/222 (10%)
Query: 4 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 63
K K F PYI L + PL ++ EL + G+ A + E+
Sbjct: 89 NKTNPKCFHWPYINVLPE-------TYDCPLSYTIDELNLMKGTKLYAAV-EKINAFL-- 138
Query: 64 YNELDTVWFMAGSLFQQYP-YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 122
+ V + L QQ+P Y P + ++FK+ A QS V + F V
Sbjct: 139 ---MKVVDYYNNKLIQQFPQYFQPFD----DLFKRLQWAHQSFWSRAFLVIYPQPFGEVG 191
Query: 123 LGPPLLAYSSKCKAMLAAVDDAVQL------VVDRPYKAGESIVVWCGPQPNSKLLINYG 176
P +S+ C Q + K GE I + N KLL+ YG
Sbjct: 192 SLIPFCDFSNHCTQAKVTYISNTQTETFSFQTNEALVKPGEQIFNNYRIRSNEKLLLGYG 251
Query: 177 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
FV+E+NP D L++ +D QY + + ++ S F
Sbjct: 252 FVEENNPCDNLLLRIYFEVDDNQYNEIEEILKQEEIKSFDFF 293
>gi|67484540|ref|XP_657490.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56474743|gb|EAL52100.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
Length = 791
Score = 43.9 bits (102), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 54/221 (24%), Positives = 84/221 (38%), Gaps = 22/221 (9%)
Query: 4 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 63
K K F PYI L + PL ++ EL + G+ A + E+
Sbjct: 89 NKTNPKCFHWPYINVLPE-------TYDCPLSYTIDELNLMKGTKLYAAV-EKINAFL-- 138
Query: 64 YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 123
+ V + L QQ+P ++F ++FK+ A QS V + F V
Sbjct: 139 ---MKVVDYYNNKLIQQFPQYF--QSFD-DLFKRLQWAHQSFWSRAFLVIYPQPFGEVGS 192
Query: 124 GPPLLAYSSKCKAMLAAVDDAVQL------VVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
P +S+ C Q + K GE I + N KLL+ YGF
Sbjct: 193 LIPFCDFSNHCTQAKVTYISNTQTETFSFQTNEELVKPGEQIFNNYRIRSNEKLLLGYGF 252
Query: 178 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
V+E+NP D L++ +D QY + + ++ S F
Sbjct: 253 VEENNPCDNLLLRIYFEVDDNQYNEIEEILKQEEIKSFDFF 293
>gi|281205954|gb|EFA80143.1| hypothetical protein PPL_06965 [Polysphondylium pallidum PN500]
Length = 417
Score = 43.9 bits (102), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 69/299 (23%), Positives = 127/299 (42%), Gaps = 36/299 (12%)
Query: 7 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 66
K S+W Y L +Q + + + + E+ L S + R GI+R YN
Sbjct: 102 NKDSYWHQYTSILPKQ-------FTTSIYFDQDEMKELQLSKLRYFTESRLSGIERHYN- 153
Query: 67 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 126
++ SL ++ + +TFE+FK A + S L + +VPL
Sbjct: 154 --VIFKKLSSLNDEFK----KKEYTFELFKWALSCIWSRAFSLS----SDDGGMVPLADM 203
Query: 127 LLAYSSKCKAMLAAVDDAVQLV--VDRPYKAGESIVVWCGPQP---NSKLLINYGFVDED 181
A K K+ + A QL+ + + GE + G N+++L++YGF D
Sbjct: 204 FNAIE-KAKSKVRPDSRADQLIYYASKDIERGEQVFTPYGVYKTIGNAQMLMDYGFA-FD 261
Query: 182 NPYDRLVVEAALN--TEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLG 239
+P + ++ L+ ++D Y D ++ + V+ F++ + + ++L Y R+
Sbjct: 262 DPSEGDTIQLTLDNFSDDELYIDTKIDLLEQLDI-VREFNL---KRNQLPQELLIYARVK 317
Query: 240 YVSDTSEMQ----SVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 294
+ + +E+Q + PVS E+ L L++Y L Y TLS+D +L
Sbjct: 318 NLKE-NELQLAKEHYRNDDNRNKPVSRRNEKTALRYLSNYLSRYLDSYETTLSDDLELL 375
>gi|347967016|ref|XP_003436005.1| AGAP002018-PB [Anopheles gambiae str. PEST]
gi|333469796|gb|EGK97407.1| AGAP002018-PB [Anopheles gambiae str. PEST]
Length = 504
Score = 43.9 bits (102), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 77/356 (21%), Positives = 135/356 (37%), Gaps = 66/356 (18%)
Query: 9 KSFWLPYIREL-DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 67
KS W PY+ L DR +PL ++ ++ L + L+ + I R+Y +
Sbjct: 163 KSDWKPYLDLLPDR--------YTTPLYYTTEDMGELAETDAFLPALKLCKHIARQYGFI 214
Query: 68 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV--------AVQSCVVHLQKV------- 112
F Q D + FT+++F+ AV + + KV
Sbjct: 215 RR--------FVQEKVDELRDCFTYDVFRLLLFSLLIPHSWAVSTVMTRQNKVPVNLAEF 266
Query: 113 -SLARRFALVPL------GPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYK--AGESIVV 161
+ AL+PL P A ++C A A ++ ++ + R A I +
Sbjct: 267 DGMDHTLALIPLWDMANHAFPDTANETRCVAETCYNATNEQLECSLTREVSDIASVPIFI 326
Query: 162 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 221
G + +++ L++ GFV NP+ + L P Y+++ + + G + F
Sbjct: 327 VYGTRTDAEFLVHNGFVCPRNPHANVQKRFTLVPAIPLYKERAHLLELLGMPTTGTFSFG 386
Query: 222 AGREKEAISDMLP----YLRLGYVSDTS------------EMQSVISSLGPICPVSPC-- 263
RE A + P + L VS + + + + + P C
Sbjct: 387 PAREPAAATTTTPISQELISLARVSSMTAKELDEYTAMKETQRQTLRTYQALLPAELCAR 446
Query: 264 MERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 319
ER LA K L YP T+ +DEA+L N H +R+ + EK++L +
Sbjct: 447 TER----WLATVMKIMLLRYPTTIEQDEALLKT-NRHHIRRLLIEYRLGEKQILRS 497
>gi|428163078|gb|EKX32170.1| hypothetical protein GUITHDRAFT_121664 [Guillardia theta CCMP2712]
Length = 449
Score = 43.5 bits (101), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 61/251 (24%), Positives = 107/251 (42%), Gaps = 12/251 (4%)
Query: 82 PYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL---GPPLLAYSSKCKAML 138
P ++ E FT+E F+ A+ V + + + R VPL GP ++ +
Sbjct: 185 PQNVTREDFTYEAFEWAYSVVTTRGIFPGLLGEEEREGEVPLLVLGPLADSFIHGASGVK 244
Query: 139 AAVD-DAVQLVVDRPYKAGES--IVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 195
+ D + V +K ++ I + G N +LL N GF+ ++N + ++++ L+
Sbjct: 245 ISYDAQEHRCVFSALHKVAKNSPISIGVGMSSNMELLANRGFMMQNNGNNFVLMKFQLDR 304
Query: 196 EDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLG 255
+ R + LS + +V R E +L LR+ +S E S +L
Sbjct: 305 NSDMHASARESMMKQLNLSNPMTYV--VRYGEMPQGLLASLRIQSLSPV-EFGSYGKALA 361
Query: 256 PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKK 315
PV+ E L + LA YP T+ EDE +LT R A L+R E+K
Sbjct: 362 --TPVTLENEWRAYRLLISSCNSILAMYPTTIEEDEIVLTQTKTSRHLRAAV-LLRREEK 418
Query: 316 MLNACLQVTAD 326
++ ++ A+
Sbjct: 419 LIYESIKTWAN 429
>gi|207346544|gb|EDZ73016.1| YDR257Cp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 354
Score = 43.5 bits (101), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
+++PL L A +SKC A L + +++V R + E + G PNS+LL YG+V
Sbjct: 80 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 139
Query: 179 DED-NPYD 185
+ D + YD
Sbjct: 140 EWDGSKYD 147
>gi|12718364|emb|CAC28558.1| related to histone-lysine N-methyltransferase [Neurospora crassa]
Length = 471
Score = 43.5 bits (101), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 45/181 (24%), Positives = 81/181 (44%), Gaps = 17/181 (9%)
Query: 32 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 91
S +L++E +L G+ + + I+ ++ L LF Q+P P + FT
Sbjct: 115 SSILFAEDDLEACAGTSLYTITKQLEQSIEDDHRALVV------RLFVQHPDLFPLDKFT 168
Query: 92 FE----IFKQAFVAVQSCVVHLQKVSLARRFALVPLGP--PLLAYSSKCKA--MLAAVDD 143
E +K A V S + LA ++ L P +L ++S+ K +
Sbjct: 169 VEDVGLHYKWALCTVWSRAMDF---VLADGNSIRLLAPFADMLNHTSEVKQCHVYDPSSG 225
Query: 144 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 203
+ + + Y+AG+ + + GP PNS+LL YGFV NP D + + + + P ++ K
Sbjct: 226 TLSVFAGKDYEAGDQVFINYGPVPNSRLLRLYGFVIPGNPNDSYDLVLSTHPQAPFFEQK 285
Query: 204 R 204
+
Sbjct: 286 Q 286
>gi|440792461|gb|ELR13682.1| [Ribulose-bisphosphate-carboxylase]-lysine N-methyltransferase
[Acanthamoeba castellanii str. Neff]
Length = 400
Score = 43.5 bits (101), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 47/192 (24%), Positives = 84/192 (43%), Gaps = 23/192 (11%)
Query: 9 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEGIKREYNEL 67
+SFW PY+ EL AV + W++ ELA + + E++E + + I +Y +
Sbjct: 61 QSFWQPYLSELPD-------AVATVDRWNQEELAEVGHTLMLYEMVEYKKKKIAADYAAI 113
Query: 68 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 127
+ + Q + IP+E E +++A V S + L +P
Sbjct: 114 LLPFLQENT--QLFGGSIPSE----EEYRRALSLVYSRTFDFSE--LIGEHVFIPF-VDF 164
Query: 128 LAYS------SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 181
L +S + C D +L+ Y GE + + G + +S+LL +YGF+ E+
Sbjct: 165 LNHSINDTGKAACTYSYNHDKDCFELLAGADYDEGEEVFISYGEKTSSQLLASYGFMYEN 224
Query: 182 NPYDRLVVEAAL 193
N D + + A+L
Sbjct: 225 NAEDTVDITASL 236
>gi|407852222|gb|EKG05847.1| hypothetical protein TCSYLVIO_003073 [Trypanosoma cruzi]
Length = 565
Score = 43.5 bits (101), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 43/148 (29%), Positives = 70/148 (47%), Gaps = 15/148 (10%)
Query: 156 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRL-----VVEAALNTEDPQYQDKR--MVAQ 208
G I + GP N +LL YGFV E N +DRL EAA+ E + +R +VA+
Sbjct: 362 GREIWMSYGPLQNWELLQFYGFVLEGNEHDRLPFPLDFPEAAVGDE---WDGRRAALVAK 418
Query: 209 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 268
L+ + H GR A ++ LR+ ++++ E ++ + GP + E V
Sbjct: 419 YGLHLAGCCWICHDGRPPPA---LVALLRV-HLAEAEEFDTMERN-GPFASLGAGTEARV 473
Query: 269 LDQLADYFKARLAGYPATLSEDEAMLTD 296
+AD + L + +L EDE +L +
Sbjct: 474 FATIADTIRCILDLFSTSLEEDERLLEN 501
>gi|260807503|ref|XP_002598548.1| hypothetical protein BRAFLDRAFT_118329 [Branchiostoma floridae]
gi|229283821|gb|EEN54560.1| hypothetical protein BRAFLDRAFT_118329 [Branchiostoma floridae]
Length = 448
Score = 43.5 bits (101), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 54/233 (23%), Positives = 96/233 (41%), Gaps = 41/233 (17%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+Y+K G+ SFW PY+ L + P+ + E + YL S +A I + +
Sbjct: 112 IYQKHIGETSFWKPYLDILPNE-------YTHPVYFGEEDFLYLPHS-LRANIKAKKQEC 163
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF--VAVQSCVVHLQKVSLARRF 118
+ Y EL + L + FTF+ ++ A+ V +S V + ++ R
Sbjct: 164 IKSYEELKPFFPSLEPLLPNW-----EGIFTFDAYRWAWSTVKTRSLYVDDKGSTVLRNL 218
Query: 119 --------ALVPLGPPLLAYSSKCKAML------AAVDDAVQLVVDRPYKAGESIVVWCG 164
+LVP+ LL +S + L D + + YK G+ ++
Sbjct: 219 DKSGLGVTSLVPM-VDLLNHSHSARTGLLIKKSCKNGDYFYTVTAEDDYKRGDQVLFCYR 277
Query: 165 PQPNSKLLINYGFVDEDNPYDRL----------VVE-AALNTEDPQYQDKRMV 206
N LL+NYGFV DN D + ++E EDP+++ ++++
Sbjct: 278 RADNQTLLLNYGFVLPDNHLDTIKFFLVKDIIGILELMNFEEEDPKFRRRKVL 330
>gi|308812294|ref|XP_003083454.1| N-methyltransferase (ISS) [Ostreococcus tauri]
gi|116055335|emb|CAL58003.1| N-methyltransferase (ISS) [Ostreococcus tauri]
Length = 492
Score = 43.5 bits (101), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 20/56 (35%), Positives = 32/56 (57%)
Query: 130 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
+S A + ++ V+LV R KAG+ I + G N +L ++YGF+ EDN +D
Sbjct: 248 HSFDASARVRECENGVELVTTRDLKAGQPIELCYGELSNDELFLDYGFIVEDNAFD 303
>gi|452986759|gb|EME86515.1| hypothetical protein MYCFIDRAFT_131111 [Pseudocercospora fijiensis
CIRAD86]
Length = 391
Score = 43.5 bits (101), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 31/120 (25%), Positives = 49/120 (40%), Gaps = 10/120 (8%)
Query: 93 EIFKQAFVAVQSCVVHLQKVSLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 151
++FK + V S H + F L P + S + ++ +R
Sbjct: 164 DVFKYYWAIVNSRSFHFKPPGAKPGFMVLCPFIDYMNHGPSGTGVNVRQTAKGYEVTANR 223
Query: 152 PYKAGESIVVWCGPQPNSKLLINYGFV---------DEDNPYDRLVVEAALNTEDPQYQD 202
Y AGE ++ G PN KLL++YGF+ D+D D +++ NT Q QD
Sbjct: 224 DYVAGEEVLATYGAHPNDKLLVHYGFINSSKPGAPSDDDIRLDHYILDNLSNTTRDQLQD 283
>gi|323355591|gb|EGA87411.1| Set7p [Saccharomyces cerevisiae VL3]
Length = 515
Score = 43.5 bits (101), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
+++PL L A +SKC A L + +++V R + E + G PNS+LL YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279
Query: 179 DED-NPYD 185
+ D + YD
Sbjct: 280 EWDGSKYD 287
>gi|432901733|ref|XP_004076920.1| PREDICTED: SET domain-containing protein 4-like [Oryzias latipes]
Length = 441
Score = 43.5 bits (101), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 55/223 (24%), Positives = 94/223 (42%), Gaps = 26/223 (11%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL-TGSPTKAEILERAEGIK 61
E+ +G+ S W PYI L + P +++T +A L +G +AE E+ EG++
Sbjct: 117 ERHRGEASDWFPYIDVLP-------CSYCCPPYFTDTVMAVLPSGVRRRAE--EQREGLQ 167
Query: 62 REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK-----VSLAR 116
Y + +FM+ +P P E T+E + A+ ++ + V + + +S
Sbjct: 168 HLY-AVHQDFFMSLQPVLSHP---PEEVLTYEALRWAWCSINTRSVFMDRPSSSFLSGPD 223
Query: 117 RFALVPLGPPLLAY--SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 174
+AL P LL + + KA ++ + + G N +LL+
Sbjct: 224 NYALAPF-LDLLNHRPDVQVKAGFNRTSGCYEIRSISGVQRYHQAFINYGSHDNQRLLLE 282
Query: 175 YGFVDEDNPYDRLVVEAALNTE----DPQYQDKRMVAQRNGKL 213
YGFV NP+ + VE L E D +K + NG L
Sbjct: 283 YGFVSSCNPHSVIYVEEDLLCEVLRGDESLDEKMKFLRENGFL 325
>gi|224042477|ref|XP_002188626.1| PREDICTED: SET domain-containing protein 4 [Taeniopygia guttata]
Length = 457
Score = 43.5 bits (101), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 65/275 (23%), Positives = 107/275 (38%), Gaps = 36/275 (13%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
EK G+KS W PY+ L + A P E ++ L P + + E+ I+
Sbjct: 134 EKHAGQKSPWKPYLDVLPK-------AYTCPAC-LEPDIINLLPKPLQKKAQEQKMLIQE 185
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 122
+ + LF + +I F F + A+ V + +++ K F+L P
Sbjct: 186 LFQSSRAFFSSLQPLFAEDTGNI----FNFSALQWAWCTVNTRTIYM-KHPHRECFSLEP 240
Query: 123 ----LGP--PLLAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 174
L P LL +S + KA + ++ D K + +++ GP N +LL+
Sbjct: 241 DVYALAPYLDLLNHSPNVQVKAGFNEQTRSYEIWTDSQCKKYQEVLICYGPHDNQRLLLE 300
Query: 175 YGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP 234
YGFV DNP+ + V A + DK QR K+S+ H D L
Sbjct: 301 YGFVATDNPHSSVYVSADTLLKYFSSLDK----QREAKVSILKDH-----------DFLE 345
Query: 235 YLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVL 269
L G+ + + + + L C R +L
Sbjct: 346 NLTFGWEGPSWRLLTALKVLSLAADEFACWRRILL 380
>gi|323334121|gb|EGA75505.1| Set7p [Saccharomyces cerevisiae AWRI796]
Length = 515
Score = 43.5 bits (101), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
+++PL L A +SKC A L + +++V R + E + G PNS+LL YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279
Query: 179 DED-NPYD 185
+ D + YD
Sbjct: 280 EWDGSKYD 287
>gi|367016539|ref|XP_003682768.1| hypothetical protein TDEL_0G01900 [Torulaspora delbrueckii]
gi|359750431|emb|CCE93557.1| hypothetical protein TDEL_0G01900 [Torulaspora delbrueckii]
Length = 573
Score = 43.5 bits (101), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 25/79 (31%), Positives = 37/79 (46%), Gaps = 9/79 (11%)
Query: 127 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD- 185
LL + + K +D V V K GE + G + N LL++YGFV + NPYD
Sbjct: 229 LLNHKNDTKVKWTFTNDNVCFVSQEIMKEGEEVFNNYGEKSNEDLLLSYGFVQDQNPYDL 288
Query: 186 --------RLVVEAALNTE 196
+ +++ ALN E
Sbjct: 289 TRLTLRLTKEMIDEALNAE 307
>gi|428181778|gb|EKX50641.1| hypothetical protein GUITHDRAFT_135258 [Guillardia theta CCMP2712]
Length = 254
Score = 43.1 bits (100), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 45/162 (27%), Positives = 68/162 (41%), Gaps = 25/162 (15%)
Query: 135 KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 194
K+ L V+ VQL+ P KAGE I ++ G + L +GF D DNP D + E L
Sbjct: 74 KSELGRVE--VQLLA--PVKAGEQIFIYYGALSTASELTRFGFCDRDNPNDTVPFELDL- 128
Query: 195 TEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL 254
+E + Q K M +V+ ++ D LP RL + +
Sbjct: 129 SEMTELQRKAM----------EVWEFRPDVQQLLKRDGLPSWRL----------LAMLRI 168
Query: 255 GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
+ +S E+ V + + A AGYP L ED + L +
Sbjct: 169 LHLNQLSVANEKLVWGTMEELLNAVTAGYPTRLEEDISRLEE 210
>gi|6320463|ref|NP_010543.1| Rkm4p [Saccharomyces cerevisiae S288c]
gi|46577338|sp|Q12504.1|RKM4_YEAST RecName: Full=Ribosomal N-lysine methyltransferase 4; AltName:
Full=SET domain-containing protein 7
gi|1136212|emb|CAA92714.1| unknown [Saccharomyces cerevisiae]
gi|1226033|emb|CAA94096.1| unknown [Saccharomyces cerevisiae]
gi|51830266|gb|AAU09704.1| YDR257C [Saccharomyces cerevisiae]
gi|190404795|gb|EDV08062.1| hypothetical protein SCRG_00269 [Saccharomyces cerevisiae RM11-1a]
gi|259145494|emb|CAY78758.1| Set7p [Saccharomyces cerevisiae EC1118]
gi|285811273|tpg|DAA12097.1| TPA: Rkm4p [Saccharomyces cerevisiae S288c]
gi|323349272|gb|EGA83501.1| Set7p [Saccharomyces cerevisiae Lalvin QA23]
gi|365766338|gb|EHN07836.1| Set7p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
gi|392300372|gb|EIW11463.1| Rkm4p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 494
Score = 43.1 bits (100), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
+++PL L A +SKC A L + +++V R + E + G PNS+LL YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279
Query: 179 DED-NPYD 185
+ D + YD
Sbjct: 280 EWDGSKYD 287
>gi|349577313|dbj|GAA22482.1| K7_Set7p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 494
Score = 43.1 bits (100), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
+++PL L A +SKC A L + +++V R + E + G PNS+LL YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279
Query: 179 DED-NPYD 185
+ D + YD
Sbjct: 280 EWDGSKYD 287
>gi|151942233|gb|EDN60589.1| SET domain-containing protein [Saccharomyces cerevisiae YJM789]
Length = 494
Score = 43.1 bits (100), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
+++PL L A +SKC A L + +++V R + E + G PNS+LL YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279
Query: 179 DED-NPYD 185
+ D + YD
Sbjct: 280 EWDGSKYD 287
>gi|256270722|gb|EEU05884.1| Set7p [Saccharomyces cerevisiae JAY291]
Length = 494
Score = 43.1 bits (100), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
+++PL L A +SKC A L + +++V R + E + G PNS+LL YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279
Query: 179 DED-NPYD 185
+ D + YD
Sbjct: 280 EWDGSKYD 287
>gi|391342782|ref|XP_003745694.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Metaseiulus occidentalis]
Length = 278
Score = 43.1 bits (100), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 37/182 (20%), Positives = 83/182 (45%), Gaps = 12/182 (6%)
Query: 151 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 210
R YK E + ++ G + N++ +++ GFV ++N +D L ++ L+ D ++ KR + ++
Sbjct: 96 REYKKNEQVNIFYGNRANAQFMLHNGFVPDENQWDSLAIKIGLSKADKLFEMKRRLCEQM 155
Query: 211 GKLSVQVFHVHAGREKEAISDMLPYLRLGYV--------SDTSEMQSVISSLGPICPVSP 262
+ VF + + + + ++P + L V SD + + + + P P
Sbjct: 156 KIPTSDVFELKKAPDGDGV--LVPKVLLHLVHILQWKAPSDGTTSGTDVGA-DPSDATDP 212
Query: 263 CMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML-NACL 321
+ L + + P ++ E +L D + ++A + E++ML NAC
Sbjct: 213 VRTKKAKTFLHVRCQLLMKALPRSVEELTEILNDPTTSLESKLAIRYRLSEQRMLTNACN 272
Query: 322 QV 323
++
Sbjct: 273 KI 274
>gi|345561352|gb|EGX44442.1| hypothetical protein AOL_s00188g347 [Arthrobotrys oligospora ATCC
24927]
Length = 468
Score = 43.1 bits (100), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 53/215 (24%), Positives = 88/215 (40%), Gaps = 23/215 (10%)
Query: 32 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 91
S + +++ EL GS + + I+ +Y L LF QY FT
Sbjct: 107 SSIFFTDDELEVCAGSSLYTITKQLKQQIQDDYRTL------VERLFGQYLDIFSLGKFT 160
Query: 92 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS---KCKAMLAAVDDAVQLV 148
E +K A V S + + L P +L +SS KC + D + ++
Sbjct: 161 IEDYKWALCTVWSRAMDFVQPDGKSIRLLAPFAD-MLNHSSDVKKCHVYDTSSGD-LSIL 218
Query: 149 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 208
+ Y+ G+ + + G PN++LL YGFV +NP D + E P ++ K+ +
Sbjct: 219 AGKDYEPGDQVFINYGSIPNNRLLRLYGFVVPNNPNDSYDLVLMTQPEAPFFELKQKLWV 278
Query: 209 RNGKLSVQVFHVHAGREKEAISDMLP-----YLRL 238
G SV + +++D LP YLR+
Sbjct: 279 SAGLDSVSTISL-------SLNDPLPKSVLQYLRI 306
>gi|323309789|gb|EGA62995.1| Set7p [Saccharomyces cerevisiae FostersO]
Length = 417
Score = 43.1 bits (100), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
+++PL L A +SKC A L + +++V R + E + G PNS+LL YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279
Query: 179 DED-NPYD 185
+ D + YD
Sbjct: 280 EWDGSKYD 287
>gi|330924929|ref|XP_003300837.1| hypothetical protein PTT_12198 [Pyrenophora teres f. teres 0-1]
gi|311324820|gb|EFQ91062.1| hypothetical protein PTT_12198 [Pyrenophora teres f. teres 0-1]
Length = 372
Score = 43.1 bits (100), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 22/77 (28%), Positives = 38/77 (49%), Gaps = 2/77 (2%)
Query: 117 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 176
R A++P+ C+A A+ + + DR Y+AGE + + G LL YG
Sbjct: 179 RLAILPVADLFNHADVGCEAQFAS--ENYSFIADRTYRAGEELYISYGTHSTDFLLAEYG 236
Query: 177 FVDEDNPYDRLVVEAAL 193
FV +N +D + ++ A+
Sbjct: 237 FVPAENRWDVVCLDEAI 253
>gi|301094169|ref|XP_002997928.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262109714|gb|EEY67766.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 440
Score = 43.1 bits (100), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 41/166 (24%), Positives = 66/166 (39%), Gaps = 27/166 (16%)
Query: 151 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL-----NTEDPQYQDKRM 205
+ Y+ GE + + G N +LL NYGF +NPYD + + + N DP + KR
Sbjct: 247 KAYEPGEQLYINYGSHSNLRLLRNYGFTTPNNPYDVVTLPMPIALQQPNPADPAFLQKRG 306
Query: 206 VAQR-NGKLSVQVFHVHAGREKE----------------AISDMLPYLRLGYVSDTSEMQ 248
+ Q G S + + + R A + L + S +
Sbjct: 307 LLQSATGSHSTDIPALRSLRFNHDGQLAPNAEHWLEILLATPEELSEIITQAASQSGAAD 366
Query: 249 SVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 294
S IS P+ ++ V ++ ARL + +TL ED+A L
Sbjct: 367 STISLALPMS-----LKHKVHSEVGSLVTARLKQHSSTLEEDDAFL 407
>gi|241712095|ref|XP_002413441.1| conserved hypothetical protein [Ixodes scapularis]
gi|215507255|gb|EEC16749.1| conserved hypothetical protein [Ixodes scapularis]
Length = 227
Score = 43.1 bits (100), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 45/98 (45%), Gaps = 11/98 (11%)
Query: 7 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 66
G SFW PYI L R + + L +S EL LTGS E L+ I R+Y
Sbjct: 94 GSASFWHPYISILPR-------SFNTVLYFSVDELQLLTGSSVLDEALKLHRSIARQYAY 146
Query: 67 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 104
+ F L + PY + FT+++++ A AV +
Sbjct: 147 FHKI-FRTHPLAKSLPY---KDCFTYDLYRWAVSAVMT 180
>gi|50303805|ref|XP_451849.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49640981|emb|CAH02242.1| KLLA0B07161p [Kluyveromyces lactis]
Length = 553
Score = 42.7 bits (99), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 25/65 (38%), Positives = 33/65 (50%), Gaps = 1/65 (1%)
Query: 141 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 200
D+ V +++ KAGE I G NS LL YGF EDNP+D + + L E Q
Sbjct: 319 TDECVDIILSNDVKAGEEIFNSYGDHSNSYLLARYGFCIEDNPHDVVDLSEEL-VEYGQK 377
Query: 201 QDKRM 205
KR+
Sbjct: 378 NSKRI 382
>gi|302793745|ref|XP_002978637.1| hypothetical protein SELMODRAFT_52721 [Selaginella moellendorffii]
gi|300153446|gb|EFJ20084.1| hypothetical protein SELMODRAFT_52721 [Selaginella moellendorffii]
Length = 523
Score = 42.7 bits (99), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 57/261 (21%), Positives = 102/261 (39%), Gaps = 62/261 (23%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP----TKAEI--- 53
+ E+ +G+ SFW PY+ L G +PL + + EL L G+ TKA++
Sbjct: 104 IIERARGRASFWAPYLEMLPSGFG-------TPLWFEDEELMELDGTTLFEATKAQVFFP 156
Query: 54 -----------LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA------------- 89
L R + + E+ E +W A +F +IP A
Sbjct: 157 STFVSTCMSLYLFRPDDRELEFQEF--LW--ANCIFWTRALNIPCPASFVTSSSPEVAKD 212
Query: 90 ----FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAM-------- 137
+ F++ + + VS LVP G ++ + +
Sbjct: 213 DGNRLVIYVLPHPFISCSA-----KDVSTIWIEGLVP-GIDFCNHTRRASGLWEIDGSDG 266
Query: 138 -LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE-AALNT 195
+ V ++ L+ D + G +++ G + N +LL YGFV+EDN D ++V +
Sbjct: 267 STSGVPHSMYLIADVVFPPGSEVLINYGDKGNEELLFLYGFVEEDNSNDYVMVHFPKMFL 326
Query: 196 EDPQYQDKRMVAQRNGKLSVQ 216
++ D ++ R LS+Q
Sbjct: 327 DEDNTMDFKLQLLRELDLSLQ 347
>gi|336258546|ref|XP_003344085.1| hypothetical protein SMAC_09068 [Sordaria macrospora k-hell]
gi|380093059|emb|CCC09296.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 421
Score = 42.7 bits (99), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 29/50 (58%)
Query: 144 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 193
A + RPY AGE + + G N LLI YGF+ ++N +D + ++ A+
Sbjct: 263 AFTITTTRPYSAGEEVYICYGNHSNDFLLIEYGFLFDENVWDEVCIDDAI 312
>gi|346465219|gb|AEO32454.1| hypothetical protein [Amblyomma maculatum]
Length = 353
Score = 42.7 bits (99), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 42/172 (24%), Positives = 73/172 (42%), Gaps = 16/172 (9%)
Query: 87 TEAFTFEIFKQAFVAVQS-CVVHLQKVSLARRF---ALVPLGPPLLAYSSKCKAMLAAVD 142
+E FT+ +F A+ AV + C+ K F L P L + KA +
Sbjct: 185 SENFTWHLFVWAWTAVNTRCI--FSKHRTDHSFWDDDYCALAPFLDCLNHHWKADVETTV 242
Query: 143 DA--VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 200
+ ++V + Y+ + + + G N KLL+ YGFV DNP D + + T+ Y
Sbjct: 243 EGSYFEIVTNNNYEPNDQVFISYGSHDNKKLLLEYGFVLADNPNDVVAI-----TKGHLY 297
Query: 201 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS 252
++ +Q+N + + EK+ ISD + G + + V+S
Sbjct: 298 ---KLNSQQNDTVLYFATKLSFLEEKDIISDTCGFTTDGLTWNGKIVMQVLS 346
>gi|307109196|gb|EFN57434.1| hypothetical protein CHLNCDRAFT_142903 [Chlorella variabilis]
Length = 1233
Score = 42.7 bits (99), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 67/293 (22%), Positives = 114/293 (38%), Gaps = 58/293 (19%)
Query: 29 AVESPLLWSETELA-YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPT 87
A +P+LW + E L GSP E R + +++E+ ++ + + YP +
Sbjct: 929 ATLTPILWPDEERQQLLRGSPVLEEARTREQALRQEWQDIAAIAAQTSGGPEAYPAVVYN 988
Query: 88 EAFTFEIFKQAFVAVQSCVV-HLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDA 144
E QAF+ S V+ H + A+ FAL+PL L S A+L +A
Sbjct: 989 E--------QAFLEAMSVVLAHAAYLPKAQCFALLPLVGGLCRTGSSSGALLDYDLEREA 1040
Query: 145 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 204
V +V R G+ + ++C L + A+L D Y KR
Sbjct: 1041 VTVVAQR--TPGQEVALYC-----------------------LFMAASLVAADRLYTTKR 1075
Query: 205 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCM 264
+ + G F + ++ A ++ + E ++I SP
Sbjct: 1076 EILEELGLGVKAEFPIF--EDRLATQQLINF----------EQDTII---------SPEN 1114
Query: 265 ERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 317
E +L L + R+ Y +D L +L P++R+A QL EK++L
Sbjct: 1115 EYEILQLLMGDLRDRIQAYATEFDDDIKDLQRTDLTPRQRLAAQLRLGEKRIL 1167
>gi|367042232|ref|XP_003651496.1| hypothetical protein THITE_2111880 [Thielavia terrestris NRRL 8126]
gi|346998758|gb|AEO65160.1| hypothetical protein THITE_2111880 [Thielavia terrestris NRRL 8126]
Length = 377
Score = 42.7 bits (99), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 42/171 (24%), Positives = 69/171 (40%), Gaps = 22/171 (12%)
Query: 28 LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPT 87
LA PL WS L P +A L RA+ K + W + F
Sbjct: 104 LATALPLAWSSPVLHNYLPPPARA--LLRAQQAKFARD-----WAAVSAAFP-------- 148
Query: 88 EAFTFEIFKQAFVAVQSCVVHLQKVSLAR-----RFALVPLGPPLLAYSSKCKAMLAAVD 142
A + F+ A++ + + + AR R L P+ L +++ +A
Sbjct: 149 -ALAPDAFRHAWLLTNTRTFYHETARTARLPHDDRMVLQPVAD-LFNHAADGGCEVAFTP 206
Query: 143 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 193
+ + DR Y GE +++ G N LL+ YGFV E N +D + ++ A+
Sbjct: 207 ASFAITADRAYAEGEEVLICYGRHSNDFLLVEYGFVLEQNRWDEVGLDEAV 257
>gi|400598098|gb|EJP65818.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
Length = 356
Score = 42.7 bits (99), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 41/168 (24%), Positives = 69/168 (41%), Gaps = 22/168 (13%)
Query: 33 PLLWSETELAYLTGSPTKAEILERAE-GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 91
P W L G T +LE+ + R++ L + YPY +P+E +
Sbjct: 104 PFFWPPEAQRLLPG--TARRLLEKQQSNFGRDWKHLQSA----------YPY-VPSEDYM 150
Query: 92 FEIFKQAFVAVQSCVVHLQKVSL---ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 148
F V+ ++ Q+ L R A++P+ S CK A ++ +V
Sbjct: 151 HAWF---VVSSRAFYQETQQTLLYPWHDRLAMLPVADLFNHASVGCKVSYCA--ESYDIV 205
Query: 149 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 196
DR Y G+ + G N LL YGF+ ++N DR + +++E
Sbjct: 206 ADREYGTGDEVCTCYGEHSNDFLLAEYGFLLQNNTNDRFDPDDLISSE 253
>gi|384254260|gb|EIE27734.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 724
Score = 42.7 bits (99), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 60/234 (25%), Positives = 88/234 (37%), Gaps = 44/234 (18%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+YE++ G S PY+ L + +PL WSE +L L E+ + EG+
Sbjct: 157 LYERQLGPASKLAPYVAALP-------VDFSTPLSWSEAQLQALCYPQLIREVATQREGL 209
Query: 61 KREYNEL------------DTVWFMAGSLFQQY--PYDIPTEAFTFEIFKQAFVAVQSCV 106
KR + EL D +W + + + PY PT + F + +
Sbjct: 210 KRLHAELAVSTPGTPITEQDLIWALQAVRSRAFSGPYAGPTWRSRLKTFGALGALAAASI 269
Query: 107 VHLQKVSLARRFALVPL-GPPLLAYSSKCKAMLAAVD-----DAVQLVVDRPYKA----- 155
++ A AL L +L+ K AM VD VQ V+ Y A
Sbjct: 270 TVAHVLNGAIAAALFNLLYDVVLSQKVKWYAMCPVVDFLNHKSTVQSEVEYEYFADRFSV 329
Query: 156 --------GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV----EAALNTED 197
GE + + G Q N LL YGFV+ P+D + AAL D
Sbjct: 330 RCQSYFSKGEQVFISYGKQSNDSLLQYYGFVEPGIPHDTYTIPDLRAAALALSD 383
>gi|328772032|gb|EGF82071.1| hypothetical protein BATDEDRAFT_23340 [Batrachochytrium
dendrobatidis JAM81]
Length = 419
Score = 42.7 bits (99), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 48/191 (25%), Positives = 83/191 (43%), Gaps = 31/191 (16%)
Query: 12 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 71
W PYI L R L ++ LL L Y +I E A K + ++LDT +
Sbjct: 107 WSPYIDILPRSFDTMPLCIDLKLL---AMLPY--------DIQEIA---KNQQSKLDTDY 152
Query: 72 FMAGSLFQQYPYD-IPTEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-------RFALVPL 123
+ Y+ IP + IFK A++ V + + + ++++ + ++ L
Sbjct: 153 AFVCTALAVSGYEMIPKD-----IFKWAWIVVNTRCITMNTNAISKPQLSHIHQQPIITL 207
Query: 124 GPPLLAYSSKCKAMLAAVDDAVQ----LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 179
P L + A ++A D V+ + PYK G + + GP N+ LL YGF
Sbjct: 208 APFLDCLNHTSTARISAGYDTVEKAYIIRTLVPYKKGSQVFINYGPHDNNFLLAEYGFAI 267
Query: 180 EDNPYDRLVVE 190
NP++ +V++
Sbjct: 268 LKNPFNHVVLD 278
>gi|297608243|ref|NP_001061350.2| Os08g0244400 [Oryza sativa Japonica Group]
gi|255678277|dbj|BAF23264.2| Os08g0244400, partial [Oryza sativa Japonica Group]
Length = 195
Score = 42.7 bits (99), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 34/59 (57%), Gaps = 2/59 (3%)
Query: 141 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD--RLVVEAALNTED 197
V +++ + RP KAGE + G P S L+ YGF+ DNPYD L ++ +++ ED
Sbjct: 14 VTKSLKFPLSRPCKAGEQCFLSYGKHPGSHLITFYGFLPRDNPYDVIPLDLDTSVDEED 72
>gi|413917183|gb|AFW57115.1| hypothetical protein ZEAMMB73_742803 [Zea mays]
Length = 514
Score = 42.4 bits (98), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 50/187 (26%), Positives = 80/187 (42%), Gaps = 16/187 (8%)
Query: 41 LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 100
LA L G+ EI++ + ++++Y+EL + L +P + T++ F A
Sbjct: 235 LAALEGTLLFDEIIQARQHLRQQYDELFPL------LCTNFPEIFRKDVCTWDDFLWACE 288
Query: 101 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV---VDRPYKAGE 157
S + + S LVP+ L S VD+A + + + RP AGE
Sbjct: 289 LWYSNSMMIVLSSGKLSTCLVPVAGLLNHSVSPHILNYGRVDEATKSLKFPLSRPCDAGE 348
Query: 158 SIVVWCGPQPNSKLLINYGFVDE-DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ 216
+ G P S LL YGF+ DNPYD + ++ D D+ + AQ + S Q
Sbjct: 349 QCFLSYGKHPGSHLLTFYGFLPRGDNPYDVIPLDL-----DTSADDEDITAQSSATTS-Q 402
Query: 217 VFHVHAG 223
H+ G
Sbjct: 403 TTHMVRG 409
>gi|85113406|ref|XP_964517.1| hypothetical protein NCU02158 [Neurospora crassa OR74A]
gi|28926302|gb|EAA35281.1| hypothetical protein NCU02158 [Neurospora crassa OR74A]
Length = 504
Score = 42.4 bits (98), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 29/50 (58%)
Query: 144 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 193
A + RPY AGE + + G N LLI YGF+ ++N +D + ++ A+
Sbjct: 270 AFTITTTRPYAAGEEVYICYGNHSNDFLLIEYGFLFDENVWDEVCIDDAI 319
>gi|242823770|ref|XP_002488126.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
gi|218713047|gb|EED12472.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
Length = 480
Score = 42.4 bits (98), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 49/193 (25%), Positives = 85/193 (44%), Gaps = 20/193 (10%)
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
+VPL L A + + A L D A+ + +P K G+ I G P S LL YG+V
Sbjct: 230 GMVPLADLLNADADRNNARLFQEDGALVMRAIKPIKTGDEIFNDYGELPRSDLLRRYGYV 289
Query: 179 DEDNPYDRLVVEAAL----------NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEA 228
DN VVE L N E +Y +++ + ++ + + +++
Sbjct: 290 -TDNYAQYDVVELPLTGICHAAGLDNIESQEYPHLKLLHEL--EILEDGYCILRPSAEDS 346
Query: 229 ISDMLP----YLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP 284
++D+LP L + E+Q + S P P+ E + L D K++L+ Y
Sbjct: 347 LTDILPDELLALLKSLTLEREELQRLQSKQKPPKPILAAREARI---LLDSVKSKLSQYG 403
Query: 285 ATLSEDEAMLTDY 297
T+ +D+A+L +
Sbjct: 404 TTVEQDKAILQQF 416
>gi|356511297|ref|XP_003524363.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
setd3-like [Glycine max]
Length = 449
Score = 42.4 bits (98), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 70/172 (40%), Gaps = 19/172 (11%)
Query: 142 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD--RLVVEAAL------ 193
D +++V + K + +++ G N L++YGFV NPYD L + AL
Sbjct: 232 DSKMKVVAETAIKEDDPLLLCYGCLNNDLFLLDYGFVMHSNPYDCIELKYDGALLDAAST 291
Query: 194 -------NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE 246
N P + +++Q N V G ++ +L LR+ ++
Sbjct: 292 AAGVSSPNFSTPAPWQELILSQLNLAGETPDLKVSLGGQETVEGRLLAALRVILSTNVET 351
Query: 247 MQ----SVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 294
MQ S++ SL P+ E AV L L +P + +DE++L
Sbjct: 352 MQKYDLSILQSLDAEAPLGVANEIAVFRTLIALCVIALGHFPTKIMDDESLL 403
>gi|440464432|gb|ELQ33864.1| hypothetical protein OOU_Y34scaffold00857g1 [Magnaporthe oryzae
Y34]
Length = 464
Score = 42.4 bits (98), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 36/129 (27%), Positives = 55/129 (42%), Gaps = 3/129 (2%)
Query: 77 LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA 136
L Q+ P E FT E +K A V S + L P +L +S K
Sbjct: 146 LLVQHRDLFPLEQFTIEDYKWALCTVWSRAMDFVLPGGNSIRLLAPFAD-MLNHSDNVKQ 204
Query: 137 MLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 194
A + + ++ + Y+AG+ + ++ GP NS+LL YGFV N D + A +
Sbjct: 205 CHAYDSSSKTLSVLAGKDYEAGDQVFIYYGPVSNSRLLRLYGFVLPGNSNDNYDLVLATH 264
Query: 195 TEDPQYQDK 203
E P + K
Sbjct: 265 PEAPFFARK 273
>gi|336463341|gb|EGO51581.1| hypothetical protein NEUTE1DRAFT_125257 [Neurospora tetrasperma
FGSC 2508]
gi|350297448|gb|EGZ78425.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
Length = 503
Score = 42.4 bits (98), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 29/50 (58%)
Query: 144 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 193
A + RPY AGE + + G N LLI YGF+ ++N +D + ++ A+
Sbjct: 270 AFTITTTRPYAAGEEVYICYGNHSNDFLLIEYGFLFDENVWDEVCIDDAI 319
>gi|170588849|ref|XP_001899186.1| SET domain containing protein [Brugia malayi]
gi|158593399|gb|EDP31994.1| SET domain containing protein [Brugia malayi]
Length = 278
Score = 42.4 bits (98), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 28/79 (35%), Positives = 44/79 (55%), Gaps = 5/79 (6%)
Query: 119 ALVPLGPPLLAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 176
A+VPL +L +S S+C A+ + + +++V RP + GE I + G N L I YG
Sbjct: 186 AIVPL-IDMLNHSNDSQCCAIWDSKFNLYKVIVTRPIRKGEQIFICYGSHTNGSLWIEYG 244
Query: 177 FVDEDNPYDRLVVEAALNT 195
F +DN D+ VE +L +
Sbjct: 245 FYLKDNICDK--VEISLGS 261
>gi|308809221|ref|XP_003081920.1| N-methyltransferase (ISS) [Ostreococcus tauri]
gi|116060387|emb|CAL55723.1| N-methyltransferase (ISS) [Ostreococcus tauri]
Length = 403
Score = 42.4 bits (98), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 47/164 (28%), Positives = 70/164 (42%), Gaps = 40/164 (24%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
M + G + W Y L AV+S ++WS+ EL L GS + + R +
Sbjct: 54 MQQTNGGASARWRAYCDAL-------PAAVDSLMMWSDEELEVLQGSALRQRAVFRRDLC 106
Query: 61 KREYNELDTVWFMAGSLFQQYPYDI-PTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 119
KREY+ L F A L + P EA++F++F+ A+ V +AR F
Sbjct: 107 KREYDAL----FPA--LARADPETFGDVEAYSFDVFRWAYATV-----------MARAFV 149
Query: 120 LVPLGPPLLAYSSKCKAMLAAVD------DAVQLVVDRPYKAGE 157
L L +C A+L +D DA + VV+R A E
Sbjct: 150 LPDL---------QCMALLPGLDIYNSARDAEKCVVERDEGACE 184
>gi|42567909|ref|NP_197226.2| protein SET DOMAIN GROUP 40 [Arabidopsis thaliana]
gi|75271674|sp|Q6NQJ8.1|SDG40_ARATH RecName: Full=Protein SET DOMAIN GROUP 40
gi|34222078|gb|AAQ62875.1| At5g17240 [Arabidopsis thaliana]
gi|51969984|dbj|BAD43684.1| unknown protein [Arabidopsis thaliana]
gi|332005020|gb|AED92403.1| protein SET DOMAIN GROUP 40 [Arabidopsis thaliana]
Length = 491
Score = 42.4 bits (98), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 40/179 (22%), Positives = 80/179 (44%), Gaps = 22/179 (12%)
Query: 143 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV--EAALNTEDPQY 200
+A L R Y+ GE +++ G N +LL +YGF+ E+N D++ + E +L + +
Sbjct: 257 NAYCLYARRNYQLGEQVLLCYGTYTNLELLEHYGFMLEENSNDKVFIPLETSLFSLASSW 316
Query: 201 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPV 260
+ ++GKLS ++ LRL + + +SV+ + +
Sbjct: 317 PKDSLYIHQDGKLSFA---------------LISTLRLWLIPQSQRDKSVMRLVYAGSQI 361
Query: 261 SPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 319
S E V+ +++ + L P +++ED + LH ++ +R+E+K A
Sbjct: 362 SVKNEILVMKWMSEKCGSVLRDLPTSVTEDTVL-----LHNIDKLQDPELRLEQKETEA 415
>gi|453087416|gb|EMF15457.1| SET domain-containing protein [Mycosphaerella populorum SO2202]
Length = 454
Score = 42.4 bits (98), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 57/263 (21%), Positives = 108/263 (41%), Gaps = 26/263 (9%)
Query: 31 ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 90
S + +++ EL GS A I+ +Y +L LF ++ P++ F
Sbjct: 106 SSSIFFADAELEICAGSSLYTTTKHLARQIEVDYKDL------VARLFGRHRDVFPSDKF 159
Query: 91 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYS---SKCKAMLAAVDDAVQL 147
T + +K A V S + K+ L+ +L +S +C + + +
Sbjct: 160 TIDDYKWALCTVWSRAMDF-KLRDGESIRLMAPFADMLNHSPDVGQCHVYDPQSGN-LSI 217
Query: 148 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 207
+ + Y+ G+ + + GP PN++L YGFV NP D + + + P ++ K +
Sbjct: 218 LAGKSYEPGDQVFINYGPIPNNRLSRLYGFVVPGNPNDSYDLVLSTHPMAPFFEQKHKLW 277
Query: 208 QRNGKLSVQVFHVHAGREKEAISDMLP-----YLRLGYVSDTSEMQSVISSLGPIC--PV 260
G S + ++D LP YLR+ +++T ++ +V + + +
Sbjct: 278 IAAGLDSTSTVSL-------TLTDPLPRSVLRYLRIQRLNET-DLAAVGTRQSDVAFEKI 329
Query: 261 SPCMERAVLDQLADYFKARLAGY 283
S E VL L + A L G+
Sbjct: 330 SDSNETEVLTFLVESISALLDGF 352
>gi|392594054|gb|EIW83379.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
Length = 508
Score = 42.4 bits (98), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 64/236 (27%), Positives = 94/236 (39%), Gaps = 30/236 (12%)
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
A+VP+ L A A L + A+++ R + G+ I G PNS LL YG V
Sbjct: 261 AMVPMADLLNARYGSSNAKLFYEEHALEMRTTRFIRRGDQIWNTYGDPPNSDLLRRYGHV 320
Query: 179 DE--------DNPYDRLVVEAALNTEDPQYQDKRM-VAQR------NGKLSVQVFHVHAG 223
D NP D + V A L T+ + VA+R G V V
Sbjct: 321 DLVPLAQGGLGNPADVVEVRADLVTDVVSASGSSIPVAERIDWYLEMGGDDVFVLETDLD 380
Query: 224 REKEAIS--DMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 281
+ I +L L + T E + P P M+ AV+ L + RL
Sbjct: 381 IPEPLIVLVRLLQLLEPDW-EKTREKEK---------PPKPKMDGAVIGVLVEVLHRRLK 430
Query: 282 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVS 337
YP T+ EDEA+L + + +VRM +K++ LQ T + + L +V
Sbjct: 431 EYPTTIDEDEALLYKEDTLSINKKNAIIVRMGEKII---LQRTLESLQRLASQSVG 483
>gi|218200748|gb|EEC83175.1| hypothetical protein OsI_28406 [Oryza sativa Indica Group]
Length = 319
Score = 42.4 bits (98), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 40/77 (51%), Gaps = 7/77 (9%)
Query: 123 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 182
+ P +L Y K V +++ + RP KAGE + G P S L+ YGF+ DN
Sbjct: 166 VSPHILNYGRVDK-----VTKSLKFPLSRPCKAGEQCFLSYGKHPGSHLITFYGFLPRDN 220
Query: 183 PYDR--LVVEAALNTED 197
PYD L ++ +++ ED
Sbjct: 221 PYDVIPLDLDTSVDEED 237
>gi|327290197|ref|XP_003229810.1| PREDICTED: SET domain-containing protein 4-like [Anolis
carolinensis]
Length = 440
Score = 42.0 bits (97), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 37/143 (25%), Positives = 64/143 (44%), Gaps = 14/143 (9%)
Query: 81 YPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQKVSLARRFALVPLGPPL-LAYSSKCKA 136
+P D+ + F ++ FK A+ + + V H Q+ +R L P L L +
Sbjct: 184 FPKDVAS-VFNYQAFKWAWCTINTRTVYMKHSQRDCFSRDTDTYALAPYLDLLNHNPTVQ 242
Query: 137 MLAAVDDAVQL----VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 192
+ A ++ + V + + E + + GP N +LL+ YGFV DNP+ + V
Sbjct: 243 VKAGFNEKTKCYEITTVTQCHHYNEVFICY-GPHDNQRLLLEYGFVSRDNPHSSVYVGTD 301
Query: 193 LNTEDPQYQDKRMVAQRNGKLSV 215
++ +DK QR KLS+
Sbjct: 302 TLLKNVFPEDK----QRPKKLSI 320
>gi|358055500|dbj|GAA98620.1| hypothetical protein E5Q_05307 [Mixia osmundae IAM 14324]
Length = 462
Score = 42.0 bits (97), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 18/46 (39%), Positives = 28/46 (60%)
Query: 147 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 192
LV D P AG+ + GP+ N + L+ YGF+ +NP D +V++ A
Sbjct: 225 LVQDEPTAAGQQVFNNYGPKSNEEFLLGYGFIIPNNPDDHMVLKLA 270
>gi|50556556|ref|XP_505686.1| YALI0F20944p [Yarrowia lipolytica]
gi|49651556|emb|CAG78495.1| YALI0F20944p [Yarrowia lipolytica CLIB122]
Length = 402
Score = 42.0 bits (97), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 98/206 (47%), Gaps = 35/206 (16%)
Query: 3 EKKQGKKSFWLPYIREL-DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA-EGI 60
++ G KS W ++ L DR+ G ++ PL WS+ + LT P +L++ +
Sbjct: 92 QQSLGSKSDWKAFMGLLPDRKEG----FLDVPLQWSKEDQDSLT--PEGIVVLKKTLDTF 145
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CVVHLQKVSLARRFA 119
+ +Y++ T F+A +Y D P +A+ + A++ V S C+ ++ ++ A
Sbjct: 146 EADYDKTKT--FVA-----KYDSD-PRDAYLW-----AWLCVNSRCLYFDLTLTTGKKDA 192
Query: 120 L-----VPLGP--PLLAYS-----SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 167
+ L P L+ +S + C+ +++ + L R Y A E I + GP+
Sbjct: 193 QEVPDNITLAPYVDLINHSVESGPTHCQLKTSSIGFEI-LCGQRGYTADEEIFLCYGPRS 251
Query: 168 NSKLLINYGFVDEDNPYDRLVVEAAL 193
NS LL YGF +NP+D + + AL
Sbjct: 252 NSVLLCEYGFTVPENPWDDVDISDAL 277
>gi|307103393|gb|EFN51653.1| hypothetical protein CHLNCDRAFT_139846 [Chlorella variabilis]
Length = 712
Score = 42.0 bits (97), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 53/101 (52%), Gaps = 7/101 (6%)
Query: 151 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 210
RP +AG+ + + GP PN KLL YGFV NP+D +V L + + +++ A
Sbjct: 444 RPCQAGQQVFISYGPVPNLKLLCYYGFVVPHNPHD--LVPLQLEPPEGPLKQQQLAAMEA 501
Query: 211 GKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVI 251
L ++ H+ ++ +L LRL V+ ++E+Q V+
Sbjct: 502 LGLGLE----HSLQDGPLSKQLLACLRL-IVATSAELQLVV 537
>gi|330800139|ref|XP_003288096.1| hypothetical protein DICPUDRAFT_152307 [Dictyostelium purpureum]
gi|325081857|gb|EGC35358.1| hypothetical protein DICPUDRAFT_152307 [Dictyostelium purpureum]
Length = 525
Score = 42.0 bits (97), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 86/425 (20%), Positives = 154/425 (36%), Gaps = 128/425 (30%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+YE G+KS W YI L L V+ P+LW + L G+ + I + I
Sbjct: 130 IYEASIGEKSKWYGYISSL-------PLKVDIPILWDKESQQLLNGTVMEDVIQDDNILI 182
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF-- 118
Y ++ + L + +P E F+ EIF + + +V + +
Sbjct: 183 NHAYADI-----VESLLIKNHP-----EYFSKEIFSFENFKIANSIVSSRAFCIDSYHGD 232
Query: 119 ALVPLG-----------------------------------PPLLAYSSK---------- 133
+LVPL PL+ S+K
Sbjct: 233 SLVPLADIFNHKTGRENVHIESNGDVCNKCGSIKTCKHRKVTPLITKSAKSYKKLTNKKK 292
Query: 134 -----------------CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 176
C + D+ + + V + KA + + G N+ LL YG
Sbjct: 293 MELIEKQKQQQINDEENCGDIAEEDDEHLYIKVVKAVKANQEVYNTYGDHSNATLLSKYG 352
Query: 177 FVDEDNPYDRLVVEAAL-----------NTEDPQYQDKRM--------VAQRNGKLSVQV 217
F++ DNP D L VE +L N D KR+ + RN S+++
Sbjct: 353 FIEMDNPCDNLPVEKSLVDTNLISLCKENGFDSNELSKRISFYASLFDIDSRNTH-SIEI 411
Query: 218 FHVHAGREKEA-----------ISDMLPYLRLGYVSDTSEMQSVISSLGP--ICPVSPCM 264
+GR +A +S+ +L++ +++ L I + +
Sbjct: 412 ----SGRLDDALVCSVGIALAPLSEFEGWLKMS----EHKLEKYFEKLEAEDIVKQNAQV 463
Query: 265 ERAVLDQLADYFKARLAGYPATLSEDEAMLTDY--NLHPKKRVATQLVRMEKKMLNACLQ 322
++A++ L + +L+ YP TL +D+ L + N +K ++T L EKK++ ++
Sbjct: 464 KKAIVQILNN----KLSNYPTTLEQDQNKLKELKENEENRKIISTSLNICEKKLIYKSIK 519
Query: 323 VTADM 327
D+
Sbjct: 520 YYEDL 524
>gi|218200744|gb|EEC83171.1| hypothetical protein OsI_28399 [Oryza sativa Indica Group]
Length = 437
Score = 42.0 bits (97), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 21/56 (37%), Positives = 33/56 (58%), Gaps = 2/56 (3%)
Query: 144 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR--LVVEAALNTED 197
+++ + RP KAGE + G P S L+ YGF+ DNPYD L ++ +++ ED
Sbjct: 300 SLKFPLSRPCKAGEQCFLSYGKHPGSHLITFYGFLPRDNPYDVIPLDLDTSVDEED 355
>gi|383863095|ref|XP_003707018.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Megachile
rotundata]
Length = 277
Score = 42.0 bits (97), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 24/78 (30%), Positives = 39/78 (50%), Gaps = 3/78 (3%)
Query: 151 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE---DPQYQDKRMVA 207
R +K G+ I + GP+PNS ++ GFV D+ +D L E DP ++R +
Sbjct: 80 RDFKKGDQIFISYGPRPNSDFFLHSGFVYMDHKHDTLKFWVGSFLESNLDPHLAERRQLL 139
Query: 208 QRNGKLSVQVFHVHAGRE 225
++ F V++GRE
Sbjct: 140 KKLHLQPWSEFVVNSGRE 157
>gi|50294638|ref|XP_449730.1| hypothetical protein [Candida glabrata CBS 138]
gi|49529044|emb|CAG62706.1| unnamed protein product [Candida glabrata]
Length = 510
Score = 42.0 bits (97), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 34/66 (51%)
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
++VPL L A + KC A L ++++ R +GE + G NS+LL YG+V
Sbjct: 238 SMVPLADTLNADTKKCNANLLHSKQTLRMIAIRDIPSGEQVYNTYGELSNSELLRRYGYV 297
Query: 179 DEDNPY 184
+ D Y
Sbjct: 298 EWDGSY 303
>gi|388250581|gb|AFK23406.1| histone-lysine N-methyltransferase [Cordyceps militaris]
Length = 479
Score = 42.0 bits (97), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 48/188 (25%), Positives = 81/188 (43%), Gaps = 7/188 (3%)
Query: 145 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 204
+ ++ + Y+ G+ I ++ G PN++LL YGFV DNP D + + P Y+ K
Sbjct: 239 LSILAAKDYQVGDQIFIYYGSVPNNRLLRLYGFVLLDNPNDSYDLVLQTSPMAPLYEQKE 298
Query: 205 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV--SDTSEMQSVISSLGPICPVSP 262
+ G S + A + ++L YLR + +D ++M + + G V+
Sbjct: 299 RLWALAGLDSTCTIPLTA--KHPLPKNVLRYLRTQRLDAADVADMTLQLLN-GTDGKVND 355
Query: 263 CMERAVLDQLADYFKARLAGYPATLSEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNAC 320
E VL L D + L G+ L + EA L Y A Q+ E+ +L
Sbjct: 356 GNEIQVLQFLIDSLGSVLEGFGIPLEKLEAQLAGGFYPAGGNAWAAAQVSAGEQGILTRA 415
Query: 321 LQVTADMI 328
+ DM+
Sbjct: 416 KKTAEDML 423
>gi|358335378|dbj|GAA53907.1| histone-lysine N-methyltransferase setd3 [Clonorchis sinensis]
Length = 254
Score = 42.0 bits (97), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 36/168 (21%), Positives = 79/168 (47%), Gaps = 15/168 (8%)
Query: 164 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 223
G + +++ L+ GFV NP++ + + ++ D + + + S + H+
Sbjct: 63 GKRTSAEFLMFSGFVPATNPHNNVRIVLGVSKSDQLSSKREQLLELIALQSPLILHITG- 121
Query: 224 REKEAISDMLPYLRLGYVSDTSEMQSVIS---------SLGPICPVSPCMERAVLDQLAD 274
+ ++SD + + R+ +V D+ ++ + +S P+CP P ++A+ L
Sbjct: 122 -DLSSLSDAIAFARV-FVMDSDQLDAHLSMTTSALHALRTSPLCPGDPIDDQAIA-FLIM 178
Query: 275 YFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 322
F+ ++ Y +SEDE NL P +R +L E ++L +C++
Sbjct: 179 RFELLVSAYGPMVSEDEVGYE--NLTPIQRYCERLRVQEVQILRSCIE 224
>gi|320170159|gb|EFW47058.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 640
Score = 42.0 bits (97), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 47/185 (25%), Positives = 73/185 (39%), Gaps = 23/185 (12%)
Query: 9 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP--TKAEILERAEGIKREYNE 66
KS W +I L + ++S + WS E L P K +ILER +++ YN
Sbjct: 265 KSMWREWISSLPQ-------TLDSTVFWSAEEQDALQSLPLKRKTQILER--HLQQLYNA 315
Query: 67 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CVVHLQKVSLARRFALVPLGP 125
L +P+ +++E+FK A++ V S + + L PL
Sbjct: 316 ------TTPRLLAAFPHIFAGGNYSYEMFKWAYMIVDSRSLTFSTGPDTLPQIMLAPLVD 369
Query: 126 PLLAYSSKCKAMLAAVDDAV-----QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
L + L + V L R K GE +V G PN +LL+ +G
Sbjct: 370 LLHHDPVQTNIQLGVHPEEVLGFEISLKTTRAIKKGEPLVRHIGELPNHQLLLRFGLAMP 429
Query: 181 DNPYD 185
NPY+
Sbjct: 430 RNPYE 434
>gi|260835045|ref|XP_002612520.1| hypothetical protein BRAFLDRAFT_214305 [Branchiostoma floridae]
gi|229297897|gb|EEN68529.1| hypothetical protein BRAFLDRAFT_214305 [Branchiostoma floridae]
Length = 287
Score = 42.0 bits (97), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 46/196 (23%), Positives = 77/196 (39%), Gaps = 24/196 (12%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
M EK + +KSFW PYI L + P ++E + L S + + +
Sbjct: 103 MCEKYRREKSFWRPYIDILPEE-------YSCPTFFTEDDFRLLPNS-LRGKAKAKKYEC 154
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV----------HLQ 110
+EY EL + M LF +AF F+ FK A+ A+++ + HL+
Sbjct: 155 HKEYKELAPFFKMLADLFPD-----QEDAFNFKDFKWAWSAIKTRALDVPIGRESCRHLR 209
Query: 111 KVSLARRFALVPLGPPL-LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 169
+ PL + A +K + ++ + Y+ ++ G N
Sbjct: 210 DAEDTPTPTMFPLVDSINHAAQAKIRHRYNEKSRCLESRTETVYRRHAEVMNSYGRADND 269
Query: 170 KLLINYGFVDEDNPYD 185
LL+ +GFV NP D
Sbjct: 270 NLLLEFGFVVPGNPED 285
>gi|308811012|ref|XP_003082814.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplast precursor (ISS)
[Ostreococcus tauri]
gi|116054692|emb|CAL56769.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplast precursor (ISS)
[Ostreococcus tauri]
Length = 588
Score = 42.0 bits (97), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 63/266 (23%), Positives = 108/266 (40%), Gaps = 46/266 (17%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG- 59
+ E+++G + Y+ L + ++SPL WS EL + G+ ++L+ A G
Sbjct: 133 LKERERGADGEFAAYVATLPK-------TLDSPLFWSADELRDIAGT----QLLDNAAGY 181
Query: 60 ---IKREYNELDTVWFMAGSLFQQYPYDIPTE-AFTFEIFKQAFVAVQS-CVVHLQKVSL 114
++ Y EL +F +Y + AF F+ AF ++S + L ++
Sbjct: 182 DAYVRAVYEEL------KNGVFVEYASTFDVDGAFDEASFRWAFGILRSRTMAPLDGANV 235
Query: 115 ARRFALVPLGPPLLAYSSKCKAMLAAVDD---------------AVQLVVDRPYKAGESI 159
A LVP G L+ +SS A A + DR Y G I
Sbjct: 236 A----LVP-GLDLINHSSLSGARWRVGGGGGMGGLFGGGSGSGVAAYVECDRDYDEGAEI 290
Query: 160 VVWCGPQP-NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
V P+ +SK ++YGF+D NP + ++ +D DK V + G F
Sbjct: 291 FVNYDPEGIDSKFALDYGFIDVVNPSPGYALTLSIPEDDANLFDKLDVLETQGLPEAPTF 350
Query: 219 HVHAGREKEAISDMLPYLRLGYVSDT 244
+ + + ++ +LRL + DT
Sbjct: 351 TLRPYSDPD--RELRTFLRLLHCKDT 374
>gi|384483765|gb|EIE75945.1| hypothetical protein RO3G_00649 [Rhizopus delemar RA 99-880]
Length = 376
Score = 42.0 bits (97), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 47/194 (24%), Positives = 83/194 (42%), Gaps = 31/194 (15%)
Query: 6 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 65
+ K+S+W PY L + S LL ++L S K E +++ + I +Y
Sbjct: 51 RDKQSWWKPYTDLLPMHFNTMPVNYPSELL------SHLPNS-LKQETMQQKDNIHTDY- 102
Query: 66 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP 125
V + +Q P DI T E FK A++ V + +H+ + + L P
Sbjct: 103 ----VTCLKFCKSKQLPQDI-----TAEEFKWAWLCVNTRCIHMTVPDYLAKGENIALAP 153
Query: 126 PL--LAYSSKCKAMLAAVDDAVQLVVDR-------PYKAGESIVVWCGPQPNSKLLINYG 176
L L ++++ K ++ + R YK GE + + GP N +L YG
Sbjct: 154 MLDFLNHTTEAK-----IESGFNIRTQRFEIKTLTAYKKGEQVYINYGPHDNLAMLKEYG 208
Query: 177 FVDEDNPYDRLVVE 190
FV +N Y+ ++++
Sbjct: 209 FVLNENIYNFVLLD 222
>gi|358384831|gb|EHK22428.1| hypothetical protein TRIVIDRAFT_84056 [Trichoderma virens Gv29-8]
Length = 458
Score = 42.0 bits (97), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 21/67 (31%), Positives = 37/67 (55%)
Query: 145 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 204
+ ++ + Y+A + + ++ GP PNS+LL YGFV DNP D + + + P Y+ K+
Sbjct: 216 LSILAGKDYEAEDQVFIYYGPMPNSRLLRLYGFVIPDNPNDSYDLVLSTHPLAPFYEQKQ 275
Query: 205 MVAQRNG 211
+ G
Sbjct: 276 KLWASAG 282
>gi|299115166|emb|CBN75532.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 524
Score = 42.0 bits (97), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 54/235 (22%), Positives = 94/235 (40%), Gaps = 46/235 (19%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW----SETELAYLTGSPTKAEILER 56
M E+++G SFW Y+R L V++PL W +E E L G T +L R
Sbjct: 90 MAERRKGDGSFWKQYLRTLPDD-------VDTPLRWLVEQAEEEFRLLDG--TMVGLLSR 140
Query: 57 --AEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK--- 111
+++++ E L + +P + TFE + A ++ S Q+
Sbjct: 141 MMHSQVRKDWEEFHL------PLVEAHPEILG--GVTFEDYLWAMSSIWSRSFDYQEPGP 192
Query: 112 -VSLARRFALVPLG-------------PPLLAYSSKCKAMLAAVDD------AVQLVVDR 151
S R A+VP+ ++ + ++ + + + +++ R
Sbjct: 193 DDSPCSRRAMVPVINAANHDPSAADSLSEMIEFQAQEGGLSMGIGEPGRARGTLRVSAGR 252
Query: 152 PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 206
Y A E + G N+KLL +YGFV NPY L + DP + K+ +
Sbjct: 253 DYAAREQFFILYGRYSNAKLLYSYGFVLASNPYGGLDYWVRVPQTDPGFAWKQAL 307
>gi|401624185|gb|EJS42251.1| set7p [Saccharomyces arboricola H-6]
Length = 494
Score = 41.6 bits (96), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 22/68 (32%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
+++PL L A +SKC A L +++++ R + E + G PNS++L YG+V
Sbjct: 220 SMIPLADTLNADTSKCNANLTYDSGSLKMIAVRDIEIDEQVYNIYGEHPNSEILRRYGYV 279
Query: 179 DED-NPYD 185
+ D + YD
Sbjct: 280 EWDGSKYD 287
>gi|449506720|ref|XP_004162829.1| PREDICTED: uncharacterized LOC101212907 [Cucumis sativus]
Length = 559
Score = 41.6 bits (96), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 45/178 (25%), Positives = 77/178 (43%), Gaps = 20/178 (11%)
Query: 29 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 88
A + L + + L G+ E+++ E ++++YNEL F A L +P P E
Sbjct: 259 AFNTGLSFGVGAMTTLVGTLLFDELMQAKEHLRKQYNEL----FPA--LCNNHPDIFPEE 312
Query: 89 AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP--------LGPPLLAYSSKCKAMLAA 140
+++E F A S + + R LVP L P +L Y + +
Sbjct: 313 FYSWEEFLWACELWYSNSLKIMFPDGNVRTCLVPIAGFLNHSLHPHILHYGK-----VDS 367
Query: 141 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYDRLVVEAALNTED 197
D+++ + RP +AGE + G S L+ YGF+ E DN D + ++ +D
Sbjct: 368 DTDSLKFRLSRPCRAGEECYLSYGNYSGSHLVTFYGFLPEGDNVNDVIPLDIDFGDDD 425
>gi|449466129|ref|XP_004150779.1| PREDICTED: uncharacterized protein LOC101212907 [Cucumis sativus]
Length = 559
Score = 41.6 bits (96), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 45/178 (25%), Positives = 77/178 (43%), Gaps = 20/178 (11%)
Query: 29 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 88
A + L + + L G+ E+++ E ++++YNEL F A L +P P E
Sbjct: 259 AFNTGLSFGVGAMTTLVGTLLFDELMQAKEHLRKQYNEL----FPA--LCNNHPDIFPEE 312
Query: 89 AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP--------LGPPLLAYSSKCKAMLAA 140
+++E F A S + + R LVP L P +L Y + +
Sbjct: 313 FYSWEEFLWACELWYSNSLKIMFPDGNVRTCLVPIAGFLNHSLHPHILHYGK-----VDS 367
Query: 141 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYDRLVVEAALNTED 197
D+++ + RP +AGE + G S L+ YGF+ E DN D + ++ +D
Sbjct: 368 DTDSLKFRLSRPCRAGEECYLSYGNYSGSHLVTFYGFLPEGDNVNDVIPLDIDFGDDD 425
>gi|10177069|dbj|BAB10511.1| unnamed protein product [Arabidopsis thaliana]
Length = 447
Score = 41.6 bits (96), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 40/179 (22%), Positives = 80/179 (44%), Gaps = 22/179 (12%)
Query: 143 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV--EAALNTEDPQY 200
+A L R Y+ GE +++ G N +LL +YGF+ E+N D++ + E +L + +
Sbjct: 213 NAYCLYARRNYQLGEQVLLCYGTYTNLELLEHYGFMLEENSNDKVFIPLETSLFSLASSW 272
Query: 201 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPV 260
+ ++GKLS ++ LRL + + +SV+ + +
Sbjct: 273 PKDSLYIHQDGKLSFA---------------LISTLRLWLIPQSQRDKSVMRLVYAGSQI 317
Query: 261 SPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 319
S E V+ +++ + L P +++ED + LH ++ +R+E+K A
Sbjct: 318 SVKNEILVMKWMSEKCGSVLRDLPTSVTEDTVL-----LHNIDKLQDPELRLEQKETEA 371
>gi|145349891|ref|XP_001419360.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579591|gb|ABO97653.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 465
Score = 41.6 bits (96), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 48/204 (23%), Positives = 77/204 (37%), Gaps = 47/204 (23%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
E+ G KS + Y R L R A W++ E +YL G+ + + + A K
Sbjct: 106 ERSLGLKSRYFAYDRVLPRCEANVVCA------WNDGERSYLAGTEVETSLRDEAAAAKN 159
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 122
E+ + +F+++ + +FE F +A V S R F L P
Sbjct: 160 EWER------VVAPVFKEHGVEC-----SFEQFIEARTVVSS-----------RAFTLSP 197
Query: 123 -----LGPPLLAYSSKCKAMLAAVDDA--------------VQLVVDRPYKAGESIVVWC 163
L P A++ V D V++ ++ + G+ I
Sbjct: 198 NAGVGLVPIADAFNHLTGNHHVNVGDGDAVVRSETGGEALCVKVTNEQGVRRGDEIFNTY 257
Query: 164 GPQPNSKLLINYGFVDEDNPYDRL 187
G N+KLL +YGF DNP D +
Sbjct: 258 GFHGNAKLLNSYGFTQNDNPADEV 281
>gi|297845640|ref|XP_002890701.1| hypothetical protein ARALYDRAFT_472886 [Arabidopsis lyrata subsp.
lyrata]
gi|297336543|gb|EFH66960.1| hypothetical protein ARALYDRAFT_472886 [Arabidopsis lyrata subsp.
lyrata]
Length = 471
Score = 41.6 bits (96), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 38/175 (21%), Positives = 71/175 (40%), Gaps = 19/175 (10%)
Query: 139 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL----------V 188
A + V++V + K + +++ G N L++YGFV E NPYD +
Sbjct: 252 AESNTLVKVVAETELKENDPLLLNYGCLSNDFFLLDYGFVIESNPYDTIELKYDEQLMDA 311
Query: 189 VEAALNTEDPQYQD-----KRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD 243
A P++ ++++Q N + V G + +L +R+ +
Sbjct: 312 ASMAAGVSSPKFSSPAPWQHQLLSQLNLAGEMPNLKVTIGGPEPVEGRLLAAIRILLCGE 371
Query: 244 TSEMQS----VISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 294
E++ + SL I P+ E AV + L+ +P + EDEA++
Sbjct: 372 MVEVEKHDLDTLKSLSAIAPLGIANEIAVFRTVIALCVIALSHFPTKIMEDEAII 426
>gi|365982325|ref|XP_003667996.1| hypothetical protein NDAI_0A05980 [Naumovozyma dairenensis CBS 421]
gi|343766762|emb|CCD22753.1| hypothetical protein NDAI_0A05980 [Naumovozyma dairenensis CBS 421]
Length = 573
Score = 41.6 bits (96), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 23/54 (42%), Positives = 28/54 (51%)
Query: 132 SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
SK + L DD V +V R GE I + GP PN+ LL GF DNP+D
Sbjct: 335 SKPEEELNNPDDYVDIVTTRGILKGEEIFISYGPLPNAFLLAKCGFTMADNPFD 388
>gi|145344497|ref|XP_001416768.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576994|gb|ABO95061.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 514
Score = 41.6 bits (96), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 88/357 (24%), Positives = 144/357 (40%), Gaps = 62/357 (17%)
Query: 7 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 66
G KS + P++R L R V+S + W+E EL+ L+GS + EY+
Sbjct: 144 GSKSAYWPWLRLLPRD-------VDSTVGWNEDELSELSGSNVVVFTRAIKAQWRMEYDA 196
Query: 67 LDTVWFMAGSLFQQYPYDIPTE---AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 123
LD +L +++P E +TF+ F A + S + L S + +
Sbjct: 197 LDV-----PTLGEKFPDVFGGERAAHYTFDKFTWARFIIWSRAIDLSTESA--EAPTIRV 249
Query: 124 GPPLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
PLL A K + A +AV++ ++ + +P+ L+ YGF+
Sbjct: 250 LVPLLDMANHAPGGKLRPEWDARSNAVKVYAASAFREHTELRFNYDTKPSQYFLLQYGFI 309
Query: 179 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE------AIS-D 231
E NP + VEA + D + R+ K + H +++ +I D
Sbjct: 310 PETNPAE--CVEATVRVSDHD-------SLRDAKEELLRLHGLDPKKRNFEWKPRSIDYD 360
Query: 232 MLPYLRLGYVSDTSEMQSVIS-----SLGPICPVSPCMERAV-LDQLADYFKARLAGYPA 285
+L R+ D +EM S S + + +AV L LA + L Y
Sbjct: 361 LLAATRV-ITMDEAEMSDATSLTLAVSGASVSAKNDARTKAVLLKSLASF----LESYTT 415
Query: 286 TLSEDEAMLT----DYNLHP----KKRVATQLVRMEKKMLNACLQVTADMIML-LPD 333
TL+ED + + N P +KR A L+RM +K + L +AD + LPD
Sbjct: 416 TLAEDNEYVARVDDESNDEPLPGKRKRFAV-LLRMREKQI---LLASADALFKELPD 468
>gi|302835223|ref|XP_002949173.1| hypothetical protein VOLCADRAFT_120737 [Volvox carteri f.
nagariensis]
gi|300265475|gb|EFJ49666.1| hypothetical protein VOLCADRAFT_120737 [Volvox carteri f.
nagariensis]
Length = 593
Score = 41.2 bits (95), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 43/154 (27%), Positives = 67/154 (43%), Gaps = 6/154 (3%)
Query: 37 SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK 96
S+ ++A L G+P + + RA E E + F SL YP E F++E +
Sbjct: 276 SQEDVALLEGTPLHGDAV-RARQHLSEAFESSSPAFR--SLLGAYPDYFKPEWFSWESYL 332
Query: 97 QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD---DAVQLVVDRPY 153
A S + +Q S R L P + + + VD +++ RP
Sbjct: 333 WAAELWYSYGIQVQFASGDIRTCLAPYLGLMNHHPLPHVVHFSKVDPETGCLRVRAFRPC 392
Query: 154 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 187
+AG + + GP N+KLL+ YGF DNP D +
Sbjct: 393 EAGNQLFLSYGPYSNAKLLLFYGFAVRDNPADEV 426
>gi|71425330|ref|XP_813082.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70877934|gb|EAN91231.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 565
Score = 41.2 bits (95), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 51/184 (27%), Positives = 80/184 (43%), Gaps = 31/184 (16%)
Query: 156 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRL-----VVEAALNTEDPQYQDKR--MVAQ 208
G I + GP N +LL YGFV E N +DRL E + E + +R +VA
Sbjct: 362 GREIWMSYGPLQNWELLQFYGFVLEGNEHDRLPFPFDFPEGVVGDE---WDGRRAALVAT 418
Query: 209 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 268
L+ + + H GR A ++ LR+ ++++ E + + GP + E V
Sbjct: 419 YGLHLAGRCWICHDGRPPPA---LVALLRV-HLAEAEEFDT-MERKGPFASLGAGTEARV 473
Query: 269 LDQLADYFKARLAGYPATLSEDEAML------------TDYNLHP---KKRVATQLVRME 313
+ +AD + L + +L EDE +L D N P KR+A L+RM
Sbjct: 474 VATIADTIRCILDLFSTSLEEDERLLENGSGPVATHSGDDGNTQPLSCNKRLAI-LLRMG 532
Query: 314 KKML 317
K +
Sbjct: 533 MKRI 536
>gi|407417214|gb|EKF38012.1| hypothetical protein MOQ_001785 [Trypanosoma cruzi marinkellei]
Length = 578
Score = 41.2 bits (95), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 37/137 (27%), Positives = 65/137 (47%), Gaps = 9/137 (6%)
Query: 164 GPQPNSKLLINYGFVDEDNPYDRLVVEAAL--NTEDPQYQDKR--MVAQRNGKLSVQVFH 219
GP N +LL YGFV E+N +DRL ++ +R +VA L+ + +
Sbjct: 370 GPLQNWELLQFYGFVVEENEHDRLPFPFDFPEGVAGDEWDRRRATLVATYGLHLAGRCWI 429
Query: 220 VHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 279
H GR A ++ LR+ ++++ E ++ + GP + E V+ +AD +
Sbjct: 430 CHDGRPPPA---LVALLRV-HLAEAEEFDTMERN-GPFASLGAGTEARVVATIADTIRCI 484
Query: 280 LAGYPATLSEDEAMLTD 296
L + +L EDE +L +
Sbjct: 485 LDLFSTSLEEDEWLLEN 501
>gi|440802665|gb|ELR23594.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
Length = 984
Score = 41.2 bits (95), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 42/205 (20%), Positives = 81/205 (39%), Gaps = 33/205 (16%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+ ++ + +FW PY+R L + R + ++ +L G+ + E+ + I
Sbjct: 90 LIHQRAAQDAFWGPYLRSLPKHDDR-----------PDEDIQHLAGTNLFYAMQEKQQQI 138
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV------QSCVVHLQKVSL 114
+ ++ + +L +P P + FT++ F F A Q+ V +
Sbjct: 139 RESFD------LLFPALCHAHPTVFPPDLFTWDHFLWTFTACSSRSFPQTLVQQPTATTS 192
Query: 115 ARR--FALVPLGPPLL--------AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG 164
A + L+ + LL Y K L ++ V + + G G
Sbjct: 193 AHADPYDLLEIDECLLPGLDMLNHQYRKKITWALDPSTGRLKFVTEDTVEKGTEAFNNYG 252
Query: 165 PQPNSKLLINYGFVDEDNPYDRLVV 189
P+ N +LL+ YGF EDN D +++
Sbjct: 253 PKGNEELLMGYGFCIEDNEQDYVMI 277
>gi|156849027|ref|XP_001647394.1| hypothetical protein Kpol_1018p68 [Vanderwaltozyma polyspora DSM
70294]
gi|156118080|gb|EDO19536.1| hypothetical protein Kpol_1018p68 [Vanderwaltozyma polyspora DSM
70294]
Length = 494
Score = 41.2 bits (95), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 18/61 (29%), Positives = 34/61 (55%)
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
+++PL L + + C A L + +++ +P +AGE + G PNS++L YG+V
Sbjct: 221 SMIPLADTLNSDTHLCNANLMYDKETLKMTAIKPIRAGEEVFNIYGEHPNSEILRRYGYV 280
Query: 179 D 179
+
Sbjct: 281 E 281
>gi|115386294|ref|XP_001209688.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114190686|gb|EAU32386.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 486
Score = 41.2 bits (95), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 83/369 (22%), Positives = 143/369 (38%), Gaps = 67/369 (18%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP---------TKA 51
MYE G +S W Y + L R+ ++ + W+ +EL L GS
Sbjct: 100 MYEYLLGDQSTWAAYFKVLPRK-------FDTLMFWTPSELLELQGSAVIDKIGRQGADE 152
Query: 52 EILERAEGIKREYNEL----------------DTVWFMA---GSLFQQYPYDI--PTEAF 90
ILE I R + L + +A GSL Y +DI P +
Sbjct: 153 SILEMIAPIVRAHPSLFPPVDGLPSYDGDAGTQALLHLAHTMGSLIMAYAFDIEKPEDED 212
Query: 91 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD 150
+ ++ + ++ L++ +VPL L A + + A L ++A+ +
Sbjct: 213 EEGDGEGGYMTDE------EEEQLSK--GMVPLADLLNADADRNNARLFQDENALVMKAI 264
Query: 151 RPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVE-------AALNTEDPQYQD 202
+P GE I G P + LL YG+V D PYD + V A L+ DP+ Q
Sbjct: 265 KPIAKGEEIFNDYGEIPRADLLRRYGYVTDNYAPYDVVEVSLDVICKAAGLSDSDPEKQP 324
Query: 203 KRMVAQRNGKLSVQVFHVHAGREKEAISDMLP-YLRLGYVSDTSEMQSVISSLGPICPVS 261
L +E + ++D+LP L + + T + + P
Sbjct: 325 PLEFLDELELLDDGYVIPRPSQEDDQLTDILPDELIILLRTLTLSPEQLAQQRSKNKPPK 384
Query: 262 PCMERAVLDQLADYFKARLAGYPATLSEDEAMLT------------DYNLHPKKRVATQL 309
P A LA + + A Y T+++D+ +L+ D + H ++++A Q+
Sbjct: 385 PAFAEAEATILAKAIQLKQAQYATTIAQDQEILSQLNSSEVSSGFVDESTH-RRKMAVQV 443
Query: 310 VRMEKKMLN 318
EK++L+
Sbjct: 444 RIGEKEILH 452
>gi|402077770|gb|EJT73119.1| hypothetical protein GGTG_09969 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 377
Score = 41.2 bits (95), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 33/76 (43%), Gaps = 2/76 (2%)
Query: 115 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 174
A R ALVP+ + C A D + DR Y G+ I + G N LL
Sbjct: 177 ADRLALVPIADLFNHADTGCGASFTP--DGFVVSTDRKYHVGQEIYISYGTHTNDLLLAE 234
Query: 175 YGFVDEDNPYDRLVVE 190
YGFV N +D+ ++
Sbjct: 235 YGFVPMANRWDKTCLD 250
>gi|302790237|ref|XP_002976886.1| hypothetical protein SELMODRAFT_416932 [Selaginella moellendorffii]
gi|300155364|gb|EFJ21996.1| hypothetical protein SELMODRAFT_416932 [Selaginella moellendorffii]
Length = 177
Score = 41.2 bits (95), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 47/98 (47%), Gaps = 6/98 (6%)
Query: 127 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 186
L S CK + AV +++++ R KAG + G PN LL YGFV E+NP+D
Sbjct: 54 FLWASELCK--IDAVTNSLKVYSLRSCKAGMQCFISYGALPNIDLLCFYGFVLENNPFDT 111
Query: 187 LVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR 224
+ VE E P+ K + +R +S F + R
Sbjct: 112 IPVE----LEVPESPAKVALMERYNLVSHISFELRGFR 145
>gi|171684553|ref|XP_001907218.1| hypothetical protein [Podospora anserina S mat+]
gi|170942237|emb|CAP67889.1| unnamed protein product [Podospora anserina S mat+]
Length = 396
Score = 41.2 bits (95), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 27/111 (24%), Positives = 50/111 (45%), Gaps = 12/111 (10%)
Query: 110 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 169
++++ + AL P+ L C+ + + DR YK GE + + G N
Sbjct: 194 ERLTKDDKMALQPVADLLNHSDEGCEVVFDT--GCYTISADREYKQGEEVYICYGTHSND 251
Query: 170 KLLINYGFVDEDNPYDRL----VVEAALNTEDPQYQDKR------MVAQRN 210
L++ YGF E+N +D + VV ++T ++ D R ++ +RN
Sbjct: 252 FLMVEYGFCPEENKWDEVCIDEVVLEEMSTARKKWLDGRDFLGKYLIDERN 302
>gi|159471213|ref|XP_001693751.1| transcription factor, E2F and DP-related [Chlamydomonas
reinhardtii]
gi|158283254|gb|EDP09005.1| transcription factor, E2F and DP-related [Chlamydomonas
reinhardtii]
Length = 656
Score = 41.2 bits (95), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 41/149 (27%), Positives = 62/149 (41%), Gaps = 6/149 (4%)
Query: 42 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 101
A L GSP AE + + + + SL + YP F++E + A
Sbjct: 164 AALAGSPLAAEAGQARRHLAEAFAASQPAFE---SLLKAYPDYFQPHWFSWESYLWAAEL 220
Query: 102 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA---VQLVVDRPYKAGES 158
S + +Q + R LVP + + + VD A +++ RP G
Sbjct: 221 WYSYGIQVQVAAGDIRTCLVPYLGLMNHHPLPHVVHFSKVDPASRGLRVRAFRPCARGRQ 280
Query: 159 IVVWCGPQPNSKLLINYGFVDEDNPYDRL 187
+ + GP PNSKLL+ YGF DNP D +
Sbjct: 281 LFLSYGPYPNSKLLLFYGFALPDNPVDEV 309
>gi|378728064|gb|EHY54523.1| SET domain-containing protein 6 [Exophiala dermatitidis NIH/UT8656]
Length = 495
Score = 41.2 bits (95), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 72/276 (26%), Positives = 102/276 (36%), Gaps = 46/276 (16%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILER-AEG 59
+YE +G+ S W PY + L + + W++ ELA L S +I R AE
Sbjct: 103 IYEYLRGESSPWHPYFKIL-------PTTFNTLMFWNDAELAELQASAVVDKIGRRQAEE 155
Query: 60 ------------------IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 101
+ +L + MAGSL Y +DI + + A
Sbjct: 156 EWQNTIIPTMADHPDLFPVGGSSAKLIELAHMAGSLIMAYAFDIDRDDMEDDNDNDKDGA 215
Query: 102 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 161
+ + +VP L A + K A L D + + +P AGE I
Sbjct: 216 DSADDEFEEDDEDEPFKGMVPFADMLNADADKNNARLFQEPDYLIMKATKPISAGEQIFN 275
Query: 162 WCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV 220
GP P S LL YG+V D YD + L E VA ++ K QV+
Sbjct: 276 DYGPLPRSDLLRMYGYVTDNYAQYDVVEFSHDLLLE---------VAGKHSKSKDQVW-- 324
Query: 221 HAGREKEAISDMLPYLRLGYV-----SDTSEMQSVI 251
RE+E D L L GY DT +Q V+
Sbjct: 325 ---REREQQLDELGVLDDGYAITRPEYDTQGLQDVL 357
>gi|325186532|emb|CCA21071.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 441
Score = 41.2 bits (95), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 50/206 (24%), Positives = 88/206 (42%), Gaps = 19/206 (9%)
Query: 155 AGESIVVWCGPQPNSKLLINYGFV--DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 212
A + + G ++KLL +YGFV +E N ++ + + + DP ++ K+ + + N
Sbjct: 238 ANDPFYISYGSYSDAKLLYSYGFVSLNEKNRFNGIDLWMRVPVTDPNFKLKQAILEGNAA 297
Query: 213 LSVQVFHVHAGREKEAISD-MLPYLRLGYVS--DTSEMQSVISSLGPICPVSPCMERAVL 269
Q + + + + L R+ +S + E + S VS E AV
Sbjct: 298 TRDQTYDFRGTIHLDDVDERFLASFRIILLSQEEFREYEKAFDS----TIVSVRNELAVY 353
Query: 270 DQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQL-VRME-KKMLNACLQVTADM 327
+ D + RLA +P +L +D L + ++ R + VRME KK+L + ++ +
Sbjct: 354 AAIHDVCEKRLARFPTSLEDDLKKLAELEMNSDLRKTYAISVRMEDKKILQSVCRLMKEW 413
Query: 328 IMLL--------PDVTVSPCPAPYAP 345
LL PDVT P P
Sbjct: 414 RNLLENDSNIYPPDVTRQQQPQLSMP 439
>gi|255087300|ref|XP_002505573.1| set domain protein [Micromonas sp. RCC299]
gi|226520843|gb|ACO66831.1| set domain protein [Micromonas sp. RCC299]
Length = 509
Score = 41.2 bits (95), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 79/347 (22%), Positives = 142/347 (40%), Gaps = 46/347 (13%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
++E+ G+KS W Y+ L Q +++PL WS E+A LTG+ ++L+ A G
Sbjct: 140 LHERSIGEKSRWAAYVNALPAQ-------LDAPLFWSAEEVATLTGT----QLLDAAAGY 188
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CVVHLQKVSLARRFA 119
+ T + S F P P++AF F AF ++S C V A
Sbjct: 189 --DSYARGTWARLKESAFDANPDVFPSDAFDEPSFLWAFGILRSRCQA---PVDQGADIA 243
Query: 120 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV------------------ 161
LVP +A S + +++ V K+G S+++
Sbjct: 244 LVP--GLDMANHSGLSSQTWTLNNGGVAAVFGGGKSGGSMLLRTEKGAKGLLAKGAEVFM 301
Query: 162 -WCGPQPNSKLLINYGFVDEDNPYDRLVVE-AALNTEDPQYQDKRMVAQRNGKLSVQVFH 219
+ + +++L ++YGF D V+ A+ DP DK V + G F
Sbjct: 302 NYGQRKIDNQLALDYGFTDAFASRPGYVLGPIAIPESDPNAFDKMDVLEVAGLREAPSFV 361
Query: 220 VHAGREKEAISDMLPYLRLGYVS--DTSEMQSVI--SSLGPIC-PVSPCMERAVLDQLAD 274
+ A + E ++ ++RL + D ++++ + G I PVS E+ + +
Sbjct: 362 LRAFEDPE--PELRVFMRLLNLKGEDAFLLEAIFRQEAWGLISEPVSRLNEQEACGTMIN 419
Query: 275 YFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 321
+ L GY + +D + D + + R+A ++ EK+ L L
Sbjct: 420 GCEEALRGYATRVEDDRRVAEDPGVGHRLRLAARVRMGEKQALADAL 466
>gi|159464317|ref|XP_001690388.1| hypothetical protein CHLREDRAFT_144255 [Chlamydomonas reinhardtii]
gi|158279888|gb|EDP05647.1| predicted protein [Chlamydomonas reinhardtii]
Length = 486
Score = 41.2 bits (95), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 35/55 (63%), Gaps = 3/55 (5%)
Query: 142 DDAVQ---LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 193
D+A Q +VV R AG+ +++ G N +LL +YGFV +DNP+D ++AAL
Sbjct: 244 DEARQQYVIVVRRRVAAGQQVLLCYGRHTNLELLEHYGFVMQDNPHDTAPLDAAL 298
>gi|242059429|ref|XP_002458860.1| hypothetical protein SORBIDRAFT_03g041640 [Sorghum bicolor]
gi|241930835|gb|EES03980.1| hypothetical protein SORBIDRAFT_03g041640 [Sorghum bicolor]
Length = 491
Score = 41.2 bits (95), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 50/236 (21%), Positives = 88/236 (37%), Gaps = 58/236 (24%)
Query: 1 MYEKKQGKKSFWLPYIREL-DRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAE 58
MYE+ +G S W Y++ L DR+ PL+W E L G+ + + E
Sbjct: 88 MYERARGTDSPWDAYLQLLPDRE--------SVPLVWPADEAECLLAGTELDKIVKQDRE 139
Query: 59 GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF 118
+ ++ E ++G L D+ + F+ E + A V S + F
Sbjct: 140 FLCEDWKECIEPLLLSGEL------DVDPDDFSLEKYFSAKTLVSSRSFQIDSY---HGF 190
Query: 119 ALVPLGPPLLAYSSKCKAML--------------------------------------AA 140
+VPL L + + C+ + +
Sbjct: 191 GMVPLAD-LFNHKTDCEHVHFTSASDASDSDGEDADDDQSDASADDESTIENPTSSSPGS 249
Query: 141 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 196
D+ +++++ R GE + G N+ LL YGF + DN YD + ++ AL T+
Sbjct: 250 KDEDLEMIIVRDVNEGEEVYNTYGTMGNAALLHRYGFTELDNQYDIVNIDLALVTK 305
>gi|145354661|ref|XP_001421597.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581835|gb|ABO99890.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 341
Score = 41.2 bits (95), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 73/320 (22%), Positives = 126/320 (39%), Gaps = 30/320 (9%)
Query: 21 RQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG-----IKREYN-ELDTVWFMA 74
R+R +G ++ +P + S E T R EG REY + + W A
Sbjct: 2 RERAKGGVSAYAPFVESLYEHTPARAVETSRAARARLEGHAAAETMREYERDAEDGWRAA 61
Query: 75 GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC 134
F+ +P FT F++A V++ + R ALVP+ LL +
Sbjct: 62 RRTFETFPSIFSVHEFTRAAFEEALAIVRANSFEARSEDGTRARALVPMAHLLLHDTGSE 121
Query: 135 KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG----FVDEDNPYDRLVVE 190
+ VD + VD ++ G+ + G +++ +G + E N ++ ++
Sbjct: 122 VPCVKIVDGVFVINVD-EHEEGDELSCSHGDYSDAETFARFGVSAFYSAEKNARNK--IK 178
Query: 191 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV 250
A +E Y K + R G + F AG A + + LRL ++T E ++
Sbjct: 179 FAFPSE--IYSMKSL--DRCGSVENIAF-TDAG----ATEEFMCALRLASANET-EWAAI 228
Query: 251 ISSLGPI-----CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH--PKK 303
S + P+S E AV + L L YP++ +EDE +L L P +
Sbjct: 229 SKSKASVRALRKKPLSEESEIAVYEALFATLTELLNSYPSSDNEDERLLQSRTLQSAPDE 288
Query: 304 RVATQLVRMEKKMLNACLQV 323
A + EK++ + L
Sbjct: 289 ERAVTIRLREKRLALSSLNA 308
>gi|62860180|ref|NP_001017105.1| SET domain containing 4 [Xenopus (Silurana) tropicalis]
gi|89267009|emb|CAJ81787.1| novel protein containing a SET domain [Xenopus (Silurana)
tropicalis]
Length = 442
Score = 41.2 bits (95), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 44/194 (22%), Positives = 87/194 (44%), Gaps = 19/194 (9%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
E+ ++S W PY+ L + P+ W E+E+ L +P + + LE+ +K
Sbjct: 117 ERVARERSPWKPYLDVLPS-------SYSCPVYW-ESEIISLLPAPLRQKALEQQTEVKE 168
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQKVSLARRFA 119
+ E W SL + +I T+ +T+ + A+ V + V H ++ L+ +
Sbjct: 169 LHTE---SWSFFVSLQPLFGGNI-TDIYTYGALRWAWCTVNTRTVYMKHPRRHGLSAQQD 224
Query: 120 LVPLGPPL-LAYSSKCKAMLAAVDD---AVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
+ + P L L S + AA ++ ++ + + + + GP N +LL+ Y
Sbjct: 225 VYAMAPYLDLLNHSPAVQVEAAFNEERRCYEIRTNSGCRKHDQAFICYGPHDNQRLLLEY 284
Query: 176 GFVDEDNPYDRLVV 189
GF+ +NP+ + V
Sbjct: 285 GFIAANNPHRSVYV 298
>gi|357131408|ref|XP_003567330.1| PREDICTED: ribosomal N-lysine methyltransferase 3-like
[Brachypodium distachyon]
Length = 495
Score = 41.2 bits (95), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 19/55 (34%), Positives = 32/55 (58%)
Query: 142 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 196
D+ +++++ R AGE + G N+ LL YGF + DNPYD + ++ L T+
Sbjct: 251 DEDLEMIIVRDANAGEEVYNTYGTMGNAALLHRYGFTELDNPYDIVNIDLTLVTK 305
>gi|396469509|ref|XP_003838423.1| similar to SET domain-containing protein [Leptosphaeria maculans
JN3]
gi|312214991|emb|CBX94944.1| similar to SET domain-containing protein [Leptosphaeria maculans
JN3]
Length = 415
Score = 41.2 bits (95), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 36/75 (48%), Gaps = 2/75 (2%)
Query: 111 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 170
K++ A +A+ P S C+ A ++ DR Y+AGE + V GP N
Sbjct: 190 KLTSADCYAMCPFMDYFNHSDSGCEPQHNA--HGYSVLADRAYRAGEEVYVSYGPHTNDF 247
Query: 171 LLINYGFVDEDNPYD 185
LL+ YGF+ + N D
Sbjct: 248 LLVEYGFLLDANSND 262
>gi|291000152|ref|XP_002682643.1| predicted protein [Naegleria gruberi]
gi|284096271|gb|EFC49899.1| predicted protein [Naegleria gruberi]
Length = 619
Score = 41.2 bits (95), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 47/193 (24%), Positives = 79/193 (40%), Gaps = 18/193 (9%)
Query: 6 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 65
+ +KS PY+ L R+ + L + E E+A L + + + +K+ Y
Sbjct: 120 EKEKSTHFPYLNLLPRE-------FTTALYFDEDEMAALRSTNLYKSVQSIRQNLKQIY- 171
Query: 66 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK-VSLARRFALVP-L 123
E + M +YP + F++E F AF AV S V ++ +VP L
Sbjct: 172 ETKVEYLM-----NKYPQKFDRQVFSYENFMWAFSAVWSRVFPIEYPAENGEGVEIVPTL 226
Query: 124 GPPLLAYSSKCKAMLA---AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
P + + K A + D L K+G+ + G + N L++YGFV
Sbjct: 227 LPTVDILNHKFNAKITYFTGSDRRFYLKTRESLKSGDYVCNNYGAKSNDSFLLSYGFVIP 286
Query: 181 DNPYDRLVVEAAL 193
+N D L V+ +
Sbjct: 287 NNSEDTLYVQFGI 299
>gi|121701277|ref|XP_001268903.1| SET domain protein [Aspergillus clavatus NRRL 1]
gi|119397046|gb|EAW07477.1| SET domain protein [Aspergillus clavatus NRRL 1]
Length = 498
Score = 41.2 bits (95), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 43/186 (23%), Positives = 85/186 (45%), Gaps = 11/186 (5%)
Query: 29 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 88
++ P ++E EL L G+ + + + +++E+ L Q+Y +D T
Sbjct: 144 SIRLPTFYTEAELELLRGTSLRTAVFAKLASLEKEFERLRQS-TEGIPWCQKYWWDEDTG 202
Query: 89 AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP---LGPPLLAYSSKCKAMLAAVDDAV 145
TF+ +K +S VV L + A+VP + S K + ++ +DA+
Sbjct: 203 RLTFDDWKYVDAVYRSRVVELPESG----HAIVPCVDMANHASEDSVKARYDESSTEDAL 258
Query: 146 -QLVVDRPYKAGESIVVWCGPQ-PNSKLLINYGFV-DEDNPYDRLVVEAALNTEDPQYQD 202
QL R +GE + + G + P S+++ +YGFV +E ++ ++ + +DP
Sbjct: 259 LQLRQGRRICSGEEVTISYGSEKPASEMVFSYGFVENERTDAKQIFLDLEIPDDDPLRMA 318
Query: 203 KRMVAQ 208
K+M +
Sbjct: 319 KQMFCK 324
>gi|449019745|dbj|BAM83147.1| similar to protein N-methyltransferase [Cyanidioschyzon merolae
strain 10D]
Length = 576
Score = 41.2 bits (95), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 23/62 (37%), Positives = 36/62 (58%), Gaps = 3/62 (4%)
Query: 127 LLAYSSKCKAMLAA--VDDAVQLVV-DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 183
L +SS+ ++ +A DA L + +R AG+ + + G N +LL YGFV+EDNP
Sbjct: 411 LFNHSSRVQSKVAYEYFYDAFSLSISNRDTHAGDQVFISYGTLTNDELLALYGFVEEDNP 470
Query: 184 YD 185
+D
Sbjct: 471 HD 472
>gi|158295743|ref|XP_001688855.1| AGAP006364-PD [Anopheles gambiae str. PEST]
gi|347965224|ref|XP_003435732.1| AGAP013401-PA [Anopheles gambiae str. PEST]
gi|333469389|gb|EGK97284.1| AGAP013401-PA [Anopheles gambiae str. PEST]
Length = 451
Score = 41.2 bits (95), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 41/86 (47%), Gaps = 13/86 (15%)
Query: 146 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT------EDPQ 199
L D Y+AGE I + G N+KLL+ YGF NP D VE + T DP+
Sbjct: 276 NLHTDTAYRAGEQIFISYGTHNNTKLLLEYGFSIPSNPDD--FVELTIGTINAFMKHDPE 333
Query: 200 YQDKRMVAQR-----NGKLSVQVFHV 220
+ R+ ++ + +L Q+F V
Sbjct: 334 LRCLRLPREKYRFLADHRLDEQLFFV 359
>gi|320169513|gb|EFW46412.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 495
Score = 40.8 bits (94), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 49/199 (24%), Positives = 88/199 (44%), Gaps = 21/199 (10%)
Query: 151 RPYKAGESIVV-WCGPQ--PNSKLLINYGFVDEDNPYDRLVV------EAALNTEDPQYQ 201
RP G+ ++ + G + N +L+++YG +NP D + + E A+ +
Sbjct: 282 RPVAEGQELLTPYGGAEQLSNGQLIMDYGVTFRNNPSDLVALPIPKLRETAVAYDSKMRL 341
Query: 202 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS---DTSEMQSVISSLGPIC 258
M R +L + V KE +L + R+ YVS D S+++ V+ +
Sbjct: 342 LMAMSLDRFDRLQLPVLDHFESIPKE----LLAFARV-YVSTPSDLSDLEHVLELMKEHR 396
Query: 259 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH--PKKRVATQLV-RM-EK 314
++P ER L+ L + Y T+ EDE ML + + P +V R+ EK
Sbjct: 397 AINPSNERRALELLLQLTNEMILKYITTIEEDETMLRELDAESVPNANAVNAVVLRLGEK 456
Query: 315 KMLNACLQVTADMIMLLPD 333
++L++ Q+ I LP+
Sbjct: 457 RILSSLWQLLDSAIEALPE 475
>gi|189190580|ref|XP_001931629.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187973235|gb|EDU40734.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 372
Score = 40.8 bits (94), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 21/77 (27%), Positives = 37/77 (48%), Gaps = 2/77 (2%)
Query: 117 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 176
R A++P+ C+A A+ + + DR Y+ GE + + G LL YG
Sbjct: 179 RLAILPVADLFNHADVGCEARFAS--ENYSFIADRDYRTGEELHISYGSHSTDFLLTEYG 236
Query: 177 FVDEDNPYDRLVVEAAL 193
FV +N +D + ++ A+
Sbjct: 237 FVPTENCWDVVCLDEAI 253
>gi|226508108|ref|NP_001151788.1| SET domain containing protein [Zea mays]
gi|195649689|gb|ACG44312.1| SET domain containing protein [Zea mays]
Length = 536
Score = 40.8 bits (94), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 49/187 (26%), Positives = 80/187 (42%), Gaps = 16/187 (8%)
Query: 41 LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 100
LA L G+ EI++ + ++++Y+EL + L +P + T++ F A
Sbjct: 266 LAALEGTLLFDEIIQARQHLRQQYDELFPL------LCTNFPEMFRKDVCTWDDFLWACE 319
Query: 101 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV---VDRPYKAGE 157
S + + S LVP+ L S VD+A + + + RP AGE
Sbjct: 320 LWYSNSMMIVLSSGKLSTCLVPVAGLLNHSVSPHILNYGRVDEATKSLKFPLSRPCDAGE 379
Query: 158 SIVVWCGPQPNSKLLINYGFVDE-DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ 216
+ G P S L+ YGF+ DNPYD + ++ D D+ + AQ + S Q
Sbjct: 380 QCFLSYGKHPGSHLVTFYGFLPRGDNPYDVIPLDL-----DTSVDDEDIAAQSSATTS-Q 433
Query: 217 VFHVHAG 223
H+ G
Sbjct: 434 TTHMVRG 440
>gi|323473309|gb|ADX78230.1| CIA6 [Chlamydomonas reinhardtii]
Length = 699
Score = 40.8 bits (94), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 35/55 (63%), Gaps = 3/55 (5%)
Query: 142 DDAVQ---LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 193
D+A Q +VV R AG+ +++ G N +LL +YGFV +DNP+D ++AAL
Sbjct: 396 DEARQQYVIVVRRRVAAGQQVLLCYGRHTNLELLEHYGFVMQDNPHDTAPLDAAL 450
>gi|406606937|emb|CCH41659.1| hypothetical protein BN7_1200 [Wickerhamomyces ciferrii]
Length = 577
Score = 40.8 bits (94), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 55/222 (24%), Positives = 87/222 (39%), Gaps = 58/222 (26%)
Query: 14 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF- 72
P+I L R G SP W+E E + + T A++ I N+L W+
Sbjct: 110 PFIEFLPTGREIG-----SPFFWNEMERSLIKN--TDADL-----AIDVGLNKLVEEWYD 157
Query: 73 MAGSL---FQQYPYDIPTEAFT-----------FEIFKQAFVAVQSCVVHLQKVSL--AR 116
+ L FQ Y Y + F FE F V+ S +L ++ +R
Sbjct: 158 IVTKLPKKFQSYQYQKDLKFFHDFQKDRDVSKHFEFFNDDSVSWTSFAAYLWSSTIFTSR 217
Query: 117 RFALVPLGPPLLAYSSKCK----AMLAAVDDA-----------------VQLVVDRPYKA 155
F P L++ + +C+ ML + D + ++ K
Sbjct: 218 GF------PFLISSTDECRDLNEGMLVPIQDLSNHNPSVEIKWGRLDKFMTFTTEQIVKK 271
Query: 156 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 197
G+ I GP+ N +LL YGFV ++N YD+ V+ AL +D
Sbjct: 272 GDEIFSNYGPKSNHELLFGYGFVMDNNIYDKAVL--ALRLQD 311
>gi|424512980|emb|CCO66564.1| predicted protein [Bathycoccus prasinos]
Length = 542
Score = 40.8 bits (94), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 18/43 (41%), Positives = 25/43 (58%)
Query: 151 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 193
+ YK GE +++ G PN +LL+ YGFVD+ N D E L
Sbjct: 350 KSYKKGEEVLISYGLMPNDELLLRYGFVDDQNVADTYQFEGLL 392
>gi|71019075|ref|XP_759768.1| hypothetical protein UM03621.1 [Ustilago maydis 521]
gi|46099291|gb|EAK84524.1| hypothetical protein UM03621.1 [Ustilago maydis 521]
Length = 685
Score = 40.8 bits (94), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 38/83 (45%), Gaps = 6/83 (7%)
Query: 118 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
++ P+ L A A L +++ +P AGE I PNS LL YG
Sbjct: 368 ISMTPMADMLNAKFESDNARLFYKSHVLEMRATKPIAAGEQIFNTYADPPNSDLLRRYGH 427
Query: 178 VDEDNPYD------RLVVEAALN 194
VDE N D +LVV+AA+N
Sbjct: 428 VDEPNGNDVVELDAKLVVQAAVN 450
>gi|170093191|ref|XP_001877817.1| SET-domain protein [Laccaria bicolor S238N-H82]
gi|164647676|gb|EDR11920.1| SET-domain protein [Laccaria bicolor S238N-H82]
Length = 524
Score = 40.8 bits (94), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 59/228 (25%), Positives = 98/228 (42%), Gaps = 32/228 (14%)
Query: 118 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
A+VP+ L A A L ++ ++++ RP K GE I G PN++LL YG
Sbjct: 276 IAMVPMADILNARYGSENAKLFYEENYLKMISTRPIKGGEQIWNTYGDLPNAELLRRYGH 335
Query: 178 VD--------EDNPYD------RLVVEAA-----LNTEDPQYQDKRMVAQRNGKLSVQVF 218
VD + NP D L+V A L+T+D + + + + F
Sbjct: 336 VDVIQLPNGGQGNPGDVAEIRADLIVSVAAEQHSLSTDDTHERIDWWLEEGGDDVFDLYF 395
Query: 219 HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 278
+ E ++ +RL + D E + + P P M+ L L + +
Sbjct: 396 DL------EIPPSIISVIRLLLLPD-EEWEKIKEKA---KPPKPKMDAVALTVLHEVLQR 445
Query: 279 RLAGYPATLSEDEAML-TDYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
RL YP ++ +DE +L T +L+ + + +L EKK+L+ L TA
Sbjct: 446 RLKEYPTSIQDDEQLLMTAPSLNLRHAIIVRL--GEKKILDGILTKTA 491
>gi|428163884|gb|EKX32933.1| hypothetical protein GUITHDRAFT_120884 [Guillardia theta CCMP2712]
Length = 320
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 40/168 (23%), Positives = 69/168 (41%), Gaps = 29/168 (17%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
SF+ PY+ L V++P+ WS+ E L GSP ++ + R + E
Sbjct: 87 SFFRPYLDLLPD-------TVDTPITWSKEEAKELVGSPVLHRAVKLRHELARSFQE--- 136
Query: 70 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLA 129
M +F +YP P F++E ++ A+ ++S + L+PL +
Sbjct: 137 ---MKDKVFDKYPDRFPPLLFSYERYQWAYSILRSRAFG--------NYTLMPLIDLMNH 185
Query: 130 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC--GPQPNSKLLINY 175
+ A D + L+ R Y VW G + ++ LL+NY
Sbjct: 186 HPDSRLAPTLLSDGSDALIARREYN------VWGFYGRKSDADLLLNY 227
>gi|303279242|ref|XP_003058914.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226460074|gb|EEH57369.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 457
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 29/48 (60%)
Query: 142 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 189
DDA+ + V + KAGE + G N+KLL +YGF DNP D++ +
Sbjct: 232 DDALFVRVVKASKAGEEVFNTYGKLGNAKLLCSYGFAQLDNPADKVTI 279
>gi|340522118|gb|EGR52351.1| predicted protein [Trichoderma reesei QM6a]
Length = 377
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 31/142 (21%), Positives = 60/142 (42%), Gaps = 15/142 (10%)
Query: 54 LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ--- 110
LE ++R E W F++ D+P + +T+ A++ V + + +
Sbjct: 120 LESQVTLERREKEFQDNW----DDFKEAFPDVPRDDYTY-----AWLVVNTRTFYHETPE 170
Query: 111 --KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPN 168
K R AL+P+ + C+ + + +V DR YK GE + + N
Sbjct: 171 TLKYPWEDRLALIPVADLFNHAAGGCRVYYSP-EGCYHVVADRAYKKGEELFISYSSHSN 229
Query: 169 SKLLINYGFVDEDNPYDRLVVE 190
L+ YGF+ ++N D + ++
Sbjct: 230 DYNLLEYGFIPDENSLDDVYID 251
>gi|358395377|gb|EHK44764.1| hypothetical protein TRIATDRAFT_80097 [Trichoderma atroviride IMI
206040]
Length = 463
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 61/269 (22%), Positives = 108/269 (40%), Gaps = 41/269 (15%)
Query: 32 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 91
S + ++E EL G+ + E I+ +Y + +F Q+P +P +
Sbjct: 107 SSIFFNEDELEVCAGTSLYTITKQLEERIEDDYR------VLVMRVFTQHPDLLPLAKIS 160
Query: 92 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL---------LAYSSKCKAMLAAVD 142
+ +K A V S + ++P G PL + +S + K A
Sbjct: 161 IQDYKWALCTVWSRAMDF----------VLPNGKPLRVLAPFADMINHSPEVKQCHAYDP 210
Query: 143 DAVQLVV--DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 200
+ L V + Y+ G+ I + G PN++LL YGFV +NP D + + + P Y
Sbjct: 211 SSGNLSVLAGKDYEIGDQIYISYGSIPNNRLLRLYGFVIPENPNDSYDLVLSTHPMAPFY 270
Query: 201 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP-----YLRLGYVSDTSEMQSV-ISSL 254
+ K+ + G S + + D LP YLR+ + D S++ ++ + L
Sbjct: 271 EQKQKLWASAGLDSASTIPL-------TLIDPLPKSVLRYLRIQRL-DASDLAAIALQKL 322
Query: 255 GPICPVSPCMERAVLDQLADYFKARLAGY 283
+S E +L L + A L G+
Sbjct: 323 DTNEKISNSKEVEILQFLVESISALLDGF 351
>gi|302896942|ref|XP_003047350.1| hypothetical protein NECHADRAFT_106552 [Nectria haematococca mpVI
77-13-4]
gi|256728280|gb|EEU41637.1| hypothetical protein NECHADRAFT_106552 [Nectria haematococca mpVI
77-13-4]
Length = 471
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 51/221 (23%), Positives = 90/221 (40%), Gaps = 36/221 (16%)
Query: 7 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL-ERAEGIKREYN 65
G++SFW PYI+ L + A+ PLLW E++L +L G+ + + ++ + +KR
Sbjct: 121 GEQSFWYPYIQILPQPDDDKDSAI--PLLWPESDLLWLRGTHLEEAVSKQKVDHVKR--- 175
Query: 66 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV-----------------H 108
W A Q+Y +D P++ FT E+ A+ S H
Sbjct: 176 -----WTEAMETLQKYGWD-PSQ-FTLELGLWAYYCFYSRYFWSIILEPDVANIKPEFQH 228
Query: 109 LQKVSLARRFALVPLGPPL--LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 166
L K + L P L L ++ + D + + + K G+ + +
Sbjct: 229 LVKAGMNLDDTAKILLPILETLNHAQETNTEYNLDDKGLSVSKNIELKPGDPFYIAYDKE 288
Query: 167 P----NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 203
N+ LL ++GF+ DN LV+ + + P + D
Sbjct: 289 TQRFNNTVLLKDFGFILPDNEAAELVLSSPFDLTRPMHLDH 329
>gi|340517549|gb|EGR47793.1| hypothetical protein TRIREDRAFT_122428 [Trichoderma reesei QM6a]
Length = 482
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 56/246 (22%), Positives = 99/246 (40%), Gaps = 48/246 (19%)
Query: 7 GKKSF----WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
G ++F W YIR L R + P +W+ E L G+ +A + + +
Sbjct: 111 GTRAFASTPWTEYIRFLPR-------PIPVPTMWTNDERELLKGTSLEAAVSAKLSALSS 163
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 122
EY++L A +L + + +E+ T E + A +S + L + A+VP
Sbjct: 164 EYDKLCE---EASAL--SFWSTLLSESATLEDWVLADAWYRSRCLELPRAG----HAMVP 214
Query: 123 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYK---AGESIVVWCG-PQPNSKLLINYGFV 178
G + +S A D +++ RP AG I + G +P +++L +YGF+
Sbjct: 215 -GLDMANHSQSHSAYYDESSDGDVVLLPRPGSKIPAGAEITISYGEAKPAAEMLFSYGFI 273
Query: 179 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 238
D+D+ L + +DP L FH++ G P +RL
Sbjct: 274 DKDSTVKELTLHLEALPDDP--------------LGRAKFHIYKGP---------PTVRL 310
Query: 239 GYVSDT 244
++D
Sbjct: 311 SIINDN 316
>gi|297598048|ref|NP_001044988.2| Os01g0879500 [Oryza sativa Japonica Group]
gi|255673923|dbj|BAF06902.2| Os01g0879500 [Oryza sativa Japonica Group]
Length = 263
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 20/63 (31%), Positives = 33/63 (52%)
Query: 134 CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 193
C + D+ ++++V R GE + G N+ LL YGF + DN YD + ++ AL
Sbjct: 15 CSYYVGDDDEDLEMIVVRDVNEGEEVFNTYGTMGNAALLHRYGFTEMDNSYDIVNIDLAL 74
Query: 194 NTE 196
T+
Sbjct: 75 VTK 77
>gi|342875304|gb|EGU77102.1| hypothetical protein FOXB_12400 [Fusarium oxysporum Fo5176]
Length = 371
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 35/77 (45%), Gaps = 2/77 (2%)
Query: 117 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 176
R +P CK +A+ +VQ DR Y GE + V GP N LL YG
Sbjct: 178 RLVCMPTADLFNHADQGCKLAYSALGYSVQ--ADRVYHQGEEVYVSYGPHSNDFLLSEYG 235
Query: 177 FVDEDNPYDRLVVEAAL 193
F+ + N +D + ++ +
Sbjct: 236 FILDTNRWDEVYLDEVI 252
>gi|296808191|ref|XP_002844434.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
gi|238843917|gb|EEQ33579.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
Length = 684
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 49/188 (26%), Positives = 80/188 (42%), Gaps = 11/188 (5%)
Query: 2 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIK 61
+++ + K S W PY+ L R G+L S L + +L +L + + +K
Sbjct: 117 HQQLKAKDSHWWPYLATLPRA---GELT--SALFYQGEDLEWLQDTNFYHARQMYHDAVK 171
Query: 62 REYNELDTVWFMAGS-LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
EY+ ++ G L + Y ++I A+T I +AF + + ++ K R+
Sbjct: 172 TEYDAAISILRKEGCPLVESYSWNIFCWAYTV-IASRAFTS-RVLEAYISKNPALRQDDE 229
Query: 121 VPLGPPLLAYSSK---CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
+ PL+ S+ K A + L V P A E I GP N +L+ YGF
Sbjct: 230 FQIMLPLVDSSNHRPLAKIEWRAEATRIGLKVIDPVSAKEEIHNNYGPLNNQQLMATYGF 289
Query: 178 VDEDNPYD 185
DNP D
Sbjct: 290 CIVDNPCD 297
>gi|392563539|gb|EIW56718.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
Length = 441
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 16/46 (34%), Positives = 29/46 (63%)
Query: 145 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 190
+ LV+ P G ++ GP+PN++L++ YGF +NP D +V++
Sbjct: 250 ISLVIHTPTTTGSELLNNYGPKPNAELILGYGFSLPNNPDDTIVLK 295
>gi|410082051|ref|XP_003958604.1| hypothetical protein KAFR_0H00600 [Kazachstania africana CBS 2517]
gi|372465193|emb|CCF59469.1| hypothetical protein KAFR_0H00600 [Kazachstania africana CBS 2517]
Length = 508
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 18/61 (29%), Positives = 34/61 (55%)
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
+++PL L + + +C A L D +++ +P K GE + G PN+++L YG+V
Sbjct: 215 SMIPLADTLNSDTHQCNANLMYDDKFLKMYAIKPIKKGEQVFNIYGNHPNAEILRRYGYV 274
Query: 179 D 179
+
Sbjct: 275 E 275
>gi|195565510|ref|XP_002106342.1| GD16174 [Drosophila simulans]
gi|194203718|gb|EDX17294.1| GD16174 [Drosophila simulans]
Length = 395
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 22/79 (27%), Positives = 37/79 (46%)
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
AL+P K + AAV ++ AGE ++ G + N+ LL++ GFV
Sbjct: 229 ALIPYWDMANHRPGKITSFYAAVPRQLECTAQEAVDAGEQFFIYYGDRSNTDLLVHNGFV 288
Query: 179 DEDNPYDRLVVEAALNTED 197
D++N D + + L+ D
Sbjct: 289 DDNNLKDYVNIRVGLSLTD 307
>gi|145354549|ref|XP_001421544.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581782|gb|ABO99837.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 488
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 17/42 (40%), Positives = 27/42 (64%)
Query: 144 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
V+L+ R +GE I + G N +LL++YGF+ +DNP+D
Sbjct: 276 GVELIARRALTSGEPIELSYGNLSNDELLLDYGFIVKDNPFD 317
>gi|387219019|gb|AFJ69218.1| set domain-containing protein 3, partial [Nannochloropsis gaditana
CCMP526]
Length = 265
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 39/163 (23%), Positives = 62/163 (38%), Gaps = 29/163 (17%)
Query: 153 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQ-YQDKRMVAQRNG 211
YK GE + G + N++LL+ YGF DN ++ + + P +Q G
Sbjct: 18 YKKGEEVFTSYGRRTNAELLLFYGFALLDNEHESVALSMPGIPSPPSWFQASHSALGTAG 77
Query: 212 KLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---SEMQSVISSLGPICPVS------- 261
+ GR + D+L L + T SE+ + +L C ++
Sbjct: 78 SV--------GGRARSMAEDVLRPSHLLFAGATELPSELVAYFRALTACCSMNEKDLVEQ 129
Query: 262 --------PC--MERAVLDQLADYFKARLAGYPATLSEDEAML 294
PC ER L + A LA +P ++ EDE L
Sbjct: 130 KLDYMQHFPCSRHERDAFSTLGAHMSASLAAFPTSIEEDEVEL 172
>gi|254577261|ref|XP_002494617.1| ZYRO0A05654p [Zygosaccharomyces rouxii]
gi|238937506|emb|CAR25684.1| ZYRO0A05654p [Zygosaccharomyces rouxii]
Length = 494
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 19/63 (30%), Positives = 37/63 (58%)
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
++VPL L A + + A L +++++V +P K G+ + + G PN+++L YG+V
Sbjct: 226 SMVPLADTLNADTKQFNAHLVYDKESLKMVSVKPIKMGQQVYNFYGEHPNAEILRRYGYV 285
Query: 179 DED 181
+ D
Sbjct: 286 EWD 288
>gi|299748031|ref|XP_002911244.1| tho2 protein [Coprinopsis cinerea okayama7#130]
gi|298407787|gb|EFI27750.1| tho2 protein [Coprinopsis cinerea okayama7#130]
Length = 2474
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 15/47 (31%), Positives = 30/47 (63%)
Query: 144 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 190
++ L+ G+ + GP+PNS+L+++YGF +DNP D ++++
Sbjct: 209 SISLIAHSAIWTGQEVFNNYGPKPNSELILSYGFSIQDNPDDSIILK 255
>gi|145356486|ref|XP_001422460.1| chloroplast lysine N-methyltransferase [Ostreococcus lucimarinus
CCE9901]
gi|144582703|gb|ABP00777.1| chloroplast lysine N-methyltransferase [Ostreococcus lucimarinus
CCE9901]
Length = 529
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 63/263 (23%), Positives = 112/263 (42%), Gaps = 49/263 (18%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG- 59
+ E+++G++S + Y++ L + SPL W+ EL+ + G+ ++L+ A G
Sbjct: 147 LKERERGEESKYAAYVKTLPE-------VLHSPLFWNAEELSEIQGT----QLLDNAAGY 195
Query: 60 ---IKREYNELDTVWFMAGSLFQQYP--YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 114
++ Y L T +F ++ +D+ AF+ + F+ AF ++S
Sbjct: 196 DGYVRGVYETLRT------GMFAKHADVFDVEG-AFSEDNFRWAFGILRS---RTMAPCD 245
Query: 115 ARRFALVPLGPPLLAYSSKCKA-------MLAAV---------DDAVQLVV--DRPYKAG 156
ALVP G L+ +SS +A + AV DD V V DR
Sbjct: 246 GANIALVP-GVDLVNHSSLSQARWRVSGGVAGAVAGLFGGGKGDDGVSARVECDRALNVN 304
Query: 157 ESIVVWCGPQ-PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 215
E + V P+ ++ +++GFVD P + ++ +DP DK V G
Sbjct: 305 EPLYVNYNPEGTDTSFALDFGFVDTITPSPGYALSLSVPEDDPNVFDKLDVLDVCGLGET 364
Query: 216 QVFHVHAGREKEAISDMLPYLRL 238
F + A + + D+ +LRL
Sbjct: 365 PTFTLRAYSDPD--PDLRTFLRL 385
>gi|453088140|gb|EMF16181.1| SET domain-containing protein, partial [Mycosphaerella populorum
SO2202]
Length = 307
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 45/185 (24%), Positives = 70/185 (37%), Gaps = 24/185 (12%)
Query: 33 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 92
PL WS +L S + LER RE E D + ++ + + +
Sbjct: 112 PLFWSPELCHHLPES--VKQPLERM----REDYEKDLTYMLS--------LNCDDQTWKE 157
Query: 93 EIFKQAFVAVQSCVVHLQKVSLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 151
E FK + V S H + L F L P + + ++ ++V DR
Sbjct: 158 EDFKYYWAIVNSRCFHFKPPGLKPGFMVLCPFIDYMNHGPTGTGVKVSQSPKGYEVVADR 217
Query: 152 PYKAGESIVVWCGPQPNSKLLINYGFV---------DEDNPYDRLVVEAALNTEDPQYQD 202
Y+ I+ G PN KLL++YGF D+D D +++ A Q QD
Sbjct: 218 DYEPNTEILATYGSHPNDKLLVHYGFCLSYKPNEPSDDDIRLDHILLPAMSANTKSQLQD 277
Query: 203 KRMVA 207
M+
Sbjct: 278 VGMLG 282
>gi|348552908|ref|XP_003462269.1| PREDICTED: SET domain-containing protein 4-like [Cavia porcellus]
Length = 440
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 52/232 (22%), Positives = 92/232 (39%), Gaps = 32/232 (13%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
EK G +S W PY+ L + + P+ E E+ L P KA+ E+ +++
Sbjct: 117 EKHAGDQSVWKPYLDILPK-------SYTCPVCL-EPEVVNLLPEPLKAKAEEQRMSVQQ 168
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 122
+ + LF++ + F++ A+ V + V+L+ RR +
Sbjct: 169 FFASSRDFFSSLQPLFEEATDSV----FSYSALLWAWCTVNTRAVYLR----TRRRDCLS 220
Query: 123 LGPP---------LLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 171
L P LL +S + KA ++ Y+ + + + GP N +L
Sbjct: 221 LEPDTCALAPYLDLLNHSPNVQVKAAFNEETGCYEIRTASDYRKHKEVFICYGPHDNHRL 280
Query: 172 LINYGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
L+ YGFV NP+ R ++ L + D Q K + + +G L F
Sbjct: 281 LLEYGFVSLCNPHACVYVSREILVKYLPSTDKQMNKKISILKDHGFLENLTF 332
>gi|328864871|gb|EGG13257.1| hypothetical protein DFA_11018 [Dictyostelium fasciculatum]
Length = 1658
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 68/297 (22%), Positives = 115/297 (38%), Gaps = 25/297 (8%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+YEK + SFW P+ L + + ++ TEL L G+ + E I
Sbjct: 1266 IYEK-ENPNSFWRPFFDTLPS-------YFPTSIHYTTTELLELEGT----NLFEETIQI 1313
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
K + + F L QYP P FT E F A S + L K+ L
Sbjct: 1314 KEHLESIRELLF--PELSNQYPDVFPESLFTMENFLWARSLFDSRAIQL-KIDGRIVNCL 1370
Query: 121 VPLGPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
VP+ + + + +D +++ A I + G + +L + YGFV
Sbjct: 1371 VPMADMINHHDQAQISQRYFDQENDCFRMISCCNIPATSQIFLQYGALQSWELALYYGFV 1430
Query: 179 DEDNPYDRLVVEAALNTED-PQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 237
+N YD + + + ED P+ ++++ L+V ++H S +L LR
Sbjct: 1431 ISNNHYDSVHIGFDMPEEDTPELREEKQKLLDRHLLTVDHHYLHRSN---IPSKLLASLR 1487
Query: 238 LGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 294
+ +++ E + PI S E VL L L + +T ED+ +L
Sbjct: 1488 VALLAE-DEFNPHVDVWNPI---SRSNEEVVLYTLYSTVLMLLKQFSSTCDEDQQLL 1540
>gi|323455796|gb|EGB11664.1| hypothetical protein AURANDRAFT_61664 [Aureococcus anophagefferens]
Length = 1916
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 17/46 (36%), Positives = 27/46 (58%)
Query: 143 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 188
DA + R Y AG+ + G + N++L+ NYGF++ NP+D V
Sbjct: 294 DAFAVNAHRDYDAGDEVHASYGKKSNAQLVANYGFLEPGNPFDDYV 339
>gi|7329638|emb|CAB82703.1| putative protein [Arabidopsis thaliana]
Length = 486
Score = 40.0 bits (92), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 29/97 (29%), Positives = 47/97 (48%), Gaps = 13/97 (13%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
EKK G+KS W+PYI L + + S + W E EL+ + S E +++ I++
Sbjct: 124 EKKMGQKSRWVPYISRLPQPA-----EMHSSIFWGEDELSMIRCSAVHQETVKQKAQIEK 178
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 99
+++ F+A + Q P I TE E F A+
Sbjct: 179 DFS------FVAQAFKQHCP--IVTERPDLEDFMYAY 207
>gi|195040205|ref|XP_001991024.1| GH12451 [Drosophila grimshawi]
gi|193900782|gb|EDV99648.1| GH12451 [Drosophila grimshawi]
Length = 573
Score = 40.0 bits (92), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 23/95 (24%), Positives = 44/95 (46%), Gaps = 3/95 (3%)
Query: 115 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ---LVVDRPYKAGESIVVWCGPQPNSKL 171
AR + L P + +C + + D A Q +KAGE ++ G + N+
Sbjct: 339 ARNAPISALIPYWDMTNHRCGKITSYYDRAAQQMECTAQEAFKAGEQFFIYYGDRSNADR 398
Query: 172 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 206
L+++GF+D N D + + L+ DP + + ++
Sbjct: 399 LVHHGFLDMHNLKDYVQIRLGLSPTDPLVEQRSLL 433
>gi|297738159|emb|CBI27360.3| unnamed protein product [Vitis vinifera]
Length = 449
Score = 40.0 bits (92), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 43/180 (23%), Positives = 79/180 (43%), Gaps = 23/180 (12%)
Query: 151 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV--EAALNTEDPQYQDKRMVAQ 208
+ YK GE +++ G N +LL +YGF+ ++NP D+ + E + +D + Q
Sbjct: 257 KNYKKGEQVLLSYGTYTNLELLEHYGFLLDENPNDKAFIPLEPEVYASSSWPKDSLYIHQ 316
Query: 209 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 268
NGK S +L LRL + + S+ +SV + +S E V
Sbjct: 317 -NGKPSFA---------------LLSALRL-WATPASQRRSVGHLVYSGTQLSSENEIFV 359
Query: 269 LDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 328
++ +A L P ++ ED +L+ + ++A Q K++L C+ ++I
Sbjct: 360 MEWIAKSCHVVLENLPTSVEEDSLLLS----MERWKLAVQWRLRHKRILVDCISRCTEII 415
>gi|356564844|ref|XP_003550657.1| PREDICTED: uncharacterized protein LOC100778605 [Glycine max]
Length = 549
Score = 40.0 bits (92), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 41/151 (27%), Positives = 69/151 (45%), Gaps = 14/151 (9%)
Query: 41 LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 100
+ L G+ EI++ + + +Y+EL F A L +P P E +T+E F A
Sbjct: 260 ITMLDGTLLLEEIMQARQHLHAQYDEL----FPA--LCNNFPDIFPPELYTWEKFLWACE 313
Query: 101 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVD---DAVQLVVDRPYKA 155
S + + R L+PL L S C ++ VD ++++ + RP ++
Sbjct: 314 LWYSNSMKIMYSDGKLRTCLIPLAGFL--NHSLCPHVMHYGKVDPATNSLKFCLSRPCRS 371
Query: 156 GESIVVWCGPQPNSKLLINYGFVDE-DNPYD 185
GE + G +S L+ YGF+ + DN YD
Sbjct: 372 GEECCLSYGNFSSSHLITFYGFLPQGDNSYD 402
>gi|389646769|ref|XP_003721016.1| hypothetical protein MGG_02740 [Magnaporthe oryzae 70-15]
gi|86196443|gb|EAQ71081.1| hypothetical protein MGCH7_ch7g488 [Magnaporthe oryzae 70-15]
gi|351638408|gb|EHA46273.1| hypothetical protein MGG_02740 [Magnaporthe oryzae 70-15]
gi|440466942|gb|ELQ36183.1| hypothetical protein OOU_Y34scaffold00666g44 [Magnaporthe oryzae
Y34]
gi|440488101|gb|ELQ67845.1| hypothetical protein OOW_P131scaffold00283g3 [Magnaporthe oryzae
P131]
Length = 390
Score = 40.0 bits (92), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 23/84 (27%), Positives = 39/84 (46%), Gaps = 2/84 (2%)
Query: 110 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 169
+++ R L P+ + C +A D+ + DR Y AGE +++ G N
Sbjct: 173 ERLGKEDRMVLQPVADLFNHADAGC--AVAFNDEDFTIRADRDYDAGEEVLICYGNHSND 230
Query: 170 KLLINYGFVDEDNPYDRLVVEAAL 193
LL YGFV N +D + ++ A+
Sbjct: 231 FLLAEYGFVLAANRWDEVCIDDAI 254
>gi|348676999|gb|EGZ16816.1| hypothetical protein PHYSODRAFT_251772 [Phytophthora sojae]
Length = 424
Score = 40.0 bits (92), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 59/262 (22%), Positives = 102/262 (38%), Gaps = 40/262 (15%)
Query: 31 ESPLLWSETELAYLTGSPTKAEILER-AEGIKREYNELDTVWFMAGSLFQ-QYPYDIPTE 88
ES W++ EL L ER + R +E+D + Q +YP + P
Sbjct: 118 ESVQDWTQAELGELHD--------ERLVDAAARRTSEIDVYYRRVMVRLQTKYPGEFPEA 169
Query: 89 AFTFEIFKQAFVAVQSCVVHLQKVSLARRF---ALVPLGPPLLAYSSKCKAMLAAVDDAV 145
+TF+ FK A+ +Q+ + RR ALVP L + K D+ +
Sbjct: 170 LYTFDRFKFAWKTIQA-------RTFGRRLPWTALVPFADCLNHTNVATKYDFDVNDNGL 222
Query: 146 QLVV---DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQD 202
+ + G + G + N +LL++YGF DN +D + VE + P
Sbjct: 223 FRLYPSGATSFAQGAEVFNSYGRRSNFQLLLDYGFALPDNEWDYVDVEIGKDRAGP---- 278
Query: 203 KRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD------TSEMQSVISSLGP 256
R KL V R+ ++ ++ P L ++D SE + +S
Sbjct: 279 ------RGRKLRFMKRVVRIDRQS-SLDELFPPSFLAGLADPVPDEEQSEAAAELSERTA 331
Query: 257 ICPVSPCMERAVLDQLADYFKA 278
+C + +++ +AD+ A
Sbjct: 332 LCDALEWLRSILIETIADWGTA 353
>gi|116197927|ref|XP_001224775.1| hypothetical protein CHGG_07119 [Chaetomium globosum CBS 148.51]
gi|88178398|gb|EAQ85866.1| hypothetical protein CHGG_07119 [Chaetomium globosum CBS 148.51]
Length = 555
Score = 40.0 bits (92), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 48/184 (26%), Positives = 83/184 (45%), Gaps = 27/184 (14%)
Query: 32 SPLLWSETELAYLTGSP--TKAEILERAEGIKREYNELDTVWFMAGS--LFQQYPYDIPT 87
S + ++E EL G+ T + L+R+ I+ +Y L V +A S LF P
Sbjct: 194 SSIFFAEEELEVCAGTSLYTITKKLDRS--IEDDYRTL-VVRVLAQSRDLF-------PL 243
Query: 88 EAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRFALVPLGPPLLAYSSKCK--AMLAA 140
+ F+ E +K A V S + + L FA +L +SS+ + + A
Sbjct: 244 DKFSIEDYKWALCTVWSRAMDFVLPDGNSIRLVAPFA------DMLNHSSEVEPCHIYDA 297
Query: 141 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 200
+ ++ + Y+AG+ ++ G PNS+LL YGFV NP D + + + P +
Sbjct: 298 SSGNLSVLAGKDYEAGDQAFIYYGSIPNSRLLRLYGFVMPGNPNDSYDLVISTHPSAPFF 357
Query: 201 QDKR 204
+ K+
Sbjct: 358 ERKQ 361
>gi|126325439|ref|XP_001376285.1| PREDICTED: SET domain-containing protein 4-like [Monodelphis
domestica]
Length = 437
Score = 40.0 bits (92), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 46/194 (23%), Positives = 81/194 (41%), Gaps = 19/194 (9%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
EK G KS W PY+ L + + L+ E E+ L P + + E+ +++
Sbjct: 118 EKHAGNKSPWKPYLDVLPK--------AYTCLVCLEPEVVRLLPRPLQMKAEEQRMQVQK 169
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQKVSLARRFA 119
+ + SLF + D+ F + F A+ + + V H QK L+
Sbjct: 170 LFISSRGFFSSLQSLFTE---DV-KHVFHYHAFLWAWCTINTRTVYMKHAQKQCLSAEPD 225
Query: 120 LVPLGP--PLLAYSSKCKAMLAAVDDAV--QLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
+ L P LL +S + A ++ ++ K E + + GP N +LL+ Y
Sbjct: 226 VYALAPYLDLLNHSPRVWVEAAFNEETCCYEIRTTSHCKKFEELFICYGPHDNHRLLLEY 285
Query: 176 GFVDEDNPYDRLVV 189
GFV +NP+ + +
Sbjct: 286 GFVASNNPHSAVYI 299
>gi|428177750|gb|EKX46628.1| hypothetical protein GUITHDRAFT_107412 [Guillardia theta CCMP2712]
Length = 606
Score = 40.0 bits (92), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 17/49 (34%), Positives = 27/49 (55%)
Query: 145 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 193
+Q P K G + + GP N++LL+ YG+ ++DNPY +E L
Sbjct: 440 LQFCTMAPIKQGSQVFLNYGPLDNTQLLLYYGYAEQDNPYQTYAIELEL 488
>gi|194896580|ref|XP_001978500.1| GG17647 [Drosophila erecta]
gi|190650149|gb|EDV47427.1| GG17647 [Drosophila erecta]
Length = 544
Score = 40.0 bits (92), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 22/79 (27%), Positives = 37/79 (46%)
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
AL+P K + AAV ++ +AGE ++ G + N+ LL++ GFV
Sbjct: 319 ALIPYWDMANHKPGKITSFYAAVSRQLECTAQEAVEAGEQFFIYYGDRSNTDLLVHNGFV 378
Query: 179 DEDNPYDRLVVEAALNTED 197
D +N D + + L+ D
Sbjct: 379 DVNNLKDYVNIRVGLSPTD 397
>gi|121703688|ref|XP_001270108.1| SET domain protein [Aspergillus clavatus NRRL 1]
gi|119398252|gb|EAW08682.1| SET domain protein [Aspergillus clavatus NRRL 1]
Length = 492
Score = 39.7 bits (91), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 41/77 (53%), Gaps = 3/77 (3%)
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
+VPL L A + + A L +D++ + +P AG+ I G P S LL YG+V
Sbjct: 234 GMVPLADILNADADRNNARLFQEEDSLVMKAIKPIAAGDEIFNDYGELPRSDLLRRYGYV 293
Query: 179 -DEDNPYDRLVVEAALN 194
D PYD V+EA+L+
Sbjct: 294 TDNYAPYD--VIEASLD 308
>gi|330797452|ref|XP_003286774.1| hypothetical protein DICPUDRAFT_54488 [Dictyostelium purpureum]
gi|325083217|gb|EGC36675.1| hypothetical protein DICPUDRAFT_54488 [Dictyostelium purpureum]
Length = 1335
Score = 39.7 bits (91), Expect = 1.9, Method: Composition-based stats.
Identities = 49/192 (25%), Positives = 78/192 (40%), Gaps = 19/192 (9%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+YEK + +FW P+ L + + +S TEL L G+ AE L +
Sbjct: 885 IYEK-ENPNTFWRPFYDTLPSY-------FTTSIHYSSTELLELEGTNLFAETL----AV 932
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
K++ F L QYP P F++E F A + S + L K+ + L
Sbjct: 933 KQQLQAFRDYLF--PELSNQYPDIFPESVFSWENFLWARSLLDSRAIQL-KIDGKIKSCL 989
Query: 121 VPLGPPLLAYSSKCKAMLAAVD---DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
VP+ ++ + + + D + ++V A I + G NS L + YGF
Sbjct: 990 VPMAD-MINHHTNAQISERHFDQDSNCFRMVSSCNIPANNQIFLHYGALQNSDLALYYGF 1048
Query: 178 VDEDNPYDRLVV 189
V +N YD V
Sbjct: 1049 VIPNNIYDSFHV 1060
>gi|194764087|ref|XP_001964163.1| GF21412 [Drosophila ananassae]
gi|190619088|gb|EDV34612.1| GF21412 [Drosophila ananassae]
Length = 1017
Score = 39.7 bits (91), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 22/88 (25%), Positives = 38/88 (43%)
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
AL+P S + + V AGE ++ G + N++ L+N GFV
Sbjct: 802 ALIPFWDMANHGSGRITTFFDSTAGEVSCNAQAACSAGEQFFIYYGDRTNTEFLVNNGFV 861
Query: 179 DEDNPYDRLVVEAALNTEDPQYQDKRMV 206
D DN D + + L+ DP + + ++
Sbjct: 862 DPDNRNDYVNIRLGLSPTDPLAEKRAII 889
>gi|426392958|ref|XP_004062802.1| PREDICTED: SET domain-containing protein 4 [Gorilla gorilla
gorilla]
Length = 440
Score = 39.7 bits (91), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 51/228 (22%), Positives = 90/228 (39%), Gaps = 24/228 (10%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
EK G++S W PY+ L + A P+ E E+ L KA+ E+ ++
Sbjct: 117 EKHAGRRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQE 168
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFA 119
+ + LF + I F++ A+ V + V+L Q+ L+
Sbjct: 169 FFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPD 224
Query: 120 LVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
L P LL +S + KA + ++ ++ E + + GP N +L + Y
Sbjct: 225 TCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEY 284
Query: 176 GFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
GFV NP+ R ++ L + D Q K + + +G + F
Sbjct: 285 GFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332
>gi|392587357|gb|EIW76691.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
Length = 430
Score = 39.7 bits (91), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 16/51 (31%), Positives = 30/51 (58%)
Query: 140 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 190
+ + + +V + G+ I GP+PNS+L++ YGF +NP D +V++
Sbjct: 230 STETTISIVSHDAFNEGQEIYNNYGPKPNSELILGYGFSLPNNPDDTIVLQ 280
>gi|195396323|ref|XP_002056781.1| GJ16703 [Drosophila virilis]
gi|194146548|gb|EDW62267.1| GJ16703 [Drosophila virilis]
Length = 539
Score = 39.7 bits (91), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 62/311 (19%), Positives = 118/311 (37%), Gaps = 36/311 (11%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+ EK +G S W P+I L + + L ++ ++ L G+ + + I
Sbjct: 196 IIEKVRGAASNWQPFIDTLPTR-------YNTVLYFTVEQMQRLRGTSACSAAVRHCRVI 248
Query: 61 KREYNEL---------DTVWFMAGSLFQQYP--YDIPTEAFTFEIFKQAFVAVQSCVVHL 109
R Y + D+V +LF +Y Y++ A + +Q V Q
Sbjct: 249 ARIYASMYKCAYMQPDDSVMAGMANLFTEYGLCYELYRWAVSTVTTRQNLVPRQ-LATDS 307
Query: 110 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 169
V + AL+P K + ++ + +KAGE ++ G + N+
Sbjct: 308 DGVRNSPMSALIPFWDMANHRCGKITSYYKPSAQQMECIAQEAFKAGEQFFIYYGDRCNA 367
Query: 170 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV-----AQRNGKLSVQVFHVHAGR 224
L+++GF+D +N D + + L+ D + + ++ +R +L V H
Sbjct: 368 DRLVHHGFLDMNNLKDYVHIRLGLSPTDALAEQRALLLSELNIERKAELRVLPAPEHISG 427
Query: 225 EKEAISDMLPYLRLGYVSD------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 278
E +L ++R+ +S S+++ + L C + +E L K
Sbjct: 428 E------LLAFVRVFNMSKEQLEHWCSDLERAVDLLHIDCALETDLETRTWQYLYQRLKL 481
Query: 279 RLAGYPATLSE 289
L ATL E
Sbjct: 482 LLGVLDATLKE 492
>gi|440640494|gb|ELR10413.1| hypothetical protein GMDG_00825 [Geomyces destructans 20631-21]
Length = 492
Score = 39.7 bits (91), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 87/337 (25%), Positives = 134/337 (39%), Gaps = 48/337 (14%)
Query: 9 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-LERAEGIKR----- 62
+S W PY L + ++S + WS ELA L S ++ ++AE I
Sbjct: 104 ESKWAPYFNVLPTK-------LDSLVFWSPEELAELQASAVLKKVGKDKAEEIFHQSISK 156
Query: 63 ---EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH-LQKVSLARRF 118
E ++D ++ S Y +DIP E + V QK SLA
Sbjct: 157 VTPEGTDVD-IFHRVASTIMAYAFDIPD----IEQEDEEGANEDDLVDDDEQKTSLA--- 208
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
++PL +L + A L + +++ P K GE I+ G P S LL YG+V
Sbjct: 209 -MIPLAD-MLNADADNNARLHYDGEELEMRTINPIKTGEEILNDYGQLPRSDLLRRYGYV 266
Query: 179 -DEDNPYDRLVVEAALNT-EDPQYQD---KRMVAQRNGKLSVQVFHV------------- 220
D+ +D V E + +T D YQD + V R G++ ++
Sbjct: 267 TDKYATFD--VAEISTSTITDHIYQDLAGELKVYLRAGEIEARLELARREDVYEDAHDVG 324
Query: 221 HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARL 280
HA E ISD L L + + ++ SS S V L + R
Sbjct: 325 HATEEWPCISDELVALVYLLLVGEETLAAIQSSKMSFPSRSKMETELVGKALQRILERRE 384
Query: 281 AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 317
Y T+ EDE +L N + ++A Q VRM +K++
Sbjct: 385 REYATTVVEDENLLQSGNHSNRVKMAIQ-VRMGEKVV 420
>gi|320584053|gb|EFW98265.1| Nuclear protein that contains a SET-domain [Ogataea parapolymorpha
DL-1]
Length = 499
Score = 39.7 bits (91), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 20/61 (32%), Positives = 34/61 (55%)
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
++VPL L + ++ A L+ +DA+ + + K GE I G PNS++L YG+V
Sbjct: 219 SMVPLADTLNSNTNLVNANLSYENDALVMTATKDIKKGEQIYNIYGELPNSEILRKYGYV 278
Query: 179 D 179
+
Sbjct: 279 E 279
>gi|195353393|ref|XP_002043189.1| GM17489 [Drosophila sechellia]
gi|194127287|gb|EDW49330.1| GM17489 [Drosophila sechellia]
Length = 537
Score = 39.7 bits (91), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 22/79 (27%), Positives = 36/79 (45%)
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
AL+P K + AAV ++ AGE ++ G + N+ LL++ GFV
Sbjct: 319 ALIPYWDMANHRQGKITSFYAAVPRQLECTAQEAVDAGEQFFIYYGDRSNTDLLVHNGFV 378
Query: 179 DEDNPYDRLVVEAALNTED 197
D+ N D + + L+ D
Sbjct: 379 DDYNLKDYVNIRVGLSLTD 397
>gi|315042966|ref|XP_003170859.1| SET domain-containing protein [Arthroderma gypseum CBS 118893]
gi|311344648|gb|EFR03851.1| SET domain-containing protein [Arthroderma gypseum CBS 118893]
Length = 693
Score = 39.7 bits (91), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 47/188 (25%), Positives = 79/188 (42%), Gaps = 11/188 (5%)
Query: 2 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIK 61
+++ + K S W PY+ L R + S L + +L +L G+ +K
Sbjct: 114 HQQLKEKGSHWWPYLATLPRAS-----ELTSALFYHGDDLEWLQGTNLYQTHQAYMNAVK 168
Query: 62 REYNELDTVWFMAGSLFQQ-YPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
EY+ ++ G L + Y +D+ A+T I +AF + + V+L + ++
Sbjct: 169 EEYDSAISILRDEGCLAAELYSWDLFCWAYTV-IASRAFTS-RVLSVYLSRNPALKQDEE 226
Query: 121 VPLGPPLLAYSSK---CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
+ PL+ S+ K A + L V P + E I GP N +L+ YGF
Sbjct: 227 FQILLPLVDSSNHKPLAKIEWRAEAAEIGLKVVEPIVSEEEIHNNYGPLNNQQLMTTYGF 286
Query: 178 VDEDNPYD 185
DNP D
Sbjct: 287 CIVDNPCD 294
>gi|255071473|ref|XP_002499410.1| predicted protein [Micromonas sp. RCC299]
gi|226514673|gb|ACO60669.1| predicted protein [Micromonas sp. RCC299]
Length = 323
Score = 39.7 bits (91), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 24/35 (68%)
Query: 156 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 190
GE +V+ G + N +LL +GF D DNP+D LV++
Sbjct: 199 GEEVVISYGDKTNEELLFVHGFADRDNPHDALVLQ 233
>gi|213407234|ref|XP_002174388.1| lysine methyltransferase [Schizosaccharomyces japonicus yFS275]
gi|212002435|gb|EEB08095.1| lysine methyltransferase [Schizosaccharomyces japonicus yFS275]
Length = 537
Score = 39.7 bits (91), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 49/215 (22%), Positives = 86/215 (40%), Gaps = 26/215 (12%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
E+ +G+KS W YI L + ++PL +++ EL L + +R K
Sbjct: 86 ERLKGEKSQWAKYIEYLPK-------TFDTPLYFTDDELKSLEHTNIFYGCNDRKRIWKE 138
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 122
E+ A L D P + F++ ++ A S + P
Sbjct: 139 EHA-------TAAKLL-----DNP-DNFSWNMYLWAATVFSSRCFSSALLGEEDTDDAAP 185
Query: 123 LGPPL---LAYSSKCKAMLAAV---DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 176
+ PL L + +C + V AVQLV +P +G + GP+ N +LL+ YG
Sbjct: 186 ILIPLVDSLNHKPRCPIIWNKVTKESHAVQLVSVKPISSGGQVYNNYGPKGNEELLMGYG 245
Query: 177 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 211
F +N ++ + +L+ + KR + +G
Sbjct: 246 FCLPNNEFETFALRLSLDKAVYNSEKKRSILASHG 280
>gi|322707769|gb|EFY99347.1| SET domain protein [Metarhizium anisopliae ARSEF 23]
Length = 467
Score = 39.7 bits (91), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 43/181 (23%), Positives = 72/181 (39%), Gaps = 13/181 (7%)
Query: 9 KSFWLPYIRELDR--QRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 66
KSFW PYIR L + Q + Q A+ W + E L G+ + I + ++R+ E
Sbjct: 106 KSFWWPYIRALPQPGQGNKSQWALAP--FWDDDEAELLEGTNVEVGIDKIRNDVRRDLQE 163
Query: 67 LDTVWFMAGSLFQQYPYDIPTEAFTFE--IFKQAFVAVQSCVVHLQKVSLARRFALVPLG 124
+ + G + + TE + + IF + Q+ SL R +
Sbjct: 164 AQELLRLHGDADGAFGKALTTELYQWAYCIFSSRSFRPSLVLSDEQRRSLPRGVTMDDFS 223
Query: 125 PPLLAYSSKCKAMLAAV----DDAVQ---LVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
L + M + DD Q L V R + G+ + + N++LL+ YGF
Sbjct: 224 VLLPLFDIGNHDMTTEIRWDLDDDRQTCELRVGRTHMPGQQVFNNYSMKTNAELLLGYGF 283
Query: 178 V 178
+
Sbjct: 284 M 284
>gi|145346652|ref|XP_001417799.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578027|gb|ABO96092.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 490
Score = 39.3 bits (90), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 49/201 (24%), Positives = 79/201 (39%), Gaps = 25/201 (12%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS-------PTKAEI 53
M E GK S W Y+ + G S L W + ++ L G+ E
Sbjct: 115 MCEMCAGKSSAWCEYLETVHEAVRVGH----SVLAWDDEQVTALFGTDAWRDAYENDDET 170
Query: 54 LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF-TFEIFKQAFVAVQSCVVHLQKV 112
L+ + + + T++F LF + + EA A VA S + ++
Sbjct: 171 LDLPMMTEEHFENVVTLFF---KLFPKLASGLSVEALRELHFAATAMVAGYSFTLGDDEI 227
Query: 113 SLARRFALVPLGPPLLAYSSKCKAMLAAVDD----AVQLVVDRPYKAGESIVVWCGPQPN 168
A+VP +L ++ C+A + D +Q++ R K GE + GP N
Sbjct: 228 Q-----AMVPFWD-MLNHAPPCEASVRLHHDQKNGCLQMITVRGVKKGEEVFNTYGPLRN 281
Query: 169 SKLLINYGFVDEDNPYDRLVV 189
++LL YGFV NP+ V
Sbjct: 282 AELLRRYGFVLPRNPHGGTTV 302
>gi|384246985|gb|EIE20473.1| rubisco small subunit N-methyltransferase, partial [Coccomyxa
subellipsoidea C-169]
Length = 363
Score = 39.3 bits (90), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 74/327 (22%), Positives = 123/327 (37%), Gaps = 74/327 (22%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW-SETELAYLTGSPTKAEILERAEG 59
M E+++G++S W P++ L A SP+LW E + L SPT E R
Sbjct: 41 MAERQKGEESRWAPFLECLPE-------ATLSPVLWPEEVQDELLKNSPTLKECRARRAA 93
Query: 60 IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 119
+++E++ +A + ++ S
Sbjct: 94 LQQEWD---------------------------------VIAQRIATGDARRFSGGDELK 120
Query: 120 L-VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
L + LG P + +L A+ D +PN +L + G V
Sbjct: 121 LWITLGSP--GWGGTSDKLLMAIYDG---------------------RPNGELAMATGRV 157
Query: 179 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 238
++DN D L V L D + K+ + + G VQ F + R + +L YLRL
Sbjct: 158 EDDNASDCLTVRVGLVQADRLFSVKKQILESLGFDIVQEFPIFRDR---MPTQLLAYLRL 214
Query: 239 GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP----ATLSEDEAML 294
++D + + V S ++P E VL L + RL Y + ED +L
Sbjct: 215 ARLTDPALLAKV--SFEEDIILNPVNEYEVLQLLLGECRDRLTSYAGMHMGSAEEDVKLL 272
Query: 295 TDYNLHPKKRVATQLVRMEKKMLNACL 321
L ++R+A +L + EK +L L
Sbjct: 273 QRPGLTAQERLAARLRKAEKAILQGTL 299
>gi|195480581|ref|XP_002101314.1| GE17555 [Drosophila yakuba]
gi|194188838|gb|EDX02422.1| GE17555 [Drosophila yakuba]
Length = 548
Score = 39.3 bits (90), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 22/79 (27%), Positives = 36/79 (45%)
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
AL+P K + AAV ++ AGE ++ G + N+ LL++ GFV
Sbjct: 323 ALIPYWDMANHRPGKITSFYAAVSRQLECTAQEAVAAGEQFFIYYGDRSNTDLLVHNGFV 382
Query: 179 DEDNPYDRLVVEAALNTED 197
D +N D + + L+ D
Sbjct: 383 DVNNLKDYVNIRVGLSPTD 401
>gi|302829721|ref|XP_002946427.1| hypothetical protein VOLCADRAFT_86703 [Volvox carteri f.
nagariensis]
gi|300268173|gb|EFJ52354.1| hypothetical protein VOLCADRAFT_86703 [Volvox carteri f.
nagariensis]
Length = 658
Score = 39.3 bits (90), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 29/91 (31%), Positives = 42/91 (46%), Gaps = 12/91 (13%)
Query: 142 DDAVQ---LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL----- 193
D+A Q +VV RPY+ GE +++ G N +LL YGFV E N +D ++ AL
Sbjct: 312 DEATQQYCIVVRRPYREGEQVMLCYGRYTNLELLEYYGFVLEGNLHDTARLDPALLPLPS 371
Query: 194 ----NTEDPQYQDKRMVAQRNGKLSVQVFHV 220
P NG+ S Q+ H+
Sbjct: 372 AARTAGGAPHLAPSDCFLHANGQPSWQLLHL 402
>gi|255581713|ref|XP_002531659.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase,
putative [Ricinus communis]
gi|223528717|gb|EEF30729.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase,
putative [Ricinus communis]
Length = 558
Score = 39.3 bits (90), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 36/149 (24%), Positives = 66/149 (44%), Gaps = 20/149 (13%)
Query: 46 GSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSC 105
G+ EI++ E ++ +Y+EL +L YP P E +T+E F A S
Sbjct: 277 GTLLFDEIMQAKEHLRVQYDEL------VPALCNNYPDVFPPELYTWEQFLWACELWYSN 330
Query: 106 VVHLQKVSLARRFALVP--------LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGE 157
+ ++ + R L+P L P ++ Y + ++ + ++ + RP + GE
Sbjct: 331 SMKIKFLDGKLRTCLIPIAGFLNHSLHPHIIHYGK-----VDSITNTLKFPLSRPCRVGE 385
Query: 158 SIVVWCGPQPNSKLLINYGFVDE-DNPYD 185
+ G + L+ YGF+ + DN YD
Sbjct: 386 QCCLSYGNFSGAHLITFYGFLPQGDNRYD 414
>gi|449544081|gb|EMD35055.1| hypothetical protein CERSUDRAFT_107074 [Ceriporiopsis subvermispora
B]
Length = 457
Score = 39.3 bits (90), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 18/49 (36%), Positives = 30/49 (61%)
Query: 142 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 190
D AV L++ P G ++ GP+PN++L++ YGF NP D +V++
Sbjct: 275 DLAVSLLLHSPTPRGAELLNNYGPKPNAELVLGYGFALPSNPDDTIVLK 323
>gi|169626351|ref|XP_001806576.1| hypothetical protein SNOG_16462 [Phaeosphaeria nodorum SN15]
gi|160705819|gb|EAT76160.2| hypothetical protein SNOG_16462 [Phaeosphaeria nodorum SN15]
Length = 474
Score = 39.3 bits (90), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 57/228 (25%), Positives = 85/228 (37%), Gaps = 49/228 (21%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI------- 53
+YE G S W PY L + + + WSE ELA L S +I
Sbjct: 91 LYEYINGDASNWAPYFSVLPTE-------FNTLMFWSEDELAELQASAVLNKIGKEGANE 143
Query: 54 ------------------------LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 89
+RAE ++ E N L + GSL Y +D+ A
Sbjct: 144 AFMEQLLPIIKEFADIFFAGDERAKQRAEEMRDERNVL--LMHKMGSLIMAYAFDVEP-A 200
Query: 90 FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 149
+ + + A + L K ++PL L A + A L + +++
Sbjct: 201 TSRKDVDEEGFAEEEEDEALPK-------GMIPLADMLNADADCNNARLFYEEKYLEMKA 253
Query: 150 DRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALNTE 196
+P +AGE + GP P S LL YG+V D YD + + L TE
Sbjct: 254 LKPIRAGEEVFNDYGPLPRSDLLRRYGYVTDNYAQYDVVEINMDLVTE 301
>gi|254568640|ref|XP_002491430.1| Nuclear protein that contains a SET-domain [Komagataella pastoris
GS115]
gi|238031227|emb|CAY69150.1| Nuclear protein that contains a SET-domain [Komagataella pastoris
GS115]
Length = 538
Score = 39.3 bits (90), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 19/61 (31%), Positives = 32/61 (52%)
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
++VP L A ++ C A L + + + +P K GE + G PNS++L YG+V
Sbjct: 254 SMVPFADTLNADTNLCNANLTYQSENLVMKAIKPIKKGEQVYNTYGNHPNSEILRRYGYV 313
Query: 179 D 179
+
Sbjct: 314 E 314
>gi|242210759|ref|XP_002471221.1| predicted protein [Postia placenta Mad-698-R]
gi|220729780|gb|EED83649.1| predicted protein [Postia placenta Mad-698-R]
Length = 256
Score = 39.3 bits (90), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 19/64 (29%), Positives = 33/64 (51%)
Query: 144 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 203
++ LV+ + AG + GP+PN++L++ YGF NP D +V++ + Q
Sbjct: 77 SISLVLHNAHPAGAELFNNYGPKPNAELILGYGFALPHNPDDTIVLKLGGASAAQHAQHN 136
Query: 204 RMVA 207
VA
Sbjct: 137 NAVA 140
>gi|194707708|gb|ACF87938.1| unknown [Zea mays]
Length = 352
Score = 39.3 bits (90), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 17/48 (35%), Positives = 31/48 (64%)
Query: 140 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 187
++D +V+++ ++ K E+I + G PN L++YGFV NPYD++
Sbjct: 136 SLDMSVKVLAEKKIKQNEAITLNYGCYPNDFFLLDYGFVITQNPYDQV 183
>gi|320167148|gb|EFW44047.1| hypothetical protein CAOG_02072 [Capsaspora owczarzaki ATCC 30864]
Length = 533
Score = 39.3 bits (90), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 47/211 (22%), Positives = 89/211 (42%), Gaps = 35/211 (16%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
SFW P+ + R+ ++ W + L L + + I + I+ EY++LD
Sbjct: 163 SFWQPWFQLFPRE-------LDCAGFWDDLLLMELDNTSIRDAIRQLEALIEYEYDQLDL 215
Query: 70 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL------ 123
+L ++P + F+++ FK AF+ + S + + V+ A ++P
Sbjct: 216 -----PALRLRFPDSFVADRFSYDDFKWAFMVLASRGLTM-SVNNAPCTVMIPFVDFFNH 269
Query: 124 -GPPLLAYSSKCKAMLAA------VDDAVQ------LVVDRPYKAGESIVVWCGPQPNSK 170
G +A+S +A A+ DD+V+ + + + GE + + N
Sbjct: 270 NGAKSIAFSYTRRAGDASDVSSGNYDDSVENLNCAVISGNETFLPGEQMFLNYKAHSNEV 329
Query: 171 LLINYGFVDEDNPYDRLVVEAALN---TEDP 198
LL++YGF N +D +V + T DP
Sbjct: 330 LLLHYGFALPHNEHDTFLVRLHFDREKTNDP 360
>gi|395326815|gb|EJF59220.1| SET domain-containing protein [Dichomitus squalens LYAD-421 SS1]
Length = 429
Score = 39.3 bits (90), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 17/44 (38%), Positives = 28/44 (63%)
Query: 147 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 190
LV+ P G ++ GP+PNS+L++ YGF +NP D +V++
Sbjct: 245 LVIHTPTPRGSELLNNYGPKPNSELILGYGFSLPNNPDDTIVLK 288
>gi|226505024|ref|NP_001151430.1| SET domain containing protein [Zea mays]
gi|195646778|gb|ACG42857.1| SET domain containing protein [Zea mays]
gi|413923893|gb|AFW63825.1| SET domain containing protein [Zea mays]
Length = 491
Score = 39.3 bits (90), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 17/48 (35%), Positives = 31/48 (64%)
Query: 140 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 187
++D +V+++ ++ K E+I + G PN L++YGFV NPYD++
Sbjct: 275 SLDMSVKVLAEKKIKQNEAITLNYGCYPNDFFLLDYGFVITQNPYDQV 322
>gi|22328112|gb|AAH36556.1| SETD4 protein [Homo sapiens]
gi|119630166|gb|EAX09761.1| SET domain containing 4, isoform CRA_d [Homo sapiens]
gi|167773807|gb|ABZ92338.1| SET domain containing 4 [synthetic construct]
Length = 416
Score = 39.3 bits (90), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 51/228 (22%), Positives = 89/228 (39%), Gaps = 24/228 (10%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
EK G +S W PY+ L + A P+ E E+ L KA+ E+ ++
Sbjct: 93 EKHAGHRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQE 144
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFA 119
+ + LF + I F++ A+ V + V+L Q+ L+
Sbjct: 145 FFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPD 200
Query: 120 LVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
L P LL +S + KA + ++ ++ E + + GP N +L + Y
Sbjct: 201 TCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEY 260
Query: 176 GFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
GFV NP+ R ++ L + D Q K + + +G + F
Sbjct: 261 GFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 308
>gi|114684050|ref|XP_001168792.1| PREDICTED: SET domain-containing protein 4 isoform 4 [Pan
troglodytes]
gi|410222534|gb|JAA08486.1| SET domain containing 4 [Pan troglodytes]
gi|410259178|gb|JAA17555.1| SET domain containing 4 [Pan troglodytes]
gi|410287502|gb|JAA22351.1| SET domain containing 4 [Pan troglodytes]
gi|410336607|gb|JAA37250.1| SET domain containing 4 [Pan troglodytes]
Length = 307
Score = 39.3 bits (90), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 46/199 (23%), Positives = 78/199 (39%), Gaps = 19/199 (9%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
EK G +S W PY+ L + A P+ E E+ L KA+ E+ ++
Sbjct: 117 EKHAGHRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQE 168
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFA 119
+ + LF + I F++ A+ V + V+L Q+ L+
Sbjct: 169 FFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPD 224
Query: 120 LVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
L P LL +S + KA + ++ ++ E + + GP N +L + Y
Sbjct: 225 TCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEY 284
Query: 176 GFVDEDNPYDRLVVEAALN 194
GFV NP+ + V N
Sbjct: 285 GFVSVHNPHACVYVSRGWN 303
>gi|452825744|gb|EME32739.1| ribulose-1,5 bisphosphate carboxylase oxygenase large subunit
N-methyltransferase, putative isoform 1 [Galdieria
sulphuraria]
Length = 487
Score = 39.3 bits (90), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 76/325 (23%), Positives = 129/325 (39%), Gaps = 35/325 (10%)
Query: 10 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE-LD 68
S W PYI L G + WS +ELA L P E+ I + Y E L
Sbjct: 166 SLWKPYIDILPHALNTGLVY------WSSSELAQLQYRPLIEEV-----KINQYYREALY 214
Query: 69 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL- 127
T F + S P + + +F A VQS + V + +AL+P+ L
Sbjct: 215 TRVFESLS----SPVRVWLQNEKENVFFWALDMVQSRAFGIPDVG-NKTYALLPMMDMLN 269
Query: 128 LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 187
+S+ + ++ + ++ G I + GP N LL YGF+ +NP D
Sbjct: 270 HRVNSQTHFLYDSIANQYEMKTYSKLSPGTDIYISYGPLDNDHLLHFYGFLQTNNPSDYF 329
Query: 188 VVEAALNTEDPQYQDKRMVAQRNG----KLSV----------QVFHVHAGREKEAISDML 233
V+ Y+ + AQ + KLS+ + FH++ + I D++
Sbjct: 330 QVKDIFQWLHLMYEQEEWQAQPSHLLEEKLSLLRKYHIYENGKTFHLYHDHYDDEI-DII 388
Query: 234 PYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM 293
+ + +D ++Q + +S + V + K L ++ EDE +
Sbjct: 389 LRVFMASKTDWQQIQENFAMGLFHKALSLENQLHVWQVIIGGCKHLLKDMKTSVEEDEQL 448
Query: 294 LTDYN-LHPKKRVATQLVRMEKKML 317
L + + L K ++A Q R+EKK +
Sbjct: 449 LKNKDQLSTKLQLAIQF-RLEKKYI 472
>gi|254585507|ref|XP_002498321.1| ZYRO0G07502p [Zygosaccharomyces rouxii]
gi|238941215|emb|CAR29388.1| ZYRO0G07502p [Zygosaccharomyces rouxii]
Length = 562
Score = 39.3 bits (90), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 19/59 (32%), Positives = 29/59 (49%)
Query: 127 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
L + + K D + V K+GE + G + N +LL+NYGFV ++N YD
Sbjct: 224 FLNHKNDTKVKWCFQDGKMCFVSKESLKSGEELFNNYGDKSNEELLLNYGFVQDNNQYD 282
>gi|55953063|ref|NP_001007260.1| SET domain-containing protein 4 isoform 2 [Homo sapiens]
gi|12804091|gb|AAH02898.1| SET domain containing 4 [Homo sapiens]
gi|119630161|gb|EAX09756.1| SET domain containing 4, isoform CRA_a [Homo sapiens]
Length = 307
Score = 39.3 bits (90), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 46/199 (23%), Positives = 78/199 (39%), Gaps = 19/199 (9%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
EK G +S W PY+ L + A P+ E E+ L KA+ E+ ++
Sbjct: 117 EKHAGHRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQE 168
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFA 119
+ + LF + I F++ A+ V + V+L Q+ L+
Sbjct: 169 FFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPD 224
Query: 120 LVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
L P LL +S + KA + ++ ++ E + + GP N +L + Y
Sbjct: 225 TCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEY 284
Query: 176 GFVDEDNPYDRLVVEAALN 194
GFV NP+ + V N
Sbjct: 285 GFVSVHNPHACVYVSRGWN 303
>gi|403331500|gb|EJY64700.1| hypothetical protein OXYTRI_15263 [Oxytricha trifallax]
Length = 741
Score = 38.9 bits (89), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 20/64 (31%), Positives = 36/64 (56%), Gaps = 4/64 (6%)
Query: 153 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN----TEDPQYQDKRMVAQ 208
YK G+ + G + N LL+NYGF +N Y+ L +N +E + ++K++ A+
Sbjct: 440 YKKGQQVFHCYGRRTNRFLLLNYGFCMNNNKYNSLSFRVWINFDWQSERQKDEEKKLKAE 499
Query: 209 RNGK 212
++GK
Sbjct: 500 KDGK 503
>gi|388452885|ref|NP_001253203.1| SET domain-containing protein 4 [Macaca mulatta]
gi|355560299|gb|EHH16985.1| SET domain-containing protein 4 [Macaca mulatta]
gi|387541878|gb|AFJ71566.1| SET domain-containing protein 4 isoform 1 [Macaca mulatta]
Length = 440
Score = 38.9 bits (89), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 51/228 (22%), Positives = 89/228 (39%), Gaps = 24/228 (10%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
EK G +S W PY+ L + A P+ E E+ L KA+ E+ ++
Sbjct: 117 EKHAGDRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQE 168
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFA 119
+ + LF + I F++ A+ V + V+L Q+ L+
Sbjct: 169 FFASSRDFFSSLQPLFVEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPD 224
Query: 120 LVPLGP--PLLAYSSKC--KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
L P LL +S + KA + ++ ++ E + + GP N +L + Y
Sbjct: 225 TCALAPYLDLLNHSPRVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEY 284
Query: 176 GFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
GFV NP+ R ++ L + D Q K + + +G + F
Sbjct: 285 GFVSVHNPHACVYVSREILVKYLPSRDKQMDKKISILKDHGYIENLTF 332
>gi|443730800|gb|ELU16158.1| hypothetical protein CAPTEDRAFT_140019 [Capitella teleta]
Length = 255
Score = 38.9 bits (89), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 51/200 (25%), Positives = 85/200 (42%), Gaps = 30/200 (15%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+ E+ +G SFW PY+ L + L W+ E+ L TK +
Sbjct: 62 LCERNKGCSSFWKPYVDILPS-------SYTDILHWTSKEMDLLPKF-TKRRACDLRLKA 113
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLA-- 115
+ +N L + L +Q P AFT+++FK A+ +V + V++ Q L+
Sbjct: 114 EESFNRLCNGFLPL--LVRQMPQF--NGAFTWDLFKWAWSSVNTRCVYMSQPQNSVLSPD 169
Query: 116 --RRFALVPLGPPLLAYSSKCKAMLAAVDDA------VQLVVDRPYKAGESIVVWCGPQP 167
+ AL P LL ++ + A DD+ L +PY + + + GP
Sbjct: 170 EEDKSALAPFLD-LLNHTVDVEVN-ARFDDSSKSYKITTLTACKPY---DQVFINYGPHS 224
Query: 168 NSKLLINYGFVDEDNPYDRL 187
N KLL+ YGF NP++ +
Sbjct: 225 NEKLLLEYGFTLPCNPHNNI 244
>gi|281201674|gb|EFA75882.1| tryptophan 2,3-dioxygenase [Polysphondylium pallidum PN500]
Length = 732
Score = 38.9 bits (89), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 54/113 (47%), Gaps = 15/113 (13%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
E ++ ++SFW PYIR L + + ++ L ++ E L GSP +LE E I
Sbjct: 153 ETQKQQQSFWAPYIRMLPK-------SYKTALYFTLAEFQLLIGSP----VLE--ESINT 199
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 115
N L F+ F + P + T FT+E F+Q +A +V L K LA
Sbjct: 200 YRNTLRQYCFLY-DFFGKNPGILSTSNFTWE-FEQNELAAYKSIVSLLKKRLA 250
>gi|66813084|ref|XP_640721.1| hypothetical protein DDB_G0281543 [Dictyostelium discoideum AX4]
gi|60468751|gb|EAL66753.1| hypothetical protein DDB_G0281543 [Dictyostelium discoideum AX4]
Length = 1339
Score = 38.9 bits (89), Expect = 3.7, Method: Composition-based stats.
Identities = 45/189 (23%), Positives = 77/189 (40%), Gaps = 17/189 (8%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+YEK SFW P+ L + + +S TEL L G+ + E
Sbjct: 839 IYEKGNAN-SFWRPFYDTLPSY-------FTTSIHYSATELLELEGT----NLFEETLHT 886
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
K++ N F L +QYP P F++E F A + S + L K+ + + L
Sbjct: 887 KQQLNSFRDYLF--PELSKQYPDIFPESQFSWENFLWARSLLDSRAIQL-KIDGSIKSCL 943
Query: 121 VPLGPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
VP+ + +++ + + +++ A I + G N +L + YGF+
Sbjct: 944 VPMADMINHHTNAQISERFFDHDSQSFKMISSCNIPANNQIFLHYGALQNWELALYYGFI 1003
Query: 179 DEDNPYDRL 187
+N YD L
Sbjct: 1004 IPNNIYDSL 1012
>gi|8393013|ref|NP_059134.1| SET domain-containing protein 4 isoform 1 [Homo sapiens]
gi|12229715|sp|Q9NVD3.1|SETD4_HUMAN RecName: Full=SET domain-containing protein 4
gi|7023055|dbj|BAA91819.1| unnamed protein product [Homo sapiens]
gi|119630162|gb|EAX09757.1| SET domain containing 4, isoform CRA_b [Homo sapiens]
gi|119630163|gb|EAX09758.1| SET domain containing 4, isoform CRA_b [Homo sapiens]
gi|119630165|gb|EAX09760.1| SET domain containing 4, isoform CRA_b [Homo sapiens]
Length = 440
Score = 38.9 bits (89), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 51/228 (22%), Positives = 89/228 (39%), Gaps = 24/228 (10%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
EK G +S W PY+ L + A P+ E E+ L KA+ E+ ++
Sbjct: 117 EKHAGHRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQE 168
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFA 119
+ + LF + I F++ A+ V + V+L Q+ L+
Sbjct: 169 FFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPD 224
Query: 120 LVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
L P LL +S + KA + ++ ++ E + + GP N +L + Y
Sbjct: 225 TCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEY 284
Query: 176 GFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
GFV NP+ R ++ L + D Q K + + +G + F
Sbjct: 285 GFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332
>gi|303272215|ref|XP_003055469.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463443|gb|EEH60721.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 468
Score = 38.9 bits (89), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 28/98 (28%), Positives = 42/98 (42%), Gaps = 10/98 (10%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
M E+ G +S W Y L RG L P+ W+E L L G+ + E AE +
Sbjct: 106 MVERALGSESRWRDYFAVLP-SRGERTL----PMFWTEARLEALKGTDLATHVREDAENL 160
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 98
+ +Y+E + L +P E TFE + +A
Sbjct: 161 RADYDEE-----VVNGLCVAHPEKFRREELTFERYLEA 193
>gi|397507017|ref|XP_003824008.1| PREDICTED: SET domain-containing protein 4 [Pan paniscus]
Length = 440
Score = 38.9 bits (89), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 51/228 (22%), Positives = 89/228 (39%), Gaps = 24/228 (10%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
EK G +S W PY+ L + A P+ E E+ L KA+ E+ ++
Sbjct: 117 EKHAGHRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQE 168
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFA 119
+ + LF + I F++ A+ V + V+L Q+ L+
Sbjct: 169 FFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPD 224
Query: 120 LVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
L P LL +S + KA + ++ ++ E + + GP N +L + Y
Sbjct: 225 TCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEY 284
Query: 176 GFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
GFV NP+ R ++ L + D Q K + + +G + F
Sbjct: 285 GFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332
>gi|332872029|ref|XP_001168891.2| PREDICTED: SET domain-containing protein 4 isoform 8 [Pan
troglodytes]
gi|410222532|gb|JAA08485.1| SET domain containing 4 [Pan troglodytes]
gi|410259176|gb|JAA17554.1| SET domain containing 4 [Pan troglodytes]
gi|410287500|gb|JAA22350.1| SET domain containing 4 [Pan troglodytes]
gi|410336605|gb|JAA37249.1| SET domain containing 4 [Pan troglodytes]
Length = 440
Score = 38.9 bits (89), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 51/228 (22%), Positives = 89/228 (39%), Gaps = 24/228 (10%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
EK G +S W PY+ L + A P+ E E+ L KA+ E+ ++
Sbjct: 117 EKHAGHRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQE 168
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFA 119
+ + LF + I F++ A+ V + V+L Q+ L+
Sbjct: 169 FFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPD 224
Query: 120 LVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
L P LL +S + KA + ++ ++ E + + GP N +L + Y
Sbjct: 225 TCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEY 284
Query: 176 GFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
GFV NP+ R ++ L + D Q K + + +G + F
Sbjct: 285 GFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332
>gi|320170797|gb|EFW47696.1| hypothetical protein CAOG_05634 [Capsaspora owczarzaki ATCC 30864]
Length = 903
Score = 38.9 bits (89), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 38/87 (43%), Gaps = 14/87 (16%)
Query: 7 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 66
G S W P+ L SPLL+ TEL L G+ E E +G++ +
Sbjct: 481 GSASRWAPFFETL-------PATYNSPLLFHVTELLELEGTRLIDETFEIKDGLRVLHES 533
Query: 67 LDTVWFMAGSLFQQYPYDIPTEAFTFE 93
L G L + YP PT+AFT+E
Sbjct: 534 L-------GPLAEAYPALFPTDAFTYE 553
>gi|242210761|ref|XP_002471222.1| predicted protein [Postia placenta Mad-698-R]
gi|220729781|gb|EED83650.1| predicted protein [Postia placenta Mad-698-R]
Length = 264
Score = 38.9 bits (89), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 16/48 (33%), Positives = 29/48 (60%)
Query: 143 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 190
++ LV+ + AG + GP+PN++L++ YGF NP D +V++
Sbjct: 215 SSISLVLHNAHPAGAELFNNYGPKPNAELILGYGFALPHNPDDTIVLK 262
>gi|258567286|ref|XP_002584387.1| predicted protein [Uncinocarpus reesii 1704]
gi|237905833|gb|EEP80234.1| predicted protein [Uncinocarpus reesii 1704]
Length = 706
Score = 38.9 bits (89), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 50/189 (26%), Positives = 79/189 (41%), Gaps = 22/189 (11%)
Query: 7 GKKSFWLPYIREL--DRQRGRGQLAVESPLLWSE-TELAYLTGSPTKAEILERAEGIK-- 61
G +SFW PYI+ L D Q R + L W E T L L + + G++
Sbjct: 137 GDESFWAPYIQSLPDDSQFTRLEYYTGDDLKWLEGTNLLKLREKLLERLKAKYETGLRLL 196
Query: 62 REYNELDT---VW--FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLAR 116
+E+ +T W F+ S I + AF+ E+ K + V L+ S+
Sbjct: 197 KEFPNKNTPKYTWERFLWASSI------ILSRAFSSEVLKDYIKGTPTRVKPLEDFSV-- 248
Query: 117 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 176
LVPL + + + A + + L+V + GE + GP+ N +L++NYG
Sbjct: 249 ---LVPLVD-ISNHQPLAQVEWATSLEKIGLIVHKTLLPGEEVPNNYGPRSNERLMMNYG 304
Query: 177 FVDEDNPYD 185
F N D
Sbjct: 305 FCIRGNVCD 313
>gi|294659704|ref|XP_462118.2| DEHA2G13354p [Debaryomyces hansenii CBS767]
gi|199434171|emb|CAG90604.2| DEHA2G13354p [Debaryomyces hansenii CBS767]
Length = 480
Score = 38.5 bits (88), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 47/202 (23%), Positives = 83/202 (41%), Gaps = 19/202 (9%)
Query: 2 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS------ETELAYLTGSPTKAEILE 55
+EK++G SFW P+I L + LA PL+W EL L + TK + +
Sbjct: 155 FEKQRGSSSFWKPFIDMLP-ETSDFDLA---PLVWKVLKVDHYEELLKLLPNSTKRHMDK 210
Query: 56 RAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFT----FEIFKQAFVAVQSCVVHL- 109
+ + +YN + D + + + T+A E++ +++ + S +++
Sbjct: 211 IYDRFQTDYNVVKDLISIKLKEISDNERSNDLTDAIRHLVPIELYLWSWMCINSRCLYME 270
Query: 110 --QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 167
Q + A F + P L +S + L Q+ Y E + + GP
Sbjct: 271 IPQSKNAADNFTMAPY-VDFLNHSCDDQCGLKIDGTGFQVYTTCSYNPDEQLFLSYGPHS 329
Query: 168 NSKLLINYGFVDEDNPYDRLVV 189
N LL YGF +N ++ L V
Sbjct: 330 NEFLLCEYGFTLPENKWNDLDV 351
>gi|169595142|ref|XP_001790995.1| hypothetical protein SNOG_00305 [Phaeosphaeria nodorum SN15]
gi|160701026|gb|EAT91800.2| hypothetical protein SNOG_00305 [Phaeosphaeria nodorum SN15]
Length = 391
Score = 38.5 bits (88), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 26/48 (54%), Gaps = 5/48 (10%)
Query: 147 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN-----PYDRLVV 189
+ DR YKAGE + V G N LL+ YGF+ + N P D L++
Sbjct: 203 VTADREYKAGEEVFVSYGAHTNDFLLVEYGFILDSNRNDAIPLDHLIL 250
>gi|308802011|ref|XP_003078319.1| N-methyltransferase (ISS) [Ostreococcus tauri]
gi|116056770|emb|CAL53059.1| N-methyltransferase (ISS) [Ostreococcus tauri]
Length = 429
Score = 38.5 bits (88), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 51/185 (27%), Positives = 76/185 (41%), Gaps = 17/185 (9%)
Query: 151 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQD-KRMVAQR 209
+ Y+ GE +++ G N +L+ YGFVD DN D E ++ Y KR +
Sbjct: 246 KDYETGEEVLISYGVLNNDELITRYGFVDVDNVADIYRFEGLMSYLQASYDPMKRALGAD 305
Query: 210 NGKLSVQVFHVH-----AGREKEAISD------MLPYLRLGYVSDTSEMQSVISSLGPIC 258
+LS + H A E ISD +L LR V T E + +
Sbjct: 306 QKRLST-LKRTHPELDQALWEGNFISDGNADPKLLWALRT--VLATPEEYAAAKGVDGFK 362
Query: 259 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 318
ER D + ++RLA YP T+ EDE L N +R A Q +K++L
Sbjct: 363 LGGGAPERRAADAVRAAVESRLAEYPTTIEEDEEALKTAN--GNERTAIQYRIRKKRILR 420
Query: 319 ACLQV 323
++
Sbjct: 421 DASRI 425
>gi|21952799|dbj|BAC06215.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
gi|22202682|dbj|BAC07340.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
gi|215769224|dbj|BAH01453.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222619626|gb|EEE55758.1| hypothetical protein OsJ_04288 [Oryza sativa Japonica Group]
Length = 495
Score = 38.5 bits (88), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 19/55 (34%), Positives = 31/55 (56%)
Query: 142 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 196
D+ ++++V R GE + G N+ LL YGF + DN YD + ++ AL T+
Sbjct: 255 DEDLEMIVVRDVNEGEEVFNTYGTMGNAALLHRYGFTEMDNSYDIVNIDLALVTK 309
>gi|125528589|gb|EAY76703.1| hypothetical protein OsI_04658 [Oryza sativa Indica Group]
Length = 495
Score = 38.5 bits (88), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 19/55 (34%), Positives = 31/55 (56%)
Query: 142 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 196
D+ ++++V R GE + G N+ LL YGF + DN YD + ++ AL T+
Sbjct: 255 DEDLEMIVVRDVNEGEEVFNTYGTMGNAALLHRYGFTEMDNSYDIVNIDLALVTK 309
>gi|384249602|gb|EIE23083.1| SET domain-containing protein, partial [Coccomyxa subellipsoidea
C-169]
Length = 306
Score = 38.5 bits (88), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 27/98 (27%), Positives = 44/98 (44%), Gaps = 13/98 (13%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
MYE GK+SFW Y+ EL ++ PL W+E E + L G+ + E E
Sbjct: 52 MYELSIGKESFWHGYLEELHKRE-------YLPLFWAEQERSLLQGTEAEHRPQEDEELT 104
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 98
+ ++ L +Q+ + ++FT E F+ A
Sbjct: 105 QEDFET------HVPPLVEQHADRLRADSFTLESFRVA 136
>gi|367001244|ref|XP_003685357.1| hypothetical protein TPHA_0D02870 [Tetrapisispora phaffii CBS 4417]
gi|357523655|emb|CCE62923.1| hypothetical protein TPHA_0D02870 [Tetrapisispora phaffii CBS 4417]
Length = 495
Score = 38.5 bits (88), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 20/63 (31%), Positives = 32/63 (50%)
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
+++PL L A + +C A L +++ +P KA E I G N +LL YG+V
Sbjct: 220 SMIPLADTLNADTKRCNANLIYDSGVLKMCAIKPIKANEQIYNTYGNHANFELLRRYGYV 279
Query: 179 DED 181
+ D
Sbjct: 280 EVD 282
>gi|297707870|ref|XP_002830708.1| PREDICTED: SET domain-containing protein 4 [Pongo abelii]
Length = 440
Score = 38.5 bits (88), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 50/228 (21%), Positives = 89/228 (39%), Gaps = 24/228 (10%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
EK G +S W PY+ L + A P+ E E+ L KA+ E+ ++
Sbjct: 117 EKHAGDRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPQSLKAKAEEQRAHVQE 168
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ---KVSLARRFA 119
+ + LF + I F++ A+ V + V+L+ + L+
Sbjct: 169 FFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRHRECLSAELD 224
Query: 120 LVPLGP--PLLAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
L P LL +S + KA + ++ ++ E + + GP N +L + Y
Sbjct: 225 TCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRRHEEVFICYGPHDNQRLFLEY 284
Query: 176 GFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
GFV NP+ R ++ L + D Q K + + +G + F
Sbjct: 285 GFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332
>gi|219122993|ref|XP_002181819.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407095|gb|EEC47033.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 579
Score = 38.5 bits (88), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 17/47 (36%), Positives = 28/47 (59%)
Query: 143 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 189
+A L D+ +G+ + + GP+ N +LL YGFV+ +NP D V+
Sbjct: 361 NAYSLATDQAIPSGDEVYISYGPRSNDQLLQYYGFVERNNPNDVYVM 407
>gi|413951745|gb|AFW84394.1| hypothetical protein ZEAMMB73_159573, partial [Zea mays]
Length = 339
Score = 38.5 bits (88), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 19/57 (33%), Positives = 32/57 (56%)
Query: 140 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 196
A D+ +++++ R GE + G N+ LL YGF + DN YD + ++ AL T+
Sbjct: 99 ANDEDLEIIIVRDVNEGEEVYNTYGTMGNAALLHRYGFTELDNQYDIVNIDLALVTK 155
>gi|113930683|ref|NP_001039027.1| SET domain-containing protein 4 [Danio rerio]
gi|66911144|gb|AAH96876.1| SET domain containing 4 [Danio rerio]
Length = 440
Score = 38.5 bits (88), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 54/229 (23%), Positives = 94/229 (41%), Gaps = 26/229 (11%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
E+ G+ S W PYI L + PL + + + L S K + ++ E +
Sbjct: 118 ERHHGEASEWNPYIDILPK-------TYTCPLYFPDNVIELLPRSLQK-KATQQKEQFQE 169
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEA-FTFEIFKQAFVAVQSCVV---HLQKVSLARRF 118
++ T + LF Q PTE F+ + + A+ +V + V H Q L+R
Sbjct: 170 LFSSSQTFFHSLQPLFNQ-----PTEELFSQDALRWAWCSVNTRTVYMEHDQSKYLSREK 224
Query: 119 ALVPLGP--PLLAY--SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 174
+ L P LL + + + +A ++ K + + GP N +LL+
Sbjct: 225 DVYALAPYLDLLNHCPNVQVEAGFNKETRCYEIRSVNGCKKFQQAFINYGPHDNHRLLLE 284
Query: 175 YGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
YGFV NP+ D ++ L+ +D Q ++K + + N L F
Sbjct: 285 YGFVAPCNPHSVVYVDLETLKVGLDEKDKQLKEKLLYLKDNDFLRNLTF 333
>gi|33468718|emb|CAE30375.1| SI:dZ63M10.4 (novel protein similar to human chromosome 21 open
reading frame 18 (C21orf18)) [Danio rerio]
Length = 440
Score = 38.5 bits (88), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 54/229 (23%), Positives = 94/229 (41%), Gaps = 26/229 (11%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
E+ G+ S W PYI L + PL + + + L S K + ++ E +
Sbjct: 118 ERHHGEASEWNPYIDILPK-------TYTCPLYFPDNVIELLPRSLQK-KATQQKEQFQE 169
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEA-FTFEIFKQAFVAVQSCVV---HLQKVSLARRF 118
++ T + LF Q PTE F+ + + A+ +V + V H Q L+R
Sbjct: 170 LFSSSQTFFHSLQPLFNQ-----PTEELFSQDALRWAWCSVNTRTVYMEHDQSKYLSREK 224
Query: 119 ALVPLGP--PLLAY--SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 174
+ L P LL + + + +A ++ K + + GP N +LL+
Sbjct: 225 DVYALAPYLDLLNHCPNVQVEAGFNKETRCYEIRSVNGCKKFQQAFINYGPHDNHRLLLE 284
Query: 175 YGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
YGFV NP+ D ++ L+ +D Q ++K + + N L F
Sbjct: 285 YGFVAPCNPHSVVYVDLETLKVGLDEKDKQLKEKLLYLKDNDFLRNLTF 333
>gi|350629837|gb|EHA18210.1| hypothetical protein ASPNIDRAFT_38188 [Aspergillus niger ATCC 1015]
Length = 480
Score = 38.5 bits (88), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 40/157 (25%), Positives = 69/157 (43%), Gaps = 10/157 (6%)
Query: 29 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 88
A+ P +SE EL L GS + + + +++E+ L ++Y +D T
Sbjct: 125 AISLPTFYSEEELELLRGSSLRLAVHAKIASLEKEFEHLRRS-TEGLDWCEKYWWDEDTG 183
Query: 89 AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP-LGPPLLAYSSKCKAMLAAVDDA--- 144
TF +K +S +V L + A+VP + A KA+ D
Sbjct: 184 KLTFNDWKYVDALYRSRMVDLPR----HGHAMVPCIDMANHASEGTVKALYDEDADGNAV 239
Query: 145 VQLVVDRPYKAGESIVVWCGPQPN-SKLLINYGFVDE 180
+QL R +A E + + G + + S+L+ +YGF+DE
Sbjct: 240 LQLREGRSLRADEEVTISYGDEKSASELIFSYGFLDE 276
>gi|403366800|gb|EJY83208.1| hypothetical protein OXYTRI_19172 [Oxytricha trifallax]
Length = 869
Score = 38.5 bits (88), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 37/170 (21%), Positives = 75/170 (44%), Gaps = 18/170 (10%)
Query: 138 LAAVDDAVQLVVDRPYKAG---ESIVVWC-GPQPNSKLLINYGFVDEDNPYDRLVVEAAL 193
++ V D ++ R Y G S V C G N ++L YGF N Y+ + ++ L
Sbjct: 454 VSQVPDDFNFII-RTYNDGFPKGSQVFLCYGRMSNREMLKRYGFCLTYNKYNYIFIKLRL 512
Query: 194 NTEDPQYQDKRMVAQR-------NGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE 246
+DP + ++ V ++ K+ + H +K + +L ++++ Y + +
Sbjct: 513 EQQDPDFIYRKYVLRKFFSIEPETDKMDISSRHFRIYFQK-LNTKVLKFIKILYFNVQED 571
Query: 247 MQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
S I + S +E +L D ++ L +P T+ ED+ +L++
Sbjct: 572 DISCI-----VETRSLSLEYLAFQRLRDVYETFLKSFPTTIGEDKKILSE 616
>gi|443733230|gb|ELU17670.1| hypothetical protein CAPTEDRAFT_97123, partial [Capitella teleta]
Length = 199
Score = 38.5 bits (88), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 51/200 (25%), Positives = 85/200 (42%), Gaps = 30/200 (15%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
+ E+ +G SFW PY+ L + L W+ E+ L TK +
Sbjct: 6 LCERNKGCSSFWKPYVDILPS-------SYTDILHWTSKEMDLLPKF-TKRRACDLRLKA 57
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLA-- 115
+ +N L + L +Q P AFT+++FK A+ +V + V++ Q L+
Sbjct: 58 EESFNRLCNGFLPL--LVRQMPQF--NGAFTWDLFKWAWSSVNTRCVYMSQPQNSVLSPD 113
Query: 116 --RRFALVPLGPPLLAYSSKCKAMLAAVDDA------VQLVVDRPYKAGESIVVWCGPQP 167
+ AL P LL ++ + A DD+ L +PY + + + GP
Sbjct: 114 EEDKSALAPFLD-LLNHTVDVEVN-ARFDDSSKSYKITTLTACKPY---DQVFINYGPHS 168
Query: 168 NSKLLINYGFVDEDNPYDRL 187
N KLL+ YGF NP++ +
Sbjct: 169 NEKLLLEYGFTLPCNPHNNI 188
>gi|159490102|ref|XP_001703025.1| predicted protein [Chlamydomonas reinhardtii]
gi|158270838|gb|EDO96670.1| predicted protein [Chlamydomonas reinhardtii]
Length = 471
Score = 38.5 bits (88), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 25/48 (52%)
Query: 138 LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
L AV L R GE +++ G N L ++YGF+ EDNPYD
Sbjct: 202 LEGGGGAVGLFARRAITEGEPLLLSYGQLSNDFLFMDYGFIVEDNPYD 249
>gi|403271547|ref|XP_003927684.1| PREDICTED: SET domain-containing protein 4 [Saimiri boliviensis
boliviensis]
Length = 440
Score = 38.5 bits (88), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 51/228 (22%), Positives = 88/228 (38%), Gaps = 24/228 (10%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
EK G +S W PY+ L + A P+ E E+ L KA+ E+ ++
Sbjct: 117 EKHAGDRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQE 168
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFA 119
+ + LF + I F++ A+ V + V+L Q+ L+
Sbjct: 169 FFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQQECLSAEPD 224
Query: 120 LVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
L P LL +S + KA ++ ++ E + + GP N +L + Y
Sbjct: 225 TCALAPYLDLLNHSPHVQVKAAFNEETHCYEIRTTSRWRKHEEVFICYGPHDNQRLFLEY 284
Query: 176 GFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
GFV NP+ R ++ L + D Q K + + +G + F
Sbjct: 285 GFVSAHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332
>gi|449302028|gb|EMC98037.1| hypothetical protein BAUCODRAFT_67154 [Baudoinia compniacensis UAMH
10762]
Length = 381
Score = 38.5 bits (88), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 15/41 (36%), Positives = 25/41 (60%)
Query: 143 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 183
+ +++ +R Y AGE ++ G N KLL++YGF+ E P
Sbjct: 204 NGYEMLANRDYDAGEEVLFTYGAHSNDKLLVHYGFICESPP 244
>gi|134077289|emb|CAK45629.1| unnamed protein product [Aspergillus niger]
Length = 498
Score = 38.5 bits (88), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 40/157 (25%), Positives = 69/157 (43%), Gaps = 10/157 (6%)
Query: 29 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 88
A+ P +SE EL L GS + + + +++E+ L ++Y +D T
Sbjct: 143 AISLPTFYSEEELELLRGSSLRLAVHAKIASLEKEFEHLRRS-TEGLDWCEKYWWDEDTG 201
Query: 89 AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP-LGPPLLAYSSKCKAMLAAVDDA--- 144
TF +K +S +V L + A+VP + A KA+ D
Sbjct: 202 KLTFNDWKYVDALYRSRMVDLPR----HGHAMVPCIDMANHASEGTVKALYDEDADGNAV 257
Query: 145 VQLVVDRPYKAGESIVVWCGPQPN-SKLLINYGFVDE 180
+QL R +A E + + G + + S+L+ +YGF+DE
Sbjct: 258 LQLREGRSLRADEEVTISYGDEKSASELIFSYGFLDE 294
>gi|355747383|gb|EHH51880.1| SET domain-containing protein 4 [Macaca fascicularis]
Length = 440
Score = 38.5 bits (88), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 50/228 (21%), Positives = 89/228 (39%), Gaps = 24/228 (10%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
EK G +S W PY+ L + A P+ E E+ L KA+ E+ ++
Sbjct: 117 EKHAGDRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQE 168
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFA 119
+ + LF + I F++ A+ + + V+L Q+ L+
Sbjct: 169 FFASSRDFFSSLQPLFVEAVDSI----FSYSALLWAWCTINTRAVYLRPRQRECLSAEPD 224
Query: 120 LVPLGP--PLLAYSSKC--KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
L P LL +S + KA + ++ ++ E + + GP N +L + Y
Sbjct: 225 TCALAPYLDLLNHSPRVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEY 284
Query: 176 GFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
GFV NP+ R ++ L + D Q K + + +G + F
Sbjct: 285 GFVSVHNPHACVYVSREILVKYLPSRDKQMDKKISILKDHGYIENLTF 332
>gi|212544736|ref|XP_002152522.1| hypothetical protein PMAA_003730 [Talaromyces marneffei ATCC 18224]
gi|210065491|gb|EEA19585.1| hypothetical protein PMAA_003730 [Talaromyces marneffei ATCC 18224]
Length = 429
Score = 38.5 bits (88), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 19/59 (32%), Positives = 30/59 (50%)
Query: 151 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQR 209
R YK GE I + GP PN L + YGF E N D + ++ + + + + ++ QR
Sbjct: 264 RLYKKGEEIYMSYGPHPNDFLFVEYGFYLETNESDAIFLDDIIFKDFTVAEKEELIRQR 322
>gi|317030555|ref|XP_001392774.2| SET domain protein [Aspergillus niger CBS 513.88]
Length = 473
Score = 38.1 bits (87), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 40/157 (25%), Positives = 69/157 (43%), Gaps = 10/157 (6%)
Query: 29 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 88
A+ P +SE EL L GS + + + +++E+ L ++Y +D T
Sbjct: 143 AISLPTFYSEEELELLRGSSLRLAVHAKIASLEKEFEHLRRS-TEGLDWCEKYWWDEDTG 201
Query: 89 AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP-LGPPLLAYSSKCKAMLAAVDDA--- 144
TF +K +S +V L + A+VP + A KA+ D
Sbjct: 202 KLTFNDWKYVDALYRSRMVDLPR----HGHAMVPCIDMANHASEGTVKALYDEDADGNAV 257
Query: 145 VQLVVDRPYKAGESIVVWCGPQPN-SKLLINYGFVDE 180
+QL R +A E + + G + + S+L+ +YGF+DE
Sbjct: 258 LQLREGRSLRADEEVTISYGDEKSASELIFSYGFLDE 294
>gi|223946389|gb|ACN27278.1| unknown [Zea mays]
Length = 289
Score = 38.1 bits (87), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 19/57 (33%), Positives = 32/57 (56%)
Query: 140 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 196
A D+ +++++ R GE + G N+ LL YGF + DN YD + ++ AL T+
Sbjct: 47 ANDEDLEIIIVRDVNEGEEVYNTYGTMGNAALLHRYGFTELDNQYDIVNIDLALVTK 103
>gi|328352061|emb|CCA38460.1| hypothetical protein PP7435_Chr2-0775 [Komagataella pastoris CBS
7435]
Length = 846
Score = 38.1 bits (87), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 19/61 (31%), Positives = 32/61 (52%)
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
++VP L A ++ C A L + + + +P K GE + G PNS++L YG+V
Sbjct: 254 SMVPFADTLNADTNLCNANLTYQSENLVMKAIKPIKKGEQVYNTYGNHPNSEILRRYGYV 313
Query: 179 D 179
+
Sbjct: 314 E 314
>gi|159131477|gb|EDP56590.1| SET domain protein [Aspergillus fumigatus A1163]
Length = 490
Score = 38.1 bits (87), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 45/188 (23%), Positives = 81/188 (43%), Gaps = 27/188 (14%)
Query: 4 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 63
+K G + W YIR + +V P +S E L G+ + + + +++E
Sbjct: 120 QKIGISNPWTEYIRFM-------PASVPLPTFYSAEERELLRGTSLQTAVDAKLGSLEKE 172
Query: 64 YNEL-----DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF 118
++ L + W Q++ +D T FTF+ +K +S VV L +
Sbjct: 173 FDHLRQATEEIPWC------QEHWWDEDTGKFTFDDWKYVDAVYRSRVVDLPRSG----H 222
Query: 119 ALVP---LGPPLLAYSSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWCGPQ-PNSKLLI 173
A+VP + S K K +AV QL + + GE + + G + P S+++
Sbjct: 223 AIVPCVDMANHACEDSVKAKYDEEGAGNAVLQLRTGKKLRVGEEVTISYGDEKPASEMVF 282
Query: 174 NYGFVDED 181
+YGFV+ +
Sbjct: 283 SYGFVENE 290
>gi|242081035|ref|XP_002445286.1| hypothetical protein SORBIDRAFT_07g007800 [Sorghum bicolor]
gi|241941636|gb|EES14781.1| hypothetical protein SORBIDRAFT_07g007800 [Sorghum bicolor]
Length = 490
Score = 38.1 bits (87), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 43/151 (28%), Positives = 67/151 (44%), Gaps = 14/151 (9%)
Query: 41 LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA-- 98
LA L G+ EI++ + ++++Y+EL + L +P + T++ F A
Sbjct: 266 LAALEGTLLFDEIMQAKQHLRQQYDELFPL------LCTNFPEIFRKDVCTWDNFLWACE 319
Query: 99 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV---VDRPYKA 155
S +V L L+ LVP+ L S VD+A + + + RP A
Sbjct: 320 LWYSNSMMVVLSSGKLST--CLVPVAGLLNHSVSPHILNYGRVDEATKSLKFPLSRPCDA 377
Query: 156 GESIVVWCGPQPNSKLLINYGFVDE-DNPYD 185
GE + G P S L+ YGF+ DNPYD
Sbjct: 378 GEQCFLSYGKHPGSHLVTFYGFLPRGDNPYD 408
>gi|255080174|ref|XP_002503667.1| set domain protein [Micromonas sp. RCC299]
gi|226518934|gb|ACO64925.1| set domain protein [Micromonas sp. RCC299]
Length = 401
Score = 38.1 bits (87), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 25/98 (25%), Positives = 44/98 (44%), Gaps = 10/98 (10%)
Query: 1 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
M+E+ G+ S W Y L RG+ + P+ W+ +L +L G+ + E AE +
Sbjct: 59 MHERSLGEGSRWAGYFAVLP---ARGERTL--PMFWTSAQLEHLRGTDLLRHVTEDAESM 113
Query: 61 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 98
+ ++NE + L +P P T E + +A
Sbjct: 114 RLDFNE-----NVVDGLCVTHPVAFPPGKHTLEAYMEA 146
>gi|357145323|ref|XP_003573603.1| PREDICTED: SET domain-containing protein 4-like [Brachypodium
distachyon]
Length = 532
Score = 38.1 bits (87), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 42/172 (24%), Positives = 77/172 (44%), Gaps = 22/172 (12%)
Query: 41 LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 100
LA L G+ E+++ + + ++Y+EL + L ++P + +T++ F A
Sbjct: 262 LAALEGTLLFDELMQARQHLHQQYDELFPM------LCTKFPEIFTQDIYTWDNFLWACE 315
Query: 101 AVQSCVVHLQKVSLARRFALVPLG--------PPLLAYSSKCKAMLAAVDDAVQLVVDRP 152
S + + S L+P+ P +L Y +A +++ + RP
Sbjct: 316 LWYSNSMMVVLSSGKLTTCLIPVAGLLNHSVYPHILNYGRVDQAT-----KSLKFPLSRP 370
Query: 153 YKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDR--LVVEAALNTEDPQYQ 201
KAG+ + G S L+ YGF+ EDNPYD L ++ +++ ED Q
Sbjct: 371 CKAGQQCFLSYGKHSGSHLITFYGFLPREDNPYDVVPLDLDMSVDEEDGTAQ 422
>gi|388579878|gb|EIM20197.1| RuBisCO-cytochrome methylase [Wallemia sebi CBS 633.66]
Length = 447
Score = 38.1 bits (87), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 72/355 (20%), Positives = 142/355 (40%), Gaps = 49/355 (13%)
Query: 2 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-------- 53
+E QG S W Y + L +Q S + WS+ EL+ L G+ +I
Sbjct: 93 WEYNQGGSSKWDAYFKILPKQ-------FTSLMFWSKEELSLLKGTTVVDKIGLEDIENE 145
Query: 54 LERAEGI-KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKV 112
ER I K+ N + LF++ I + +FT E +K + +
Sbjct: 146 FERVRDIVKQNENVFGDIANYTLDLFKRMGSLILSRSFTVEEWKTEEEREKEEEEEEDED 205
Query: 113 ------SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 166
+ A+VP+ L + + A ++ ++++ + KAG+ I
Sbjct: 206 EEIDLRTSVDDVAMVPMADILNSRTDSVNAHTEYEENCLRMISLQDIKAGDQIFNTYNDP 265
Query: 167 PNSKLLINYGFVD-----ED-----NPYD------RLVVEAALNTEDPQYQDKRM--VAQ 208
PN+ L+ YG VD +D N D +++E AL ++++R+ +
Sbjct: 266 PNADLIRRYGHVDYSPLSQDPDFMGNKNDVVELPADILLELALPDAKESHKERRVEFLLD 325
Query: 209 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 268
G+ S ++ H + + ++L L + +E ++ S P + +
Sbjct: 326 ECGEDSFELTH------DDLVPELLKICVLLFTESEAEFKTREKSRK--LPKASGFTKGK 377
Query: 269 LDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM-EKKMLNACLQ 322
+ L K R+ Y +TL +D + L + + P+ +V + E+++LN ++
Sbjct: 378 AEFLIKAIKQRMEQYGSTLEDDISKLDNKDSLPENNFKALVVTVGERRILNKAIE 432
>gi|428175768|gb|EKX44656.1| hypothetical protein GUITHDRAFT_109433 [Guillardia theta CCMP2712]
Length = 591
Score = 38.1 bits (87), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 51/205 (24%), Positives = 84/205 (40%), Gaps = 32/205 (15%)
Query: 1 MYEK-KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG 59
+YEK QG SFW P I L G + WSE EL L KAE +
Sbjct: 99 LYEKLDQGNASFWKPMIDILPADPG-------AASKWSEEELQELQDESLKAEAMIVVAS 151
Query: 60 IKREYNE-LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF 118
+++ Y L + G +F + +T+E F+ A + V+S RF
Sbjct: 152 MQQTYQRVLRPILVQHGDVF-------SVDRYTWEEFRWALLCVESRTFG--------RF 196
Query: 119 ----ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD----RPYKAGESIVVWCGPQPNSK 170
++VP L + + + + D ++ GE + GP+ N++
Sbjct: 197 LPHPSIVPFADLLNHVNVQTSYRWLPEERRAAYMCDASGEHVHRRGEEAFMSYGPRSNAE 256
Query: 171 LLINYGFVDEDNPYDRLVVEAALNT 195
LL++YGF + N Y+ + + +NT
Sbjct: 257 LLLHYGFALQSNRYEAVELNFRINT 281
>gi|393217169|gb|EJD02658.1| SET domain-containing protein [Fomitiporia mediterranea MF3/22]
Length = 513
Score = 37.7 bits (86), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 55/222 (24%), Positives = 84/222 (37%), Gaps = 19/222 (8%)
Query: 118 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
A+VP+ L A A L + +V +P + GE I G PNS LL YG
Sbjct: 240 IAMVPMADMLNAQYGSENAKLFYEPTHLNMVSTKPIRRGEQIYNAYGDLPNSALLREYGH 299
Query: 178 VD----------EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 227
VD E NP D + + A L R+ A+ + K + + G +
Sbjct: 300 VDLVPLPGVPWKEGNPADVVEIPADLALHAVLSSQARVDAE-SLKERIDWWLEEGGDDVF 358
Query: 228 AIS------DMLPYLRLGYVSDTSEMQSVISSLGPICP-VSPCMERAVLDQLADYFKARL 280
+ D++ + E + S P P + + + + RL
Sbjct: 359 VLGTDLELPDVMISFLKLLLLSKLEWEKARSKSKPPKPKLDMDSKLQTFPLVLGMLERRL 418
Query: 281 AGYPATLSEDEAMLTDYNLHPKKRVATQLVRM-EKKMLNACL 321
A YP TL DEA+L+ P +VR+ EK +L C+
Sbjct: 419 AKYPTTLEHDEALLSGQTSLPYNVRNAIIVRIGEKHILVGCM 460
>gi|242066082|ref|XP_002454330.1| hypothetical protein SORBIDRAFT_04g028760 [Sorghum bicolor]
gi|241934161|gb|EES07306.1| hypothetical protein SORBIDRAFT_04g028760 [Sorghum bicolor]
Length = 490
Score = 37.7 bits (86), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 17/48 (35%), Positives = 30/48 (62%)
Query: 140 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 187
++D +V++V ++ + SI + G PN L++YGFV NPYD++
Sbjct: 274 SLDMSVKVVAEKKIEQNASITLNYGCHPNDFFLLDYGFVITPNPYDQV 321
>gi|226492747|ref|NP_001140859.1| uncharacterized protein LOC100272935 [Zea mays]
gi|194701488|gb|ACF84828.1| unknown [Zea mays]
gi|413951742|gb|AFW84391.1| hypothetical protein ZEAMMB73_159573 [Zea mays]
Length = 495
Score = 37.7 bits (86), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 19/57 (33%), Positives = 32/57 (56%)
Query: 140 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 196
A D+ +++++ R GE + G N+ LL YGF + DN YD + ++ AL T+
Sbjct: 253 ANDEDLEIIIVRDVNEGEEVYNTYGTMGNAALLHRYGFTELDNQYDIVNIDLALVTK 309
>gi|355718756|gb|AES06374.1| SET domain containing 4 [Mustela putorius furo]
Length = 256
Score = 37.7 bits (86), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 53/220 (24%), Positives = 89/220 (40%), Gaps = 23/220 (10%)
Query: 3 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
EK G +S W PY+ L + A P+ E ++ L P KA+ E+ ++
Sbjct: 20 EKHAGDQSLWKPYLDILPK-------AYTCPVC-LEPKVVNLFPEPLKAKAEEQRARVQG 71
Query: 63 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQKVSLARRFA 119
++ + LF + +I F++ A+ V + V H Q+ +
Sbjct: 72 FFSSSRDFFSSLQPLFSEAVENI----FSYSALLWAWCTVNTRAVYMKHGQRKCFSPEPD 127
Query: 120 LVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
L P LL +S + KA ++ + E + + GP N +LL+ Y
Sbjct: 128 TYALAPYLDLLNHSPDVQVKAAFNEETRCYEVRTASGCRKHEQVFICYGPHDNQRLLLEY 187
Query: 176 GFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 215
GFV NP+ + V A L + DK+M N K+S+
Sbjct: 188 GFVSIQNPHACVYVSADLLVKYLPSTDKQM----NKKISI 223
>gi|119495234|ref|XP_001264406.1| SET domain protein [Neosartorya fischeri NRRL 181]
gi|119412568|gb|EAW22509.1| SET domain protein [Neosartorya fischeri NRRL 181]
Length = 492
Score = 37.7 bits (86), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 42/191 (21%), Positives = 80/191 (41%), Gaps = 33/191 (17%)
Query: 4 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 63
+K G + W YIR + ++ P +S E L G+ + + + +++E
Sbjct: 120 QKIGVSNPWTEYIRFM-------PASIPLPTFYSAEERELLRGTSLQTAVDAKLGSLEKE 172
Query: 64 YNEL-----DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF 118
+ L D W Q++ +D T FTF+ K +S VV L + A
Sbjct: 173 FEHLRQATEDIHWC------QEHWWDEDTGKFTFDDLKYVDAVYRSRVVDLPRSGHA--- 223
Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDD-------AVQLVVDRPYKAGESIVVWCGPQ-PNSK 170
+ P + + C+ ++ A D +QL + + GE + + G + P S+
Sbjct: 224 ----IVPCVDMANHACEDLVKARYDEDGAGNAVLQLRTGKKLRVGEEVTISYGDEKPASE 279
Query: 171 LLINYGFVDED 181
++ +YGFV+ +
Sbjct: 280 MVFSYGFVENE 290
>gi|410079629|ref|XP_003957395.1| hypothetical protein KAFR_0E01060 [Kazachstania africana CBS 2517]
gi|372463981|emb|CCF58260.1| hypothetical protein KAFR_0E01060 [Kazachstania africana CBS 2517]
Length = 534
Score = 37.7 bits (86), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 25/44 (56%)
Query: 142 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
D+ V +V+D+ KAG I GP N LL +GF E NP+D
Sbjct: 307 DECVDIVLDKDVKAGVEIFNSYGPLSNVFLLSRHGFAVEGNPHD 350
>gi|297836754|ref|XP_002886259.1| hypothetical protein ARALYDRAFT_319874 [Arabidopsis lyrata subsp.
lyrata]
gi|297332099|gb|EFH62518.1| hypothetical protein ARALYDRAFT_319874 [Arabidopsis lyrata subsp.
lyrata]
Length = 541
Score = 37.7 bits (86), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 37/151 (24%), Positives = 63/151 (41%), Gaps = 21/151 (13%)
Query: 44 LTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ 103
L G+ EI++ E ++ Y+EL L + + P E +T+E + A
Sbjct: 258 LDGTLLLDEIMQAKELLRERYDELI-------PLLSNHRHVFPPEHYTWEHYLWACELYY 310
Query: 104 SCVVHLQKVSLARRFALVPLG--------PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 155
S + ++ + L+P+ P ++ Y C +++ V RP
Sbjct: 311 SNSMQIKFPDGKLKTCLIPVAGFLNHSIYPHIVKYGKVCVET-----SSLKFPVSRPCNK 365
Query: 156 GESIVVWCGPQPNSKLLINYGFVDE-DNPYD 185
GE + G +S LL YGF+ + DNPYD
Sbjct: 366 GEQCFLSYGNYSSSHLLTFYGFLPKGDNPYD 396
>gi|258563540|ref|XP_002582515.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237908022|gb|EEP82423.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 445
Score = 37.7 bits (86), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 52/111 (46%), Gaps = 13/111 (11%)
Query: 145 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 204
+ ++ + Y+AG+ + + G PN++LL YGFV NP D + + P ++ KR
Sbjct: 195 LSILAGKGYEAGDQVFIHYGSVPNNRLLRLYGFVIPSNPNDSYDLVLETHPLAPFFEQKR 254
Query: 205 MVAQRNGKLSVQVFHVHAGREKEAISDMLP-----YLRLGYVSDTSEMQSV 250
+ G S + ++D LP YLR+ SD S++ +V
Sbjct: 255 KLWALAGLDSTSTISL-------TLTDPLPNNVLRYLRI-QRSDESDLAAV 297
>gi|156538697|ref|XP_001607787.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Nasonia
vitripennis]
Length = 403
Score = 37.7 bits (86), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 33/181 (18%), Positives = 70/181 (38%), Gaps = 25/181 (13%)
Query: 34 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 93
L S ++ L SP L++ I R+Y+ ++ + + + D+ FT+E
Sbjct: 108 LYMSMDDMMELKESPALETALKQCRNIARQYSYFKKLFHNSKNSVSKLLADV----FTYE 163
Query: 94 IFKQAFVAV-----------QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD 142
++ A + QS +VH +L+P+ K +
Sbjct: 164 EYRWAVSTIMTRQNVIPSENQSAMVH----------SLIPMWDMCNHSEGKITTNFNEIS 213
Query: 143 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQD 202
+ + + +K + I ++ G + N++ ++ GFV DN D + + + D +
Sbjct: 214 NCCECYAMKSFKTDDQIFIYYGSRTNAEFFVHSGFVYPDNANDSYELHLGIGSSDKLRSE 273
Query: 203 K 203
K
Sbjct: 274 K 274
>gi|388854849|emb|CCF51530.1| uncharacterized protein [Ustilago hordei]
Length = 673
Score = 37.7 bits (86), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 34/76 (44%)
Query: 118 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
++ P+ L A A L +++ +P AGE I PNS LL YG
Sbjct: 362 ISMTPMADMLNARFESDNARLFYKSHVLEMRATKPISAGEQIFNTYADPPNSDLLRRYGH 421
Query: 178 VDEDNPYDRLVVEAAL 193
VDE N D + ++A L
Sbjct: 422 VDEPNGSDVVELDAKL 437
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.135 0.401
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,264,592,823
Number of Sequences: 23463169
Number of extensions: 207579814
Number of successful extensions: 533380
Number of sequences better than 100.0: 702
Number of HSP's better than 100.0 without gapping: 353
Number of HSP's successfully gapped in prelim test: 349
Number of HSP's that attempted gapping in prelim test: 532294
Number of HSP's gapped (non-prelim): 866
length of query: 348
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 205
effective length of database: 9,003,962,200
effective search space: 1845812251000
effective search space used: 1845812251000
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 77 (34.3 bits)