BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 009869
(523 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|449455876|ref|XP_004145676.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cucumis
sativus]
gi|449492872|ref|XP_004159127.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cucumis
sativus]
Length = 521
Score = 819 bits (2116), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 404/510 (79%), Positives = 442/510 (86%), Gaps = 21/510 (4%)
Query: 21 HHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSSDTLVAGSR-------EVVSK 73
H PL + S IS+S R +F +S +RR N S SSS+TLVAGSR E V+K
Sbjct: 24 HRPLLLLSKISVSAPRISHFSNSFSPIRRWNVCS--ASSSETLVAGSRKENGKTGEAVTK 81
Query: 74 KEED-LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
KE+D GDLK+WMH NGLPPCKVIL+EKPSH++ HRPIHYVAASEDL+ GD AFSVPNSL
Sbjct: 82 KEDDEFGDLKAWMHDNGLPPCKVILEEKPSHDKNHRPIHYVAASEDLEVGDVAFSVPNSL 141
Query: 133 VVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
VVTLERVLGNET+AELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQL
Sbjct: 142 VVTLERVLGNETVAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQL 201
Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 252
AVESPLLWSE EL YL+GSPTK E+LERAEGIK+EYNELDTVWFMAGSLFQQYPYDIPTE
Sbjct: 202 AVESPLLWSEDELDYLSGSPTKKEVLERAEGIKKEYNELDTVWFMAGSLFQQYPYDIPTE 261
Query: 253 AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 312
AF+FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY S CKAML AVD AV+LV
Sbjct: 262 AFSFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYRSNCKAMLTAVDGAVELV 321
Query: 313 VDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 372
VDRPYKAGESI VW CGPQPNSKLL+NYGFVDEDN YDRLVVEAALNTED
Sbjct: 322 VDRPYKAGESIAVW-----------CGPQPNSKLLLNYGFVDEDNRYDRLVVEAALNTED 370
Query: 373 PQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPI 432
PQYQDKRMVAQRNG+LS+Q F+V+AG+EKEA+ DMLPYLRLGYV+ SEMQSVISS GP+
Sbjct: 371 PQYQDKRMVAQRNGRLSIQAFYVYAGKEKEAVLDMLPYLRLGYVTHPSEMQSVISSQGPV 430
Query: 433 CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 492
CPVSPCMERA+L+Q+ADYFK RLAGYP TLSEDE +L D NL+PKKRVATQLVR+EKK+L
Sbjct: 431 CPVSPCMERAMLEQVADYFKRRLAGYPTTLSEDEFLLADGNLNPKKRVATQLVRLEKKLL 490
Query: 493 NACLQVTADMIMLLPDVTVSPCPAPYAPLL 522
++CL+VT D I LPD TVSPCPAPYAPLL
Sbjct: 491 HSCLEVTIDFINQLPDHTVSPCPAPYAPLL 520
>gi|225452167|ref|XP_002264334.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
[Vitis vinifera]
Length = 509
Score = 819 bits (2116), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 396/473 (83%), Positives = 423/473 (89%), Gaps = 18/473 (3%)
Query: 58 SSSDTLVAGSR-------EVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPI 110
S SDTLVAGSR E KKE++ GDLKSWMH+NGLPPCKV+LKE+PSH+E+H+ I
Sbjct: 48 SGSDTLVAGSRKEDGRVSEAARKKEDEFGDLKSWMHENGLPPCKVVLKERPSHHEQHKAI 107
Query: 111 HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
HY+AASEDLQAGD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ
Sbjct: 108 HYIAASEDLQAGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 167
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
GKKSFW PYIRELDRQRGRGQLAVESPLLWSE+ELAYLTGSPTKAE+LERAEGIKREYNE
Sbjct: 168 GKKSFWYPYIRELDRQRGRGQLAVESPLLWSESELAYLTGSPTKAEVLERAEGIKREYNE 227
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 290
LDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFVA+QSCVVHLQKVSLARRFALVPLGPP
Sbjct: 228 LDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFVAIQSCVVHLQKVSLARRFALVPLGPP 287
Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINY 350
LLAY S CKAMLAAVD +VQLVVDRPYKAGESIVVW CGPQPNSKLL+NY
Sbjct: 288 LLAYRSNCKAMLAAVDGSVQLVVDRPYKAGESIVVW-----------CGPQPNSKLLLNY 336
Query: 351 GFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPY 410
GFVDEDN YDR+VVEAALNTEDPQYQDKRMVAQRNGKL+VQ FHV G+E+EA+SDMLPY
Sbjct: 337 GFVDEDNSYDRIVVEAALNTEDPQYQDKRMVAQRNGKLTVQKFHVSVGKEREAVSDMLPY 396
Query: 411 LRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 470
LRLGYVSD SEMQSVISS GPICPVSPCMERAVLDQL DYF+ RLAGYP T+SEDE +L
Sbjct: 397 LRLGYVSDPSEMQSVISSQGPICPVSPCMERAVLDQLVDYFERRLAGYPTTMSEDECLLA 456
Query: 471 DYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 523
D NL+PKK VATQLVR+EKKMLNACL+ T D+I LPD TVSPCPAPY PLL
Sbjct: 457 DSNLNPKKLVATQLVRLEKKMLNACLKATVDLINQLPDHTVSPCPAPYTPLLK 509
>gi|359488614|ref|XP_003633789.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
[Vitis vinifera]
Length = 515
Score = 812 bits (2098), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 396/479 (82%), Positives = 423/479 (88%), Gaps = 24/479 (5%)
Query: 58 SSSDTLVAGSR-------EVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPI 110
S SDTLVAGSR E KKE++ GDLKSWMH+NGLPPCKV+LKE+PSH+E+H+ I
Sbjct: 48 SGSDTLVAGSRKEDGRVSEAARKKEDEFGDLKSWMHENGLPPCKVVLKERPSHHEQHKAI 107
Query: 111 HYVAASEDLQ------AGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYL 164
HY+AASEDLQ AGD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYL
Sbjct: 108 HYIAASEDLQGFLLLQAGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYL 167
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
MYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE+ELAYLTGSPTKAE+LERAEGI
Sbjct: 168 MYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESELAYLTGSPTKAEVLERAEGI 227
Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 284
KREYNELDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFVA+QSCVVHLQKVSLARRFAL
Sbjct: 228 KREYNELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFVAIQSCVVHLQKVSLARRFAL 287
Query: 285 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNS 344
VPLGPPLLAY S CKAMLAAVD +VQLVVDRPYKAGESIVVW CGPQPNS
Sbjct: 288 VPLGPPLLAYRSNCKAMLAAVDGSVQLVVDRPYKAGESIVVW-----------CGPQPNS 336
Query: 345 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI 404
KLL+NYGFVDEDN YDR+VVEAALNTEDPQYQDKRMVAQRNGKL+VQ FHV G+E+EA+
Sbjct: 337 KLLLNYGFVDEDNSYDRIVVEAALNTEDPQYQDKRMVAQRNGKLTVQKFHVSVGKEREAV 396
Query: 405 SDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSE 464
SDMLPYLRLGYVSD SEMQSVISS GPICPVSPCMERAVLDQL DYF+ RLAGYP T+SE
Sbjct: 397 SDMLPYLRLGYVSDPSEMQSVISSQGPICPVSPCMERAVLDQLVDYFERRLAGYPTTMSE 456
Query: 465 DEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 523
DE +L D NL+PKK VATQLVR+EKKMLNACL+ T D+I LPD TVSPCPAPY PLL
Sbjct: 457 DECLLADSNLNPKKLVATQLVRLEKKMLNACLKATVDLINQLPDHTVSPCPAPYTPLLK 515
>gi|224117488|ref|XP_002331687.1| SET domain protein [Populus trichocarpa]
gi|222874165|gb|EEF11296.1| SET domain protein [Populus trichocarpa]
Length = 502
Score = 800 bits (2065), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 405/529 (76%), Positives = 439/529 (82%), Gaps = 33/529 (6%)
Query: 1 MEASCSLRSSKFISPPIRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSS 60
ME +C +K ISP L++ S +SIS P S RR F+
Sbjct: 1 MEFTC--LHNKCISPS-------LTVLSRVSISFSNLPKRAVSFHRRRRNLCFA------ 45
Query: 61 DTLVAGSR--EVVSKK----EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVA 114
TLV G R EVVSK+ E++ GDLKSWMHKNGLPPCKV+LKE+PSH++K RPIHYVA
Sbjct: 46 -TLVDGKRTSEVVSKRGGEEEDEFGDLKSWMHKNGLPPCKVVLKERPSHDKKLRPIHYVA 104
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASEDLQA D A SVPNSLVVTLERVLGNET+AELLTTNKLSELACLALYLMYEKKQGKKS
Sbjct: 105 ASEDLQASDVAVSVPNSLVVTLERVLGNETLAELLTTNKLSELACLALYLMYEKKQGKKS 164
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
FW PYIRELDRQRGRGQLAVESPLLWSE ELAYLTGSPTKAE+L+RA+GIKREY ELDTV
Sbjct: 165 FWYPYIRELDRQRGRGQLAVESPLLWSEAELAYLTGSPTKAEVLDRADGIKREYEELDTV 224
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
WFMAGSLFQQYPYDIPTEAF FEIFKQAFVA+QSCVVHLQKVSLARRFALVPLGPPLLAY
Sbjct: 225 WFMAGSLFQQYPYDIPTEAFPFEIFKQAFVAIQSCVVHLQKVSLARRFALVPLGPPLLAY 284
Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVD 354
SS CKAML AVD AV+LVVDRPYKAGE IVVW CGPQPNSKLL+NYGFVD
Sbjct: 285 SSNCKAMLTAVDGAVELVVDRPYKAGEPIVVW-----------CGPQPNSKLLLNYGFVD 333
Query: 355 EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLG 414
EDNPYDR+ VEAALNTEDPQYQDKRMVAQRNGKLSVQVF V+AG+EKEA+SD+LPYLRLG
Sbjct: 334 EDNPYDRIAVEAALNTEDPQYQDKRMVAQRNGKLSVQVFQVYAGKEKEAVSDILPYLRLG 393
Query: 415 YVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNL 474
YVSD SEMQSVISS GP+CPVSPCME+AVLDQL YF+ RLAGY ++SEDE ML D NL
Sbjct: 394 YVSDPSEMQSVISSQGPVCPVSPCMEQAVLDQLTVYFRTRLAGYCTSISEDELMLADPNL 453
Query: 475 HPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 523
+PKKRVATQLVR+EKKML ACLQ T D+I LPD T+ PCPAPYAPLL
Sbjct: 454 NPKKRVATQLVRLEKKMLKACLQATVDLINQLPDHTMPPCPAPYAPLLK 502
>gi|296090251|emb|CBI40070.3| unnamed protein product [Vitis vinifera]
Length = 428
Score = 785 bits (2026), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/438 (85%), Positives = 401/438 (91%), Gaps = 11/438 (2%)
Query: 85 MHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNET 144
MH+NGLPPCKV+LKE+PSH+E+H+ IHY+AASEDLQAGD AFSVP+SLVVTLERVLGNET
Sbjct: 1 MHENGLPPCKVVLKERPSHHEQHKAIHYIAASEDLQAGDVAFSVPDSLVVTLERVLGNET 60
Query: 145 IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 204
IAELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE+E
Sbjct: 61 IAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESE 120
Query: 205 LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
LAYLTGSPTKAE+LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFV
Sbjct: 121 LAYLTGSPTKAEVLERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFV 180
Query: 265 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 324
A+QSCVVHLQKVSLARRFALVPLGPPLLAY S CKAMLAAVD +VQLVVDRPYKAGESIV
Sbjct: 181 AIQSCVVHLQKVSLARRFALVPLGPPLLAYRSNCKAMLAAVDGSVQLVVDRPYKAGESIV 240
Query: 325 VWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQR 384
VW CGPQPNSKLL+NYGFVDEDN YDR+VVEAALNTEDPQYQDKRMVAQR
Sbjct: 241 VW-----------CGPQPNSKLLLNYGFVDEDNSYDRIVVEAALNTEDPQYQDKRMVAQR 289
Query: 385 NGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVL 444
NGKL+VQ FHV G+E+EA+SDMLPYLRLGYVSD SEMQSVISS GPICPVSPCMERAVL
Sbjct: 290 NGKLTVQKFHVSVGKEREAVSDMLPYLRLGYVSDPSEMQSVISSQGPICPVSPCMERAVL 349
Query: 445 DQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIM 504
DQL DYF+ RLAGYP T+SEDE +L D NL+PKK VATQLVR+EKKMLNACL+ T D+I
Sbjct: 350 DQLVDYFERRLAGYPTTMSEDECLLADSNLNPKKLVATQLVRLEKKMLNACLKATVDLIN 409
Query: 505 LLPDVTVSPCPAPYAPLL 522
LPD TVSPCPAPY PLL
Sbjct: 410 QLPDHTVSPCPAPYTPLL 427
>gi|357497055|ref|XP_003618816.1| SET domain protein [Medicago truncatula]
gi|355493831|gb|AES75034.1| SET domain protein [Medicago truncatula]
Length = 501
Score = 773 bits (1995), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/487 (77%), Positives = 418/487 (85%), Gaps = 13/487 (2%)
Query: 36 RDPNFGSSLRLVRRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKV 95
R P+F S RR+ R S+SDTLVA + + K++ED GDLK+WMHKNGLPPCKV
Sbjct: 27 RLPSFLSLSTNHRRRRRSFCSASNSDTLVAATGK--KKRDEDDGDLKTWMHKNGLPPCKV 84
Query: 96 ILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLS 155
+LK+KPS ++ +PIHYVAASEDLQ GD AFSVPNSLVVTLERVLGNETIAELLTTNK S
Sbjct: 85 VLKDKPSLDDSVKPIHYVAASEDLQKGDIAFSVPNSLVVTLERVLGNETIAELLTTNKFS 144
Query: 156 ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 215
ELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE+ELAYL GSP K
Sbjct: 145 ELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESELAYLEGSPLKD 204
Query: 216 EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 275
EI++R EGI++EYNELDTVWFM+GSLFQQYPYD+PTEAF FEIFKQAF AVQSCVVHLQ
Sbjct: 205 EIVKRIEGIRKEYNELDTVWFMSGSLFQQYPYDLPTEAFPFEIFKQAFAAVQSCVVHLQN 264
Query: 276 VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHD 335
VSLARRFALVPLGPPLLAY S CKAML AVD AVQLVVDRPYKAG+ IVVW
Sbjct: 265 VSLARRFALVPLGPPLLAYCSNCKAMLTAVDGAVQLVVDRPYKAGDPIVVW--------- 315
Query: 336 FKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV 395
CGPQPN+KLL NYGFVDEDN DRL+VE AL+TEDPQYQDKR+VAQRNGKLS+Q F+V
Sbjct: 316 --CGPQPNTKLLTNYGFVDEDNSNDRLIVEVALSTEDPQYQDKRIVAQRNGKLSIQTFYV 373
Query: 396 HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARL 455
+ G+E+EA+SDM+PY+RLGYVSD SEMQSVISS GP+CPVSPCMERAVLDQLADYF RL
Sbjct: 374 YTGKEREAVSDMIPYMRLGYVSDPSEMQSVISSQGPVCPVSPCMERAVLDQLADYFNTRL 433
Query: 456 AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCP 515
A YP TL+EDE+MLTD +L+PK+RVATQLVR+EKKML+ACLQ D+I LPD +VSPCP
Sbjct: 434 AAYPTTLAEDESMLTDGSLNPKRRVATQLVRLEKKMLHACLQAIMDLISQLPDHSVSPCP 493
Query: 516 APYAPLL 522
APYAP L
Sbjct: 494 APYAPSL 500
>gi|22326803|ref|NP_196930.2| Rubisco methyltransferase family protein [Arabidopsis thaliana]
gi|30684815|ref|NP_851038.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
gi|42573363|ref|NP_974778.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
gi|17473570|gb|AAL38260.1| putative protein [Arabidopsis thaliana]
gi|23297671|gb|AAN13005.1| unknown protein [Arabidopsis thaliana]
gi|332004624|gb|AED92007.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
gi|332004625|gb|AED92008.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
gi|332004626|gb|AED92009.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
Length = 514
Score = 769 bits (1987), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/532 (73%), Positives = 433/532 (81%), Gaps = 27/532 (5%)
Query: 1 MEASCSLRSSKFISPPIRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSS 60
ME + +K +S PIR PLS S S+ R+ SS R V + S+ VSSS
Sbjct: 1 MEGVITCFHTKCVSLPIR--SFPLSRVS--SLPRWRNNKLISSSRSVHLR---SLCVSSS 53
Query: 61 DTLVAGSR--------EVVSKKE-EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIH 111
DTLVA +V SKKE +D DLK WM KNGLPPCKVILKE+P+H++KH+PIH
Sbjct: 54 DTLVASGSPKEDERQSKVSSKKEGDDSEDLKFWMDKNGLPPCKVILKERPAHDQKHKPIH 113
Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
YVAASEDLQ GD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG
Sbjct: 114 YVAASEDLQKGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 173
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
KKS W PYIRELDRQRGRGQL ESPLLWSE EL YLTGSPTKAE+LERAEGIKREYNEL
Sbjct: 174 KKSVWYPYIRELDRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGIKREYNEL 233
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
DTVWFMAGSLFQQYP+DIPTEAF+FEIFKQAFVA+QSCVVHLQ V LARRFALVPLGPPL
Sbjct: 234 DTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQNVGLARRFALVPLGPPL 293
Query: 292 LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYG 351
LAY S CKAML AVD AV+LVVDRPYKAG+ IVVW CGPQPN+KLL+NYG
Sbjct: 294 LAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVVW-----------CGPQPNAKLLLNYG 342
Query: 352 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL 411
FVDEDNPYDR++VEAALNTEDPQYQDKRMVAQRNGKLS QVF V G+E+EA+ DMLPYL
Sbjct: 343 FVDEDNPYDRVIVEAALNTEDPQYQDKRMVAQRNGKLSQQVFQVRVGKEREAVQDMLPYL 402
Query: 412 RLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 471
RLGY+SD SEMQSVISS GP+CP+SPCMERAVLDQLA+YF RL+GYP T ED+A+L D
Sbjct: 403 RLGYMSDPSEMQSVISSQGPVCPMSPCMERAVLDQLANYFMRRLSGYPTTPKEDDALLAD 462
Query: 472 YNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 523
+L P+KRVAT+LV++EKK+L ACL T D++ LPD +SPCPAPYAP L
Sbjct: 463 PSLSPRKRVATRLVQLEKKILVACLTTTVDLLNQLPDTAISPCPAPYAPSLK 514
>gi|18377718|gb|AAL67009.1| unknown protein [Arabidopsis thaliana]
Length = 514
Score = 766 bits (1979), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/532 (72%), Positives = 432/532 (81%), Gaps = 27/532 (5%)
Query: 1 MEASCSLRSSKFISPPIRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSS 60
ME + +K +S PIR PLS S S+ R+ SS R V + S+ VSSS
Sbjct: 1 MEGVITCFHTKCVSLPIR--SFPLSRVS--SLPRWRNNKLISSSRSVHLR---SLCVSSS 53
Query: 61 DTLVAGSR--------EVVSKKE-EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIH 111
DTLVA +V SKKE +D DLK WM KNGLPPCKVILKE+P+H++KH+PIH
Sbjct: 54 DTLVASGSPKEDERQSKVSSKKEGDDSEDLKFWMDKNGLPPCKVILKERPAHDQKHKPIH 113
Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
YVAASEDLQ GD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG
Sbjct: 114 YVAASEDLQKGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 173
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
KKS W PYIRELDRQRGRGQL ESPLLWSE EL YLTGSPTKAE+LERAEGIKREYNEL
Sbjct: 174 KKSVWYPYIRELDRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGIKREYNEL 233
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
DTVWFMAGSLFQQYP+DIPTEAF+FEIFKQAFVA+QSCVVHLQ V LARRFALVPLGPPL
Sbjct: 234 DTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQNVGLARRFALVPLGPPL 293
Query: 292 LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYG 351
LAY S CKAML AVD AV+LVVDRPYKAG+ IVVW CGPQPN+KLL+NYG
Sbjct: 294 LAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVVW-----------CGPQPNAKLLLNYG 342
Query: 352 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL 411
FVDEDNPYDR++VEAALNTE PQYQDKRMVAQRNGKLS QVF V G+E+EA+ DMLPYL
Sbjct: 343 FVDEDNPYDRVIVEAALNTEGPQYQDKRMVAQRNGKLSQQVFQVRVGKEREAVQDMLPYL 402
Query: 412 RLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 471
RLGY+SD SEMQSVISS GP+CP+SPCMERAVLDQLA+YF RL+GYP T ED+A+L D
Sbjct: 403 RLGYMSDPSEMQSVISSQGPVCPMSPCMERAVLDQLANYFMRRLSGYPTTPKEDDALLAD 462
Query: 472 YNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 523
+L P+KRVAT+LV++EKK+L ACL T D++ LPD +SPCPAPYAP L
Sbjct: 463 PSLSPRKRVATRLVQLEKKILVACLTTTVDLLNQLPDTAISPCPAPYAPSLK 514
>gi|297807453|ref|XP_002871610.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297317447|gb|EFH47869.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 516
Score = 764 bits (1973), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/532 (72%), Positives = 431/532 (81%), Gaps = 25/532 (4%)
Query: 1 MEASCSLRSSKFISPPIRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSS 60
ME + +K +S PIR PLS S S+ R+ SS R V ++ + SSS
Sbjct: 1 MEGVITCFHTKCVSLPIR--SFPLSRVS--SLPRWRNTKLISSSRSVPLRS-LCVSASSS 55
Query: 61 DTLVAGSR--------EVVSKKE-EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIH 111
DTLVAG +V SKKE +D DLK WM KNGLPPCKV+LKE+P+H+ K++PIH
Sbjct: 56 DTLVAGGSPKEDERQSKVSSKKEGDDSEDLKFWMDKNGLPPCKVLLKERPAHDLKYKPIH 115
Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
YVAASEDLQ GD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG
Sbjct: 116 YVAASEDLQKGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 175
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
KKS W PYIRELDRQRGRGQL ESPLLWSE EL YLTGSPTKAE+LERAEGIKREYNEL
Sbjct: 176 KKSVWYPYIRELDRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGIKREYNEL 235
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
DTVWFMAGSLFQQYP+DIPTEAF+FEIFKQAFVA+QSCVVHLQ V LARRFALVPLGPPL
Sbjct: 236 DTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQNVGLARRFALVPLGPPL 295
Query: 292 LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYG 351
LAY S CKAML AVD AV+LVVDRPYKAG+ IVVW CGPQPN+KLL+NYG
Sbjct: 296 LAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVVW-----------CGPQPNAKLLLNYG 344
Query: 352 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL 411
FVDEDNPYDR++VEAALNTEDPQYQDKRMVAQRNGKLS QVF V G+E+EA+ DMLPYL
Sbjct: 345 FVDEDNPYDRIIVEAALNTEDPQYQDKRMVAQRNGKLSQQVFQVRVGKEREAVQDMLPYL 404
Query: 412 RLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 471
RLGY+SD SEMQSVISS GP+C +SPCMERAVLDQLA+YF RL+GYP T ED+A+L D
Sbjct: 405 RLGYMSDPSEMQSVISSQGPVCTMSPCMERAVLDQLANYFMRRLSGYPTTPKEDDALLAD 464
Query: 472 YNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 523
+L P+KRVAT+LV++EKK+L ACL T D++ LPD +SPCPAPYAP L
Sbjct: 465 PSLSPRKRVATRLVQLEKKILAACLTTTVDLLNQLPDTAISPCPAPYAPSLK 516
>gi|356571407|ref|XP_003553868.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
isoform 1 [Glycine max]
gi|356571409|ref|XP_003553869.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
isoform 2 [Glycine max]
Length = 502
Score = 761 bits (1964), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/443 (81%), Positives = 397/443 (89%), Gaps = 11/443 (2%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
DLKSWMHK+GLPPCKV+LK+KP N+ H+PIHYVAAS+DLQ GD AFSVPNSLVVTLERV
Sbjct: 70 DLKSWMHKHGLPPCKVVLKDKPCPNDSHKPIHYVAASQDLQVGDVAFSVPNSLVVTLERV 129
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
LGNET+AELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQL+VESPLL
Sbjct: 130 LGNETVAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLSVESPLL 189
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
W ++EL YL+GSP K E+++R E I++EYNELDTVWFMAGSLFQQYPYDIPTEAF+FEIF
Sbjct: 190 WLKSELDYLSGSPIKDEVIQREEAIRKEYNELDTVWFMAGSLFQQYPYDIPTEAFSFEIF 249
Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 319
KQAF A+QSCVVHLQKVSLARRFALVPLGPPLL+Y S CKAML AVD AV+L VDRPYKA
Sbjct: 250 KQAFAAIQSCVVHLQKVSLARRFALVPLGPPLLSYQSNCKAMLTAVDGAVELAVDRPYKA 309
Query: 320 GESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 379
G+ IVVW CGPQPNSKLLINYGFVDE+N DRL+VEAALNTEDPQYQDKR
Sbjct: 310 GDPIVVW-----------CGPQPNSKLLINYGFVDENNSNDRLIVEAALNTEDPQYQDKR 358
Query: 380 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCM 439
MVAQRNGKLSVQVFHV+AG+E+EA+ DML Y+RLGYVSD SEM+SVISS GP+CPVSPCM
Sbjct: 359 MVAQRNGKLSVQVFHVYAGKEREAVLDMLRYMRLGYVSDPSEMESVISSQGPVCPVSPCM 418
Query: 440 ERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVT 499
ERA LDQLADYFKARLAGYP TL+EDE+MLTD NL+PKKRVATQ VR+EKKML+ACLQ T
Sbjct: 419 ERAALDQLADYFKARLAGYPTTLAEDESMLTDDNLNPKKRVATQYVRLEKKMLHACLQAT 478
Query: 500 ADMIMLLPDVTVSPCPAPYAPLL 522
D I LPD T+SPCPAPYAPLL
Sbjct: 479 TDFINQLPDHTISPCPAPYAPLL 501
>gi|356511552|ref|XP_003524489.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
[Glycine max]
Length = 503
Score = 756 bits (1952), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/444 (81%), Positives = 396/444 (89%), Gaps = 12/444 (2%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
DLKSWMHK+GLPPCKV+LK+KP N+ H+PIHYVAAS+DLQ GD AFSVPNSLVVTLERV
Sbjct: 70 DLKSWMHKHGLPPCKVVLKDKPCPNDSHKPIHYVAASQDLQVGDVAFSVPNSLVVTLERV 129
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
LGNET+AELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQL+VESPLL
Sbjct: 130 LGNETVAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLSVESPLL 189
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
WS++EL YL+GSP K E+++R E I++EY ELDTVWFMAGSLFQQYPYDIPTEAF+FEIF
Sbjct: 190 WSKSELDYLSGSPIKDEVIQREEAIRKEYKELDTVWFMAGSLFQQYPYDIPTEAFSFEIF 249
Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 319
KQAF A+QSCVVHLQKVSLARRFALVPLGPPLL+Y S CKAML AVD AV+L VDRPYKA
Sbjct: 250 KQAFAAIQSCVVHLQKVSLARRFALVPLGPPLLSYQSNCKAMLTAVDGAVELAVDRPYKA 309
Query: 320 GESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 379
G+ IVVW CGPQPNSKLLINYGFVDE+N DRL+VEAALNTEDPQYQDKR
Sbjct: 310 GDPIVVW-----------CGPQPNSKLLINYGFVDENNSNDRLIVEAALNTEDPQYQDKR 358
Query: 380 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCM 439
MVAQRNGKLSVQVFHV+AG+E+EA+ DML Y+RLGYVSD SEMQSVISS GP+CPVSPCM
Sbjct: 359 MVAQRNGKLSVQVFHVYAGKEREAVLDMLRYMRLGYVSDPSEMQSVISSQGPVCPVSPCM 418
Query: 440 ERAVLDQLADYFKARLAGYPATLSEDEAMLTD-YNLHPKKRVATQLVRMEKKMLNACLQV 498
ERA LDQLADYFKARLAGYP L+EDE+MLTD NL+PKKRVATQ VR+EKKML+ACLQ
Sbjct: 419 ERAALDQLADYFKARLAGYPTILAEDESMLTDGGNLNPKKRVATQYVRLEKKMLHACLQA 478
Query: 499 TADMIMLLPDVTVSPCPAPYAPLL 522
T D I LPD T+SPCPAPYAPLL
Sbjct: 479 TIDFINQLPDHTISPCPAPYAPLL 502
>gi|116786810|gb|ABK24248.1| unknown [Picea sitchensis]
Length = 507
Score = 716 bits (1849), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 348/487 (71%), Positives = 403/487 (82%), Gaps = 18/487 (3%)
Query: 42 SSLRLVRRKNRFSIRVSSSDTLVAGS------REVVSKKEEDLGDLKSWMHKNGLPPCKV 95
S +RL R F + V S+DTL A S ++ + KEE++ DLKSWMH++GLPPC+V
Sbjct: 32 SRVRLPGRCVGFPMVVYSADTLTASSQHGEDKKDAIRGKEEEV-DLKSWMHRHGLPPCRV 90
Query: 96 ILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLS 155
+LKE+PS + KH+PI YVAASEDLQ GD AFS+PNSL+VTLERVLGNETIAELLTTNKLS
Sbjct: 91 MLKERPSPDGKHKPIKYVAASEDLQPGDVAFSIPNSLIVTLERVLGNETIAELLTTNKLS 150
Query: 156 ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 215
ELACLALYLMYEKKQG +SFW P+IRELDRQRGRGQLAVESPLLWS EL Y TGSP K
Sbjct: 151 ELACLALYLMYEKKQGNQSFWRPFIRELDRQRGRGQLAVESPLLWSSEELKYFTGSPMKE 210
Query: 216 EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 275
+LER GIKREY ELDTVWFMAGSLF+QYPYDIPTEAF FEIFKQAFVAVQSCVVHLQ
Sbjct: 211 IMLERNSGIKREYEELDTVWFMAGSLFKQYPYDIPTEAFPFEIFKQAFVAVQSCVVHLQN 270
Query: 276 VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHD 335
V+LARRFALVPLGPPLL+Y S CKAML AV D+VQL VDR YKAGE IVVW
Sbjct: 271 VNLARRFALVPLGPPLLSYKSNCKAMLKAVGDSVQLEVDREYKAGEPIVVW--------- 321
Query: 336 FKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV 395
CGPQPN++LL+NYGFVDEDNP+DRL+VE +L+T+DP YQDKR++AQRNGKLSVQ F++
Sbjct: 322 --CGPQPNARLLLNYGFVDEDNPHDRLIVEVSLDTKDPLYQDKRIIAQRNGKLSVQTFNI 379
Query: 396 HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARL 455
+ GREKEA+ DMLPYLRL YVSD SEMQSV+SS GP+CPVSPC ERAVLDQL+ YF+ RL
Sbjct: 380 YIGREKEAVLDMLPYLRLAYVSDPSEMQSVLSSQGPVCPVSPCTERAVLDQLSRYFRERL 439
Query: 456 AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCP 515
AGYP T SEDE +L D +PK++VATQLV +EKKMLN+CL ++I LPD+ V+PCP
Sbjct: 440 AGYPTTASEDEIVLADPTTNPKRQVATQLVLIEKKMLNSCLAAVYEIIDQLPDLAVTPCP 499
Query: 516 APYAPLL 522
+PY+P+L
Sbjct: 500 SPYSPIL 506
>gi|7573451|emb|CAB87765.1| putative protein [Arabidopsis thaliana]
Length = 537
Score = 690 bits (1780), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 357/500 (71%), Positives = 392/500 (78%), Gaps = 45/500 (9%)
Query: 1 MEASCSLRSSKFISPPIRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSS 60
ME + +K +S PIR PLS S S+ R+ SS R V + S+ VSSS
Sbjct: 1 MEGVITCFHTKCVSLPIR--SFPLSRVS--SLPRWRNNKLISSSRSVHLR---SLCVSSS 53
Query: 61 DTLVAGSR--------EVVSKKE-EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIH 111
DTLVA +V SKKE +D DLK WM KNGLPPCKVILKE+P+H++KH+PIH
Sbjct: 54 DTLVASGSPKEDERQSKVSSKKEGDDSEDLKFWMDKNGLPPCKVILKERPAHDQKHKPIH 113
Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
YVAASEDLQ GD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG
Sbjct: 114 YVAASEDLQKGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 173
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
KKS W PYIRELDRQRGRGQL ESPLLWSE EL YLTGSPTKAE+LERAEGIKREYNEL
Sbjct: 174 KKSVWYPYIRELDRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGIKREYNEL 233
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL------------------ 273
DTVWFMAGSLFQQYP+DIPTEAF+FEIFKQAFVA+QSCVVHL
Sbjct: 234 DTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQVVLVASSNLDCYASSCT 293
Query: 274 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTG 333
Q V LARRFALVPLGPPLLAY S CKAML AVD AV+LVVDRPYKAG+ IVVW
Sbjct: 294 QNVGLARRFALVPLGPPLLAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVVW------- 346
Query: 334 HDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
CGPQPN+KLL+NYGFVDEDNPYDR++VEAALNTEDPQYQDKRMVAQRNGKLS QVF
Sbjct: 347 ----CGPQPNAKLLLNYGFVDEDNPYDRVIVEAALNTEDPQYQDKRMVAQRNGKLSQQVF 402
Query: 394 HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 453
V G+E+EA+ DMLPYLRLGY+SD SEMQSVISS GP+CP+SPCMERAVLDQLA+YF
Sbjct: 403 QVRVGKEREAVQDMLPYLRLGYMSDPSEMQSVISSQGPVCPMSPCMERAVLDQLANYFMR 462
Query: 454 RLAGYPATLSEDEAMLTDYN 473
RL+GYP T ED+A+ N
Sbjct: 463 RLSGYPTTPKEDDALEASCN 482
>gi|125536207|gb|EAY82695.1| hypothetical protein OsI_37912 [Oryza sativa Indica Group]
Length = 505
Score = 669 bits (1727), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 320/445 (71%), Positives = 369/445 (82%), Gaps = 14/445 (3%)
Query: 81 LKSWMHKNGLPPCKVILKEKPS---HNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
W+ ++GLPP KV + ++P K P+HYVAA +DL+AGD AF VP SLVVTLE
Sbjct: 71 FSDWLREHGLPPGKVAILDRPVPCFREGKDLPLHYVAAGQDLEAGDVAFEVPMSLVVTLE 130
Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
RVLG+E++AELLTTNKLSELACLALYLMYEKKQG+ SFW PYI+ELDRQRGRGQLAVESP
Sbjct: 131 RVLGDESVAELLTTNKLSELACLALYLMYEKKQGQDSFWYPYIKELDRQRGRGQLAVESP 190
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
LLW+E+EL YL GSP K E++ R EGI+REYNELDT+WFMAGSLFQQYP+DIPTEAF FE
Sbjct: 191 LLWTESELNYLKGSPIKDEVVARDEGIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFE 250
Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 317
IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL Y S CKAML AV D+V+LVVDRPY
Sbjct: 251 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTAVGDSVRLVVDRPY 310
Query: 318 KAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQD 377
KAGE I+VW CGPQPNS+LL+NYGF+DEDNPYDR+V+EA+LN EDPQ+Q+
Sbjct: 311 KAGEPIIVW-----------CGPQPNSRLLLNYGFIDEDNPYDRIVIEASLNIEDPQFQE 359
Query: 378 KRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSP 437
KRMVAQRNGKL++Q FHV G+EKE I++MLPYLRLGY+SD EMQS++SS G CPVSP
Sbjct: 360 KRMVAQRNGKLAIQNFHVCVGKEKETIAEMLPYLRLGYISDPDEMQSILSSEGDTCPVSP 419
Query: 438 CMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 497
C ERAVLDQL Y ++RLA YP TL ED+AML D NL PKK VAT+LVR+EKK+L+ CLQ
Sbjct: 420 CTERAVLDQLVGYLESRLADYPTTLDEDDAMLADGNLEPKKEVATRLVRLEKKLLHGCLQ 479
Query: 498 VTADMIMLLPDVTVSPCPAPYAPLL 522
+ I LPD TVSPCPAP+AP L
Sbjct: 480 AANEFINDLPDHTVSPCPAPFAPEL 504
>gi|357160358|ref|XP_003578740.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
[Brachypodium distachyon]
Length = 516
Score = 669 bits (1727), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 321/443 (72%), Positives = 366/443 (82%), Gaps = 14/443 (3%)
Query: 84 WMHKNGLPPCKVILKEKP---SHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
W+ +GLPP KV + E+P S K RP+H+VAA +DL+ GD AF +P SLVVTLERVL
Sbjct: 85 WLLTHGLPPGKVAILERPVPCSRGGKDRPLHFVAAGQDLEVGDVAFEMPMSLVVTLERVL 144
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
G+E++AELLTTNKLSELACLALYLMYEKKQGK S W PYI+ELDRQRGRGQLAVESPLLW
Sbjct: 145 GDESVAELLTTNKLSELACLALYLMYEKKQGKDSLWYPYIKELDRQRGRGQLAVESPLLW 204
Query: 201 SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK 260
+E+EL YL GSP + E++ R EGI+REYNELDT+WFMAGSLF+QYP+D+PTEAF FEIFK
Sbjct: 205 TESELDYLNGSPMRDEVVVRDEGIRREYNELDTLWFMAGSLFKQYPFDVPTEAFPFEIFK 264
Query: 261 QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 320
QAFVAVQSCVVHLQKVSLARRFALVPLGPPLL Y S CKAML AVDD+V+LVVDRPYKAG
Sbjct: 265 QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTAVDDSVRLVVDRPYKAG 324
Query: 321 ESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRM 380
E I+VW CGPQPNS+LL+NYGFVDEDNPYDR+ +EA+LN EDPQYQ+KRM
Sbjct: 325 EPIIVW-----------CGPQPNSRLLLNYGFVDEDNPYDRIAIEASLNMEDPQYQEKRM 373
Query: 381 VAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCME 440
VAQRNGKL++Q F V G+EKE IS+MLPYLRLGY+SD EMQ ++SS G CPVSPC E
Sbjct: 374 VAQRNGKLAIQKFQVCVGKEKETISEMLPYLRLGYISDPDEMQCILSSEGDTCPVSPCSE 433
Query: 441 RAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 500
RAVLDQL Y K+RLAGYP TL EDEAML D NL PKK VAT+LVR+EKK+L+ CLQ
Sbjct: 434 RAVLDQLVVYLKSRLAGYPTTLDEDEAMLADGNLEPKKEVATRLVRLEKKLLHGCLQAAH 493
Query: 501 DMIMLLPDVTVSPCPAPYAPLLN 523
+ I LPD TVSPCPA YAP L
Sbjct: 494 EFISALPDHTVSPCPALYAPNLK 516
>gi|115487958|ref|NP_001066466.1| Os12g0236900 [Oryza sativa Japonica Group]
gi|77554044|gb|ABA96840.1| SET domain containing protein, expressed [Oryza sativa Japonica
Group]
gi|113648973|dbj|BAF29485.1| Os12g0236900 [Oryza sativa Japonica Group]
Length = 509
Score = 669 bits (1726), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 320/445 (71%), Positives = 369/445 (82%), Gaps = 14/445 (3%)
Query: 81 LKSWMHKNGLPPCKVILKEKPS---HNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
W+ ++GLPP KV + ++P K P+HYVAA +DL+AGD AF VP SLVVTLE
Sbjct: 75 FSDWLREHGLPPGKVAILDRPVPCFREGKDLPLHYVAAGQDLEAGDVAFEVPMSLVVTLE 134
Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
RVLG+E++AELLTTNKLSELACLALYLMYEKKQG+ SFW PYI+ELDRQRGRGQLAVESP
Sbjct: 135 RVLGDESVAELLTTNKLSELACLALYLMYEKKQGQDSFWYPYIKELDRQRGRGQLAVESP 194
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
LLW+E+EL YL GSP K E++ R EGI+REYNELDT+WFMAGSLFQQYP+DIPTEAF FE
Sbjct: 195 LLWTESELNYLKGSPIKDEVVARDEGIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFE 254
Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 317
IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL Y S CKAML AV D+V+LVVDRPY
Sbjct: 255 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTAVGDSVRLVVDRPY 314
Query: 318 KAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQD 377
KAGE I+VW CGPQPNS+LL+NYGF+DEDNPYDR+V+EA+LN EDPQ+Q+
Sbjct: 315 KAGEPIIVW-----------CGPQPNSRLLLNYGFIDEDNPYDRIVIEASLNIEDPQFQE 363
Query: 378 KRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSP 437
KRMVAQRNGKL++Q FHV G+EKE I++MLPYLRLGY+SD EMQS++SS G CPVSP
Sbjct: 364 KRMVAQRNGKLAIQNFHVCVGKEKETIAEMLPYLRLGYISDPDEMQSILSSEGDTCPVSP 423
Query: 438 CMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 497
C ERAVLDQL Y ++RLA YP TL ED+AML D NL PKK VAT+LVR+EKK+L+ CLQ
Sbjct: 424 CTERAVLDQLVGYLESRLADYPTTLDEDDAMLADGNLEPKKEVATRLVRLEKKLLHGCLQ 483
Query: 498 VTADMIMLLPDVTVSPCPAPYAPLL 522
+ I LPD TVSPCPAP+AP L
Sbjct: 484 AANEFINDLPDHTVSPCPAPFAPEL 508
>gi|326510275|dbj|BAJ87354.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525555|dbj|BAJ88824.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 523
Score = 669 bits (1725), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 336/530 (63%), Positives = 395/530 (74%), Gaps = 34/530 (6%)
Query: 8 RSSKFISPP-----IRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSSDT 62
RSS+ +PP + HH L + + R P GS R +R + +DT
Sbjct: 14 RSSEARAPPMASSALSGTHHRLLLPCFLR----RLPQPGS-----RSCSRLRLAACHADT 64
Query: 63 LVAGS------REVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKP---SHNEKHRPIHYV 113
L++ S G W+ NGLPP K+ + E+P S + RP+H+V
Sbjct: 65 LLSSSGAQGPPSPAACLSASSAGGFSDWLLTNGLPPGKLAILERPVPCSRGGRDRPLHFV 124
Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKK 173
AA +DL+AGD AF VP SLVVTLERVLG+E++AELLTTNKLSELACLALYLMYEKKQG+
Sbjct: 125 AAGQDLEAGDVAFEVPMSLVVTLERVLGDESVAELLTTNKLSELACLALYLMYEKKQGRD 184
Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
S W PYI+ELDRQRGRGQLAVESPLLW+E+EL YL GSP + E++ R EGIK+EYNELDT
Sbjct: 185 SLWYPYIKELDRQRGRGQLAVESPLLWTESELDYLNGSPMRDEVVVRDEGIKKEYNELDT 244
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLA 293
+WFMAGSLF+QYP+D+PTEAF FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL
Sbjct: 245 LWFMAGSLFKQYPFDVPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLT 304
Query: 294 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV 353
Y S CKAML AVD +V+L+VDRPYKAGE I+VW CGPQPNS+LL+NYGFV
Sbjct: 305 YKSNCKAMLTAVDGSVRLLVDRPYKAGEPIIVW-----------CGPQPNSRLLLNYGFV 353
Query: 354 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 413
DEDNPYDR+ +EA+LNTEDPQYQ+KRMVAQRNGKL++Q F V G+EK+ IS+MLPYLRL
Sbjct: 354 DEDNPYDRIAIEASLNTEDPQYQEKRMVAQRNGKLAIQKFQVCVGKEKQTISEMLPYLRL 413
Query: 414 GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYN 473
GY+SD EMQ ++SS G CPVSPC ERAVLDQL Y K+RLAGYP L EDEAML D +
Sbjct: 414 GYISDPDEMQCILSSEGDTCPVSPCSERAVLDQLVVYLKSRLAGYPTNLDEDEAMLADGS 473
Query: 474 LHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 523
L PKK VAT+LVR+EKKML+ CL+ + I LPD TVSPCPA YAP L
Sbjct: 474 LEPKKEVATRLVRLEKKMLHGCLEAANEFISGLPDHTVSPCPALYAPELK 523
>gi|242053769|ref|XP_002456030.1| hypothetical protein SORBIDRAFT_03g029140 [Sorghum bicolor]
gi|241928005|gb|EES01150.1| hypothetical protein SORBIDRAFT_03g029140 [Sorghum bicolor]
Length = 512
Score = 664 bits (1713), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 318/442 (71%), Positives = 367/442 (83%), Gaps = 14/442 (3%)
Query: 84 WMHKNGLPPCKVILKEKPSH---NEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
W+ GLPP KV ++E+P N K P+ YVAA DLQAGD AF VP SLVVTLERVL
Sbjct: 81 WLRARGLPPGKVDIRERPVPCLLNGKDLPLRYVAAGVDLQAGDVAFEVPMSLVVTLERVL 140
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
G+E+IAELLT NKLSELACLALYLMYEKKQGK SFW PYI+ELDR RGRGQLAVESPLLW
Sbjct: 141 GDESIAELLTNNKLSELACLALYLMYEKKQGKDSFWYPYIKELDRHRGRGQLAVESPLLW 200
Query: 201 SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK 260
+E+EL YLTGSP K E++ R E I+REYNELDT+WFMAGSLFQQYP+DIPTEAF FEIFK
Sbjct: 201 TESELDYLTGSPLKDEVVARDEAIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIFK 260
Query: 261 QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 320
QAFVAVQSCVVHLQKVSLARRFALVPLGPPLL Y S CKAML A D+V+LVVDRPYKAG
Sbjct: 261 QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTADGDSVRLVVDRPYKAG 320
Query: 321 ESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRM 380
E I++W CGPQ NS+L++NYGFVDEDNP+DR+ +EA+LN+EDPQYQ+KRM
Sbjct: 321 EPIIIW-----------CGPQTNSRLVLNYGFVDEDNPFDRIAIEASLNSEDPQYQEKRM 369
Query: 381 VAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCME 440
VAQRNGKL++Q F+V+ G+EK+ +++MLPYLRLGY+SD EMQS++SS G CP+SPC E
Sbjct: 370 VAQRNGKLAIQNFNVYVGKEKQTVAEMLPYLRLGYISDPDEMQSILSSEGDTCPLSPCTE 429
Query: 441 RAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 500
RAVLDQL Y ++RLAGYP TL EDEAML D +L PKK VAT+LVR+EKKM++ACLQ T
Sbjct: 430 RAVLDQLVGYLESRLAGYPTTLDEDEAMLADGSLEPKKEVATRLVRLEKKMIHACLQATN 489
Query: 501 DMIMLLPDVTVSPCPAPYAPLL 522
+ I LPD TVSPCPAPYAP L
Sbjct: 490 EFINDLPDHTVSPCPAPYAPEL 511
>gi|414881266|tpg|DAA58397.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
Length = 512
Score = 664 bits (1712), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 324/468 (69%), Positives = 374/468 (79%), Gaps = 14/468 (2%)
Query: 58 SSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPS---HNEKHRPIHYVA 114
SSS+ A V E W+ GLPP KV ++E+P + K +P+ YV+
Sbjct: 55 SSSEARAAPGPAVEPSSESATDCFVDWLRARGLPPGKVDIRERPVPCLRDGKDQPLRYVS 114
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A DLQAGD AF V SLVVTLERVLG+E+IAELLT NKLSELACLALYLMYEKKQGK S
Sbjct: 115 AVVDLQAGDVAFEVSMSLVVTLERVLGDESIAELLTNNKLSELACLALYLMYEKKQGKDS 174
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
FW PYI+ELDR RGRGQLAVESPLLW+E+EL YLTGSP K E++ R E I+REYNELDT+
Sbjct: 175 FWYPYIKELDRHRGRGQLAVESPLLWTESELDYLTGSPLKDEVVARDEAIRREYNELDTL 234
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
WFMAGSLFQQYP+DIPTEAF FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL Y
Sbjct: 235 WFMAGSLFQQYPFDIPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTY 294
Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVD 354
S CKAML A D+V+LVVDRPYKAGE I++W CGPQ NS+L++NYGFVD
Sbjct: 295 RSNCKAMLTADGDSVRLVVDRPYKAGEPIIIW-----------CGPQTNSRLVLNYGFVD 343
Query: 355 EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLG 414
EDNP+DR+ +EA+LNTEDPQYQ+KRMVAQRNGKL++Q F+V+ G+EK+ +++MLPYLRLG
Sbjct: 344 EDNPFDRVAIEASLNTEDPQYQEKRMVAQRNGKLAIQNFNVYVGKEKQTVAEMLPYLRLG 403
Query: 415 YVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNL 474
Y+S+ EMQS++SS G CPVSPC ERAVLDQL Y ++RLAGYP TL EDEAML D NL
Sbjct: 404 YISNPDEMQSILSSEGDTCPVSPCTERAVLDQLVGYLESRLAGYPTTLDEDEAMLADGNL 463
Query: 475 HPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 522
PKK VAT+LVR+EKKML+ACLQ T + I LPD TVSPCPAPYAP L
Sbjct: 464 EPKKEVATRLVRLEKKMLHACLQATNEFINDLPDHTVSPCPAPYAPEL 511
>gi|125578929|gb|EAZ20075.1| hypothetical protein OsJ_35675 [Oryza sativa Japonica Group]
Length = 536
Score = 639 bits (1649), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 305/405 (75%), Positives = 345/405 (85%), Gaps = 11/405 (2%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLP 178
LQAGD AF VP SLVVTLERVLG+E++AELLTTNKLSELACLALYLMYEKKQG+ SFW P
Sbjct: 143 LQAGDVAFEVPMSLVVTLERVLGDESVAELLTTNKLSELACLALYLMYEKKQGQDSFWYP 202
Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMA 238
YI+ELDRQRGRGQLAVESPLLW+E+EL YL GSP K E++ R EGI+REYNELDT+WFMA
Sbjct: 203 YIKELDRQRGRGQLAVESPLLWTESELNYLKGSPIKDEVVARDEGIRREYNELDTLWFMA 262
Query: 239 GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC 298
GSLFQQYP+DIPTEAF FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL Y S C
Sbjct: 263 GSLFQQYPFDIPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNC 322
Query: 299 KAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNP 358
KAML AV D+V+LVVDRPYKAGE I+VW CGPQPNS+LL+NYGF+DEDNP
Sbjct: 323 KAMLTAVGDSVRLVVDRPYKAGEPIIVW-----------CGPQPNSRLLLNYGFIDEDNP 371
Query: 359 YDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD 418
YDR+V+EA+LN EDPQ+Q+KRMVAQRNGKL++Q FHV G+EKE I++MLPYLRLGY+SD
Sbjct: 372 YDRIVIEASLNIEDPQFQEKRMVAQRNGKLAIQNFHVCVGKEKETIAEMLPYLRLGYISD 431
Query: 419 TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKK 478
EMQS++SS G CPVSPC ERAVLDQL Y ++RLA YP TL ED+AML D NL PKK
Sbjct: 432 PDEMQSILSSEGDTCPVSPCTERAVLDQLVGYLESRLADYPTTLDEDDAMLADGNLEPKK 491
Query: 479 RVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 523
VAT+LVR+EKK+L+ CLQ + I LPD TVSPCPAP+AP L
Sbjct: 492 EVATRLVRLEKKLLHGCLQAANEFINDLPDHTVSPCPAPFAPELK 536
>gi|168044593|ref|XP_001774765.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673920|gb|EDQ60436.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 523
Score = 598 bits (1542), Expect = e-168, Method: Compositional matrix adjust.
Identities = 305/492 (61%), Positives = 361/492 (73%), Gaps = 21/492 (4%)
Query: 38 PNFGSSLRLVRRKNRFSIRVSSSDTLVAGSRE---VVSKKEEDLG--DLKSWMHKNGLPP 92
P FG+ V + R S ++ T V E SKK+E DLK WM + GLP
Sbjct: 45 PRFGTQKVAVSSEKRGSRCRNTLTTDVYKQDENDLAQSKKQEHESGIDLKQWMEEQGLPE 104
Query: 93 CKVILKE-KPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTT 151
CKV L E +PS +K +PIHYV ASEDLQ G+ A ++P SLVVTLERVLG+ETIAELLTT
Sbjct: 105 CKVSLAEHQPSEGDKGKPIHYVVASEDLQPGELALTIPKSLVVTLERVLGDETIAELLTT 164
Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTG 210
NKLSELACLALYLMYEKKQGK+S+W PYIRELDRQRGRGQL+V SPLLWS EL Y TG
Sbjct: 165 NKLSELACLALYLMYEKKQGKESYWYPYIRELDRQRGRGQLSVASPLLWSREELNEYFTG 224
Query: 211 SPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
S K +LER GIKREY ELDTVWFMAGSLF+QYP+D+PTEAF+FEIFKQAFVAVQSCV
Sbjct: 225 STMKEVVLERLAGIKREYEELDTVWFMAGSLFKQYPFDLPTEAFSFEIFKQAFVAVQSCV 284
Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNIS 330
VHLQ VSLARRFALVPLGPPLLAY S CKAML AVDD V L VDR YKAG+ I VW
Sbjct: 285 VHLQGVSLARRFALVPLGPPLLAYKSNCKAMLKAVDDNVVLEVDRAYKAGDPIAVW---- 340
Query: 331 FTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 390
CGPQPNSKLL+NYGFVDEDNPYDRL VEA+L+TEDP YQ KR + Q+N +L++
Sbjct: 341 -------CGPQPNSKLLLNYGFVDEDNPYDRLAVEASLDTEDPLYQQKRAIVQKNNRLTI 393
Query: 391 QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADY 450
Q F ++ G+E EA+ DMLPY+RL +++D EM++V + GP+CPVS C ERAVL+QL Y
Sbjct: 394 QTFQIYKGKEMEAVLDMLPYMRLAHLADPEEMETVSFAQGPVCPVSACNERAVLEQLEQY 453
Query: 451 FKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVT 510
F+ RLAGY ++ + + D + KKRVA +L+ +EK +L L ++I LPD
Sbjct: 454 FEKRLAGYKSSHATEGG---DAKKNAKKRVAEKLLCIEKSILRNALAAVQELISQLPDSA 510
Query: 511 VSPCPAPYAPLL 522
+SPC PY P L
Sbjct: 511 ISPCIGPYLPNL 522
>gi|302794360|ref|XP_002978944.1| hypothetical protein SELMODRAFT_110000 [Selaginella moellendorffii]
gi|300153262|gb|EFJ19901.1| hypothetical protein SELMODRAFT_110000 [Selaginella moellendorffii]
Length = 432
Score = 585 bits (1508), Expect = e-164, Method: Compositional matrix adjust.
Identities = 293/443 (66%), Positives = 344/443 (77%), Gaps = 18/443 (4%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
WM + GLPPCKV LKE+ + + I YV ASEDL+ GD A SVP SLVVTLERVLGNE
Sbjct: 3 WMLEQGLPPCKVSLKERDLNG---KTIRYVVASEDLKPGDLALSVPMSLVVTLERVLGNE 59
Query: 144 TIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 203
TIAELLTTNKLSELACLALYLMYEKK+GK+SFW P+IRELDRQRGRGQ+AVESPLLW+
Sbjct: 60 TIAELLTTNKLSELACLALYLMYEKKRGKESFWYPFIRELDRQRGRGQVAVESPLLWTSE 119
Query: 204 EL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
EL Y TGS K +LER EGIKREY ELDTVWFMAGSLF++YP+DIPTEAF+FEIFKQA
Sbjct: 120 ELDEYFTGSRMKEVVLERLEGIKREYQELDTVWFMAGSLFKEYPFDIPTEAFSFEIFKQA 179
Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
FVAVQSCVVHLQ VSL RRFALVPLGPPLLAY S CKAML A D V+L VDR YK GE
Sbjct: 180 FVAVQSCVVHLQGVSLPRRFALVPLGPPLLAYKSNCKAMLKAAGDLVRLEVDRAYKKGEQ 239
Query: 323 IVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 382
I+VW CGPQPN++LL+NYGFVD DNP+DRL VEA+LNT DP YQ+KR++
Sbjct: 240 ILVW-----------CGPQPNTRLLLNYGFVDPDNPHDRLSVEASLNTRDPFYQNKRIIV 288
Query: 383 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERA 442
Q+N +L++Q F + GREKEA+ +MLPYLRLG+VSD M+SV S+ GP CPVS C ERA
Sbjct: 289 QKNNRLTIQNFQIFKGREKEAVLEMLPYLRLGHVSDPYHMESVFSAEGPTCPVSACNERA 348
Query: 443 VLDQLADYFKARLAGYPATLSEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNACLQVTA 500
VLDQLA YF+ R+A Y T+ ED A+L D +++PK+RVATQL+ +EK++L+ L V
Sbjct: 349 VLDQLAQYFQERIAKYKTTIDEDRALLEDGSSDINPKQRVATQLLLIEKEILHNTLDVVN 408
Query: 501 DMIMLLPDVTVS-PCPAPYAPLL 522
LPD +V+ PC + P L
Sbjct: 409 GFRNQLPDGSVAPPCCGDFVPKL 431
>gi|302809535|ref|XP_002986460.1| hypothetical protein SELMODRAFT_269129 [Selaginella moellendorffii]
gi|300145643|gb|EFJ12317.1| hypothetical protein SELMODRAFT_269129 [Selaginella moellendorffii]
Length = 432
Score = 585 bits (1507), Expect = e-164, Method: Compositional matrix adjust.
Identities = 293/443 (66%), Positives = 344/443 (77%), Gaps = 18/443 (4%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
WM + GLPPCKV LKE+ + + I YV ASEDL+ GD A SVP SLVVTLERVLGNE
Sbjct: 3 WMLEQGLPPCKVSLKERDLNG---KTIRYVVASEDLKPGDLALSVPMSLVVTLERVLGNE 59
Query: 144 TIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 203
TIAELLTTNKLSELACLALYLMYEKK+GK+SFW P+IRELDRQRGRGQ+AVESPLLW+
Sbjct: 60 TIAELLTTNKLSELACLALYLMYEKKRGKESFWYPFIRELDRQRGRGQVAVESPLLWTSE 119
Query: 204 EL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
EL Y TGS K +LER EGIKREY ELDTVWFMAGSLF++YP+DIPTEAF+FEIFKQA
Sbjct: 120 ELDEYFTGSRMKEVVLERLEGIKREYQELDTVWFMAGSLFKEYPFDIPTEAFSFEIFKQA 179
Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
FVAVQSCVVHLQ VSL RRFALVPLGPPLLAY S CKAML A D V+L VDR YK GE
Sbjct: 180 FVAVQSCVVHLQGVSLPRRFALVPLGPPLLAYKSNCKAMLKAAGDLVRLEVDRAYKKGEQ 239
Query: 323 IVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 382
I+VW CGPQPN++LL+NYGFVD DNP+DRL VEA+LNT DP YQ+KR++
Sbjct: 240 ILVW-----------CGPQPNTRLLLNYGFVDPDNPHDRLSVEASLNTRDPFYQNKRIIV 288
Query: 383 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERA 442
Q+N +L++Q F + GREKEA+ +MLPYLRLG+VSD M+SV S+ GP CPVS C ERA
Sbjct: 289 QKNNRLTIQNFQIFKGREKEAVLEMLPYLRLGHVSDPYHMESVFSAEGPTCPVSACNERA 348
Query: 443 VLDQLADYFKARLAGYPATLSEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNACLQVTA 500
VLDQLA YF+ R+A Y T+ ED A+L D +++PK+RVATQL+ +EK++L+ L V
Sbjct: 349 VLDQLAQYFQERIAKYKTTIDEDRALLEDCSSDINPKQRVATQLLLIEKEILHNTLDVVN 408
Query: 501 DMIMLLPDVTVS-PCPAPYAPLL 522
LPD +V+ PC + P L
Sbjct: 409 GFRNQLPDGSVAPPCCGDFVPKL 431
>gi|168020073|ref|XP_001762568.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686301|gb|EDQ72691.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 427
Score = 566 bits (1458), Expect = e-158, Method: Compositional matrix adjust.
Identities = 281/440 (63%), Positives = 330/440 (75%), Gaps = 16/440 (3%)
Query: 85 MHKNGLPPCKVILKE-KPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
M + GLP C V L E + + +K +PIHYV AS+DLQ GD A +VP SLVVTLERVLG+E
Sbjct: 1 MEEQGLPKCNVALVEHQLAEGDKGKPIHYVVASQDLQPGDVALTVPKSLVVTLERVLGDE 60
Query: 144 TIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 203
TIAELLTTNKLSELACLALYLMYEKKQGK+S+W PYIRELDRQRGRGQL+V SPLLWS
Sbjct: 61 TIAELLTTNKLSELACLALYLMYEKKQGKESYWYPYIRELDRQRGRGQLSVASPLLWSPE 120
Query: 204 EL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
EL Y TGS K +LER GIKREY ELDTVWFMAGSLF+QYP+D+PTEAF+FEIFKQA
Sbjct: 121 ELNEYFTGSTMKEVVLERLAGIKREYEELDTVWFMAGSLFKQYPFDLPTEAFSFEIFKQA 180
Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
FVAVQSCVVHLQ VSLARRFALVPLGPPLLAY S CKAML AV D VQL VD YK G+
Sbjct: 181 FVAVQSCVVHLQGVSLARRFALVPLGPPLLAYKSNCKAMLKAVGDNVQLEVDHAYKTGDP 240
Query: 323 IVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 382
I VW CGPQPNSKLL+NYGFVDEDNP+DRL VEA+LNTEDP YQ KR V
Sbjct: 241 IAVW-----------CGPQPNSKLLLNYGFVDEDNPFDRLAVEASLNTEDPLYQQKRAVV 289
Query: 383 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERA 442
Q+N +L++Q F ++ G+E EA+ DMLPY+RLG+++D E+++V + P+C VS C ERA
Sbjct: 290 QKNNRLTIQTFQIYKGKEMEAVRDMLPYMRLGHLADPEEIETVSFAQEPLCYVSACNERA 349
Query: 443 VLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADM 502
VL+Q+ +F+ RLAGY S D D K+ VA +L+ +EK +L L ++
Sbjct: 350 VLNQIEHFFERRLAGYK---SSDTTKAVDAKKDAKRTVAKKLMSIEKNILRNALAAVHEL 406
Query: 503 IMLLPDVTVSPCPAPYAPLL 522
I LPD +SPC PY P L
Sbjct: 407 IRELPDGAISPCIGPYLPNL 426
>gi|326503142|dbj|BAJ99196.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 425
Score = 546 bits (1407), Expect = e-153, Method: Compositional matrix adjust.
Identities = 274/437 (62%), Positives = 326/437 (74%), Gaps = 34/437 (7%)
Query: 8 RSSKFISPP-----IRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSSDT 62
RSS+ +PP + HH L + + R P GS R +R + +DT
Sbjct: 9 RSSEARAPPMASSALSGTHHRLLLPCFLR----RLPQPGS-----RSCSRLRLAACHADT 59
Query: 63 LVAGS------REVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKP---SHNEKHRPIHYV 113
L++ S G W+ NGLPP K+ + E+P S + RP+H+V
Sbjct: 60 LLSSSGAQGPPSPAACLSASSAGGFSDWLLTNGLPPGKLAILERPVPCSRGGRDRPLHFV 119
Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKK 173
AA +DL+AGD AF VP SLVVTLERVLG+E++AELLTTNKLSELACLALYLMYEKKQG+
Sbjct: 120 AAGQDLEAGDVAFEVPMSLVVTLERVLGDESVAELLTTNKLSELACLALYLMYEKKQGRD 179
Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
S W PYI+ELDRQRGRGQLAVESPLLW+E+EL YL GSP + E++ R EGIK+EYNELDT
Sbjct: 180 SLWYPYIKELDRQRGRGQLAVESPLLWTESELDYLNGSPMRDEVVVRDEGIKKEYNELDT 239
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLA 293
+WFMAGSLF+QYP+D+PTEAF FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL
Sbjct: 240 LWFMAGSLFKQYPFDVPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLT 299
Query: 294 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV 353
Y S CKAML AVD +V+L+VDRPYKAGE I+VW CGPQPNS+LL+NYGFV
Sbjct: 300 YKSNCKAMLTAVDGSVRLLVDRPYKAGEPIIVW-----------CGPQPNSRLLLNYGFV 348
Query: 354 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 413
DEDNPYDR+ +EA+LNTEDPQYQ+KRMVAQRNGKL++Q F V G+EK+ IS+MLPYLRL
Sbjct: 349 DEDNPYDRIAIEASLNTEDPQYQEKRMVAQRNGKLAIQKFQVCVGKEKQTISEMLPYLRL 408
Query: 414 GYVSDTSEMQSVISSLG 430
GY+SD EMQ ++SS G
Sbjct: 409 GYISDPDEMQCILSSEG 425
>gi|384246822|gb|EIE20311.1| hypothetical protein COCSUDRAFT_48681 [Coccomyxa subellipsoidea
C-169]
Length = 539
Score = 388 bits (997), Expect = e-105, Method: Compositional matrix adjust.
Identities = 197/383 (51%), Positives = 264/383 (68%), Gaps = 20/383 (5%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+ DLQAG+ A +P+ LV+TL+RV +E++AELLTT+KLSELACL LYLMYEKK G++S
Sbjct: 49 AARDLQAGELALRIPDHLVITLDRVFEDESLAELLTTDKLSELACLTLYLMYEKKNGRQS 108
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDT 233
W +I+ELDR +GRGQ+ +SPLLW E ++ YL GSP AEI ER +GI++EY ELDT
Sbjct: 109 VWYEFIKELDRIQGRGQMGAKSPLLWDEGQVDEYLAGSPLVAEIKERLKGIEKEYAELDT 168
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLA 293
VWFMAGSLF+ YPYD+PTEAF+ ++F+Q F AVQ+ VVHLQ V L++RFALVPLGPPLL+
Sbjct: 169 VWFMAGSLFKSYPYDVPTEAFSLKLFRQGFAAVQASVVHLQGVPLSKRFALVPLGPPLLS 228
Query: 294 YSSKCKAMLAAVDDA--VQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYG 351
YSS KAML +A VQL VDR Y GE I W CGPQPN +LL+NYG
Sbjct: 229 YSSTAKAMLTYNREAKEVQLAVDRSYTKGEPIEAW-----------CGPQPNRRLLLNYG 277
Query: 352 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL 411
V ++NP+D++ + L DP +Q KR V Q+N + Q F + R+K +LPYL
Sbjct: 278 IVTDNNPHDKMALTVTLPHADPLFQAKRAVLQQNNLSTQQTFQLQ--RDKGLPELLLPYL 335
Query: 412 RLGYVSDTSEMQSVISSLGPIC--PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 469
RL + +D + ++ +++L C P+SP ER VL QLA + + RL Y T EDE ++
Sbjct: 336 RLAHCTDAASLK--LATLDTCCAAPISPENERTVLHQLASHLQDRLDRYKTTCEEDEVII 393
Query: 470 TDYNLHPKKRVATQLVRMEKKML 492
P+++VA +L+R+EK +L
Sbjct: 394 RSTTAGPRQKVAARLLRIEKAIL 416
>gi|255536985|ref|XP_002509559.1| conserved hypothetical protein [Ricinus communis]
gi|223549458|gb|EEF50946.1| conserved hypothetical protein [Ricinus communis]
Length = 348
Score = 372 bits (956), Expect = e-100, Method: Compositional matrix adjust.
Identities = 174/190 (91%), Positives = 182/190 (95%)
Query: 85 MHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNET 144
MHKNGLPPCKV+LKE+PSH+ K RPIHYVAASEDLQ GD AFSVPNSLVVTLERVLGNET
Sbjct: 1 MHKNGLPPCKVVLKERPSHDAKLRPIHYVAASEDLQTGDVAFSVPNSLVVTLERVLGNET 60
Query: 145 IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 204
+ ELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE E
Sbjct: 61 VVELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSEAE 120
Query: 205 LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
LAYLTGSPTKAE+LERA+GIKREY+ELDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFV
Sbjct: 121 LAYLTGSPTKAEVLERADGIKREYDELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFV 180
Query: 265 AVQSCVVHLQ 274
A+QSCVVHLQ
Sbjct: 181 AIQSCVVHLQ 190
Score = 278 bits (712), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 130/159 (81%), Positives = 148/159 (93%)
Query: 364 VEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQ 423
++AALNTEDPQYQDKRMVAQRNGKLS+QVF ++ G+EKEAISD+LPYLRLGYVSD SEMQ
Sbjct: 189 LQAALNTEDPQYQDKRMVAQRNGKLSIQVFQIYVGKEKEAISDILPYLRLGYVSDPSEMQ 248
Query: 424 SVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQ 483
SVISS GPICPVSPCME+AVLDQLADYFK RLAGYP +L+EDE ML D+NL+PKKRVATQ
Sbjct: 249 SVISSQGPICPVSPCMEQAVLDQLADYFKRRLAGYPTSLNEDELMLADHNLNPKKRVATQ 308
Query: 484 LVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 522
LVR+EKK+LNACLQ TAD+I LPD++VSPCPAPYAP+L
Sbjct: 309 LVRLEKKILNACLQATADLINQLPDLSVSPCPAPYAPIL 347
>gi|212721460|ref|NP_001132025.1| uncharacterized protein LOC100193433 [Zea mays]
gi|194693232|gb|ACF80700.1| unknown [Zea mays]
gi|414881264|tpg|DAA58395.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
gi|414881265|tpg|DAA58396.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
Length = 252
Score = 366 bits (939), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 173/253 (68%), Positives = 206/253 (81%), Gaps = 11/253 (4%)
Query: 270 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNI 329
++ QKVSLARRFALVPLGPPLL Y S CKAML A D+V+LVVDRPYKAGE I++W
Sbjct: 10 LIQEQKVSLARRFALVPLGPPLLTYRSNCKAMLTADGDSVRLVVDRPYKAGEPIIIW--- 66
Query: 330 SFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLS 389
CGPQ NS+L++NYGFVDEDNP+DR+ +EA+LNTEDPQYQ+KRMVAQRNGKL+
Sbjct: 67 --------CGPQTNSRLVLNYGFVDEDNPFDRVAIEASLNTEDPQYQEKRMVAQRNGKLA 118
Query: 390 VQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLAD 449
+Q F+V+ G+EK+ +++MLPYLRLGY+S+ EMQS++SS G CPVSPC ERAVLDQL
Sbjct: 119 IQNFNVYVGKEKQTVAEMLPYLRLGYISNPDEMQSILSSEGDTCPVSPCTERAVLDQLVG 178
Query: 450 YFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDV 509
Y ++RLAGYP TL EDEAML D NL PKK VAT+LVR+EKKML+ACLQ T + I LPD
Sbjct: 179 YLESRLAGYPTTLDEDEAMLADGNLEPKKEVATRLVRLEKKMLHACLQATNEFINDLPDH 238
Query: 510 TVSPCPAPYAPLL 522
TVSPCPAPYAP L
Sbjct: 239 TVSPCPAPYAPEL 251
>gi|307107385|gb|EFN55628.1| hypothetical protein CHLNCDRAFT_57818 [Chlorella variabilis]
Length = 435
Score = 356 bits (913), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 191/415 (46%), Positives = 262/415 (63%), Gaps = 20/415 (4%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
+ W+ ++G P KV L+ + + A+E LQ GD A +P L+VTL+RVL
Sbjct: 9 MMQWLTESGAPQQKVKLQTVVREGTE---VDITVAAEALQPGDVALRIPEHLIVTLDRVL 65
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
+ T+AEL+TT KLSELACL LYL YEKK+GK+ W +I+ELDR +GRG +SPLLW
Sbjct: 66 EDNTLAELVTTGKLSELACLTLYLAYEKKRGKEGCWYRFIKELDRMQGRGSQGAKSPLLW 125
Query: 201 SETELA-YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
E + A L GSP EI R +GI++EY ELDTVW++AGSLF + P+ PTE F+F +F
Sbjct: 126 DEGQAAELLAGSPVVGEIEARLQGIRKEYEELDTVWYLAGSLFNRQPFSPPTEQFSFPVF 185
Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPY 317
+QAF AVQS VVHLQ V+L +RFALVP+GPPLL YSS KAML ++ V+L VDR Y
Sbjct: 186 RQAFTAVQSSVVHLQGVALGKRFALVPMGPPLLTYSSTAKAMLKFDPESHEVRLAVDRAY 245
Query: 318 KAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQD 377
+ GE+++ W CGPQPNS+LLINYG VDE NPYD+L + + ++DP Y+
Sbjct: 246 QPGEAVLAW-----------CGPQPNSRLLINYGIVDESNPYDKLPLSITIPSDDPLYRL 294
Query: 378 KRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSP 437
KR G + Q F + A A +LPYLRL + + ++++ V PV+P
Sbjct: 295 KRDRLAERGLSTQQTFQLQAAASLPA--QLLPYLRLVHSTREADVEGVKWE-EEAGPVAP 351
Query: 438 CMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 492
E VL+QL + + R + Y T+ EDEA++ D P+ VA +L+++EK +L
Sbjct: 352 ENELTVLNQLITHLRLRQSRYRTTIEEDEAIIADPAKGPRPTVAARLLKIEKGIL 406
>gi|413950742|gb|AFW83391.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
gi|413950743|gb|AFW83392.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
Length = 252
Score = 355 bits (911), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 166/251 (66%), Positives = 203/251 (80%), Gaps = 11/251 (4%)
Query: 270 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNI 329
++ QKVSLARRFALVPLGPPLL Y S CKAML ++V+LVVDRPYKAGE I++W
Sbjct: 10 LIQEQKVSLARRFALVPLGPPLLTYKSNCKAMLTVDGESVRLVVDRPYKAGEPIIIW--- 66
Query: 330 SFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLS 389
CGPQ NS+L++NYGFVDE+NP+DR+ +EA+LNTEDPQYQ+KRMVAQRNGK +
Sbjct: 67 --------CGPQTNSRLVLNYGFVDENNPFDRISIEASLNTEDPQYQEKRMVAQRNGKHA 118
Query: 390 VQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLAD 449
+Q F+V+ G+EK+ +++MLPYLRLGY+SD EMQS++SS G CPVSPC ERAVLDQL
Sbjct: 119 IQNFNVYVGKEKQTVAEMLPYLRLGYISDPDEMQSILSSEGDTCPVSPCTERAVLDQLGG 178
Query: 450 YFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDV 509
Y ++RLAGYP TL+EDEAML D +L PK+ VAT+LVR+EKKML+ACLQ T + I LPD
Sbjct: 179 YLESRLAGYPTTLNEDEAMLADGSLEPKQEVATRLVRLEKKMLHACLQATNEFITDLPDH 238
Query: 510 TVSPCPAPYAP 520
TVSPCPA YAP
Sbjct: 239 TVSPCPAQYAP 249
>gi|302847476|ref|XP_002955272.1| hypothetical protein VOLCADRAFT_76643 [Volvox carteri f.
nagariensis]
gi|300259344|gb|EFJ43572.1| hypothetical protein VOLCADRAFT_76643 [Volvox carteri f.
nagariensis]
Length = 488
Score = 338 bits (868), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 192/419 (45%), Positives = 255/419 (60%), Gaps = 25/419 (5%)
Query: 80 DLKSWMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
+L W+ +NG + +V + PS RP+ V A L AG+ A SVP L +TL+
Sbjct: 43 ELVDWLRENGAKIDAVEVKTMDVPSAG---RPLDVVVAGRSLAAGEVALSVPERLCLTLD 99
Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
R+ +E +AELLTT+KLSELACLALYLMYEKK KKSFW PYI+ELD+Q+ RG A ESP
Sbjct: 100 RIFESEFVAELLTTDKLSELACLALYLMYEKKLKKKSFWYPYIKELDKQQARGPQAAESP 159
Query: 198 LLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 256
LLW + EL + L GSP + +R GI++EY LDTVWFMAGSLF +YP+D+PTE F+F
Sbjct: 160 LLWGDQELDSLLKGSPLLPAVRQRQAGIRKEYEALDTVWFMAGSLFNKYPFDLPTETFSF 219
Query: 257 EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD--AVQLVVD 314
E+F+QAF VQ+ +VHLQ V +A+RFALVPLGPPL+AYSS K M+ +D +V+LVV
Sbjct: 220 ELFQQAFAVVQASIVHLQGVPIAKRFALVPLGPPLMAYSSTSKNMMTYDEDSRSVRLVVS 279
Query: 315 RPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVE--AALNTED 372
P +AG + W CGPQPNS+LL+NYG VDE NP+D+L L T D
Sbjct: 280 GPVEAGRPVAAW-----------CGPQPNSRLLLNYGVVDEHNPFDKLQARFTFTLPTSD 328
Query: 373 PQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPI 432
P + KR V G + Q F V R +LPY+ L + ++ SV S
Sbjct: 329 PLFPAKRAVLSEAGLATQQSFDVSVARPLP--PQLLPYMMLALATTPEQVASV--SFSDT 384
Query: 433 CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 491
+E A L L Y + R A Y L D ++ D + P+++VA +L ++EK +
Sbjct: 385 AGHDRELEAAALAALMAYVQRRTAAYAHPLWRDLEIINDPSSTPRQKVAARLTKIEKSI 443
>gi|413950744|gb|AFW83393.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
Length = 281
Score = 338 bits (868), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 166/280 (59%), Positives = 203/280 (72%), Gaps = 40/280 (14%)
Query: 270 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNI 329
++ QKVSLARRFALVPLGPPLL Y S CKAML ++V+LVVDRPYKAGE I++W
Sbjct: 10 LIQEQKVSLARRFALVPLGPPLLTYKSNCKAMLTVDGESVRLVVDRPYKAGEPIIIW--- 66
Query: 330 SFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLS 389
CGPQ NS+L++NYGFVDE+NP+DR+ +EA+LNTEDPQYQ+KRMVAQRNGK +
Sbjct: 67 --------CGPQTNSRLVLNYGFVDENNPFDRISIEASLNTEDPQYQEKRMVAQRNGKHA 118
Query: 390 VQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLAD 449
+Q F+V+ G+EK+ +++MLPYLRLGY+SD EMQS++SS G CPVSPC ERAVLDQL
Sbjct: 119 IQNFNVYVGKEKQTVAEMLPYLRLGYISDPDEMQSILSSEGDTCPVSPCTERAVLDQLGG 178
Query: 450 YFKARLAGYPATLSEDEAM-----------------------------LTDYNLHPKKRV 480
Y ++RLAGYP TL+EDEAM L D +L PK+ V
Sbjct: 179 YLESRLAGYPTTLNEDEAMVMSCDFLRVVSWSLYKLAECYGIGFGHCQLADGSLEPKQEV 238
Query: 481 ATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAP 520
AT+LVR+EKKML+ACLQ T + I LPD TVSPCPA YAP
Sbjct: 239 ATRLVRLEKKMLHACLQATNEFITDLPDHTVSPCPAQYAP 278
>gi|413950741|gb|AFW83390.1| hypothetical protein ZEAMMB73_201403, partial [Zea mays]
Length = 130
Score = 232 bits (592), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 108/128 (84%), Positives = 117/128 (91%)
Query: 147 ELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA 206
ELLT NKLSELACLALYLMYEKKQGK SFW PYI+ELDR RGRGQLAVESPLLW+E+EL
Sbjct: 1 ELLTNNKLSELACLALYLMYEKKQGKDSFWYPYIKELDRHRGRGQLAVESPLLWTESELD 60
Query: 207 YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 266
YL+GSP K E++ R E I+REYNELDT+WFMAGSLFQQYP+DIPTEAF FEIFKQAFVAV
Sbjct: 61 YLSGSPLKDEVVARDEAIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIFKQAFVAV 120
Query: 267 QSCVVHLQ 274
QSCVVHLQ
Sbjct: 121 QSCVVHLQ 128
>gi|168002824|ref|XP_001754113.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694667|gb|EDQ81014.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 638
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 128/455 (28%), Positives = 203/455 (44%), Gaps = 49/455 (10%)
Query: 45 RLVRRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHN 104
R V + R +R + ++ S + ++ L L W+ K G P VIL
Sbjct: 54 RTVWNEGRRGLRGVARCSMSGNSMQSMA-----LHQLSEWLSKQGFPTQDVILT---GFG 105
Query: 105 EKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYL 164
E+ + AA D + G+ A +P + VT V+ + +A ++ L L+L
Sbjct: 106 EEGVGL---AAGRDFKEGEVALKIPENYTVTGVDVVNHPVVAAPAAGR--GDVIGLTLWL 160
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS-ETELAYLTGSPTKAEILERAEG 223
MYE+ G+KS W PY++ SP+LW+ E + L GSP E+ +R+
Sbjct: 161 MYERSLGEKSVWYPYLQTFPS-------TTLSPILWTAEEQQKLLKGSPALEEVQQRSAA 213
Query: 224 IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 283
++ EY +L S F + P P E F+ E FK AF + S V+L L FA
Sbjct: 214 LEGEYEDLQ-------SYFTKDPQAFPQEYFSLEAFKSAFSVILSRAVYLPSADL---FA 263
Query: 284 LVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQ 341
LVP L + + +A L + D AV VDR YK GE + Y G + +
Sbjct: 264 LVPYADAL-NHRADSQAYLDYSMEDQAVVFPVDRNYKEGEQVFTSY-----GRE-----R 312
Query: 342 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREK 401
N+ LLI YGFVDE+N D L +E L D K+ + Q+ S Q F ++ R
Sbjct: 313 SNADLLITYGFVDENNAMDYLDLEVGLVDGDRLLVLKQQILQQAMLDSPQTFPLYLDRFP 372
Query: 402 EAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPAT 461
+ +L Y+RL + D + ++ + + E L L + +L Y
Sbjct: 373 ---TQLLTYMRLSRLQDPALFPKIVFDKDIM--LDQANEYECLQLLMGECRTKLGNYEGG 427
Query: 462 LSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 496
+ ++ +L + + ++RVA QL EKK+L + +
Sbjct: 428 VDDEIRLLKNKKISQRERVAAQLRLCEKKILTSTM 462
>gi|298706765|emb|CBJ29688.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Ectocarpus siliculosus]
Length = 521
Score = 135 bits (341), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 123/444 (27%), Positives = 198/444 (44%), Gaps = 50/444 (11%)
Query: 81 LKSWMHKNGL-----------PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVP 129
LK WM +NG+ P + + NE + A+ +++ GD F++P
Sbjct: 92 LKEWMGENGVWVYDKSDWGVGPHALSVAVDTVDENENETAGRGMIANREIKEGDELFTLP 151
Query: 130 NSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
L++T + E A+++T + LSE +AL ++EK +GK+SFW YI L
Sbjct: 152 IDLLLT-KDAAKKEFGADVITED-LSEYIAIALLAVHEKAKGKESFWSSYIGVLPTVE-- 207
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDI 249
V LW+E +LA L GSP A ++ EY ++ L ++P +
Sbjct: 208 ---EVYPTYLWAEEDLALLEGSPVIAATESMRRKLEVEYATVEN------DLLDKFPEIL 258
Query: 250 PTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG------PPLLAY-SSKCKAML 302
P E T+E F+ AF + S + L +S ALVP P +Y ++ + +
Sbjct: 259 PREVHTYEEFQWAFAMLFSRAIRLGGLSTGEAVALVPYADLFNHNPFANSYIDARQQGLF 318
Query: 303 AAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRL 362
+ D V + DR YK E + + Y GP+ NS LL+ YGF + NPY+ +
Sbjct: 319 FSKTDEVVVYADRSYKKMEQVYISY-----------GPKGNSDLLLLYGFSLDRNPYNSV 367
Query: 363 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM 422
V +L+ D Y+ K+ G + F ++ R + ++L YLRL ++
Sbjct: 368 DVTVSLDENDELYERKKAFLSEAGLPPTKAFPLYNDRYPD---ELLQYLRLIQLNTDQLR 424
Query: 423 QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD----YNLHPKK 478
+ L + E VLD L + KA +AGYP T +D ++ D L +
Sbjct: 425 GRTLEDLSFEKKQTDVNELMVLDSLVEACKATIAGYPTTEEQDSKLMNDPGMFRALSKTQ 484
Query: 479 RVATQLVRMEKKMLNACL-QVTAD 501
R+A + R EK +L + VT D
Sbjct: 485 RMAVKHRRQEKVILRRTIAAVTKD 508
>gi|302764082|ref|XP_002965462.1| hypothetical protein SELMODRAFT_406852 [Selaginella moellendorffii]
gi|300166276|gb|EFJ32882.1| hypothetical protein SELMODRAFT_406852 [Selaginella moellendorffii]
Length = 481
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 125/419 (29%), Positives = 187/419 (44%), Gaps = 44/419 (10%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
D+ W+ + G P +++ S +K A+ DLQAGDAA S+P + VT V
Sbjct: 49 DMTKWLQEQGFPQQPLLVS---SFEDKGLG---CCATRDLQAGDAALSIPENFTVTAVDV 102
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
+ I+ EL LAL+LMYE+++ + S W PY++ + SPLL
Sbjct: 103 ANHPVISS--AAEGRDELVGLALWLMYEQERSQDSPWYPYLKVF-------PASTLSPLL 153
Query: 200 WSETELAYLT-GSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFE 257
W + E L GS A++ ++ +++ ++ L DT+ + D P E FTF
Sbjct: 154 WEQEEQEELLRGSSALAKVKDQLTSLRQTFDALKDTL---------KDNKDFPMEKFTFS 204
Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 317
FK AF V S V+L L FALVP G + SS+ + V+L VD+ Y
Sbjct: 205 AFKAAFSVVLSRAVYLPSAEL---FALVPFGDLINHESSRSLLDYDIEEQKVKLAVDKRY 261
Query: 318 KAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQD 377
K G+ + Y + T DF LI YGF+DE + D + +E L + D
Sbjct: 262 KKGDQVFASYAQNLTSADF----------LIRYGFLDESDENDFIEIEVGLVSGDSLAPL 311
Query: 378 KRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSP 437
KR + Q G Q F V+ R + +L Y RL + D+ + I V
Sbjct: 312 KREILQEVGLTVPQKFPVYLNRFP---TQLLTYTRLARIQDSGLFAKITFEKDLI--VCQ 366
Query: 438 CMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 496
E L L + +L + T+ +D L NL K+RVA QL EK++L +
Sbjct: 367 TNEYETLMLLMADCRTKLLSFSDTMEDDMQTLKRKNLSYKQRVAAQLRLKEKRILTDTM 425
>gi|302823067|ref|XP_002993188.1| hypothetical protein SELMODRAFT_449044 [Selaginella moellendorffii]
gi|300138958|gb|EFJ05708.1| hypothetical protein SELMODRAFT_449044 [Selaginella moellendorffii]
Length = 600
Score = 125 bits (314), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 120/415 (28%), Positives = 182/415 (43%), Gaps = 42/415 (10%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
D+ W+ + G P +++ S +K A+ DLQAGDAA S+P + VT V
Sbjct: 49 DMTKWLQEQGFPQQPLLV---SSFEDKGLG---CCATRDLQAGDAALSIPENFTVTAVDV 102
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
+ I+ EL LAL+LMYE+++ + S W PY++ + L
Sbjct: 103 ANHPVISS--AAEGRDELVGLALWLMYEQERSQDSPWYPYVKVFPAS------TLSLLLW 154
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFEI 258
E + L GS A++ ++ +++ ++ L DT+ + D P E FTF
Sbjct: 155 EQEEQEELLRGSSALAKVKDQLTSLRQTFDALKDTL---------KDNKDFPMEKFTFSA 205
Query: 259 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYK 318
FK AF V S V+L L FALVP G + SS+ + V+L VD+ YK
Sbjct: 206 FKTAFSVVLSRAVYLPSAEL---FALVPFGDLINHESSRSLLDYDIEEQKVKLAVDKRYK 262
Query: 319 AGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 378
G+ + Y + T DF LI YGF+DE + D + +E L + D K
Sbjct: 263 KGDQVFASYAQNLTSADF----------LIRYGFLDESDENDCIEIEVGLVSGDSLAPLK 312
Query: 379 RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPC 438
R + Q G Q F ++ R + +L Y RL + D+ + I VS
Sbjct: 313 REILQEVGLTVPQKFPLYLNR---FPTQLLTYTRLARIQDSGLFAKITFEKDLI--VSQT 367
Query: 439 MERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 493
E L L + +L T+ ++ L NL K+RVA QL EK++L
Sbjct: 368 NEYETLMLLMADCRTKLLSSSDTMEDEMQTLRRKNLSYKQRVAAQLRLKEKRILT 422
>gi|452821842|gb|EME28868.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Galdieria sulphuraria]
Length = 490
Score = 125 bits (314), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 120/462 (25%), Positives = 209/462 (45%), Gaps = 63/462 (13%)
Query: 56 RVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGL----------PPCKVILKEKPSHNE 105
R S + + +G V ++ W+ +NG+ P ++++ E+ + +E
Sbjct: 58 RSSDAFSFTSGDPAVQKGWSSEISAFYDWLKENGVYLSEKASWTHAPHRLVIAEE-TKDE 116
Query: 106 KHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLM 165
+ +S + G+ +P L+ T R L ET + + E + L L+
Sbjct: 117 GEYSGRGLLSSRSVNLGEKVLEIPEKLMFT--RKLALETFPTSIIASIEDEYVSIGLLLL 174
Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPL-LWSETELAYLTGSPTKAEILERAEGI 224
YEK +G SF+ PY+ L L +PL LWS +L L GSPT + + + +
Sbjct: 175 YEKAKGFDSFFKPYLDILP------TLDELNPLFLWSNKDLDLLQGSPTLSACEQLRDKL 228
Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 284
REY ++ ++ Q P + ++ F+ F+ AF + S + ++R AL
Sbjct: 229 LREYT------YLGKNIIPQIP-NFASKPIDFKQFQWAFGILFSRAICFPS---SKRIAL 278
Query: 285 VPLGPPLLAYSSKCKAML--------AAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDF 336
VP LL +S C A + V +AV + VDR Y+ E + V Y
Sbjct: 279 VPYAD-LLNHSPFCSAFIDEEKIPFGNGVTEAV-VYVDRLYEPYEQVYVSY--------- 327
Query: 337 KCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 396
GP+ N +LL+ YGF E NP+D + + L+ DP Y +K + + GK +Q F ++
Sbjct: 328 --GPRSNQELLLLYGFSLERNPFDCVEITIGLDKTDPLYLEKCRMLESYGKSPLQSFPLY 385
Query: 397 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 456
R +M +LR + +++Q+ ++ VS E + LD+L +Y +L
Sbjct: 386 MDRYP---VEMAEFLRFCCIDTETDLQADFGTI-----VSASNEESALDKLLNYIVDQLR 437
Query: 457 GYPATLSEDEAMLTD----YNLHPKKRVATQLVRMEKKMLNA 494
YP +L +DE ++ D L +R+A + EK++L+A
Sbjct: 438 KYPTSLEDDEKIIRDRAMFQTLEKNQRMAIRQRLGEKRILHA 479
>gi|297829320|ref|XP_002882542.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297328382|gb|EFH58801.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 504
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 94/326 (28%), Positives = 153/326 (46%), Gaps = 38/326 (11%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L++W+ +GLPP K+ + ++ E+ + AS++L+ G+ VP SLV++
Sbjct: 72 ENATSLQNWLSDSGLPPQKMAI-DRVDIGERG-----LVASQNLRKGEKLLFVPPSLVIS 125
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ N E++ + + LA YL+ E K S W YI L RQ
Sbjct: 126 ADSEWTNPEAGEVMKRYDVPDWPLLATYLISEASLQKSSRWYNYISALPRQ-------PY 178
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ TEL YL S + +ER + Y +L + +F ++P+ P E F
Sbjct: 179 SLLYWTRTELDMYLEASQIRERAIERITNVVGTYEDLRS------RIFSKHPHLFPKEVF 232
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLV 312
E FK +F + S +V L S+ RFALVP +L ++ + + L V
Sbjct: 233 NDETFKWSFGILFSRLVRLP--SMDGRFALVPWA-DMLNHNCEVETFLDYDKSSKGVVFT 289
Query: 313 VDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNT 370
DRPY+ GE + + Y G + N +LL++YGFV + NP D + + +L
Sbjct: 290 TDRPYQPGEQVFISY-----------GNKSNGELLLSYGFVPREGTNPSDSVELALSLRK 338
Query: 371 EDPQYQDKRMVAQRNGKLSVQVFHVH 396
D Y++K +++G + Q F V
Sbjct: 339 NDKCYKEKLDALKKHGLSTPQCFPVR 364
>gi|15231493|ref|NP_187424.1| rubisco methyltransferase-like protein [Arabidopsis thaliana]
gi|6466950|gb|AAF13085.1|AC009176_12 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
gi|6648179|gb|AAF21177.1|AC013483_1 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
gi|15028205|gb|AAK76599.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
gi|19310671|gb|AAL85066.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
gi|332641064|gb|AEE74585.1| rubisco methyltransferase-like protein [Arabidopsis thaliana]
Length = 504
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 94/326 (28%), Positives = 152/326 (46%), Gaps = 38/326 (11%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L++W+ +GLPP K+ + ++ E+ + AS++L+ G+ VP SLV++
Sbjct: 72 ENATSLQNWLSDSGLPPQKMAI-DRVDIGERG-----LVASQNLRKGEKLLFVPPSLVIS 125
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ N E++ + + LA YL+ E K S W YI L RQ
Sbjct: 126 ADSEWTNAEAGEVMKRYDVPDWPLLATYLISEASLQKSSRWFNYISALPRQ-------PY 178
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ TEL YL S + +ER + Y +L + +F ++P P E F
Sbjct: 179 SLLYWTRTELDMYLEASQIRERAIERITNVVGTYEDLRS------RIFSKHPQLFPKEVF 232
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLV 312
E FK +F + S +V L S+ RFALVP +L ++ + + L V
Sbjct: 233 NDETFKWSFGILFSRLVRLP--SMDGRFALVPWA-DMLNHNCEVETFLDYDKSSKGVVFT 289
Query: 313 VDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNT 370
DRPY+ GE + + Y G + N +LL++YGFV + NP D + + +L
Sbjct: 290 TDRPYQPGEQVFISY-----------GNKSNGELLLSYGFVPREGTNPSDSVELALSLRK 338
Query: 371 EDPQYQDKRMVAQRNGKLSVQVFHVH 396
D Y++K +++G + Q F V
Sbjct: 339 NDKCYEEKLDALKKHGLSTPQCFPVR 364
>gi|21537309|gb|AAM61650.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
Length = 504
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 94/326 (28%), Positives = 152/326 (46%), Gaps = 38/326 (11%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L++W+ +GLPP K+ + ++ E+ + AS++L+ G+ VP SLV++
Sbjct: 72 ENATSLQNWLSDSGLPPQKMAI-DRVDIGERG-----LVASQNLRKGEKLLFVPPSLVIS 125
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ N E++ + + LA YL+ E K S W YI L RQ
Sbjct: 126 ADSEWTNAEAGEVMKRYDVPDWPLLATYLISEANLQKSSRWFNYISALPRQ-------PY 178
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ TEL YL S + +ER + Y +L + +F ++P P E F
Sbjct: 179 SLLYWTRTELDMYLEASQIRERAIERITNVVGTYEDLRS------RIFSKHPQLFPKEVF 232
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLV 312
E FK +F + S +V L S+ RFALVP +L ++ + + L V
Sbjct: 233 NDETFKWSFGILFSRLVRLP--SMDGRFALVPWA-DMLNHNCEVETFLDYDKSSKGVIFT 289
Query: 313 VDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNT 370
DRPY+ GE + + Y G + N +LL++YGFV + NP D + + +L
Sbjct: 290 TDRPYQPGEQVFISY-----------GNKSNGELLLSYGFVPREGTNPSDSVELALSLRK 338
Query: 371 EDPQYQDKRMVAQRNGKLSVQVFHVH 396
D Y++K +++G + Q F V
Sbjct: 339 NDKCYEEKLDALKKHGLSTPQCFPVR 364
>gi|3065835|gb|AAC14296.1| putative methyltransferase [Arabidopsis thaliana]
Length = 504
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 93/326 (28%), Positives = 150/326 (46%), Gaps = 38/326 (11%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L++W+ +GLPP K+ + ++ E+ + AS++L+ G+ V SLV+
Sbjct: 72 ENATSLQNWLSDSGLPPQKMAI-DRVDIGERG-----LVASQNLRKGEKLLFVSPSLVIC 125
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ N E++ + + LA YL+ E K S W YI L RQ
Sbjct: 126 ADSEWTNAEAGEVMKRYDVPDWPLLATYLISEASLQKSSRWFNYISALPRQ-------PY 178
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ TEL YL S + +ER + Y +L + +F ++P P E F
Sbjct: 179 SLLYWTRTELDMYLEASQIRERAIERITNVVGTYEDLRS------RIFSKHPQLFPKEVF 232
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLV 312
E FK +F + S +V L S+ RFALVP +L ++ + + L V
Sbjct: 233 NDETFKWSFGILFSRLVRLP--SMDGRFALVPWA-DMLNHNCEVETFLDYDKSSKGVVFT 289
Query: 313 VDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNT 370
DRPY+ GE + + Y G + N +LL++YGFV + NP D + + +L
Sbjct: 290 TDRPYQPGEQVFISY-----------GNKSNGELLLSYGFVPREGTNPSDSVELALSLRK 338
Query: 371 EDPQYQDKRMVAQRNGKLSVQVFHVH 396
D Y++K +++G + Q F V
Sbjct: 339 NDKCYEEKLDALKKHGLSTPQCFPVR 364
>gi|224129218|ref|XP_002320530.1| predicted protein [Populus trichocarpa]
gi|222861303|gb|EEE98845.1| predicted protein [Populus trichocarpa]
Length = 503
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 94/323 (29%), Positives = 146/323 (45%), Gaps = 38/323 (11%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L+ W+ +GLPP K+ + +K E+ + A ++++ G+ VP SLV+
Sbjct: 71 ENAEALQKWLSDSGLPPQKMAI-QKVEVGERG-----LVALKNIRKGEMLLFVPPSLVIA 124
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ E+L + + LA YL+ E K S W YI L RQ
Sbjct: 125 ADSEWSCPEAGEVLKKYSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PY 177
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ EL YL S + +ER + YN+L +F +YP+ P E F
Sbjct: 178 SLLYWTRAELDTYLEASQIRERAIERITNVTGTYNDLRL------RIFSKYPHLFPEEVF 231
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLV 312
E FK +F + S +V L S+ R ALVP +L +SS+ + L V
Sbjct: 232 NMETFKWSFGILFSRLVRLP--SMDGRVALVPWA-DMLNHSSEVETFLDYDKSSKGVVFT 288
Query: 313 VDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNT 370
DRPY+ GE + + Y G + N +LL++YGFV + NP D + + +L
Sbjct: 289 TDRPYQPGEQVFISY-----------GRKSNGELLLSYGFVPREGTNPSDSVELSLSLKK 337
Query: 371 EDPQYQDKRMVAQRNGKLSVQVF 393
D Y++K +++G Q F
Sbjct: 338 SDKCYKEKLEALKKHGLSVSQCF 360
>gi|219121061|ref|XP_002185762.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Phaeodactylum tricornutum CCAP
1055/1]
gi|209582611|gb|ACI65232.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Phaeodactylum tricornutum CCAP
1055/1]
Length = 575
Score = 108 bits (270), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 104/417 (24%), Positives = 185/417 (44%), Gaps = 65/417 (15%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKK-QGKK 173
A D+ GD +P +L +T + + + + + ++++E +A +L+YE+ +G++
Sbjct: 131 ARRDINDGDELLRIPMALCMT--KSAARKAVGKDVLPSEINEYLAMACHLIYERNVRGEE 188
Query: 174 SFWLPYIR---ELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
S W PY+ ++D V W + +LA+L GSP A ++REY+
Sbjct: 189 SPWKPYLDVLPDIDE--------VNPTFTWPDEDLAFLNGSPVIAATKSLQMKLRREYDA 240
Query: 231 LDTVWFMAG--SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG 288
L + G L +YP P EAF F+ ++ AF + S + L+ + ALVP
Sbjct: 241 L-----LGGEDGLLAKYPDRFPAEAFNFKAWEWAFTMLFSRAIRLRSLKQGETLALVPYA 295
Query: 289 PPLLAYSSKCKAMLAAV----------DDAVQLVVDRPYKAGESIVVWYNISFTGHDFKC 338
L+ +S +A + A D+ V L DR Y+ E I + Y
Sbjct: 296 -DLINHSPFSQAYIDARQNGDWLFKSGDEEVILYADRGYRRMEQIYISY----------- 343
Query: 339 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE---------------DPQYQDKRMVAQ 383
GP+ N++LL+ YGF E NP++ + V ++ DP ++K +
Sbjct: 344 GPKSNAELLLLYGFAVERNPFNSVDVTVSIAPRTASFVKELDDDTIPVDPLAEEKAAFLE 403
Query: 384 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 443
+ G+ + F +A R +ML YLRL ++ ++ +S E AV
Sbjct: 404 QVGRDATVDFPCYADRYP---VEMLEYLRLMQMTPEDTRGKPLAEFDYSRTISLGNEAAV 460
Query: 444 LDQLADYFKARLAGYPATLSEDEAMLTDYNLHP----KKRVATQLVRMEKKMLNACL 496
L + +L+ YP + +D A++ D +L +R+A + R EK++L +
Sbjct: 461 LTSVITAVSRQLSNYPQSEEDDAALIKDKSLFRLLSYNQRMAVRHRRNEKRLLKRTI 517
>gi|356547583|ref|XP_003542190.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Glycine
max]
Length = 499
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 92/326 (28%), Positives = 148/326 (45%), Gaps = 38/326 (11%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L+ W+ ++GLPP K+ + E+ E+ + A ++++ G+ VP SLV+T
Sbjct: 67 ENSSALQRWLSESGLPPQKMGI-ERVEVGERG-----LVALKNIRKGEKLLFVPPSLVIT 120
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ E+L N + + LA YL+ E + S W YI L RQ
Sbjct: 121 PDSEWSCPEAGEVLKRNSVPDWPLLATYLISEASLMESSRWSNYISALPRQ-------PY 173
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W++ EL YL S + +ER + YN+L +F +YP P E F
Sbjct: 174 SLLYWTQAELDRYLEASQIRERAIERINNVIGTYNDLRL------RIFSKYPDLFPDEVF 227
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLV 312
E FK +F + S +V L S+ ALVP +L +S + L +
Sbjct: 228 NIESFKWSFGILFSRLVRLP--SMGGNVALVPWA-DMLNHSCDVETFLDYDKTSKGIVFT 284
Query: 313 VDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNT 370
DRPY+ GE + + Y G + N +LL++YGFV ++ NP D + + +L
Sbjct: 285 TDRPYQPGEQVFISY-----------GKKSNGELLLSYGFVPKEGANPSDSVELSLSLKK 333
Query: 371 EDPQYQDKRMVAQRNGKLSVQVFHVH 396
D Y++K + + G + Q F +
Sbjct: 334 SDASYKEKLELLKNYGLSASQCFPIQ 359
>gi|168003103|ref|XP_001754252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694354|gb|EDQ80702.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 431
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 90/338 (26%), Positives = 155/338 (45%), Gaps = 42/338 (12%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L+ W+ K GL K++L S + A++ L+ G+ VP+ L++T +
Sbjct: 16 LQDWLMKEGLAKQKLVLDRVDSGGRG------LVATQSLRQGERLLFVPSGLLITADSEW 69
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
G ++ L E LA++L+ E + + S W PY L + S L W
Sbjct: 70 GCAETGRIIKEAGLPEWPMLAIFLISEASREESSRWFPYFATLPK-------TPSSILQW 122
Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
+E E+ +LT SP + + LE + Y +L ++F ++P P++ +T F
Sbjct: 123 TEEEVNTWLTASPVREKALECIRDVTETYRDL------RATIFLKHPEVFPSQVYTLAAF 176
Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD---AVQLVVDRP 316
K AF + S +V L V + ALVP +L +S + + L + +V V DR
Sbjct: 177 KWAFGILFSRLVRLPSVG---KLALVPWA-DMLNHSPQVDSFLDFDQNNAKSVVTVTDRA 232
Query: 317 YKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALNTEDPQY 375
Y++GE + + Y G + + +L + YGF+ E N +D + +E ++++DP +
Sbjct: 233 YQSGEQVFISY-----------GKRSSGELFLAYGFIPSELNVHDSVELEMEIDSDDPSF 281
Query: 376 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 413
+ K A G S Q F V R+ + +L Y RL
Sbjct: 282 EAKLRAANEQGLSSPQRFPV---RKDGFPAQLLAYARL 316
>gi|357462493|ref|XP_003601528.1| SET domain-containing protein [Medicago truncatula]
gi|355490576|gb|AES71779.1| SET domain-containing protein [Medicago truncatula]
gi|388500078|gb|AFK38105.1| unknown [Medicago truncatula]
Length = 497
Score = 105 bits (263), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 92/326 (28%), Positives = 149/326 (45%), Gaps = 38/326 (11%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L+ W+ ++GLP K+ + +K E+ + A +++ G+ VP LV+T
Sbjct: 65 ENSSSLQKWLSQSGLPSQKMSI-DKVDVGERG-----LVALNNIRKGEKLLFVPPQLVIT 118
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ E+L N + + LA YL+ E K S W YI L RQ
Sbjct: 119 PDSEWSCPEAGEVLKKNSVPDWPLLATYLISEASLMKSSRWFSYISALPRQ-------PY 171
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L WS+ EL YL S + +ER + YN+ M +F +YP P E F
Sbjct: 172 SLLYWSQAELDRYLEASQIRERAIERTNNVIGTYND------MRVRIFSKYPDFFPEEVF 225
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV-- 312
E FK +F + S +V L S+ + ALVP ++ +S + + L + +V
Sbjct: 226 NIESFKWSFGILFSRMVRLP--SMDGKNALVPWA-DMMNHSCEVETFLDYDKSSKGIVFP 282
Query: 313 VDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNT 370
DRPY+ GE + + Y G + N +LL++YGFV ++ NP D + + +L
Sbjct: 283 TDRPYQPGEQVFISY-----------GKKSNGELLLSYGFVPKEGTNPSDSVELSLSLKK 331
Query: 371 EDPQYQDKRMVAQRNGKLSVQVFHVH 396
D Y++K + ++ G Q F +
Sbjct: 332 SDESYKEKLELLKKYGLSGSQCFPIR 357
>gi|357469947|ref|XP_003605258.1| SET domain-containing protein [Medicago truncatula]
gi|355506313|gb|AES87455.1| SET domain-containing protein [Medicago truncatula]
Length = 494
Score = 105 bits (263), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 92/326 (28%), Positives = 149/326 (45%), Gaps = 38/326 (11%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L+ W+ ++GLP K+ + +K E+ + A +++ G+ VP LV+T
Sbjct: 62 ENSSSLQKWLSQSGLPSQKMSI-DKVDVGERG-----LVALNNIRKGEKLLFVPPQLVIT 115
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ E+L N + + LA YL+ E K S W YI L RQ
Sbjct: 116 PDSEWSCPEAGEVLKKNSVPDWPLLATYLISEASLMKSSRWFSYISALPRQ-------PY 168
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L WS+ EL YL S + +ER + YN+ M +F +YP P E F
Sbjct: 169 SLLYWSQAELDRYLEASQIRERAIERTNNVIGTYND------MRVRIFSKYPDFFPEEVF 222
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV-- 312
E FK +F + S +V L S+ + ALVP ++ +S + + L + +V
Sbjct: 223 NIESFKWSFGILFSRMVRLP--SMDGKNALVPWA-DMMNHSCEVETFLDYDKSSKGIVFP 279
Query: 313 VDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNT 370
DRPY+ GE + + Y G + N +LL++YGFV ++ NP D + + +L
Sbjct: 280 TDRPYQPGEQVFISY-----------GKKSNGELLLSYGFVPKEGTNPSDSVELSLSLKK 328
Query: 371 EDPQYQDKRMVAQRNGKLSVQVFHVH 396
D Y++K + ++ G Q F +
Sbjct: 329 SDESYKEKLELLKKYGLSGSQCFPIR 354
>gi|225447500|ref|XP_002267469.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic [Vitis
vinifera]
gi|296085051|emb|CBI28466.3| unnamed protein product [Vitis vinifera]
Length = 497
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 93/327 (28%), Positives = 152/327 (46%), Gaps = 40/327 (12%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L+ W+ +GLPP K+ + E+ E+ + A ++++ G+ VP SLV+T
Sbjct: 65 ENAALLQKWLSDSGLPPQKMGI-ERVEVGERG-----LVALKNIRKGEKLLFVPPSLVIT 118
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ E+L N + + LA YL+ E + S W YI L RQ
Sbjct: 119 ADSEWSCTEAGEVLKRNSVPDWPLLATYLIGEASFMQSSRWSNYISALPRQ-------PY 171
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ EL YL S + +ER + YN+L +F ++P+ P E F
Sbjct: 172 SLLYWTRAELDKYLEASQIRERAIERINDVTGTYNDLRL------RIFSKHPHLFPEEVF 225
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV- 313
E FK +F + S +V L S+ + ALVP +L +S + + L D + Q VV
Sbjct: 226 NMETFKWSFGILFSRLVRLP--SMDEKIALVPWA-DMLNHSCEVETFL-DYDKSSQGVVF 281
Query: 314 --DRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED--NPYDRLVVEAALN 369
DR Y+ E + + Y G + N +LL++YGFV + NP D++ + +L
Sbjct: 282 TTDRTYQPSEQVFISY-----------GKKSNGELLLSYGFVPREGTNPNDKVELLLSLK 330
Query: 370 TEDPQYQDKRMVAQRNGKLSVQVFHVH 396
D Y++K +++G + Q F +
Sbjct: 331 KSDKCYKEKSEAMKKHGLSTSQCFPIQ 357
>gi|255582876|ref|XP_002532210.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
gi|223528106|gb|EEF30179.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
Length = 508
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 95/322 (29%), Positives = 148/322 (45%), Gaps = 40/322 (12%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L+ W+ NGLP K+ + +K E+ + A ++++ G+ VP SLV+T +
Sbjct: 81 LQRWLSNNGLPDQKMAI-DKVEVGERG-----LVALKNIRKGEKLLFVPPSLVITADSEW 134
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
E+L + + LA+YL+ E K S W YI L RQ S L W
Sbjct: 135 SCPEAGEVLKQYSVPDWPLLAIYLISEANLQKSSKWSNYISALPRQ-------PYSLLYW 187
Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
+ EL YL S + +ER + YN+L +F +YP P E F E F
Sbjct: 188 TRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNLETF 241
Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DRP 316
K +F + S +V L S+ + ALVP +L +S + + L D + Q VV DR
Sbjct: 242 KWSFGILFSRLVRLP--SMDGKVALVPWA-DMLNHSCEVETFL-DYDKSSQGVVFTTDRQ 297
Query: 317 YKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQ 374
Y+ GE + + Y G + N +LL++YGFV + NP D + + +L D
Sbjct: 298 YEPGEQVFISY-----------GKKSNGELLLSYGFVPREGTNPSDSVELSLSLKKSDKS 346
Query: 375 YQDKRMVAQRNGKLSVQVFHVH 396
Y++K +++G + Q F V
Sbjct: 347 YKEKLEALKKHGFSASQCFPVR 368
>gi|242066146|ref|XP_002454362.1| hypothetical protein SORBIDRAFT_04g029430 [Sorghum bicolor]
gi|241934193|gb|EES07338.1| hypothetical protein SORBIDRAFT_04g029430 [Sorghum bicolor]
Length = 499
Score = 102 bits (255), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 105/397 (26%), Positives = 176/397 (44%), Gaps = 70/397 (17%)
Query: 9 SSKFISPPIRPPHH----PLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSSDTLV 64
S+ + PP+R P H P S +S+ S R + R IR S++
Sbjct: 4 STTTLHPPLRAPRHLRPLPHSYSSSFS----------------RTRGRAPIRASAASASA 47
Query: 65 AGSREVVS--------KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAAS 116
RE + + E L+ W+ +GLP ++ + ++ E+ + A
Sbjct: 48 PAQREAAAGVPWGCEIESLESAASLERWLIDSGLPEQRLAI-QRVDIGERG-----LVAL 101
Query: 117 EDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFW 176
++++ G+ VP SLV+T + G + E++ N + + +A YL+ E S W
Sbjct: 102 KNIRKGEKLLFVPPSLVITADSEWGRPEVGEVMKRNSVPDWPLIATYLISEASLEGSSRW 161
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL-DTV 234
YI L RQ S L W+ EL AYL SP + ++R + YN+L D +
Sbjct: 162 SSYIAALPRQ-------PYSLLYWTRAELDAYLVASPIRKRAIQRITDVIGTYNDLRDRI 214
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
+ LF P E + E F +F + S +V L S+ + ALVP +L +
Sbjct: 215 FSRHSDLF-------PEEVYNIETFLWSFGILFSRLVRLP--SMDEKVALVPWA-DMLNH 264
Query: 295 SSKCKAMLAAVDDAVQLVV---DRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYG 351
S + + L D + Q +V DR Y+ GE + + Y G + + +LL++YG
Sbjct: 265 SPEVETFL-DFDKSSQGIVFTTDRSYQPGEQVFISY-----------GKKSSGELLLSYG 312
Query: 352 FVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNG 386
FV ++ NP D + + +L+ D Y++K +RNG
Sbjct: 313 FVPKEGTNPNDSVELLVSLDKSDKCYKEKLQALKRNG 349
>gi|440792294|gb|ELR13522.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
Length = 568
Score = 102 bits (253), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 124/472 (26%), Positives = 195/472 (41%), Gaps = 81/472 (17%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
+DL L+ W+ KNGL + E ++ + V A +D + G+ VP L+ T
Sbjct: 66 DDLEQLRVWLLKNGLDSKWLEGIEFAANLPEGSG---VVAKKDFKKGEPFLQVPRKLMFT 122
Query: 136 LERVLGNETIAELLTTNKL---SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
+ + N + +LL +K S CLAL+L+ EK SFW PYI+ L + G
Sbjct: 123 CQ-AMQNTPLGQLLKVDKFLAQSPSLCLALHLLVEK-HNHSSFWTPYIKTLPKSYG---- 176
Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 252
+ L ++ EL L GSPT ++ + +Y + LFQ +
Sbjct: 177 ---TCLYFTLEELEGLRGSPTFTSAIKVIATVAIQYTYIH-------DLFQIRKDILHIN 226
Query: 253 AFTFEIFKQAFVAV---QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV 309
AFT++ F A AV Q+ V +L+ +AL+P + D+
Sbjct: 227 AFTWDEFIWAMSAVGSRQNQVPQWGHNALSE-YALIPAWDMCNHDHGDLQTFWDVNSDST 285
Query: 310 QLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 369
+ R YK GE + ++Y GP+PNS LL++ GFV E+N +D L + L
Sbjct: 286 ESHAMRAYKKGEQVYIFY-----------GPRPNSDLLLHAGFVYENNRFDALAIRVRLA 334
Query: 370 TEDPQYQDK-RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISS 428
+ +DK R++ N K+ Q + G D++ +LR+ + + E+Q V+ +
Sbjct: 335 PDAEHIKDKLRLLHLNNMKMDSQYYLYGLG----LAVDLMAFLRI-HAMNEQELQQVLGA 389
Query: 429 -------------------------LGPICPVSPCMERAVLDQLADYFKARLAGYPATLS 463
P ++ E A L + L+ YP TL
Sbjct: 390 YDQQEAKVHNGEHPASNGEVVASGVFDPRVKLNDRNELAALQLAEAKCLSLLSLYPTTLQ 449
Query: 464 ---------EDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLL 506
ED+A L +L P R T L EK++LN L D I LL
Sbjct: 450 VANGVELKQEDQAALRTTSLTPNMRAVTLLRLKEKEILNRTL----DAIRLL 497
>gi|226501968|ref|NP_001140387.1| uncharacterized protein LOC100272441 [Zea mays]
gi|194699272|gb|ACF83720.1| unknown [Zea mays]
gi|413923744|gb|AFW63676.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Zea mays]
Length = 503
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 84/316 (26%), Positives = 146/316 (46%), Gaps = 38/316 (12%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E L+ W+ +GLP ++ + ++ E+ + A ++++ G+ VP SLV+T
Sbjct: 71 ESAASLERWLIDSGLPEQRLAI-QRVDIGERG-----LVALKNIRKGEKLLFVPPSLVIT 124
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ G + +++ N + + +A YL+ E S W+ YI L RQ
Sbjct: 125 ADSEWGRPEVGDVMKRNSVPDWPLIATYLISEASLEGSSRWISYIAALPRQ-------PY 177
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ EL AYL SP + ++R + YN+L +F ++P P E +
Sbjct: 178 SLLYWTRAELDAYLVASPIRKRAIQRITDVIGTYNDLRD------RIFSRHPDLFPEEVY 231
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLV 312
E F +F + S +V L S+ R ALVP +L +S + + L +
Sbjct: 232 NIETFLWSFGILFSRLVRLP--SMDGRVALVPWA-DMLNHSPEVETFLDFDKSSRGIVFT 288
Query: 313 VDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNT 370
DR Y+ GE + + Y G + + +LL++YGFV ++ NP D + + +L+
Sbjct: 289 TDRSYQPGEQVFISY-----------GKKSSGELLLSYGFVPKEGTNPNDSVELLVSLDK 337
Query: 371 EDPQYQDKRMVAQRNG 386
D Y++K +RNG
Sbjct: 338 SDNCYKEKLQALKRNG 353
>gi|449453618|ref|XP_004144553.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Cucumis
sativus]
gi|449511789|ref|XP_004164054.1| PREDICTED: LOW QUALITY PROTEIN: ribulose-1,5 bisphosphate
carboxylase/oxygenase large subunit N-methyltransferase,
chloroplastic-like [Cucumis sativus]
Length = 497
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 93/327 (28%), Positives = 151/327 (46%), Gaps = 40/327 (12%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L+ W+ ++GLP K+ ++ N R + A ++++ G+ VP SLV++
Sbjct: 65 ENASALQKWLSESGLPDQKMSIQRV---NVGERGL---VALKNVRKGEKLLFVPPSLVIS 118
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
E E+L N + + +A YL+ E K S W YI L RQ
Sbjct: 119 AESEWSCPEAGEVLKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQ-------PY 171
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ EL YL S + +ER + YN+L +F ++P P E F
Sbjct: 172 SLLYWTREELDRYLEASEIRERAIERITNVVGTYNDLSI------RVFSKHPELFPEEVF 225
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV- 313
E FK +F + S +V L S+ + ALVP +L ++ + + L D A Q VV
Sbjct: 226 NIETFKWSFGILFSRLVRLP--SMDGKVALVPWA-DMLNHNCEVETFL-DYDKASQGVVF 281
Query: 314 --DRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED--NPYDRLVVEAALN 369
DR Y+ GE + + Y G + N +LL++YGFV ++ NP D + + +L
Sbjct: 282 TTDRAYQPGEQVFISY-----------GKKSNGELLLSYGFVPKEGSNPSDSVELLLSLK 330
Query: 370 TEDPQYQDKRMVAQRNGKLSVQVFHVH 396
D Y++K +++G + Q F +
Sbjct: 331 KSDKCYKEKLEALKKHGLRASQCFPIQ 357
>gi|397613505|gb|EJK62256.1| hypothetical protein THAOC_17139 [Thalassiosira oceanica]
Length = 648
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 107/425 (25%), Positives = 181/425 (42%), Gaps = 69/425 (16%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLE---RVLGNETIAELLTTNKLSELACLALYLMYEK-KQ 170
A + GD +P L +T + R LG + + E ++E +A L++EK +
Sbjct: 210 ARRSINDGDELLKIPLDLCLTRKSARRELGKDALQE-----GINEYLAVACQLIHEKFVK 264
Query: 171 GKKSFWLPYIR---ELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
G+ SF+ Y+ E+D V W + +LA+L GSP A ++RE
Sbjct: 265 GEDSFYAAYMGVLPEVDE--------VNPTFTWPDEDLAFLEGSPVVAATRSLQMKLRRE 316
Query: 228 YNELDTVWFMAG--SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
Y++L + G L ++P P E +TFE ++ AF + S + L+ + + R A+V
Sbjct: 317 YDDL-----LGGPDGLVAKFPLRFPAEHYTFENWEWAFTMLFSRAIRLRNLQVGERLAMV 371
Query: 286 PLGPPLLAYSSKCKAMLAA----------VDDAVQLVVDRPYKAGESIVVWYNISFTGHD 335
P L+ +S+ +A + A ++ V L DR Y+ E + + Y
Sbjct: 372 PYA-DLINHSAFSQAFIDARESGDWLFKSGEEEVILYADRGYRQMEQVYISY-------- 422
Query: 336 FKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN-------------TEDPQYQDKRMVA 382
G + N++LL+ YGF E NPY+ + V ++ EDP +K
Sbjct: 423 ---GQKSNAELLLLYGFALERNPYNSVDVTVSIAPRTKQIAEANEGVEEDPLADEKLEFL 479
Query: 383 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERA 442
G+ F +A R +ML YLRL ++ +S +S E +
Sbjct: 480 LSVGRDQTVDFPCYADRYP---VEMLEYLRLMMMTPEDTRGKPLSDFDYSRTISSANEAS 536
Query: 443 VLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP----KKRVATQLVRMEKKMLNACLQV 498
VL + K +L +P T +D A++ D + +R+A + R EK++L L
Sbjct: 537 VLRSVVAAVKYQLGLFPQTEEDDAAIIKDKGMFRLFSYNQRMAVRHRRNEKRLLKRTLAA 596
Query: 499 TADMI 503
I
Sbjct: 597 LEKQI 601
>gi|307109960|gb|EFN58197.1| hypothetical protein CHLNCDRAFT_142047 [Chlorella variabilis]
Length = 485
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 113/434 (26%), Positives = 190/434 (43%), Gaps = 51/434 (11%)
Query: 49 RKNRFSIRVSSSDTLV-----AGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSH 103
R R R+++ L+ AGS E+ + E + +LK+W+ + GLPP K+ P
Sbjct: 26 RHRRCRCRLAAQAGLLDLLRGAGSTEIATDAEGE--ELKAWLIERGLPPPKLAAAATPGS 83
Query: 104 NEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALY 163
+ A++ + G++ S+P LV+T L + LL L + LAL+
Sbjct: 84 GRG------LVAAQPIGKGESLLSIPQQLVLTPAAALEQSCLRPLLEEQPLPAWSVLALW 137
Query: 164 LMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG 223
L ++ G W PY+R L + G L WSE E+ +L GS ++ LE
Sbjct: 138 LAEQRAAGSAGGWWPYVRLLPERTG-------CVLEWSEEEVEWLCGSQLHSDALEIRAA 190
Query: 224 IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 283
+ + E+ V A + + + AF + AF + S +V L L + A
Sbjct: 191 AEASWAEMQAVLAAAKAQGRAPAHG----AFGRAQLQWAFAVLLSRLVRL--AGLGDQEA 244
Query: 284 LVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQ 341
L+P LL + + L +A + AV L +R Y+AGE +++ Y G +
Sbjct: 245 LLPWA-DLLNHDCAAASFLDWSATEAAVVLRAERRYRAGEQLLISY-----------GQK 292
Query: 342 PNSKLLINYGFVDE--DNPYD--RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 397
+ +LL++YGF + NP+D RL++E L D K +++G + Q+F +
Sbjct: 293 TSGELLLSYGFCPDLGSNPHDGCRLLLE--LAPGDAARNWKAAALRQHGLAASQLFPL-- 348
Query: 398 GREKEAISDMLPY--LRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARL 455
R A +++ Y V E + + L + P ++ A L+ + KA L
Sbjct: 349 -RMAAAPFELVHYTAFSAAVVGSRQEAEQLARRLFEEGDIPPALQTAALEAVVAACKAAL 407
Query: 456 AGYPATLSEDEAML 469
A YP + D A L
Sbjct: 408 AAYPRSFDGDRAEL 421
>gi|195651313|gb|ACG45124.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Zea mays]
Length = 503
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 83/316 (26%), Positives = 145/316 (45%), Gaps = 38/316 (12%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E L+ W+ +GLP ++ + ++ E+ + A ++++ G+ VP SLV+T
Sbjct: 71 ESAASLERWLIDSGLPEQRLAI-QRVDIGERG-----LVALKNIRKGENLLFVPPSLVIT 124
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ G + +++ N + + +A YL+ E S W+ YI L RQ
Sbjct: 125 ADSEWGRPEVGDVMKRNSVPDWPLIATYLISEASLEGSSRWISYIAALPRQ-------PY 177
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ EL AYL SP + ++R + YN+L +F ++P P E +
Sbjct: 178 SLLYWTRAELDAYLVASPIRKRAIQRITDVIGTYNDLRD------RIFSRHPDLFPEEVY 231
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLV 312
E F +F + S +V L S+ R LVP +L +S + + L +
Sbjct: 232 NIETFLWSFGILFSRLVRLP--SMDGRVVLVPWA-DMLNHSPEVETFLDFDKSSRGIVFT 288
Query: 313 VDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNT 370
DR Y+ GE + + Y G + + +LL++YGFV ++ NP D + + +L+
Sbjct: 289 TDRSYQPGEQVFISY-----------GKKSSGELLLSYGFVPKEGTNPNDSVELLVSLDK 337
Query: 371 EDPQYQDKRMVAQRNG 386
D Y++K +RNG
Sbjct: 338 SDNCYKEKLQALKRNG 353
>gi|3403236|gb|AAC29137.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Spinacia oleracea]
Length = 491
Score = 99.0 bits (245), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 99/394 (25%), Positives = 174/394 (44%), Gaps = 46/394 (11%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A +D+ + VP + + V +E + N L +AL+LM EKK G
Sbjct: 86 LVAQKDISRNEVVLEVPQKFWINPDTVAASEIGS---VCNGLKPWVSVALFLMREKKLGN 142
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W PYI L + S + WSE EL+ L GS L E + E+ +L+
Sbjct: 143 SSSWKPYIDILPD-------STNSTIYWSEEELSELQGSQLLNTTLGVKELVANEFAKLE 195
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG---- 288
+ Q +P+D+ + F F F +C+ + L+PL
Sbjct: 196 EEVLVPHK--QLFPFDVTQDDF-FWAFGMLRSRAFTCLE-------GQSLVLIPLADLAN 245
Query: 289 --PPLLA--YSSKCK-AMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPN 343
P + A Y+ + + A L + + L P KAG+ +++ Y+++ + N
Sbjct: 246 HSPDITAPKYAWEIRGAGLFSRELVFSLRNPTPVKAGDQVLIQYDLN----------KSN 295
Query: 344 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEA 403
++L ++YG + + + + + D Y DK +A+ NG F + E+
Sbjct: 296 AELALDYGLTESRSERNAYTLTLEIPESDSFYGDKLDIAESNGMGESAYFDIVL--EQPL 353
Query: 404 ISDMLPYLRLGYVS--DTSEMQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGY 458
++MLPYLRL + D ++S+ S G + P+SP E + + D + L+GY
Sbjct: 354 PANMLPYLRLVALGGEDAFLLESIFRNSIWGHLDLPISPANEELICQVIRDACTSALSGY 413
Query: 459 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 492
T++EDE +L + ++ P+ +A + EKK+L
Sbjct: 414 STTIAEDEKLLAEGDIDPRLEIAITIRLGEKKVL 447
>gi|3403234|gb|AAC29136.1| ribulose-1,5-bisphosphate carboxylase/oxygenase N-methyltransferase
[Spinacia oleracea]
gi|3403238|gb|AAC29138.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase II [Spinacia oleracea]
Length = 495
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 96/391 (24%), Positives = 176/391 (45%), Gaps = 36/391 (9%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A +D+ + VP + + V +E + N L +AL+LM EKK G
Sbjct: 86 LVAQKDISRNEVVLEVPQKFWINPDTVAASEIGS---VCNGLKPWVSVALFLMREKKLGN 142
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W PYI L + S + WSE EL+ L GS L E + E+ +L+
Sbjct: 143 SSSWKPYIDILPD-------STNSTIYWSEEELSELQGSQLLNTTLGVKELVANEFAKLE 195
Query: 233 TVWFMAGSLFQQYPYDIPTEAF--TFEIFK-QAFVAVQS---CVVHLQKVSLARRFALVP 286
+ Q +P+D+ + F F + + +AF ++ ++ L + + +
Sbjct: 196 EEVLVPHK--QLFPFDVTQDDFFWAFGMLRSRAFTCLEGQSLVLIPLADLWVQQANHSPD 253
Query: 287 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKL 346
+ P A+ + A L + + L P KAG+ +++ Y+++ + N++L
Sbjct: 254 ITAPKYAWEIR-GAGLFSRELVFSLRNPTPVKAGDQVLIQYDLN----------KSNAEL 302
Query: 347 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD 406
++YG + + + + + D Y DK +A+ NG F + E+ ++
Sbjct: 303 ALDYGLTESRSERNAYTLTLEIPESDSFYGDKLDIAESNGMGESAYFDIVL--EQPLPAN 360
Query: 407 MLPYLRLGYVS--DTSEMQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPAT 461
MLPYLRL + D ++S+ S G + P+SP E + + D + L+GY T
Sbjct: 361 MLPYLRLVALGGEDAFLLESIFRNSIWGHLDLPISPANEELICQVIRDACTSALSGYSTT 420
Query: 462 LSEDEAMLTDYNLHPKKRVATQLVRMEKKML 492
++EDE +L + ++ P+ +A + EKK+L
Sbjct: 421 IAEDEKLLAEGDIDPRLEIAITIRLGEKKVL 451
>gi|326495906|dbj|BAJ90575.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 507
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 86/317 (27%), Positives = 148/317 (46%), Gaps = 40/317 (12%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E L+ W+ +GLP ++ L EK E+ + A ++++ G+ VP +LV+T
Sbjct: 74 ESAASLERWLTASGLPEQRLAL-EKVDIGERG-----LVALKNVRNGEKLLFVPPTLVIT 127
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ N + +++ + + LA YL+ E S W YI L RQ
Sbjct: 128 ADSEWTNREVGDVMKRYSVPDWPLLATYLISEASLEGSSRWSSYIDALPRQ-------PY 180
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ TE+ AYL SP + + R + YN+L +F ++P P + +
Sbjct: 181 SLLYWTRTEIDAYLVASPIRERAISRISDVIGTYNDLRD------RIFSKHPDLFPEKVY 234
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV- 313
E F+ +F + S +V L+ S+ + ALVP +L +S + A L D + Q +V
Sbjct: 235 NMENFRWSFGILFSRLVRLE--SMGGKVALVPWA-DMLNHSPEVDAFL-DYDKSSQGIVF 290
Query: 314 --DRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED--NPYDRLVVEAALN 369
DR Y+ GE + + Y G + + +LL++YGFV ++ NP D + +L
Sbjct: 291 TTDRSYQPGEQVFISY-----------GKKSSGELLLSYGFVPKEGTNPNDSVEFLVSLK 339
Query: 370 TEDPQYQDKRMVAQRNG 386
D Y++K +++G
Sbjct: 340 KSDECYKEKLQALKKHG 356
>gi|223992783|ref|XP_002286075.1| rubisco small subunit small subunit n-methyltransferase
[Thalassiosira pseudonana CCMP1335]
gi|220977390|gb|EED95716.1| rubisco small subunit small subunit n-methyltransferase
[Thalassiosira pseudonana CCMP1335]
Length = 434
Score = 95.5 bits (236), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 102/412 (24%), Positives = 172/412 (41%), Gaps = 57/412 (13%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKK-QGKK 173
A + GD +P L +T R + + + + ++E +A L++EK G +
Sbjct: 50 ARRSINDGDELLKIPMDLCIT--RKSARKALGKDALQDGINEYLAIACQLIHEKYVLGDE 107
Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
S W Y+ L V W + +LA+L GSP A ++REY+ L
Sbjct: 108 SEWDAYMGVLPEVE-----EVNPTFTWKDEDLAFLDGSPVVAATRSLQMKLRREYDAL-- 160
Query: 234 VWFMAG--SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
+ G L ++P P E FT+E + AF + S + L+ + + R A+VP L
Sbjct: 161 ---LGGQDGLIAKFPDRFPAEHFTYENWVWAFTMLFSRAIRLRNLQVGERLAMVPYA-DL 216
Query: 292 LAYSSKCKAMLAAV----------DDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQ 341
+ +S+ A + A ++ V L DR Y+ E + + Y G +
Sbjct: 217 INHSAFSGAFIDARESGDWLFKNGEEEVILYADRGYRQMEQVYISY-----------GQK 265
Query: 342 PNSKLLINYGFVDEDNPYDRLVVEAALNTE-------------DPQYQDKRMVAQRNGKL 388
N++LL+ YGF E NPY+ + V ++ DP Q+K G+
Sbjct: 266 SNAELLLLYGFALERNPYNSVDVTVSIAPRTAALAAANEGIEVDPLAQEKVEFLASVGRD 325
Query: 389 SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLA 448
F +A R +ML +LRL ++ ++ +SP E AVL +
Sbjct: 326 QTVDFPCYADRYP---VEMLEFLRLMMMTPEDTRGKPLADFDYSRTISPANEAAVLSSVV 382
Query: 449 DYFKARLAGYPATLSEDEAMLTDYNLHP----KKRVATQLVRMEKKMLNACL 496
+ K +L YP + +D ++ D L +R+A + R EK++L L
Sbjct: 383 EAVKYQLNLYPQSEEDDANIIKDKALFRLLSYNQRMAVRHRRNEKRLLKRTL 434
>gi|302785554|ref|XP_002974548.1| hypothetical protein SELMODRAFT_101776 [Selaginella moellendorffii]
gi|300157443|gb|EFJ24068.1| hypothetical protein SELMODRAFT_101776 [Selaginella moellendorffii]
Length = 467
Score = 95.5 bits (236), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 83/310 (26%), Positives = 137/310 (44%), Gaps = 37/310 (11%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L+ W+ + GLP KV LK + + + L GD +P +L +T E
Sbjct: 41 LQQWLSQAGLPIQKVELKNVGAGGRG------LVSKRMLYKGDRLLFLPATLAITTESEW 94
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
+++ L E LA YL+ E GK S W PYI L R+ G S LLW
Sbjct: 95 ACAEAGKVIRAKDLPEWPFLACYLISEASLGKSSPWYPYIAALPRRPG-------SILLW 147
Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
+ ++ A+L+ + K L+ ++ +N+L+ FM + P E F E F
Sbjct: 148 TALDVEAHLSATSIKDRALQCVREVEDTFNDLNKQVFMKNR------EEFPPEVFNLESF 201
Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPY 317
K AF + S +V L SL ++ AL+P G +L + ++ L + ++ +DR Y
Sbjct: 202 KWAFGILFSRLVRLP--SLGQKLALIPFG-DMLNHDTEVTTFLDFDSGSKSITCTLDRGY 258
Query: 318 KAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALNTEDPQYQ 376
++ + + + Y G + N +LL+ YGFV N D + + L+ D Y+
Sbjct: 259 ESNKEVFISY-----------GKRSNGELLVAYGFVPSGKNSEDSVSITLGLDPADEMYE 307
Query: 377 DKRMVAQRNG 386
K + +G
Sbjct: 308 AKLGALKEHG 317
>gi|293333172|ref|NP_001168589.1| uncharacterized protein LOC100382373 [Zea mays]
gi|223949395|gb|ACN28781.1| unknown [Zea mays]
gi|414885391|tpg|DAA61405.1| TPA: hypothetical protein ZEAMMB73_723554 [Zea mays]
Length = 489
Score = 95.1 bits (235), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 108/397 (27%), Positives = 170/397 (42%), Gaps = 47/397 (11%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A+ DL G+ VP L + + V ++ L +AL L+ E +G
Sbjct: 84 LVAARDLPRGEVVAEVPKKLWMDADAVAASDIGRACGGGGGLRPWVAVALLLLSEVARGA 143
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY--NE 230
S W PY+ L RQ +S + WSE EL + G+ ++L G+K EY +E
Sbjct: 144 DSPWAPYLAILPRQ-------TDSTIFWSEEELLEIQGT----QLLSTTVGVK-EYVQSE 191
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV---VHLQKVSLARRFALVPL 287
D+V S + D+ + TF+ F AF ++S V + K++L LV
Sbjct: 192 FDSVQAEIISTNK----DLFPGSITFDDFLWAFGMLRSRVFPELRGDKLALIPFADLVNH 247
Query: 288 GPPLLAYSS----KCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPN 343
P + + S K K + + L K+G+ I + Y++ + N
Sbjct: 248 SPNITSEGSSWEIKGKGLFGR-ELMFSLRTPVNVKSGQQIYIQYDLD----------KSN 296
Query: 344 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEA 403
++L ++YGFV+ + D V ++ DP Y DK +A+ NG F V +
Sbjct: 297 AELALDYGFVESNPSRDSFTVTLEISESDPFYGDKLDIAEANGLGETAYFDVIL--NEPL 354
Query: 404 ISDMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYFKARLA 456
MLPYLRL + T SV L P+SP E ++ + D K+ LA
Sbjct: 355 PPQMLPYLRLLCIGGTDAFLLEALFRNSVWGHLE--LPLSPDNEESICQAMRDACKSALA 412
Query: 457 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 493
Y T+ EDE + NL P+ +A + EKK+L
Sbjct: 413 DYHTTIEEDEELSGRENLQPRLAIAIGVRAGEKKVLQ 449
>gi|444909511|ref|ZP_21229702.1| hypothetical protein D187_00317 [Cystobacter fuscus DSM 2262]
gi|444720460|gb|ELW61244.1| hypothetical protein D187_00317 [Cystobacter fuscus DSM 2262]
Length = 445
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 111/444 (25%), Positives = 181/444 (40%), Gaps = 56/444 (12%)
Query: 68 REVVSKKEEDLGDLKSWMHKNG-LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAF 126
R E+ L L WM + G L P I+++ V A D+ G+
Sbjct: 2 RTSAESSEQKLSSLLRWMEQGGALFPKMHIVRQADGERS-------VLARTDIAEGEVVL 54
Query: 127 SVPNSLVVTLERV----LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRE 182
+P + + TLER +G ++L N + LA +L+ EK +G SFW P++
Sbjct: 55 QIPTTHLFTLERAKASDIGRRIQSQLQPDN---DFLYLASWLLEEKHRGADSFWKPFVDS 111
Query: 183 LDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLF 242
L + PL +SE E A + GS LER ++R+ E + +
Sbjct: 112 LP------EAYPHVPLFYSEQERARMKGSQ-----LERLVEVQRQSFEQE---------Y 151
Query: 243 QQYPYDIPT-EAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAM 301
Q +P E F FE + A +++ S + L+ +LVPL + + +
Sbjct: 152 AQLREKLPEYERFGFEEYVWARISLYSRLFSLKGGLQGP--SLVPLSD-MFNHRQPPDVL 208
Query: 302 LAAVDDA--VQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV-DEDNP 358
+ +D +++ R AG I Y G + + L++ GFV D +
Sbjct: 209 WSTSEDGQTFRMIAQRAVPAGTEIHTHY-----------GAKSSDVFLLHSGFVPDGNEE 257
Query: 359 YDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD 418
D + + L DP K+ + + F V + A + +LR+ + S
Sbjct: 258 NDEVYLSVGLPPGDPLASVKQQMFGLASATAKHPFKVSRQGKYLASWSVFSFLRMAHASP 317
Query: 419 TSEM---QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
+ ++S I PVS E VL LA + RL +P TL EDE +L + L
Sbjct: 318 DEFLALSNRLLSGTKTIAPVSVACEERVLGTLAAACEERLKAFPTTLEEDERLLREGPLS 377
Query: 476 PKKRVATQLVRMEKKMLNACLQVT 499
P +R L R EK++L L++T
Sbjct: 378 PNERSCVLLRRQEKRLLGDYLELT 401
>gi|308807993|ref|XP_003081307.1| putative methyltransferase (ISS) [Ostreococcus tauri]
gi|116059769|emb|CAL55476.1| putative methyltransferase (ISS) [Ostreococcus tauri]
Length = 505
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 109/421 (25%), Positives = 174/421 (41%), Gaps = 78/421 (18%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
DL W+ NGL K+ L+ + + A+E+++ G+A V S ++T+ER
Sbjct: 66 DLTRWLASNGLRAQKMTLESNLAEG------RGLVATEEIKRGEALLGVDASCLITVERA 119
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEK---KQGKKSFWLPYIRELDRQRGRGQLAVES 196
+ + +L E + LA +L + + G + YIR L R+ G S
Sbjct: 120 IAEAKLGP--RHAELQEWSVLATFLAQQAMALESGNAGTFGEYIRALPRRTG-------S 170
Query: 197 PLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
L W E E+ L GSP++ ER E + E+ + +P DI A
Sbjct: 171 VLDWPEDEVETLLKGSPSRLAAAERQESVNAAIAEIRS----------SFP-DITEGALR 219
Query: 256 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 315
+ AF + S ++ L ++ ALVP +L + C A + AV L DR
Sbjct: 220 W-----AFDILFSRLIRLD--AMGGELALVPWAD-MLNHKPGCAAFIDLNGSAVNLTTDR 271
Query: 316 PYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDE--DNPYDRLVVEAALNTEDP 373
Y AGE + Y G +P+S+LLI+YGF E +NP D + ++ DP
Sbjct: 272 AYAAGEQVWASY-----------GQRPSSELLISYGFAPEVGENPDDEYSLTLGVDVNDP 320
Query: 374 QYQDKRMVAQRNGKLSVQVFHVH-AGREKEAI-------------SDMLPYLRLGYVSDT 419
Q K V +R G V+ F + G ++ + S++ R +
Sbjct: 321 YAQAKADVLRRMGLSPVETFPLRLNGYPRQLLQYASFILCNPDKPSELEGLARTAFTGSA 380
Query: 420 SEMQSVISSL------------GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEA 467
+ QS+ S+ G I P E AV + LAD L+ YP +L +D+
Sbjct: 381 NFGQSIFDSVRGLAQGQARGKQGVILGGVPG-EIAVREMLADMCAEALSAYPNSLEKDKG 439
Query: 468 M 468
+
Sbjct: 440 I 440
>gi|218202140|gb|EEC84567.1| hypothetical protein OsI_31339 [Oryza sativa Indica Group]
Length = 649
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 104/406 (25%), Positives = 171/406 (42%), Gaps = 66/406 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A+ DL G+ VP L + + V ++ + + L +AL L+ E +G
Sbjct: 244 LVAARDLPRGEVLAEVPKKLWLDADAVAASD-LGGAVGRGGLRPWVAVALLLLREAARGA 302
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W PY+ L RQ +S + WSE EL + G+ ++L G+K EY +
Sbjct: 303 GSPWAPYLAILPRQ-------TDSTIFWSEEELLEIQGT----QLLSTTMGVK-EYVQ-- 348
Query: 233 TVWFMAGSLFQQYPYDIPTE-------AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
S F+ +I +E TF F AF ++S V + + AL+
Sbjct: 349 -------SEFESVEAEIISENRELFPGTVTFNDFLWAFGILRSRVFAELR---GDKLALI 398
Query: 286 PLGPPL----------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHD 335
P + ++ K K + D L K+GE I + Y++
Sbjct: 399 PFADLVNHSDDITSKESSWEIKGKGLFGR-DVVFSLRTPVNVKSGEQIYIQYDLD----- 452
Query: 336 FKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV 395
+ N++L ++YGF + ++ D + ++ DP Y DK +A+ NG F +
Sbjct: 453 -----KSNAELALDYGFTESNSSRDAYTLTLEISESDPFYDDKLDIAELNGMGETAYFDI 507
Query: 396 HAGREKEAIS-DMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQL 447
G E++ MLPYLRL + T +V L PVS E A+ +
Sbjct: 508 VLG---ESLPPQMLPYLRLLCLGGTDAFLLEALFRNAVWGHL--ELPVSQDNEEAICQVI 562
Query: 448 ADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 493
+ K+ L Y T+ EDE +L NL P+ ++A ++ EKK+L
Sbjct: 563 RNACKSALGAYHTTIEEDEELLGSENLQPRLQIAVEVRAGEKKVLQ 608
>gi|308802083|ref|XP_003078355.1| ribulose-1,5-bisphosphate carb (ISS) [Ostreococcus tauri]
gi|116056807|emb|CAL53096.1| ribulose-1,5-bisphosphate carb (ISS) [Ostreococcus tauri]
Length = 520
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 106/433 (24%), Positives = 181/433 (41%), Gaps = 37/433 (8%)
Query: 76 EDLGDLKSWM-HKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
ED +L +W+ + G+ + KE + V D +AG A VP S V
Sbjct: 48 EDARELAAWLSYDKGVDASALAFKEDAKGGVR------VILKADAEAGATALRVPQSAAV 101
Query: 135 TLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
T V + ++EL + EL LAL+L E+ +G S W PY++ L +
Sbjct: 102 TSVDVGEHPIVSELASGR--PELIGLALWLCAERIKGGASEWAPYVKTL-------RANP 152
Query: 195 ESPLLWSET-ELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 253
++PL W++ + A L GSP A+ +ER++ + EY + V + P P EA
Sbjct: 153 DAPLFWTDAKDFALLKGSPVAADAIERSKSARTEYASITEV-------IKSDPSSYPPEA 205
Query: 254 FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML-AAVDDAVQLV 312
+ F + A+ + + A+ +ALVPL + + +L + D V
Sbjct: 206 YEFLTEARFVDALATVCAKATWLPTAQCYALVPLLDVISIGGAPVPGVLPPSASDGVVRC 265
Query: 313 VDRPYKAGESIVVWYNISFTGHD-----FKCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 367
Y + VV + + + N +L +N G+VD+ +P D + ++
Sbjct: 266 GPADYDVDTASVVLRCATKAAANSEVIQLDALQRNNGELFLNTGYVDQKHPGDYIYMKTD 325
Query: 368 LNTEDPQYQDKRMVAQRNGKLSV-QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVI 426
+ T D + K+ V + G + Q F V+ R + + YLR V D EM +V
Sbjct: 326 IQTSDRLFTAKKQVLEGMGFTAADQYFPVYKDRMP---TQLYSYLRFSRVQDPGEMMAVS 382
Query: 427 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 486
I VS E +L L + +A Y ++ +L + P + + +R
Sbjct: 383 FEEDKI--VSVMNEYEILQILMGDCRELMAEYDTNEEDELNLLKLSDQMPVREIEAAKLR 440
Query: 487 M-EKKMLNACLQV 498
M EKK++ A +
Sbjct: 441 MSEKKLIGATMNA 453
>gi|50252331|dbj|BAD28364.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplast precursor
[Oryza sativa Japonica Group]
gi|215769445|dbj|BAH01674.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 495
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 103/405 (25%), Positives = 169/405 (41%), Gaps = 64/405 (15%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A+ DL G+ VP L + + V ++ + + L +AL L+ E +G
Sbjct: 90 LVAARDLPRGEVLAEVPKKLWLDADAVAASD-LGGAVGRGGLRPWVAVALLLLREAARGA 148
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W PY+ L RQ +S + WSE EL + G+ ++L G+K EY +
Sbjct: 149 GSPWAPYLAILPRQ-------TDSTIFWSEEELLEIQGT----QLLSTTMGVK-EYVQ-- 194
Query: 233 TVWFMAGSLFQQYPYDIPTE-------AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
S F+ +I +E TF F AF ++S V + + AL+
Sbjct: 195 -------SEFESVEAEIISENRELFPGTVTFNDFLWAFGILRSRVFAELR---GDKLALI 244
Query: 286 PLGPPL----------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHD 335
P + ++ K K + D L K+GE I + Y++
Sbjct: 245 PFADLVNHSDDITSKESSWEIKGKGLFGR-DVVFSLRTPVNVKSGEQIYIQYDLD----- 298
Query: 336 FKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV 395
+ N++L ++YGF + ++ D + ++ DP Y DK +A+ NG F +
Sbjct: 299 -----KSNAELALDYGFTESNSSRDAYTLTLEISESDPFYDDKLDIAELNGMGETAYFDI 353
Query: 396 HAGREKEAISDMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLA 448
G + MLPYLRL + T +V L PVS E A+ +
Sbjct: 354 VLG--ESLPPQMLPYLRLLCLGGTDAFLLEALFRNAVWGHLE--LPVSQDNEEAICQVIR 409
Query: 449 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 493
+ K+ L Y T+ EDE +L NL P+ ++A ++ EKK+L
Sbjct: 410 NACKSALGAYHTTIEEDEELLGSENLQPRLQIAVEVRAGEKKVLQ 454
>gi|115448405|ref|NP_001047982.1| Os02g0725200 [Oryza sativa Japonica Group]
gi|45735887|dbj|BAD12920.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase [Oryza sativa Japonica
Group]
gi|45736017|dbj|BAD13045.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase [Oryza sativa Japonica
Group]
gi|113537513|dbj|BAF09896.1| Os02g0725200 [Oryza sativa Japonica Group]
gi|215737236|dbj|BAG96165.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623589|gb|EEE57721.1| hypothetical protein OsJ_08208 [Oryza sativa Japonica Group]
Length = 502
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 87/319 (27%), Positives = 146/319 (45%), Gaps = 40/319 (12%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L+ W+ +GLP ++ + ++ E+ + A ++++ G+ VP SLV+T +
Sbjct: 75 LERWLTDSGLPEQRLGI-QRVDVGERG-----LVALKNIRKGEKLLFVPPSLVITADSEW 128
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
G + +L N + + +A YL+ E S W YI L RQ S L W
Sbjct: 129 GCPEVGNVLKRNSVPDWPLIATYLISEASLESSSRWSSYIAALPRQ-------PYSLLYW 181
Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFEI 258
+ EL AYL SP + ++R + YN+L D ++ LF P E + E
Sbjct: 182 TRPELDAYLVASPIRERAIQRITDVVGTYNDLRDRIFSKHSDLF-------PEEVYNLET 234
Query: 259 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRP 316
F+ +F + S +V L S+ R ALVP +L +S + + L + DR
Sbjct: 235 FRWSFGILFSRLVRLP--SMDGRVALVPWA-DMLNHSPEVETFLDYDKSSGGIVFTTDRS 291
Query: 317 YKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQ 374
Y+ GE + + Y G + + +LL++YGFV ++ NP D + + +LN D
Sbjct: 292 YQPGEQVFISY-----------GKKSSGELLLSYGFVPKEGTNPNDSVELLVSLNKSDKC 340
Query: 375 YQDKRMVAQRNGKLSVQVF 393
Y++K +RNG + F
Sbjct: 341 YKEKLQALKRNGLSEFESF 359
>gi|218191491|gb|EEC73918.1| hypothetical protein OsI_08761 [Oryza sativa Indica Group]
Length = 502
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 87/319 (27%), Positives = 146/319 (45%), Gaps = 40/319 (12%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L+ W+ +GLP ++ + ++ E+ + A ++++ G+ VP SLV+T +
Sbjct: 75 LERWLTDSGLPEQRLGI-QRVDVGERG-----LVALKNIRKGEKLLFVPPSLVITADSEW 128
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
G + +L N + + +A YL+ E S W YI L RQ S L W
Sbjct: 129 GCPEVGNVLKRNSVPDWPLIATYLISEASLESSSRWSSYIAALPRQ-------PYSLLYW 181
Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFEI 258
+ EL AYL SP + ++R + YN+L D ++ LF P E + E
Sbjct: 182 TRPELDAYLVASPIRERAIQRITDVVGTYNDLRDRIFSKHSDLF-------PEEVYNLET 234
Query: 259 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRP 316
F+ +F + S +V L S+ R ALVP +L +S + + L + DR
Sbjct: 235 FRWSFGILFSRLVRLP--SMDGRVALVPWA-DMLNHSPEVETFLDYDKSSGGIVFTTDRS 291
Query: 317 YKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQ 374
Y+ GE + + Y G + + +LL++YGFV ++ NP D + + +LN D
Sbjct: 292 YQPGEQVFISY-----------GKKSSGELLLSYGFVPKEGTNPNDSVELLVSLNKSDKC 340
Query: 375 YQDKRMVAQRNGKLSVQVF 393
Y++K +RNG + F
Sbjct: 341 YKEKLQALKRNGLSEFESF 359
>gi|323456050|gb|EGB11917.1| hypothetical protein AURANDRAFT_61181 [Aureococcus anophagefferens]
Length = 516
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 93/354 (26%), Positives = 152/354 (42%), Gaps = 45/354 (12%)
Query: 151 TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 210
+ +E +AL L+ E+ +G +SFW YI L G + W ELAYL G
Sbjct: 162 NDDTNEYIAIALLLILERSKGSRSFWSEYIAILPTNEDVG-----ATFTWPAEELAYLEG 216
Query: 211 SPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
SP + ++ E+ A L D E FTFE ++ AF + S
Sbjct: 217 SPAASATASMMAKLRAEH---------AAVLEGNSALD--PEIFTFEAWQWAFTNLFSRA 265
Query: 271 VHLQKVSLARRFALVPL-----GPPLLAYSSKCKAMLAAV-----DDAVQLVVDRPYKAG 320
+ L+ A+VP P + + + A +D V L DR YK
Sbjct: 266 IRLKASRAGELLAMVPYVDFINHSPFSSSYVDAREVPKAFPWEEKEDEVVLFADRAYKKF 325
Query: 321 ESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRM 380
E + + Y GP+ N+ LL+ YGF + NP++ + + + +D Y K
Sbjct: 326 EQVFISY-----------GPKSNADLLLLYGFALDRNPFNSVDLAVGASKDDALYDAKER 374
Query: 381 VAQRNGK-LSVQVFHVHAGREKEAISDMLPYLRLGYVS-DTSEMQSVISSLGPICPVSPC 438
A+ G+ +S F ++A R + +++ +LR+ + D + + + +S
Sbjct: 375 FARGAGRDVSSAAFPLYADRFPD---ELVQFLRMACATEDHLGARPLDDPDNYVDILSLD 431
Query: 439 MERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 492
E AVLD + D A +A YPA +D + D L +R+A +LV EK++L
Sbjct: 432 NELAVLDTIRDACDAAVAAYPAKSGDD---VPDAFLSRNQRMAKRLVNTEKRIL 482
>gi|357137766|ref|XP_003570470.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like
[Brachypodium distachyon]
Length = 389
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 78/280 (27%), Positives = 132/280 (47%), Gaps = 34/280 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A +++ G+ VP SLV++ + N + +++ + + + LA YL+ E
Sbjct: 13 LVALTNVRNGEKLLFVPPSLVISADSEWSNREVGDVMKSYSVPDWPLLATYLISEASLEG 72
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL 231
S W YI L RQ S L W+ TE+ AYL SP + + R + YN+L
Sbjct: 73 SSRWSSYIDALPRQ-------PYSLLYWTRTEIDAYLVASPIRERAISRIGDVIGTYNDL 125
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
+F ++P P E + E F+ +F + S +V L S+ + ALVP +
Sbjct: 126 ------RDRIFSKHPELFPEEVYNMENFRWSFGILFSRLVRLP--SMDGKVALVPWA-DM 176
Query: 292 LAYSSKCKAMLAAVDDAVQLVV---DRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLI 348
L ++ + A L D + Q +V DR Y+ GE + + Y G + + +LL+
Sbjct: 177 LNHNPEVDAFL-DFDKSSQGIVFTTDRSYQPGEQVFISY-----------GKKSSGELLL 224
Query: 349 NYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNG 386
+YGFV ++ NP D + +LN D Y++K +R+G
Sbjct: 225 SYGFVPKEGTNPNDSVEFSVSLNKSDDCYREKLQALKRHG 264
>gi|302759643|ref|XP_002963244.1| hypothetical protein SELMODRAFT_80789 [Selaginella moellendorffii]
gi|300168512|gb|EFJ35115.1| hypothetical protein SELMODRAFT_80789 [Selaginella moellendorffii]
Length = 467
Score = 92.0 bits (227), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 81/310 (26%), Positives = 135/310 (43%), Gaps = 37/310 (11%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L+ W+ + GLP KV LK + + + L GD +P +L +T E
Sbjct: 41 LQQWLSQAGLPIQKVELKNVGAGGRG------LVSKRMLYKGDRLLFLPATLAITTESEW 94
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
+++ L E LA YL+ E GK S W PYI L R+ G S LLW
Sbjct: 95 ACAEAGKVIRAKDLPEWPFLACYLISEASLGKSSPWYPYIAALPRRPG-------SILLW 147
Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
+ ++ +L+ + K L+ ++ +N+L+ FM + P E F + F
Sbjct: 148 TALDVETHLSATSIKDRALQCVREVEDTFNDLNKQVFMKNR------EEFPPEVFNLKSF 201
Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPY 317
K AF + S +V L SL ++ AL+P G +L + ++ L + ++ +DR Y
Sbjct: 202 KWAFGILFSRLVRLP--SLGQKLALIPFG-DMLNHDTEVTTFLDFDSGSKSITCTLDRGY 258
Query: 318 KAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALNTEDPQYQ 376
++ + + Y G + N +LL+ YGFV N D + + L+ D Y+
Sbjct: 259 ESNREVFISY-----------GKRSNGELLVAYGFVPSGKNSEDSVSITLGLDPADEMYE 307
Query: 377 DKRMVAQRNG 386
K + +G
Sbjct: 308 AKLGTLKEHG 317
>gi|443722302|gb|ELU11224.1| hypothetical protein CAPTEDRAFT_181634 [Capitella teleta]
Length = 541
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 99/434 (22%), Positives = 185/434 (42%), Gaps = 46/434 (10%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPI--HYVAASEDLQAGDAAFSVP 129
S +E++ W+ N + V ++ H + + + A+ D + G+ ++P
Sbjct: 71 SGREKNFDGFMGWLKSNSVDAEAVEIQ--------HFDVGGYGIKATRDFKEGELFLAIP 122
Query: 130 NSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQ 186
S+++T + N + L+ N++ + LAL+++ E SFWLPY++ L
Sbjct: 123 RSVMMTTDTA-KNSALGALIADNRILQTMPNILLALHVLCELC-SPASFWLPYLKILPH- 179
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
+ SPL ++ +L L SPT +E++ + I R+Y + F L + P
Sbjct: 180 ------SYSSPLYFNPEDLQLLKASPTLSEMINQFRNITRQYAYFFNL-FQGHELASKLP 232
Query: 247 YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF-ALVPLGPPLLAYSSKCKAMLAAV 305
I + ++ ++ A +V + + + R AL+PL + + +
Sbjct: 233 --IQVKNICYDDYRWAVSSVMTRQNQIPTLDGQRMISALIPLWDMCNHTNGQITTDFSLK 290
Query: 306 DDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVE 365
+D + AG + ++Y G + N++LLI+ GFV N DRL +
Sbjct: 291 NDRSECFSLEGTVAGAQVFIFY-----------GSRSNAELLIHNGFVYPQNHSDRLTIR 339
Query: 366 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV 425
++ DP + K V R + ++F +H G SD L +LR+ +++ ++++
Sbjct: 340 LGISKNDPLFSMKSEVLSRLSMQASRLFSLHCG-VNPVDSDTLAFLRVVVMTE-DDLRTA 397
Query: 426 ISSLGPICP-------VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKK 478
++ I VS ER LA L YP + ED +L +L
Sbjct: 398 LACRQQISKLRDFDDFVSEDNERKAWAFLATRVLLLLKAYPTSAQEDATLLQGNDLSTHA 457
Query: 479 RVATQLVRMEKKML 492
R+A QL EK +L
Sbjct: 458 RLAVQLRHCEKNIL 471
>gi|332321747|sp|B7ZUF3.1|SETD3_XENTR RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|213624517|gb|AAI71209.1| LOC549331 protein [Xenopus (Silurana) tropicalis]
Length = 582
Score = 90.9 bits (224), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 106/466 (22%), Positives = 203/466 (43%), Gaps = 58/466 (12%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ +L W +NG L E P + A+ +++A + VP
Sbjct: 73 GKREDYFPELMEWCKENGASTDGFELVEFPEEG------FGLKATREIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E G+ + L + +++ + LA +L+ E+ SFWLPYI+ L +
Sbjct: 127 LLMTVESAKGS-VLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWLPYIKTLPNE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL ++E E+ YL + ++ + + R+Y +F + Q +P
Sbjct: 183 -----YDTPLYFNEDEVQYLQSTQAILDVFSQYKNTARQY-----AYFY--KVIQTHPNA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FTF+ ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTFDDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V + +K+GE I ++Y G + N++ +I+ GF E+N +DR+ +
Sbjct: 291 EDDRCECVALQDFKSGEQIYIFY-----------GTRSNAEFVIHNGFFFENNLHDRVKI 339
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL---------GY 415
+ ++ D Y K V R G + VF +H E + +L +LR+ G+
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHV-TEPPISAQLLAFLRVFCMNEDELKGH 398
Query: 416 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTD 471
+ + + + PVS E + +L + +AR L Y T+ +D +L
Sbjct: 399 LIGDHAIDKIFTLGNSEFPVS--WENEI--KLWTFLEARASLLLKTYKTTVEDDNKVLEQ 454
Query: 472 YNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAP 517
++ +A +L R+EK++L L+ +D L + P P
Sbjct: 455 PDMTFHSAMAIKLRRVEKEILEKALKSASDNRKLYSKNSEEGTPLP 500
>gi|62857953|ref|NP_001016577.1| histone-lysine N-methyltransferase setd3 [Xenopus (Silurana)
tropicalis]
gi|89272100|emb|CAJ81720.1| novel protein containing a SET domain [Xenopus (Silurana)
tropicalis]
Length = 581
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 106/466 (22%), Positives = 203/466 (43%), Gaps = 58/466 (12%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ +L W +NG L E P + A+ +++A + VP
Sbjct: 72 GKREDYFPELMEWCKENGASTDGFELVEFPEEG------FGLKATREIKAEELFLWVPRK 125
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E G+ + L + +++ + LA +L+ E+ SFWLPYI+ L +
Sbjct: 126 LLMTVESAKGS-VLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWLPYIKTLPNE-- 181
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL ++E E+ YL + ++ + + R+Y +F + Q +P
Sbjct: 182 -----YDTPLYFNEDEVQYLQSTQAILDVFSQYKNTARQY-----AYFY--KVIQTHPNA 229
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FTF+ ++ A +V + + +R AL+PL +
Sbjct: 230 NKLPLKDSFTFDDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 289
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V + +K+GE I ++Y G + N++ +I+ GF E+N +DR+ +
Sbjct: 290 EDDRCECVALQDFKSGEQIYIFY-----------GTRSNAEFVIHNGFFFENNLHDRVKI 338
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL---------GY 415
+ ++ D Y K V R G + VF +H E + +L +LR+ G+
Sbjct: 339 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHV-TEPPISAQLLAFLRVFCMNEDELKGH 397
Query: 416 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTD 471
+ + + + PVS E + +L + +AR L Y T+ +D +L
Sbjct: 398 LIGDHAIDKIFTLGNSEFPVS--WENEI--KLWTFLEARASLLLKTYKTTVEDDNKVLEQ 453
Query: 472 YNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAP 517
++ +A +L R+EK++L L+ +D L + P P
Sbjct: 454 PDMTFHSAMAIKLRRVEKEILEKALKSASDNRKLYSKNSEEGTPLP 499
>gi|340780678|pdb|3SMT|A Chain A, Crystal Structure Of Human Set Domain-Containing Protein3
Length = 497
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 102/445 (22%), Positives = 196/445 (44%), Gaps = 50/445 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 72 GKREDYFPDLXKWASENG---ASVEGFEXVNFKEEGFGLR---ATRDIKAEELFLWVPRK 125
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L+ T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 126 LLXTVESA-KNSVLGPLYSQDRILQAXGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 181
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 182 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 229
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 230 NKLPLKDSFTYEDYRWAVSSVXTRQNQIPTEDGSRVTLALIPLWDXCNHTNGLITTGYNL 289
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V + ++AGE I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 290 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 338
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL---------GY 415
+ ++ D Y K V R G + VF +H E + +L +LR+ +
Sbjct: 339 KLGVSKSDRLYAXKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCXTEEELKEH 397
Query: 416 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
+ S + + + PVS E + L D L Y T+ ED+++L +++L
Sbjct: 398 LLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNHDLS 457
Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
+ + A +L EK++L ++ A
Sbjct: 458 VRAKXAIKLRLGEKEILEKAVKSAA 482
>gi|194038089|ref|XP_001925323.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Sus scrofa]
gi|456754196|gb|JAA74239.1| SET domain containing 3 [Sus scrofa]
Length = 595
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 106/445 (23%), Positives = 195/445 (43%), Gaps = 50/445 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASDNG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYAQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP-- 246
++PL + E E+ YL + ++ + + R+Y +F + Q +P
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPQA 230
Query: 247 YDIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+ +P E+FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 HKLPLKESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V R ++AGE I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 291 EDDRCECVALRDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 424
+ ++ D Y K V R G + VF +H E + +L +LR+ +++ +
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPVSAQLLAFLRVFCMTEGELKEH 398
Query: 425 V--------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
+ I +LG PVS E + L D L Y T+ ED+ L ++ L
Sbjct: 399 LLGENAIDRIFTLGNSEYPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKTFLKNHGLS 458
Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
+ +A +L EK++L ++ A
Sbjct: 459 VRATMAVKLRLGEKEILEKAVESAA 483
>gi|10439587|dbj|BAB15525.1| unnamed protein product [Homo sapiens]
Length = 512
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 98/423 (23%), Positives = 192/423 (45%), Gaps = 49/423 (11%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQG 171
A+ D++A + VP L++T+E N + L + +++ + LA +L+ E+
Sbjct: 28 ATRDIKAEELFLWVPRKLLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERAS- 85
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 86 PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY--- 135
Query: 232 DTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 136 --AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPL 191
Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLL 347
+ DD + V + ++AGE I ++Y G + N++ +
Sbjct: 192 WDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFV 240
Query: 348 INYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDM 407
I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E + +
Sbjct: 241 IHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQL 299
Query: 408 LPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGY 458
L +LR+ +++ S + + + PVS E + L D L Y
Sbjct: 300 LAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTY 359
Query: 459 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPY 518
T+ ED+++L +++L + ++A +L EK++L ++ A + P + +SP
Sbjct: 360 KTTIEEDKSVLKNHDLSVRAKMAIKLRLGEKEILEKAVKSAA----VNPGI-LSPTDGGK 414
Query: 519 APL 521
APL
Sbjct: 415 APL 417
>gi|355718753|gb|AES06373.1| SET domain containing 3 [Mustela putorius furo]
Length = 585
Score = 89.7 bits (221), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 106/445 (23%), Positives = 199/445 (44%), Gaps = 50/445 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRDLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P +AFT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDAFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V R ++AGE I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 291 EDDRCECVALRDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 424
+ ++ D Y K V R G + VF +H+ E + +L +LR+ +++ +
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPVSAQLLAFLRVFCMTEEELKEH 398
Query: 425 V--------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
+ I +LG PVS E + L D L Y T+ ED++ L D++L
Sbjct: 399 LLGDNALDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLKDHDLS 458
Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
+ +A +L EK++L ++ A
Sbjct: 459 VRAAMAIKLRLGEKEILEKAVKSAA 483
>gi|110331827|gb|ABG67019.1| hypothetical protein LOC84193 [Bos taurus]
Length = 488
Score = 89.7 bits (221), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 102/442 (23%), Positives = 196/442 (44%), Gaps = 50/442 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 80 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 133
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 134 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 189
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 190 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 237
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL S
Sbjct: 238 HKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTSGLITTGYNL 297
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V + ++AGE I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 298 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 346
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT----- 419
+ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 347 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 405
Query: 420 ----SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
S + + + PVS E + L D L Y T+ ED++ L +++L
Sbjct: 406 LLGDSAIDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLKNHDLS 465
Query: 476 PKKRVATQLVRMEKKMLNACLQ 497
+ +A +L EK++L ++
Sbjct: 466 ARATMAIKLRLGEKEILERAVK 487
>gi|148744485|gb|AAI42996.1| SET domain containing 3 [Homo sapiens]
Length = 594
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 102/445 (22%), Positives = 199/445 (44%), Gaps = 50/445 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V + ++AGE I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT----- 419
+ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398
Query: 420 ----SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
S + + + PVS E + L D L Y T+ ED+++L +++L
Sbjct: 399 LLGDSAIDRIFTLGNSKFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNHDLS 458
Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
+ ++A +L EK++L ++ A
Sbjct: 459 VRAKMAIKLRLGEKEILEKAVKSAA 483
>gi|440907688|gb|ELR57800.1| SET domain-containing protein 3 [Bos grunniens mutus]
Length = 594
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 103/445 (23%), Positives = 197/445 (44%), Gaps = 50/445 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL S
Sbjct: 231 HKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTSGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V + ++AGE I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT----- 419
+ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398
Query: 420 ----SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
S + + + PVS E + L D L Y T+ ED++ L +++L
Sbjct: 399 LLGDSAIDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLKNHDLS 458
Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
+ +A +L EK++L ++ A
Sbjct: 459 ARATMAIKLRLGEKEILERAVKSAA 483
>gi|134254196|gb|AAI35195.1| LOC549331 protein [Xenopus (Silurana) tropicalis]
Length = 507
Score = 89.4 bits (220), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 106/463 (22%), Positives = 202/463 (43%), Gaps = 58/463 (12%)
Query: 75 EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
E+ +L W +NG L E P + A+ +++A + VP L++
Sbjct: 1 EDYFPELMEWCKENGASTDGFELVEFPEEG------FGLKATREIKAEELFLWVPRKLLM 54
Query: 135 TLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQ 191
T+E G+ + L + +++ + LA +L+ E+ SFWLPYI+ L +
Sbjct: 55 TVESAKGS-VLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWLPYIKTLPNE----- 107
Query: 192 LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY--DI 249
++PL ++E E+ YL + ++ + + R+Y +F + Q +P +
Sbjct: 108 --YDTPLYFNEDEVQYLQSTQAILDVFSQYKNTARQY-----AYFY--KVIQTHPNANKL 158
Query: 250 P-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDD 307
P ++FTF+ ++ A +V + + +R AL+PL +S DD
Sbjct: 159 PLKDSFTFDDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNSLITTGYNLEDD 218
Query: 308 AVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 367
+ V + +K+GE I ++Y G + N++ +I+ GF E+N +DR+ ++
Sbjct: 219 RCECVALQDFKSGEQIYIFY-----------GTRSNAEFVIHNGFFFENNLHDRVKIKLG 267
Query: 368 LNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL---------GYVSD 418
++ D Y K V R G + VF +H E + +L +LR+ G++
Sbjct: 268 VSKSDRLYAMKAEVLARAGIPTSSVFALHV-TEPPISAQLLAFLRVFCMNEDELKGHLIG 326
Query: 419 TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNL 474
+ + + PVS E + +L + +AR L Y T+ +D +L ++
Sbjct: 327 DHAIDKIFTLGNSEFPVS--WENEI--KLWTFLEARASLLLKTYKTTVEDDNKVLEQPDM 382
Query: 475 HPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAP 517
+A +L R+EK++L L+ +D L + P P
Sbjct: 383 TFHSAMAIKLRRVEKEILEKALKSASDNRKLYSKNSEEGTPLP 425
>gi|40068481|ref|NP_115609.2| histone-lysine N-methyltransferase setd3 isoform a [Homo sapiens]
gi|74750394|sp|Q86TU7.1|SETD3_HUMAN RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|28071092|emb|CAD61927.1| unnamed protein product [Homo sapiens]
gi|119602070|gb|EAW81664.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
gi|119602072|gb|EAW81666.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
gi|119602073|gb|EAW81667.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
gi|194380984|dbj|BAG64060.1| unnamed protein product [Homo sapiens]
gi|307686103|dbj|BAJ20982.1| SET domain containing 3 [synthetic construct]
Length = 594
Score = 89.4 bits (220), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 102/445 (22%), Positives = 199/445 (44%), Gaps = 50/445 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V + ++AGE I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT----- 419
+ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398
Query: 420 ----SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
S + + + PVS E + L D L Y T+ ED+++L +++L
Sbjct: 399 LLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNHDLS 458
Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
+ ++A +L EK++L ++ A
Sbjct: 459 VRAKMAIKLRLGEKEILEKAVKSAA 483
>gi|431839268|gb|ELK01195.1| SET domain-containing protein 3 [Pteropus alecto]
Length = 805
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 105/445 (23%), Positives = 197/445 (44%), Gaps = 50/445 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E E+ + A+ D++A + VP
Sbjct: 252 GKREDYFPDLMKWASENG---ASVEGFEMVDFKEEGFGLR---ATRDIKAEELFLWVPRK 305
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 306 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 361
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 362 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 409
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 410 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 469
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V R ++AGE I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 470 EDDRCECVALRDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 518
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 424
+ ++ D Y K V R G + VF +H E + +L +LR+ +++ +
Sbjct: 519 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 577
Query: 425 V--------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
+ I +LG PVS E + L D L Y T+ ED + L +++L
Sbjct: 578 LLGDNAIDRIFTLGNSEYPVSWDNEVKLWTFLEDRASLLLKTYKTTVEEDRSFLRNHDLS 637
Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
+ +A +L EK++L ++ A
Sbjct: 638 VRAAMAVKLRLGEKEILERAVKSAA 662
>gi|119914085|ref|XP_589822.3| PREDICTED: histone-lysine N-methyltransferase setd3 [Bos taurus]
gi|297488270|ref|XP_002696879.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Bos taurus]
gi|296475307|tpg|DAA17422.1| TPA: SET domain containing 3 [Bos taurus]
Length = 601
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 103/445 (23%), Positives = 197/445 (44%), Gaps = 50/445 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 80 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 133
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 134 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 189
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 190 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 237
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL S
Sbjct: 238 HKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTSGLITTGYNL 297
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V + ++AGE I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 298 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 346
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT----- 419
+ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 347 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 405
Query: 420 ----SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
S + + + PVS E + L D L Y T+ ED++ L +++L
Sbjct: 406 LLGDSAIDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLKNHDLS 465
Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
+ +A +L EK++L ++ A
Sbjct: 466 ARATMAIKLRLGEKEILERAVKSAA 490
>gi|302821397|ref|XP_002992361.1| hypothetical protein SELMODRAFT_430576 [Selaginella moellendorffii]
gi|300139777|gb|EFJ06511.1| hypothetical protein SELMODRAFT_430576 [Selaginella moellendorffii]
Length = 463
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 84/338 (24%), Positives = 138/338 (40%), Gaps = 68/338 (20%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+L SW+ G +LK P + A D++AG+ V ++T +R+
Sbjct: 39 ELVSWLKIRGEHDACSLLKTGPDKRG-------LFAVRDIKAGECILRVSRDTMMTADRL 91
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
+LL++ +SE A LAL L++EK+ G+ S W PYI L R + S
Sbjct: 92 --PLEFQQLLSSG-VSEWAQLALLLLFEKRAGEASIWAPYISCLPRWG-----TIHSTAF 143
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
W + ELA + S E + R I+ E+NE+ + FQ+Y + + ++ F
Sbjct: 144 WRKEELAMIQESSLSYETMSRRAAIREEFNEMQPI-------FQRYEH-VFGGPVSYASF 195
Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD------------ 307
K A+V C ++ + A+VP + + AML D
Sbjct: 196 KHAYVTATVC-SRAWRIDGLEKLAMVPFA-DFMNHDWSSNAMLTYDTDNGSTEVEEVKVY 253
Query: 308 ---------AVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNP 358
QL D+ Y AGE + ISF GP N+ L +++GF NP
Sbjct: 254 SDCLDIALFCAQLFADKNYAAGEQV----TISF-------GPLCNASLALDFGFTVPYNP 302
Query: 359 YDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 396
+D++ + ++ D ++K +Q H H
Sbjct: 303 WDKVQLWLGISRRDSLRKEK-----------LQYLHAH 329
>gi|426248573|ref|XP_004018037.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
setd3 [Ovis aries]
Length = 596
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 103/445 (23%), Positives = 197/445 (44%), Gaps = 50/445 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 80 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 133
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 134 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 189
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 190 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--RVIQTHPHA 237
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL S
Sbjct: 238 HKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTSGLITTGYNL 297
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V + ++AGE I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 298 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 346
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT----- 419
+ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 347 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 405
Query: 420 ----SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
S + + + PVS E + L D L Y T+ ED++ L +++L
Sbjct: 406 LLGDSAIDRIFTLGNSEYPVSWDNEVRLWAFLEDRASLLLKTYKTTIEEDKSFLKNHDLS 465
Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
+ +A +L EK++L ++ A
Sbjct: 466 ARATMAVKLRLGEKEILERAVKSAA 490
>gi|444915331|ref|ZP_21235465.1| SET domain containing protein [Cystobacter fuscus DSM 2262]
gi|444713560|gb|ELW54457.1| SET domain containing protein [Cystobacter fuscus DSM 2262]
Length = 449
Score = 88.2 bits (217), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 111/447 (24%), Positives = 191/447 (42%), Gaps = 53/447 (11%)
Query: 70 VVSKKEEDLGDLKSWMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFS 127
S + L +L W+ + G P +++ +E V A + AG+
Sbjct: 10 AASSSNQKLSNLLRWLEEGGARFPKLQLVRREDGERA--------VLAQAPISAGETVLQ 61
Query: 128 VPNSLVVTLERVLGNET-----IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRE 182
VP + ++TLE L E+ IAE L + +E LA +L+ EK + + SFW PYI
Sbjct: 62 VPRTHMLTLE--LARESDIGRAIAEGLDPD--NEDLYLASFLLQEKHR-EGSFWKPYIDS 116
Query: 183 LDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLF 242
L + PL + E A L G + +A+ ++ +Y SL
Sbjct: 117 LPESYS------QMPLFYGSEEHALLKGCFALTLLTHQAQSLREDYL----------SLC 160
Query: 243 QQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML 302
Q P E FT F A ++V S + L+K + LVP+ +L + +
Sbjct: 161 QNVP---GYERFTPGEFVWARLSVSSRLFSLKKGGFLGQ-TLVPMAD-MLNHRRPPDVLW 215
Query: 303 AAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRL 362
+D V+ KA ++ + HD G + N +L+++GFV +DN +D
Sbjct: 216 ETTEDGESFVM----KANNAVAAGDEV----HD-SYGAKSNDLMLLHFGFVTDDNEHDEA 266
Query: 363 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV--SDTS 420
+ + DP K+M+ + + F + +LR+ +D
Sbjct: 267 FLGLRILDGDPLAATKQMLLMLPSPTAARPFKISRPYVHTTTRMAFSFLRIAAAVPNDIE 326
Query: 421 EMQS-VISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKR 479
++ S V+S + P+S E VL+ LA +ARL+ +P +L++DE +L +L P R
Sbjct: 327 DISSRVMSGERALGPLSVENEENVLELLAATCQARLSIFPTSLAQDEELLRGESLSPNAR 386
Query: 480 VATQLVRMEKKMLNACLQVTADMIMLL 506
+ R EK+++ L++T + LL
Sbjct: 387 NCVLVRRAEKQLIEDYLEMTRVCLKLL 413
>gi|146181028|ref|XP_001021989.2| SET domain containing protein [Tetrahymena thermophila]
gi|146144300|gb|EAS01744.2| SET domain containing protein [Tetrahymena thermophila SB210]
Length = 590
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 102/416 (24%), Positives = 189/416 (45%), Gaps = 71/416 (17%)
Query: 103 HNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNET-IAELLTTNKLSELA--- 158
+++ +R +H A + + +P S ++TLE + ET +A+ + KL+ L+
Sbjct: 173 YSKNYRGVH---ARRKVYNKETILFIPKSHLITLE--MAKETDVAKKIIAAKLNLLSPKH 227
Query: 159 -CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
L+ +L+ E+K K+S W PY+ L + P+ +SE +L++L GSP + ++
Sbjct: 228 SFLSTFLLQERK-NKESKWKPYLDILPSDYN------QFPIFFSEDDLSWLKGSPFQNQV 280
Query: 218 LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 277
E+ IKR+Y+++ +V F +Y TFE F A + S V LQ ++
Sbjct: 281 REKKADIKRDYDDICSV----APEFAEY---------TFEDFCWARMTASSRVFGLQ-IN 326
Query: 278 LARRFALVPLGPPLLAYSSKCKAMLAAVDD-----AVQLVVDRPYKAGESIVVWYNISFT 332
+ A VPL L + K DD +Q + D P GE + Y
Sbjct: 327 EQKTDAFVPLADML--NHRRPKQTSWQYDDQREGFVIQALEDIP--RGEQVYDSY----- 377
Query: 333 GHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL-SVQ 391
G + NS+ +NYGF++ DN + + + + EDP + K+ + G + +
Sbjct: 378 ------GRKCNSRFFLNYGFINLDNDANEVALRLTFDAEDPTIERKKEMM--GGDVPEFK 429
Query: 392 VFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL----------------GPICPV 435
V+ + +++ +S+ + YLR + D S++ ++SSL P+
Sbjct: 430 VYRILENYQEQNVSEFMSYLRFILIRDNSKL-LMLSSLHEQQTENSENLSGYKPQKTPPI 488
Query: 436 SPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 491
S E + ++++ + ++ Y TL ED+ +L NL +R L EK++
Sbjct: 489 SIQNETDMWVRISNMCQTSISLYNTTLKEDKELLAKDNLTQNQRNCVLLRSGEKEV 544
>gi|147843303|emb|CAN82664.1| hypothetical protein VITISV_015206 [Vitis vinifera]
Length = 507
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 95/390 (24%), Positives = 170/390 (43%), Gaps = 37/390 (9%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A D+ +A VP + + V +E + L +AL+L+ EK +
Sbjct: 81 LVAQRDIARNEAVLEVPKRFWINPDAVAASEIGS---VCGGLKPWVSVALFLIREKLR-D 136
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+S W Y+ L S + WSE EL + G+ L E ++ E+ +++
Sbjct: 137 ESPWRSYLDILPEY-------TNSTIYWSEEELVEIQGTQLSNTTLGVKEYVQSEFLKVE 189
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
+ S +P + + F I + +AF ++ Q + L L+ P +
Sbjct: 190 EEVILPHSQLFPFPVTLDDFLWAFGILRSRAFSRLRG-----QNLVLIPLADLINHSPSI 244
Query: 292 LA--YSSKCK-AMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLI 348
Y+ + K A L + D L KAGE +++ Y++ + N++L +
Sbjct: 245 TTEEYAWEIKGAGLFSRDQLFSLRTPVSVKAGEQVLIQYDLD----------KSNAELAL 294
Query: 349 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML 408
+YGF++ + + ++ DP + DK +A+ NG + F + G+ A ML
Sbjct: 295 DYGFIESRPNRNSYTLTLEISESDPFFGDKLDIAESNGLSEIAYFDIVLGQSLPAA--ML 352
Query: 409 PYLRLGYVS--DTSEMQSVISS--LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLS 463
PYLRL + D ++S+ + G + PVS E + + D K+ L+GY T+
Sbjct: 353 PYLRLVALGGPDAFLLESIFRNTIWGHLELPVSRANEELICQVIQDACKSALSGYLTTIE 412
Query: 464 EDEAMLTDYNLHPKKRVATQLVRMEKKMLN 493
EDE + + NLHP+ +A + EKK+L
Sbjct: 413 EDEKLKEEGNLHPRLEIAVGVRTGEKKVLQ 442
>gi|225462926|ref|XP_002267249.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic [Vitis
vinifera]
gi|296087793|emb|CBI35049.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 97/395 (24%), Positives = 172/395 (43%), Gaps = 47/395 (11%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A D+ +A VP + + V +E + L +AL+L+ EK +
Sbjct: 81 LVAQRDIARNEAVLEVPKRFWINPDAVAASEIGS---VCGGLKPWVSVALFLIREKLR-D 136
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+S W Y+ L S + WSE EL + G+ L E ++ E+ +++
Sbjct: 137 ESPWRSYLDILPEY-------TNSTIYWSEEELVEIQGTQLSNTTLGVKEYVQSEFLKVE 189
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG---- 288
+ S Q +P+ + T + F AF ++S + + L+PL
Sbjct: 190 EEVILPHS--QLFPFPV-----TLDDFLWAFGILRSRAFSRLR---GQNLVLIPLADLIN 239
Query: 289 --PPLLA--YSSKCK-AMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPN 343
P + Y+ + K A L + D L KAGE +++ Y++ + N
Sbjct: 240 HSPSITTEEYAWEIKGAGLFSRDQLFSLRTPVSVKAGEQVLIQYDLD----------KSN 289
Query: 344 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEA 403
++L ++YGF++ + + ++ DP + DK +A+ NG + F + G+ A
Sbjct: 290 AELALDYGFIESRPNRNSYTLTLEISESDPFFGDKLDIAESNGLSEIAYFDIVLGQSLPA 349
Query: 404 ISDMLPYLRLGYVS--DTSEMQSVISS--LGPI-CPVSPCMERAVLDQLADYFKARLAGY 458
MLPYLRL + D ++S+ + G + PVS E + + D K+ L+GY
Sbjct: 350 A--MLPYLRLVALGGPDAFLLESIFRNTIWGHLELPVSRANEELICQVIQDACKSALSGY 407
Query: 459 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 493
T+ EDE + + NLHP+ +A + EKK+L
Sbjct: 408 LTTIEEDEKLKEEGNLHPRLEIAVGVRTGEKKVLQ 442
>gi|355778846|gb|EHH63882.1| hypothetical protein EGM_16943 [Macaca fascicularis]
Length = 595
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 102/445 (22%), Positives = 198/445 (44%), Gaps = 50/445 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERAN-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V + ++AGE I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT----- 419
+ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398
Query: 420 ----SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
S + + + PVS E + L D L Y T+ ED+++L + +L
Sbjct: 399 LLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLS 458
Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
+ ++A +L EK++L ++ A
Sbjct: 459 VRAKMAIKLRLGEKEILEKAVKSAA 483
>gi|386780935|ref|NP_001247800.1| SET domain containing 3 [Macaca mulatta]
gi|355693560|gb|EHH28163.1| hypothetical protein EGK_18532 [Macaca mulatta]
gi|380817110|gb|AFE80429.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
gi|383422129|gb|AFH34278.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
gi|384949778|gb|AFI38494.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
Length = 595
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 102/445 (22%), Positives = 198/445 (44%), Gaps = 50/445 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-NPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V + ++AGE I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT----- 419
+ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398
Query: 420 ----SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
S + + + PVS E + L D L Y T+ ED+++L + +L
Sbjct: 399 LLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLS 458
Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
+ ++A +L EK++L ++ A
Sbjct: 459 VRAKMAIKLRLGEKEILEKAVKSAA 483
>gi|255080880|ref|XP_002504006.1| predicted protein [Micromonas sp. RCC299]
gi|226519273|gb|ACO65264.1| predicted protein [Micromonas sp. RCC299]
Length = 529
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 110/396 (27%), Positives = 169/396 (42%), Gaps = 41/396 (10%)
Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWL 177
D++AG+ +P +L VT V + +A L EL LAL+L E+ +G S W
Sbjct: 83 DVRAGEPLIEIPQNLAVTSVDVADSPIVAGLAAGR--GELVGLALWLCLERHKGPLSEWA 140
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L + PLLW+ EL L GSP + + + R E EY +
Sbjct: 141 PYVATLP------SAGSDHPLLWTAGELQTLLQGSPVREQAVSRLESADDEYASI----- 189
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFV-AVQSCVVHLQKVSLARRFALVPLGPPLLAYS 295
+ P D P +A+ F + + AFV A+ + + ++ A +A+VPL LL
Sbjct: 190 --ADQIRSNPNDFPPDAYEF-LTRDAFVDALATVLARAVWLNAANCYAMVPL-VDLLPLV 245
Query: 296 SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNIS------FTGHDFKCG---------P 340
++ A V +P A + VV Y+ + + +D +
Sbjct: 246 GSPPPGVSPAAAAGGPAVGKPGLAAAAGVVDYDAATECVAVVSANDAQQTARVVCVDPLA 305
Query: 341 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG-KLSVQVFHVHAGR 399
+ L + G VDE + D L A+ D Y+ KR + + G Q F V A R
Sbjct: 306 RNAGDLFLATGAVDESHCGDYLAFAASCTQTDRLYEAKRQILEGMGMSADGQTFPVFADR 365
Query: 400 EKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP 459
+L Y+R V D E+ SV I VSP E VL L + LA Y
Sbjct: 366 MP---MQLLAYMRFARVQDPGELMSVSFEEDRI--VSPMNEYEVLQLLMQDAREMLAEYE 420
Query: 460 ATLSEDEAM-LTDYNLHPKKRVATQLVRMEKKMLNA 494
++ E E + L + L ++RVA +L EK+++NA
Sbjct: 421 SSSEEFELLQLKEKGLSARQRVAAKLRLAEKRLINA 456
>gi|332252553|ref|XP_003275417.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
[Nomascus leucogenys]
gi|332252555|ref|XP_003275418.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
[Nomascus leucogenys]
Length = 595
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 102/445 (22%), Positives = 198/445 (44%), Gaps = 50/445 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V + ++AGE I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT----- 419
+ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398
Query: 420 ----SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
S + + + PVS E + L D L Y T+ ED+++L + +L
Sbjct: 399 LLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLS 458
Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
+ ++A +L EK++L ++ A
Sbjct: 459 VRAKMAIKLRLGEKEILEKAVKSAA 483
>gi|281182452|ref|NP_001162549.1| histone-lysine N-methyltransferase setd3 [Papio anubis]
gi|332321745|sp|A9X1D0.1|SETD3_PAPAN RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|163781076|gb|ABY40825.1| SET domain containing 3, isoform 1 (predicted) [Papio anubis]
Length = 595
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 102/445 (22%), Positives = 198/445 (44%), Gaps = 50/445 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERAN-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V + ++AGE I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT----- 419
+ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398
Query: 420 ----SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
S + + + PVS E + L D L Y T+ ED+++L + +L
Sbjct: 399 LLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLS 458
Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
+ ++A +L EK++L ++ A
Sbjct: 459 VRAKMAIKLRLGEKEILEKAVKSAA 483
>gi|114654683|ref|XP_522946.2| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2 [Pan
troglodytes]
gi|332843114|ref|XP_003314566.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Pan
troglodytes]
gi|397525919|ref|XP_003832895.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1 [Pan
paniscus]
gi|397525921|ref|XP_003832896.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2 [Pan
paniscus]
gi|410227562|gb|JAA11000.1| SET domain containing 3 [Pan troglodytes]
gi|410255618|gb|JAA15776.1| SET domain containing 3 [Pan troglodytes]
gi|410289938|gb|JAA23569.1| SET domain containing 3 [Pan troglodytes]
gi|410342147|gb|JAA40020.1| SET domain containing 3 [Pan troglodytes]
Length = 594
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 102/445 (22%), Positives = 198/445 (44%), Gaps = 50/445 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V + ++AGE I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT----- 419
+ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398
Query: 420 ----SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
S + + + PVS E + L D L Y T+ ED+++L + +L
Sbjct: 399 LLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLS 458
Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
+ ++A +L EK++L ++ A
Sbjct: 459 VRAKMAIKLRLGEKEILEKAVKSAA 483
>gi|297695854|ref|XP_002825140.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
[Pongo abelii]
gi|395746278|ref|XP_003778419.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
[Pongo abelii]
Length = 595
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 102/445 (22%), Positives = 198/445 (44%), Gaps = 50/445 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V + ++AGE I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT----- 419
+ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398
Query: 420 ----SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
S + + + PVS E + L D L Y T+ ED+++L + +L
Sbjct: 399 LLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLS 458
Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
+ ++A +L EK++L ++ A
Sbjct: 459 VRAKMAIKLRLGEKEILEKAVKSAA 483
>gi|332320543|sp|B0VX69.2|SETD3_CALJA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
Length = 595
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 104/445 (23%), Positives = 200/445 (44%), Gaps = 50/445 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEEEVRYLQSTQAVHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V + ++AGE I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 424
+ ++ D Y K V R G + VF +H E + +L +LR+ +++ +
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398
Query: 425 V--------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
+ I +LG PVS E + L D L Y T+ ED+++L + +L
Sbjct: 399 LLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLS 458
Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
+ ++A +L EK++L ++ A
Sbjct: 459 VRAKMAIKLRLGEKEILEKAVKSAA 483
>gi|296215874|ref|XP_002754318.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Callithrix jacchus]
Length = 610
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 104/445 (23%), Positives = 200/445 (44%), Gaps = 50/445 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 88 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 141
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 142 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 197
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 198 -----YDTPLYFEEEEVRYLQSTQAVHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 245
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 246 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 305
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V + ++AGE I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 306 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 354
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 424
+ ++ D Y K V R G + VF +H E + +L +LR+ +++ +
Sbjct: 355 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 413
Query: 425 V--------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
+ I +LG PVS E + L D L Y T+ ED+++L + +L
Sbjct: 414 LLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLS 473
Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
+ ++A +L EK++L ++ A
Sbjct: 474 VRAKMAIKLRLGEKEILEKAVKSAA 498
>gi|426377975|ref|XP_004055723.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Gorilla
gorilla gorilla]
Length = 594
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 102/445 (22%), Positives = 198/445 (44%), Gaps = 50/445 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V + ++AGE I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT----- 419
+ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398
Query: 420 ----SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
S + + + PVS E + L D L Y T+ ED+++L + +L
Sbjct: 399 LLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLS 458
Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
+ ++A +L EK++L ++ A
Sbjct: 459 VRAKMAIKLRLGEKEILEKTVKSAA 483
>gi|169409575|gb|ACA57918.1| SET domain containing 3 isoform a (predicted) [Callicebus moloch]
Length = 597
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 104/445 (23%), Positives = 200/445 (44%), Gaps = 50/445 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 75 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 128
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 129 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 184
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 185 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 232
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 233 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 292
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V + ++AGE I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 293 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 341
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 424
+ ++ D Y K V R G + VF +H E + +L +LR+ +++ +
Sbjct: 342 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 400
Query: 425 V--------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
+ I +LG PVS E + L D L Y T+ ED+++L + +L
Sbjct: 401 LLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLS 460
Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
+ ++A +L EK++L ++ A
Sbjct: 461 VRAKMAIKLRLGEKEILEKAVKSAA 485
>gi|168986666|gb|ACA35060.1| SET domain containing 3 isoform a (predicted) [Callithrix jacchus]
Length = 597
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 104/445 (23%), Positives = 200/445 (44%), Gaps = 50/445 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 75 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 128
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 129 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 184
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 185 -----YDTPLYFEEEEVRYLQSTQAVHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 232
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 233 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 292
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V + ++AGE I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 293 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 341
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 424
+ ++ D Y K V R G + VF +H E + +L +LR+ +++ +
Sbjct: 342 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 400
Query: 425 V--------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
+ I +LG PVS E + L D L Y T+ ED+++L + +L
Sbjct: 401 LLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLS 460
Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
+ ++A +L EK++L ++ A
Sbjct: 461 VRAKMAIKLRLGEKEILEKAVKSAA 485
>gi|332321478|sp|B1MTJ4.2|SETD3_CALMO RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
Length = 595
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 104/445 (23%), Positives = 200/445 (44%), Gaps = 50/445 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V + ++AGE I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 424
+ ++ D Y K V R G + VF +H E + +L +LR+ +++ +
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398
Query: 425 V--------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
+ I +LG PVS E + L D L Y T+ ED+++L + +L
Sbjct: 399 LLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLS 458
Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
+ ++A +L EK++L ++ A
Sbjct: 459 VRAKMAIKLRLGEKEILEKAVKSAA 483
>gi|332321743|sp|C1FXW2.1|SETD3_DASNO RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|226526916|gb|ACO71275.1| SET domain containing 3 isoform a (predicted) [Dasypus
novemcinctus]
Length = 589
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 104/445 (23%), Positives = 199/445 (44%), Gaps = 50/445 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSMLGPLYSQDRILQAMGNITLAFHLLCERAN-PNSFWQPYIQSLPGE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLHSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V + ++AGE I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 424
+ ++ D Y K V R G + VF +H E + +L +LR+ +++ +
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398
Query: 425 V--------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
+ I +LG PVS E + L D L Y T+ ED++ L +++L
Sbjct: 399 LLGENAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSFLKNHDLS 458
Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
+ +A +L EK++L ++ A
Sbjct: 459 VRATMAIKLRLGEKEILEKAVKSAA 483
>gi|348554489|ref|XP_003463058.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cavia
porcellus]
Length = 789
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 105/445 (23%), Positives = 198/445 (44%), Gaps = 50/445 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFWLPYI+ L +
Sbjct: 127 LLMTVESA-KNSILGPLYSQDRILQAMGNIALAFHLLCERAN-PNSFWLPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEEEVQCLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V + ++AGE I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 424
+ ++ D Y K V R G + VF +H E + +L +LR+ +++ +
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398
Query: 425 VIS--------SLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
++ +LG PVS E + L D L Y T+ ED+A+L L
Sbjct: 399 LLGENAIDRIFTLGNSEFPVSWENEVKLWSFLEDRASLLLKTYKTTIEEDKAVLKGPELP 458
Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
+ ++A +L EK++L +Q A
Sbjct: 459 TRMKMAVKLRLGEKEILERTVQSAA 483
>gi|332321746|sp|B2KI88.1|SETD3_RHIFE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|183637154|gb|ACC64548.1| SET domain containing 3 isoform a (predicted) [Rhinolophus
ferrumequinum]
Length = 594
Score = 86.7 bits (213), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 104/445 (23%), Positives = 198/445 (44%), Gaps = 50/445 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E S E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVSFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFGEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V + ++AGE I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFQAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 424
+ ++ D Y K V R G + VF +H E + +L +LR+ +++ +
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398
Query: 425 V--------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
+ I +LG PVS E + L D L Y + ED++ L +++L
Sbjct: 399 LLGDNAIDRIFTLGNSEYPVSWDNEVKLWTFLEDRASLLLKTYKTNIEEDKSFLKNHDLS 458
Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
+ +A +L EK++L ++ A
Sbjct: 459 VRATMAIKLRLGEKEILEKAVKSAA 483
>gi|344273731|ref|XP_003408672.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Loxodonta
africana]
Length = 597
Score = 86.7 bits (213), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 104/445 (23%), Positives = 198/445 (44%), Gaps = 50/445 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEVVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAN-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ +L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRHLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V + ++AGE I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 424
+ ++ D Y K V R G + VF +H E + +L +LR+ +++ +
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398
Query: 425 V--------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
+ I +LG PVS E + L D L Y T+ ED+A L ++L
Sbjct: 399 LLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKAFLKGHDLS 458
Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
+ +A +L EK++L ++ A
Sbjct: 459 IRATMAVKLRLGEKEILERAVKSAA 483
>gi|346474100|gb|AEO36894.1| hypothetical protein [Amblyomma maculatum]
Length = 459
Score = 85.9 bits (211), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 106/450 (23%), Positives = 190/450 (42%), Gaps = 76/450 (16%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
W +NG V +K++P + + + A E ++ +P LV+T ++
Sbjct: 50 WCSENGAYLGSVAIKDRPDGD------YGLVAEEKIEESMQFLGIPMKLVMTTASARKSK 103
Query: 144 TIAELLTTN----KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
+ LL + +S +A LA++L+ E G+ SFW PYI L + + L
Sbjct: 104 -LGPLLRDDPIMKSMSNVA-LAIFLILELSAGESSFWHPYISVLPD-------SFNTVLY 154
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
++ EL L+GS E L+ I R+Y + F L + P+ + FT++++
Sbjct: 155 FNIEELELLSGSAVLDEALKLHRSIARQYAYFHKI-FRTHPLAKSLPF---KDCFTYDLY 210
Query: 260 KQAFVAV---QSCVVHLQKVSL----------ARRFALVPL--------GPPLLAYSS-- 296
+ A AV Q+ V + L A ALVPL G L Y S
Sbjct: 211 RWAVSAVMTRQNAVPWTESDGLGGDDVEIDGTAAVTALVPLWDMCNHSDGKVLTDYDSSA 270
Query: 297 ---KCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV 353
+C AM R + GE + ++Y G + N++ I+ GFV
Sbjct: 271 SMVRCYAM-------------RDFDKGEEVTIFY-----------GKRTNAEFFIHNGFV 306
Query: 354 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 413
EDN YD + ++ ++ +DP + K + + + LS+ R++ D+ +LR+
Sbjct: 307 FEDNRYDAVDIKLGVSKKDPLFAVKSKLCE-DHDLSLSGTFALVARDRPVSEDLSTFLRI 365
Query: 414 GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYN 473
+ D S+ ++ S I S R L L + L +P + E E ++ D
Sbjct: 366 LVLKDASQPEAF--SAEHILTSSDSNARDALTFLVVRIELLLKAFPKSDEEYEDIIKDGA 423
Query: 474 LHPKKRVATQLVRMEKKMLNACLQVTADMI 503
+ + ++A +L +E K+L + L+ + +
Sbjct: 424 SNARVKMAARLRLLESKVLASVLETLGNHV 453
>gi|357153645|ref|XP_003576520.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
[Brachypodium distachyon]
Length = 492
Score = 85.9 bits (211), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 92/397 (23%), Positives = 171/397 (43%), Gaps = 47/397 (11%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A+ +L G+ VP L + + V ++ + L ++L ++ E +G
Sbjct: 86 LVAARNLPRGEVVAEVPKKLWMDADAVAASDIGRACRSGGDLRPWVSVSLLILREAARGG 145
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W PY+ L RQ +S + WSE EL + G+ + + E ++ E++ ++
Sbjct: 146 DSLWAPYLAILPRQ-------TDSTIFWSEEELLEIQGTQLLSTTMGVKEYVQSEFDNVE 198
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
+ G +P + TF+ F AF ++S V + + AL+P L+
Sbjct: 199 AK--IIGPNKDLFP-----DTITFDDFLWAFGILRSRVFPELR---GDKLALIPFAD-LI 247
Query: 293 AYSS-----------KCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQ 341
+S+ + K L D L K+GE + V Y++ +
Sbjct: 248 NHSADITSKQSCWEIQGKGFLGR-DVVFSLRTPMEVKSGEQVYVQYDLD----------K 296
Query: 342 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREK 401
N++L ++YGF + ++ D + ++ DP Y DK +A+ NG F V G +
Sbjct: 297 SNAELALDYGFTETNSTRDSYTLTLEISESDPFYGDKLDIAELNGMGETAYFDVVLG--E 354
Query: 402 EAISDMLPYLRLGYVSDTSE--MQSVISS--LGPI-CPVSPCMERAVLDQLADYFKARLA 456
M+ YLRL + T ++++ + G + PVS E ++ + K+ L
Sbjct: 355 SLPPQMITYLRLLCLGGTDAFLLEALFRNKVWGFLELPVSRDNEESICQVIQTACKSALT 414
Query: 457 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 493
Y T+ EDE +L +L + ++A ++ EKK+L
Sbjct: 415 AYHTTIEEDEELLKREDLQSRHQIAVEVRAGEKKVLQ 451
>gi|332321744|sp|B5FW36.1|SETD3_OTOGA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|197215622|gb|ACH53017.1| SET domain containing 3 isoform a (predicted) [Otolemur garnettii]
Length = 595
Score = 85.9 bits (211), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 104/445 (23%), Positives = 198/445 (44%), Gaps = 50/445 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKRENYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQSLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V + ++AGE I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 424
+ ++ D Y K V R G + VF +H E + +L +LR+ +++ +
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398
Query: 425 V--------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
+ I +LG PVS E + L D L Y T+ ED+ +L +++L
Sbjct: 399 LLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKFVLKNHDLS 458
Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
+ +A +L EK++L ++ A
Sbjct: 459 VRATMAIKLRLGEKEILEKAVKSAA 483
>gi|410962953|ref|XP_003988033.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Felis catus]
Length = 591
Score = 85.9 bits (211), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 104/444 (23%), Positives = 195/444 (43%), Gaps = 58/444 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRDLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAV 305
+P +AFT+E ++ V S + + L + G P + +
Sbjct: 231 NKLPLKDAFTYEDYRLGLV---SLALGRWALGLECGVGIARCGKPQITTGYNLE------ 281
Query: 306 DDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVE 365
DD + V + ++AGE I ++Y G + N++ +I+ GF ++N +DR+ ++
Sbjct: 282 DDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKIK 330
Query: 366 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV 425
++ D Y K V R G + VF +H E + +L +LR+ +++ + +
Sbjct: 331 LGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPVSAQLLAFLRVFCMTEEELKEHL 389
Query: 426 --------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 476
I +LG PVS E + L D L Y T+ ED+A L ++NL
Sbjct: 390 LGDNAIDRIFTLGNSEYPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKAFLKNHNLSV 449
Query: 477 KKRVATQLVRMEKKMLNACLQVTA 500
+ +A +L EK++L ++ A
Sbjct: 450 RATMAIKLRLGEKEILEKAVKSAA 473
>gi|395827792|ref|XP_003787079.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Otolemur
garnettii]
Length = 595
Score = 85.5 bits (210), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 104/445 (23%), Positives = 198/445 (44%), Gaps = 50/445 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKRENYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQSLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIYDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V + ++AGE I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 424
+ ++ D Y K V R G + VF +H E + +L +LR+ +++ +
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398
Query: 425 V--------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
+ I +LG PVS E + L D L Y T+ ED+ +L +++L
Sbjct: 399 LLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKFVLKNHDLS 458
Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
+ +A +L EK++L ++ A
Sbjct: 459 VRATMAIKLRLGEKEILEKAVKSAA 483
>gi|332321742|sp|E2RBS6.1|SETD3_CANFA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
Length = 588
Score = 85.1 bits (209), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 104/445 (23%), Positives = 198/445 (44%), Gaps = 50/445 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRDLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P +AFT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDAFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V R ++AGE I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 291 EDDRCECVALRDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 424
+ ++ D Y K V R G + VF +H + + +L +LR+ +++ +
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHY-TDPPVSAQLLAFLRVFCMTEEELKEH 398
Query: 425 V--------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
+ I +LG PVS E + L D L Y T+ ED++ L +++L
Sbjct: 399 LLGDNALDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLRNHDLS 458
Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
+ +A +L EK++L ++ A
Sbjct: 459 VRATMAIKLRLGEKEILEKAVKSAA 483
>gi|73964462|ref|XP_547974.2| PREDICTED: SET domain containing 3 [Canis lupus familiaris]
Length = 589
Score = 85.1 bits (209), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 104/445 (23%), Positives = 198/445 (44%), Gaps = 50/445 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRDLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P +AFT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDAFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V R ++AGE I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 291 EDDRCECVALRDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 424
+ ++ D Y K V R G + VF +H + + +L +LR+ +++ +
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHY-TDPPVSAQLLAFLRVFCMTEEELKEH 398
Query: 425 V--------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
+ I +LG PVS E + L D L Y T+ ED++ L +++L
Sbjct: 399 LLGDNALDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLRNHDLS 458
Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
+ +A +L EK++L ++ A
Sbjct: 459 VRATMAIKLRLGEKEILEKAVKSAA 483
>gi|343961019|dbj|BAK62099.1| SET domain containing 3 isoform a [Pan troglodytes]
Length = 492
Score = 85.1 bits (209), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 98/429 (22%), Positives = 190/429 (44%), Gaps = 50/429 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V + ++AGE I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD------ 418
+ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398
Query: 419 ---TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
S + + + PVS E + L D L Y T+ ED+++L + +L
Sbjct: 399 LLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLS 458
Query: 476 PKKRVATQL 484
+ ++A +L
Sbjct: 459 VRAKMAIKL 467
>gi|338719872|ref|XP_001488117.2| PREDICTED: histone-lysine N-methyltransferase setd3-like [Equus
caballus]
Length = 609
Score = 85.1 bits (209), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 102/445 (22%), Positives = 195/445 (43%), Gaps = 50/445 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 87 GKREDYFPDLMKWASENG---ASVDGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 140
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 141 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 196
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 197 -----YDTPLYFEEDEVRYLQSTQAVHDVFSQYKNTARQY-----AYFY--RVIQTHPHA 244
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 245 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTTGLITTGYNL 304
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V + ++AGE I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 305 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 353
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT----- 419
+ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 354 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKDH 412
Query: 420 ----SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
+ + + + PVS E + L D L Y T+ ED A L + +L
Sbjct: 413 LLGDNAIDRIFTLGNSEYPVSWDNEVKLWTFLEDRALLLLKTYKTTVEEDRAFLKNSDLS 472
Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
+ +A +L EK++L ++ A
Sbjct: 473 VRATMAIKLRLGEKEILEKAVKSAA 497
>gi|432952574|ref|XP_004085141.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Oryzias
latipes]
Length = 606
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 102/447 (22%), Positives = 188/447 (42%), Gaps = 56/447 (12%)
Query: 62 TLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQA 121
T+ GSRE + DL SW +NG + + R + D++A
Sbjct: 69 TVFEGSRE------DSFADLMSWAQENGASCDGFTITNFGTEGYGLR------TTRDIKA 116
Query: 122 GDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKKQGKKSFWLP 178
+ VP +++T+E N + + + +++ + LAL+L+ E+ SFW P
Sbjct: 117 EELFLWVPRKMLMTVESA-QNSVLGPIYSQDRILQAMGNVTLALHLLCERGD-PASFWSP 174
Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMA 238
YIR L ++ ++PL + + ++ L G+ ++L + + R+Y +F
Sbjct: 175 YIRSLPQE-------YDTPLYYQQEDVQLLLGTQAVQDVLNQYKNTARQY-----AYFY- 221
Query: 239 GSLFQQYP--YDIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAY 294
L Q +P +P + F+F+ ++ A +V + + V +R AL+PL
Sbjct: 222 -KLVQTHPAASKLPLKDGFSFDDYRWAVSSVMTRQNQIPTVDGSRVTLALIPLWDMCNHT 280
Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVD 354
+ DD + V + YK E I ++Y G + N++ +I+ GF
Sbjct: 281 NGLITTGYNLEDDRCECVALQDYKKNEQIYIFY-----------GTRSNAEFVIHNGFFF 329
Query: 355 EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL- 413
+DN +DR+ ++ ++ + Y K V R G + VF +H + + +L +LR+
Sbjct: 330 QDNAHDRVKIKLGVSKSERLYAMKAEVLARAGIPASCVFALHC-NDPPISAQLLAFLRVF 388
Query: 414 --------GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSED 465
Y+ + + + PVS E + L L Y T ED
Sbjct: 389 CMTEEELKDYLLGERAINKIFTLGNSDFPVSWENEIKLWTFLETRAALLLKTYKTTSEED 448
Query: 466 EAMLTDYNLHPKKRVATQLVRMEKKML 492
++L +L R+A QL EK++L
Sbjct: 449 RSILEKPDLSLHTRLAVQLRLAEKQIL 475
>gi|281338628|gb|EFB14212.1| hypothetical protein PANDA_005835 [Ailuropoda melanoleuca]
Length = 585
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 104/445 (23%), Positives = 198/445 (44%), Gaps = 50/445 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEEEVRDLQCTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P +AFT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDAFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V R ++AGE I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 291 EDDRCECVALRDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 424
+ ++ D Y K V R G + VF +H + + +L +LR+ +++ +
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TDPPVSAQLLAFLRVFCMTEEELKEH 398
Query: 425 V--------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
+ I +LG PVS E + L D L Y T+ ED++ L +++L
Sbjct: 399 LLGDNALDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLKNHDLS 458
Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
+ +A +L EK++L ++ A
Sbjct: 459 VRATMAIKLRLGEKEILEKAVKSAA 483
>gi|291411315|ref|XP_002721936.1| PREDICTED: SET domain containing 3 [Oryctolagus cuniculus]
Length = 591
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 103/448 (22%), Positives = 194/448 (43%), Gaps = 56/448 (12%)
Query: 72 SKKEEDLGDLKSWMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVP 129
K+E+ +L W NG + +V+ E+ + A+ +++A + VP
Sbjct: 73 GKREDYFPELMKWASANGASVEGFEVVNFEEEGFG--------LRATREIKAEELFLWVP 124
Query: 130 NSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQ 186
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 125 RKLLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERAS-PNSFWQPYIQTLPSE 182
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
++PL + E E+ YL + ++ + + R+Y +F + Q +P
Sbjct: 183 -------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--RVIQTHP 228
Query: 247 Y--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAML 302
+ +P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 229 HANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGY 288
Query: 303 AAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRL 362
DD + V R + AGE I ++Y G + N++ +I+ GF ++N +DR+
Sbjct: 289 NLEDDRCECVALRDFHAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRV 337
Query: 363 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM 422
++ ++ D Y K V R G + VF +H E + +L +LR+ + E+
Sbjct: 338 KIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRV-FCMTEEEL 395
Query: 423 QSVISSLGPI----------CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDY 472
+ + G I PVS E + L D L Y T+ ED+A+L
Sbjct: 396 REHLLGDGAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKAVLRSP 455
Query: 473 NLHPKKRVATQLVRMEKKMLNACLQVTA 500
L + +A +L EK++L ++ A
Sbjct: 456 ALSARAAMAVKLRLGEKEILEKAVRSAA 483
>gi|301764186|ref|XP_002917505.1| PREDICTED: SET domain-containing protein 3-like [Ailuropoda
melanoleuca]
Length = 591
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 104/445 (23%), Positives = 198/445 (44%), Gaps = 50/445 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEEEVRDLQCTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P +AFT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDAFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V R ++AGE I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 291 EDDRCECVALRDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 424
+ ++ D Y K V R G + VF +H + + +L +LR+ +++ +
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TDPPVSAQLLAFLRVFCMTEEELKEH 398
Query: 425 V--------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
+ I +LG PVS E + L D L Y T+ ED++ L +++L
Sbjct: 399 LLGDNALDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLKNHDLS 458
Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
+ +A +L EK++L ++ A
Sbjct: 459 VRATMAIKLRLGEKEILEKAVKSAA 483
>gi|403274243|ref|XP_003928891.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Saimiri
boliviensis boliviensis]
Length = 513
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 94/402 (23%), Positives = 183/402 (45%), Gaps = 44/402 (10%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQG 171
A+ D++A + VP L++T+E N + L + +++ + LA +L+ E+
Sbjct: 28 ATRDIKAEELFLWVPRKLLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-S 85
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 86 PNSFWQPYIQTLPSE-------YDTPLYFEEEEVRYLQSTQAIHDVFSQYKNTARQY--- 135
Query: 232 DTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 136 --AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPL 191
Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLL 347
+ DD + V + ++AGE I ++Y G + N++ +
Sbjct: 192 WDMCNHTNGLITTGYNLEDDRCECVALQDFQAGEQIYIFY-----------GTRSNAEFV 240
Query: 348 INYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDM 407
I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E + +
Sbjct: 241 IHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQL 299
Query: 408 LPYLRLGYVSDTSEMQSV--------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGY 458
L +LR+ +++ + + I +LG PVS E + L D L Y
Sbjct: 300 LAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTY 359
Query: 459 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 500
T+ ED+ +L + +L + ++A +L EK++L ++ A
Sbjct: 360 KTTIEEDKFVLKNQDLSVRAKMAIKLRLGEKEILEKAVKSAA 401
>gi|145350419|ref|XP_001419603.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579835|gb|ABO97896.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 524
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 82/320 (25%), Positives = 135/320 (42%), Gaps = 51/320 (15%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+L W+ LP K+ L+ + + A+E+++ G+A VP + ++T+ER
Sbjct: 83 ELARWLEGRRLPGQKMALEVNLAEG------RGLVATEEIKRGEALLGVPRTTLITVERA 136
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEK---KQGKKSFWLPYIRELDRQRGRGQLAVES 196
+ + +L E + LA +L + + G + YIR L R+ G S
Sbjct: 137 IAEAKLGP--KHAELQEWSVLATFLAQQALALESGTAGTFGEYIRALPRRTG-------S 187
Query: 197 PLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
L W E E+ L GSP++ ER + + +E+ + +P T
Sbjct: 188 VLDWPEDEVDKLLKGSPSRLAAAERQDSVNAAIDEIRSY----------FPE------IT 231
Query: 256 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 315
+ AF + S ++ L ++ ALVP +L + C A + DAV L DR
Sbjct: 232 VGALRWAFDILFSRLIRLD--AMGGELALVPW-ADMLNHKPGCAAFIDLNGDAVNLTTDR 288
Query: 316 PYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDE--DNPYDRLVVEAALNTEDP 373
Y GE + Y G +P+S+LLI+YGF E +NP D + ++ DP
Sbjct: 289 SYVKGEQVWASY-----------GQRPSSELLISYGFAPEVGENPDDEYALTLGVDVNDP 337
Query: 374 QYQDKRMVAQRNGKLSVQVF 393
K V + G V+ F
Sbjct: 338 LADAKAQVLRDMGLSPVETF 357
>gi|432098266|gb|ELK28072.1| Histone-lysine N-methyltransferase setd3 [Myotis davidii]
Length = 585
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 101/445 (22%), Positives = 195/445 (43%), Gaps = 50/445 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ +L W +NG V E + E+ + A+ D++A + VP
Sbjct: 62 GKREDYFPNLMKWASENG---ASVEGFEMFNFKEEGFGLR---ATRDIKAEELFLWVPRK 115
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 116 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 171
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 172 -----YDTPLYFEEDEVRSLQSTQAVHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 219
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 220 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 279
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V R ++AGE I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 280 EDDRCECVALRDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 328
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT----- 419
+ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 329 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-MEPPISAQLLAFLRVFCMTEEELKDH 387
Query: 420 ----SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
+ + + + PVS E + L D L Y T ED++ L +++L
Sbjct: 388 LLGDNAIDKIFTLGNSEYPVSWDNEVKLWTFLEDRASLLLKTYKTTSEEDKSFLKNHDLS 447
Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
+ R+A +L EK++L + A
Sbjct: 448 VRARMAIKLRLGEKEILEKAVTSAA 472
>gi|224051705|ref|XP_002200601.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Taeniopygia
guttata]
Length = 593
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 92/404 (22%), Positives = 185/404 (45%), Gaps = 44/404 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
+ A+ +++A + VP L++T+E N + L + +++ + LA +L+ E+
Sbjct: 108 LKATREIKAEELFLWVPRKLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERA 166
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
SFWLPYI+ L + ++PL + E E+ +L + ++ + + R+Y
Sbjct: 167 N-PHSFWLPYIQTLPSE-------YDTPLYFEEDEVQHLQSTQAIHDVFSQYKNTARQY- 217
Query: 230 ELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALV 285
+F + Q +P +P ++FT++ ++ A +V + + +R AL+
Sbjct: 218 ----AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRVTLALI 271
Query: 286 PLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSK 345
PL + DD + V + +KAGE I ++Y G + N++
Sbjct: 272 PLWDMCNHTNGLITTGYNLEDDRCECVALQDFKAGEQIYIFY-----------GTRSNAE 320
Query: 346 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 405
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H+ E +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPAISA 379
Query: 406 DMLPYLRLGYVSDTSEMQSVIS--SLGPICPVSPCMERAVLD---QLADYFKAR----LA 456
+L +LR+ +S+ + +I ++G I + D +L + +AR L
Sbjct: 380 QLLAFLRVFCMSEEELKEHLIGEHAIGKIFTLGNSDFPVSWDNEVKLWTFLEARASLLLK 439
Query: 457 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 500
Y T+ D++ L ++L P +A +L EK++L ++ A
Sbjct: 440 TYKTTVEVDKSFLETHDLTPHAIMAIKLRLGEKEILEKAVKSAA 483
>gi|115657973|ref|XP_798530.2| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Strongylocentrotus purpuratus]
Length = 682
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 105/449 (23%), Positives = 190/449 (42%), Gaps = 48/449 (10%)
Query: 64 VAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGD 123
VAG S +E W++ NG+ V + K + + A++D++
Sbjct: 66 VAGEPMQQSDREVHFETFFKWLNTNGVTTDAVKMA-------KFDEGYGLQATQDIKMDQ 118
Query: 124 AAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKKQGKKSFWLPYI 180
++P +++T + + + TI +L+ ++L + LA++++ EK + SFW PY+
Sbjct: 119 ELMNIPRKVMMTDQNAVDSPTIGDLVRGDRLLKGMPNVSLAIFILSEKLK-SDSFWKPYL 177
Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS 240
L + PL ++ E+ GS E L++ + I R+Y L F +
Sbjct: 178 DVLPS-------SYSLPLYFTPDEIQLFQGSTMYGECLKQHKNIARQYAYL----FKLLN 226
Query: 241 LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL-QKVSLARRFALVPLGPPLLAYSSKCK 299
L + I E FT++ ++ A V + + K +L+PL + + K
Sbjct: 227 LPENSKLHI-REYFTYDFYRWAVSTVMTRQNQIPAKDGKGMSLSLIPLWDMCNHANGEMK 285
Query: 300 AMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPY 359
D+ + R + GE I + Y G + ++ LL+ GFV N Y
Sbjct: 286 TDFIEERDSCVNMALRDFSVGEQIFICY-----------GRRSSADLLLYSGFVYPGNVY 334
Query: 360 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSV--QVFHVHAGREKEAISDMLPYLRLGYVS 417
D + ++ L++ D Y K + KL V Q +H+ AG+E + ++L +LR+ +
Sbjct: 335 DGMAIQLGLSSSDRLYAMKAQLCSVM-KLGVPSQNYHISAGKEPVTL-ELLTFLRIFCMQ 392
Query: 418 D---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM 468
D + Q++ S + +S E LA Y ++ EDE
Sbjct: 393 DLELRDRLLGDNRAQALFSLVDRSQIISKLNELRTCVYLATRVTLLQRQYKTSIQEDEEK 452
Query: 469 LTDYNLHPKKRVATQLVRMEKKMLNACLQ 497
L D NL ++R A QL+ +EK L L+
Sbjct: 453 LKDGNLSAQERSALQLLLIEKCTLENVLE 481
>gi|302814473|ref|XP_002988920.1| hypothetical protein SELMODRAFT_129035 [Selaginella moellendorffii]
gi|300143257|gb|EFJ09949.1| hypothetical protein SELMODRAFT_129035 [Selaginella moellendorffii]
Length = 389
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 68/262 (25%), Positives = 126/262 (48%), Gaps = 34/262 (12%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS + G+ V + L++T E++ E + +LL+ + +S A LAL+L+ +K+ + S
Sbjct: 5 ASRPIHTGECMLHVSHDLMITPEKL--PEEVTKLLSKD-VSAWAKLALFLLAHQKKKETS 61
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI L ++ S + W++ EL YL SP E ++R + ++ E+ +
Sbjct: 62 AWAPYISCLPPFG-----SMHSTIFWTQDELVYLKVSPVYRETVQRKDVVRMEFAAAENA 116
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
+ P+ + E FK A+ V S ++ + + ALVP +
Sbjct: 117 LLLC-------PHIFGSRVSALE-FKHAYATVCSRAWGIETI---KSLALVPF-VDFFNH 164
Query: 295 SSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGF 352
+ C+AML+ +D ++V DR Y G+ +V+ Y G N+ L +++GF
Sbjct: 165 DANCRAMLSYDEDRHCAEVVSDRDYATGDQVVISY-----------GQLSNATLALDFGF 213
Query: 353 VDEDNPYDRLV-VEAALNTEDP 373
NP+D++ + +L+ +DP
Sbjct: 214 ALPFNPHDQVAGIWLSLSEKDP 235
>gi|302754606|ref|XP_002960727.1| hypothetical protein SELMODRAFT_449995 [Selaginella moellendorffii]
gi|300171666|gb|EFJ38266.1| hypothetical protein SELMODRAFT_449995 [Selaginella moellendorffii]
Length = 430
Score = 82.4 bits (202), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 103/392 (26%), Positives = 171/392 (43%), Gaps = 58/392 (14%)
Query: 127 SVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
S+P +L + ++ V +E I E L +ALYL++EK + S W YIR L R
Sbjct: 37 SIPKTLWMDVDTVRRSE-IGECCAG--LRPWIAVALYLLHEKAK-PHSDWSAYIRVLPR- 91
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
++SPL WSE ELA L G+ + I E +KREY+++ T + + P
Sbjct: 92 ------TLDSPLFWSEEELAELKGTQLLSSINGFKEFLKREYDKVMT------EVIEPRP 139
Query: 247 YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD 306
+T E F AF ++S R F PPL+ + + V+
Sbjct: 140 DVFDRSLYTLEAFTWAFGILRS-----------RTF------PPLIGDNLALVPLADFVN 182
Query: 307 DAVQLVVDRPYKAGESIVVWY---NISFTGHDFKCGPQP-----------NSKLLINYGF 352
L + PY +S V+ ++ C + N++L +YGF
Sbjct: 183 HGFGLTNEDPYWHVKSAGVFARQETLTLQAAA-NCAEKQEVLMQYGKKKGNAQLATDYGF 241
Query: 353 VDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPY 410
VD D N D + ++ + DK +AQ G S F+++ R + DM+ Y
Sbjct: 242 VDSDEKNNRDSFTLTLQVSLSERFADDKVDIAQMAGLDSTAYFNLY--RNQGPPEDMIAY 299
Query: 411 LRLG--YVSDTSEMQSVISSL---GPICPVSPCMERAVLDQLADYFKARLAGYPATLSED 465
LRL + SD+ ++++ + P+S E A+ + + + +A L Y +T+ ED
Sbjct: 300 LRLIALFGSDSFLLEALFRNTVWDHLRLPISRENEEAICEAMIEGCRATLREYSSTIDED 359
Query: 466 EAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 497
+L L +K++A + EK++L LQ
Sbjct: 360 TMLLNSSELSTRKKMAVVVRLGEKRILQEQLQ 391
>gi|268370088|ref|NP_082538.2| histone-lysine N-methyltransferase setd3 [Mus musculus]
gi|81879567|sp|Q91WC0.1|SETD3_MOUSE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=Endothelial differentiation inhibitory protein D10;
AltName: Full=SET domain-containing protein 3
gi|16359331|gb|AAH16123.1| SET domain containing 3 [Mus musculus]
gi|18044800|gb|AAH19973.1| Setd3 protein [Mus musculus]
gi|26327255|dbj|BAC27371.1| unnamed protein product [Mus musculus]
gi|74145116|dbj|BAE27425.1| unnamed protein product [Mus musculus]
gi|74151505|dbj|BAE38861.1| unnamed protein product [Mus musculus]
gi|148686776|gb|EDL18723.1| mCG18357, isoform CRA_a [Mus musculus]
Length = 594
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 101/445 (22%), Positives = 196/445 (44%), Gaps = 50/445 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P E+FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V + ++AG+ I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFQAGDQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT----- 419
+ ++ D Y K V R G + VF +H+ E + +L +LR+ +++
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEH 398
Query: 420 ----SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
S + + + PVS E + L D L Y T+ ED+ +L + +L
Sbjct: 399 LLGDSAIDRIFTLGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLS 458
Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
+ +A +L EK++L ++ A
Sbjct: 459 VRATMAIKLRLGEKEILEKAVKSAA 483
>gi|148686779|gb|EDL18726.1| mCG18357, isoform CRA_d [Mus musculus]
Length = 597
Score = 82.0 bits (201), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 103/445 (23%), Positives = 198/445 (44%), Gaps = 50/445 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 76 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 129
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 130 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 185
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 186 -----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 233
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P E+FT+E ++ A +V + + +R AL+PL +
Sbjct: 234 NKLPLKESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 293
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V + ++AG+ I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 294 EDDRCECVALQDFQAGDQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 342
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 424
+ ++ D Y K V R G + VF +H+ E + +L +LR+ +++ +
Sbjct: 343 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEH 401
Query: 425 V--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
+ I +LG PVS E + L D L Y T+ ED+ +L + +L
Sbjct: 402 LLGDSAIDRIFTLGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLS 461
Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
+ +A +L EK++L ++ A
Sbjct: 462 VRATMAIKLRLGEKEILEKAVKSAA 486
>gi|145344456|ref|XP_001416748.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576974|gb|ABO95041.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 515
Score = 81.6 bits (200), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 111/438 (25%), Positives = 172/438 (39%), Gaps = 55/438 (12%)
Query: 76 EDLGDLKSWM-HKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
ED +L +W+ + G+ ++ KE R VA D+ AG +VP V
Sbjct: 47 EDARELAAWLSYDKGVDASGLVFKEG------ARGEVEVALRGDVDAGARVLAVPQDCAV 100
Query: 135 TLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
T V + ++ L EL LAL+L E+ +G S W PY++ L
Sbjct: 101 TSVDVDAHPIVSGL--AKGRPELVGLALWLCAERIKGGASDWAPYVKTLAAN-------P 151
Query: 195 ESPLLWSETE-LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 253
++PL W+E E A L GSP + +ER+ + EY + V + P P EA
Sbjct: 152 DAPLFWTEAEDFALLKGSPIVNDAVERSRSAREEYAAIVEV-------IKGDPTAFPAEA 204
Query: 254 FTF---EIFKQAFVAVQSCVVHLQKVSLARRFALVPL-------GPPLLAYSSKCKAMLA 303
+ F E F A V + L S +ALVPL G P+ S +
Sbjct: 205 YEFFTEERFVDALATVCAKATWLPTASC---YALVPLLDVITIAGSPVPGVSPP-----S 256
Query: 304 AVDDAVQLVVDRPYKAGESIVVWYNI---SFTGHDFKCGP--QPNSKLLINYGFVDEDNP 358
A D + D Y + VV + + P + N +L +N G VD+ +P
Sbjct: 257 AKDGIARCAAD--YDVDSACVVLSAVVKAPANSRVVQLDPLQRNNGELFLNTGRVDQKHP 314
Query: 359 YDRLVVEAALNTEDPQYQDKRMVAQRNG-KLSVQVFHVHAGREKEAISDMLPYLRLGYVS 417
D L + + D + K+ V + G Q F V+ E + + YLR V
Sbjct: 315 GDYLYMRTEIQPSDRLFSAKKQVLEGMGFTAENQYFPVY---EDRMPTQLYSYLRFARVQ 371
Query: 418 DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPK 477
D EM +V I VS E +L L + ++ Y ++ +L +
Sbjct: 372 DPGEMMAVSFEEDKI--VSVMNEYEILQLLMGDCRELMSEYDTNEEDELNLLKLSDTMRV 429
Query: 478 KRVATQLVRMEKKMLNAC 495
+ + +RM +K L C
Sbjct: 430 REIEAAKLRMSEKKLIGC 447
>gi|354483159|ref|XP_003503762.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Cricetulus
griseus]
gi|344254671|gb|EGW10775.1| SET domain-containing protein 3 [Cricetulus griseus]
Length = 577
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 101/445 (22%), Positives = 195/445 (43%), Gaps = 50/445 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERAS-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V + ++AGE I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFQAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT----- 419
+ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398
Query: 420 ----SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
S + + + PVS E + L D L Y T+ ED+ +L + +L
Sbjct: 399 LLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLS 458
Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
+ +A +L EK++L ++ A
Sbjct: 459 VRATMAIKLRLGEKEILEKAVKSAA 483
>gi|302804384|ref|XP_002983944.1| hypothetical protein SELMODRAFT_119151 [Selaginella moellendorffii]
gi|300148296|gb|EFJ14956.1| hypothetical protein SELMODRAFT_119151 [Selaginella moellendorffii]
Length = 439
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 102/387 (26%), Positives = 168/387 (43%), Gaps = 48/387 (12%)
Query: 127 SVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
S+P +L + + V +E I E L +ALYL++EK + S W YIR L R
Sbjct: 37 SIPKTLWMDADTVRRSE-IGE--CCEGLRPWIAVALYLLHEKAK-PHSDWSAYIRVLPR- 91
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
++SPL WSE ELA L G+ + + E +KREY+++ T + + P
Sbjct: 92 ------TLDSPLFWSEEELAELKGTQLLSSMNGFKEFLKREYDKVMT------EVIEPRP 139
Query: 247 YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL------GPPLLAYSSKCKA 300
+T E F AF ++S + ALVPL G L K
Sbjct: 140 DVFDRSLYTLEAFTWAFGILRSRTF---PPLIGDNLALVPLADFVNHGFGLTNEDPGWKV 196
Query: 301 MLAAV---DDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED- 356
A V + + L + +++ Y + N++L +YGFVD D
Sbjct: 197 KSAGVFARQETLTLQAAANCAEKQEVLIQYG----------KKKGNAQLATDYGFVDSDE 246
Query: 357 -NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLG- 414
N D + ++ + DK +AQ G S F+++ R + DM+ YLRL
Sbjct: 247 KNNRDSFTLTLQVSLSERFADDKVDIAQMAGLDSTAYFNLY--RNQGPPEDMIAYLRLIA 304
Query: 415 -YVSDTSEMQSVISSL---GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 470
+ SD+ ++++ + P+S E A+ + + + +A L Y +T+ ED +L
Sbjct: 305 LFGSDSFLLEALFRNTVWDHLRLPISRENEEAICEAMIEGCRATLREYSSTIDEDTMLLN 364
Query: 471 DYNLHPKKRVATQLVRMEKKMLNACLQ 497
L +K++A + EK++L LQ
Sbjct: 365 SSELSTRKKMAVVVRLGEKRILQEQLQ 391
>gi|12848462|dbj|BAB27964.1| unnamed protein product [Mus musculus]
gi|46241521|gb|AAS82953.1| endothelial differentiation inhibitory protein D10 [Mus musculus]
Length = 594
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 101/445 (22%), Positives = 196/445 (44%), Gaps = 50/445 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P E+FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V + ++AG+ I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFQAGDQIYIFY-----------GTRSNAESVIHSGFFFDNNSHDRVKI 339
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT----- 419
+ ++ D Y K V R G + VF +H+ E + +L +LR+ +++
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEH 398
Query: 420 ----SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
S + + + PVS E + L D L Y T+ ED+ +L + +L
Sbjct: 399 LLGDSAIDRIFTLGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLS 458
Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
+ +A +L EK++L ++ A
Sbjct: 459 VRATMAIKLRLGEKEILEKAVKSAA 483
>gi|41056027|ref|NP_956348.1| histone-lysine N-methyltransferase setd3 [Danio rerio]
gi|82187658|sp|Q7SXS7.1|SETD3_DANRE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|32766447|gb|AAH55261.1| SET domain containing 3 [Danio rerio]
Length = 596
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 90/394 (22%), Positives = 169/394 (42%), Gaps = 40/394 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKK 169
+ A++D++A + +P +++T+E N + L + +++ + LAL+L+ E+
Sbjct: 108 LKATKDIKAEELFLWIPRKMLMTVESA-KNSVLGPLYSQDRILQAMGNVTLALHLLCERA 166
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
S WLPYI+ L + ++PL + E E+ +L + ++L + + R+Y
Sbjct: 167 N-PSSPWLPYIKTLPSE-------YDTPLYFEEEEVRHLLATQAIQDVLSQYKNTARQY- 217
Query: 230 ELDTVWFMAGSLFQQYPYDIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
+F +P +AFTF+ ++ A +V + + +R AL+PL
Sbjct: 218 ----AYFYKVIHTHPNASKLPLKDAFTFDDYRWAVSSVMTRQNQIPTADGSRVTLALIPL 273
Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLL 347
+ DD + V + YK GE I ++Y G + N++ +
Sbjct: 274 WDMCNHTNGLITTGYNLEDDRCECVALKDYKEGEQIYIFY-----------GTRSNAEFV 322
Query: 348 INYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDM 407
I+ GF EDN +DR+ ++ ++ + Y K V R G + +F +H E + +
Sbjct: 323 IHNGFFFEDNAHDRVKIKLGVSKGERLYAMKAEVLARAGIPASSIFALHCS-EPPISAQL 381
Query: 408 LPYLRL---------GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGY 458
L +LR+ Y+ + + + PVS E + L L Y
Sbjct: 382 LAFLRVFCMTEEELRDYLVGDHAINKIFTLGNTEFPVSWENEIKLWTFLETRAALLLKTY 441
Query: 459 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 492
ED +ML +L R+A +L EK++L
Sbjct: 442 KTASEEDRSMLEKPDLSLHSRIAIKLRLAEKEIL 475
>gi|392341246|ref|XP_002726820.2| PREDICTED: histone-lysine N-methyltransferase setd3 [Rattus
norvegicus]
gi|392349051|ref|XP_216781.6| PREDICTED: histone-lysine N-methyltransferase setd3 [Rattus
norvegicus]
gi|149044195|gb|EDL97577.1| rCG27725, isoform CRA_a [Rattus norvegicus]
Length = 596
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 100/445 (22%), Positives = 195/445 (43%), Gaps = 50/445 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSILGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V + ++AG+ I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFQAGDQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT----- 419
+ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398
Query: 420 ----SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
S + + + PVS E + L D L Y T+ ED+ +L + +L
Sbjct: 399 LLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKTVLKNPDLS 458
Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
+ +A +L EK++L ++ A
Sbjct: 459 VRATMAIKLRLGEKEILEKAVKSAA 483
>gi|410928182|ref|XP_003977480.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Takifugu
rubripes]
Length = 598
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 95/403 (23%), Positives = 176/403 (43%), Gaps = 54/403 (13%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKKQG 171
A+ D++A + +P +++T+E + L +++ + LAL+L+ E+
Sbjct: 110 ATRDIKAEELFLWIPRKMLMTVESA-KKSVLGPLYNQDRILQAMDNVTLALHLLCERA-N 167
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
SFWLPYIR L ++ ++PL + + E+ L G+ ++L + R+Y
Sbjct: 168 PASFWLPYIRTLPQE-------YDTPLFYEQDEVQLLQGTQAVQDVLSQYRNTARQY--- 217
Query: 232 DTVWFMAGSLFQQYPYD--IP-TEAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRFA 283
+F L Q +P +P ++FTF+ ++ A +V + + ++V+L A
Sbjct: 218 --AYFY--KLIQTHPASSKLPLKDSFTFDDYRWAVSSVMTRQNQIPTEDGRQVTL----A 269
Query: 284 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPN 343
L+PL + DD + V + YK E I ++Y G + N
Sbjct: 270 LIPLWDMCNHRNGLITTGYNLEDDRCECVALQDYKKNEQIYIFY-----------GTRSN 318
Query: 344 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEA 403
++ +I+ GF ++N +D++ ++ ++ + Y K V R G +F ++ E+
Sbjct: 319 AEFVIHNGFFYQENAHDQVKIKLGISKSERLYAMKAEVLARAGIPVSSIFALYCN-EQPI 377
Query: 404 ISDMLPYLRLGYVSDTSEMQSV---------ISSLGPI-CPVSPCMERAVLDQLADYFKA 453
+ +L +LR+ + E++ I +LG + PVS E + L
Sbjct: 378 SAQLLAFLRV-FCMKEEELRDYLLGGHAINKIVTLGSMEFPVSWDNEIKLWTFLETRVAL 436
Query: 454 RLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 496
L Y T ED + L L P R+A QL EK +L L
Sbjct: 437 LLKAYKTTSEEDSSTLEKSELSPHSRMAIQLRLAEKWILEKAL 479
>gi|297849804|ref|XP_002892783.1| hypothetical protein ARALYDRAFT_471564 [Arabidopsis lyrata subsp.
lyrata]
gi|297338625|gb|EFH69042.1| hypothetical protein ARALYDRAFT_471564 [Arabidopsis lyrata subsp.
lyrata]
Length = 482
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 99/403 (24%), Positives = 178/403 (44%), Gaps = 47/403 (11%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A D+ + +P L + E V ++ I L L +AL+L+ EK + +
Sbjct: 79 LVARRDIGRNEVVLEIPKRLWINPETVTASK-IGPL--CGGLKPWVSVALFLIREKYE-E 134
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+S W Y+ L + + +S + WSE ELA L G+ + L E ++ E+ +L+
Sbjct: 135 ESSWRLYLDMLPQ-------STDSTVFWSEEELAELKGTQLLSTTLGVKEYVENEFLKLE 187
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG---- 288
+ D+ + T + F AF ++S + + L+PL
Sbjct: 188 QEILLPNK-------DLFSSRITLDDFIWAFGILKSRAFSRLR---GQNLVLIPLADLIN 237
Query: 289 --PPLLA--YSSKCK-AMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPN 343
P + Y+ + K A L + D L KAGE + + Y+++ + N
Sbjct: 238 HNPAITTEDYAYEIKGAGLFSRDLLFSLKSPVYVKAGEQVYIQYDLN----------KSN 287
Query: 344 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEA 403
++L ++YGFV+ + + + + DP + DK +A+ N F V G+ A
Sbjct: 288 AELALDYGFVESNPNRNSYTLTIEIPESDPFFGDKLDIAETNKMGETGYFDVVDGQTLPA 347
Query: 404 ISDMLPYLRLGYV--SDTSEMQSVISS--LGPI-CPVSPCMERAVLDQLADYFKARLAGY 458
ML YLRL + SD ++S+ ++ G + PVS E + + D K+ L+G+
Sbjct: 348 --GMLQYLRLVALGGSDAFLLESIFNNTIWGHLELPVSRSNEELICRVVRDACKSALSGF 405
Query: 459 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTAD 501
T+ EDE +L + L P+ +A ++ EK++L Q+ D
Sbjct: 406 STTIEEDEKLLEEGKLDPRLEMALKIRIGEKRVLQQIDQIFKD 448
>gi|168063638|ref|XP_001783777.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664720|gb|EDQ51429.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 395
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 76/285 (26%), Positives = 129/285 (45%), Gaps = 39/285 (13%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+ ++ G+ V L++T ++ + ELL T ++E A LAL+++ E+ G+ S
Sbjct: 5 AARPIEVGEQVLRVSGDLMITPNKL--PTEVKELLPTG-VTEWARLALFILVEQHLGQAS 61
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI L A+ S + W + EL + + E ++R I E+ + V
Sbjct: 62 QWAPYINCLPTCG-----ALHSTVFWKKEELELVRFTSLHRETMQRRAVIGSEFASVLPV 116
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
Q+ P+ I E FKQA+ +S + S R VP +
Sbjct: 117 -------LQKCPH-IFGERVLHSKFKQAYATGKSL-----RRSSNTRILTVPF-VDFFNH 162
Query: 295 SSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGF 352
S C+A+L+ ++ +++ D+ Y GE +V+ Y G PN+ L +++GF
Sbjct: 163 DSNCRALLSYDEERACAEVIADKNYARGEQVVISY-----------GRLPNTTLALDFGF 211
Query: 353 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 397
NPYD++ V AL+ DP + K + +G +V VHA
Sbjct: 212 TISCNPYDQVEVWMALSHRDPLRKMKLALLHAHGMPTV----VHA 252
>gi|17368377|sp|P94026.1|RBCMT_TOBAC RecName: Full=Ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic; AltName:
Full=[Ribulose-bisphosphate carboxylase]-lysine
N-methyltransferase; Short=RuBisCO LSMT; Short=RuBisCO
methyltransferase; Short=rbcMT; Flags: Precursor
gi|1731475|gb|AAC49565.1| ribulose-1,5-bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase [Nicotiana tabacum]
gi|1731477|gb|AAC49566.1| ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase [Nicotiana tabacum]
Length = 491
Score = 79.7 bits (195), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 90/391 (23%), Positives = 171/391 (43%), Gaps = 40/391 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A D+ G+ VP + + V +E I + + L +AL+L+ EK +
Sbjct: 87 LVAKRDIAKGETVLQVPKRFWINPDAVAESE-IGNVCS--GLKPWISVALFLLREKWR-D 142
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W Y+ L + + +S + WSE EL+ + G+ + + + ++ E+ +++
Sbjct: 143 DSKWKYYMDVLPK-------STDSTIYWSEEELSEIQGTQLLSTTMSVKDYVQNEFQKVE 195
Query: 233 TVWFMAGSLFQQYPYDIPTEAF--TFEIFK-QAFVAVQS-CVVHLQKVSLARRFALVPLG 288
+ Q +P+ I + F F I + +AF +++ ++ + L A V
Sbjct: 196 EEVILRNK--QLFPFPITLDDFFWAFGILRSRAFSRLRNQNLILVPFADLTNHNARVTTE 253
Query: 289 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLI 348
A+ + A L + D L KAG+ + + Y+++ + N+ + +
Sbjct: 254 DH--AHEVRGPAGLFSWDLLFSLRSPLKLKAGDQLFIQYDLN----------KSNADMAL 301
Query: 349 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML 408
+YGF++ + D + ++ D Y DK +A+ NG F + G+ M+
Sbjct: 302 DYGFIEPSSARDAFTLTLEISESDEFYGDKLDIAETNGIGETAYFDIKIGQSLPPT--MI 359
Query: 409 PYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPAT 461
PYLRL + T SV LG PVS E + + D K+ L+GY T
Sbjct: 360 PYLRLVALGGTDAFLLESIFRNSVWGHLG--LPVSRANEELICKVVRDACKSALSGYHTT 417
Query: 462 LSEDEAMLTDYNLHPKKRVATQLVRMEKKML 492
+ EDE ++ + NL + ++A + EK++L
Sbjct: 418 IEEDEKLMEEGNLSTRLQIAVGIRLGEKRVL 448
>gi|160774366|gb|AAI55279.1| SET domain containing 3 [Danio rerio]
Length = 596
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 89/394 (22%), Positives = 168/394 (42%), Gaps = 40/394 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKK 169
+ A++D++A + +P +++T+E N + L + +++ + LAL+L+ E+
Sbjct: 108 LKATKDIKAEELFLWIPRKMLMTVESA-KNSVLGPLYSQDRILQAMGNVTLALHLLCERA 166
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
S WLPYI+ L + ++PL + E E+ +L + ++L + + R+Y
Sbjct: 167 N-PSSPWLPYIKTLPSE-------YDTPLYFEEEEVRHLLATQAIQDVLSQYKNTARQY- 217
Query: 230 ELDTVWFMAGSLFQQYPYDIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
+F +P +AFTF+ ++ A +V + + +R AL+PL
Sbjct: 218 ----AYFYKVIHTHPNASKLPLKDAFTFDDYRWAVSSVMTRQNQIPTADGSRVTLALIPL 273
Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLL 347
+ DD + V + YK GE I ++Y G + N++ +
Sbjct: 274 WDMCNHTNGLITTGYNLEDDRCECVALKDYKEGEQIYIFY-----------GTRSNAEFV 322
Query: 348 INYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDM 407
I+ GF EDN +DR+ ++ ++ + Y K V R G + +F +H E + +
Sbjct: 323 IHNGFFFEDNAHDRVKIKLGVSKSERLYAMKAEVLARAGIPASSIFALHCS-EPPISAQL 381
Query: 408 LPYLRL---------GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGY 458
L +LR+ Y+ + + + PVS E + L L Y
Sbjct: 382 LAFLRVFCMTEEELRDYLVGDHAINKIFTLGNTEFPVSWENEIKLWTFLETRAALLLKTY 441
Query: 459 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 492
ED +ML +L R+ +L EK++L
Sbjct: 442 KTASEEDRSMLEKPDLSLHSRITIKLRLAEKEIL 475
>gi|449280698|gb|EMC87934.1| SET domain-containing protein 3 [Columba livia]
Length = 593
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 95/408 (23%), Positives = 185/408 (45%), Gaps = 52/408 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
+ A+ +++A + VP L++T+E N + L + +++ + LA +L+ E+
Sbjct: 108 LKATREIKAEELFLWVPRRLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERA 166
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
SFWLPYI+ L + +PL + E E+ YL + ++ + + R+Y
Sbjct: 167 N-PNSFWLPYIQTLPSE-------YNTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY- 217
Query: 230 ELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALV 285
+F + Q +P +P ++FT++ ++ A +V + + +R AL+
Sbjct: 218 ----AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRVTLALI 271
Query: 286 PLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSK 345
PL + DD + V + +KAGE I ++Y G + N++
Sbjct: 272 PLWDMCNHTNGLITTGYNLEDDRCECVALQDFKAGEQIYIFY-----------GTRSNAE 320
Query: 346 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 405
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H+ E +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISA 379
Query: 406 DMLPYLRLGYVSDTSEMQSVIS--------SLG-PICPVSPCMERAVLDQLADYFKAR-- 454
+L +LR+ +S+ + +I +LG PVS E +L + +AR
Sbjct: 380 QLLAFLRVFCMSEEELKEHLIGEHAIDKIFTLGNSEFPVSWDNEV----KLWTFLEARAS 435
Query: 455 --LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 500
L Y T+ +D++ L ++L +A +L EK++L ++ A
Sbjct: 436 LLLKTYKTTVEDDKSFLETHDLTSHAIMAIKLRLGEKEILEKAVKSAA 483
>gi|302786274|ref|XP_002974908.1| hypothetical protein SELMODRAFT_102436 [Selaginella moellendorffii]
gi|300157067|gb|EFJ23693.1| hypothetical protein SELMODRAFT_102436 [Selaginella moellendorffii]
Length = 389
Score = 79.3 bits (194), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 68/262 (25%), Positives = 126/262 (48%), Gaps = 36/262 (13%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS + G+ V + L++T E++ E + +LL+ + +S A LAL+L+ +K+ + S
Sbjct: 5 ASRPIHTGECMLHVSHDLMITPEKL--PEEVTKLLSKD-VSAWAKLALFLLAHQKKKETS 61
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI L ++ S + W++ EL YL SP E ++R + ++ E+ + V
Sbjct: 62 AWAPYISCLPPFG-----SMHSTIFWTQDELVYLKVSPVYRETVQRKDVVRMEFAAAENV 116
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
L QQ + + T ++ V S ++ + + ALVP +
Sbjct: 117 CM----LMQQVKLFVCSRILT------DYITVCSRAWGIETI---KSLALVPF-VDFFNH 162
Query: 295 SSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGF 352
+ C+AML+ +D ++V DR Y G+ +V+ Y G N+ L +++GF
Sbjct: 163 DANCRAMLSYDEDRHCAEVVSDRDYATGDQVVISY-----------GQLSNATLALDFGF 211
Query: 353 VDEDNPYDRLV-VEAALNTEDP 373
NP+D++ + +L+ +DP
Sbjct: 212 ALPFNPHDQVAGIWLSLSEKDP 233
>gi|47215092|emb|CAF98166.1| unnamed protein product [Tetraodon nigroviridis]
Length = 444
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 93/402 (23%), Positives = 174/402 (43%), Gaps = 52/402 (12%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKKQG 171
A+ D++A + +P +++T+E + L T +++ + LAL+L+ E+
Sbjct: 28 ATRDIKAEELFLWIPRKMLMTVESA-KKSVLGPLYTQDRILQAMDNVTLALHLLCERAD- 85
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
SFWLPYIR L ++ ++PL + + ++ L G+ ++L + R+Y
Sbjct: 86 PASFWLPYIRTLPQE-------YDTPLFYQQQDVQLLHGTQAIQDVLSQYRNTARQY--- 135
Query: 232 DTVWFMAGSLFQQYPYD--IP-TEAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRFA 283
+F L Q +P +P ++FTF+ ++ A +V + + ++V+L A
Sbjct: 136 --AYFY--KLVQTHPASSKLPLKDSFTFDDYRWAVSSVMTRQNQIPTEDGRQVTL----A 187
Query: 284 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPN 343
L+PL + DD + V + YK E I ++Y G + N
Sbjct: 188 LIPLWDMCNHRNGLITTGYNLEDDRCECVALQDYKKNEQIYIFY-----------GTRSN 236
Query: 344 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEA 403
++ +I+ GF ++N +D++ ++ ++ + Y K V R G VF ++ E
Sbjct: 237 AEFVIHNGFFYQENAHDQVKIKLGISKSERLYAMKAEVLGRAGIPVSSVFALYCN-EPPI 295
Query: 404 ISDMLPYLRL---------GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 454
+ +L +LR+ Y+ + +++ PVS E + L
Sbjct: 296 SAQLLAFLRVFCMMEEELKDYLFGAQAINRLVTLGSMEFPVSWENEIKLWTFLETRAALL 355
Query: 455 LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 496
L Y T ED + L +L P R+A QL EK +L L
Sbjct: 356 LKAYKTTAEEDSSTLDKTDLSPHSRMAVQLRLAEKAILEKAL 397
>gi|57529914|ref|NP_001006486.1| histone-lysine N-methyltransferase setd3 [Gallus gallus]
gi|363734802|ref|XP_003641459.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Gallus
gallus]
gi|75571462|sp|Q5ZML9.1|SETD3_CHICK RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|53127281|emb|CAG31024.1| hypothetical protein RCJMB04_1k10 [Gallus gallus]
Length = 593
Score = 78.6 bits (192), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 93/408 (22%), Positives = 186/408 (45%), Gaps = 52/408 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
+ A+ +++A + VP L++T+E N + L + +++ + LA +L+ E+
Sbjct: 108 LKATREIKAEELFLWVPRKLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERA 166
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
SFWLPYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 167 N-PNSFWLPYIQTLPSE-------YDTPLYFEEDEVQYLRSTQAIHDVFSQYKNTARQY- 217
Query: 230 ELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALV 285
+F + Q +P +P ++FT++ ++ A +V + + +R AL+
Sbjct: 218 ----AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRVTLALI 271
Query: 286 PLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSK 345
PL + DD + V + +KAGE I ++Y G + N++
Sbjct: 272 PLWDMCNHTNGLITTGYNLEDDRCECVALQDFKAGEQIYIFY-----------GTRSNAE 320
Query: 346 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 405
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H+ E +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-IEPPISA 379
Query: 406 DMLPYLRLGYVSDTSEMQSVIS--------SLG-PICPVSPCMERAVLDQLADYFKAR-- 454
+L +LR+ +++ + +I +LG P+S E +L + +AR
Sbjct: 380 QLLAFLRVFCMNEEELKEHLIGEHAIDKIFTLGNSEFPISWDNEV----KLWTFLEARAS 435
Query: 455 --LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 500
L Y T+ +D++ L ++L +A +L EK++L ++ A
Sbjct: 436 LLLKTYKTTVEDDKSFLETHDLTSHATMAIKLRLGEKEILEKAVKSAA 483
>gi|427784595|gb|JAA57749.1| Putative histone-lysine n-methyltransferase setd3 [Rhipicephalus
pulchellus]
Length = 485
Score = 78.2 bits (191), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 97/428 (22%), Positives = 176/428 (41%), Gaps = 46/428 (10%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
W NG V +K+ P + A E ++ + VP L++T
Sbjct: 80 WCSDNGAYLGSVSIKDLPDGE------YGFVADEHIEESNQFLGVPLKLMMTTAAA-KKS 132
Query: 144 TIAELLTTN----KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
+ LL + +S +A LA++L+ E G+ SFW PYI L + + L
Sbjct: 133 KLGPLLRDDPIMMSMSNVA-LAMFLILEFCTGESSFWHPYISTL-------PASFNTVLY 184
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
+S EL L GS E L+ I R+Y+ + F L + PY + FT++++
Sbjct: 185 FSVEELELLHGSTVLDEALKLHRSIARQYSYFHKI-FRTHPLAKSLPY---KDCFTYDLY 240
Query: 260 KQAFVAVQS--CVVHLQKVSLARR-------FALVPLGPPLLAYSSKCKAMLAAVDDAVQ 310
+ A AV + V L + A+VPL K + ++
Sbjct: 241 RWAVSAVMTRQNAVPLTDTAGGDDEDGTDAMTAMVPLWDMCNHSDGKVFTDYDISANMLR 300
Query: 311 LVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 370
R ++ G+ + ++Y G + N++ I+ GFV +N +D + ++ ++
Sbjct: 301 CYAMRDFEKGQEVTIFY-----------GRRTNAEFFIHNGFVFPENRHDSVDIKLGISK 349
Query: 371 EDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLG 430
+DP Y K + + +F + RE+ D+ +LR+ + D S+ S
Sbjct: 350 QDPLYAVKAKLCDDHELTPSGIFAL-VPRERPVCEDLSTFLRILVLKDASQAASFTDE-- 406
Query: 431 PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKK 490
I + R L+ L + L +P + E E ++ D + + ++A QL +E+K
Sbjct: 407 HIMVATDDNAREALNFLIVRIQLLLRAFPKSDQEYENIIADEGSNGRLKMAAQLRLLERK 466
Query: 491 MLNACLQV 498
+L A L+
Sbjct: 467 ILTAVLET 474
>gi|168067849|ref|XP_001785817.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162662541|gb|EDQ49381.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 489
Score = 78.2 bits (191), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 105/442 (23%), Positives = 182/442 (41%), Gaps = 70/442 (15%)
Query: 93 CKVILKEKPSHNEKHRPIHYVAASEDL--------QAGDAAFSVPNSLVVTLERVLGNET 144
+ I SH + + SE L AGD +VP S+ + L V N +
Sbjct: 43 VQTIWSWAQSHGIQGEAVKPAEVSEGLGLIAQRPVNAGDEILNVPESVWINLAAVQ-NSS 101
Query: 145 IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 204
+ + L +AL+L++E S W PY+ L + +++SPL WS+ E
Sbjct: 102 LGK--ACEGLKPWVAVALFLIHESSN-PSSKWRPYLDSLPK-------SLDSPLFWSDEE 151
Query: 205 LAYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 263
LA L G+ + E ++ EYN L + V +F Y TF+ FK AF
Sbjct: 152 LAELVGTQLLGSVTGYLEFLENEYNNLVEEVLEPNNKIFNPAVY-------TFDGFKWAF 204
Query: 264 VAVQSCVVHLQKVSLARRFALVPL------------GPP--LLAYSSKCKAMLAAVDDAV 309
++S ALVP+ G P + +S+ + D +
Sbjct: 205 GILRSRTFSPLT---GEDIALVPIADLVNHGKGLGDGSPSWVRKGTSQFWNIGKGSSDLL 261
Query: 310 QLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDN--------PYDR 361
+ + AGE +++ Y + + N+ L ++YGFV+ D D
Sbjct: 262 TVRASANFSAGEQVLMQYGAT----------KSNADLALDYGFVERDRGSQFSPGIERDS 311
Query: 362 LVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS--DT 419
L + ++ +D DK + + NG F + G+ +M+ +LRL +S D+
Sbjct: 312 LALSLEISPDDRFVDDKADILEINGFQCSMQFDLSRGQGPS--DEMITFLRLSALSGPDS 369
Query: 420 SEMQSVI--SSLGPIC-PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 476
++++ + G + PVS E A+ + + KA L GY T+ +D +L +L
Sbjct: 370 FLLEALFRNEAWGHVSLPVSRDNEEALCTSMLEGLKAALDGYSTTVEQDMELLARGDLST 429
Query: 477 KKRVATQLVRMEKKMLNACLQV 498
+ +A +VR+ +K + LQ
Sbjct: 430 RMEIAV-VVRLGEKRVMQELQT 450
>gi|15223054|ref|NP_172856.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase
[Arabidopsis thaliana]
gi|17369870|sp|Q9XI84.1|RBCMT_ARATH RecName: Full=[Fructose-bisphosphate aldolase]-lysine
N-methyltransferase, chloroplastic; AltName:
Full=Aldolases N-methyltransferase; AltName:
Full=[Ribulose-bisphosphate carboxylase]-lysine
N-methyltransferase-like; Short=AtLSMT-L;
Short=LSMT-like enzyme; Flags: Precursor
gi|5080779|gb|AAD39289.1|AC007576_12 Putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase [Arabidopsis thaliana]
gi|28973755|gb|AAO64193.1| putative ribulose-1,5 bisphosphate carboxylase oxygenase large
subunit N-methyltransferase [Arabidopsis thaliana]
gi|332190979|gb|AEE29100.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase
[Arabidopsis thaliana]
Length = 482
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 97/404 (24%), Positives = 174/404 (43%), Gaps = 49/404 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A D+ + +P L + E V ++ I L L +AL+L+ EK + +
Sbjct: 79 LVARRDIGRNEVVLEIPKRLWINPETVTASK-IGPL--CGGLKPWVSVALFLIREKYE-E 134
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+S W Y+ L + + +S + WSE ELA L G+ + L E ++ E+ +L+
Sbjct: 135 ESSWRVYLDMLPQ-------STDSTVFWSEEELAELKGTQLLSTTLGVKEYVENEFLKLE 187
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL- 291
+ D+ + T + F AF ++S + + L+PL +
Sbjct: 188 QEILLPNK-------DLFSSRITLDDFIWAFGILKSRAFSRLR---GQNLVLIPLADLIN 237
Query: 292 ---------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
AY K A L + D L KAGE + + Y+++ +
Sbjct: 238 HNPAIKTEDYAYEIKG-AGLFSRDLLFSLKSPVYVKAGEQVYIQYDLN----------KS 286
Query: 343 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 402
N++L ++YGFV+ + + + + DP + DK +A+ N F + G+
Sbjct: 287 NAELALDYGFVESNPKRNSYTLTIEIPESDPFFGDKLDIAESNKMGETGYFDIVDGQTLP 346
Query: 403 AISDMLPYLRLGYVS--DTSEMQSVISS--LGPI-CPVSPCMERAVLDQLADYFKARLAG 457
A ML YLRL + D ++S+ ++ G + PVS E + + D K+ L+G
Sbjct: 347 A--GMLQYLRLVALGGPDAFLLESIFNNTIWGHLELPVSRTNEELICRVVRDACKSALSG 404
Query: 458 YPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTAD 501
+ T+ EDE +L L P+ +A ++ EK++L Q+ D
Sbjct: 405 FDTTIEEDEKLLDKGKLEPRLEMALKIRIGEKRVLQQIDQIFKD 448
>gi|413923745|gb|AFW63677.1| hypothetical protein ZEAMMB73_839660 [Zea mays]
gi|413923746|gb|AFW63678.1| hypothetical protein ZEAMMB73_839660 [Zea mays]
Length = 306
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 68/259 (26%), Positives = 116/259 (44%), Gaps = 26/259 (10%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E L+ W+ +GLP ++ + ++ E+ + A ++++ G+ VP SLV+T
Sbjct: 71 ESAASLERWLIDSGLPEQRLAI-QRVDIGERG-----LVALKNIRKGEKLLFVPPSLVIT 124
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ G + +++ N + + +A YL+ E S W+ YI L RQ
Sbjct: 125 ADSEWGRPEVGDVMKRNSVPDWPLIATYLISEASLEGSSRWISYIAALPRQ-------PY 177
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ EL AYL SP + ++R + YN+L +F ++P P E +
Sbjct: 178 SLLYWTRAELDAYLVASPIRKRAIQRITDVIGTYNDL------RDRIFSRHPDLFPEEVY 231
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLV 312
E F +F + S +V L S+ R ALVP +L +S + + L +
Sbjct: 232 NIETFLWSFGILFSRLVRLP--SMDGRVALVPWA-DMLNHSPEVETFLDFDKSSRGIVFT 288
Query: 313 VDRPYKAGESIVVWYNISF 331
DR Y+ G I +W I +
Sbjct: 289 TDRSYQPGIYI-LWEEIQW 306
>gi|326921018|ref|XP_003206761.1| PREDICTED: LOW QUALITY PROTEIN: SET domain-containing protein
3-like [Meleagris gallopavo]
Length = 593
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 93/408 (22%), Positives = 185/408 (45%), Gaps = 52/408 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
+ A+ +++A + VP L++T+E + + L + +++ + LA +L+ E+
Sbjct: 108 LKATREIKAEELFLWVPRKLLMTVESA-KSSVLGSLYSQDRILQAMGNITLAFHLLCERA 166
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
SFWLPYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 167 N-PNSFWLPYIQTLPNE-------YDTPLYFEEDEVQYLRSTQAIHDVFSQYKNTARQY- 217
Query: 230 ELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALV 285
+F + Q +P +P ++FT++ ++ A +V + + +R AL+
Sbjct: 218 ----AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRVTLALI 271
Query: 286 PLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSK 345
PL + DD + V + +KAGE I ++Y G + N++
Sbjct: 272 PLWDMCNHTNGLITTGYNLEDDRCECVALQDFKAGEQIYIFY-----------GTRSNAE 320
Query: 346 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 405
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H+ E +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-IEPPISA 379
Query: 406 DMLPYLRLGYVSDTSEMQSVIS--------SLG-PICPVSPCMERAVLDQLADYFKAR-- 454
+L +LR+ +++ + +I +LG P+S E +L + +AR
Sbjct: 380 QLLAFLRVFCMNEEELKEHLIGEHAIDKIFTLGNSEFPISWDNEV----KLWTFLEARAS 435
Query: 455 --LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 500
L Y T+ +D+ L ++L +A +L EKK+L ++ A
Sbjct: 436 LLLKTYKTTVEDDKLFLETHDLTSHATMAIKLRLGEKKILEKTVKSAA 483
>gi|8778402|gb|AAF79410.1|AC068197_20 F16A14.25 [Arabidopsis thaliana]
Length = 474
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 95/398 (23%), Positives = 174/398 (43%), Gaps = 45/398 (11%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A D+ + +P L + E V ++ I L L +AL+L+ EK + +
Sbjct: 79 LVARRDIGRNEVVLEIPKRLWINPETVTASK-IGPL--CGGLKPWVSVALFLIREKYE-E 134
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+S W Y+ L + + +S + WSE ELA L G+ + L E ++ E+ +L+
Sbjct: 135 ESSWRVYLDMLPQ-------STDSTVFWSEEELAELKGTQLLSTTLGVKEYVENEFLKLE 187
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL- 291
+ D+ + T + F AF +++ + ++ F + P +
Sbjct: 188 QEILLPNK-------DLFSSRITLDDFIWAF-----GILNRESLTSMFEFEQINHNPAIK 235
Query: 292 ---LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLI 348
AY K A L + D L KAGE + + Y+++ + N++L +
Sbjct: 236 TEDYAYEIKG-AGLFSRDLLFSLKSPVYVKAGEQVYIQYDLN----------KSNAELAL 284
Query: 349 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML 408
+YGFV+ + + + + DP + DK +A+ N F + G+ A ML
Sbjct: 285 DYGFVESNPKRNSYTLTIEIPESDPFFGDKLDIAESNKMGETGYFDIVDGQTLPA--GML 342
Query: 409 PYLRLGYVS--DTSEMQSVISS--LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLS 463
YLRL + D ++S+ ++ G + PVS E + + D K+ L+G+ T+
Sbjct: 343 QYLRLVALGGPDAFLLESIFNNTIWGHLELPVSRTNEELICRVVRDACKSALSGFDTTIE 402
Query: 464 EDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTAD 501
EDE +L L P+ +A ++ EK++L Q+ D
Sbjct: 403 EDEKLLDKGKLEPRLEMALKIRIGEKRVLQQIDQIFKD 440
>gi|387016380|gb|AFJ50309.1| Histone-lysine N-methyltransferase setd3 [Crotalus adamanteus]
Length = 592
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 95/408 (23%), Positives = 183/408 (44%), Gaps = 52/408 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
+ A+ D++A + VP L++T+E N + L + +++ + LA +L+ E+
Sbjct: 108 LKATRDIKAEELFLWVPRKLLMTVESA-KNSILGSLYSQDRILQAMGNITLAFHLLCER- 165
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
SFWLPYI+ L + + L + E E+ YL + +I + + R+Y
Sbjct: 166 YNPNSFWLPYIQTLPNE-------YNTALYFEEDEVQYLQSTQAIHDIFSQYKNTARQY- 217
Query: 230 ELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALV 285
+F + Q +P +P ++FT++ ++ A +V + + +R AL+
Sbjct: 218 ----AYFY--KVVQTHPNASKLPLKDSFTYDDYRWAVSSVMARQNQIPAEDGSRVTLALI 271
Query: 286 PLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSK 345
PL + DD + V + +KAGE I ++Y G + N++
Sbjct: 272 PLWDMCNHTNGLITTGYNLKDDRCECVALQDFKAGEQIYIFY-----------GTRSNAE 320
Query: 346 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 405
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H+ E +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISA 379
Query: 406 DMLPYLRLGYVSDTSEMQSVIS--------SLG-PICPVSPCMERAVLDQLADYFKAR-- 454
+L +LR+ +++ + +I +LG PVS E +L + +AR
Sbjct: 380 QLLAFLRVFCMTEDELKEHLIGEHTIDRIFTLGNSEFPVSWDNEV----KLWTFLEARAS 435
Query: 455 --LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 500
L Y T+ +D+ +L +L +A +L EK++L ++ A
Sbjct: 436 LLLKTYKTTIHDDKFILETQDLTHNATMAIKLRLGEKEILEKAIKSAA 483
>gi|326492674|dbj|BAJ90193.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 96/401 (23%), Positives = 166/401 (41%), Gaps = 49/401 (12%)
Query: 109 PIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEK 168
P+ + A +L G+ VP L + + V + + L ++L ++ E
Sbjct: 39 PVLGLVAERNLPRGEVVAEVPKKLWLDADAVAASVLGRVCGSGGDLRPWVSVSLLILREA 98
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
+G S W PY+ L RQ +S + WSE EL + G+ + + E ++ E+
Sbjct: 99 ARGGDSLWAPYLAILPRQ-------TDSTIFWSEEELLEIQGTQLLSTTMGVKEYVQSEF 151
Query: 229 NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG 288
+ ++ AG + D+ TF+ F AF ++S V + + AL+P
Sbjct: 152 DNVE-----AGII--NVNKDLFPGTITFDDFLWAFGVLRSRVFPELR---GDKLALIPFA 201
Query: 289 PPL----------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKC 338
+ + K K L D L K+GE I V Y++
Sbjct: 202 DLINHNGDITSKESCWEIKGKGFLGR-DTVFSLRTPVDVKSGEQIYVQYDLD-------- 252
Query: 339 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 398
+ N++L ++YGF + ++ D + ++ DP Y+DK +A+ NG F V G
Sbjct: 253 --KSNAELALDYGFTESNSSRDSYTLTLEISESDPFYEDKLDIAELNGMGETAYFDVVLG 310
Query: 399 REKEAISDMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYF 451
+ M+ YLRL + T V L PVS E ++ + +
Sbjct: 311 --ESLPPQMITYLRLLCLGGTDAFLLEALFRNKVWEHLE--LPVSRDNEESICQVIQNAC 366
Query: 452 KARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 492
K+ LA Y T+ EDE +L +L + ++A ++ EKK+L
Sbjct: 367 KSALAAYHTTIEEDEELLEREDLQSRHQIAVEVRVGEKKVL 407
>gi|395504553|ref|XP_003756612.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Sarcophilus
harrisii]
Length = 602
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 103/448 (22%), Positives = 196/448 (43%), Gaps = 58/448 (12%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG L N K + A+ +++A + VP L
Sbjct: 81 KREDYFPDLIKWAAENGASTDGFELV-----NFKEEGFG-LRATREIKAEELFLWVPRKL 134
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFWLPYI+ L +
Sbjct: 135 LMTVESA-KNSVLGALYSQDRILQAMGNITLAFHLLCERA-NPSSFWLPYIQTLPSE--- 189
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY-- 247
++PL + E E+ +L + ++ + + R+Y +F + Q +P
Sbjct: 190 ----YDTPLYFEEDEVQHLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPNAN 238
Query: 248 DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAV 305
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 239 KLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLE 298
Query: 306 DDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVE 365
DD + V + + GE I ++Y G + N++ +I+ GF ++N +DR+ ++
Sbjct: 299 DDRCECVALQDFNVGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKIK 347
Query: 366 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV 425
++ D Y K V R G + VF +H E + +L +LR+ +++ + +
Sbjct: 348 LGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHL 406
Query: 426 IS--------SLG-PICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDY 472
I +LG PVS E +L + +AR L Y T+ ED++ L +
Sbjct: 407 IGEHAIDRIFTLGNSEFPVSWDNEV----KLWTFLEARASLLLKTYKTTIEEDKSFLATH 462
Query: 473 NLHPKKRVATQLVRMEKKMLNACLQVTA 500
+L +A +L EK++L ++ A
Sbjct: 463 DLTFHATMAIKLRLGEKEILEKAVKSAA 490
>gi|126290266|ref|XP_001367810.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Monodelphis domestica]
Length = 595
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 103/449 (22%), Positives = 195/449 (43%), Gaps = 58/449 (12%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W NG L N K + A+ +++A + VP
Sbjct: 73 GKREDYFPDLIKWAAANGASTDGFELV-----NFKEEGFG-LRATREIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFWLPYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGALYSQDRILQAMGNITLAFHLLCERA-NPSSFWLPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ +L + ++ + + R+Y +F + Q +P
Sbjct: 183 -----YDTPLYFEEDEVQHLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPNA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V + + GE I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFNVGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 424
+ ++ D Y K V R G + VF +H E + +L +LR+ +++ +
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398
Query: 425 VIS--------SLG-PICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTD 471
+I +LG PVS E +L + +AR L Y T+ ED++ L
Sbjct: 399 LIGEHAIDRIFTLGNSEFPVSWDNEV----KLWTFLEARASLLLKTYKTTIEEDKSFLAT 454
Query: 472 YNLHPKKRVATQLVRMEKKMLNACLQVTA 500
++L +A +L EK++L ++ A
Sbjct: 455 HDLTFHATMAIKLRLGEKEILEKAVKSAA 483
>gi|159479580|ref|XP_001697868.1| rubisco large subunit N-methyltransferase [Chlamydomonas
reinhardtii]
gi|158273966|gb|EDO99751.1| rubisco large subunit N-methyltransferase [Chlamydomonas
reinhardtii]
Length = 475
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 111/434 (25%), Positives = 178/434 (41%), Gaps = 59/434 (13%)
Query: 83 SWMHKNGLPPCKV-----ILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
+W K G K IL +KP + AS D+Q G++ VP++ V++
Sbjct: 46 AWATKQGAKLEKANLSTDILTDKP----------ILVASADVQPGESLIVVPDAAWVSVP 95
Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
V T+ +L ++ L LAL L+ E+ KS Y L G +P
Sbjct: 96 NV-AKTTVGKLASSAGLEPWLQLALVLVAERFGSAKSELAGYASSLPEDLG-------TP 147
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
LLWSE E L G+ + + + +L LF P P FT
Sbjct: 148 LLWSEEETRALAGTQVAGTLNSYLTFFRSTFAQLQA------GLFTANPAAFPPAVFTLP 201
Query: 258 IFKQAFVAVQSCV---VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV- 313
F A AV+S + K++LA LV L A ++K + + Q+ V
Sbjct: 202 NFVWAVAAVRSRSHPPLEGDKIALA---PLVDLVSHRRAANTKLSVRSSGLFGRGQVAVV 258
Query: 314 --DRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 371
R + GE++ G D+ G + + +L++YG +D +P + L+
Sbjct: 259 EATRAIRKGEAL---------GMDYAPG-KLDGPVLLDYGVMDTASPKPGYSLTLTLDES 308
Query: 372 DPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS--DTSEMQSVISSL 429
D DK + + G + + +++ +M+ +LRL + D ++S+ +
Sbjct: 309 DKFVDDKADIVEGAGLRPSMTYSITP--DQQPGEEMMAFLRLMNIKAMDAFLLESIFRN- 365
Query: 430 GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRV-ATQLVRM- 487
VS E AV LA+ +A LAGYP TL +D A L + R A LVR+
Sbjct: 366 ----EVSEGNEEAVCAMLAEGARAALAGYPTTLDQDLAALRSNSTPLGSRAEAALLVRLG 421
Query: 488 EKKMLNACLQVTAD 501
EK+ L+A + D
Sbjct: 422 EKESLDAVARFFED 435
>gi|330822500|ref|XP_003291689.1| hypothetical protein DICPUDRAFT_57488 [Dictyostelium purpureum]
gi|325078125|gb|EGC31794.1| hypothetical protein DICPUDRAFT_57488 [Dictyostelium purpureum]
Length = 540
Score = 75.9 bits (185), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 104/466 (22%), Positives = 185/466 (39%), Gaps = 63/466 (13%)
Query: 67 SREVVSKKEED---LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGD 123
+++ K+E D + + W+ +G K +K + E + ++ D++ G+
Sbjct: 56 GKQIAVKQETDQQLVSNFMEWLKNSGFDETKSKVKIGRNLAEGSG----LVSTCDIKEGE 111
Query: 124 AAFSVPNSL---VVTLERVLGNETIAELLTTNKLSEL--ACLALYLMYEKKQGKKSFWLP 178
+P L ++T + G LL N + + LALYL+ E S P
Sbjct: 112 EFLEIPEKLFIDIMTALKSFGQSGYDILLRDNLIRRVPNLVLALYLIKESTNPDSSI-AP 170
Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMA 238
Y++ L + + W + L GSP + G R+Y +F
Sbjct: 171 YLKVLPK-------TYSTIGYWGIEDFKQLEGSPVFQTAVNYTRGSMRQY-----CYFY- 217
Query: 239 GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP--PLLAYSS 296
LF P + T FT+E F A VQS V + AL+P ++
Sbjct: 218 -QLFDNNPGILQTSNFTYEAFIWAVATVQS---RQNPVGGGQEMALIPFWDFCNHSSHGG 273
Query: 297 KCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED 356
K + V + + YK GE + ++Y GP+PNS+ + GF +
Sbjct: 274 KITTFIDPVKHVLTCSAAKSYKKGEQVYMYY-----------GPRPNSQFYLFQGFSLKT 322
Query: 357 NPYDRLVVEAALNTEDPQ--YQDK-RMVAQRNGKLSVQVFHVHAGREKEAI-SDMLPYLR 412
N D + L+ ED + DK ++ +R G Q + E + ++++P+ R
Sbjct: 323 NLNDDYSFDMDLDNEDDRDIAHDKIHILEERCGLRVGQTVSLSQNPSSEKLPAEIIPFYR 382
Query: 413 LGYVS--DTSEM-----------QSVISSLGP--ICPVSPCMERAVLDQLADYFKARLAG 457
+ +S +T ++ + P +S E+ L D KARL+G
Sbjct: 383 IAALSPEETKKLAPPQEEGHHHHHQGPMDMKPEAFNIISEENEKKAFKLLLDSLKARLSG 442
Query: 458 YPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 503
YP TL++DE + + N ++R ++ EKK+L ++ +I
Sbjct: 443 YPTTLAQDEQEMKN-NPTTQRRYVLYILINEKKILERNIKYVEQLI 487
>gi|307108530|gb|EFN56770.1| hypothetical protein CHLNCDRAFT_8187, partial [Chlorella
variabilis]
Length = 398
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 106/400 (26%), Positives = 173/400 (43%), Gaps = 48/400 (12%)
Query: 116 SEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSF 175
S+ + G+ F+VP + +T + ++ + L L +AL+L++E+ G S
Sbjct: 4 SKAVNKGEQLFAVPEAAWITADTAQQSQIGSHL---TGLESWLAIALFLLHERAMGNASR 60
Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 235
W PYI L G SP+ W E +LA L GS + ++ +++L
Sbjct: 61 WAPYIALLPADSG-------SPVQWEEADLAELQGSQVLGTVQGYRAYFQQRFDQLQAEV 113
Query: 236 FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV------VHLQKVSLARRFALVPLGP 289
F S +D P F F+ F A V++ ++ V LA P P
Sbjct: 114 FGPNS----QAFD-PI-VFNFDAFLWAACTVRARAHPPLDGGNIALVPLADMVRSQPSWP 167
Query: 290 PLLA-YSSKCKAMLAAVDDAVQLVVDRP--YKAGESIVVWYNISFTGHDFKCGPQ-PNSK 345
P A + K L LV++ AG++I + DF GPQ + +
Sbjct: 168 PDSAGWQLKQTGGLFGAGSTQALVMEASGSMAAGDAIAM---------DF--GPQKSDGQ 216
Query: 346 LLINYGFVDE--DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV-HAGREKE 402
LL+++G +D + P L +E L+ ED Y DK + + N +L+ H+ A R +
Sbjct: 217 LLVDHGVIDPLVNQPSYALTLE--LSKEDRNYDDKADILELN-ELAESTEHILRADRAPD 273
Query: 403 AISDMLPYLRLGYVSDTSEMQSVISSL---GPICPVSPCMERAVLDQLADYFKARLAGYP 459
A + L +D ++S+ + PVS ER QL D A LA YP
Sbjct: 274 AGLLPVLRLLNLSGTDAFLLESIFRNEVWEHMQLPVSEDNERGCYQQLIDGCTAALAAYP 333
Query: 460 ATLSEDEAMLTDYNLHPKKRVATQL-VRM-EKKMLNACLQ 497
++ ED A++ +L P R + + VR+ EK+ L+A L+
Sbjct: 334 TSIDEDLALMASGSLQPGSRRQSAVRVRLGEKEALDATLR 373
>gi|260803924|ref|XP_002596839.1| hypothetical protein BRAFLDRAFT_284593 [Branchiostoma floridae]
gi|229282099|gb|EEN52851.1| hypothetical protein BRAFLDRAFT_284593 [Branchiostoma floridae]
Length = 500
Score = 75.1 bits (183), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 97/400 (24%), Positives = 176/400 (44%), Gaps = 50/400 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKK 169
+ A +D++A + ++P L++T E ++ L+ +++ ++ LAL+++ EK
Sbjct: 121 LKAVKDIKAEELFITIPRKLMLTTE-TARESSLGPLIKKDRILQVMANVSLALHVLCEK- 178
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
SFW PYI +PL + E E+ +L GS +++L + + I R+Y
Sbjct: 179 YSSNSFWAPYINIFPG-------TYTTPLYFEEGEMLHLQGSLNFSDVLNQYKSIARQY- 230
Query: 230 ELDTVWFMAGSLFQQYP--YDIP-TEAFTFEIFKQAFVAVQSCVVHLQKV--SLARRF-- 282
+F LFQ P +P E FTF+ ++ A V + + +V S R
Sbjct: 231 ----AYFY--KLFQTQPEAAGLPLKECFTFDEYRWA---VSTVMTRQNQVPTSDGRHLIT 281
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
AL+P+ + + D+ + + R + + ++Y G +
Sbjct: 282 ALIPMWDMCNHSNGEVSTEFNLGSDSAECLAMREFPTDSQVYIFY-----------GMRS 330
Query: 343 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 402
N++ LI+ GFV +N +DR+ V+ ++ D + K V R G + F VH G++
Sbjct: 331 NAEFLIHNGFVYPENVHDRVNVKLGVSKNDSLFAMKAEVLSRAGIHASTSFQVHCGKDP- 389
Query: 403 AISDMLPYLRL-----GYVSD--TSEMQSV-ISSLG-PICPVSPCMERAVLDQLADYFKA 453
++L +LR+ G + D TSE QS +S LG C V+ E L
Sbjct: 390 IPPELLVFLRVFTMVEGDLRDLLTSEHQSAYLSCLGRSDCMVTQEQETKAWAFLETRLSL 449
Query: 454 RLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 493
+ Y ++ + E L ++ R A QL E ++L+
Sbjct: 450 LIRSYRTSIKDVETELQAPDMTYHSRAALQLKLAEMQILS 489
>gi|356534483|ref|XP_003535783.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Glycine
max]
Length = 463
Score = 75.1 bits (183), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 64/266 (24%), Positives = 120/266 (45%), Gaps = 30/266 (11%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ +Q GD VP + +T + +L L ++ +A LA ++ EKK G+ S
Sbjct: 65 ASKIIQTGDCILKVPYRVQITADNLLPE---IRSLIGEEVGNIAKLATVILIEKKLGQGS 121
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI L +Q G+L + + W+E+EL + S E +++ I++++ + +
Sbjct: 122 EWYPYISCLPQQ---GEL--HNTVFWTESELEMIRPSSVYQETIDQKSQIEKDFLAIKHI 176
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
+ + F Y A T +F V + V + AL+P L +
Sbjct: 177 FECSHQSFGDSTYKDFMHACTLVLFDHFNVELP---VGSRAWGSTNGLALIPFA-DFLNH 232
Query: 295 SSKCKAMLAAVDD-------AVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLL 347
+A++ + DD ++Q++ DR Y GE +++ Y G N+ L+
Sbjct: 233 DGVSEAIVMSDDDKQCSEVQSLQIIADRDYAPGEQVLIRY-----------GKFSNATLM 281
Query: 348 INYGFVDEDNPYDRLVVEAALNTEDP 373
+++GF N YD++ ++ + DP
Sbjct: 282 LDFGFTIPYNIYDQVQIQFDIPKHDP 307
>gi|302755392|ref|XP_002961120.1| hypothetical protein SELMODRAFT_402746 [Selaginella moellendorffii]
gi|300172059|gb|EFJ38659.1| hypothetical protein SELMODRAFT_402746 [Selaginella moellendorffii]
Length = 371
Score = 75.1 bits (183), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 70/262 (26%), Positives = 114/262 (43%), Gaps = 50/262 (19%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+ + AG +P ++T E V ++ LL+T+ L+L+L+ EK + ++S
Sbjct: 10 ATRRVPAGSRFLEIPRIAIITPENVPSQ--VSHLLSTSNPKTR--LSLFLLSEKHKAQES 65
Query: 175 FWLPYIRELDRQRGRGQLA-VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
W PY+R L QL +ES + W + ELA+L SPT E +E + IK E++ L+
Sbjct: 66 QWAPYLRCL------PQLGDIESTMFWKDEELAWLKHSPTYRETMECLKIIKSEFHVLEA 119
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLA 293
F + D+ E + F A+ Q +P
Sbjct: 120 NVF-------PWCRDVLGEV-SLTDFMHAYSTDQ-----------------IPFA-DFFN 153
Query: 294 YSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYG 351
+ C+ L+ D V D+ YKAG+ I + Y G PNS L ++YG
Sbjct: 154 HDHNCQTRLSYDKEKDCAVAVADQDYKAGDEIFLSY-----------GSTPNSILAVDYG 202
Query: 352 FVDEDNPYDRLVVEAALNTEDP 373
F NP++++ V ++ DP
Sbjct: 203 FAVASNPHEQVEVPMGVSLTDP 224
>gi|424513480|emb|CCO66102.1| predicted protein [Bathycoccus prasinos]
Length = 571
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 102/405 (25%), Positives = 170/405 (41%), Gaps = 45/405 (11%)
Query: 116 SEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSF 175
S++++ GD S+P VT + +A L+ + EL LAL+L EK + K S
Sbjct: 122 SKNVEGGDVILSIPQDNCVTAVDAKEHPIVAPLI--EEKPELVQLALWLCCEKAKAKGSE 179
Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELA-YLTGSPTKAEILERAEGIKREYNELDTV 234
W PY++ L+ S L ++E E L G+ E +R + K EY L
Sbjct: 180 WWPYLKTLNGNPN-------SVLRFTEEEFKELLKGTSIDKEARQRRDSAKEEYEALRAA 232
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFV-AVQSCVVHLQKVSLARRFALVPL------ 287
+ P P + + F + + AF+ A+ Q ++ A +A+VPL
Sbjct: 233 -------IAEDPGKYPLDVYAF-LTESAFIDALDIVCARAQWLNSANCYAMVPLMDAIPI 284
Query: 288 --GPPLLAYSSKCKAMLAAVDD------AVQLVVDRPYKAGESIVVWYNISFTGH----D 335
PP ++ A + D AV+ S+V+ N +
Sbjct: 285 CGAPPPVSPEDPSFARFYEIRDIKTGLTAVRCGYADYDVDSASVVLCANTRASAGSKILQ 344
Query: 336 FKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV-QVFH 394
+ NS+L +++G VD+ +P D L+ DP Y K+ V + G Q F
Sbjct: 345 IDHSVRNNSELYLSFGDVDDQHPGDYEYWPTELSENDPLYAAKKSVLEAQGFADKGQTFP 404
Query: 395 VHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 454
V+ R + L YLR V+++ E+ +V + + VSP E L L + R
Sbjct: 405 VYKDRMPR---EFLSYLRFARVTNSEELFAVSFTEDKV--VSPMNEYETLQLLMADCRDR 459
Query: 455 LAGYPATLSEDEAMLTDY-NLHPKKRVATQLVRMEKKMLNACLQV 498
++ Y T EDE +L ++ K R A++L R EK+++ +
Sbjct: 460 MSAYD-TNEEDELLLQKRDDVSLKIRNASRLRRCEKELVGEMMNA 503
>gi|229596469|ref|XP_001008992.3| SET domain containing protein [Tetrahymena thermophila]
gi|225565279|gb|EAR88747.3| SET domain containing protein [Tetrahymena thermophila SB210]
Length = 629
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 84/296 (28%), Positives = 124/296 (41%), Gaps = 48/296 (16%)
Query: 66 GSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAA 125
+E + K E +L SW+ N + LK +HN + + +QA +
Sbjct: 142 ADKETLKKSE----NLLSWVQANKGEFSSIKLKYLSTHNRS------IVSKRIIQADETV 191
Query: 126 FSVPNSLVVTLERVLGNETIAELLTTNKLSEL-----ACLALYLMYEKKQGKKSFWLPYI 180
S+P V+TL+ V + ++LT K ++L A AL+L+ E+K+ S + YI
Sbjct: 192 ISIPQEQVITLD-VASSSDFCKILT-EKNTQLVQQKHAYFALFLLQEQKKKDASHYKAYI 249
Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS 240
L P L+SE EL YL G+ + E+ E IK +Y + V
Sbjct: 250 DSLPTDLSSF------PALFSEEELQYLEGTAALKLVQEQKEDIKTDYESISQV------ 297
Query: 241 LFQQYPYDIP--TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC 298
IP F+FE F+ AF+ S V + KV + +VPL L S
Sbjct: 298 --------IPEFKSEFSFEQFRWAFLCSHSRVFGI-KVKGVKTSVMVPLADMLNHKHSGQ 348
Query: 299 KAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVD 354
+ DDA KA + I I F+ G + NSKL +NYGFVD
Sbjct: 349 EDSEWVFDDATNCFT---VKALKKIQRNQQIHFS-----YGSKCNSKLFLNYGFVD 396
>gi|291235388|ref|XP_002737626.1| PREDICTED: SET domain containing 4-like [Saccoglossus kowalevskii]
Length = 353
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 79/309 (25%), Positives = 139/309 (44%), Gaps = 47/309 (15%)
Query: 75 EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
+ D +L WM +NG K L + + E R + A++ Q GD S+P L++
Sbjct: 28 DNDYIELVRWMSRNGF---KGALLKPANFKETGRGL---MATKPFQIGDQVISIPEMLLI 81
Query: 135 TLERVLGNETIAELL---TTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQ 191
T + VL + + + + T KLS + + YL+ E+ + K SFW YI+ L +
Sbjct: 82 TTQNVLSS-YLGDFIKQQTRPKLSPMQVICTYLICERSRQKDSFWYNYIKVLPK------ 134
Query: 192 LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPT 251
+ +P+ ++ E+ +L K ++ + E I Y EL ++ + S F +
Sbjct: 135 -SYSNPVYFTNEEINWLP-RRIKRKVFDECEKINTAYRELKNLFSILESTFVSFK----- 187
Query: 252 EAFTFEIFKQAFVAVQSCVVHLQK-----VSLAR-RFALVPLGPPLLAYSS--KCKAMLA 303
F + F+ A+ V + V++ + +S+ R +AL P LL +++ + KA
Sbjct: 188 GIFEYSAFRWAWCTVNTRSVYMLQEQNPHLSIERDHYALAPF-LDLLNHTNTVEVKASYN 246
Query: 304 AVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLV 363
V ++ K + + ++Y GP N KL I YGFV N ++ V
Sbjct: 247 PVSKCYEIFTCTACKKYDQMFIYY-----------GPHDNVKLFIEYGFVLPQNQHN--V 293
Query: 364 VEAALNTED 372
VE L+ ED
Sbjct: 294 VE--LDFED 300
>gi|145524453|ref|XP_001448054.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124415587|emb|CAK80657.1| unnamed protein product [Paramecium tetraurelia]
Length = 581
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 102/433 (23%), Positives = 180/433 (41%), Gaps = 83/433 (19%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNET-IAELLTTNKLSELA----CLALYLMYE 167
V A + + A + +P S ++TLE + ET +A+ + +L L+ L+ +L+ E
Sbjct: 165 VNAKQKINAKELILFIPKSHMITLE--MAKETPVAKKMIQFRLDLLSPKHSFLSTFLLQE 222
Query: 168 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
K + SFW PY+ L Q P+ ++ +L +L GSP +I ++ +K++
Sbjct: 223 KSRPN-SFWKPYLDIL------PQSYPSFPIFFNNYDLEWLQGSPFLKQINDKLSDLKKD 275
Query: 228 YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 287
YN++ V F QY +F F A + S + + + + A VPL
Sbjct: 276 YNDICNV----APEFSQY---------SFYEFCWARMTASSRIFGI-NIKGVKTDAFVPL 321
Query: 288 GPPL---------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKC 338
L YS + + + D+ + DR G+ I F + KC
Sbjct: 322 ADMLNHKRPKLTSWCYSEEKQGFIIETDEKI----DR----GQMI-------FDSYGRKC 366
Query: 339 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK--------------RMVAQR 384
NS+ L+NYGFV +DN + + V A DP Q K R++
Sbjct: 367 ----NSRFLLNYGFVVDDNDANEVNVTVAAEFNDPLIQLKEDATEEQLKQPKTFRLIMDT 422
Query: 385 NGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISS-----LGP--ICPVSP 437
+G + F + + + + Y+R + D +++Q +++ + P I P+
Sbjct: 423 DGINEITHFL------EATVMEFMSYIRFLVIRDQTQLQFLLNERESKYIKPTKIQPLGI 476
Query: 438 CMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 497
E + D + L+ YP TL +D+ +L +L +R L EK++L Q
Sbjct: 477 HNELDMWDLIRRICYVSLSRYPTTLEQDKEILQICDLTTNQRNCLILRMGEKEILKFYYQ 536
Query: 498 VTADMIMLLPDVT 510
+ M LL +
Sbjct: 537 FSEKMKQLLSNFN 549
>gi|168043570|ref|XP_001774257.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674384|gb|EDQ60893.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 458
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 66/270 (24%), Positives = 119/270 (44%), Gaps = 33/270 (12%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLP 178
++ G+ V L++T R+ + E + ++E + LAL+ + K GK S W P
Sbjct: 70 IKRGEQVLRVSRELMITPNRL---PSCVEESLSEDVNEWSRLALFQLLHKHAGKASPWEP 126
Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMA 238
YIR L RG +++ + W + EL L S + R I +++ + V
Sbjct: 127 YIRCLPPLRG-----LQNTVFWRDEELELLRQSNVYDQTEHRKTLISNQFDLVQAV---- 177
Query: 239 GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC 298
+YP ++ E T E FK A+ S ++ + +VP + + S
Sbjct: 178 ---VNKYP-ELFGETVTLESFKHAYCVASSRSWGVEALG---SITMVPF-VDMFNHDSSA 229
Query: 299 KAMLAAVDDA--VQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED 356
+A+LA ++ ++V D+ Y G +V+ Y G PNS L +++GF D
Sbjct: 230 RALLAYYEEEGYAEVVADKDYNQGSQVVITY-----------GTLPNSSLALDFGFTLPD 278
Query: 357 NPYDRLVVEAALNTEDPQYQDKRMVAQRNG 386
NP+D + + + DP +K + + +G
Sbjct: 279 NPHDEVQIWMEAPSGDPLRAEKLKLLRDHG 308
>gi|42565948|ref|NP_191068.2| SET domain-containing protein [Arabidopsis thaliana]
gi|56236044|gb|AAV84478.1| At3g55080 [Arabidopsis thaliana]
gi|59958342|gb|AAX12881.1| At3g55080 [Arabidopsis thaliana]
gi|332645816|gb|AEE79337.1| SET domain-containing protein [Arabidopsis thaliana]
Length = 463
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 68/284 (23%), Positives = 129/284 (45%), Gaps = 36/284 (12%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ + AGD VP + +T + + + +L +N++ + LA L+ EKK G+KS
Sbjct: 75 ASKVIYAGDCMLKVPFNAQITPDELPSD---IRVLLSNEVGNIGMLAAVLIREKKMGQKS 131
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W+PYI L + + S + W E EL+ + S E +++ I+++++
Sbjct: 132 RWVPYISRLPQ-----PAEMHSSIFWGEDELSMIRCSAVHQETVKQKAQIEKDFS----- 181
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
F+A + Q P I TE E F A+ V S + ++R +L+P +
Sbjct: 182 -FVAQAFKQHCP--IVTERPDLEDFMYAYALVGS-----RAWENSKRISLIPFADFMNHD 233
Query: 295 SSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV 353
+L D+ + ++ DR Y G+ + + Y G N+ L++++GF
Sbjct: 234 GLSASIVLRDEDNQLSEVTADRNYSPGDEVFIKY-----------GEFSNATLMLDFGFT 282
Query: 354 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ---VFH 394
N +D + ++ + +DP K + Q + +V+ +FH
Sbjct: 283 FPYNIHDEVQIQMDVPNDDPLRNMKLGLLQTHHTRTVKDINIFH 326
>gi|217038301|gb|ACJ76599.1| SET domain-containing protein 3 (predicted) [Oryctolagus cuniculus]
Length = 394
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 80/351 (22%), Positives = 154/351 (43%), Gaps = 45/351 (12%)
Query: 72 SKKEEDLGDLKSWMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVP 129
K+E+ +L W NG + +V+ E+ + A+ +++A + VP
Sbjct: 73 GKREDYFPELMKWASANGASVEGFEVVNFEEEGFG--------LRATREIKAEELFLWVP 124
Query: 130 NSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQ 186
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 125 RKLLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE 182
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
++PL + E E+ YL + ++ + + R+Y V Q +P
Sbjct: 183 -------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYRV-------IQTHP 228
Query: 247 Y--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAML 302
+ +P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 229 HANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGY 288
Query: 303 AAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRL 362
DD + V R + AGE I ++Y G + N++ +I+ GF ++N +DR+
Sbjct: 289 NLEDDRCECVALRDFHAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRV 337
Query: 363 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 413
++ ++ D Y K V R G + VF +H E + +L +LR+
Sbjct: 338 KIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRV 387
>gi|326496433|dbj|BAJ94678.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 453
Score = 73.6 bits (179), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 94/392 (23%), Positives = 162/392 (41%), Gaps = 49/392 (12%)
Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWL 177
+L G+ VP L + + V + + L ++L ++ E +G S W
Sbjct: 52 NLPRGEVVAEVPKKLWLDADAVAASVLGRVCGSGGDLRPWVSVSLLILREAARGGDSLWA 111
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
PY+ L RQ +S + WSE EL + G+ + + E ++ E++ ++
Sbjct: 112 PYLAILPRQ-------TDSTIFWSEEELLEIQGTQLLSTTMGVKEYVQSEFDNVE----- 159
Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL------ 291
AG + D+ TF+ F AF ++S V + + AL+P +
Sbjct: 160 AGII--NVNKDLFPGTITFDDFLWAFGVLRSRVFPELR---GDKLALIPFADLINHDGDI 214
Query: 292 ----LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLL 347
+ K K L D L K+GE I V Y++ + N++L
Sbjct: 215 TSKESCWEIKGKGFLGR-DTVFSLRTPVDVKSGEQIYVQYDLD----------KSNAELA 263
Query: 348 INYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDM 407
++YGF + ++ D + ++ DP Y+DK +A+ NG F V G + M
Sbjct: 264 LDYGFTESNSSRDSYTLTLEISESDPFYEDKLDIAELNGMGETAYFDVVLG--ESLPPQM 321
Query: 408 LPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 460
+ YLRL + T V L PVS E ++ + + K+ LA Y
Sbjct: 322 ITYLRLLCLGGTDAFLLEALFRNKVWEHLE--LPVSRDNEESICQVIQNACKSALAAYHT 379
Query: 461 TLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 492
T+ EDE +L +L + ++A ++ EKK+L
Sbjct: 380 TIEEDEELLEREDLQSRHQIAVEVRVGEKKVL 411
>gi|146162512|ref|XP_001009518.2| SET domain containing protein [Tetrahymena thermophila]
gi|146146406|gb|EAR89273.2| SET domain containing protein [Tetrahymena thermophila SB210]
Length = 789
Score = 73.6 bits (179), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 104/428 (24%), Positives = 182/428 (42%), Gaps = 80/428 (18%)
Query: 107 HRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNET-IAELLTTNKLSELA----CLA 161
+R +H A + ++ G+ +P ++TLE L E I +L+ + + L+ L+
Sbjct: 373 YRGVH---ARQKIKKGECILFIPVDNMITLE--LSKELPICQLIESKNIRLLSPKHTFLS 427
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES---PLLWSETELAYLTGSPTKAEIL 218
+Y++ EKK KSFW P++ L VE P+L+++ EL +L GSP ++
Sbjct: 428 IYIIIEKK-NHKSFWKPFL---------DILPVEYTTFPILYTDEELFWLKGSPFLNQVK 477
Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA--FTFEIFKQAFVAVQSCVVHLQKV 276
ER E I ++Y Q IP A T + F A + S + L +
Sbjct: 478 ERRECITQDY--------------QAIVSKIPEFAKLCTLDEFAWARMMAASRIYGL-FI 522
Query: 277 SLARRFALVPLGPPLL----AYSS------KCKAMLAAVDDAVQLVVDRPYKAGESIVVW 326
+ R A VPL AY++ K ML A +D + G+ I
Sbjct: 523 NKKRTDAFVPLADMFNHRRPAYTNWGFCEDKGGFMLKASEDI---------RRGDQIY-- 571
Query: 327 YNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK-RMVAQRN 385
+ CG + NS+ L+NYGFV ++N + + + + +D K +M+ +R
Sbjct: 572 ---------YSCGRKCNSRFLLNYGFVVKNNEANEIQLRVDFDKKDETLPIKLQMIGKR- 621
Query: 386 GKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQ-----SVISSLGPIC--PVSPC 438
K +F +H E++++ + +LR + D ++ S P+ P S
Sbjct: 622 -KPESLIFRIHINYEEKSVLEFFGFLRFVLIRDYIVLEKFHEMSEGKEFDPLRTPPFSIE 680
Query: 439 MERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQV 498
E+ + ++ + YP TL ED+ +L L ++ L EK++L + +
Sbjct: 681 NEKQMWTEIHKICAEIMIQYPTTLDEDKKILETSKLTINQKNCVILRMGEKEILMYYITM 740
Query: 499 TADMIMLL 506
M LL
Sbjct: 741 ADRMKKLL 748
>gi|148686777|gb|EDL18724.1| mCG18357, isoform CRA_b [Mus musculus]
Length = 466
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 83/355 (23%), Positives = 159/355 (44%), Gaps = 40/355 (11%)
Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
LA +L+ E+ SFW PYI+ L + ++PL + E E+ L + ++
Sbjct: 28 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVF 79
Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 275
+ + R+Y +F + Q +P+ +P E+FT+E ++ A +V + +
Sbjct: 80 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKESFTYEDYRWAVSSVMTRQNQIPT 132
Query: 276 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGH 334
+R AL+PL + DD + V + ++AG+ I ++Y
Sbjct: 133 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFY------- 185
Query: 335 DFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFH 394
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF
Sbjct: 186 ----GTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFA 241
Query: 395 VHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLGPI-CPVSPCMERAVLD 445
+H+ E + +L +LR+ +++ + + I +LG PVS E +
Sbjct: 242 LHS-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNAEFPVSWDNEVKLWT 300
Query: 446 QLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 500
L D L Y T+ ED+ +L + +L + +A +L EK++L ++ A
Sbjct: 301 FLEDRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAA 355
>gi|297820264|ref|XP_002878015.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297323853|gb|EFH54274.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 473
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 61/260 (23%), Positives = 120/260 (46%), Gaps = 33/260 (12%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ + AGD VP ++ +T + + + ++ T+++ + LA L+ EKK+G+KS
Sbjct: 75 ASKVIHAGDCMLKVPFNVQITPDELSPDIRVS---LTDEVGNIGKLAAVLIREKKKGQKS 131
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W+PYI L + + S + W E E + + S E +++ I++E++
Sbjct: 132 RWVPYISRLPQ-----PAEMHSTIFWGEDEFSMIRCSAVHKETVKQKAQIEKEFS----- 181
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
F+A + Q YP I E E F A+ V S + ++ +L+P +
Sbjct: 182 -FVAQAFKQHYPMVI--ERPYLEDFMYAYALVGS-----RAWETSKGISLIPFADFMNHD 233
Query: 295 SSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV 353
+L+ D+ + ++ DR Y G+ + + Y G N+ L++++GF
Sbjct: 234 GLSASIVLSDEDNQLSEVTADRNYSPGDEVFIKY-----------GEFSNATLMLDFGFT 282
Query: 354 DEDNPYDRLVVEAALNTEDP 373
N +D + ++ + +DP
Sbjct: 283 VPYNIHDEVQIQMDVPNDDP 302
>gi|302766942|ref|XP_002966891.1| hypothetical protein SELMODRAFT_408134 [Selaginella moellendorffii]
gi|300164882|gb|EFJ31490.1| hypothetical protein SELMODRAFT_408134 [Selaginella moellendorffii]
Length = 374
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 73/267 (27%), Positives = 116/267 (43%), Gaps = 57/267 (21%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+ + AG +P ++T E V ++ LL+T+ + L+L+L+ EK + ++S
Sbjct: 10 ATRRVPAGSRFLEIPRIAIITPENVPSQ--VSHLLSTS--NPKTRLSLFLLSEKHKAQES 65
Query: 175 FWLPYIRELDRQRGRGQLA-VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
W PY+R L QL +ES + W ELA+L SPT E +E + IK E++ L
Sbjct: 66 QWAPYLRCL------PQLGDIESTMFWKAEELAWLKHSPTYRETMECLKIIKSEFHLLT- 118
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL-----ARRFALVPLG 288
+A Q +P+ C L +VSL A +P
Sbjct: 119 ---LANK--QVFPW---------------------CRDALGEVSLTDFMHAYSTDQIPFA 152
Query: 289 PPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKL 346
+ C+ L+ D V D+ YKAG+ I + Y G PNS L
Sbjct: 153 -DFFNHDHNCQTRLSYDKEKDCAVAVADQDYKAGDEIFLSY-----------GSTPNSIL 200
Query: 347 LINYGFVDEDNPYDRLVVEAALNTEDP 373
++YGF NP++++ V ++ DP
Sbjct: 201 AVDYGFAVASNPHEQVEVPMGVSLTDP 227
>gi|303277863|ref|XP_003058225.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226460882|gb|EEH58176.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 612
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 90/317 (28%), Positives = 134/317 (42%), Gaps = 53/317 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIA--ELLTTNKLSELACLALYLMYEKKQ 170
AAS DL AG A ++P+S ++T L + T T L E + L+L+YEK
Sbjct: 195 AAASTDLPAGADALTIPSSALLTSRVALEDPTARGDAYRTFAGLGEDTLMTLWLVYEKYA 254
Query: 171 -GKKSFWLPYIREL---------DRQRGRGQLAVESPLLW-SETELAYLTGSPTKAEILE 219
G +S W P + L + G L + +P W +E A L G+P + ++
Sbjct: 255 LGDRSPWAPLLASLPMDDGGGDDGDRTAAGALGL-TPASWPAEVTDALLRGAPLLDDAVK 313
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
E R++ L F A L + +P PTE +T F+ A A + + +Q ++
Sbjct: 314 ARETTARQHAAL----FPA--LGEHFPEVFPTELYTLRRFRIASEAWNAYGMTVQAETVG 367
Query: 280 RRFALVPLGPPLLAYSSKCKAMLA-------AV------DDAVQLVVDRPYKAGESIVVW 326
PP A+L AV DDA+ L + R +AGE I V
Sbjct: 368 GASGGGEHHPPAPTTCLPPIALLCNHATWPHAVRYSRLRDDALHLPIARGVRAGEEIFVS 427
Query: 327 YNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQR-- 384
Y G + N++LL+ YGF DNPYD + L+ E PQ + + + A R
Sbjct: 428 Y-----------GAKSNAELLLFYGFGVRDNPYD----DVPLSLELPQGEVRDVSALRER 472
Query: 385 ---NGKLSVQVFHVHAG 398
KLS+ V G
Sbjct: 473 VLHRAKLSLSPHSVRCG 489
>gi|348537527|ref|XP_003456245.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Oreochromis niloticus]
Length = 607
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 101/454 (22%), Positives = 186/454 (40%), Gaps = 73/454 (16%)
Query: 74 KEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
+E+ +L SW +NG C+ + + + + A+ D++A + +P ++
Sbjct: 75 REDHFPELMSWAKENG-ASCECF-----TVANFGKEGYGLRATRDIKAEELFLWIPRKML 128
Query: 134 VTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRG 190
+T+E N + L + +++ + LAL+L+ E+ SFWLPYIR L ++
Sbjct: 129 MTVESA-QNSILGPLYSQDRILQAMGNVTLALHLLCERA-NPASFWLPYIRSLPQE---- 182
Query: 191 QLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY----------NELDTVWF---- 236
+ PL + + ++ L G+ ++L + + R+Y L +V
Sbjct: 183 ---YDIPLYYQQEDVQLLLGTQAVQDVLSQYKNTARQYAYFYKLVQDKGMLGSVELRLFA 239
Query: 237 -----MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
M G LF Q+ A + + +Q + + +V+LA L+PL
Sbjct: 240 SLTPVMGGKLFDQW-------AVSSVMTRQNQIPTEDG----SRVTLA----LIPLWDMC 284
Query: 292 LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYG 351
+ DD + V + YK E I ++Y G + N++ +I+ G
Sbjct: 285 NHTNGLITTGYNLEDDRCECVALQDYKENEQIYIFY-----------GTRSNAEFVIHNG 333
Query: 352 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL 411
F +D+ +DR+ ++ ++ + Y K V R G + VF +H E + +L +L
Sbjct: 334 FFFQDDAHDRVKIKLGVSKSERLYAMKAEVLARAGIPASYVFALHCN-EPPISAQLLAFL 392
Query: 412 RLG---------YVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATL 462
R+ Y+ + + + PVS E + L L Y T
Sbjct: 393 RVFCMTEDELKYYLLGDRAINKIFTLGNSEFPVSWENEIKLWTFLETRAALLLKTYKTTS 452
Query: 463 SEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 496
ED +ML +L R+A QL EK++L L
Sbjct: 453 EEDRSMLEKPDLSLHSRMAIQLRLAEKQILEKAL 486
>gi|242049248|ref|XP_002462368.1| hypothetical protein SORBIDRAFT_02g024510 [Sorghum bicolor]
gi|241925745|gb|EER98889.1| hypothetical protein SORBIDRAFT_02g024510 [Sorghum bicolor]
Length = 489
Score = 72.0 bits (175), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 104/398 (26%), Positives = 165/398 (41%), Gaps = 49/398 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNET-IAELLTTNKLSELACLALYLMYEKKQG 171
+ A+ DL G+ VP L + + V ++ A L +AL L+ E +G
Sbjct: 83 LVAARDLPRGEVVAEVPKKLWMDADAVAASDIGRACGGGGGGLRPWVAVALLLLSEVARG 142
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY--N 229
S W PY+ L RQ S A L S + ++L G+K EY +
Sbjct: 143 ADSPWAPYLAILPRQTD------------STIFCAGLKKSSLRYKLLSTTVGVK-EYVQS 189
Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV---VHLQKVSLARRFALVP 286
E D+V S + D+ + TF+ F AF ++S V + K++L LV
Sbjct: 190 EFDSVQAEIISRNK----DLFPGSITFDDFLWAFGILRSRVFPELRGDKLALVPFADLVN 245
Query: 287 LGPPLLAYSS----KCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
P + + S K K + + L K+G+ I + Y++ +
Sbjct: 246 HSPDITSEGSSWEIKGKGLFGR-EPMFSLRTPVDVKSGQQIYIQYDLD----------KS 294
Query: 343 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 402
N++L ++YGFV+ + D V ++ DP Y DK +A+ N F + ++
Sbjct: 295 NAELALDYGFVESNPSRDSYTVTLEISESDPFYGDKLDIAELNELGETAYFDIIL--DEP 352
Query: 403 AISDMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYFKARL 455
MLPYLRL + T SV L P+SP E ++ + D K+ L
Sbjct: 353 LPPQMLPYLRLLCIGGTDAFILEALFRNSVWGHLE--LPLSPDNEESICQVMRDACKSAL 410
Query: 456 AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 493
A Y T+ EDE + NL P+ +A + EKK+L
Sbjct: 411 AAYHTTIEEDEELSERENLQPRLTIAIGVRAGEKKVLQ 448
>gi|449495943|ref|XP_004159992.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Cucumis
sativus]
Length = 503
Score = 71.6 bits (174), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 91/404 (22%), Positives = 169/404 (41%), Gaps = 49/404 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+A +++L + VP + + V +E + L +AL+L+ E +G
Sbjct: 100 LATTKNLSKNEVVLEVPKRFWINPDAVADSEIGN---VCSGLKPWISVALFLIRENLKGD 156
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W Y+ L ++ +S + WSE ELA + G+ + L E +K E+ +++
Sbjct: 157 -SRWRRYLDILPQE-------TDSTVFWSEEELAEIQGTQLLSTTLNVKEYVKSEFLKVE 208
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL- 291
+ D+ T + F AF ++S + + L+P +
Sbjct: 209 EEILLRHK-------DLFPSRITLDDFFWAFGILRSRAFSRLR---GQNLVLIPFADLVN 258
Query: 292 ---------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
A+ K A L + D L KAG+ + + Y++ +
Sbjct: 259 HSANVTTEEHAWEVKGPAGLFSWDVLCSLRSPLSVKAGDQVFIQYDLK----------KS 308
Query: 343 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 402
N+ L ++YGF+++ + + + + D + DK +A+ NG F + E+
Sbjct: 309 NADLALDYGFIEQKSDRNAYTLTLEIPESDLFFDDKLDIAETNGLNQTAYFDIIL--ERP 366
Query: 403 AISDMLPYLRLGYVSDTSE--MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAG 457
MLP+LRL + T ++S+ S G + PVS E + + + +A L+G
Sbjct: 367 FPPAMLPFLRLLALGGTDAFLLESLFRNSVWGHLEMPVSRANEELICQVVRNACEAALSG 426
Query: 458 YPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTAD 501
Y T+ EDE L + NL + R+A + EK++L +Q+ D
Sbjct: 427 YHTTIEEDEK-LKEENLDSRLRIAVGIREGEKRVLQQIIQIFKD 469
>gi|449442309|ref|XP_004138924.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Cucumis
sativus]
Length = 503
Score = 71.6 bits (174), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 91/404 (22%), Positives = 169/404 (41%), Gaps = 49/404 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+A +++L + VP + + V +E + L +AL+L+ E +G
Sbjct: 100 LATTKNLSKNEVVLEVPKRFWINPDAVADSEIGN---VCSGLKPWISVALFLIRENLKGD 156
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W Y+ L ++ +S + WSE ELA + G+ + L E +K E+ +++
Sbjct: 157 -SRWRRYLDILPQE-------TDSTVFWSEEELAEIQGTQLLSTTLNVKEYVKSEFLKVE 208
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL- 291
+ D+ T + F AF ++S + + L+P +
Sbjct: 209 EEILLRHK-------DLFPSRITLDDFFWAFGILRSRAFSRLR---GQNLVLIPFADLVN 258
Query: 292 ---------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
A+ K A L + D L KAG+ + + Y++ +
Sbjct: 259 HSANVTTEEHAWEVKGPAGLFSWDVLFSLRSPLSVKAGDQVFIQYDLK----------KS 308
Query: 343 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 402
N+ L ++YGF+++ + + + + D + DK +A+ NG F + E+
Sbjct: 309 NADLALDYGFIEQKSDRNAYTLTLEIPESDLFFDDKLDIAETNGLNQTAYFDIIL--ERP 366
Query: 403 AISDMLPYLRLGYVSDTSE--MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAG 457
MLP+LRL + T ++S+ S G + PVS E + + + +A L+G
Sbjct: 367 FPPAMLPFLRLLALGGTDAFLLESLFRNSVWGHLEMPVSRANEELICQVVRNACEAALSG 426
Query: 458 YPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTAD 501
Y T+ EDE L + NL + R+A + EK++L +Q+ D
Sbjct: 427 YHTTIEEDEK-LKEENLDSRLRIAVGIREGEKRVLQQIIQIFKD 469
>gi|302754340|ref|XP_002960594.1| hypothetical protein SELMODRAFT_402971 [Selaginella moellendorffii]
gi|300171533|gb|EFJ38133.1| hypothetical protein SELMODRAFT_402971 [Selaginella moellendorffii]
Length = 403
Score = 71.6 bits (174), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 83/313 (26%), Positives = 135/313 (43%), Gaps = 44/313 (14%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEK----HRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
+ SW+ + G + + S + + HRP + AG+ +LV+T
Sbjct: 41 EFMSWLRRRGEDMNSIAVAIGMSKHGRALFAHRP---------MCAGECMIKFSQNLVLT 91
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
E+ L E IA L N+ + ++ L +M EK++G+ S W PYI L G+ +
Sbjct: 92 PEK-LPCEVIALLDQANEFTRVS---LLVMAEKRKGQNSAWAPYIECLP---SFGE--IH 142
Query: 196 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
S + W ELA L SP ER ++ EY E+ V S Y D+ +
Sbjct: 143 STIFWDPKELACLECSPIHRGTGERNALLQSEYREVKKV---VESCPHLYDPDV-----S 194
Query: 256 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV-D 314
E FK + V S S ++PL + + + + + DD +VV
Sbjct: 195 LEQFKHEYATVSSRAWGQGPHS---DMTMIPL-VDFANHDPRSRTLFSHADDNCTVVVAS 250
Query: 315 RPYKAG-ESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 373
R Y+ G E+ + +I + G N+ L ++YGFV DNP+D + + +EDP
Sbjct: 251 RDYQTGDENFHLKVHICY-------GDHSNAVLALDYGFVVPDNPFDEAEIFLEIPSEDP 303
Query: 374 QYQDK-RMVAQRN 385
+ K + +AQ N
Sbjct: 304 LREIKLQYMAQNN 316
>gi|302771638|ref|XP_002969237.1| hypothetical protein SELMODRAFT_410177 [Selaginella moellendorffii]
gi|300162713|gb|EFJ29325.1| hypothetical protein SELMODRAFT_410177 [Selaginella moellendorffii]
Length = 336
Score = 71.6 bits (174), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 79/282 (28%), Positives = 123/282 (43%), Gaps = 40/282 (14%)
Query: 107 HRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMY 166
HRP + AG+ LV+T E+ L E IA L N+ + ++ L +M
Sbjct: 72 HRP---------MCAGECMIKFSQDLVLTPEK-LPCEVIALLDQANEFTRVS---LLVMA 118
Query: 167 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 226
EK++G+ S W PYI L G+ + S + W ELA L SP ER ++
Sbjct: 119 EKRKGQNSAWAPYIECLP---SFGE--IHSTIFWDPKELACLECSPIHRGTGERNALLQS 173
Query: 227 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 286
EY E+ V S Y D+ + E FK + V S S ++P
Sbjct: 174 EYREVKKV---VESCPHLYDPDV-----SLEQFKHEYATVSSRAWGQGPHS---DMTMIP 222
Query: 287 LGPPLLAYSSKCKAMLAAVDDAVQLVV-DRPYKAG-ESIVVWYNISFTGHDFKCGPQPNS 344
L + + + + + DD +VV R Y+ G E+ + +I + G N+
Sbjct: 223 L-VDFANHDPRSRTLFSHADDNCTVVVASRDYQTGDENFHLKVHICY-------GDHSNA 274
Query: 345 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK-RMVAQRN 385
L ++YGFV DNP+D + + +EDP + K + +AQ N
Sbjct: 275 VLALDYGFVVPDNPFDEAEIFLEIPSEDPLREIKLQYMAQNN 316
>gi|149044197|gb|EDL97579.1| rCG27725, isoform CRA_c [Rattus norvegicus]
Length = 468
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 80/355 (22%), Positives = 156/355 (43%), Gaps = 40/355 (11%)
Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
LA +L+ E+ SFW PYI+ L + ++PL + E E+ L + ++
Sbjct: 28 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVF 79
Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 275
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 80 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 132
Query: 276 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGH 334
+R AL+PL + DD + V + ++AG+ I ++Y
Sbjct: 133 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFY------- 185
Query: 335 DFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFH 394
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF
Sbjct: 186 ----GTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFA 241
Query: 395 VHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVISSLGPICPVSPCMERAVLD 445
+H E + +L +LR+ +++ S + + + PVS E +
Sbjct: 242 LHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWT 300
Query: 446 QLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 500
L D L Y T+ ED+ +L + +L + +A +L EK++L ++ A
Sbjct: 301 FLEDRASLLLKTYKTTIEEDKTVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAA 355
>gi|58177849|gb|AAH89108.1| Setd3 protein [Rattus norvegicus]
Length = 450
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 80/355 (22%), Positives = 156/355 (43%), Gaps = 40/355 (11%)
Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
LA +L+ E+ SFW PYI+ L + ++PL + E E+ L + ++
Sbjct: 10 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVF 61
Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 275
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 62 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 114
Query: 276 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGH 334
+R AL+PL + DD + V + ++AG+ I ++Y
Sbjct: 115 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFY------- 167
Query: 335 DFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFH 394
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF
Sbjct: 168 ----GTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFA 223
Query: 395 VHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVISSLGPICPVSPCMERAVLD 445
+H E + +L +LR+ +++ S + + + PVS E +
Sbjct: 224 LHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWT 282
Query: 446 QLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 500
L D L Y T+ ED+ +L + +L + +A +L EK++L ++ A
Sbjct: 283 FLEDRASLLLKTYKTTIEEDKTVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAA 337
>gi|62642307|gb|AAX92711.1| SET domain-containing protein [Picea abies]
Length = 106
Score = 71.2 bits (173), Expect = 1e-09, Method: Composition-based stats.
Identities = 37/75 (49%), Positives = 52/75 (69%), Gaps = 7/75 (9%)
Query: 42 SSLRLVRRKNRFSIRVSSSDTLVAGS------REVVSKKEEDLGDLKSWMHKNGLPPCKV 95
S +RL R F + V S+DTL A S ++ + KEE++ DLKSWMH++GLPPC+V
Sbjct: 32 SRVRLPGRCVGFPMVVYSADTLTASSQHGEDKKDAIRGKEEEV-DLKSWMHRHGLPPCRV 90
Query: 96 ILKEKPSHNEKHRPI 110
+LKE+PS + KH+PI
Sbjct: 91 MLKERPSPDGKHKPI 105
>gi|392349055|ref|XP_003750278.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Rattus
norvegicus]
Length = 416
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 80/349 (22%), Positives = 154/349 (44%), Gaps = 41/349 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 12 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 65
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 66 LLMTVESA-KNSILGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 121
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y V Q +P+
Sbjct: 122 -----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQYAYFYKV-------IQTHPHA 169
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 170 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 229
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
DD + V + ++AG+ I ++Y G + N++ +I+ GF ++N +DR+ +
Sbjct: 230 EDDRCECVALQDFQAGDQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 278
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 413
+ ++ D Y K V R G + VF +H E + +L +LR+
Sbjct: 279 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRV 326
>gi|361129824|gb|EHL01706.1| putative Ribosomal N-lysine methyltransferase 4 [Glarea lozoyensis
74030]
Length = 483
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 110/451 (24%), Positives = 181/451 (40%), Gaps = 68/451 (15%)
Query: 84 WMHKNGLPPC-KVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL-G 141
W+ K G+ K+ LK+ S V A+ D + + F +P + V+ + V G
Sbjct: 16 WLSKIGVRINPKMTLKDLKSEGRGR----GVVAAADFEEDEVVFCIPRTAVLNVNNVFAG 71
Query: 142 NETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
++ A ++ L +M E +Q S W PY+ L ++ ++S + WS
Sbjct: 72 QDSGASKEALLQMPNWLALTATMMSEGQQSD-SRWAPYLAVLPQK-------LDSLVFWS 123
Query: 202 ETELAYLTGSPTKAEILERA--EGIKREYNELDTVWF------MAGSLFQQYPYDIPTEA 253
E ELA L S +I + E + + L F S+ Y +DIP E
Sbjct: 124 EEELAELQASSVAKKIGRSSAEEMFTKHISPLGLGEFNVELCHQVASVIMAYAFDIPEE- 182
Query: 254 FTFEIFKQAFVAVQSCVVHL-------QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD 306
E KQ + L +K L+ ++PL L A + + A + +
Sbjct: 183 ---EPAKQENGGAEGETDDLVSDDGEDEKTILS----MIPLADMLNADAERNNARIYYEN 235
Query: 307 DAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED-NPYDRLVVE 365
+ +++ +P AGE I Y G P S LL YG+V E+ YD + +
Sbjct: 236 EDLEMRTIKPIMAGEEIFNDY-----------GQLPRSDLLRRYGYVTENYAQYDVVEIS 284
Query: 366 AA----LNTEDPQ---------------YQDKRMVAQRNGKLSVQVFHVHAGREKEAISD 406
+A L TE PQ +++ +A R G L A E+ AI D
Sbjct: 285 SASIKSLMTEKPQEIQSGQFLDPLTSAEAEERVALADREGILEDSYDVNIANAEERAIPD 344
Query: 407 MLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 466
L L ++ D ++++++S + S V L + R A Y TL EDE
Sbjct: 345 ELLALLYLFLLDNENLEAIVTSQSALPSRSKLATELVGKVLVKVLRHREAEYATTLEEDE 404
Query: 467 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 497
+L NL + +A Q+ EK++L ++
Sbjct: 405 KLLQAANLPRRTAMAIQVRHGEKRVLRLAVE 435
>gi|302768639|ref|XP_002967739.1| hypothetical protein SELMODRAFT_408995 [Selaginella moellendorffii]
gi|300164477|gb|EFJ31086.1| hypothetical protein SELMODRAFT_408995 [Selaginella moellendorffii]
Length = 421
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 73/299 (24%), Positives = 120/299 (40%), Gaps = 57/299 (19%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+L SW+ G +LK P + A D++AG+ V ++T +R+
Sbjct: 39 ELVSWLKIRGEHDACSLLKTGPDK-------RGLFAVRDIKAGECILRVSRDTMMTADRL 91
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
+LL++ +SE A LAL L++EK+ G+ S W PYI L R + S
Sbjct: 92 --PLEFQQLLSSG-VSEWAQLALLLLFEKRAGEASIWAPYISCLPRWG-----TIHSTAF 143
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
W + EL + S E + R I+ E+NE+ +V F +D + A
Sbjct: 144 WRKEELTMIQESSLSYETMSRRAAIREEFNEMQSVPFA-----DFMNHDWSSNAMLTYDT 198
Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 319
V+ V+ + +A A QL D+ Y A
Sbjct: 199 DNGSTEVEEVKVYSDCLYIALFCA--------------------------QLFADKNYAA 232
Query: 320 GESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 378
GE + ISF GP N+ L +++GF NP+D++ + ++ D ++K
Sbjct: 233 GEQV----TISF-------GPLCNASLALDFGFTVPYNPWDKVQLWLGISRRDSLRKEK 280
>gi|242007310|ref|XP_002424484.1| SET domain-containing protein, putative [Pediculus humanus
corporis]
gi|212507902|gb|EEB11746.1| SET domain-containing protein, putative [Pediculus humanus
corporis]
Length = 492
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 98/441 (22%), Positives = 190/441 (43%), Gaps = 56/441 (12%)
Query: 74 KEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
+E+ +L SW+ +NG V +K NE + + A++DL+ + ++P +++
Sbjct: 81 REDHFSNLISWIKENGGVADNVTIKH---FNEMG---YGLEAAKDLEESELICAIPKNVM 134
Query: 134 VTLERVLGNETIAELLTTN----KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
+TL+ V + L N + +A LAL+L+ E + + SFW YI L
Sbjct: 135 MTLDNV-KVSPLKYLYENNPILKNMGNVA-LALFLILEHVKNENSFWHHYISSLPSD--- 189
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYD- 248
+ L + + + SPT + + I R+Y +F +LFQ +
Sbjct: 190 ----YNTVLYFDLNDFLEMKNSPTFEMATKHCKNIARQY-----AYF--NNLFQNSNDEA 238
Query: 249 --IPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF-----ALVPLGPPLLAYSSKCKAM 301
I FT+++++ A V + + S + L+PL + +++ +
Sbjct: 239 SLILRNVFTYQLYRWAVSTVMTRQNFIPSSSTSNDVENGINGLIPLWD-MCNHTNGYLST 297
Query: 302 LAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDR 361
VD + L +P+K GE ++++Y G + NS L++ GFV ++NP+D
Sbjct: 298 QYKVDRSECLAC-KPFKKGEQVLIFY-----------GERSNSDFLVHNGFVYDENPHDS 345
Query: 362 LVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY--VSDT 419
+ ++ D + + + + G F++++G E ++L +LR+ V +
Sbjct: 346 FRLRLGISKSDKLHGLRCELLKDLGIPDSGDFYLYSGSEP-VRENLLAFLRIFNMDVENL 404
Query: 420 SEMQSVISSLGPI----CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
+ +S S L + C + +E V + L D L Y E E + D N
Sbjct: 405 NHWKSHSSRLSDLMWKDCALDTKIESKVWNFLYDRINLLLKTYKG--DEVEVRVEDSNST 462
Query: 476 PKKRVATQLVRMEKKMLNACL 496
+R+ ++ EKK L++ L
Sbjct: 463 ECRRLVRAQLKCEKKFLSSIL 483
>gi|340720054|ref|XP_003398458.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Bombus
terrestris]
Length = 484
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 102/449 (22%), Positives = 183/449 (40%), Gaps = 61/449 (13%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+ + +G +W+ +NG + E P ++ + A + + +P L
Sbjct: 77 KRSQGIGRFINWLKQNGANVYGASVAEFPGYDLG------LKAERNFLENELILRIPREL 130
Query: 133 VVTLERV------LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
+ ++ L N+ + +L+ LA+ L+ EK + + S W PY+ L
Sbjct: 131 IFSIHNAAPELVALQNDPLLQLMPQ------VALAIALLIEKHK-EYSKWKPYLDILPT- 182
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
+ L + ++ L GSPT L++ I R+Y +F LFQ+
Sbjct: 183 ------TYTTVLYMTAADMNELKGSPTLEAALKQCRNIARQY-----AYF--NKLFQKNN 229
Query: 247 YDIPT---EAFTFEIFKQAF--VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAM 301
+ + FT+E + A V + ++ + SL AL+P+ SK
Sbjct: 230 NAVSAILRDVFTYEKYCWAVSTVMTRQNIIPSKDGSLMIH-ALIPMWDMCNHEDSKITTD 288
Query: 302 LAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDR 361
A + + R +K E I + Y GP+ NS ++ GFV DN D
Sbjct: 289 FNATLNCCECYALRDFKKAEQIFISY-----------GPRTNSDFFVHSGFVYMDNEQDG 337
Query: 362 LVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-MLPYLRLGYVSDTS 420
+ ++ DP ++++ + + +V F + G E ISD +L +LR+ +
Sbjct: 338 FKLRLGISKADPLHKERVELLNKLDLPAVGEFLLKPG--TEPISDTLLAFLRV-FSMRKE 394
Query: 421 EMQSVISS--LGPICPVSPCMERAVLDQLADYFKARL----AGYPATLSEDEAMLTDYNL 474
E+ I S + + + +E V + + + RL A YP TL ED +L + L
Sbjct: 395 ELAHWIQSDRVNDLKHMDCALETVVEENVKKFLLTRLQLLIANYPTTLKEDLQLL-ETTL 453
Query: 475 HPKKRVATQLVRMEKKMLNACLQVTADMI 503
K++A QL EK++L L+ I
Sbjct: 454 PRIKKLAIQLRVTEKRILQGALEYVQQWI 482
>gi|168046556|ref|XP_001775739.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672891|gb|EDQ59422.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 524
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 83/320 (25%), Positives = 126/320 (39%), Gaps = 33/320 (10%)
Query: 52 RFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIH 111
RF R S+ VS + L++W+ K C + L+ P
Sbjct: 49 RFGCRWVQSNGSTHTKESNVSISNTKVERLRNWLKKLNHDDCNLKLERCPQGGSGS-GYG 107
Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
A + G VP ++T E + + L+ + L+ + L+L+YE+ +G
Sbjct: 108 AFAGPGGVGNGSTIVKVPRKALMTEETARLCQDVGPLVKKSDLTPWQAMCLHLLYERARG 167
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSET-ELAYLTGSPTKAEILERAEGIKREYNE 230
+ SFW PYI L ++ +L P+LWS+ +L GSP ++ ER I RE E
Sbjct: 168 ETSFWYPYIAVLPKEL---ELIGIHPMLWSQKMRREWLEGSPM-LDVTERRLAICREDYE 223
Query: 231 LDTVWFMAGSLF----QQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL------AR 280
+ AG L + P I A + + +S ++LQ L
Sbjct: 224 A-MLLAGAGRLTPRGNEGEPISITETAVQ---WAATMLLSRSFSLNLQTQKLRPGSFAED 279
Query: 281 RFALVPLGPPLLAYSSKCKAMLAAVDD---AVQLVVDRPYKAGESIVVWYNISFTGHDFK 337
ALVP L SS + D L R Y GE + F +
Sbjct: 280 TIALVPWADMLNHSSSAGRESCLVYDQKSGVATLQAHRTYSEGEQV-------FDSYGPS 332
Query: 338 CGPQPNSKLLINYGFVDEDN 357
C P S+LL++YGFVDE+N
Sbjct: 333 CSP---SRLLLDYGFVDEEN 349
>gi|145516108|ref|XP_001443948.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124411348|emb|CAK76551.1| unnamed protein product [Paramecium tetraurelia]
Length = 572
Score = 68.9 bits (167), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 91/418 (21%), Positives = 173/418 (41%), Gaps = 62/418 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA----CLALYLMYEK 168
V A + + A + +P S ++TLE + T+A+ + +L L+ L+ +L+ EK
Sbjct: 165 VNAKQTINAKELILFIPKSHMITLE-MAKETTVAKKMMQFRLDLLSPKHSFLSTFLLQEK 223
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
+ SFW PYI L P+ ++ ++L +L GSP +I ++ ++++Y
Sbjct: 224 FRPN-SFWKPYIDILPSSYP------SFPIFYNNSDLEWLKGSPFLKQIKDKLADLQKDY 276
Query: 229 NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG 288
N++ V F QY F F A + S + + ++ + A VPL
Sbjct: 277 NDICNV----VPEFTQY---------QFHEFCWARMTASSRIFGI-NINGVKTDAFVPLA 322
Query: 289 -------PPLLA--YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCG 339
P L + YS + + + D+ ++ G+ I F + KC
Sbjct: 323 DMLNHKRPKLTSWCYSDEKQGFIIETDEKIE--------RGQMI-------FDSYGRKC- 366
Query: 340 PQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR 399
NS+ +NYGFV E N + + + + DP Q K + + + + F +
Sbjct: 367 ---NSRFFLNYGFVVEGNDANEVNLAVEADQNDPLLQLKEQAIKESLQWP-KNFKLLMDT 422
Query: 400 EKEAISDMLPYLRLGYVSDTSEMQSVISSLG-------PICPVSPCMERAVLDQLADYFK 452
++ A+ D + ++R + D ++++ +++ P+ E + + K
Sbjct: 423 DETAVIDFMSHIRFLVIRDEAQLKLLLNQKNSQNFKSTKTQPLGIYNELEMWKMIGRICK 482
Query: 453 ARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVT 510
L YP T +D+ +L L +R L EK++L Q + M LL +
Sbjct: 483 KTLKQYPTTFEQDQEILQICELTTNQRNCLILRMGEKEILKFYFQFSERMKELLSNFN 540
>gi|440797255|gb|ELR18348.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
Length = 431
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 72/265 (27%), Positives = 106/265 (40%), Gaps = 48/265 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVT------------LERVLGNETIAELLTTNKLSELACL 160
V A+ D+ G+ SVP SLVV + R+L E N L
Sbjct: 59 VVAAHDIATGETLLSVPFSLVVDSADAPLATAAPEIRRILDEEFPLSATNENAL------ 112
Query: 161 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILER 220
L+ K S W YI L + L +S+ EL+YL GS +R
Sbjct: 113 ---LLLVHKNDPNSPWQRYIDVLPS-------TFSTTLFFSDDELSYLEGSSLHHFARQR 162
Query: 221 AEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV--QSCVVHLQKVSL 278
I+ +Y+ + T LF YP E F+ + +K A + +S VV K L
Sbjct: 163 RRAIESQYDTIFT------PLFVDYPEHFAPEQFSLDAWKWALSVIWSRSFVVDEGKRGL 216
Query: 279 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKC 338
+ + P + + K + AVD + P K GE I V Y +
Sbjct: 217 VPWADMFNMAPE----TEQVKVAVDAVDHHLIYSARSPIKKGEQIFVAYG--------QS 264
Query: 339 GPQPNSKLLINYGFVDEDNPYDRLV 363
N++LL++YGFV E+NP+D +V
Sbjct: 265 RQMSNAQLLMDYGFVLENNPHDAVV 289
>gi|449017905|dbj|BAM81307.1| similar to ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplast precursor
[Cyanidioschyzon merolae strain 10D]
Length = 567
Score = 68.6 bits (166), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 117/455 (25%), Positives = 174/455 (38%), Gaps = 98/455 (21%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLE---RVLGNETIAELLTTNKLSELACLALY------LM 165
A D+QAG+ F VP L T + R + EL + LA L LY
Sbjct: 120 ARRDIQAGEVLFQVPFHLCFTKDVAVRRFAALNVPELADEEEFFALATLLLYERGLDESW 179
Query: 166 YEKKQGKKSFWLPYIRELDR--QRGRGQLAVES----PL----LWSETELAYLTGSPTKA 215
+ +G SFW PY+ L +G ES PL LW+E E+ +L GSPT
Sbjct: 180 KKSGRGPGSFWGPYLDILPPVPWEFKGAEPAESLSMDPLDALWLWAEDEMQWLQGSPTLL 239
Query: 216 EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE-AFTFEIFKQAFVAVQSCVVHLQ 274
++REY E L++++P+ E AF E F AF + S V L
Sbjct: 240 SARALRSKVEREYAE------ACERLYRRHPHIFDLEGAFRLERFLWAFGVLFSRAVSLP 293
Query: 275 KVSLARRFALVPLGPPLLAYSSKCKAMLAA----------------------------VD 306
+ ALVP L +S+ C + + A D
Sbjct: 294 AENGM--LALVPYA-DLANHSAFCVSFIDARTAAFPYAFRASSKQKRGQWWQRFLAPNSD 350
Query: 307 DA------------------VQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLI 348
DA V DR Y E + V Y G + N++LL+
Sbjct: 351 DAGAVANTDSSHYREDAQREVVAYADRFYDKFEQVYVSY-----------GQKSNAELLL 399
Query: 349 NYGFVDEDNPYDRLVVEAALNTEDPQ----YQDKR--MVAQRNGKLSVQVFHVHAGREKE 402
YGFV + NPY+ + V +L+ + KR ++A + F ++A R
Sbjct: 400 LYGFVSDRNPYNSVEVCVSLSGSEAAGAGLLDRKRSFLLACGRDPDKPECFPLYADRYPL 459
Query: 403 AISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATL 462
+ +L + L + S + + PV+ E A L K L YP +
Sbjct: 460 ELMQLLRFASL--TEQDAAGYSDLEQIDVAQPVNRENEIAAKSALLQACKIALQAYPTSA 517
Query: 463 SEDEAMLTDYN----LHPKKRVATQLVRMEKKMLN 493
ED+A L D + L K+R++ +L R EK++L
Sbjct: 518 DEDDAALKDKSMAQLLSRKQRLSVRLRRSEKRILE 552
>gi|255087300|ref|XP_002505573.1| set domain protein [Micromonas sp. RCC299]
gi|226520843|gb|ACO66831.1| set domain protein [Micromonas sp. RCC299]
Length = 509
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 110/451 (24%), Positives = 188/451 (41%), Gaps = 46/451 (10%)
Query: 64 VAGSREVVSKKEEDLGDLKSWMHKNGLPPCKV---ILKEKPSHNEKHRPIHYVAASEDLQ 120
VA V S+ + D L +W+ G+ KV ++ P R VAA ED+
Sbjct: 44 VAVDASVDSRTQADFDALWAWLGSEGVDVSKVSPALVDAAPGG----RGWGLVAA-EDIG 98
Query: 121 AGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYI 180
GDA ++P SL +T++ L + A + +AL L++E+ G+KS W Y+
Sbjct: 99 GGDAVLAIPRSLWMTVDTALASPIGAH---CGDEAGWIAVALQLLHERSIGEKSRWAAYV 155
Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS 240
L Q +++PL WS E+A LTG+ ++L+ A G + T + S
Sbjct: 156 NALPAQ-------LDAPLFWSAEEVATLTGT----QLLDAAAGY--DSYARGTWARLKES 202
Query: 241 LFQQYPYDIPTEAFTFEIFKQAFVAVQS-CVVHLQKVSLARRFALVPLGPPLLAYSSKCK 299
F P P++AF F AF ++S C V ALVP +A S
Sbjct: 203 AFDANPDVFPSDAFDEPSFLWAFGILRSRCQA---PVDQGADIALVP--GLDMANHSGLS 257
Query: 300 AMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGP--------QPNSKLLINYG 351
+ +++ V K+G S+++ G K + +++L ++YG
Sbjct: 258 SQTWTLNNGGVAAVFGGGKSGGSMLLRTEKGAKGLLAKGAEVFMNYGQRKIDNQLALDYG 317
Query: 352 FVDEDNPYDRLVVE-AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPY 410
F D V+ A+ DP DK V + G F + A + E ++ +
Sbjct: 318 FTDAFASRPGYVLGPIAIPESDPNAFDKMDVLEVAGLREAPSFVLRAFEDPE--PELRVF 375
Query: 411 LRLGYV--SDTSEMQSVI--SSLGPIC-PVSPCMERAVLDQLADYFKARLAGYPATLSED 465
+RL + D ++++ + G I PVS E+ + + + L GY + +D
Sbjct: 376 MRLLNLKGEDAFLLEAIFRQEAWGLISEPVSRLNEQEACGTMINGCEEALRGYATRVEDD 435
Query: 466 EAMLTDYNLHPKKRVATQLVRMEKKMLNACL 496
+ D + + R+A ++ EK+ L L
Sbjct: 436 RRVAEDPGVGHRLRLAARVRMGEKQALADAL 466
>gi|255083899|ref|XP_002508524.1| set domain protein [Micromonas sp. RCC299]
gi|226523801|gb|ACO69782.1| set domain protein [Micromonas sp. RCC299]
Length = 425
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 73/288 (25%), Positives = 118/288 (40%), Gaps = 46/288 (15%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYE----K 168
+ A+E+++ G++ +P S ++T+ER + + L E + LA +L +
Sbjct: 24 LVATEEVRRGESLLDIPESTLITVERAIAESNLGP--AHANLQEWSVLAAFLAEQALAID 81
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA-YLTGSPTKAEILERAEGIKRE 227
S + Y+R L R+ G L W E ++ L GSP++ +ER +
Sbjct: 82 AGADGSRFATYVRALPRRTG-------GVLDWPEEDVKELLAGSPSQRAAMERQASVDAA 134
Query: 228 YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 287
+E+ +P P + AF + S ++ L A ALVP
Sbjct: 135 IDEIRA----------SFPQLTPG------ALRWAFDVLFSRLIRLPNRGGA--LALVPW 176
Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLL 347
+L + C A + AV L DR YK GE + Y GP+P+S+LL
Sbjct: 177 A-DMLNHRPGCDAYIDDTGGAVCLSPDRRYKPGEQVYASY-----------GPRPSSELL 224
Query: 348 INYGFVDE--DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
I+YGF +NP D V ++ D K +R G V+ F
Sbjct: 225 ISYGFAPAVGENPDDEFEVVLGIDPNDRHADAKADALRRIGLSPVEAF 272
>gi|440804743|gb|ELR25614.1| SET domain containing protein, partial [Acanthamoeba castellanii
str. Neff]
Length = 273
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 72/262 (27%), Positives = 105/262 (40%), Gaps = 48/262 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVT------------LERVLGNETIAELLTTNKLSELACL 160
V A+ D+ AG+ SVP SLVV + R+L E N L
Sbjct: 45 VVAAHDIAAGETLLSVPFSLVVDSADALLATSAPEIRRILDEEFPLSPTNENAL------ 98
Query: 161 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILER 220
L+ K S W YI L + L +S+ EL+YL GS +R
Sbjct: 99 ---LLLVHKNDPNSPWQRYIDVLPS-------TFSTTLFFSDDELSYLEGSSLHYFARQR 148
Query: 221 AEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV--QSCVVHLQKVSL 278
I+ +Y+ + T LF YP E F+ + +K A + +S VV K L
Sbjct: 149 RRAIESQYDTIFT------PLFVDYPEHFAPEQFSLDAWKWALSVIWSRSFVVDEGKSGL 202
Query: 279 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKC 338
+ + P + + K + AVD + P K GE I V Y +
Sbjct: 203 VPWADMFNMAPE----TEQVKVAVDAVDHHLIYSARSPIKKGEQIFVAYG--------QS 250
Query: 339 GPQPNSKLLINYGFVDEDNPYD 360
N++LL++YGFV E+NP+D
Sbjct: 251 RQMSNAQLLMDYGFVLENNPHD 272
>gi|358392567|gb|EHK41971.1| hypothetical protein TRIATDRAFT_251278, partial [Trichoderma
atroviride IMI 206040]
Length = 956
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 98/460 (21%), Positives = 176/460 (38%), Gaps = 95/460 (20%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER----------VLGNETIAELLTTNKLSELACLAL 162
+ A +D+ A F+VP S +V +E L +T E+ + + L +
Sbjct: 531 IVALQDIPADTVLFTVPRSAIVNIETSELRAKLPDVFLNQDTAMEVDNKPQQDPWSTLII 590
Query: 163 YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RA 221
L+YE +G +S W PY+ L + E+P+ WS+ E+ L S T+++I + A
Sbjct: 591 VLIYEYFKGDQSSWKPYLDVL-------PASFETPMFWSDAEVDELQASATRSKIGKTNA 643
Query: 222 EGI--------------------KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ 261
E + + EL + GS Y +D F+
Sbjct: 644 EEMFHAKILPVIRGNPDIFQTSQAKSDEELIQLAHRMGSTIMSYAFD----------FQN 693
Query: 262 AFVAVQSCVVHLQKVSLARR-FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 320
+ + A+ +VP+ +L ++ A + DDA+ + R KAG
Sbjct: 694 EDEEEEDDSEEWVEDREAKSTMGMVPMAD-ILNADAEYNAHVNYGDDALTVATLRTIKAG 752
Query: 321 ESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDN--------PY----DRLVVEAAL 368
E I+ +Y GP PNS+LL YG+V + P+ D L L
Sbjct: 753 EEILNYY-----------GPHPNSELLRRYGYVTPKHSRYDVVELPWKMIEDALAANLGL 801
Query: 369 NTED----------PQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD 418
++E ++++ ++ + + + + + + E D+ L+
Sbjct: 802 SSEQLDSAREHLDLDEFEETFVLERESDEPNPDGTFANPAKFSEIPEDLREQLK------ 855
Query: 419 TSEMQSVISSLGPICPVSPCMERAVLDQ-LADYFKARLAGYPATLSEDEAMLTDYNLHPK 477
M I + P C V V L A + YP T+ EDE +L+ NL +
Sbjct: 856 --SMLKAIRKVDPSCIVDKRKRDEVQHTVLITALDALTSQYPTTIIEDELILSGSNLSER 913
Query: 478 KRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAP 517
++ A + EK++L + ++ + D + PAP
Sbjct: 914 RKAAVTVRLGEKRLLQEARVLLSE---IASDAILDDAPAP 950
>gi|145528147|ref|XP_001449873.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124417462|emb|CAK82476.1| unnamed protein product [Paramecium tetraurelia]
Length = 605
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 93/419 (22%), Positives = 174/419 (41%), Gaps = 62/419 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNET-IAELLTTNKLSELA----CLALYLMYE 167
V A + + + + VP S ++TLE + +T +A+ + +L L+ L+ +L+ E
Sbjct: 190 VNARKAISSKEVILFVPRSHMITLE--MAKDTPVAKKIIQYRLDLLSPKHSFLSTFLLQE 247
Query: 168 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
KK + SFW PY+ L + P+ +++++L +L GSP ++ ++ +K++
Sbjct: 248 KK-IQDSFWKPYLDVLPKSYSNF------PIFFNDSDLEWLKGSPFLKQVKDKITDLKKD 300
Query: 228 YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 287
Y ++ V A Q +F+ F A + S + + + + A VPL
Sbjct: 301 YCDICQV---APEFLQN----------SFDEFCWARMTASSRIFGIN-IKGVKTDAFVPL 346
Query: 288 GPPL---------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKC 338
L YS + + + D+ ++ G+ I F + KC
Sbjct: 347 ADMLNHKRPKLTSWCYSDERQGFIIETDENIE--------KGQMI-------FDSYGSKC 391
Query: 339 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 398
NS+ L+NYGFV +DN + + V + Q K +++ + + F +
Sbjct: 392 ----NSRFLLNYGFVVDDNNANEVNVMVEPDGTISLIQLKEGLSRETLQFP-KSFRLVID 446
Query: 399 REKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICP-----VSPCMERAVLDQLADYFKA 453
+ SD + ++R + + E +++ I P +S E A + + +
Sbjct: 447 PNDVSFSDFMSFIRFILIQEEKEFANLLGKNSYIKPTKIHFISIQNELATWNLIENICIR 506
Query: 454 RLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVS 512
L YP TL +D +L L +R L EKK+LN Q + M L + S
Sbjct: 507 ALNQYPTTLEQDLEILKICELTTNQRNCLILRMGEKKILNFYKQFSEKMRQLFSNFDFS 565
>gi|303288796|ref|XP_003063686.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226454754|gb|EEH52059.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 538
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 109/455 (23%), Positives = 176/455 (38%), Gaps = 62/455 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKV---ILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
++ + D L +W+ + G V ++ P + A+ D+ GDAA V
Sbjct: 73 ARTQADFDALWTWLEREGADVASVSPALVDATPGGRGWG-----LVATRDVGGGDAAIVV 127
Query: 129 PNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
P +L +T E ++ I L LAL L++EK G S W YIR L R
Sbjct: 128 PRALWMTKETAFASK-IGTALDPETTPPWCALALQLLHEKSLGDDSRWAAYIRCLPRVE- 185
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEG----IKREYNELDTVWFMAG-SLFQ 243
A+++PL WS ELA L G+ ++L A G ++ + L F +LF
Sbjct: 186 ----ALDAPLFWSSEELAELAGT----QLLANAAGYDSYVRGTHAALKETTFKEHPALFG 237
Query: 244 QYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA 303
D AF+ F AF ++S L V AL+P G + + C
Sbjct: 238 DAGDDDGGGAFSEREFLWAFGVLRSRA--LPPVDQGESIALIP-GIDMANHDGLCSQTWQ 294
Query: 304 AVDDAVQLVVDRPYKAGESIVVWYNISFT-------GHDFKCGPQP---NSKLLINYGFV 353
+ + V A V + T G + +C P +S+ ++YGFV
Sbjct: 295 LNNGGIAAVFGGRGGADGGGSVLLRVEKTKAGGAKRGEEIRCNYGPANIDSQFALDYGFV 354
Query: 354 DEDNPYDRLVVEA-ALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 412
D V+ ++ +D DK V G F + A ++ +M+ ++R
Sbjct: 355 DAFCSRPGYVLGPLSIPEDDVNAFDKMDVLSVAGLKESPAFTIRA--FEDPPPEMVVFMR 412
Query: 413 LGYVSDTS----------EMQSVISSLGPICPVSPCMERAVLDQLADY-FKARLAGYPAT 461
L + + E ++IS PVSP D AD + L Y
Sbjct: 413 LLNLKNDDAFLLEAIFRQECWALISD-----PVSP-------DNEADAGCEEALGAYATK 460
Query: 462 LSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 496
+ +D + D + P+ R+A ++ EK+ L L
Sbjct: 461 IEDDRGVADDADASPRLRLAARVRMGEKQALEEVL 495
>gi|198413420|ref|XP_002131202.1| PREDICTED: similar to SET domain containing 3 [Ciona intestinalis]
Length = 577
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 100/449 (22%), Positives = 198/449 (44%), Gaps = 67/449 (14%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
KSW+ ++G+ + ++E S E V A +D++ ++P ++T E
Sbjct: 83 FKSWLKEHGVEYSAIDIQE-VSEEEGFG----VIALQDIEIKCPLVTIPRKAMMTYEDA- 136
Query: 141 GNETIAELLTTNKLSEL---ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
+ +A L+ N++ + CLALYL E+ S + PYI + ++ +
Sbjct: 137 KSSYLAGLIEGNEVLSVMPNVCLALYLHCERFT-LNSKYQPYIDMIPQE-------FNTI 188
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA---F 254
L + E+ YL G+ + + + + I R++ L V+ GS ++ +P +A F
Sbjct: 189 LYFKPHEMKYLKGTAALSVAINQFKSIVRQFALLYQVF--NGSHQKEDVEKLPLQARNAF 246
Query: 255 TFEIFKQAFVAVQS----CVVHLQKV----SLARRFALVPL--------GPPLLAYSSKC 298
TF+ ++ AV + H+ V AL+P+ GP AY+
Sbjct: 247 TFDTYRWCASAVTTRQNKIPTHVGDVLGDLDENSTLALIPMWDMFNHAIGPLSTAYN--- 303
Query: 299 KAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNP 358
A+ ++ + + +K GE + + Y G + NS LLI+ GFV +++P
Sbjct: 304 -----ALTRGIECLAMQDFKTGEQVKICY-----------GARTNSDLLIHNGFVMKESP 347
Query: 359 YDRLVVEAALNTEDPQYQDK-RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS 417
+D++ + ++ +DP Y K +++ + N ++S Q +L +LR+ +++
Sbjct: 348 FDKVRIHLGVSQKDPLYSLKAKLLEKLNVEVSGQFAVCSMDNSLPTSPQLLVFLRVFHMN 407
Query: 418 DT------SEMQSVISSLGPI---CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM 468
+ + ++ +SSL I V + V + L + K L G+ E M
Sbjct: 408 EEELRSWLEKQKNELSSLREIYISGEVKFKSDVKVWEFLENRVKLLLMGFKKIGDNIEEM 467
Query: 469 LTDYNLHPKKRVATQLVRMEKKMLNACLQ 497
+ D +L + ++A Q E ++L+AC+
Sbjct: 468 MEDKSLTHRSKLALQFRIEEHRILSACVN 496
>gi|260835124|ref|XP_002612559.1| hypothetical protein BRAFLDRAFT_219602 [Branchiostoma floridae]
gi|229297937|gb|EEN68568.1| hypothetical protein BRAFLDRAFT_219602 [Branchiostoma floridae]
Length = 327
Score = 67.0 bits (162), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 77/300 (25%), Positives = 127/300 (42%), Gaps = 35/300 (11%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
+D L W+ +NG ++L + P V ++ +L+ GD S+P +L++T
Sbjct: 4 DDSIQLMRWLRRNGFRDSHLVLTDFPDTGRG------VMSTRNLKEGDCIVSLPENLLIT 57
Query: 136 LERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
V+ N + + + T K L+ L+LYL+ EK +GK SFW PYI+ L +
Sbjct: 58 TTTVV-NSHLGQYIKTWKPRLTPKQVLSLYLIAEKSRGKDSFWYPYIQTL-------PTS 109
Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 253
+P +S E+ L +A + R + ++ Y L T LF +
Sbjct: 110 YTTPSYFSTAEVDALPALVREATLRHR-KVLQNSYKSLQTSLHNLEPLFPDW-----KTV 163
Query: 254 FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAM----LAAVDDAV 309
FT + ++ A+ V + V+ K F L P P + A + + L D
Sbjct: 164 FTLKSYRWAWATVYTRSVY--KRGPGWEF-LDPSDPDVYALAPFLDMLNHSPLVQTDTDF 220
Query: 310 QLVVDRPYKA-GESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 368
V + Y+ E Y F +D P N +LL+ YGFV NP+ + AA+
Sbjct: 221 N-VSSKCYEVKTEGACRKYRQVFINYD----PYDNGRLLMEYGFVMPRNPHSVVTFTAAV 275
>gi|442753255|gb|JAA68787.1| Putative set domain-containing protein [Ixodes ricinus]
Length = 428
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 69/293 (23%), Positives = 123/293 (41%), Gaps = 39/293 (13%)
Query: 81 LKSWMHKNGLP-PCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
L +WM NG K+ L++ P V A E L G+ +P SL+++
Sbjct: 31 LLTWMEANGFRLHSKLGLRDFPDTGRG------VVALEKLVGGETFLKLPTSLLISTRTA 84
Query: 140 LGNETIAELLTTN---KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES 196
L +++ T KL+ + L L+++ +K G+ S W P++ L R +
Sbjct: 85 L--QSLLHSFITRYHAKLTPIDVLTLFVLDQKLLGEASRWWPFVDSLPR-------TFTT 135
Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 256
P+ T L + E+ R I+R + +L + + G + ++ + FT+
Sbjct: 136 PVFLRRTVFESLP-KDLREEVHTRITSIQRTFLKLKVL--LGGHVEEEPEVQSLSTGFTW 192
Query: 257 EIFKQAFVAVQSCVVHLQKVSLARRF-----ALVPLGPPLLAYSSKCKAMLAAVDDAVQL 311
F A+ AV + + Q + + + AL P L + K A V + ++
Sbjct: 193 NNFVWAWTAVNTRCIFAQGSNSSSLWENDHCALAPF-LDCLNHHWKASIETAMVGENFEI 251
Query: 312 VVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
+ + + A E + + Y GP N +L ++YGFV DNP D +VV
Sbjct: 252 LSHKSHDANEQVFISY-----------GPHSNRRLFLDYGFVLPDNPNDVVVV 293
>gi|348675930|gb|EGZ15748.1| hypothetical protein PHYSODRAFT_561468 [Phytophthora sojae]
Length = 430
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 99/429 (23%), Positives = 163/429 (37%), Gaps = 65/429 (15%)
Query: 111 HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
H V A L +G VP L + E ++ L ++ + LAL+LM+E+ +
Sbjct: 35 HGVFAKRALTSGQVTLQVPFKLTMNTESAATSDLAPVLEKYPQIPDDEVLALHLMHERSK 94
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
G +SF+ P+I + + P+ W+E EL L G+ + ++R++
Sbjct: 95 GGESFFAPFIASM-------PTTFDLPVFWTEAELNELKGTNVLLLTQLMKQHLERDFE- 146
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 290
++ Q D P +IF + V +R F + G
Sbjct: 147 ---------NIHQAVAADFP------DIFASLPTLTIDDYMWAMSVIWSRAFGVSKGGKY 191
Query: 291 LLAYSSKCKAM---------------LAAVDDAVQLV---VDRPYKAGESIVVWYNISFT 332
L C AM + ++ Q++ V AG ++ + Y
Sbjct: 192 LHVL---CPAMDMFNHDVTVRKPLDDFVSFNEEKQMMTHHVPEDVAAGSAVHISY----- 243
Query: 333 GHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQV 392
G N+KLL +YGFV +N + + DP ++ K+ V N Q
Sbjct: 244 ------GQYSNAKLLYSYGFVSPENFRRGVDFWMKIPLSDPYFKLKQTVLDSNELTKEQT 297
Query: 393 FHVHAGREKEAISD-MLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYF 451
+ H + + +L LR+ +++ Q + I V E AV + L
Sbjct: 298 YDFHGTLLSNDVDERLLATLRVILMNEQEIRQYKKAFESSILSVRN--ELAVYENLQSTC 355
Query: 452 KARLAGYPATLSEDEAMLTDYNLHPKKRVATQL-VRMEKKMLNACLQVTADMIMLLPDVT 510
+ +L+ Y TL EDEA+L + K R+A + VRME K QVT +I L
Sbjct: 356 RRKLSNYATTLEEDEAILAETETESKPRLAFAVRVRMEDK------QVTTSVIETLEQWK 409
Query: 511 VSPCPAPYA 519
S P A
Sbjct: 410 QSLASKPDA 418
>gi|428163078|gb|EKX32170.1| hypothetical protein GUITHDRAFT_121664 [Guillardia theta CCMP2712]
Length = 449
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 98/434 (22%), Positives = 177/434 (40%), Gaps = 38/434 (8%)
Query: 77 DLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTL 136
D D+ W NG KV+L++ + + PI + A ED++AG+ S+P +L+
Sbjct: 25 DGSDVYEWAAANGANVSKVVLRD----DGEAGPI--LHAKEDIEAGEVILSLPANLLFP- 77
Query: 137 ERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES 196
RV + + ++ + + + LYL+ E+ S W P+++ L +
Sbjct: 78 TRVSDHSPVVHMIENTTIGRITAICLYLISERADSS-SHWKPWLQSLPPRFFHA------ 130
Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQ------QYPYDIP 250
L +SE ++ + S K + + +++EY + F P ++
Sbjct: 131 -LSYSEDDMLHFQASSFKELRDRKKKNVRQEYEQTVAPLLHKLPAFDPLLAAVDKPQNVT 189
Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL---GPPLLAYSSKCKAMLAAVDD 307
E FT+E F+ A+ V + + + R VPL GP ++ + + D
Sbjct: 190 REDFTYEAFEWAYSVVTTRGIFPGLLGEEEREGEVPLLVLGPLADSFIHGASGVKISYDA 249
Query: 308 AVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 367
V + A + IS G N +LL N GF+ ++N + ++++
Sbjct: 250 QEHRCV---FSALHKVAKNSPISI-----GVGMSSNMELLANRGFMMQNNGNNFVLMKFQ 301
Query: 368 LNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS 427
L+ + R + LS + +V R E +L LR+ +S E S
Sbjct: 302 LDRNSDMHASARESMMKQLNLSNPMTYV--VRYGEMPQGLLASLRIQSLSPV-EFGSYGK 358
Query: 428 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 487
+L PV+ E L + LA YP T+ EDE +LT R A L+R
Sbjct: 359 ALA--TPVTLENEWRAYRLLISSCNSILAMYPTTIEEDEIVLTQTKTSRHLRAAV-LLRR 415
Query: 488 EKKMLNACLQVTAD 501
E+K++ ++ A+
Sbjct: 416 EEKLIYESIKTWAN 429
>gi|322703179|gb|EFY94792.1| UV-endonuclease UVE-1 [Metarhizium anisopliae ARSEF 23]
Length = 1118
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 96/438 (21%), Positives = 173/438 (39%), Gaps = 79/438 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN-------KLSELACLALYLM 165
+ A D+ A F++P ++ + E + EL + L + L L +M
Sbjct: 694 IVALRDIPADTTLFTIPRDAIINSDTSSLREKLPELFESQGDEDEQQALDSWSALILIMM 753
Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEGI 224
YE G +S W PYI L L ++P+ WSE EL+YL S T +I + AE +
Sbjct: 754 YEFFLGHQSKWKPYIDVL-------PLTFDTPMFWSEEELSYLQASATVNKIGKADAEEM 806
Query: 225 KREY--------------------NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
R +L + GS Y +D+ E + +V
Sbjct: 807 FRTRLIPAIRGNPSVFASSGDCSDEDLIGLAHRMGSTIMAYAFDLENEEAENDDESDGWV 866
Query: 265 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 324
+ + V++A +L ++ A + D+ + + R KAGE I+
Sbjct: 867 EDREGKSMMGMVAMA----------DILNADAEFNAHVNHGDEELTVTSIRDIKAGEEIL 916
Query: 325 VWYNISFTGHDFKCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAALNTEDPQYQ 376
+Y GP PNS+LL YG++ E + P+D V+ +L +E Q
Sbjct: 917 NYY-----------GPHPNSELLRRYGYITEKHSRYDVVEIPWD--AVQHSLMSELGVPQ 963
Query: 377 DKRMVAQRNGKL------SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE-MQSVISSL 429
D ++ + K+ + V +G + P + G D E +++ I L
Sbjct: 964 D--IMTETMDKMDQDDLEDIFVLERDSGEPNPDGTFAGPAVVDGMPPDLKEQLKATIKLL 1021
Query: 430 ----GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLV 485
G + + + + + + + Y T++EDE +L +L ++R+A ++
Sbjct: 1022 QKVDGNLISDKRKRDDILRSTMVETLRLIASRYSTTIAEDEILLAQDSLTRRQRMAVRVR 1081
Query: 486 RMEKKMLNACLQVTADMI 503
EKK+L ++M+
Sbjct: 1082 LGEKKLLQEAFDHFSEMV 1099
>gi|350408192|ref|XP_003488333.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Bombus
impatiens]
Length = 484
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 101/449 (22%), Positives = 182/449 (40%), Gaps = 61/449 (13%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+ + +G +W+ +NG + E P ++ + A + + +P L
Sbjct: 77 KRSQGIGRFINWLKQNGANVYGASVAEFPGYDLG------LKAERNFLENELILRIPREL 130
Query: 133 VVTLERV------LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
+ ++ L N+ + +L+ LA+ L+ EK + + S W PY+ L
Sbjct: 131 IFSIHNAAPELVALQNDPLLQLMPQ------VALAIALLIEKHK-EYSKWKPYLDILPT- 182
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
+ L + ++ L GSPT L++ I R+Y +F LFQ+
Sbjct: 183 ------TYTTVLYMTAADMNELKGSPTLEAALKQCRNIARQY-----AYF--NKLFQKNN 229
Query: 247 YDIPT---EAFTFEIFKQAF--VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAM 301
+ + FT+E + A V + ++ + SL AL+P+ +SK
Sbjct: 230 NAVSAILRDVFTYEKYCWAVSTVMTRQNIIPSKDGSLMIH-ALIPMWDMCNHENSKITTD 288
Query: 302 LAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDR 361
A + + R +K E I + Y G + NS ++ GFV DN D
Sbjct: 289 FNATLNCCECYALRDFKKAEQIFISY-----------GARTNSDFFVHSGFVYMDNEQDG 337
Query: 362 LVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-MLPYLRLGYVSDTS 420
+ ++ DP +++ + + +V F + G E ISD +L +LR+ +
Sbjct: 338 FKLRLGISKADPLQKERVELLNKLDLPAVGEFLLKPG--TEPISDTLLAFLRV-FSMRKE 394
Query: 421 EMQSVISS--LGPICPVSPCMERAVLDQLADYFKARL----AGYPATLSEDEAMLTDYNL 474
E+ I S + + + +E V + + + RL A YP TL ED +L + L
Sbjct: 395 ELAHWIQSDRVNDLKHMDCALETVVEENVKKFLLTRLQLLIANYPTTLKEDLQLL-ETTL 453
Query: 475 HPKKRVATQLVRMEKKMLNACLQVTADMI 503
K++A QL EK++L L+ I
Sbjct: 454 PRIKKLAIQLRVTEKRILQGALEYVQQWI 482
>gi|160331079|ref|XP_001712247.1| met [Hemiselmis andersenii]
gi|159765694|gb|ABW97922.1| met [Hemiselmis andersenii]
Length = 464
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 79/393 (20%), Positives = 171/393 (43%), Gaps = 51/393 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER-VLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
+ A + +Q G+ +P +L+++++R + NE + L+E L ++L+ + G
Sbjct: 103 LLAFKKIQQGEKLIEIPENLILSVDRDQIKNEG------NDFLNEYDSLGIFLIQQMAMG 156
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
KS W Y L R+ + W+ ++ +L GS T L E IK ++ L
Sbjct: 157 DKSKWKIYFDILPREED-----LNLGFRWNLNDIVFLRGSKTLNASLYLKEKIKIQFLRL 211
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP-- 289
+ F L +YP I F ++ A + S + LQ + ++ +LVP
Sbjct: 212 EKTIFSKNRL--KYPVSI----FNLAQWEWALSILLSRAIFLQNL---KKVSLVPYADFM 262
Query: 290 ---PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKL 346
P K + + + + + D+ Y + I Y G + N +L
Sbjct: 263 NHNPFSTSYINSKKISFSKNHEIVMYADKDYNKFDQIFTTY-----------GQKTNLEL 311
Query: 347 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV---AQRNGKLSVQVFHVHAGREKEA 403
L+ YGF+ E NP+D + + +L+ +D ++ K+ ++ +++ +F+ +E
Sbjct: 312 LLLYGFILERNPFDSIELRISLSDKDSFFEKKKQFMIECEKTSEITFPIFYYKYPKE--- 368
Query: 404 ISDMLPYLRLGYVSDTSEMQSV-ISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATL 462
+ +LR +S+ E+ S +S + +E+ + + + L Y +
Sbjct: 369 ---LYEFLRFC-ISNQEELGSTDLSDFNFNDENNYEIEKIIRKLVLFSCEKLLKNYSKKV 424
Query: 463 SEDE---AMLTDYNLHPKKRVATQLVRMEKKML 492
SE++ ++ +++ + +++A + + EKK++
Sbjct: 425 SEEKILNSLNSNFLISKNQKMALKQSKCEKKII 457
>gi|399949805|gb|AFP65462.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Chroomonas mesostigmatica
CCMP1168]
Length = 464
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 82/389 (21%), Positives = 161/389 (41%), Gaps = 47/389 (12%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A +Q G+ +P +L+ L++ L E +E L+ L+E LA+ + E+ G+KS
Sbjct: 100 AFRKIQQGEKLIEIPENLI--LKKSLK-ENRSEDLSF--LNEYDSLAIKAIQERAIGEKS 154
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W Y L +++ + W +++ +L GS E IK ++ ++
Sbjct: 155 KWKVYYEILPKEKDLNLV-----FRWKISDIVFLRGSKVLNASFYLKEKIKIQFLRIEKT 209
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP----- 289
F L P + F + ++ A + S + LQ + ++ ALVP
Sbjct: 210 IFSKNRLV------YPEKIFNLQSWEWAISLLLSRAIFLQNM---KKIALVPYADFINHN 260
Query: 290 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
P K + + ++ + + D+ Y + I Y G + N +LL+
Sbjct: 261 PFSTSYINSKKIAFSENNEIVMYADKDYNKFDQIFTTY-----------GQKTNLELLVL 309
Query: 350 YGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV---AQRNGKLSVQVFHVHAGREKEAISD 406
YGF+ E NP+D + + AL+T+D Y K ++ +++ VF+ +E
Sbjct: 310 YGFIIERNPFDSIELRVALSTKDELYNKKEKFINDCEKTEQITFPVFYYKYPKE------ 363
Query: 407 MLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 466
+ ++RL S+L + +E+ + + K L Y T++E++
Sbjct: 364 LYEFMRLCLSGPRDFFGEEFSNLNFTDEENFNLEKIIRKTVIFACKKNLKAYNKTINEEK 423
Query: 467 AMLTDYN---LHPKKRVATQLVRMEKKML 492
+ N L ++ + + + EKK+L
Sbjct: 424 ILNNLSNIIVLTKNQKTSIKQRKCEKKIL 452
>gi|303271159|ref|XP_003054941.1| methyltransferase [Micromonas pusilla CCMP1545]
gi|226462915|gb|EEH60193.1| methyltransferase [Micromonas pusilla CCMP1545]
Length = 544
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 103/392 (26%), Positives = 158/392 (40%), Gaps = 34/392 (8%)
Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWL 177
D++AG+ +P +L VT V + +A L EL LAL+L E+ +G S W
Sbjct: 98 DVRAGEPLLEIPQNLAVTSVDVSDHPIVAGLAAGR--GELVGLALWLCCERAKGSLSDWA 155
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L V+ PL W E+E+ + L GSPT + + RA + EY +
Sbjct: 156 PYVNTLPT-----GCTVDHPLRWEESEIRSLLKGSPTCEQAVGRAVDAREEYASIRAAIA 210
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL--------- 287
+ Y+ TE F A V + V L ++ +ALVPL
Sbjct: 211 ADADAYPADAYEFLTEL----AFTDALATVLARAVWLNAANV---YALVPLVDLLPVVGA 263
Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGP---QPNS 344
PP + ++ A DA VVD VV N + C +
Sbjct: 264 PPPGVNPAAAAADAGARGLDAAVGVVDYDAATECVAVVSANDARQTAPVVCADALGRNAG 323
Query: 345 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG-KLSVQVFHVHAGREKEA 403
L ++ G V+ + D L + D Y K+ + + G Q F V A R
Sbjct: 324 DLFLSTGRVNGAHVGDYLTFVTSTVMSDKLYAAKKQILEGMGYSADAQAFPVFADRMP-- 381
Query: 404 ISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLS 463
+ Y+R V + SE+ +V I VSP E +L L + LA Y +
Sbjct: 382 -LQLFAYMRFARVQEPSELMTVSFEEDRI--VSPMNEYEILQLLMGDAREMLAEYENSSE 438
Query: 464 EDEAM-LTDYNLHPKKRVATQLVRMEKKMLNA 494
E E + L + N+ ++ A +L EK+++NA
Sbjct: 439 EFELLQLKETNISERQMTAAKLRLGEKRLINA 470
>gi|310799999|gb|EFQ34892.1| SET domain-containing protein [Glomerella graminicola M1.001]
Length = 478
Score = 65.5 bits (158), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 97/444 (21%), Positives = 173/444 (38%), Gaps = 93/444 (20%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER----------VLGNETIAELLTTNKLSELACLAL 162
+ A++D+ F++P ++ +E GN+ E + L L L
Sbjct: 44 IVATKDIAPETVLFTIPRKSIINIETSELPKKIPQVFTGNDGDDEDMENEPLDSWGSLIL 103
Query: 163 YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAE 222
++YE QG S W Y L + ++ + W +L YL GS ++I +
Sbjct: 104 VMIYEYLQGNASPWKTYFEVLPEK-------FDTLMFWESPDLEYLKGSAVLSKIGKDEA 156
Query: 223 GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR- 281
L + AG F Q P+E+ ++ + + + L+ +
Sbjct: 157 DEMFRSRILPVISANAGIFFPQ-GVSPPSESELLQLAHRMGSIIMAYAFDLENEEEPEQE 215
Query: 282 -------------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYN 328
+VP+ +L ++ A + +D + + RP KAGE I+ +Y
Sbjct: 216 DEEWVEDREGKTMLGMVPMA-DILNADAEFNAHVNHGEDDLSVTALRPIKAGEEILNYY- 273
Query: 329 ISFTGHDFKCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAALNTEDPQYQDK-- 378
GP PNS+LL YG+V + P+D +V++ L TE + D+
Sbjct: 274 ----------GPHPNSELLRRYGYVTPKHSRYDVVEIPWD--LVQSTL-TEQLRLTDEVW 320
Query: 379 RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL--------- 429
+ VA+ ++ V E S+ G++ +++Q V + L
Sbjct: 321 KQVAEHVDPEDLEDVFVLERESGEPDSE-------GHLQTPAKVQEVSAELEEQLKDVLK 373
Query: 430 ------GPICP-------VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 476
G + P V C+ + L +L LA YP T EDEAML N+
Sbjct: 374 AIKKVRGDLIPDKRKRDEVYQCVVVSTLQKL-------LAQYPTTAEEDEAMLASGNVTS 426
Query: 477 KKRVATQLVRMEKKMLNACLQVTA 500
++++A ++ EK+++ LQV
Sbjct: 427 RQKLAVEVRLGEKRLIKEALQVAG 450
>gi|156064409|ref|XP_001598126.1| hypothetical protein SS1G_00212 [Sclerotinia sclerotiorum 1980]
gi|154691074|gb|EDN90812.1| hypothetical protein SS1G_00212 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 470
Score = 65.5 bits (158), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 103/420 (24%), Positives = 170/420 (40%), Gaps = 68/420 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSE----LACLALYLMYEK 168
V A+ D+ + FS+P + V L + +A L + +L E L LM E
Sbjct: 42 VVATGDIDDDEIIFSIPRNAV------LNAQNVAPLPVSRRLFEKMPSWLVLTSILMTEA 95
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA------- 221
Q + S W PY+ L + ++S + WS++ELA L S +I ++
Sbjct: 96 -QMENSKWAPYLAVLPER-------LDSLVFWSDSELAELQASAVVKKIGKKDAEDMFKS 147
Query: 222 ----EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQ 274
+G+K E+ S+ Y +DIP + + A V +
Sbjct: 148 YIAPQGLKHSSTEM---CHKVASVIMAYAFDIPDPSDAPTSGGKGGEAGDDLVSDDGEDE 204
Query: 275 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGH 334
K L+ ++PL L A + + A L ++ +++ +P GE I Y
Sbjct: 205 KTILS----MIPLADMLNADADRNNARLICDNEELEMRAIKPISKGEEIFNDY------- 253
Query: 335 DFKCGPQPNSKLLINYGFV-DEDNPYD------RLVVEAALN----------TEDPQYQD 377
G P S LL YG+V D + YD L+V N T+D +
Sbjct: 254 ----GQLPRSDLLRRYGYVTDGYSAYDVAEISAELIVSLFRNGKVHPSLHKLTQD-GLKT 308
Query: 378 KRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSP 437
+ +A+R G VH+ ++ +I D L + D S ++++++S I S
Sbjct: 309 RLELAEREGVYEDSFDLVHSSPDEPSIPDELLAFLYLLLVDESHLKAILNSESSIPSRSK 368
Query: 438 CMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 497
LA +AR Y TL EDE +L + +L + +A Q+ EKK+L A +Q
Sbjct: 369 LTTELAGQVLATLLQAREKEYSTTLEEDEDLLKNADLPVRHAMAIQVRSGEKKVLRAAMQ 428
>gi|302836231|ref|XP_002949676.1| Rubisco large subunit N-methyltransferase [Volvox carteri f.
nagariensis]
gi|300265035|gb|EFJ49228.1| Rubisco large subunit N-methyltransferase [Volvox carteri f.
nagariensis]
Length = 484
Score = 65.5 bits (158), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 108/412 (26%), Positives = 178/412 (43%), Gaps = 47/412 (11%)
Query: 105 EKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYL 164
+ RP+ + AS D Q GD FSVP+S ++ E V + +L L +AL L
Sbjct: 65 QTDRPV--LIASTDAQQGDVLFSVPDSAWLSAESVK-KAAVGKLAAAAGLEPWLQIALQL 121
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
+ ++ KS Y + +++PLLWSE EL L G+ ++L+ G
Sbjct: 122 VADRFGSTKSELSAYAASIPED-------LDTPLLWSEDELQELQGT----QVLQTLGGY 170
Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV---VHLQKVSLARR 281
+ T + LF P P FT F A AV+S + K++LA
Sbjct: 171 LTFFRS--TFQQLQSGLFTSNPAAFPPSIFTLPRFLWAVAAVRSRSHPPLDGPKIALA-- 226
Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DRPYKAGESIVVWYNISFTGHDFKC 338
L L A +SK A + Q++V R + GE + + Y
Sbjct: 227 -PLTELVSHRRAANSKLSVRSAGLFGRGQVLVLEATRAIRKGEPLSMDY----------- 274
Query: 339 GP-QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 397
GP + + +L++YG +D +P + + D DK + + N V+++
Sbjct: 275 GPGKLDGPVLVDYGVMDVTSPKPGYSLTLKMPDSDRFIDDKLDILESNDLPQSVVYNLTP 334
Query: 398 GREKEAISDMLPYLRLGYV--SDTSEMQSVISS--LGPIC-PVSPCMERAVLDQLADYFK 452
+++ +ML +LRL + SD ++S+ + G + PVS E AV + L++ +
Sbjct: 335 --DEQPTIEMLAFLRLMQLKGSDAFLLESIFRNDVWGFMQEPVSEGNEEAVCNTLSEGAR 392
Query: 453 ARLAGYPATLSEDEAMLTDYNLHPK--KRVATQLVRM-EKKMLNACLQVTAD 501
A L GY T+ +D A L K +R A L+R+ EK+ L+A + D
Sbjct: 393 AALGGYGTTIDQDLAELRAQGSRAKGSRREAALLIRLGEKEALDAVARFFED 444
>gi|156361027|ref|XP_001625323.1| predicted protein [Nematostella vectensis]
gi|156212150|gb|EDO33223.1| predicted protein [Nematostella vectensis]
Length = 447
Score = 65.1 bits (157), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 76/291 (26%), Positives = 118/291 (40%), Gaps = 43/291 (14%)
Query: 75 EEDLGDLKSWMHKNGLPPCKVILKEKPS-HNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
EE+ L W +NG+ V K +P+ + R + A E + + + SVP L+
Sbjct: 47 EENYISLLKWAKRNGM----VFKKIRPAIFSSTGRGM---LAIERIHSSECVISVPERLL 99
Query: 134 VT----LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
+T LE +GN + K S L L+LMYEK K SFW PYIR L
Sbjct: 100 ITASSVLESAIGNYVAERMKGGAKSSNDYLLVLFLMYEKYLEKGSFWAPYIRTLPD---- 155
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDI 249
+P ++ EL +L + + E+ IK+ Y + + Q + +
Sbjct: 156 ---TFNTPCYFTRKEL-FLLPEQCREQAFEQVTQIKQSYKSFAKAY---NDVLQDFDCNF 208
Query: 250 PTEAFTFEIFKQAFVAVQS-CVVHLQKVSLAR----RFALVPLGPPLLAYSSKCK--AML 302
FE FK A+ V + V H + A+ AL PL LL + K +
Sbjct: 209 -WRTVDFESFKWAWCVVNTRSVYHDEPNRRAQPIDGNCALAPL-LDLLNHCDKAEMCGRF 266
Query: 303 AAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV 353
+ ++ V Y+ G + + Y GP N++L + YGFV
Sbjct: 267 NSSSKNYEINVITEYQKGTQVFINY-----------GPHDNTRLFLEYGFV 306
>gi|297726941|ref|NP_001175834.1| Os09g0411650 [Oryza sativa Japonica Group]
gi|255678893|dbj|BAH94562.1| Os09g0411650, partial [Oryza sativa Japonica Group]
Length = 206
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 47/158 (29%), Positives = 82/158 (51%), Gaps = 9/158 (5%)
Query: 341 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGRE 400
+ N++L ++YGF + ++ D + ++ DP Y DK +A+ NG F + G
Sbjct: 10 KSNAELALDYGFTESNSSRDAYTLTLEISESDPFYDDKLDIAELNGMGETAYFDIVLG-- 67
Query: 401 KEAIS-DMLPYLRLGYVSDTSE--MQSVISS--LGPI-CPVSPCMERAVLDQLADYFKAR 454
E++ MLPYLRL + T ++++ + G + PVS E A+ + + K+
Sbjct: 68 -ESLPPQMLPYLRLLCLGGTDAFLLEALFRNAVWGHLELPVSQDNEEAICQVIRNACKSA 126
Query: 455 LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 492
L Y T+ EDE +L NL P+ ++A ++ EKK+L
Sbjct: 127 LGAYHTTIEEDEELLGSENLQPRLQIAVEVRAGEKKVL 164
>gi|357444999|ref|XP_003592777.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase [Medicago truncatula]
gi|355481825|gb|AES63028.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase [Medicago truncatula]
Length = 451
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 65/272 (23%), Positives = 118/272 (43%), Gaps = 36/272 (13%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ +Q GD VP SL +T + + + + + +A LA L+ K G+ S
Sbjct: 66 ASKSIQTGDCILQVPYSLQLTPDNLPPE---IKPFISEDVGNIAKLATVLLIHKNLGQDS 122
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI L Q + + + W+E+EL + S E + + I++++ E+ V
Sbjct: 123 EWHPYISCLPPQA-----EMHNTIFWNESELEMIRQSSVYQETIYQKSQIEKDFLEIKPV 177
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
FQ P+ FT++ F A V S + + +L+P L +
Sbjct: 178 -------FQ--PFCQSFGDFTWKDFMHACTLVGS-----RAWGSTKGLSLIPFAD-FLNH 222
Query: 295 SSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGF 352
+A++ + DD ++ DR Y GE +++ Y G N+ L++++GF
Sbjct: 223 DGISEAIVMSDDDNKCSEVFSDRDYVPGEQVLIRY-----------GKFSNATLMLDFGF 271
Query: 353 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQR 384
N YD++ ++ + DP K + Q+
Sbjct: 272 TIPYNIYDQVQIQYDIPKYDPLRHTKLELLQQ 303
>gi|384484604|gb|EIE76784.1| hypothetical protein RO3G_01488 [Rhizopus delemar RA 99-880]
Length = 400
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 72/300 (24%), Positives = 127/300 (42%), Gaps = 62/300 (20%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A++D++ GD FS+P S++++ + ++EL ++LS + L L +MYE ++
Sbjct: 41 VTANKDIKEGDLLFSLPRSILLSQLTSSLKDQVSEL---SELSGWSPLILCMMYEIEK-P 96
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
SFW PY L R+ +P+ W++ +L L G+ ++I + E + +NEL+
Sbjct: 97 DSFWKPYFDVLPRE-------FTTPMFWNQEDLKELEGTDIISKI-GKKESEELFHNELE 148
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFK--QAFVAVQSCVVHLQKVSLARR--------- 281
+ ++YP + T E+F + + S LQK
Sbjct: 149 PI-------IKKYPNLFDEQKHTIELFHICGSLIMAYSFNDELQKAPKENNKEEEKEEEE 201
Query: 282 ---------------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 326
++VP+ L + A L D++Q+ + K GE I
Sbjct: 202 EEEEEEEEEEEEEGLISMVPMADMLNHKTGFNNARLFHEPDSLQMRAIKDIKEGEQIYNT 261
Query: 327 YNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDR------LVVEAALNTEDPQYQDKRM 380
Y G N+ LL YGFVDE N +D L+VE +D +++++
Sbjct: 262 Y-----------GDLCNADLLRKYGFVDEKNDFDLVELDGPLLVEVCCEDQDEALKERKI 310
>gi|255562948|ref|XP_002522479.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
gi|223538364|gb|EEF39971.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
Length = 502
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 93/415 (22%), Positives = 168/415 (40%), Gaps = 74/415 (17%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERV----LGNETIAELLTTNKLSELACLALYLMYEKKQ 170
A D+ + +P L + + V +GN + L +AL+L+ EK +
Sbjct: 84 AERDIARNEVVLEIPKKLWINPDAVAASDIGN-------VCSGLKPWISVALFLIREKLK 136
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLW-----------------SETELAYLTGSPT 213
+ S W PY+ L S + W SE ELA L G+
Sbjct: 137 KEGSTWWPYLDILPD-------TTNSTIYWWVLLVAFYVLVLSFQRRSEEELAELQGTQL 189
Query: 214 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 273
L E ++RE+ +++ + + +P I T + F AF ++S
Sbjct: 190 LRTTLGVKEYMQREFAKVEEEILLPHK--ELFPSPI-----TLDDFLWAFGILRSRAFSR 242
Query: 274 QKVSLARRFALVPLGPPL----------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESI 323
+ + L+PL + AY K + + + L K+GE +
Sbjct: 243 LR---GQNLVLIPLADLINHSPDITTEDYAYEIKGGGLFSR-ELLFSLRSPISVKSGEQV 298
Query: 324 VVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 383
++ Y+++ + N++L ++YGF+++ + + ++ DP + DK +A+
Sbjct: 299 LIQYDLN----------KSNAELALDYGFIEKTPDRNTYTLTLQISESDPFFGDKLDIAE 348
Query: 384 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE--MQSVISS--LGPI-CPVSPC 438
NG F + G MLPYLRL + T ++S+ + G + P+S
Sbjct: 349 TNGSGETADFDIVLGNPLPPA--MLPYLRLVALGGTDAFLLESIFRNTIWGHLELPISRA 406
Query: 439 MERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 493
E + + D K+ L+GY T+ EDE L +L+P+ +A + EKK+L
Sbjct: 407 NEELICRVVRDACKSALSGYHTTIEEDEK-LEAADLNPRLEIAVGIRAGEKKVLQ 460
>gi|380015248|ref|XP_003691619.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Apis
florea]
Length = 483
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 96/446 (21%), Positives = 182/446 (40%), Gaps = 53/446 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
SK+ + +G +W+ +NG + E P ++ + A + + +P
Sbjct: 75 SKRSQGIGQFINWLKENGANVDGASVAEFPGYDLG------LKAERNFLENELILRIPRG 128
Query: 132 LVVTLERVLGNETIAELLTTNK------LSELACLALYLMYEKKQGKKSFWLPYIRELDR 185
L+ ++ + EL+T + ++A LA+ L+ E+ + + S W PY+ L
Sbjct: 129 LIFSI-----HNAAPELITLQNDPLIQHMPQVA-LAIALLIERHK-ENSKWKPYLDILPT 181
Query: 186 QRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQY 245
+ L + ++ L GSPT L++ I R+Y+ + V+ +
Sbjct: 182 -------TYTTVLYMTAADMIELKGSPTLEAALKQCRNIARQYSYFNKVFQNNNNAVSAI 234
Query: 246 PYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF-ALVPLGPPLLAYSSKCKAMLAA 304
D+ FT+E + A V + + +R AL+P+ + + A
Sbjct: 235 LRDV----FTYERYCWAVSTVMTRQNLIPSEDGSRMIHALIPMWDMCNHENGRITTDFNA 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
+ + R +K GE I + Y GP+ NS ++ GFV +N D +
Sbjct: 291 TSNYCECYALRDFKKGEQIFISY-----------GPRTNSDFFVHSGFVYMENKQDGFKL 339
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-MLPYLRLGYVSDTSEMQ 423
++ D +++ + + +V F + G E ISD +L +LR+ + +E+
Sbjct: 340 RLGISKADSLQKERIELLNKLDLPTVGEFLLKLG--TEPISDLLLAFLRV-FSMRKAELA 396
Query: 424 SVISS--LGPICPVSPCMERAVLDQLADYFKARL----AGYPATLSEDEAMLTDYNLHPK 477
I S + + + +E V + + + RL A YP TL ED +L + L
Sbjct: 397 HWIRSDRVNDLKHMDCALETVVEENVRKFLLTRLQLLIANYPTTLKEDLQLL-ETTLPQI 455
Query: 478 KRVATQLVRMEKKMLNACLQVTADMI 503
K++ QL EK++L L+ I
Sbjct: 456 KKLTIQLRVTEKRILQGALEYVEQWI 481
>gi|412990750|emb|CCO18122.1| predicted protein [Bathycoccus prasinos]
Length = 543
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 71/273 (26%), Positives = 112/273 (41%), Gaps = 49/273 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLM------- 165
+ A+E ++ G+ +P ++T+E L + E +L E + LA +L
Sbjct: 118 LVATESIKRGEKVLEIPQEAIITVEVALKESLLREKKKLAELQEWSILATFLAETAQNLS 177
Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGI 224
E K + Y++ L R G S L W E+++ L GSP+ LER +
Sbjct: 178 TEDNSSNKYRFATYVKALPRSTG-------SVLEWPESDVRTLLAGSPSLFSALERRASV 230
Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 284
E+ + Q+ +DI +F S ++ L+ SL AL
Sbjct: 231 AAAIAEIRVNFPELNEKTLQWAFDI--------LF--------SRLIRLE--SLGGNLAL 272
Query: 285 VPLGPPLLAYSSKCKAM--LAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
VP +L + C+A L V L DR Y+ GE + Y G +P
Sbjct: 273 VPWAD-MLNHQPGCEAFIDLDRGSRKVCLTTDRSYEPGEQVWASY-----------GQRP 320
Query: 343 NSKLLINYGFVDE--DNPYDRLVVEAALNTEDP 373
+S+LLI+YGF DNP D + ++ EDP
Sbjct: 321 SSELLISYGFAPAVGDNPDDEYALNLQIDEEDP 353
>gi|79315114|ref|NP_001030864.1| SET domain-containing protein [Arabidopsis thaliana]
gi|51971180|dbj|BAD44282.1| unnamed protein product [Arabidopsis thaliana]
gi|332645817|gb|AEE79338.1| SET domain-containing protein [Arabidopsis thaliana]
Length = 353
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 59/244 (24%), Positives = 107/244 (43%), Gaps = 38/244 (15%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LA L+ EKK G+KS W+PYI L + + S + W E EL+ + S E ++
Sbjct: 2 LAAVLIREKKMGQKSRWVPYISRLPQ-----PAEMHSSIFWGEDELSMIRCSAVHQETVK 56
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
+ I+++++ F+A + Q P I TE E F A+ V S + +
Sbjct: 57 QKAQIEKDFS------FVAQAFKQHCP--IVTERPDLEDFMYAYALVGS-----RAWENS 103
Query: 280 RRFALVPLGPPLLAYSSKCKAMLAAVDD------AVQLVVDRPYKAGESIVVWYNISFTG 333
+R +L+P + +L D+ +Q+ DR Y G+ + + Y
Sbjct: 104 KRISLIPFADFMNHDGLSASIVLRDEDNQLSEFSTLQVTADRNYSPGDEVFIKY------ 157
Query: 334 HDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ-- 391
G N+ L++++GF N +D + ++ + +DP K + Q + +V+
Sbjct: 158 -----GEFSNATLMLDFGFTFPYNIHDEVQIQMDVPNDDPLRNMKLGLLQTHHTRTVKDI 212
Query: 392 -VFH 394
+FH
Sbjct: 213 NIFH 216
>gi|302896454|ref|XP_003047107.1| SET domain protein [Nectria haematococca mpVI 77-13-4]
gi|256728035|gb|EEU41394.1| SET domain protein [Nectria haematococca mpVI 77-13-4]
Length = 1037
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 107/458 (23%), Positives = 179/458 (39%), Gaps = 104/458 (22%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN----------KLSELACLAL 162
+ A +D+ A F++P ++ +E + + ++ + +L + L L
Sbjct: 610 IIALQDIPAETTLFTIPRKGIINVETSELPKKLPDVFDLDKPIDDDDEAPRLDSWSSLIL 669
Query: 163 YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK-------A 215
LMYE QG+KS W PY L + ++P+ WSE+EL L S + A
Sbjct: 670 VLMYEYLQGEKSQWKPYFDVLPS-------SFDTPMFWSESELDQLQASHMRHKIGKADA 722
Query: 216 EILERAE-------------GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
E + R G R ++L + GS Y +D+ + E
Sbjct: 723 ESMFRKTLLPIIRKNSSVFGGENRSDDDLVEIAHRMGSTIMAYAFDLENDEDEEEEETDG 782
Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
+V + + +VP+ +L ++ A + ++++ + RP KAGE
Sbjct: 783 WVEDREGKSMM---------GMVPMA-DILNADAEFNAHVNHEEESLTVTSLRPIKAGEE 832
Query: 323 IVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNP-YDRLVVEAALNTEDPQYQDKRMV 381
I +Y GP PNS+LL YG+V E + YD VVE + + V
Sbjct: 833 IFNYY-----------GPHPNSELLRRYGYVTERHSRYD--VVEIPWDVVES-------V 872
Query: 382 AQRNGKLSVQVFHV--HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGP-------- 431
+ N +S QV H E+E D R +T E+ S + GP
Sbjct: 873 MRLNFGISGQVLEKLRHGLEEEEEFEDTFVLER-----ETGEVNSDGTFSGPARFESMPE 927
Query: 432 ---------ICPVSPCMERAVLDQ----------LADYFKARLAGYPATLSEDEAMLTDY 472
+ V A+ D+ LA +A + YP + SED +L
Sbjct: 928 DLQEQLKTFLKGVKKAQPEAIPDKRKRDEIHHAVLAKTLQALASKYPTSTSEDGILLQRQ 987
Query: 473 NLHPKKRVATQLVRMEKKMLNACLQVTA--DMIMLLPD 508
+L + R+A ++ EKK+L + T+ D+ M + D
Sbjct: 988 DLSQRTRMAIEVRLGEKKLLQEAIASTSSVDVEMTVDD 1025
>gi|225448769|ref|XP_002275729.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Vitis
vinifera]
Length = 480
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 70/303 (23%), Positives = 133/303 (43%), Gaps = 46/303 (15%)
Query: 75 EEDLGDLKSWM-HKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
++D D W+ K G+ V+ K ++ + AS+ +Q GD VP ++
Sbjct: 39 DKDCDDFLPWLEQKAGVEISSVLSIGKSTYGRS------LFASKSIQTGDCILKVPYNVQ 92
Query: 134 VTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
++ + V I LL +++ +A LA+ + E K G+ S W PYI L Q G
Sbjct: 93 ISPDNV--PSKINSLLG-DEVGNIAKLAIVISVEWKMGQDSEWAPYINRLP-QPGE---- 144
Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW-FMAGSLFQQYPYDIPTE 252
+ S + WSE EL + S E + + I++++ + V + +LF+ DI +
Sbjct: 145 MHSTIFWSEGELKMIQQSSVYQETINQKAQIQKDFLAIKPVLHHFSENLFK----DISLK 200
Query: 253 AFTFEIFKQAFVAVQSC-VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA--V 309
F + +C +V + + +L+P + + ++L +D
Sbjct: 201 EF-----------MHACALVGSRAWGSTKGLSLIPFA-DFVNHDGFSDSVLLGDEDKQLS 248
Query: 310 QLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 369
+++ DR Y GE +++ Y G PN+ LL+++GF N YD++ ++ +
Sbjct: 249 EVIADRNYAPGEQVLIRY-----------GKFPNATLLLDFGFTLPYNIYDQVQIQVNIP 297
Query: 370 TED 372
D
Sbjct: 298 HHD 300
>gi|297736447|emb|CBI25318.3| unnamed protein product [Vitis vinifera]
Length = 487
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 70/309 (22%), Positives = 135/309 (43%), Gaps = 51/309 (16%)
Query: 75 EEDLGDLKSWM-HKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
++D D W+ K G+ V+ K ++ + + AS+ +Q GD VP ++
Sbjct: 39 DKDCDDFLPWLEQKAGVEISSVLSIGKSTYGSRS-----LFASKSIQTGDCILKVPYNVQ 93
Query: 134 VTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
++ + V I LL +++ +A LA+ + E K G+ S W PYI L Q G
Sbjct: 94 ISPDNV--PSKINSLLG-DEVGNIAKLAIVISVEWKMGQDSEWAPYINRLP-QPGE---- 145
Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW-FMAGSLFQQYPYDIPTE 252
+ S + WSE EL + S E + + I++++ + V + +LF+ DI +
Sbjct: 146 MHSTIFWSEGELKMIQQSSVYQETINQKAQIQKDFLAIKPVLHHFSENLFK----DISLK 201
Query: 253 AFTFEIFKQAFVAVQSC-VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD---- 307
F + +C +V + + +L+P + + ++L +D
Sbjct: 202 EF-----------MHACALVGSRAWGSTKGLSLIPFA-DFVNHDGFSDSVLLGDEDKQLS 249
Query: 308 ----AVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLV 363
++++ DR Y GE +++ Y G PN+ LL+++GF N YD++
Sbjct: 250 ESSSTLEVIADRNYAPGEQVLIRY-----------GKFPNATLLLDFGFTLPYNIYDQVQ 298
Query: 364 VEAALNTED 372
++ + D
Sbjct: 299 IQVNIPHHD 307
>gi|301094750|ref|XP_002896479.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262109454|gb|EEY67506.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 478
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 107/471 (22%), Positives = 187/471 (39%), Gaps = 61/471 (12%)
Query: 62 TLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQA 121
T V+ R +SK++ +L W+ NG K+ L+E + + R +H + + L
Sbjct: 18 TPVSPPRNGMSKEDVVGQELIQWLETNGADSKKLTLQE---YAPEVRGVH---SRKVLVP 71
Query: 122 GDAAFSVPNSLVVTLERVLGNET-IAELLTTNKLSELA----CLALYLMYEKKQGKKSFW 176
G+ +P ++T+E +G +T I L + +A L ++L+ + + + SF+
Sbjct: 72 GERILVIPKKCLITVE--MGKQTDIGRKLLARNVDFVAPKHIFLMMFLLTDMEHVETSFF 129
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
Y L P+ WSE EL++L GS +I ER I+++Y+ + V
Sbjct: 130 RNYYSTLPSTLS------NMPIFWSEEELSWLKGSYIIQQIQERKAAIRKDYDVICRV-- 181
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS 296
D F+ + F A + V S L + + ALVP L Y
Sbjct: 182 -----------DPSFARFSLDRFSWARMIVCSRNFGL-TIDGVKTAALVPFADMLNHYRP 229
Query: 297 K-CKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDE 355
+ DA + G + Y G + N + L+NYGF E
Sbjct: 230 RETSWTFDQSIDAFTITSLGTIGTGAQVYDSY-----------GKKCNHRFLLNYGFAVE 278
Query: 356 DNPY------DRLVVEAALNTEDPQ-YQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML 408
DN + ++++ L+ D Q + DKR +G ++ + +
Sbjct: 279 DNTEEDGRNPNEVLIDFQLSPADGQLFYDKRAYLHESGIYTMDA-RLSCSHSDANTREGF 337
Query: 409 PYLRLGYVSDTSEMQSVISSLGPIC---PVSPCMERAVLDQLADYFKARLAGYPATLSED 465
+ RL + T E S + P P+S E L+ L + +L+ Y T+ ED
Sbjct: 338 SFARL--IVATEEEFSTMKMKSPAHSSPPISFDNEIRALEYLRNLMTHQLSLYDTTIEED 395
Query: 466 EAMLT--DYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPC 514
+L Y L + A +R EK++ Q AD ++ L + ++ C
Sbjct: 396 NELLASKQYPLFSNRIQALFFIRGEKQVCRY-FQELADKVIPLFSLPLAEC 445
>gi|322694827|gb|EFY86647.1| SET domain protein [Metarhizium acridum CQMa 102]
Length = 467
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 97/438 (22%), Positives = 173/438 (39%), Gaps = 79/438 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN-------KLSELACLALYLM 165
+ A D+ A F++P ++ E + + +L + L + L L +M
Sbjct: 43 ITALRDIPADTTLFTIPRDAIINSETSSLRKKLPDLFESQGDEDEEQALDSWSALILIMM 102
Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEGI 224
YE G +S W PYI L L ++P+ WSE EL+YL S T +I + AE +
Sbjct: 103 YEFFLGDESKWKPYIDVL-------PLTFDTPMFWSEEELSYLQASATVNKIGKADAEEM 155
Query: 225 KREY--------------------NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
R +L + GS Y +D+ E + +V
Sbjct: 156 FRTRLIPAIRGNPSVFVSSGDCSDEDLIGLAHRMGSTIMAYAFDLENEEAENDEESDGWV 215
Query: 265 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 324
+ + V++A +L ++ A + D+ + + R KAGE I+
Sbjct: 216 EDREGKSMMGMVAMA----------DILNADAEFNAHVNHGDEELTVTSIRDIKAGEEIL 265
Query: 325 VWYNISFTGHDFKCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAALNTEDPQYQ 376
+Y GP PNS+LL YG++ E + P+D V+ +L +E Q
Sbjct: 266 NYY-----------GPHPNSELLRRYGYITEKHSRYDVVEIPWD--AVQHSLMSELGVPQ 312
Query: 377 D------KRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE-MQSVISSL 429
D +RM ++ + V +G + P + G D E +++ I L
Sbjct: 313 DIMAETMERM--DQDDLEDIFVLERDSGEPNPDGTFAGPAVVDGMPPDLKEQLKATIKLL 370
Query: 430 ----GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLV 485
G + + + + + + + Y T++EDE +L +L ++R+A Q+
Sbjct: 371 QKLDGNLISDKRKRDDILRSTMVETLRLIASRYSTTIAEDEVLLAQDSLTRRQRMAVQVR 430
Query: 486 RMEKKMLNACLQVTADMI 503
EKK+L ++M+
Sbjct: 431 LGEKKLLQEACDHFSEMV 448
>gi|71895277|ref|NP_001025965.1| SET domain-containing protein 4 [Gallus gallus]
gi|53134599|emb|CAG32346.1| hypothetical protein RCJMB04_23h14 [Gallus gallus]
Length = 439
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 102/447 (22%), Positives = 176/447 (39%), Gaps = 72/447 (16%)
Query: 86 HKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVVTLERV 139
HK K LK++ + RP + + ++ LQAG+ S+P +VT V
Sbjct: 28 HKLEYIKLKKWLKDRGFGDSSLRPAQFWGTGRGLMTTKALQAGELVISLPEKCLVTTTTV 87
Query: 140 LGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
L N + E + K +S L L +L+ EK G++S W PY+ L + P
Sbjct: 88 L-NSCLGEYIMKWKPPVSPLIALCPFLIAEKHAGERSLWKPYLDVLPK-------TYSCP 139
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
+ E ++ L P + + E+ + Y + SLF + I F +
Sbjct: 140 VC-LEQDVVQLLPEPLRKQAQEQRTAVHELYMSSKAFFSSLQSLFAENTATI----FNYS 194
Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVP----LGP--PLLAYS--SKCKAMLAAVDDAV 309
+ A+ + + +++ K S F+L P L P LL +S + KA
Sbjct: 195 ALEWAWCTINTRTIYM-KHSQRECFSLEPDVYALAPYLDLLNHSPNVQVKAAFNEQSRNY 253
Query: 310 QLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 369
++ + K E + + Y GP N +LL+ YGFV DNP+ + V +
Sbjct: 254 EIQTNSQCKKYEEVFICY-----------GPHDNQRLLLEYGFVAVDNPHSSVYVSSDTL 302
Query: 370 TEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL 429
+ DK Q+N KLS+ H D+L L G+ + + + + L
Sbjct: 303 LKYFPSLDK----QKNAKLSILKEH-----------DLLENLTFGWDGPSWRLLTALKVL 347
Query: 430 GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM-----LTDYNLHPKKRVATQL 484
C RA+ L D AR +E +A+ + + + + V Q+
Sbjct: 348 SLGGDEFTCWRRAL---LGDVISAR--------NEQQALNITTKICHFLIEETQHVLLQI 396
Query: 485 VRMEKKMLNACLQVTADMIMLLPDVTV 511
+M++ N Q+T + L D+ +
Sbjct: 397 SQMKRDKENLKEQLTLVEALRLEDLKI 423
>gi|452841392|gb|EME43329.1| hypothetical protein DOTSEDRAFT_131367 [Dothistroma septosporum
NZE10]
Length = 445
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 106/448 (23%), Positives = 176/448 (39%), Gaps = 70/448 (15%)
Query: 83 SWMHKNGLP-PCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLG 141
+W+ NG K+ L + N + A EDL + FSVP S ++T E
Sbjct: 16 NWLRDNGASISAKITLDDLRQQNAGRG----IVAVEDLDEDEELFSVPRSTMLTTETSRN 71
Query: 142 NETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
E + + + LS + +AL + G +S W PY L ++ ++ + WS
Sbjct: 72 GEAVLQEVDDPWLSLIVVMALEYL----DGSQSRWKPYFDVL-------PVSFDNLMFWS 120
Query: 202 ETELAYLTGSPTKAEI----------------LERAEGIKREYN-ELDTVWFMAGSLFQQ 244
+ EL +L GS +I +ER K N EL + GS
Sbjct: 121 DRELRHLEGSTVVGKIGKEAADATFREQLIPVIERISKAKAADNEELLRMCHRMGSTIMA 180
Query: 245 YPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAA 304
Y +D+ T + + + + L K +VPL L A + + A L
Sbjct: 181 YGFDLETSSDQAKNDGEEWEEDSDAGETLPK-------GMVPLADMLNADADRNNAKLFY 233
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDN------- 357
DD V + +P KAGE + YN DF G P + LL YG++ DN
Sbjct: 234 EDDKVVMKTIKPVKAGEEL---YN------DF--GSLPRADLLRRYGYL-TDNYAQYDVV 281
Query: 358 --PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 415
P D + A L T+D ++ A+ G L A E+ + L L
Sbjct: 282 EIPADLIKERAGLRTQD--VDERWQYAEEQGVLDDGYDVSRASSEEGQFPEELCVLLNLL 339
Query: 416 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
+E + V + P + +L + Y R A YP + + M +D +L+
Sbjct: 340 ALPRAEFEKVKNKDKIPKPDLTTNAKKLLRTILVY---RYAAYPGNVDQ---MHSDVSLN 393
Query: 476 PKKR-VATQLVRMEKKMLNACLQVTADM 502
++R +A +++ EK++L + +++
Sbjct: 394 DRRRKMAIVVIQGEKQVLQEAVDAISEI 421
>gi|255947868|ref|XP_002564701.1| Pc22g06730 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211591718|emb|CAP97961.1| Pc22g06730 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 679
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 63/220 (28%), Positives = 102/220 (46%), Gaps = 26/220 (11%)
Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
A +LM + +G + FW PY+R L + GQL +PL + E ++ ++ G+ +
Sbjct: 105 TFAFFLMGQYLRGSEGFWYPYLRTLPQP---GQLT--TPLFFGEEDVDWIQGTGIPEAAV 159
Query: 219 ERAEGIKREYN----ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA-VQSCVVHL 273
ER + +++Y+ +LD + F +QY +++ A T I +AF A V S V
Sbjct: 160 ERIKVWEQKYDLGYLKLDEIGFPD---CEQYTWELYLWASTI-ITSRAFSAKVLSGAVQP 215
Query: 274 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTG 333
+ AL+PL L + K A D+ + L+V + AG+ I Y
Sbjct: 216 DDLPEDGVSALLPL-IDLPNHRPMAKVEWRAGDEDIGLLVLEDHSAGQEISNNY------ 268
Query: 334 HDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 373
GP+ N +LLINYGF NP D +V + + P
Sbjct: 269 -----GPRNNEQLLINYGFCIAGNPTDYRIVLLGVKPDSP 303
>gi|336261436|ref|XP_003345507.1| hypothetical protein SMAC_07495 [Sordaria macrospora k-hell]
gi|380088183|emb|CCC13858.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 499
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 71/315 (22%), Positives = 123/315 (39%), Gaps = 61/315 (19%)
Query: 81 LKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
L W H +G P ++ EK + + +P +A+E L + A + P S+ ++
Sbjct: 12 LLDWAHNHGASLHPSVEIYQDEKTGFSLRVKP----SAAESLHSPFKAVTCPTSITLSYL 67
Query: 138 RVLGNETIAELLT---------------TNKLSELACLALYLMYEKKQGKKSFWLPYIRE 182
L + I LT N L YL+ + +GK S W PYI
Sbjct: 68 NALTDGPITPYLTPPALDTQKHAFPERFMNSLPPHVIGRFYLIQQYLKGKSSLWAPYIST 127
Query: 183 LDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS-- 240
L + A+ P W+E ++ L G+ I E + +K EY + + GS
Sbjct: 128 LTDPSQLDKWAL--PPFWTEHDIELLRGTNAYVAIQEIQDNVKSEYKQARKILKQEGSPD 185
Query: 241 --LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYS--- 295
+ Q Y+ FT F+ + + +S ++++ L+P G + +S
Sbjct: 186 YRAYTQVLYNWAYCMFTSRSFRPSLILSESAREYVER--------LLPEGAKIDDFSILQ 237
Query: 296 -----------SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNS 344
++ L + A +L+ Y+ G+ + Y G + NS
Sbjct: 238 PLYDIGNHSPEAEYSWNLTSEPSACELICRNSYEPGQQVFNNY-----------GKKTNS 286
Query: 345 KLLINYGFVDEDNPY 359
+LL+ YGFV E+N Y
Sbjct: 287 ELLLGYGFVTENNDY 301
>gi|302753470|ref|XP_002960159.1| hypothetical protein SELMODRAFT_437298 [Selaginella moellendorffii]
gi|300171098|gb|EFJ37698.1| hypothetical protein SELMODRAFT_437298 [Selaginella moellendorffii]
Length = 377
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 58/220 (26%), Positives = 96/220 (43%), Gaps = 41/220 (18%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LAL ++ E+ +G+ S W PYI L + +++ LW +TEL+YL SP + E
Sbjct: 114 LALIVLMERYKGQSSVWAPYISCLPQPA-----ELDNTFLWEDTELSYLKASPLYGKTRE 168
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
R E I E+ ++ + LF + + E FK + V S + +++
Sbjct: 169 RLEMITTEFGQVQNALNVWPQLFGK---------VSLEDFKHVYATVFS-----RSLAIG 214
Query: 280 RRFALVPLGPPLLAY-----SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGH 334
LV + P+L + +S K + + + DR Y + I W N
Sbjct: 215 EDSTLVMI--PMLDFFNHNATSFAKLSFNGLLNYAVVTADRAYTENDQI--WINY----- 265
Query: 335 DFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQ 374
G N++L ++YGF +NPYD E L T+ P+
Sbjct: 266 ----GDLSNAELALDYGFTVPENPYD----ETDLLTQFPE 297
>gi|219126444|ref|XP_002183467.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217405223|gb|EEC45167.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 519
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 95/363 (26%), Positives = 143/363 (39%), Gaps = 69/363 (19%)
Query: 160 LALYLMYEKK-QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
L +YL++++K G SF+ PY L P+ WS EL L GS ++I
Sbjct: 116 LMIYLLWDRKTHGSSSFFHPYYEILP------PTLRNMPIFWSAFELQELEGSHLLSQIA 169
Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 278
+R + I+ +Y + V G+L T + FK A + V S LQ +
Sbjct: 170 DRGQAIQDDYEAILEVAPSLGTLC------------TLDEFKWARMCVCSRNFGLQ-IDG 216
Query: 279 ARRFALVPLGPPLLAYSSK-CKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFK 337
R ALVP L Y + K V + + +AG + Y
Sbjct: 217 HRTSALVPHADMLNHYRPRETKWTFDEVTQCFTITSLQSIQAGAQVYDSY---------- 266
Query: 338 CGPQPNSKLLINYGFVDEDNPY------DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ 391
G + N + L+NYGF EDN + + +E ++ D +QDK R
Sbjct: 267 -GQKCNHRFLLNYGFAVEDNRELDGFCPNEVPLELYVDPADILFQDKLEFWTRG------ 319
Query: 392 VFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYF 451
E IS G V+ Q+V S+G P S E +
Sbjct: 320 --------ETNQIS--------GAVTAGLIAQAVGGSMGRGVP-SHAAESYTSGPVVK-- 360
Query: 452 KARLAGYPATLSEDEAMLTDYNLHPK---KRVATQLVRMEKKMLN---ACLQVTADMIML 505
+ RLA YP T+S+D A L D +P+ +R A VR EK++L+ + DM+
Sbjct: 361 RVRLASYPTTISQDMADLQDEASYPQFSNRRHAKIQVRGEKEVLHHFRVWSETALDMLTF 420
Query: 506 LPD 508
+ D
Sbjct: 421 IED 423
>gi|57899520|dbj|BAD87034.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
gi|57899939|dbj|BAD87851.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
Length = 509
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 60/271 (22%), Positives = 103/271 (38%), Gaps = 50/271 (18%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASE +Q GD VP + +TL+++ L + + + + LA L+ E+ G +S
Sbjct: 63 ASEPIQEGDCIMQVPYHVQLTLDKL---PQKFNTLLDHAVGDTSKLAALLIMEQHLGNES 119
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI+ L + + + +LW EL + S E +E E K+E+ L
Sbjct: 120 GWAPYIKSLPTKD-----QMHNMVLWDLNELHAVQNSSIYDEAIEHKEQAKKEFLALKPA 174
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
LF + A + V + QK
Sbjct: 175 LDHFPHLFGEVKLGDFMHASALDFLNHDGVFGSVLIYDEQK------------------- 215
Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVD 354
D +++ DR Y GE +++ Y G N+ L +N+GF
Sbjct: 216 ------------DVCEIIADRNYAVGEQVMIRY-----------GKYSNATLALNFGFTL 252
Query: 355 EDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 385
N YD+ ++ + +DP Y+ K + Q++
Sbjct: 253 ARNIYDQALIRIDMPVQDPLYKKKLDIWQKH 283
>gi|367029027|ref|XP_003663797.1| hypothetical protein MYCTH_2080826, partial [Myceliophthora
thermophila ATCC 42464]
gi|347011067|gb|AEO58552.1| hypothetical protein MYCTH_2080826, partial [Myceliophthora
thermophila ATCC 42464]
Length = 357
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 81/328 (24%), Positives = 133/328 (40%), Gaps = 74/328 (22%)
Query: 78 LGDLKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
G L W ++G P ++ L ++ + P +A+E LQ G AA S P + +
Sbjct: 10 FGALVEWAEQHGARLHPSVEIYLDPVSKYSLRVSP----SATEGLQPGFAAVSCPARITL 65
Query: 135 TLERVLGNETIAELLTTNKLSELACL------------------------ALYLMYEKKQ 170
+ L + + +++ ++ A L +L+ E +
Sbjct: 66 SYLNALVDGLLDPSALSDRSAQSARLDQETSSTGAFPPRFTRSVPPHVLGRFFLVKEYLK 125
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVES-PLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
GK SFW PYI L Q+AV + P W + ++AYL G+ I E E +KRE+
Sbjct: 126 GKDSFWWPYIATLPPPE---QVAVWALPPFWPDHDIAYLEGTNAHVAIQEIQENVKREFK 182
Query: 230 ELDTVWFMAGSLFQQYPY-DIPTEAFTFEIFKQAFVAVQSCVVH----LQKVSLARRFAL 284
+ A L ++ + D+P A+T ++K AF S L + R AL
Sbjct: 183 Q-------ARKLLKEEDFPDLP--AYTQLLYKWAFCIFTSRSFRPSLVLSDATKRRLSAL 233
Query: 285 VPLG---------PPLLAYSSKCKAM-----LAAVDDAVQLVVDRPYKAGESIVVWYNIS 330
+P G PLL ++ +V D +L+ PY+ G + YN
Sbjct: 234 LPQGVQLDDFSVLQPLLDIANHSPTARYTWDTTSVPDTCRLICHDPYQPGTQV---YN-- 288
Query: 331 FTGHDFKCGPQPNSKLLINYGFVDEDNP 358
G + NS+LL+ YGF+ + P
Sbjct: 289 ------NYGLKTNSELLLAYGFILPETP 310
>gi|348671353|gb|EGZ11174.1| hypothetical protein PHYSODRAFT_361758 [Phytophthora sojae]
Length = 486
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 102/452 (22%), Positives = 177/452 (39%), Gaps = 59/452 (13%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+L W+ NG K+ L+E + + R +H + + L G+ +P ++T+E
Sbjct: 44 ELIQWLEGNGADTKKLALQE---YAPEVRGVH---SRKVLAPGERILVIPKKCLITVE-- 95
Query: 140 LGNET-IAELLTTNKLSELA----CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
+G +T I L + +A L ++L+ + ++ + SF+ Y L
Sbjct: 96 MGKQTDIGRKLLARNVDFVAPKHIFLMMFLLTDMERAETSFFRNYYSTLP------STLS 149
Query: 195 ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
P+ WS+ EL +L GS +I ER I+++Y+ + V D F
Sbjct: 150 NMPIFWSDEELGWLKGSYIIQQIQERKAAIRKDYDVICRV-------------DPAFARF 196
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSK-CKAMLAAVDDAVQLVV 313
+ + F A + V S L + + ALVP L Y + DA +
Sbjct: 197 SLDRFSWARMIVCSRNFGL-TIDGVKTAALVPFADMLNHYRPRETSWTFDQSIDAFTITS 255
Query: 314 DRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPY------DRLVVEAA 367
G + Y G + N + L+NYGF EDN + ++++
Sbjct: 256 LGTIGTGAQVYDSY-----------GKKCNHRFLLNYGFAVEDNTEEDGRNPNEVLIDFQ 304
Query: 368 LNTEDPQ-YQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS-- 424
L+ D Q + DKR +G ++ + + + RL V+ E S
Sbjct: 305 LSQADGQLFYDKRAYLHESGIYTMDA-RLSCSHSDANTREGFSFARL-IVATEDEFSSMK 362
Query: 425 VISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT--DYNLHPKKRVAT 482
+ S P+S E L L D +L+ Y T+ ED +L Y L + A
Sbjct: 363 MKSPAHSSPPISFDNEIRALQYLRDLMTHQLSLYDTTIEEDNELLASKQYPLFSNRIQAL 422
Query: 483 QLVRMEKKMLNACLQVTADMIMLLPDVTVSPC 514
+R EK++ Q AD ++ L + ++ C
Sbjct: 423 FFIRGEKQVCRY-FQELADKVIQLFSLPLAEC 453
>gi|298711968|emb|CBJ32910.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 247
Score = 62.4 bits (150), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 65/207 (31%), Positives = 92/207 (44%), Gaps = 34/207 (16%)
Query: 311 LVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 370
+ R + AGE +++ Y GP+ N LL YGFV++DNP D + ++
Sbjct: 40 VTTQRGWTAGEQVLISY-----------GPRSNDHLLRRYGFVEQDNPNDVYRITGLIDK 88
Query: 371 -EDPQYQDK-RMVAQRNGKLSVQ--------VFHVHAGR-----EKEAISDMLPYLRLGY 415
D +D R++ + GKL V V GR EKE ++P RL
Sbjct: 89 LSDVLGKDSVRVLRESGGKLGTTGDNAEGRPVESVTVGRSGLLGEKEE-GRVMPVFRLAV 147
Query: 416 VSDTS--EMQSVISSLGPIC-PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDY 472
V D E ++ SL +SP E A D L G+ TL+EDEA L+
Sbjct: 148 VKDDQLPEGKAAGISLKDFSNEISPANEAAARDALRKLCIKEREGFATTLAEDEAYLSSL 207
Query: 473 N--LHPKKRVATQLVRMEKK-MLNACL 496
L +KRVA RMEKK +L+A +
Sbjct: 208 GNSLGAQKRVAFSF-RMEKKRVLDAAI 233
>gi|241603784|ref|XP_002405757.1| SET domain-containing protein, putative [Ixodes scapularis]
gi|215502568|gb|EEC12062.1| SET domain-containing protein, putative [Ixodes scapularis]
Length = 429
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 66/293 (22%), Positives = 123/293 (41%), Gaps = 35/293 (11%)
Query: 79 GDLKSWMHKNGLP-PCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
G L +WM NG K+ L++ P V A E L G+ +P +L+++
Sbjct: 30 GRLLTWMEANGFRLHSKLGLRDFPDTGRG------VVALEKLVGGETFLKLPATLLISTR 83
Query: 138 RVLGNETIAELLTTN-KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES 196
L + + ++ + KL+ + L L+++ +K G+ S W P++ L R +
Sbjct: 84 TALQSRLHSFIIRHHAKLTPIDVLTLFVLDQKLLGEASRWWPFVDSLPR-------TFTT 136
Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 256
P+ L + E+ I+R + +L + + G + ++ + FT+
Sbjct: 137 PVFLRRKVFESLP-KDLREEVQTGITFIQRTFLKLKVL--LGGHVEEEPEVQCLSTGFTW 193
Query: 257 EIFKQAFVAVQSCVVHLQKVSLARRF-----ALVPLGPPLLAYSSKCKAMLAAVDDAVQL 311
F A+ AV + + Q + + + AL P L + K A V + ++
Sbjct: 194 NNFVWAWTAVNTRCIFAQGSNSSSLWEDDHCALAPF-LDCLNHHWKASIETAMVGENFEI 252
Query: 312 VVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
+ + + A E + + Y GP N +L ++YGFV DNP D +VV
Sbjct: 253 LSHKSHDANEQVFISY-----------GPHSNRRLFLDYGFVLPDNPNDVVVV 294
>gi|429861365|gb|ELA36056.1| set domain protein [Colletotrichum gloeosporioides Nara gc5]
Length = 471
Score = 62.0 bits (149), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 90/436 (20%), Positives = 162/436 (37%), Gaps = 76/436 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER----------VLGNETIAELLTTNKLSELACLAL 162
+ A++D+ A F++P ++ +E GN+ E + L L L
Sbjct: 42 IIATKDIPAETTLFTIPRRSIINVETSELPKKIPQVFTGNDGDDEDMENEPLDSWGSLIL 101
Query: 163 YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-LERA 221
++YE QG S W PY L + + + W ++L L GS ++I E A
Sbjct: 102 VMIYEFLQGAASPWKPYFEVLPEK-------FHTLMFWESSDLENLKGSAVLSKIGKEEA 154
Query: 222 EGIKREY----------------------NELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
+ + R EL + GS+ Y +D+ E +
Sbjct: 155 DEMFRSRILTVIAANPAIFYPEGSSPLGEAELLQLAHRMGSIIMAYAFDLDNEEEPEQEE 214
Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 319
++ + L V +A +L ++ A + DD + + RP A
Sbjct: 215 DDEWIEDRDGKTMLGMVPMAD----------ILNADAEFNAHVNHGDDELTVTALRPIPA 264
Query: 320 GESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDN--------PYD--RLVVEAALN 369
GE I+ +Y GP PNS+LL YG+V + P+D + V L
Sbjct: 265 GEEILNYY-----------GPHPNSELLRRYGYVTPKHSRYDVVEIPWDLVQASVSEHLK 313
Query: 370 TEDPQYQDKRMVAQRNGKLSVQVFHVHAGR-EKEAISDMLPYLRLGYVSDTSEMQSVISS 428
D ++ + V V +G + E + +R ++++V+ +
Sbjct: 314 IGDDVWKQVQEYVDPEELEDVFVLERESGEPDSEGQFRTVAEVREISAELEEQLKAVLKA 373
Query: 429 L----GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQL 484
+ G + P + + + L+ YP + EDEA+L +L ++R+A +
Sbjct: 374 IKKINGDLIPDKRKRDEVFHAVIVSTLQKILSQYPTSTQEDEALLATSDLTNRQRMAIHV 433
Query: 485 VRMEKKMLNACLQVTA 500
EKK+L L+
Sbjct: 434 RLGEKKLLKEALEFAG 449
>gi|357615786|gb|EHJ69829.1| putative SET domain containing 3 [Danaus plexippus]
Length = 489
Score = 62.0 bits (149), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 81/353 (22%), Positives = 143/353 (40%), Gaps = 71/353 (20%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
SW+H++G V + E + + A++D G +VP ++++ E+
Sbjct: 88 SWLHEHGAEFEGVEISEFDGYG------FGLKATKDFSEGSLILTVPGKVMMS-EKDPKA 140
Query: 143 ETIAELLTTNKLSEL---ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
++E + + L + LAL+L+ EK SFW PYI L + + L
Sbjct: 141 SDLSEFINIDPLLQNMPNVTLALFLLLEK-NNPNSFWKPYIDVLPEK-------YSTVLY 192
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREY----NELDTVWFMAGSLFQQYPYDIPT---- 251
++ ELA L SP L+ I R+Y N++ T+ D+P
Sbjct: 193 FNSEELAELRPSPVFESSLKLYRSIVRQYAYFYNKIHTI-------------DLPVLKNL 239
Query: 252 -EAFTFEIFKQAFVAV---QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD 307
+ FTF+ ++ A V Q+ +V +L A +PL C +
Sbjct: 240 QDIFTFDNYRWAVSTVMTRQNNIVQGTAFTLTN--AFIPLW-------DMCNHKHGKITT 290
Query: 308 AVQLVVDR-------PYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYD 360
L ++R Y+ E I ++Y G +PNS L ++ GFV DN YD
Sbjct: 291 DFNLELNRGECYALQDYRRDEQIFIFY-----------GARPNSDLFLHNGFVYPDNDYD 339
Query: 361 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 413
L + ++ D K + + G V F ++ G ++ ++L ++R+
Sbjct: 340 SLSIALGISPNDALRNGKVNLLNKLGLSGVTNFSLYKGASPISV-ELLAFIRI 391
>gi|224098926|ref|XP_002311320.1| SET domain-containing protein [Populus trichocarpa]
gi|222851140|gb|EEE88687.1| SET domain-containing protein [Populus trichocarpa]
Length = 490
Score = 62.0 bits (149), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 90/411 (21%), Positives = 168/411 (40%), Gaps = 59/411 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A D+ + +P L + + V +E + +AL+L+ EK + +
Sbjct: 80 LVAQRDISRNEVVLEIPKKLWINPDVVAASEIGN---VCGGVKPWVSVALFLIREKLK-E 135
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W PY+ L + S + WSE ELA L G+ + L ++RE+ +++
Sbjct: 136 DSTWRPYLDVLPE-------STNSTIFWSEEELAELQGTQLLSTTLGVKSYLRREFLKVE 188
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG---- 288
+ Q +P + T + F AF ++S + + L+PL
Sbjct: 189 EEILVPHK--QLFPSPV-----TLDDFSWAFGILRSRSFSRLR---GQNLVLIPLADLCN 238
Query: 289 -------------PPLL----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISF 331
P + Y K + + D L KAGE +++ YN++
Sbjct: 239 FLHTWLLDQVNHSPDITIEDGVYEIKGAGLFSR-DLIFSLRSPISLKAGEQVLIQYNLNL 297
Query: 332 TGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ 391
+ N++L ++YGF++ + + + ++ DP + DK +A+ NG +
Sbjct: 298 S----------NAELAVDYGFIEAKSDRNMYTLTLQISESDPFFGDKLDIAETNGLGEIA 347
Query: 392 VFH-VHAGREKEAISDMLPYLRLGYVSDTSEMQSVISS--LGPI-CPVSPCMERAVLDQL 447
F V + L + LG +D+ ++S+ + G + PVS E + +
Sbjct: 348 DFDIVLGNPLPPTLLPYLRLVALGG-TDSFLLESIFRNTIWGHLELPVSRANEELICRVV 406
Query: 448 ADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQV 498
D K+ L+GY T+ EDE L L+P+ +A + EKK+L ++
Sbjct: 407 RDACKSALSGYHTTIEEDEK-LKGEELNPRLEIAVGIRAGEKKVLQQIEEI 456
>gi|190402231|gb|ACE77646.1| hypothetical protein [Sorex araneus]
Length = 350
Score = 62.0 bits (149), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 62/259 (23%), Positives = 115/259 (44%), Gaps = 22/259 (8%)
Query: 252 EAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 310
E+FT+E ++ A +V + + +R AL+PL + DD +
Sbjct: 12 ESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCE 71
Query: 311 LVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 370
V + ++AGE I ++Y G + N++ +++ GF ++N +DR+ ++ ++
Sbjct: 72 CVALQDFRAGEQIYIFY-----------GTRSNAEFVVHSGFFFDNNSHDRVKIKLGVSK 120
Query: 371 EDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS--- 427
D Y K V R G + VF +H E + +L +LR+ +++ + ++
Sbjct: 121 SDRLYAMKAEVLARAGIPTSSVFALHV-TELPISAQLLAFLRVFCMTEEELREHLLGENA 179
Query: 428 -----SLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVA 481
+LG PVS E + L D L Y T+ ED+A L L P+ +A
Sbjct: 180 IDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKAFLQSPGLSPRAAMA 239
Query: 482 TQLVRMEKKMLNACLQVTA 500
+L EK++L ++ A
Sbjct: 240 VKLRLGEKEILEKAVRSAA 258
>gi|189236574|ref|XP_975615.2| PREDICTED: similar to SET domain containing 3 [Tribolium castaneum]
Length = 667
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 59/237 (24%), Positives = 102/237 (43%), Gaps = 37/237 (15%)
Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRP-------YKAGESIVVWYNISFTGH 334
+AL+PL C + A V+DR +KAGE + ++Y
Sbjct: 451 YALIPLW-------DMCNHTNGTISTAYNPVLDRSECLAVKNFKAGEQLFIFY------- 496
Query: 335 DFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFH 394
G + N+ L ++ GFV E+N YD + ++ DP Q + + GKLS+
Sbjct: 497 ----GSRSNADLFVHNGFVFENNDYDVYWIRLGISKSDPLQQKRGHLL---GKLSIASTC 549
Query: 395 VHAGREKEAISD--MLPYLRLGYVSDTSEMQSVISS-----LGPI-CPVSPCMERAVLDQ 446
+ R+ + D +L +LR+ + + ++ I+S LG I C + +E
Sbjct: 550 DFSIRKGASPIDGQLLAFLRV-FNMNEEQLDHWINSDKSADLGHIDCALDTALETKSWRF 608
Query: 447 LADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 503
L K L+ Y TL EDE +L + P + +A ++ EK+++ L+ I
Sbjct: 609 LHARLKLLLSTYKTTLDEDEKLLAEAQATPNRLLAIKMRATEKRIIRETLEYVEQYI 665
Score = 40.4 bits (93), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 35/119 (29%), Positives = 58/119 (48%), Gaps = 12/119 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSEL--ACLALYLMYEKK 169
V A+ D+ +VP L++++E + +L+ +K L + L+++L+ EK
Sbjct: 116 VKANVDIAESSLVIAVPRKLMMSVENA-KESVLKDLIEKDKILGSMPNVALSIFLLLEKY 174
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
+G SFW PYI L + + L +S EL L GSPT L + + I R+Y
Sbjct: 175 KGD-SFWKPYIDILPK-------TYTTVLYFSIDELEELRGSPTLEVALRQIKSITRQY 225
>gi|270005261|gb|EFA01709.1| hypothetical protein TcasGA2_TC007289 [Tribolium castaneum]
Length = 230
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 56/230 (24%), Positives = 101/230 (43%), Gaps = 23/230 (10%)
Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQ 341
+AL+PL + V D + + + +KAGE + ++Y G +
Sbjct: 14 YALIPLWDMCNHTNGTISTAYNPVLDRSECLAVKNFKAGEQLFIFY-----------GSR 62
Query: 342 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREK 401
N+ L ++ GFV E+N YD + ++ DP Q + + GKLS+ + R+
Sbjct: 63 SNADLFVHNGFVFENNDYDVYWIRLGISKSDPLQQKRGHLL---GKLSIASTCDFSIRKG 119
Query: 402 EAISD--MLPYLRLGYVSDTSEMQSVISS-----LGPI-CPVSPCMERAVLDQLADYFKA 453
+ D +L +LR+ + + ++ I+S LG I C + +E L K
Sbjct: 120 ASPIDGQLLAFLRV-FNMNEEQLDHWINSDKSADLGHIDCALDTALETKSWRFLHARLKL 178
Query: 454 RLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 503
L+ Y TL EDE +L + P + +A ++ EK+++ L+ I
Sbjct: 179 LLSTYKTTLDEDEKLLAEAQATPNRLLAIKMRATEKRIIRETLEYVEQYI 228
>gi|52545671|emb|CAH56365.1| hypothetical protein [Homo sapiens]
Length = 380
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 59/259 (22%), Positives = 117/259 (45%), Gaps = 22/259 (8%)
Query: 252 EAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 310
++FT+E ++ A +V + + +R AL+PL + DD +
Sbjct: 23 DSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCE 82
Query: 311 LVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 370
V + ++AGE I ++Y G + N++ +I+ GF ++N +DR+ ++ ++
Sbjct: 83 CVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSK 131
Query: 371 EDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD---------TSE 421
D Y K V R G + VF +H E + +L +LR+ +++ S
Sbjct: 132 SDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSA 190
Query: 422 MQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVA 481
+ + + PVS E + L D L Y T+ ED+++L +++L + ++A
Sbjct: 191 IDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNHDLSVRAKMA 250
Query: 482 TQLVRMEKKMLNACLQVTA 500
+L EK++L ++ A
Sbjct: 251 IKLRLGEKEILEKAVKSAA 269
>gi|340519125|gb|EGR49364.1| predicted protein [Trichoderma reesei QM6a]
Length = 963
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 92/421 (21%), Positives = 171/421 (40%), Gaps = 64/421 (15%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERV--------LGNETIAELLTTNKLSELACLALYL 164
+ A +D+ A F+VP S +++ E + ET E+ + + L + +
Sbjct: 537 IVALQDIPAEAVLFTVPRSGILSSETSELKGKLPEIFQETAMEVDDKPQQDPWSTLIIVM 596
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
MYE +G +S W PYI L + E+P+ WS+ EL L S T++++ +A
Sbjct: 597 MYEYFKGSESKWKPYIDVLPS-------SFETPMFWSDAELDELQASATRSKV-GKASAE 648
Query: 225 KREYNELDTVWFMAGSLF---QQYPYDIPTE----------AFTFEIFKQAFVAVQSCVV 271
+ +++ V LF Q Y D + +++F+ +
Sbjct: 649 EMFQDKVLPVIRANQHLFPTSQTYSDDDLIQLAHRMGSTIMSYSFDFQNEDEEDEDETEE 708
Query: 272 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISF 331
+++ +VP+ +L ++ A + DDA+ + R KAGE I +Y
Sbjct: 709 WVEEREAKSTMGMVPMAD-ILNADAEYNAHVNYGDDALTVTALRTIKAGEEIFNYY---- 763
Query: 332 TGHDFKCGPQPNSKLLINYGFVD-EDNPYDRL---------VVEAALNTEDPQYQDKRMV 381
GP PNS+LL YG+V + + YD + V A+L Q DK
Sbjct: 764 -------GPHPNSELLRRYGYVTPKHSRYDVVELPWTLVEESVAASLGLSSEQL-DKARE 815
Query: 382 AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT--SEMQSVISSLGPICPVSPCM 439
+ +L E + R + + +++S++ ++ P S
Sbjct: 816 CLDSDELEDTFVLERETEEPNPDGTLTGSARFSEIPEDLRDQLKSLLKAIRKAVPSSVVD 875
Query: 440 ERA-------VLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 492
+R +L + D +R YP ++SEDE +L ++ ++R A + EK+++
Sbjct: 876 KRKRDEIQHNILIRALDALASR---YPTSISEDERILAGNDISERRRAAVTVRLGEKRLI 932
Query: 493 N 493
Sbjct: 933 Q 933
>gi|46130858|ref|XP_389160.1| hypothetical protein FG08984.1 [Gibberella zeae PH-1]
Length = 1000
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 96/454 (21%), Positives = 180/454 (39%), Gaps = 90/454 (19%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--------LSELACLALYL 164
+ A +D+ A F++P ++ E + I ++ +K L + L L +
Sbjct: 578 IIALKDIPAETTLFTIPRKGIINTETSELPKKIPDVFDLDKPDEDDVPGLDSWSSLILIM 637
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEG 223
+YE QG S W Y L + ++P+ WSE EL L S + +I + AE
Sbjct: 638 IYEYLQGDSSQWKSYFDVLPS-------SFDTPMFWSENELDQLQASHMRHKIGKADAED 690
Query: 224 I-------------------KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
+ R EL + GS Y +D+ + E ++
Sbjct: 691 MFKKTLVPIIRSNPSIFNAENRSDYELVEIAHRMGSTIMAYAFDLENDEEEEEETEEWVE 750
Query: 265 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 324
+ + +VP+ +L ++ A + ++++ + RP KAGE I+
Sbjct: 751 DREGKSM----------MGMVPMAD-ILNADAEFNAHVNHEEESLTVTSLRPIKAGEEIL 799
Query: 325 VWYNISFTGHDFKCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAALNT------ 370
+Y GP PNS+LL YG+V E + P+D +VE+ L
Sbjct: 800 NYY-----------GPHPNSELLRRYGYVTEKHSRYDVVEIPWD--IVESVLTNFGISSK 846
Query: 371 ----------EDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 420
E+ +++D ++ + G+++ + + D+ L+ ++
Sbjct: 847 ILEQIRGEFEEEEEFEDTFVLERDTGEVNSDGTFAEPAKFEGMPEDLQEQLK-SFLKGIK 905
Query: 421 EMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRV 480
++QS P + +AVL + + AR YP ++SED+ +L NL + R+
Sbjct: 906 KLQS---DTIPDKRKRDEIHQAVLVKTLEALAAR---YPTSISEDQILLNGQNLDQRARM 959
Query: 481 ATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPC 514
AT + EKK+L + ++ + + D P
Sbjct: 960 ATVVRLGEKKLLQEAIATFSEDVEMTMDDESGPA 993
>gi|296419472|ref|XP_002839331.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295635461|emb|CAZ83522.1| unnamed protein product [Tuber melanosporum]
Length = 541
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 64/262 (24%), Positives = 118/262 (45%), Gaps = 36/262 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V D+ + S P++L + + + A+ +T ++ A L ++L E +GK
Sbjct: 43 VITCTDIPSHSQLISCPHTLTINYTKARSAFS-ADFITNT--TQHAALCMFLCLEWLKGK 99
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+SFW PY+ L R+ ++PL +S+ +L +L G +A +E + I RE E
Sbjct: 100 ESFWWPYLCVLPRE-------FDTPLYFSDEDLQFLQGCNLEATEVEARKLIWREEFE-- 150
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA----LVPLG 288
A S+ Q+ YD TE +T+E++ A S + + R +P+
Sbjct: 151 ----AAVSILQREGYD--TEYYTWELYLWASTIFTSRSFPGKLMDWDRIIVHEDDTMPIL 204
Query: 289 PPLLAYSSKCKAMLAA---VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSK 345
PL+ + A + D +++++ AG + YN GP+ N +
Sbjct: 205 FPLIDSLNHYPATIITWQPSDTSLRIISGVGVSAGAEV---YN--------NYGPKANEE 253
Query: 346 LLINYGFVDEDNPYDRLVVEAA 367
LL+ YGF NP+D +++++
Sbjct: 254 LLMGYGFTLLQNPFDSFLLKSS 275
>gi|302792358|ref|XP_002977945.1| hypothetical protein SELMODRAFT_107696 [Selaginella moellendorffii]
gi|300154648|gb|EFJ21283.1| hypothetical protein SELMODRAFT_107696 [Selaginella moellendorffii]
Length = 467
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 73/292 (25%), Positives = 122/292 (41%), Gaps = 48/292 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A DL G+ ++P + +TL + I L L + LMYE+ +GK
Sbjct: 34 VRALRDLHHGELIATIPKAACLTLLTTAARDAIERARLGGGLG----LTVALMYERSKGK 89
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL 231
S W Y++ L RQ P LWSE E+ L G+ + E +K ++ E
Sbjct: 90 GSKWYRYLKTLPRQE-------SVPFLWSEEEIDGLLLGTELHKALKEDKLLMKEDWEE- 141
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ-KVSLARRFALVPLGPP 290
L ++ P + P + FTFE +++A +S V ++ + +VPL
Sbjct: 142 -----NIAPLTKEDPLEFPAQDFTFE----SYLAAKSLVSSRSFEIDAEHGYGMVPLAD- 191
Query: 291 LLAYSSKCKA-----MLAA-----------VDDAVQLVVDRPYKAGESI---VVWYNISF 331
++ K A ML A +DD + R + +S+ V+ +++
Sbjct: 192 --LFNHKTDAEDVHFMLNASDSDDDDNGLIIDDGLANGDCREISSDKSVLEMVMVKDVAA 249
Query: 332 TGHDFKC-GPQPNSKLLINYGFVDEDNPYD--RLVVEAALNTEDPQYQDKRM 380
F G N+ LL YGF + +NP+D L ++ L ++Q KR+
Sbjct: 250 GSEIFNTYGQLGNAALLHRYGFTEPNNPHDIVNLDMDCVLEVLLSRFQKKRV 301
>gi|449464220|ref|XP_004149827.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Cucumis
sativus]
Length = 499
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 64/260 (24%), Positives = 111/260 (42%), Gaps = 50/260 (19%)
Query: 112 YVAASEDLQAGDAAFSVP-------NSLVVTLERVLGNETIAELLTTNKLSELACLALYL 164
++ ASE ++AGD VP +SL + + +LGNE + +A LA+ +
Sbjct: 77 FLFASETIRAGDCILKVPFNVQISPDSLPLPIRDLLGNE----------IGNVAKLAVVV 126
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
+ E K G S W PYI L + + + + W E+EL + S E L + I
Sbjct: 127 LLEHKLGLGSEWAPYIIRLPQ-----PWEMHNTIFWKESELEMIRKSSLYEESLNQRSQI 181
Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 284
KRE+ + + +P I + + + F A+ V S + +L
Sbjct: 182 KREFLAIRKA-------LEAFPEII--DRISCDDFMHAYALVTS-----RAWRSTEGVSL 227
Query: 285 VPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
+P L + +AML DD ++V DR + GE +++ Y G
Sbjct: 228 IPFA-DFLNHDGASEAMLLNDDDKQLSEVVADRDFAPGEHVLIRY-----------GKYS 275
Query: 343 NSKLLINYGFVDEDNPYDRL 362
N+ L++++GF N +D++
Sbjct: 276 NATLMLDFGFALPYNIHDQV 295
>gi|350595011|ref|XP_003484025.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Sus
scrofa]
Length = 326
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 47/161 (29%), Positives = 78/161 (48%), Gaps = 18/161 (11%)
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS 296
M +F +YP P E F E FK +F + S +V L S+ + ALVP ++ +S
Sbjct: 73 MRVRIFSKYPDFFPEEVFNIESFKWSFGILFSRMVRLP--SMDGKNALVPWA-DMMNHSC 129
Query: 297 KCKAMLAAVDDAVQLV--VDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVD 354
+ + L + +V DRPY+ GE + + Y G + N +LL++YGFV
Sbjct: 130 EVETFLDYDKSSKGIVFPTDRPYQPGEQVFISY-----------GKKSNGELLLSYGFVP 178
Query: 355 ED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
++ NP D + + +L D Y++K + ++ G Q F
Sbjct: 179 KEGTNPSDSVELSLSLKKSDESYKEKLELLKKYGLSGSQCF 219
>gi|325183831|emb|CCA18289.1| conserved hypothetical protein [Albugo laibachii Nc14]
gi|325183979|emb|CCA18437.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 561
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 102/448 (22%), Positives = 181/448 (40%), Gaps = 70/448 (15%)
Query: 68 REVVSKKEEDL--GDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAA 125
RE V+ E D+ +L W+ G K++L++ + + R +H +L G+
Sbjct: 105 REDVADLENDVVGAELIDWLQNQGAETKKLMLQQ---YAPEVRGVH---CRNELVPGERI 158
Query: 126 FSVPNSLVVTLERVLGNET-IAELLTTNKLSELA----CLALYLMYEKKQGKKSFWLPYI 180
+P + ++T+E +G +T I + + + + +A L LYL+ + ++ +F+ Y
Sbjct: 159 LFIPKNCLITVE--MGKQTEIGQKVLAHNIEFVAPKHIFLILYLLTDMEKKDLTFFKYYY 216
Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS 240
L P+ WS+ EL++L GS +I ER I+++Y+ +
Sbjct: 217 STL------PSTLKNMPIFWSDQELSWLKGSYILHQIQERKAAIRKDYDAICRA------ 264
Query: 241 LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA 300
D F+ E F A + V S L + + ALVP L Y + +
Sbjct: 265 -------DPSFSRFSLERFSWARMIVCSRNFGL-TIDGVKTAALVPFADMLNHYRPRETS 316
Query: 301 -MLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPY 359
D + +G + Y G + N + L+NYGF EDN
Sbjct: 317 WTFDQKLDGFTITSLESICSGAQVYDSY-----------GKKCNHRFLLNYGFAVEDNTE 365
Query: 360 ------DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 413
+ ++V+ L+ D Q ++ + L + R + SD P R
Sbjct: 366 EDGSNPNEIMVDFQLDPGDGQ-----LLYDKTAYLYESGIYTMNARLSCSHSD--PSTRE 418
Query: 414 GY-----VSDTSEMQSVISSLGPIC---PVSPCMERAVLDQLADYFKARLAGYPATLSED 465
G+ ++ T + S + P P+S E A L+ L +L YP +L E
Sbjct: 419 GFSFARLIAATEDEFSSMKMRSPAHASPPISFRNEIAALNLLKQLMDTQLDQYPTSLDEG 478
Query: 466 EAML--TDYNLHPKKRVATQLVRMEKKM 491
EA+L +Y L+ + A +R EK++
Sbjct: 479 EAILKSKEYPLYSNRIQALFFIRGEKQV 506
>gi|342881738|gb|EGU82570.1| hypothetical protein FOXB_06936 [Fusarium oxysporum Fo5176]
Length = 467
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 103/461 (22%), Positives = 183/461 (39%), Gaps = 86/461 (18%)
Query: 104 NEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--------LS 155
N ++ A ED+ A F++P ++ +E + I + +K L
Sbjct: 38 NAGRGEVNKTVALEDIPAETTLFTIPRKGIINVETSELPKKIPDAFDLDKPDDDDAPGLD 97
Query: 156 ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 215
+ L L ++YE QG+ S W PY L + ++P+ WS+ EL L S +
Sbjct: 98 SWSSLILIMIYEYLQGENSKWKPYFDVLPS-------SFDTPMFWSDNELDQLQASHMRH 150
Query: 216 EILE-RAEGIKRE------------YN-------ELDTVWFMAGSLFQQYPYDIPTEAFT 255
+I + AE + ++ +N EL + GS Y +D+ +
Sbjct: 151 KIGKADAENMFQKTLLPIIRSNAEIFNAGNKTDAELIEIAHRMGSTIMAYAFDLENDE-- 208
Query: 256 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 315
+ V S+ +VP+ +L ++ A + ++++ + R
Sbjct: 209 -----EEEEEADGWVEDRDGKSM---MGMVPMAD-ILNADAEFNAHVNHEEESLTVTSLR 259
Query: 316 PYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAA 367
P KAGE I+ +Y GP PNS+LL YG+V E + P+D +VE+A
Sbjct: 260 PIKAGEEILNYY-----------GPHPNSELLRRYGYVTEKHSRYDVVEIPWD--IVESA 306
Query: 368 L--NTEDPQYQDKRMVAQRNGKL-------SVQVFHVHAGREKEAISDMLPYLRLGYVSD 418
L N P +++ Q G L V G + P D
Sbjct: 307 LTSNFGIP----GQVLEQIRGALEEDEEFEDTFVLERETGEVNSDGTFAEPARFESMPED 362
Query: 419 TSE-MQSVISSLGPICP--VSPCMERAVLDQ--LADYFKARLAGYPATLSEDEAMLTDYN 473
E +++ + + P + +R + Q LA +A +A YP ++SEDE +L +
Sbjct: 363 LQEQLKTFLKGIKKAQPDAIPDKRKRDEIHQAVLAKTLEALVARYPTSISEDENLLKQ-D 421
Query: 474 LHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPC 514
L+ + R+A + EKK+L + ++ + + D P
Sbjct: 422 LNQRTRMAIAVRLGEKKLLQEAITASSGDVEMTMDDESGPA 462
>gi|391340216|ref|XP_003744440.1| PREDICTED: SET domain-containing protein 4-like [Metaseiulus
occidentalis]
Length = 381
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 67/290 (23%), Positives = 119/290 (41%), Gaps = 52/290 (17%)
Query: 78 LGDLKSWMHKNGLPPCKVI-LKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTL 136
+G+L SW+ + G P V+ L P+ + +++AGD +P++L++T
Sbjct: 21 IGELYSWIQRLGFKPTSVLRLACTPASGRG------IVCLSNIEAGDVIIDLPSTLLITP 74
Query: 137 ERVLGNETIAEL-LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ V EL ++ LS L ++++ E+ G+KS W PYI +
Sbjct: 75 DLVR-----KELNMSKENLSAEEILTIFVLSERSLGEKSKWKPYIESI------------ 117
Query: 196 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
P ++ + P + A+ I R E V+ F+ D+
Sbjct: 118 -PDVFDGLQCRKSVRLPRRL-----AQAIDRWNAERRNVFSRLRMFFRGRGIDL-----N 166
Query: 256 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 315
FE F A+ AV + ++++ L P LL + K + V++ + +
Sbjct: 167 FETFSWAWSAVNTRCIYVE----GHGSTLAPF-LDLLNHHWKASIETSFVNNHFIIRSNV 221
Query: 316 PYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVE 365
Y+AG + + Y G N L +NYGFV ++NP D + VE
Sbjct: 222 GYEAGSEVFIGY-----------GSHDNRTLFLNYGFVLDENPNDCITVE 260
>gi|425766115|gb|EKV04742.1| hypothetical protein PDIG_87340 [Penicillium digitatum PHI26]
gi|425778867|gb|EKV16969.1| hypothetical protein PDIP_33360 [Penicillium digitatum Pd1]
Length = 679
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 59/224 (26%), Positives = 99/224 (44%), Gaps = 34/224 (15%)
Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
A +LM + +G + FW PY+R L + GQL +PL + E ++ ++ G+ +
Sbjct: 105 TFAFFLMAQYLRGPEGFWYPYLRTLPQP---GQLT--TPLFFGEEDVDWIQGTGIPEAAV 159
Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 278
ER + + +Y D+ + G+ +P E +T+E++ A + S + +S
Sbjct: 160 ERIKIWEEKY---DSGYLQLGA--TGFP---DCETYTWELYLWASTIITSRAFSAKVLSG 211
Query: 279 ARR---------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNI 329
A + AL+PL L + K A D + L+V + AG+ I Y
Sbjct: 212 AVQPGDLPEDGVSALLPL-IDLPNHRPMAKVEWRAGDKDIGLLVLEDHSAGQEISNNY-- 268
Query: 330 SFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 373
GP+ N +LLINYGF NP D +V + + P
Sbjct: 269 ---------GPRNNEQLLINYGFCIAGNPTDYRIVHLGVKPDSP 303
>gi|321470773|gb|EFX81748.1| hypothetical protein DAPPUDRAFT_317395 [Daphnia pulex]
Length = 495
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 97/443 (21%), Positives = 177/443 (39%), Gaps = 51/443 (11%)
Query: 88 NGLPPCKVILKEKPSHNEKHRPIHYVA---------ASEDLQAGDAAFSVPNSLVVTLER 138
+ LPP L+ +H+ K P+ V A++ + + FS+P L+++ E
Sbjct: 75 DTLPP---FLEWMTNHDVKMGPVELVELPLYGCCVRATKQVSTDELLFSIPQKLMLSNET 131
Query: 139 VLGNETIAELLTTNK-LSELACLAL-YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES 196
+ TI + + LS++ +AL + + + KSFW PY+ L + ++
Sbjct: 132 A-NSSTIGHFINNDPILSQMPNVALAFHVLNELYDPKSFWKPYLDALPS-------SYDT 183
Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-----DTVWFMAGSLFQQYPYDIPT 251
+ ++ E+ L GSP + L I R+Y+ V +L + Y+
Sbjct: 184 VMYFTPDEITELKGSPAFDDALRMCRNIARQYSYFYSLLQKNVDPALSNLRANFTYNDYR 243
Query: 252 EAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQL 311
A + + +Q + Q + K L AL+PL +
Sbjct: 244 WAVSTVMTRQNLIPSQEEISGNDKDQLPPVNALIPLWDFCNHQDGQFSTEFQLESRRTVC 303
Query: 312 VVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 371
R + GE + ++Y G + ++ I+ GFVD +N +D L ++ L+
Sbjct: 304 QAGRDFGPGEQVFIFY-----------GTRTCAEQFIHNGFVDINNAHDALTLKVGLSKS 352
Query: 372 DPQYQDKRMVAQRNGKLSVQ------VFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV 425
DP + + + LS + F + AG + +L +LRL ++ S + +
Sbjct: 353 DPLAGQRATLLCKLRILSDEKISGPIAFQLKAG-PQPVDGKLLAFLRLFCMTKDSLDRWL 411
Query: 426 ISSLGPICPVSPC-MERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRV 480
S C +E V D+ + KAR L YP T D ML + +L +R+
Sbjct: 412 QSDNASNLMHEECGIETEVDDKSWSFLKARCQLLLQLYPTTKEADLKMLEE-DLSSHRRM 470
Query: 481 ATQLVRMEKKMLNACLQVTADMI 503
L EK++L + ++ A I
Sbjct: 471 CVLLRLAEKRILLSAIECAAQRI 493
>gi|189189204|ref|XP_001930941.1| SET domain-containing protein RMS1 [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187972547|gb|EDU40046.1| SET domain-containing protein RMS1 [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 476
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 79/313 (25%), Positives = 123/313 (39%), Gaps = 62/313 (19%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLER-VLG 141
+W+ ++G I E + + R V AS+D+ + F +P + ++++E +L
Sbjct: 13 AWLRQSGAEISPKIKLEDLRNKDAGRG---VVASQDIAEHELLFRIPRASILSVENSILS 69
Query: 142 NETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
E A L+ L L L ++YE G S W PY L + + + W+
Sbjct: 70 TEIPAATLSL--LGPWLSLILVMLYEYHNGSASNWAPYFAVLPTE-------FNTLMFWT 120
Query: 202 ETELAYLTGSPTKAEIL---------------------------ERAEGIKREYNELDTV 234
E ELA L S ++ ERA+ +E L+ +
Sbjct: 121 EDELAELQASAVVGKVGKESADEAFLEQLLPVIEEFADIVFSGDERAKDKAKEMRSLENL 180
Query: 235 WFM--AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
M GSL Y +D+ T E+ ++ A + L K +VPL L
Sbjct: 181 ELMHKMGSLIMAYAFDVEPATPTKEVDEEG-FAEEEEDAALPK-------GMVPLADMLN 232
Query: 293 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGF 352
A + +C A L D +++ +P +AGE I Y GP P S LL YG+
Sbjct: 233 ADADRCNARLFYEKDCLEMKALKPIQAGEEIFNDY-----------GPLPRSDLLRRYGY 281
Query: 353 VDEDNPYDRLVVE 365
V DN VVE
Sbjct: 282 VT-DNYAQYDVVE 293
>gi|327290197|ref|XP_003229810.1| PREDICTED: SET domain-containing protein 4-like [Anolis
carolinensis]
Length = 440
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 81/320 (25%), Positives = 132/320 (41%), Gaps = 43/320 (13%)
Query: 86 HKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVVTLERV 139
HK+ K LKEK + K RP + + ++ LQ G+ S+P ++T + V
Sbjct: 29 HKDEYILLKKWLKEKGCNVNKLRPAQFPETGRGLVTTKGLQVGELIISLPEKCLLTTDTV 88
Query: 140 LGNETIAELLT--TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
L N + E + T +S L L +L+ EK +KS W PY+ L + S
Sbjct: 89 L-NSYLREYIVKWTPPISPLIALCTFLIAEKWAQEKSPWKPYLDLLPE--------IYSC 139
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
+ E ++ L P + + E+ + ++ + +F LF P D+ + F ++
Sbjct: 140 PVCLEQKIVNLFPEPLRRKAHEQRKLVQELFISSQQFFFSLQPLF---PKDVAS-VFNYQ 195
Query: 258 IFKQAFVAVQSCVV---HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD 314
FK A+ + + V H Q+ +R L P L + + A +
Sbjct: 196 AFKWAWCTINTRTVYMKHSQRDCFSRDTDTYALAPYLDLLNHNPTVQVKA-------GFN 248
Query: 315 RPYKAGESIVV----WYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 370
K E V YN F + GP N +LL+ YGFV DNP+ + V
Sbjct: 249 EKTKCYEITTVTQCHHYNEVFICY----GPHDNQRLLLEYGFVSRDNPHSSVYVGTDTLL 304
Query: 371 EDPQYQDKRMVAQRNGKLSV 390
++ +DK QR KLS+
Sbjct: 305 KNVFPEDK----QRPKKLSI 320
>gi|408393455|gb|EKJ72719.1| hypothetical protein FPSE_07119 [Fusarium pseudograminearum CS3096]
Length = 465
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 95/454 (20%), Positives = 177/454 (38%), Gaps = 90/454 (19%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--------LSELACLALYL 164
+ A D+ A F++P + +E + I ++ +K L + L L +
Sbjct: 43 IIALRDIPAETTLFTIPRKGSINIETSELPQKIPDVFDLDKPDEDDVPGLDSWSSLILIM 102
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEG 223
+YE +G S W Y L + ++P+ WSE EL L S + +I + AE
Sbjct: 103 IYEYLRGDSSQWKSYFDVLPS-------SFDTPMFWSENELDQLQASHMRHKIGKADAEN 155
Query: 224 I-------------------KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
+ R +EL + GS Y +D+ + E ++
Sbjct: 156 MFKKTLVPIIRSNPSIFNAENRSDSELVEIAHRMGSTIMAYAFDLENDEEEEEETEEWVE 215
Query: 265 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 324
+ +VP+ +L ++ A + ++++ + RP KAGE I+
Sbjct: 216 DRDGKSM----------MGMVPMAD-ILNADAEFNAHVNHEEESLTVTSLRPIKAGEEIL 264
Query: 325 VWYNISFTGHDFKCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAALNT------ 370
+Y GP PNS+LL YG+V E + P+D +VE+ L
Sbjct: 265 NYY-----------GPHPNSELLRRYGYVTEKHSRYDVVEIPWD--IVESVLTNFGISSK 311
Query: 371 ----------EDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 420
+ +++D ++ + G+++ + + D+ L+ S
Sbjct: 312 ILKQIRGEFEGEEEFEDTFVLERDTGEINSDGTFAEPAKFEGMPEDLQEQLK----SFLK 367
Query: 421 EMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRV 480
++ V S P + +AVL + + AR YP ++SED+ +L NL + R+
Sbjct: 368 GIKKVQSDTIPDKRKRDEIHQAVLVKTLEALAAR---YPTSISEDQTLLNGQNLDQRARM 424
Query: 481 ATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPC 514
AT + EKK+L + ++ + + D P
Sbjct: 425 ATVVRLGEKKLLQEAIATFSEDVEMTMDDESGPA 458
>gi|80479475|gb|AAI08868.1| Unknown (protein for MGC:132347) [Xenopus laevis]
Length = 456
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 64/257 (24%), Positives = 113/257 (43%), Gaps = 29/257 (11%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAEL-LTTNKLSELACLALYLMYEKKQGKK 173
A+ DL+ G+ ++P + ++T E VL + + L +S L L +L+ E+ G++
Sbjct: 64 ATRDLKPGELIIALPETCLITTETVLQSYLGKYIRLWRPHVSPLLALCTFLIAERFAGER 123
Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
S W PY+ + P+ W E E+ +L +P + + LE+ K E EL T
Sbjct: 124 SQWKPYLDVIPS-------TYSCPVYW-ELEIVHLLPAPLRQKALEQ----KTEVQELHT 171
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLA 293
+ Q D + +T++ + A+ V + V+++ R LLA
Sbjct: 172 ESLAFFNSLQPLFCDNVADIYTYDALRWAWCTVNTRTVYMKHTQQDR----------LLA 221
Query: 294 YSSKCKA--MLAAVDDAVQLVVDRPY-KAGESIVVWYNISFTGHD--FKC-GPQPNSKLL 347
C L ++ + ++ V+ + K + N HD F C GP N +LL
Sbjct: 222 QQDVCALAPYLDLLNHSPEVQVEAEFSKDRRCYEIRTNSGCRKHDQAFICYGPHDNQRLL 281
Query: 348 INYGFVDEDNPYDRLVV 364
+ YGFV +NP+ + V
Sbjct: 282 LEYGFVAANNPHRSVYV 298
>gi|17367341|sp|Q43088.1|RBCMT_PEA RecName: Full=Ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic; AltName:
Full=[Fructose-bisphosphate aldolase]-lysine
N-methyltransferase; AltName:
Full=[Ribulose-bisphosphate carboxylase]-lysine
N-methyltransferase; Short=PsLSMT; Short=RuBisCO LSMT;
Short=RuBisCO methyltransferase; Short=rbcMT; Flags:
Precursor
gi|508551|gb|AAA69903.1| ribulose-1,5 bisphosphate carboxylase large subunit
N-methyltransferase [Pisum sativum]
Length = 489
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 92/387 (23%), Positives = 163/387 (42%), Gaps = 33/387 (8%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A +D+ D VP L + + V +E I + + +L + L+L+ E+ + +
Sbjct: 84 LVALKDISRNDVILQVPKRLWINPDAVAASE-IGRVCS--ELKPWLSVILFLIRERSR-E 139
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W Y L ++ +S + WSE EL L GS + E +K E +L+
Sbjct: 140 DSVWKHYFGILPQE-------TDSTIYWSEEELQELQGSQLLKTTVSVKEYVKNECLKLE 192
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQS-CVVHLQKVSLARRFALVPLGPP 290
+ P + + F I + +AF +++ +V + L A V
Sbjct: 193 QEIILPNKRLFPDPVTLDDFFWAFGILRSRAFSRLRNENLVVVPMADLINHSAGVTTEDH 252
Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINY 350
AY K A L + D L KAGE + + Y+++ + N++L ++Y
Sbjct: 253 --AYEVKGAAGLFSWDYLFSLKSPLSVKAGEQVYIQYDLN----------KSNAELALDY 300
Query: 351 GFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPY 410
GF++ + + ++ DP + DK VA+ NG F + R +LPY
Sbjct: 301 GFIEPNENRHAYTLTLEISESDPFFDDKLDVAESNGFAQTAYFDIFYNRTLPP--GLLPY 358
Query: 411 LRLGYVSDTSE--MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSED 465
LRL + T ++S+ + G + VS E + + + K+ LAGY T+ +D
Sbjct: 359 LRLVALGGTDAFLLESLFRDTIWGHLELSVSRDNEELLCKAVREACKSALAGYHTTIEQD 418
Query: 466 EAMLTDYNLHPKKRVATQLVRMEKKML 492
L + NL + +A + EK +L
Sbjct: 419 RE-LKEGNLDSRLAIAVGIREGEKMVL 444
>gi|255568191|ref|XP_002525071.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
gi|223535652|gb|EEF37318.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
Length = 456
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 62/260 (23%), Positives = 122/260 (46%), Gaps = 38/260 (14%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ +Q GD VP S + + +L +++LL +++ +A LA+ L+ ++K G++S
Sbjct: 51 ASKSIQTGDCILRVPYSAQIASDNLLPE--LSDLLG-DEVGSVAKLAIVLLVDQKVGQES 107
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI L Q G + S + WS++EL + S E +++ I++++ + V
Sbjct: 108 KWAPYISRLP-QLGE----MHSTIFWSKSELDMIFQSSVYKETIKQKAQIEKDFLTIKPV 162
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
+ +P + + TF+ F A+ V+S + + +L+P L +
Sbjct: 163 -------LEHFPQ--ISRSITFQDFMHAYALVKS-----RAWGSTKGVSLIPFA-DFLNH 207
Query: 295 SSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGF 352
+A++ +D ++ DR Y E +++ Y G N+ LL+++GF
Sbjct: 208 DGFSEAVVLNDEDKQVSEVAADRNYAPHEEVLIRY-----------GKFSNATLLLDFGF 256
Query: 353 VDEDNPYDRLVVEAALNTED 372
N +++ VE +N D
Sbjct: 257 SLPYNIHEQ--VEIQINIPD 274
>gi|62860180|ref|NP_001017105.1| SET domain containing 4 [Xenopus (Silurana) tropicalis]
gi|89267009|emb|CAJ81787.1| novel protein containing a SET domain [Xenopus (Silurana)
tropicalis]
Length = 442
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 67/260 (25%), Positives = 118/260 (45%), Gaps = 31/260 (11%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSELACLALYLMYEKKQG 171
+ A+ DLQ G+ S+P+S ++T E VL + + T + +S L L +L+ E+
Sbjct: 62 LMATRDLQPGELIISLPDSCLITTETVLQSYLGKYIRTWSPPVSPLLALCTFLIAERVAR 121
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
++S W PY+ L + P+ W E+E+ L +P + + LE+ +K + E
Sbjct: 122 ERSPWKPYLDVLPS-------SYSCPVYW-ESEIISLLPAPLRQKALEQQTEVKELHTE- 172
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQKVSLARRFALVPLG 288
W SL + +I T+ +T+ + A+ V + V H ++ L+ + + +
Sbjct: 173 --SWSFFVSLQPLFGGNI-TDIYTYGALRWAWCTVNTRTVYMKHPRRHGLSAQQDVYAMA 229
Query: 289 PPL-LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHD--FKC-GPQPNS 344
P L L S + AA ++ + R N HD F C GP N
Sbjct: 230 PYLDLLNHSPAVQVEAAFNEERRCYEIRT-----------NSGCRKHDQAFICYGPHDNQ 278
Query: 345 KLLINYGFVDEDNPYDRLVV 364
+LL+ YGF+ +NP+ + V
Sbjct: 279 RLLLEYGFIAANNPHRSVYV 298
>gi|301122791|ref|XP_002909122.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262099884|gb|EEY57936.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 426
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 93/422 (22%), Positives = 172/422 (40%), Gaps = 49/422 (11%)
Query: 111 HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
H V A + L +G +P L + +E ++ L ++ + LAL+LM+E+ +
Sbjct: 35 HGVFAKQALTSGQVTLRIPFKLTMNIESAARSDLARVLEKYPQIPDDEVLALHLMHERSK 94
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
SF+ P+I L + P+ WSE+EL L G+ + ++R++
Sbjct: 95 RSDSFFAPFIASLPT-------TFDLPVFWSESELNELKGTNVLLLTQLMKQQLQRDFEN 147
Query: 231 LDTVWFMAGSLFQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 287
+ ++ + +P +PT T E + A + S + + R L P
Sbjct: 148 IHQ------AVVEDFPEVFALLPT--LTLEDYTWAMSVIWSRAFGVTREKKYLR-VLCP- 197
Query: 288 GPPLLAYSSKCKAML---AAVDDAVQLV---VDRPYKAGESIVVWYNISFTGHDFKCGPQ 341
+ + + +L + D+ Q++ V + AG ++ + Y G
Sbjct: 198 AMDMFNHDVSLRILLDDFVSFDEETQMLTHHVPKEVAAGSALQISY-----------GQY 246
Query: 342 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR-E 400
N+KLL +YGFV ++N + + DP + K+ V N Q + E
Sbjct: 247 SNAKLLFSYGFVAKENSRRAVDFWMKIPPNDPYLKLKQTVLDSNELTRDQTYDFCGTLFE 306
Query: 401 KEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 460
+ +L LR+ +++ E++ + +S E AV + L + + +LA +
Sbjct: 307 NDVDERLLATLRVILMNE-QEIR-LYKKAFETSIISIRNELAVYENLQNTCRRKLANFAT 364
Query: 461 TLSEDEAMLTDYNLHPKKRVATQL-VRMEKKM--------LNACLQVTADMIMLLPDVTV 511
TL EDEA+L + R++ + VR+E K L QV A + + P T
Sbjct: 365 TLEEDEAILAEMATESSPRLSFAVRVRVEDKQVLTGVIDTLEKWKQVLASNLEMYPPSTT 424
Query: 512 SP 513
P
Sbjct: 425 RP 426
>gi|390354259|ref|XP_001201449.2| PREDICTED: SET domain-containing protein 4-like [Strongylocentrotus
purpuratus]
Length = 455
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 69/301 (22%), Positives = 123/301 (40%), Gaps = 37/301 (12%)
Query: 75 EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
+E L WM ++G CK + ++ R + ++L+ GD+ +P L+V
Sbjct: 40 DEQYITLMKWMKEHGFN-CKGCCLKPAVFSDTGRGL---MTKKNLRPGDSIVEIPRHLLV 95
Query: 135 TLERVLGNETIAELLTTN--KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
T + +L E + ++ K + + +L+ E+ +GK SFW PYI L +
Sbjct: 96 TAKDILNTE-LGPIIKRQRQKPTPYQVVCAFLLTERSKGKSSFWYPYINVLPKD------ 148
Query: 193 AVESPLLWSETELAYLTGSPT--KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIP 250
+P S T+ A PT ++ + + + I+ + ++ F QY
Sbjct: 149 -FTTPAFGS-TKQADFDVLPTIARSRAINQLQDIRAAFESASCLFEDIERTFPQYRIFFS 206
Query: 251 TEAFTFEIF----KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYS--SKCKAMLAA 304
++F + F + ++ C K S FAL P LL +S ++ A
Sbjct: 207 LDSFVWAWFVINSRSVYIEPSGCEAFDPKAS--DDFALAPFLD-LLNHSPGAEVTAGFDP 263
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
V + ++ Y A + + + Y GP N LL+ YGFV NP+D +
Sbjct: 264 VSNCYRIKTLDSYHAYDQVFIHY-----------GPHDNVNLLLEYGFVIPSNPHDAVSF 312
Query: 365 E 365
E
Sbjct: 313 E 313
>gi|320168265|gb|EFW45164.1| hypothetical protein CAOG_03170 [Capsaspora owczarzaki ATCC 30864]
Length = 464
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 83/318 (26%), Positives = 135/318 (42%), Gaps = 70/318 (22%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELA---------CLA 161
V A D+ AG +VP +L++T E+ +ET +L+T+ L +EL+ L
Sbjct: 170 VIARRDIPAGQTFINVPEALMMTAEKARKSETF-QLITSGALDSTELSPAMAKLDNFLLR 228
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
++L+ E+++G S+W PYI +L QR R PL ++E EL L SP E +
Sbjct: 229 MFLIVERRRGGNSYWSPYI-DLLPQRFR------LPLYFTEAELELLKPSPALQEAFVQL 281
Query: 222 EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR 281
+ R+Y + ++QY E+ + A + S H + + RR
Sbjct: 282 RNVVRQY-----------AAWKQY-------LMMLELARAAELPSGSGDAHQKILDQRRR 323
Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD--RPYKAGE----SIVVW--YNISFTG 333
+P+ L Y C A A Q+VV R +A E I W N +F G
Sbjct: 324 AQAMPVRYNELTYDLFCWASSAVATRQNQIVVGEVRANQAPELSLALIPGWDMCNHAFGG 383
Query: 334 ---------HDFKC-GPQPNSK---LLINYG------------FVDEDNPYDRLVVEAAL 368
+C P +K +L++YG FV D+P D+ ++ A+
Sbjct: 384 ASSFYDTQTRSLECVAVAPIAKGEPVLLHYGDRSSMAYFGNSEFVPADHPTDQYLILLAV 443
Query: 369 NTEDPQYQDKRMVAQRNG 386
+DP ++ K + Q G
Sbjct: 444 GKQDPLFKSKSTILQALG 461
>gi|451999637|gb|EMD92099.1| hypothetical protein COCHEDRAFT_1134267 [Cochliobolus
heterostrophus C5]
Length = 476
Score = 58.9 bits (141), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 93/390 (23%), Positives = 156/390 (40%), Gaps = 66/390 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
VAA +D+ + FS+P S ++++E + + I T L L L ++YE G
Sbjct: 40 VAAKQDIAEHELLFSIPRSSILSVENSILSTEIPPT-TFALLGPWLSLILVMLYEYHNGS 98
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL-------------- 218
S W PY L ++ + W+E EL L S +I
Sbjct: 99 ASNWAPYFAVLPTD-------FDTLMFWTEDELTELQASAVVNKIGKEGANEVFIEQLLP 151
Query: 219 -------------ERAEGIKREYNELDTVWFM--AGSLFQQYPYDIPTEAFTFEIFKQAF 263
ERA+ + +E + + M GSL Y +D+ A + + +
Sbjct: 152 VIEEFADVIFSGDERAKDLAKEMRAPENLELMHKMGSLIMAYAFDVEP-AISDKEVDEEG 210
Query: 264 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESI 323
A + L K +VPL L A + +C A L D +++ +P +AGE I
Sbjct: 211 FAEEEEDAALPK-------GMVPLADMLNADADRCNARLFYEKDGLEMKALKPIQAGEEI 263
Query: 324 VVWYNISFTGHDFKCGPQPNSKLLINYGFVDED-NPYDRLVVEAALNTE----DPQYQDK 378
Y GP P S LL YG++ E+ YD + + A L ++ D + +K
Sbjct: 264 FNDY-----------GPLPRSDLLRRYGYITENYAQYDVVEIPADLVSQALAHDGLWHEK 312
Query: 379 RMVAQRNGKLSVQVFHVHAG---REKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPV 435
R+ ++ + + A +E++S L L + + E + + S G + P
Sbjct: 313 RIEYLDEQEIVDTGYDIAASVPFSLEESLSPELVILVETMLLPSEEFER-LQSKGRL-PK 370
Query: 436 SPCMERAVLDQLADYFKARLAGYPATLSED 465
+ M + L +AR+A YP TL +D
Sbjct: 371 AEKMTGKAAEILYKIVQARIAQYPTTLEQD 400
>gi|24987776|pdb|1MLV|A Chain A, Structure And Catalytic Mechanism Of A Set Domain Protein
Methyltransferase
gi|24987777|pdb|1MLV|B Chain B, Structure And Catalytic Mechanism Of A Set Domain Protein
Methyltransferase
gi|24987778|pdb|1MLV|C Chain C, Structure And Catalytic Mechanism Of A Set Domain Protein
Methyltransferase
gi|33357815|pdb|1OZV|A Chain A, Crystal Structure Of The Set Domain Of Lsmt Bound To
Lysine And Adohcy
gi|33357816|pdb|1OZV|B Chain B, Crystal Structure Of The Set Domain Of Lsmt Bound To
Lysine And Adohcy
gi|33357817|pdb|1OZV|C Chain C, Crystal Structure Of The Set Domain Of Lsmt Bound To
Lysine And Adohcy
gi|33357822|pdb|1P0Y|A Chain A, Crystal Structure Of The Set Domain Of Lsmt Bound To
Melysine And Adohcy
gi|33357823|pdb|1P0Y|B Chain B, Crystal Structure Of The Set Domain Of Lsmt Bound To
Melysine And Adohcy
gi|33357824|pdb|1P0Y|C Chain C, Crystal Structure Of The Set Domain Of Lsmt Bound To
Melysine And Adohcy
Length = 444
Score = 58.9 bits (141), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 91/387 (23%), Positives = 161/387 (41%), Gaps = 33/387 (8%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A +D+ D VP L + + V +E ++L + L+L+ E+ + +
Sbjct: 40 LVALKDISRNDVILQVPKRLWINPDAVAASEIGR---VCSELKPWLSVILFLIRERSR-E 95
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W Y L ++ +S + WSE EL L GS + E +K E +L+
Sbjct: 96 DSVWKHYFGILPQE-------TDSTIYWSEEELQELQGSQLLKTTVSVKEYVKNECLKLE 148
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQS-CVVHLQKVSLARRFALVPLGPP 290
+ P + + F I + +AF +++ +V + L A V
Sbjct: 149 QEIILPNKRLFPDPVTLDDFFWAFGILRSRAFSRLRNENLVVVPMADLINHSAGVTTEDH 208
Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINY 350
AY K A L + D L KAGE + + Y+++ + N++L ++Y
Sbjct: 209 --AYEVKGAAGLFSWDYLFSLKSPLSVKAGEQVYIQYDLN----------KSNAELALDY 256
Query: 351 GFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPY 410
GF++ + + ++ DP + DK VA+ NG F + R +LPY
Sbjct: 257 GFIEPNENRHAYTLTLEISESDPFFDDKLDVAESNGFAQTAYFDIFYNRTLPP--GLLPY 314
Query: 411 LRLGYVSDTSE--MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSED 465
LRL + T ++S+ + G + VS E + + + K+ LAGY T+ +D
Sbjct: 315 LRLVALGGTDAFLLESLFRDTIWGHLELSVSRDNEELLCKAVREACKSALAGYHTTIEQD 374
Query: 466 EAMLTDYNLHPKKRVATQLVRMEKKML 492
L + NL + +A + EK +L
Sbjct: 375 RE-LKEGNLDSRLAIAVGIREGEKMVL 400
>gi|328700922|ref|XP_003241429.1| PREDICTED: SET domain-containing protein 3-like [Acyrthosiphon
pisum]
Length = 463
Score = 58.9 bits (141), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 69/312 (22%), Positives = 130/312 (41%), Gaps = 41/312 (13%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
+ ++ + L W KNG IL H ++ + + A++++ GD +VP +L
Sbjct: 81 RNDQSIEKLTKWATKNG-----AILNGVEIHQFENYA-YGMKANKNITVGDKLVTVPRAL 134
Query: 133 VVTLERV----LGNETIAELLTTNKLSELACLALYLMYEK-KQGKKSFWLPYIRELDRQR 187
++T E + L +++ N + LA++++ E ++ KKSFW Y+ L
Sbjct: 135 MMTEENIPSSPLWKLHSQDMMLRNMPN--VALAIFILVESLRKDKKSFWHSYLTTL---- 188
Query: 188 GRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY 247
+ +P+ + +L L GSP L+ I R+Y ++ ++ P
Sbjct: 189 ---PVTYSTPVYFDVADLEALKGSPAFEAALKLNRNIARQYAYFKKLFQLSND-----PA 240
Query: 248 D-IPTEAFTFEIFKQA---FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA 303
I + FT+E ++ A ++ Q+ V S AL+PL S +
Sbjct: 241 SVILKDTFTYEYYRWAVSTLMSRQNTVPSSDNPS-ENVSALIPLWDMFNHRSGRLSTDFV 299
Query: 304 AVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLV 363
+ D Y A E + ++Y G + N+ L++ GFV DN +D +
Sbjct: 300 KSSNVCVCYADGDYAADEQVYIFY-----------GVRTNADFLVHNGFVYPDNEHDAVK 348
Query: 364 VEAALNTEDPQY 375
+ ++ DP Y
Sbjct: 349 IRLGVSRSDPLY 360
>gi|148226164|ref|NP_001079674.1| SET domain containing 4 [Xenopus laevis]
gi|28422727|gb|AAH46855.1| MGC53706 protein [Xenopus laevis]
Length = 456
Score = 58.5 bits (140), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 65/257 (25%), Positives = 111/257 (43%), Gaps = 29/257 (11%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAEL-LTTNKLSELACLALYLMYEKKQGKK 173
A+ DL+ G+ ++P + ++T E VL + + L +S L L +L+ E+ G
Sbjct: 64 ATRDLKPGELIIALPETCLITTETVLQSYLGKYIRLWRPHVSPLLALCTFLIAERFAGDC 123
Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
S W PY+ + P+ W E E+ +L +P + + LE+ K E EL T
Sbjct: 124 SQWKPYLDVIPS-------TYSCPVYW-ELEIIHLLPAPLRKKALEQ----KTEVQELHT 171
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLA 293
S Q D + +T++ + A+ V + V+++ R LLA
Sbjct: 172 ESLAFFSSLQPLFCDNVADIYTYDALRWAWCTVNTRTVYMKHTQQDR----------LLA 221
Query: 294 YSSKCKA--MLAAVDDAVQLVVDRPY-KAGESIVVWYNISFTGHD--FKC-GPQPNSKLL 347
C L ++ + ++ V+ + K + N HD F C GP N +LL
Sbjct: 222 QQDVCALAPYLDLLNHSPEVQVEAEFSKDRRCYEIRTNSGCRKHDQAFICYGPHDNQRLL 281
Query: 348 INYGFVDEDNPYDRLVV 364
+ YGFV +NP+ + V
Sbjct: 282 LEYGFVAANNPHRSVYV 298
>gi|109158151|pdb|2H21|A Chain A, Structure Of Rubisco Lsmt Bound To Adomet
gi|109158152|pdb|2H21|B Chain B, Structure Of Rubisco Lsmt Bound To Adomet
gi|109158153|pdb|2H21|C Chain C, Structure Of Rubisco Lsmt Bound To Adomet
gi|109158154|pdb|2H23|A Chain A, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
Adohcy
gi|109158155|pdb|2H23|B Chain B, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
Adohcy
gi|109158156|pdb|2H23|C Chain C, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
Adohcy
gi|109158157|pdb|2H2E|A Chain A, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
gi|109158158|pdb|2H2E|B Chain B, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
gi|109158159|pdb|2H2E|C Chain C, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
gi|109158160|pdb|2H2J|A Chain A, Structure Of Rubisco Lsmt Bound To Sinefungin And
Monomethyllysine
gi|109158161|pdb|2H2J|B Chain B, Structure Of Rubisco Lsmt Bound To Sinefungin And
Monomethyllysine
gi|109158162|pdb|2H2J|C Chain C, Structure Of Rubisco Lsmt Bound To Sinefungin And
Monomethyllysine
Length = 440
Score = 58.5 bits (140), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 91/387 (23%), Positives = 161/387 (41%), Gaps = 33/387 (8%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A +D+ D VP L + + V +E ++L + L+L+ E+ + +
Sbjct: 36 LVALKDISRNDVILQVPKRLWINPDAVAASEIGR---VCSELKPWLSVILFLIRERSR-E 91
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W Y L ++ +S + WSE EL L GS + E +K E +L+
Sbjct: 92 DSVWKHYFGILPQE-------TDSTIYWSEEELQELQGSQLLKTTVSVKEYVKNECLKLE 144
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQS-CVVHLQKVSLARRFALVPLGPP 290
+ P + + F I + +AF +++ +V + L A V
Sbjct: 145 QEIILPNKRLFPDPVTLDDFFWAFGILRSRAFSRLRNENLVVVPMADLINHSAGVTTEDH 204
Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINY 350
AY K A L + D L KAGE + + Y+++ + N++L ++Y
Sbjct: 205 --AYEVKGAAGLFSWDYLFSLKSPLSVKAGEQVYIQYDLN----------KSNAELALDY 252
Query: 351 GFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPY 410
GF++ + + ++ DP + DK VA+ NG F + R +LPY
Sbjct: 253 GFIEPNENRHAYTLTLEISESDPFFDDKLDVAESNGFAQTAYFDIFYNRTLPP--GLLPY 310
Query: 411 LRLGYVSDTSE--MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSED 465
LRL + T ++S+ + G + VS E + + + K+ LAGY T+ +D
Sbjct: 311 LRLVALGGTDAFLLESLFRDTIWGHLELSVSRDNEELLCKAVREACKSALAGYHTTIEQD 370
Query: 466 EAMLTDYNLHPKKRVATQLVRMEKKML 492
L + NL + +A + EK +L
Sbjct: 371 RE-LKEGNLDSRLAIAVGIREGEKMVL 396
>gi|50557134|ref|XP_505975.1| YALI0F28061p [Yarrowia lipolytica]
gi|49651845|emb|CAG78787.1| YALI0F28061p [Yarrowia lipolytica CLIB122]
Length = 454
Score = 58.5 bits (140), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 74/286 (25%), Positives = 117/286 (40%), Gaps = 54/286 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V ASED++ + F +P S +++E + I ++ KL+ L LY+M K G
Sbjct: 42 VIASEDIEEDEVLFKIPRSSFLSVEN--DPDFIKQVPEAKKLNSWLQLILYMM---KAGS 96
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE-- 230
+ W PY L Q ++S ++W++ EL L GS +I G + +Y E
Sbjct: 97 MTKWKPYFDVLPTQ-------LDSLMMWTDDELEGLKGSMIVKKI--GKAGAEEDYQEKL 147
Query: 231 -------------LDT---VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ 274
DT + G L Y +D P ++F+ + L
Sbjct: 148 KPIIDAHPEYFKDCDTSLESFHRMGGLIMAYSFDAP-DSFS----EDEEDDEDIEHDDLY 202
Query: 275 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD-AVQLVVDRPYKAGESIVVWYNISFTG 333
L + A+VPL L A++ C A L A DD + +P K GE + Y
Sbjct: 203 NEGLVK--AMVPLADTLNAHTRFCNANLIAEDDGGFSMTAIQPIKKGEQVYNTY------ 254
Query: 334 HDFKCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALNTEDPQYQDK 378
G PN L YG+V +E +D +VE +++ Y +K
Sbjct: 255 -----GELPNCDFLRRYGYVENEGTEFD--IVEFSMDEISDFYANK 293
>gi|384246211|gb|EIE19702.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 503
Score = 58.5 bits (140), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 110/446 (24%), Positives = 192/446 (43%), Gaps = 51/446 (11%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E L L +W+ + GLP K+ ++ + + + S+ + G +VP+S +T
Sbjct: 50 ETLPPLSAWVEQRGLPLKKLNVRPEIVEGDL-----CLVVSKPTKKGQPLVAVPSSAWLT 104
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
++V+ + +I L+ L +AL+L++E+ + + W ++ + A +
Sbjct: 105 -QQVVRSSSIGSLV--EDLEPWLQIALFLLHERSKPDAA-WQGFLDSI-------PAAPD 153
Query: 196 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
PL WSE EL+ L G+ + + + + +Y EL+ LF + P ++
Sbjct: 154 VPLFWSEEELSQLEGTQLLSSVQGYRQFFEAKYAELEE------QLFAPHREAFPPKSHQ 207
Query: 256 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY----SSKCKAMLAA--VDDAV 309
+ F A V+S V ALVPL L+ + ++ + LA A
Sbjct: 208 LDDFLWAVATVRSRV---HSPLDGEDVALVPLAD-LVQHRKLQGARWQLQLAGGLFSKAQ 263
Query: 310 QLVVD--RPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPY-DRLVVEA 366
LVV+ R Y GE + + + T D + + +S++L++YG +D D P D VV+
Sbjct: 264 ALVVEAQRDYAEGEVVTMDFGAPLTEEDQE---KLDSQVLLDYGALDADRPQADPGVVQG 320
Query: 367 ------ALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 420
AL +D Y DK + + NG F + A E L D
Sbjct: 321 GFILSLALPEDDKYYDDKADILELNGLSEAASFVLRANEEPSEQLLGFLRLLNLSGQDAF 380
Query: 421 EMQSVISSLG---PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPK 477
++ + + + PVS ERAV + + + +A L GY ++ +D L D P
Sbjct: 381 LLEPLFRNEAWGHMLAPVSEANERAVYESMMEGCRAALQGYATSIDDDLRALRDT--QPG 438
Query: 478 KRVATQ-LVRM-EKKMLNACLQVTAD 501
R+ LVR+ EK+ L+A L D
Sbjct: 439 TRLEKAILVRLGEKETLDATLAFFED 464
>gi|428182808|gb|EKX51668.1| hypothetical protein GUITHDRAFT_102933 [Guillardia theta CCMP2712]
Length = 436
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 82/380 (21%), Positives = 157/380 (41%), Gaps = 62/380 (16%)
Query: 81 LKSWMHK-NGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
L+ W+ + +G+ KV L+ P V A+ L+ G+ F +P S + E V
Sbjct: 29 LRIWLEEEHGVDMSKVDLQRSPLEG------LGVFANRRLEPGETLFMIPKSCCIYPELV 82
Query: 140 LGNETIAELLTTNKLS-------ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
+ + + + KL+ E+ LA +L EK +G +S + P+I L
Sbjct: 83 FEDRQLGK--SMQKLASAAGEGIEVVALATFLAREKMKGSESSYKPFIDVL-------PW 133
Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 252
PLLW++ E+ L G+ EIL E ++ + V G ++Q+ I TE
Sbjct: 134 DSLHPLLWTDEEVDLLEGTYAHREILAFREQVEVATELFEPVLNPKG--WKQFFQTIETE 191
Query: 253 AFTFEIF----KQAFVAVQS----CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAA 304
T E F + AF +V S + L R + P L ++
Sbjct: 192 KMTPEEFGFMMRGAFASVLSRAFDSKIGRGDKGLEERVVI----PLLDIFNHGSYGPSIT 247
Query: 305 VDDAVQLVVDRPY-----------KAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV 353
D A++ ++ + + GE + +Y G +PN +L YGFV
Sbjct: 248 FDTALERDNEKGFPVRVADKGKSIEEGEELFGFY-----------GDKPNWNMLTTYGFV 296
Query: 354 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV-QVFHVHAGREKEAISDMLPYLR 412
+ + +++ +DP + K + + G ++V Q+F + + + + ++ Y R
Sbjct: 297 SPNPKCQETTLSVSIDEKDPYFAQKEEILKARGMVAVEQLFDIR--HDTDPMGPLINYFR 354
Query: 413 LGYVSDTSEMQSVISSLGPI 432
+ +S+ +++ V ++ G +
Sbjct: 355 IREISNEADLTKVQTNYGEM 374
>gi|327259114|ref|XP_003214383.1| PREDICTED: SET domain-containing protein 3-like, partial [Anolis
carolinensis]
Length = 311
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 56/208 (26%), Positives = 98/208 (47%), Gaps = 29/208 (13%)
Query: 306 DDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVE 365
DD + V + +KAGE I ++Y G + N++ +I+ GF ++N +DR+ ++
Sbjct: 9 DDRCECVALQDFKAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKIK 57
Query: 366 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV 425
++ D Y K V R G + VF +HA E + +L +LR+ +++ + +
Sbjct: 58 LGVSKSDRLYAMKAEVLARAGIPTSSVFALHA-TEPPISAQLLAFLRVFCMTEDELKEHL 116
Query: 426 IS--------SLG-PICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDY 472
I +LG PVS E +L + +AR L Y T+SED+A L
Sbjct: 117 IGEHAIDRIFTLGNSEFPVSWDNEV----KLWTFLEARASLLLKTYKTTVSEDKAFLGTQ 172
Query: 473 NLHPKKRVATQLVRMEKKMLNACLQVTA 500
+L +A +L EK++L ++ A
Sbjct: 173 DLTCNATMAIKLRLGEKEILEKAIKSAA 200
>gi|336467028|gb|EGO55192.1| hypothetical protein NEUTE1DRAFT_147775 [Neurospora tetrasperma
FGSC 2508]
gi|350288355|gb|EGZ69591.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
Length = 504
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 74/313 (23%), Positives = 125/313 (39%), Gaps = 51/313 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
S +E L W HK+G P +V E + + +P +A+E L +G A S
Sbjct: 3 SPHKERFEALLDWAHKHGASLHPLLEVYEDEVTGFSLRVKP----SATELLGSGFKAVSC 58
Query: 129 PNSLVVTLERVLGNETIAELLTT---------------NKLSELACLALYLMYEKKQGKK 173
P S+ ++ L + I TT N L YL+ + +GK
Sbjct: 59 PTSITLSYLNALTDGPITPSSTTLAPNTENPAFPERFMNSLPPHVIGRFYLIQQYLKGKS 118
Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
SFW PYI L + A+ P W+E ++ L G+ I E +K EY +
Sbjct: 119 SFWAPYISTLADPSQLDKWAL--PPFWAEDDIELLKGTNAYVAIQEIQSNVKSEYKQARK 176
Query: 234 VWFMAG----SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKV----SLARRFALV 285
+ G + Q Y+ FT F+ + V +S +++++ S F+++
Sbjct: 177 ILKKEGFPDYRDYTQVLYNWAYCMFTSRSFRPSLVLSESAREYVERLLPEGSKIDDFSIL 236
Query: 286 PLGPPLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGP 340
PL ++ + L + A +L+ + Y G+ + F + FK
Sbjct: 237 ---QPLYDIGNHSWDASYTWNLTSEPSACELICNDSYGPGQQV-------FNNYGFKT-- 284
Query: 341 QPNSKLLINYGFV 353
NS+LL+ YGF+
Sbjct: 285 --NSELLLGYGFI 295
>gi|449662705|ref|XP_002165483.2| PREDICTED: uncharacterized protein LOC100209819 [Hydra
magnipapillata]
Length = 819
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 95/405 (23%), Positives = 159/405 (39%), Gaps = 70/405 (17%)
Query: 107 HRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAEL---LTT-----NKLSELA 158
HR + + A+ED++ G+ F+VP L++ + E L T N S
Sbjct: 127 HR--YGMLATEDIKKGEVLFTVPRQLLLNQNTATLKNRLNEFEKWLDTHGKSLNDSSGWL 184
Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY-LTGSPTKAEI 217
L + LM+E Q K SFW Y+ + G PL W E E G P +I
Sbjct: 185 PLLITLMWEFNQ-KDSFWASYLLLVPEISEFGH-----PLFWKEEEYNLEFQGMPLLNDI 238
Query: 218 LERAEGIKREYNELDTVWF-----MAGSLFQQYPYDIPTEAFTFEIFKQ--AFVAVQSCV 270
+ E I+ EY E ++ + GSL E ++ E FK+ AFV S
Sbjct: 239 IVDRENIETEYAEFVLLFLRRNKDLFGSL----------ENYSLEFFKRMVAFVMAYSFT 288
Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNIS 330
+ S +VP+ +L + S A L +Q++ R K GE + +
Sbjct: 289 EDEESPS------MVPMA-DILNHHSNNNAHLVFHKSNLQMISIRRIKKGEEVFNTF--- 338
Query: 331 FTGHDFKCGPQPNSKLLINYGFVD-EDNPYDRLVV-----------EAALNTEDPQYQDK 378
G N++LL YG+V+ N YD L++ + +DP K
Sbjct: 339 --------GKLGNTELLQMYGYVEIPSNQYDSLLLPVKDFYKIMTSKNGTANDDPYLLAK 390
Query: 379 RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVS-- 436
+ R G V F + D++ +L++ + SD E++ ++ + P S
Sbjct: 391 INLLNRTGIAEVDAFFMFDKNGLRCGPDLIQFLKIFHASD-RELEKILKTRASKRPESFY 449
Query: 437 -PCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRV 480
+ + L + + K L ++ED+ + N + +K V
Sbjct: 450 HKLLRKLRLSKKTE--KNSLGMTVIDITEDDTEMDIENFNKRKNV 492
>gi|85093434|ref|XP_959692.1| hypothetical protein NCU09581 [Neurospora crassa OR74A]
gi|28921141|gb|EAA30456.1| predicted protein [Neurospora crassa OR74A]
Length = 504
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 75/318 (23%), Positives = 123/318 (38%), Gaps = 61/318 (19%)
Query: 72 SKKEEDLGDLKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
S +E L W HK+G P +V E + + +P +A+E L +G A S
Sbjct: 3 SPHKERFEALLDWAHKHGASLHPLLEVYEDEVTGFSLRVKP----SATERLGSGFKAVSC 58
Query: 129 PNSLVVTLERVLGNETIAELLTT---------------NKLSELACLALYLMYEKKQGKK 173
P S+ ++ L + I TT N L YL+ + +GK
Sbjct: 59 PTSITLSYLNALTDGPITPSSTTPAPNTKNPAFPERFMNSLPPHVIGRFYLIQQYLKGKS 118
Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
SFW PYI L + A+ P W+E ++ L G+ I E +K EY +
Sbjct: 119 SFWAPYISTLADPSQLDKWAL--PPFWAEDDIELLQGTNAYIAIQEIQNNVKSEYKQARK 176
Query: 234 VWFMAG----SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP 289
+ G + Q Y+ FT F+ + V +S ++++ L+P G
Sbjct: 177 ILKKEGFPDYREYTQVLYNWAYCMFTSRSFRPSLVLSESAREYVER--------LLPEGT 228
Query: 290 ---------PLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHD 335
PL ++ + L + A +L+ + Y G+ + F +
Sbjct: 229 KIDDFSVLQPLYDIGNHSWDASYTWNLTSEPSACELICNDSYGPGQQV-------FNNYG 281
Query: 336 FKCGPQPNSKLLINYGFV 353
FK NS+LL+ YGF+
Sbjct: 282 FKT----NSELLLGYGFI 295
>gi|238485948|ref|XP_002374212.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
gi|83768069|dbj|BAE58208.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|220699091|gb|EED55430.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
Length = 713
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 56/204 (27%), Positives = 90/204 (44%), Gaps = 18/204 (8%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
+LM + QGK+ FW PYIR L Q G A+ +PL + +L +L G+ ++A
Sbjct: 131 FFLMGQYLQGKEGFWYPYIRTLP-QPG----ALTTPLYYEGDDLEWLEGTSLSPARQQKA 185
Query: 222 EGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLAR 280
+K +Y + T AG ++Y +D+ A T + + V S V+ ++
Sbjct: 186 NLLKEKYGTVYTELCKAGFDGAEKYTWDLYLWASTIFVSRAFSAKVLSGVIPDTQLPEEN 245
Query: 281 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGP 340
L+P +L + K A V +V AG+ I Y GP
Sbjct: 246 VSVLLPF-IDILNHRPLAKVEWRAGKGNVAFLVLEDVAAGQEISNNY-----------GP 293
Query: 341 QPNSKLLINYGFVDEDNPYDRLVV 364
+ N +L++NYGF +NP D +V
Sbjct: 294 RNNEQLMMNYGFCLPNNPCDYRIV 317
>gi|330806388|ref|XP_003291152.1| hypothetical protein DICPUDRAFT_155733 [Dictyostelium purpureum]
gi|325078672|gb|EGC32310.1| hypothetical protein DICPUDRAFT_155733 [Dictyostelium purpureum]
Length = 465
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 96/414 (23%), Positives = 165/414 (39%), Gaps = 57/414 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
+K+ E L + K W+ N P + L +K + + A + ++ D S+
Sbjct: 34 TKEIESLKEFKEWLVNNNAYINPNIDIELLDKYGRS--------IVAKKSIKKQDKLISI 85
Query: 129 PNSLVVTLERVLGN-----ETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIREL 183
P ++++ + G + I E + + LS A+++MY K +KSFW PY+ L
Sbjct: 86 PKDIIMS--NIGGYPKKIPKEIYEQVQSIGLSPTNLQAVFIMY-SKLNEKSFWHPYVTVL 142
Query: 184 DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQ 243
+ + L +S+ EL L S K + R +GI+R Y S F
Sbjct: 143 PE-------SFSTSLYFSDNELDELQASQLKEFTIIRKDGIERHYE----------STFS 185
Query: 244 QYPYDIPTEAFTFEIFKQA-FVAVQSCVVHLQKVSLARR-FALVPLGPPLLAYS-SKCKA 300
+ +P E ++ Q F SCV + SLA +VPL A SK K
Sbjct: 186 RLSKLVP-EFSNLALYNQELFTWALSCVWS-RAFSLAENDGGMVPLADMFNAEDRSKSKV 243
Query: 301 MLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV-DEDNPY 359
+ D + GE I Y + P +S++L++YGF+ DE
Sbjct: 244 LPKVTDTTLDYYASDDIAEGEQIFTPYGV--------YKPLSSSQMLMDYGFIFDEGTVS 295
Query: 360 DRLVVEA-ALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD 418
D + + + ++P K+ + + N ++ +VF + A D+L Y R+ +
Sbjct: 296 DNVAITVPVFHNDEPNLSTKQEILEENDIIN-EVFLLQKTDPLPA--DLLLYARVKNLIA 352
Query: 419 TSEMQSVISSLGP---ICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 469
Q+ L P P++ E+ L L + L Y L D+ +L
Sbjct: 353 KECDQAKKHFLSPNTRNTPLNTRNEKVSLRFLENLIHRYLDSYGTNLESDKNLL 406
>gi|413917183|gb|AFW57115.1| hypothetical protein ZEAMMB73_742803 [Zea mays]
Length = 514
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 74/288 (25%), Positives = 116/288 (40%), Gaps = 35/288 (12%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASE + GD A +P SL+++ E + +E L N ++ L L+ M E+
Sbjct: 153 ASESIGVGDIALEIPESLIISDELLCQSEVFLSLKDFNNITSETMLLLWSMRERYNLGSK 212
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F PY L G L + LA L G+ EI++ + ++++Y+EL +
Sbjct: 213 F-KPYFDTLPANFNTG-------LSFGIDALAALEGTLLFDEIIQARQHLRQQYDELFPL 264
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
L +P + T++ F A S + + S LVP+ L
Sbjct: 265 ------LCTNFPEIFRKDVCTWDDFLWACELWYSNSMMIVLSSGKLSTCLVPVAGLLNHS 318
Query: 295 SSKCKAMLAAVDDA---VQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYG 351
S VD+A ++ + RP AGE + Y G P S LL YG
Sbjct: 319 VSPHILNYGRVDEATKSLKFPLSRPCDAGEQCFLSY-----------GKHPGSHLLTFYG 367
Query: 352 FVDE-DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 398
F+ DNPYD + ++ D D+ + AQ + S Q H+ G
Sbjct: 368 FLPRGDNPYDVIPLDL-----DTSADDEDITAQSSATTS-QTTHMVRG 409
>gi|317144568|ref|XP_001820210.2| SET domain protein [Aspergillus oryzae RIB40]
gi|391871646|gb|EIT80803.1| N-methyltransferase [Aspergillus oryzae 3.042]
Length = 703
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 56/204 (27%), Positives = 90/204 (44%), Gaps = 18/204 (8%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
+LM + QGK+ FW PYIR L Q G A+ +PL + +L +L G+ ++A
Sbjct: 121 FFLMGQYLQGKEGFWYPYIRTLP-QPG----ALTTPLYYEGDDLEWLEGTSLSPARQQKA 175
Query: 222 EGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLAR 280
+K +Y + T AG ++Y +D+ A T + + V S V+ ++
Sbjct: 176 NLLKEKYGTVYTELCKAGFDGAEKYTWDLYLWASTIFVSRAFSAKVLSGVIPDTQLPEEN 235
Query: 281 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGP 340
L+P +L + K A V +V AG+ I Y GP
Sbjct: 236 VSVLLPF-IDILNHRPLAKVEWRAGKGNVAFLVLEDVAAGQEISNNY-----------GP 283
Query: 341 QPNSKLLINYGFVDEDNPYDRLVV 364
+ N +L++NYGF +NP D +V
Sbjct: 284 RNNEQLMMNYGFCLPNNPCDYRIV 307
>gi|347836900|emb|CCD51472.1| similar to SET domain-containing protein [Botryotinia fuckeliana]
Length = 470
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 99/408 (24%), Positives = 164/408 (40%), Gaps = 64/408 (15%)
Query: 126 FSVPNSLVVTLERVLGNETIAELLTTNKLSEL--ACLALY-LMYEKKQGKKSFWLPYIRE 182
FS+P S V L + L + +L+E + LAL ++ + Q S W PY+
Sbjct: 55 FSIPRSAV------LNAQNAKPLAISKRLAEKMPSWLALTSILMAEGQVDDSKWAPYLAI 108
Query: 183 LDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA-----------EGIKREYNEL 231
L Q + S + WS++ELA L S +I ++ +G++ E+
Sbjct: 109 LPEQ-------LNSLVFWSDSELAELQASAVVKKIGKQGAEDMFKTYITPQGLQHSSTEM 161
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQKVSLARRFALVPLG 288
S+ Y +DIP + + A V +K L+ ++PL
Sbjct: 162 ---CHKVASVIMAYAFDIPDPSEGPTSGGKGEEAADDLVSDDGEDEKTILS----MIPLA 214
Query: 289 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLI 348
L A + + A L ++ +++ +P GE I Y G P S LL
Sbjct: 215 DMLNADADRNNARLICDNEDLEMRAIKPIAKGEEIFNDY-----------GQLPRSDLLR 263
Query: 349 NYGFV-DEDNPYDRLVVEAAL----------NTEDPQY-QDKRMV----AQRNGKLSVQV 392
YG+V D + YD + A L + P+ QDK V A+R G
Sbjct: 264 RYGYVTDGYSAYDVAEISAELIVSLFRNGKVHPSLPKLTQDKLKVRLDLAEREGVYDESF 323
Query: 393 FHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFK 452
VH+ ++ +I D L + D S +++++ S + S LA +
Sbjct: 324 DLVHSSPDEPSIPDELLAFLYLLLVDESHLKAILDSESSLPSRSKLTTELAGQVLAILLQ 383
Query: 453 ARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 500
AR Y T+ EDE +L + +L + +A Q+ EKK+L A ++ A
Sbjct: 384 ARENEYSTTVEEDEDLLKNADLPIRTAMAIQVRSGEKKVLRAAIREAA 431
>gi|365989356|ref|XP_003671508.1| hypothetical protein NDAI_0H00910 [Naumovozyma dairenensis CBS 421]
gi|343770281|emb|CCD26265.1| hypothetical protein NDAI_0H00910 [Naumovozyma dairenensis CBS 421]
Length = 540
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 66/263 (25%), Positives = 106/263 (40%), Gaps = 45/263 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLT-----TNKLSELA---CLALYL 164
+ AS+D+ + F +P S ++ N T ++L T KL EL+ L + +
Sbjct: 90 IIASKDIDTDELLFEIPRSSIL-------NVTTSQLCVDFPHITGKLMELSQWDSLIICM 142
Query: 165 MYEKKQGK-KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLT--------GSPTKA 215
MYE K + +S W Y L L + W++ EL++LT G
Sbjct: 143 MYEMKVLQHESRWSSYFNVLPSSESLNTL-----MYWNDKELSFLTPSLVVNRVGKGDAE 197
Query: 216 EILERAEGIKREYNELDTVWFMAGSL----FQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 271
+ R E+NE D + GS+ F P I +F EI
Sbjct: 198 TMYRRILDTINEFNE-DILTEKLGSISWEEFLYIPSIIMAYSFDVEIKNDDDENEGDEEF 256
Query: 272 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISF 331
++ +++PL L A + KC A L D+++++ +P K GE + Y
Sbjct: 257 DEKEEEPELLKSMIPLADTLNADTHKCNANLTYDKDSLKMLAIKPIKKGEQVYNTY---- 312
Query: 332 TGHDFKCGPQPNSKLLINYGFVD 354
G PNS+LL YG+V+
Sbjct: 313 -------GELPNSELLRKYGYVE 328
>gi|212546319|ref|XP_002153313.1| SET domain protein [Talaromyces marneffei ATCC 18224]
gi|210064833|gb|EEA18928.1| SET domain protein [Talaromyces marneffei ATCC 18224]
Length = 481
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 97/440 (22%), Positives = 174/440 (39%), Gaps = 84/440 (19%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A ++Q G+ F +P+ +V+ ++ N+ +A+ L L L + ++YE G+
Sbjct: 50 VVARSNIQEGEDLFHLPHHIVLMVKTSRLNQILADDLKN--LGPWLSLVVVMIYEYSLGE 107
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT---------KAEILERAEG 223
+S W Y + L + ++ + WSE E + L S + +I E+
Sbjct: 108 QSNWKQYFQVLPSK-------FDTLMFWSEEEFSQLQASAVVDKVGKRDAEEDIFEKVLP 160
Query: 224 IKREYNEL------------DT-------VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
+ R + +L DT + GSL Y +DI + ++
Sbjct: 161 LVRAHPDLFPPIDGVMSYDDDTGAQALLELAHRMGSLIMAYAFDIEKAEEEESEGEDGYL 220
Query: 265 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 324
L K +VPL L A + + A L + A+ + +P KAG+ I
Sbjct: 221 TDDEE--QLPK-------GMVPLADLLNADADRNNARLFQEEGALVMRAIKPIKAGDEIF 271
Query: 325 VWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL----------NTEDPQ 374
Y G P S LL YG+V DN VVE L N ED +
Sbjct: 272 NDY-----------GELPRSDLLRRYGYV-TDNYAQYDVVELPLTGICHAAGFDNIEDKE 319
Query: 375 YQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP----YLRLGYVSDTSEMQSVISSLG 430
Y +++ Q ++ + + ++ + D+LP L D+ E+Q ++S
Sbjct: 320 YPQLKLLDQL--EILEDGYCILRPSPEDTLLDILPDELLALLKTLTLDSEELQRLLSKNK 377
Query: 431 PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDY-------NLHPKKRVATQ 483
P+ E + L D ++++ Y T+ ED+ +L + ++ +A Q
Sbjct: 378 HPKPILGAREARI---LLDAAQSKMGQYGTTIQEDKILLQQFASSSVLRTRERRRHMAVQ 434
Query: 484 LVRMEKKMLNACLQVTADMI 503
+ EK++L A L + D +
Sbjct: 435 VRVGEKEILQALLMMLQDFL 454
>gi|303275964|ref|XP_003057276.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226461628|gb|EEH58921.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 308
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 70/289 (24%), Positives = 123/289 (42%), Gaps = 48/289 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEK---K 169
+ A ED++ G+ +P++ ++T+ER + + +L E + LA +L + +
Sbjct: 25 LVAREDVKRGEPLLEIPDASLITVERAVKESKLGP--KHAELQEWSLLAAFLAEQALDIE 82
Query: 170 QGKKS-FWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKRE 227
G +S + Y++ L R+ G L W E ++ L GSP++ ER +
Sbjct: 83 NGDESGVFAAYVKALPRRTG-------GVLDWPEEDVKTLLAGSPSQRAAYERQASVDGA 135
Query: 228 YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 287
E+ ++P P + AF + S ++ L + ALVP
Sbjct: 136 IEEIRA----------EFPQLTPG------ALRWAFDVLFSRLIRLP--NRGGELALVPW 177
Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLL 347
+L + C A + V L DR YK GE + Y G +P+++LL
Sbjct: 178 A-DMLNHKPGCNAYIDDSGGKVCLQPDRAYKPGEQVFASY-----------GQRPSAELL 225
Query: 348 INYGFVDE--DNPYDRLVVEAALNTEDPQYQDKRMVA-QRNGKLSVQVF 393
I+YGF E +NP D + ++ D +Y D + A ++ G V+ F
Sbjct: 226 ISYGFAPEVGENPDDEYEITLGIDPND-RYADAKAAALEKIGLRPVESF 273
>gi|159477607|ref|XP_001696900.1| rubisco small subunit N-methyltransferase [Chlamydomonas
reinhardtii]
gi|158274812|gb|EDP00592.1| rubisco small subunit N-methyltransferase [Chlamydomonas
reinhardtii]
Length = 411
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/185 (28%), Positives = 84/185 (45%), Gaps = 9/185 (4%)
Query: 315 RPYKAGESIVVWYNISFTGHDFKCGP-QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 373
R A + +VVW G + +PN +LL+ G + ++N D L A L D
Sbjct: 166 RAAGARKGVVVWDG---AGSEMLLNDGRPNGELLLATGTLQDNNSSDFLSWPAGLVPADR 222
Query: 374 QYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPIC 433
Y K V + G + + F V+A R +L YLRL V+D + + +
Sbjct: 223 YYMMKSQVLESMGYSAAEEFPVYADRMP---IQLLAYLRLSRVADPALLAKC--TFEADV 277
Query: 434 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 493
+S E +L L + RLA Y + ED + +L PK+R+A +L EK+++N
Sbjct: 278 ELSQMNEYEILQILMGDCRERLASYTKSYEEDVKIAQQSDLSPKERLAVKLRLGEKRIIN 337
Query: 494 ACLQV 498
A ++
Sbjct: 338 ATMEA 342
>gi|432901733|ref|XP_004076920.1| PREDICTED: SET domain-containing protein 4-like [Oryzias latipes]
Length = 441
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 74/281 (26%), Positives = 122/281 (43%), Gaps = 34/281 (12%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
+Q G S+P S ++T VL + + L + K S L L ++L+ E+ +G+ S W
Sbjct: 68 IQPGGMLVSLPESCLLTTSTVL-HSYLGPFLKSWKPRPSSLVALCVFLVCERHRGEASDW 126
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYL-TGSPTKAEILERAEGIKREYNELDTVW 235
PYI L + P +++T +A L +G +AE E+ EG++ Y + +
Sbjct: 127 FPYIDVLP-------CSYCCPPYFTDTVMAVLPSGVRRRAE--EQREGLQHLY-AVHQDF 176
Query: 236 FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYS 295
FM+ +P P E T+E + A+ C ++ + V + R + GP A +
Sbjct: 177 FMSLQPVLSHP---PEEVLTYEALRWAW-----CSINTRSVFMDRPSSSFLSGPDNYALA 228
Query: 296 SKCKAMLAAVDDAVQLVVDRPYKAGE----SIVVWYNISFTGHDFKCGPQPNSKLLINYG 351
+ D V+ +R E S V Y+ +F + G N +LL+ YG
Sbjct: 229 PFLDLLNHRPDVQVKAGFNRTSGCYEIRSISGVQRYHQAFINY----GSHDNQRLLLEYG 284
Query: 352 FVDEDNPYDRLVVEAALNTE----DPQYQDKRMVAQRNGKL 388
FV NP+ + VE L E D +K + NG L
Sbjct: 285 FVSSCNPHSVIYVEEDLLCEVLRGDESLDEKMKFLRENGFL 325
>gi|315039895|ref|XP_003169325.1| hypothetical protein MGYG_08872 [Arthroderma gypseum CBS 118893]
gi|311337746|gb|EFQ96948.1| hypothetical protein MGYG_08872 [Arthroderma gypseum CBS 118893]
Length = 455
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 91/406 (22%), Positives = 165/406 (40%), Gaps = 49/406 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
V A + G+ ++P++ + T+++ + + +L + LS LALYL++ K
Sbjct: 28 VKALRSFKEGERILTIPSACLWTVKKAYADPLLGPVLRAAQPPLSVEDSLALYLLFVKS- 86
Query: 171 GKKSFWLPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIK 225
R L + R +A + + +++ EL GS A + + +
Sbjct: 87 ----------RTLGYEGQRHHIAAMPQSYSASIFFTDDELQVCKGSSLYALTPQLEQRVH 136
Query: 226 REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
+Y +L +L Q+ P + FT E +K A ++ S + A +
Sbjct: 137 DDYRQLLV------ALLSQHRDLFPLDQFTIEDYKWALCSIWSRAMDFAVSETASVRLVA 190
Query: 286 PLGPPLLAYS---SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
PL +L +S +C A D + ++ + Y+ G+ I ++Y G P
Sbjct: 191 PLAD-MLNHSPDVKQCHAYDPTSGD-LSILAAKDYQVGDQIFIYY-----------GSVP 237
Query: 343 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 402
N++LL YGFV DNP D + + P Y+ K + G S + K+
Sbjct: 238 NNRLLRLYGFVLPDNPNDSYDLVLQTSPLAPLYEQKERLWALAGLDSTCTIPLTV---KD 294
Query: 403 AI-SDMLPYLRLGYVSDTSEMQSVISSL--GPICPVSPCMERAVLDQLADYFKARLAGYP 459
+ +++L YLR+ + D S + + L G V+ E VL L D + L G+
Sbjct: 295 PLPNNVLRYLRIQRL-DESNITDITLRLVNGTDGKVNDGNEIQVLQFLVDSIGSLLEGFG 353
Query: 460 ATLSEDEAMLT--DYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 503
L + EA L DY A + E+++L + D++
Sbjct: 354 IPLEKLEAQLVAGDYPAGGNAWAAAHVSAGEQRVLTRAKKTAEDLL 399
>gi|145516585|ref|XP_001444181.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124411592|emb|CAK76784.1| unnamed protein product [Paramecium tetraurelia]
Length = 658
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 37/143 (25%), Positives = 65/143 (45%), Gaps = 26/143 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL---------SELACLALY 163
V+A ++ A ++PN L+++ +VL +E ++++ T+K +E CLALY
Sbjct: 48 VSAKMNIPANKVIIAIPNKLIISHHKVLKSE-LSDMFKTHKQFFDDQITADAEFNCLALY 106
Query: 164 LMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG 223
+ Y K QG KSFW PY+ +++ + W +L L E +
Sbjct: 107 IFYHKLQGDKSFWYPYLNVVEQH---------TMFEWRNRDLFNLQDQSLIDEFMYIQ-- 155
Query: 224 IKREYNELDTVWFMAGSLFQQYP 246
+E+D W+ L +YP
Sbjct: 156 -----SEMDKSWYKFKGLMNKYP 173
>gi|159490820|ref|XP_001703371.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280295|gb|EDP06053.1| predicted protein [Chlamydomonas reinhardtii]
Length = 339
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 61/261 (23%), Positives = 109/261 (41%), Gaps = 29/261 (11%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER-----VLGNETIAELLTTNKLSELACLALYLMYE 167
+ AS +++ G+ VP+ V+ E VL E + + ++ E+ L + +M+E
Sbjct: 68 LVASRNIKMGEVVVEVPDDAVLMAENCGLRDVLEEEGMTKDSADEEILEVQGLVIAVMWE 127
Query: 168 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
+ +G +S W PY+ L PL W E L G+ ++L RA+
Sbjct: 128 RWRGPESRWAPYLALLPDD------MTHMPLYWKRREFRELRGTAAYDKMLGRAQHPSDA 181
Query: 228 YNELDTVWF-MAGSLFQQYP-YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
++ +W + G ++P +P +E+++ A AV S L + A+V
Sbjct: 182 PTQVPLLWSEVVGPFIAEHPELGLPGGERGYELYRWATAAVASYSFILGD---DKYQAMV 238
Query: 286 PLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPN 343
P+ L + L + +Q++ R AG +V Y G N
Sbjct: 239 PVWDLLNHITGDVNVRLHHCSKRHVLQMIAMRDIVAGSELVNNY-----------GELSN 287
Query: 344 SKLLINYGFVDEDNPYDRLVV 364
++LL YGFV+ N Y+ + V
Sbjct: 288 AELLRGYGFVERANRYNHIPV 308
>gi|156384284|ref|XP_001633261.1| predicted protein [Nematostella vectensis]
gi|156220328|gb|EDO41198.1| predicted protein [Nematostella vectensis]
Length = 403
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 91/405 (22%), Positives = 169/405 (41%), Gaps = 54/405 (13%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL--ACLALYLMYEKKQGK 172
A+ DLQ +VP L++++ + + + L + LAL+++ E+ +
Sbjct: 26 ATADLQENQVFVAVPEKLLMSVVTAKKSSLGPLISREHGLRSMPHVVLALHVLCERLH-E 84
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W PY+ L R + + L +S ++ L GSP+ E L++ GI ++Y
Sbjct: 85 DSTWAPYLNILPR-------SYSTCLYFSPDDMMALQGSPSMGEALKQFRGIVKQY---- 133
Query: 233 TVWFMAGSLFQQYPYDIPTE-AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
V+F +P + +FTF+ F+ A V + ++ S AL+ P+
Sbjct: 134 -VYFFRLVQINPEASRLPLKNSFTFDDFRWAVSTVMTRQNDVKVSSNETVKALI----PM 188
Query: 292 LAYSSKCKA-MLAAVDDA---VQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLL 347
+ C DD+ V+ + +P +AG+ + ++Y G + N+ L
Sbjct: 189 WDMCNHCNGPFTTGFDDSTKEVKSLAFKPTRAGDQVFIFY-----------GRRNNADRL 237
Query: 348 INYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS-D 406
+ GFV + D + ++ ++ D Y K + G L R E IS +
Sbjct: 238 FHNGFVYTEAEEDWVNIQLGVSKNDRLYAMKAQILAMVG-LDASGRSYRVLRGPEPISPE 296
Query: 407 MLPYLRLGYVSDTSEMQSV--------ISSLGPICPVSPCMERAVLDQLADYFKAR---- 454
+ +LR+ + +T E++ ++ L +C + +L +F R
Sbjct: 297 LRIFLRV-FSMNTGELKPYLFNPEGLPVTPLAELCKAEFTLSEENELKLWSFFHTRLQLI 355
Query: 455 LAGYPATLSEDEAMLT--DYNLHPKKRVATQLVRMEKKMLNACLQ 497
L Y T EDEA+L+ D LH R +L E+ +L + L+
Sbjct: 356 LGQYKTTKQEDEALLSRDDNTLH--TRNCIRLRMSERDILVSALE 398
>gi|66828265|ref|XP_647487.1| hypothetical protein DDB_G0268558 [Dictyostelium discoideum AX4]
gi|60475797|gb|EAL73732.1| hypothetical protein DDB_G0268558 [Dictyostelium discoideum AX4]
Length = 459
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 80/324 (24%), Positives = 132/324 (40%), Gaps = 57/324 (17%)
Query: 78 LGDLKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
L + W+ N + P ++ + EK + + A + ++ + SVP +++
Sbjct: 35 LNEFNKWLINNKVYKNPKIEIKVLEKYGRS--------IVAKQSIKKNEKLISVPKLIIM 86
Query: 135 T----LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRG 190
+ L NE I E + +S A++LMY K KSFW PY+ L ++
Sbjct: 87 SNMGGFSHHLPNE-IYEPSISIGISPTNLQAIFLMY-CKLNDKSFWYPYVSVLPKE---- 140
Query: 191 QLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIP 250
+ + +SE EL L S K + R +GI+R YN T G + + P
Sbjct: 141 ---FTTSIYFSEEELDELQSSKLKEFTIIRKDGIERHYNSTFTRLSNRG-IAEFSPTSTQ 196
Query: 251 T---EAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL-------VPLGPPLLAYS-SKCK 299
T + +T E+F A SCV +R F+L VPL A SK K
Sbjct: 197 TLQQKGYTLELFTWAL----SCV-------WSRAFSLSDSDGGMVPLADMFNAEEISKSK 245
Query: 300 AMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV-DEDNP 358
D + + GE I Y + P +S++L++YGFV D P
Sbjct: 246 VQPKVTDSTLDYYASDDIEIGEQIFTPYGV--------YKPLSSSQMLMDYGFVFDHGTP 297
Query: 359 YDRLVVEAAL-NTEDPQYQDKRMV 381
D + + + + ++P Q K+ +
Sbjct: 298 SDNVAISVPIFHPDEPNIQVKQSI 321
>gi|296804474|ref|XP_002843089.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
gi|238845691|gb|EEQ35353.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
Length = 455
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 91/406 (22%), Positives = 165/406 (40%), Gaps = 49/406 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
V A + G+ ++P++ + T+E+ + + +L + + LS LA+YL++ +
Sbjct: 28 VKALRSFKEGERILTIPSACLWTVEKAYADPLLGPVLRSAQPPLSVEDALAVYLLFVRS- 86
Query: 171 GKKSFWLPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIK 225
R + R +A + + ++E EL GS A + + ++
Sbjct: 87 ----------RTSGYEGQRHHIAAMPQSYSASIFFTEDELQVCAGSSLYALTRQLEQRVR 136
Query: 226 REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
+Y +L L Q+ P + FT E +K A ++ S + VS LV
Sbjct: 137 DDYRQLLV------PLLSQHRDLFPLDQFTIEDYKWALCSIWSRAMDF-AVSGTTSVRLV 189
Query: 286 PLGPPLLAYS---SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
+L +S +C A D + ++ + Y+ G+ + ++Y G P
Sbjct: 190 APLADMLNHSPDVKQCHAYDPTSGD-LSILAAKDYQVGDQVFIYY-----------GSVP 237
Query: 343 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 402
N++LL YGFV DNP D + + P Y+ K + G S + K+
Sbjct: 238 NNRLLRLYGFVLPDNPNDSYDLVLQTSPLAPLYEQKERLWALAGLDSTCTIPLTV---KD 294
Query: 403 AI-SDMLPYLRLGYVSDTSEMQSVISSL--GPICPVSPCMERAVLDQLADYFKARLAGYP 459
+ +++L YLR+ + D S + + L G VS E VL L D + L G+
Sbjct: 295 PLPNNVLRYLRIQRL-DESNITDITLQLVNGTDGKVSDGNEMQVLQFLVDSIGSLLEGFG 353
Query: 460 ATLSEDEAMLT--DYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 503
L + EA L DY A + E+++L + D++
Sbjct: 354 IPLEKLEAQLAAGDYPAGGNAWAAAHVSAGEQRVLTRAKRTAEDLL 399
>gi|66825817|ref|XP_646263.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|60474297|gb|EAL72234.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 567
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 97/471 (20%), Positives = 175/471 (37%), Gaps = 69/471 (14%)
Query: 65 AGSREVVSKKEEDL-GDLKSWMHKNGL--PPCKVILKEKPSHNEKHRPIHYVAASEDLQA 121
A S ++V E L + W+ G CKV + S + A++D++
Sbjct: 56 ANSGKIVEPTEAQLVANFIEWLKGKGFDESKCKVKIDRNTSEGTG------LVATQDIKE 109
Query: 122 GDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLP 178
G+ +P++L +T +L ++L + L+++L+ E S W P
Sbjct: 110 GEDFVEIPSNLFITTAVAFQGLGKPPILENDRLIQSIPGILLSIFLVKELSN-PTSEWGP 168
Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMA 238
YI+ L +Q + W E GSP + G R+Y ++
Sbjct: 169 YIKLLPKQ-------YNTVYYWGLKEFTQFRGSPNLEYAMRYVRGAMRQY------CYLY 215
Query: 239 GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR-----FALVPLGP--PL 291
+ + +P +FT++ F A VQS Q A AL+P
Sbjct: 216 SMIDRTQSNIMPISSFTWDAFVWAISTVQS----RQNPVYAGNGNGSIMALIPFWDFCNH 271
Query: 292 LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYG 351
+ SK + + + + +K GE + ++Y GP+ N++LL++ G
Sbjct: 272 SSTGSKITSFYHMDSNCMTSGAIKDFKKGEQVYMFY-----------GPRDNTQLLMHAG 320
Query: 352 FVDEDNPYDRLVVEAAL--NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP 409
F + N +D E L + ++ ++ +R + V V E +++P
Sbjct: 321 FATKTNLHDSYPFELHLLEGNHEIRHDKVHLLEERGIRDGVVVNLNQNPTSNELPLELIP 380
Query: 410 YLRLGYVSDTSEMQSVIS---------------SLGPIC--PVSPCMERAVLDQLADYFK 452
+ R+ Y E +++ L P+ ++ E L K
Sbjct: 381 FYRI-YALSEQETRAIAPPQVPGEHNHHHGHQLELKPLAFKIITQENEEKAYSNLVQALK 439
Query: 453 ARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 503
+LA YP TL EDE L N +R EKK+L+ ++ +I
Sbjct: 440 GKLASYPTTLEEDEQELKK-NPPANQRFILYTKINEKKILDRNIKYLESLI 489
>gi|344277088|ref|XP_003410336.1| PREDICTED: SET domain-containing protein 4 [Loxodonta africana]
Length = 440
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 78/334 (23%), Positives = 131/334 (39%), Gaps = 45/334 (13%)
Query: 67 SREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAF 126
SR V + + +LK W+ +I P + + LQ G
Sbjct: 22 SRGVNESYKSEFIELKKWLKDRKFEDTNLIPARFPGTGRG------LMSKTSLQVGQMII 75
Query: 127 SVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFWLPYIRELD 184
S+P S +++ + V+ + +T K S L L +L+ EK G +S W PY+ L
Sbjct: 76 SLPESCLLSTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVLEKHAGDQSSWKPYLETLP 134
Query: 185 RQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQ 244
+ P+ W E E+ L P +A+ E+ ++ + + LF +
Sbjct: 135 K-------TYTCPVCW-EPEVVNLLPRPLRAKAQEQRTRVQEFFTSFRDFFSSLQPLFSE 186
Query: 245 YPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP----LGP--PLLAYSS-- 296
+I FT+ A+ V + V+L+ L R F+ P L P LL +S
Sbjct: 187 AVENI----FTYSALLWAWCTVNTRAVYLRHRQL-RCFSAEPDTCALAPYLDLLNHSPDV 241
Query: 297 KCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED 356
+ KA ++V + E + + Y GP N +LL+ YGFV
Sbjct: 242 QVKAAFNEKTRCYEIVAVSSCRKHEEVFICY-----------GPHDNHRLLLEYGFVSTR 290
Query: 357 NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 390
NP+ + V + + DK+M N K+S+
Sbjct: 291 NPHACVYVSRDILVKYLPSTDKQM----NKKISI 320
>gi|340966944|gb|EGS22451.1| hypothetical protein CTHT_0019870 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 499
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 50/202 (24%), Positives = 80/202 (39%), Gaps = 23/202 (11%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
+L+ E +G+ SFW PYI L + + P W E ++ +L G+ I E
Sbjct: 111 FFLIKEYLKGENSFWWPYIATLPQPEQVNSWTL--PAFWPEDDIQFLEGTNAHVAIGEIQ 168
Query: 222 EGIKREYNELDTVW----FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK-V 276
IKREY + V F + Q Y FT F+ + + QS ++ +
Sbjct: 169 ANIKREYKQARKVLKEENFPNWKEYSQMLYKWAFSIFTSRSFRPSLILSQSVKDYVSTLL 228
Query: 277 SLARRFALVPLGPPLLAYSSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWYNISF 331
AR + PL ++ D + QL+ Y+ G+ + F
Sbjct: 229 PSAREIDDFSILQPLFDIANHSMTATYTWDTTSDPNCCQLICQDSYRPGDQV-------F 281
Query: 332 TGHDFKCGPQPNSKLLINYGFV 353
+ FK NS+LL+ YGF+
Sbjct: 282 NNYGFKT----NSELLLAYGFI 299
>gi|212542185|ref|XP_002151247.1| SET domain protein [Talaromyces marneffei ATCC 18224]
gi|210066154|gb|EEA20247.1| SET domain protein [Talaromyces marneffei ATCC 18224]
Length = 709
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 69/234 (29%), Positives = 103/234 (44%), Gaps = 35/234 (14%)
Query: 161 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE--LAYLTGSPTKAEIL 218
A +LM + + FW PYIR L G+ + +PL + E E L +L G + A
Sbjct: 108 AFFLMGQYLLQEHGFWYPYIRSLP-----GKEELTTPLFFREEEGDLEWL-GMTSLAASR 161
Query: 219 ERAEGI-----KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA--VQSCVV 271
ER I +R Y L + F + Y +D+ A T I +AF A + S +
Sbjct: 162 ERRLAIWRGNYERGYTMLKELGFEG---VEGYTWDLYLWASTI-ISSRAFTAKVLASVIP 217
Query: 272 HLQKVSLARRFALVPLGPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWYNI 329
L+ + R L+PL + A + K K A D++ LVV AGE +
Sbjct: 218 ELKNAEVDRVSVLLPL---IDATNHKPLSKVEWRAGTDSIGLVVMSDVAAGEEV------ 268
Query: 330 SFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 383
G+++ GP+ N +L++NYGF DNP + VV + P Q K Q
Sbjct: 269 ---GNNY--GPRNNEQLMMNYGFCIPDNPCEYRVVSLRAPLDSPLAQIKAQYEQ 317
>gi|403349615|gb|EJY74245.1| hypothetical protein OXYTRI_04500 [Oxytricha trifallax]
Length = 689
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 35/131 (26%), Positives = 66/131 (50%), Gaps = 19/131 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-------SELACLALYLM 165
+ A +D+ A +PNS ++++ RV + + ++L+ ++ ++ CLA++LM
Sbjct: 74 IGAKKDIGQYKAFLFIPNSCIISVTRVKKHPIVGQILSNHQELFMKHADADQLCLAVFLM 133
Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL--WSETELAYLTGSPTKAEILERAEG 223
E QG++SFW PYI ++ ES LL W + E+ L + E +
Sbjct: 134 NEYLQGQQSFWWPYINVMN----------ESDLLYKWKDEEIKLLNDFEIYQQAKEYRDD 183
Query: 224 IKREYNELDTV 234
I+ E+N+L +
Sbjct: 184 IEDEWNKLSKI 194
>gi|241712095|ref|XP_002413441.1| conserved hypothetical protein [Ixodes scapularis]
gi|215507255|gb|EEC16749.1| conserved hypothetical protein [Ixodes scapularis]
Length = 227
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 52/190 (27%), Positives = 81/190 (42%), Gaps = 25/190 (13%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLG-- 141
W NG + L+ P + AA +D+Q G VP +++T +G
Sbjct: 11 WCLDNGATINGITLQALPDDE------YGFAAEQDIQVGPVFLGVPLGMMMT---TIGAR 61
Query: 142 NETIAELLTTN---KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPL 198
+ LL + K E L+++L+ E G SFW PYI L R + + L
Sbjct: 62 KSKLGALLKDDPIMKSMENVALSMFLILELCAGSASFWHPYISILPR-------SFNTVL 114
Query: 199 LWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEI 258
+S EL LTGS E L+ I R+Y + F L + PY + FT+++
Sbjct: 115 YFSVDELQLLTGSSVLDEALKLHRSIARQYAYFHKI-FRTHPLAKSLPY---KDCFTYDL 170
Query: 259 FKQAFVAVQS 268
++ A AV +
Sbjct: 171 YRWAVSAVMT 180
>gi|281207968|gb|EFA82146.1| hypothetical protein PPL_04566 [Polysphondylium pallidum PN500]
Length = 510
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 40/159 (25%), Positives = 72/159 (45%), Gaps = 21/159 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A +DL+ +P S ++T +I+ L K+ + ++ L+YE G
Sbjct: 59 VIALQDLKIDHTVAIIPKSCLLTPHTT----SISAYLKKYKIKDATATSIALLYEASIGS 114
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+S W YI+ L L+V+ P+LW++ +L L G+ + + E E + YN+
Sbjct: 115 QSKWYGYIKSL-------PLSVDLPILWNDADLKNLKGTSIETVVYENKETVDATYNK-- 165
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 271
++ L +P F+ + FK+A SC+V
Sbjct: 166 ---YIKSKLIANHPDVFNEHVFSLDNFKRA-----SCLV 196
>gi|380480025|emb|CCF42668.1| SET domain-containing protein RMS1 [Colletotrichum higginsianum]
Length = 318
Score = 55.5 bits (132), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 56/241 (23%), Positives = 108/241 (44%), Gaps = 45/241 (18%)
Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQ 341
+VP+ +L ++ A + +D + +V RP KAGE I+ +Y GP
Sbjct: 71 LGMVPMAD-ILNADAEFNAHVNHGEDDLSVVALRPIKAGEEILNYY-----------GPH 118
Query: 342 PNSKLLINYGFVDEDN--------PYDRLVVEAALNTE---------------DPQ-YQD 377
PNS+LL YG+V + P+D +V++ L + DP+ ++D
Sbjct: 119 PNSELLRRYGYVTPKHSRYDVVEIPWD--LVQSILTEQLRLTDDVWKQLAEHVDPEDFED 176
Query: 378 KRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSP 437
++ + +G+ + + +E +++ L+ + ++++ G + P
Sbjct: 177 VFVLERDSGEPDSEGRLTTPAKVQEVSAELEEQLK-------AVLKAIKKVRGDLIPDKR 229
Query: 438 CMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 497
+ +A + LA YP T EDEA+L NL ++R+A ++ EK++L LQ
Sbjct: 230 KRDEVYQHVVAAALQKLLAQYPTTAEEDEALLASGNLTSRQRMAVEVRLGEKRLLKEALQ 289
Query: 498 V 498
+
Sbjct: 290 M 290
>gi|162606198|ref|XP_001713614.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Guillardia theta]
gi|13794534|gb|AAK39909.1|AF165818_117 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Guillardia theta]
Length = 460
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 76/393 (19%), Positives = 167/393 (42%), Gaps = 66/393 (16%)
Query: 133 VVTLERVLGNETIAEL--------------LTTNKLSELACLALYLMYEKKQGKKSFWLP 178
++ +L NE I E+ + +N + LA+ L+ E + KKSFW P
Sbjct: 102 LIASRNILKNEKIIEISENLMFDKFEHNLEINSNGSDNYSDLAIKLLVELFKNKKSFWFP 161
Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMA 238
YI L + L W EL ++ GS + +K +Y ++
Sbjct: 162 YIGILPEEYDLKLL-----FRWPLKELFFIKGSRLSKASDYLKKKLKAQYEMVNK----- 211
Query: 239 GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC 298
+FQ+ P++ F ++ ++ + + S + LQ+ ++ L+P LL ++
Sbjct: 212 -EVFQRNRLLYPSKIFNYQNWEWSMSILLSRTISLQE---TKKVVLIPY-IDLLNHNPFS 266
Query: 299 KAMLAA----VDDAVQLVV--DRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGF 352
+ ++ + D+ ++VV D+ + + + Y G + N +LL YGF
Sbjct: 267 SSFISYRKIPLSDSKEIVVYSDKNCNKFDQLYISY-----------GQKSNLELLNLYGF 315
Query: 353 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 412
+ E NPYD +++ +++ +D +++K+ N K + + + + +M+ +++
Sbjct: 316 IAERNPYDSVIIRISMSPKDIFFKEKKSFLFSNKKFFYNSYPIFLYKYPD---EMIEFIK 372
Query: 413 L----GYVSDTSEMQSVISSLGPICPVSPC----MERAVLDQLADYFKARLAGYPATLSE 464
+ ++D + + I + + C +E+++ DY R L E
Sbjct: 373 ICLFNTNINDKNFNLNKIENYDYTKIIKSCIVTVIEKSLNSNYNDYENLR----NIMLKE 428
Query: 465 DEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 497
+ ++D ++++ + +EKK+LN L+
Sbjct: 429 NLLHISD-----NQKISIKYNALEKKILNRFLE 456
>gi|452824261|gb|EME31265.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase
[Galdieria sulphuraria]
Length = 546
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 88/383 (22%), Positives = 146/383 (38%), Gaps = 63/383 (16%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPS---HNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
E +L++W+ NG+P +K KP HN A L+ G+ ++P
Sbjct: 71 EKTEELENWLFDNGVPS----IKGKPVLSPHN-----CRTFRAKIPLKLGEEVLAIPERF 121
Query: 133 VVTL---ERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
+T E++LG + LS+ +A L+ E + + SFW P+I L
Sbjct: 122 WLTKQLSEKLLG-------FHVSDLSDEEAIAALLLVETARKETSFWKPWIETLPSSDEL 174
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY-D 248
L+WS E YL S T +IL E + EL+T LF ++ Y
Sbjct: 175 HHF-----LVWSTAETQYLESSSTFEDILSLRETASLVFEELNT------ELFPKFLYPQ 223
Query: 249 IPTEAFTFEIFKQAFVAVQSCVVHLQKVS-----------LARRFALVPLGPPLLAYSSK 297
+ FT F A VQS ++ S L +++L+ Y
Sbjct: 224 YDVKYFTLPYFTWALSIVQSFGLYDIMDSCPLVIVPGLEWLTYKYSLITEESFFRQYFHI 283
Query: 298 CKAMLAAV------DDAVQLVVDRPYKAGESIVVWY--NISFTGHDFKCGPQPNSKLLIN 349
L V + +++ K GE + + Y N+S + G + ++
Sbjct: 284 SNVSLIRVGPFFTQERRLKITASEDLKVGEPVSLVYEGNVSLIDTFCRWGWK------LD 337
Query: 350 YGFVDEDN--PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDM 407
G +DE+ + A+ T D + DK + +Q F + KE + +
Sbjct: 338 LGALDEEQLLKMGSYEISFAVTTTDQFFDDKEDILDAQRLELLQTFELRYDMSKELLQRI 397
Query: 408 LPYLRLGYVSDTSE--MQSVISS 428
LP+LRL + D ++SV S
Sbjct: 398 LPFLRLICLKDKDSFILESVFRS 420
>gi|168005531|ref|XP_001755464.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693592|gb|EDQ79944.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1033
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 71/306 (23%), Positives = 118/306 (38%), Gaps = 68/306 (22%)
Query: 70 VVSKKEEDLGD-LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
VV + D D SWM NG S +EK H +A L G
Sbjct: 520 VVHQNGTDTTDQFVSWMEGNGF-----------SISEKLSITHLLAGDGKLVRG------ 562
Query: 129 PNSLVVTLERVLGNETIAEL-----------LTTNKLSELACLALYLMYEKKQGKKSFWL 177
VV L+ + ET+ L + ++ A L+ EK +G S W
Sbjct: 563 ----VVVLKNIRRGETLCNLPLDMGLYDNETIVAGEVDSWDRAAARLLREKAKGSSSAWA 618
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
YI L + + P+L + EL + P E+++ + I+ ++ L +V +
Sbjct: 619 SYINILPQN-------MTVPILLEDHELHEVQWWPVLRELVQVRKSIRESFSLL-SVDDL 670
Query: 238 AGSLFQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
AG+ F++Y + + + AFT +F A + ++ ++ + + P+
Sbjct: 671 AGADFEEYRWAAMMVHSRAFTLPVFADDHYAPYVMMPYMDMINHHYHYQADWMSQPIWG- 729
Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVD 354
V++V R K GE + SF GP+ N L + YGFV
Sbjct: 730 ------------GKVEIVARRDIKKGEELFA----SF-------GPRANDNLFLYYGFVL 766
Query: 355 EDNPYD 360
+DNP+D
Sbjct: 767 KDNPFD 772
>gi|440804394|gb|ELR25271.1| rubisco lsmt substrate-binding protein [Acanthamoeba castellanii
str. Neff]
Length = 408
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 94/406 (23%), Positives = 155/406 (38%), Gaps = 56/406 (13%)
Query: 121 AGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELAC-----LALYLMYEKKQGKKSF 175
A + VP SL++ L E + + K + A LAL++++E ++ SF
Sbjct: 4 ASERILEVPFSLLLDAGAALRAEDVGSVFAAVKPALDAVDNRLPLALFMLHELRK-PDSF 62
Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 235
W PY L + V P+ W++ ++ L GSP A +L + + + + E
Sbjct: 63 WRPYFDALPSR-------VNLPMFWADEDMQLLAGSPLHAAVLAQKKQARDWHTE----- 110
Query: 236 FMAGSLFQQYP--YDIPTE------AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 287
+ ++YP + + + +++ F+ + S + +VP+
Sbjct: 111 -HIVPIVRRYPRPFGVSDDDSSLEPSYSLARFEWVLSMIASRAFWHFDLKDTWEPHMVPM 169
Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLV---VDRPYKAGESIVVWYNISFTGHDFKCGPQPNS 344
+ + DD Q V +PY GE + + Y C N
Sbjct: 170 ADLINHSLTNDNVSKYTFDDKTQTFIVHVQQPYAEGEQVFITY----------CT-DSNF 218
Query: 345 KLLINYGFVDEDN-------PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 397
+LL Y + EDN D + E + R + QR L+ Q + V
Sbjct: 219 ELLKTYAMMVEDNYNKYTEIRLDETTIARICPDEVERLTKTRALTQRG--LAKQTYPV-- 274
Query: 398 GREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAG 457
+ +E D++ LRL ++ T S+ PVS E V D +A K L+
Sbjct: 275 -KSEEFPLDLVQALRLYHLPLTDSHTE--STCFETDPVSVQNELMVYDTIAGCVKELLSQ 331
Query: 458 YPATLSEDEAMLT-DYNLHPKKRVATQLVRMEKKMLNACLQVTADM 502
YP T ED AML D L R+A R +K L V A+M
Sbjct: 332 YPITAQEDAAMLAHDPRLSATARLAVAYRREDKLFLTEVGSVFAEM 377
>gi|351701197|gb|EHB04116.1| SET domain-containing protein 3 [Heterocephalus glaber]
Length = 705
Score = 55.5 bits (132), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 63/262 (24%), Positives = 117/262 (44%), Gaps = 25/262 (9%)
Query: 252 EAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 310
++FT+E ++ A +V + + +R AL+PL + DD +
Sbjct: 346 DSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCE 405
Query: 311 LVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 370
V + ++AGE I ++Y G + N++ +I+ GF ++N +DR+ ++ ++
Sbjct: 406 CVALQDFQAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSK 454
Query: 371 EDPQYQDKRMVAQRNG---KLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV-- 425
D Y K V R G + VF +H E + +L +LR+ +++ + +
Sbjct: 455 SDRLYAMKAEVLARAGIPTYVWSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLG 513
Query: 426 ------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKK 478
I +LG PVS E + L D L Y T ED+A+L + +L +
Sbjct: 514 ENAIDRIFTLGNSEFPVSWENEVKLWSFLEDRASLLLKTYKTTTEEDKAVLKNPDLPART 573
Query: 479 RVATQLVRMEKKMLNACLQVTA 500
++A +L EK++L +Q A
Sbjct: 574 KMAIKLRLGEKEILEKAVQSAA 595
>gi|260831632|ref|XP_002610762.1| hypothetical protein BRAFLDRAFT_91548 [Branchiostoma floridae]
gi|229296131|gb|EEN66772.1| hypothetical protein BRAFLDRAFT_91548 [Branchiostoma floridae]
Length = 604
Score = 55.1 bits (131), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 59/242 (24%), Positives = 113/242 (46%), Gaps = 46/242 (19%)
Query: 151 TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 210
T++ + L+L+L+ EK +GK SFW PYIR L + +P+ ++E+EL L+
Sbjct: 230 TSRFTCAQVLSLFLLLEKNKGKDSFWYPYIRSLPN-------SFTTPVYFTESELNALSP 282
Query: 211 SPTKAEILERAEGIKRE----YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 266
S + E+A +K+E +N+L+ F+ L + FTF+ F+ A+ +
Sbjct: 283 S-----LQEKARDLKKELLHAFNDLEP--FVTSCLPEL------DSTFTFDAFRWAWSVL 329
Query: 267 QSCVVHLQKVS---LARR----FALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPY 317
++ ++ + L+ + LVP+ L+ +S KA ++ V PY
Sbjct: 330 KTRTLYQEDCRSPYLSNKEPQTSTLVPM-LDLINHSPSAKARFGYNVNTSCYEVRVLEPY 388
Query: 318 KAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED-PQYQ 376
+ + + + Y G + N++L++ +GF +NP D + + + E PQ
Sbjct: 389 RKYDQVFISY-----------GFEENTELMLKFGFFVPENPKDFMKINLSEMLESLPQIN 437
Query: 377 DK 378
D+
Sbjct: 438 DE 439
>gi|145356486|ref|XP_001422460.1| chloroplast lysine N-methyltransferase [Ostreococcus lucimarinus
CCE9901]
gi|144582703|gb|ABP00777.1| chloroplast lysine N-methyltransferase [Ostreococcus lucimarinus
CCE9901]
Length = 529
Score = 55.1 bits (131), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 92/396 (23%), Positives = 156/396 (39%), Gaps = 64/396 (16%)
Query: 48 RRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKH 107
RR+ R+ +S + D L W+ NG V + + +E
Sbjct: 24 RRRARWGDATTSKTRRPRTRARRDAASSADHDALHEWLSANGADVASVEFYDARAGDEDD 83
Query: 108 RPIHYVA--ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELL----TTNKLSELACLA 161
A+ L G A VP SL +T E + ++ + + L L+ LA
Sbjct: 84 GGDAGWGARATRALARGAKAIVVPKSLWITPEVGMNDDELGKALRDEDVAGGLARWTTLA 143
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
L L+ E+++G++S + Y++ L + SPL W+ EL+ + G+ ++L+ A
Sbjct: 144 LTLLKERERGEESKYAAYVKTLPE-------VLHSPLFWNAEELSEIQGT----QLLDNA 192
Query: 222 EG----IKREYNELDTVWFMAGSLFQQYP--YDIPTEAFTFEIFKQAFVAVQSCVVHLQK 275
G ++ Y L T +F ++ +D+ AF+ + F+ AF ++S
Sbjct: 193 AGYDGYVRGVYETLRT------GMFAKHADVFDVEG-AFSEDNFRWAFGILRS---RTMA 242
Query: 276 VSLARRFALVPLGPPLLAYSSKCKA-------MLAAV---------DDAVQLVV--DRPY 317
ALVP G L+ +SS +A + AV DD V V DR
Sbjct: 243 PCDGANIALVP-GVDLVNHSSLSQARWRVSGGVAGAVAGLFGGGKGDDGVSARVECDRAL 301
Query: 318 KAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQD 377
E + V YN T ++ +++GFVD P + ++ +DP D
Sbjct: 302 NVNEPLYVNYNPEGT----------DTSFALDFGFVDTITPSPGYALSLSVPEDDPNVFD 351
Query: 378 KRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 413
K V G F + A + + D+ +LRL
Sbjct: 352 KLDVLDVCGLGETPTFTLRAYSDPD--PDLRTFLRL 385
>gi|281207217|gb|EFA81400.1| mRNA-decapping enzyme 2 [Polysphondylium pallidum PN500]
Length = 1078
Score = 55.1 bits (131), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 103/475 (21%), Positives = 188/475 (39%), Gaps = 108/475 (22%)
Query: 61 DTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQ 120
D + G ++ +++D ++++W + P + EK + +S D++
Sbjct: 12 DIRIGGQTVQLTFRKDDGINIQTWKQDSKQPLLSLTPNEKG-----------IFSSRDIK 60
Query: 121 AGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLA--------LYLMY--EKKQ 170
G+ S+P +++ +V + + L NK+ +L A LY Y +
Sbjct: 61 EGEELLSLPWYNSLSMNKV--QQQLPWLF--NKIQDLELTAEDGLVVALLYYRYCMDDLS 116
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
S W + E+ + S L +S+ E L GSP +++ + K +
Sbjct: 117 FDYSEWFSAMPEV----------LNSGLFFSDAEAELLNGSPAYIDLMNQRLDAKELFGR 166
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL---ARRFALVPL 287
L SLF++ + A T++ K A+ V S ++ + +L F V L
Sbjct: 167 L-------KSLFKEQQFS--KCAMTYDRLKWAYSVVDSRKIYTEAPNLDANGNPFITVVL 217
Query: 288 GPPLLAYSSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
P L Y + + AA D A+++V +P K GE I + Y G Q
Sbjct: 218 AP-FLDYFNHAEDAQAAYDFDYDESAIKVVALQPIKKGEQIFLNY-----------GNQD 265
Query: 343 -NSKLLINYGFVDEDNPYDRLV---VEAALNT---EDPQYQDKRMVAQRNGKLS--VQVF 393
NS LLI+YGF+D+ + V VE LNT DPQ +K + + + + +++F
Sbjct: 266 CNSDLLIHYGFIDQSSTAKHCVNVLVEELLNTIPASDPQLIEKTELLTKAFEQNERMKLF 325
Query: 394 HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 453
E IS L Y ++L L
Sbjct: 326 KDSLTEELLKISKYLSYKNF----------------------------SLLPYLKSLIDM 357
Query: 454 RLAGYPATLSEDEAML---TDYNLHPKKRVATQLVRMEK----KMLNACLQVTAD 501
++ YP T+ ED A++ T++ ++ + ++R+++ K + A +QV D
Sbjct: 358 KMKAYPTTMEEDRAIIEATTEFEKLSQRSKMSIIMRLQEKETLKEIGALIQVKID 412
>gi|70984218|ref|XP_747626.1| SET domain protein [Aspergillus fumigatus Af293]
gi|66845253|gb|EAL85588.1| SET domain protein [Aspergillus fumigatus Af293]
Length = 492
Score = 55.1 bits (131), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 72/291 (24%), Positives = 120/291 (41%), Gaps = 66/291 (22%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
V A D+ G+ FS+P LV++ + N + +LL+ + L EL L L +MYE
Sbjct: 50 VVARSDIFDGEELFSIPRGLVLSAQ----NSKLKDLLSQD-LEELGPWLSLILVMMYEYL 104
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
G++S W PY + L + + ++ + WS +EL L GS ++I EG +
Sbjct: 105 LGEQSAWAPYFKILPK-------SFDTLMFWSPSELRELQGSAIVSKI--GKEGAE---- 151
Query: 230 ELDTVWFMAGSLFQQYPYDIPT---------EAFTFEIFKQAFVA---VQSCVVHLQKVS 277
D++ M + + P P+ EA + + + A + + + ++KV
Sbjct: 152 --DSIMQMIAPVVRANPSLFPSVDGLASWDGEAGSHALLRLAHIMGSLIMAYAFDIEKVE 209
Query: 278 LARRF-------------------ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYK 318
+VPL L A + + A L DD++ + +P +
Sbjct: 210 DEDDENNDEEDGYVTDDEQDQSSKGMVPLADILNADADRNNARLFQEDDSLVMKAIKPIR 269
Query: 319 AGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 369
GE I Y G P + LL YG+V DN VVE +L+
Sbjct: 270 VGEEIFNDY-----------GELPRADLLRRYGYV-TDNYAQYDVVELSLD 308
>gi|320169513|gb|EFW46412.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 495
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 99/426 (23%), Positives = 168/426 (39%), Gaps = 59/426 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYE-KKQG 171
V A DL AG+ VP SL++ +E + + +L +LS+ +A +L+YE +
Sbjct: 79 VFALRDLAAGETVLRVPLSLLLNVEHASAS-PLGGILDDFRLSDAEAMAFWLIYELTRPE 137
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
+ S WLPY+ L QL + + E+ L SP R ++ ++ +
Sbjct: 138 RASPWLPYLESL--PASIKQLT----MFYDPFEMKRLQASPVAEFTSRRTVKMRNKFGKY 191
Query: 232 DTVWFMAGSLFQQYP-----YDIPTEAFTFEIFKQAFVAVQSC------VVHLQKVSLAR 280
+ + P + P E T + F A +AVQ V H R
Sbjct: 192 RE------QISKHRPAHLAEIEFPVELITVDDFLWA-MAVQFTRLITVQVKHPADGEWER 244
Query: 281 RFALVPLG-----PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHD 335
LVPL P + +C L + + RP G+ ++ Y
Sbjct: 245 TKCLVPLADLLNTAPADQINVECATNLDSTH--FECATIRPVAEGQELLTPYG------- 295
Query: 336 FKCGPQPNSKLLINYGFVDEDNPYDRLVV------EAALNTEDPQYQDKRMVAQRNGKLS 389
N +L+++YG +NP D + + E A+ + M R +L
Sbjct: 296 -GAEQLSNGQLIMDYGVTFRNNPSDLVALPIPKLRETAVAYDSKMRLLMAMSLDRFDRLQ 354
Query: 390 VQVFHVHAGREKEAISDMLPYLRLGYVS---DTSEMQSVISSLGPICPVSPCMERAVLDQ 446
+ V KE +L + R+ YVS D S+++ V+ + ++P ER L+
Sbjct: 355 LPVLDHFESIPKE----LLAFARV-YVSTPSDLSDLEHVLELMKEHRAINPSNERRALEL 409
Query: 447 LADYFKARLAGYPATLSEDEAMLTDYNLH--PKKRVATQLV-RM-EKKMLNACLQVTADM 502
L + Y T+ EDE ML + + P +V R+ EK++L++ Q+
Sbjct: 410 LLQLTNEMILKYITTIEEDETMLRELDAESVPNANAVNAVVLRLGEKRILSSLWQLLDSA 469
Query: 503 IMLLPD 508
I LP+
Sbjct: 470 IEALPE 475
>gi|159122413|gb|EDP47534.1| SET domain protein [Aspergillus fumigatus A1163]
Length = 492
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 72/291 (24%), Positives = 120/291 (41%), Gaps = 66/291 (22%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
V A D+ G+ FS+P LV++ + N + +LL+ + L EL L L +MYE
Sbjct: 50 VVARSDIFDGEELFSIPRGLVLSAQ----NSKLKDLLSQD-LEELGPWLSLILVMMYEYL 104
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
G++S W PY + L + + ++ + WS +EL L GS ++I EG +
Sbjct: 105 LGEQSAWAPYFKILPK-------SFDTLMFWSPSELRELQGSAIVSKI--GKEGAE---- 151
Query: 230 ELDTVWFMAGSLFQQYPYDIPT---------EAFTFEIFKQAFVA---VQSCVVHLQKVS 277
D++ M + + P P+ EA + + + A + + + ++KV
Sbjct: 152 --DSIMQMIAPVVRANPSLFPSVDGLASWDGEAGSHALLRLAHIMGSLIMAYAFDIEKVE 209
Query: 278 LARRF-------------------ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYK 318
+VPL L A + + A L DD++ + +P +
Sbjct: 210 DEDDENNDEEDGYVTDDEQDQSSKGMVPLADILNADADRNNARLFQEDDSLVMKAIKPIR 269
Query: 319 AGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 369
GE I Y G P + LL YG+V DN VVE +L+
Sbjct: 270 VGEEIFNDY-----------GELPRADLLRRYGYV-TDNYAQYDVVELSLD 308
>gi|330933580|ref|XP_003304225.1| hypothetical protein PTT_16721 [Pyrenophora teres f. teres 0-1]
gi|311319308|gb|EFQ87682.1| hypothetical protein PTT_16721 [Pyrenophora teres f. teres 0-1]
Length = 476
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 78/313 (24%), Positives = 121/313 (38%), Gaps = 62/313 (19%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLER-VLG 141
+W+ K+G I E + + R V AS+++ + F +P + ++++E +L
Sbjct: 13 AWLRKSGAEISPKIKLEDLRNKDAGRG---VVASQEIAEHELLFRIPRTSILSVENSILS 69
Query: 142 NETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
E A L+ L L L ++YE G S W PY L + + + W+
Sbjct: 70 TEIPAATLSL--LGPWLSLILVMLYEYHNGSASNWAPYFAVLPTE-------FNTLMFWT 120
Query: 202 ETELAYLTGSPTKAEIL---------------------------ERAEGIKREYNELDTV 234
E ELA L S +I E+A+ +E +
Sbjct: 121 EDELAELQASAVVGKIGKESADEAFLEQLLPVIEEFADIVFSGDEKAKDKAKEMRSPKNL 180
Query: 235 WFM--AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
M GSL Y +D+ T E+ ++ A + L K +VPL L
Sbjct: 181 ELMHKMGSLIMAYAFDVEPATPTKEVDEEG-FAEEEEDAALPK-------GMVPLADMLN 232
Query: 293 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGF 352
A + +C A L D +++ +P +AGE I Y GP P S LL YG+
Sbjct: 233 ADADRCNARLFYEKDCLEMKALKPIQAGEEIFNDY-----------GPLPRSDLLRRYGY 281
Query: 353 VDEDNPYDRLVVE 365
V DN VVE
Sbjct: 282 VT-DNYAQYDVVE 293
>gi|302804448|ref|XP_002983976.1| hypothetical protein SELMODRAFT_423083 [Selaginella moellendorffii]
gi|300148328|gb|EFJ14988.1| hypothetical protein SELMODRAFT_423083 [Selaginella moellendorffii]
Length = 266
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 63/253 (24%), Positives = 104/253 (41%), Gaps = 40/253 (15%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK-- 172
AS ++AG+ + L++ + G + T ++LA + L Y K Q K
Sbjct: 25 ASRPVRAGERVLEISLDLMIAPSDLPGELSTVLSSTVKPWTKLALIVLMERY-KGQAKLQ 83
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W PYI L +++ LW +TEL+YL SP + ER E I E+ ++
Sbjct: 84 SSAWAPYISCLPEPA-----ELDNTFLWEDTELSYLRASPLYGKTRERLEMITTEFGQVQ 138
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
+ LF + + E FK + V S + +++ LV + P+L
Sbjct: 139 NALDVWPQLFGK---------VSLEDFKHVYATVFS-----RSLAIGEDSTLVMI--PML 182
Query: 293 AY-----SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLL 347
+ +S K + + + DR Y + I W N G N++L
Sbjct: 183 DFFNHNATSFAKLSFNGLLNYAVVTADRDYAENDQI--WINY---------GDLSNAELA 231
Query: 348 INYGFVDEDNPYD 360
++YGF +NPYD
Sbjct: 232 LDYGFAVPENPYD 244
>gi|356577306|ref|XP_003556768.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Glycine
max]
Length = 487
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 85/386 (22%), Positives = 162/386 (41%), Gaps = 31/386 (8%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A +D+ + VP L + + V +E I ++ L +AL+L+ E+ +
Sbjct: 82 LVALKDISRNEVVLQVPKRLWINPDAVAASE-IGKVCIG--LKPWLAVALFLIRERSR-S 137
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W Y L ++ +S + WSE EL+ L G+ + ++ EY L+
Sbjct: 138 NSLWKHYFSVLPKE-------TDSTIYWSEEELSELQGTQLLNTTRSVKQYVENEYRRLE 190
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQS-CVVHLQKVSLARRFALVPLGPP 290
+ P + + F I + +AF +++ +V + A V
Sbjct: 191 EEIILPNKKLFPSPLTLDDFFWAFGILRSRAFSRLRNENLVVIPFADFINHSARVTTEDH 250
Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINY 350
AY K A L + D L KAG+ + + Y+++ + N++L ++Y
Sbjct: 251 --AYEIKGAAGLFSWDYLFSLRSPLSLKAGDQVYIQYDLN----------KSNAELALDY 298
Query: 351 GFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV-HAGREKEAISDMLP 409
GF++ + + + ++ DP + DK +A+ NG F + ++ + L
Sbjct: 299 GFIEPNADRNAYTLTLQISESDPFFGDKLDIAESNGFGETAYFDIFYSRPLPPGLLPYLR 358
Query: 410 YLRLGYVSDTSEMQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 466
+ LG +D ++S+ S G + PVS E + + + K LAGY T+ ED+
Sbjct: 359 LVALG-GTDAFLLESIFRNSIWGHLELPVSRDNEELICRVVRETCKTALAGYHTTIEEDQ 417
Query: 467 AMLTDYNLHPKKRVATQLVRMEKKML 492
L + L + +A + EK++L
Sbjct: 418 K-LKEAKLDSRHAIAVGIREGEKQLL 442
>gi|328869852|gb|EGG18227.1| hypothetical protein DFA_03714 [Dictyostelium fasciculatum]
Length = 504
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 42/176 (23%), Positives = 78/176 (44%), Gaps = 20/176 (11%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A +DL+ + +P V++ + +IA +L +L E ++ LMYE +G
Sbjct: 44 IIAKQDLKVDEIIAVIPKRCVLSPKTT----SIAPILEKYELEEAVATSIALMYETSKGV 99
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+S W YI+ + ++ P+LW + + YL G+ + ++E E ++ +Y E
Sbjct: 100 QSKWYSYIQSM-------PTVIDLPILWDKESIEYLVGTDLEEIVIENIETLEEQYRE-- 150
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG 288
+ + +P FT E FK A V S ++ + +LVPL
Sbjct: 151 ----DVEPIIKNHPETFKENIFTLESFKIASTIVSSRAFNIDQYHGE---SLVPLA 199
>gi|356521657|ref|XP_003529470.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Glycine
max]
Length = 487
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 86/387 (22%), Positives = 161/387 (41%), Gaps = 33/387 (8%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A +D+ + VP L + + V +E I ++ + L +AL+L+ E+ +
Sbjct: 82 LVALKDISRNEVVLQVPKRLWINPDAVAASE-IGKVCSG--LKPWLAVALFLIRERSR-S 137
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W Y L ++ +S + WSE EL+ L G+ + ++ E+ L+
Sbjct: 138 DSLWKHYFSILPKE-------TDSTIYWSEEELSELQGTQLLNTTRSVKQYVQNEFRRLE 190
Query: 233 TVWFMAGSLFQQYPYDIPTEAF--TFEIFK-QAFVAVQS-CVVHLQKVSLARRFALVPLG 288
+ + +P I + F F I + +AF +++ +V + L A V
Sbjct: 191 EEIIIPNK--KLFPSSITLDDFFWAFGILRSRAFSRLRNENLVVIPLADLINHSARVTTD 248
Query: 289 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLI 348
AY K A L + D L KAG+ + + Y+++ + N++L +
Sbjct: 249 DH--AYEIKGAAGLFSWDYLFSLRSPLSLKAGDQVYIQYDLN----------KSNAELAL 296
Query: 349 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML 408
+YGF++ + + + ++ DP + DK +A+ NG F + R
Sbjct: 297 DYGFIEPNTDRNAYTLTLQISESDPFFGDKLDIAESNGFGETAYFDIFYNRPLPPGLLPY 356
Query: 409 PYLRLGYVSDTSEMQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSED 465
L +D ++S+ S G + PVS E + + + K LAGY T+ ED
Sbjct: 357 LRLVALGGTDAFLLESIFRNSIWGHLELPVSRDNEELICRVVRETCKTALAGYHTTIEED 416
Query: 466 EAMLTDYNLHPKKRVATQLVRMEKKML 492
+ L + L + +A + EK +L
Sbjct: 417 QK-LKEAKLDSRHAIAVGIREGEKNLL 442
>gi|332020870|gb|EGI61268.1| SET domain-containing protein 3 [Acromyrmex echinatior]
Length = 232
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 56/228 (24%), Positives = 95/228 (41%), Gaps = 22/228 (9%)
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
AL+P+ + + A D + R +K GE + + Y GP+
Sbjct: 18 ALIPMWDMCNHENGRITTDFNATSDRCECYALRDFKKGEQVFISY-----------GPRT 66
Query: 343 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 402
NS ++ GFV DN D + ++ D +++ + + SV F + G E
Sbjct: 67 NSDFFVHSGFVCMDNEQDGFKLRLGISKADSLQKERIELLSKLDLPSVGEFLLKPG--TE 124
Query: 403 AISD-MLPYLRLGYVSDTSEMQSVISSLGPI------CPVSPCMERAVLDQLADYFKARL 455
ISD +L +LR+ + +E+ + S C + +E V L + +
Sbjct: 125 PISDTLLAFLRV-FSMRKAELTHWLRSDKVFDLKHVDCALETVVEENVRKFLLTRLQLLI 183
Query: 456 AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 503
A YP TL ED +L + L K++A QL EK++L+ L+ I
Sbjct: 184 ANYPTTLKEDLELL-ETTLPQMKKMAVQLRVTEKRILSGALEYVEQWI 230
>gi|396495152|ref|XP_003844476.1| similar to SET domain-containing protein [Leptosphaeria maculans
JN3]
gi|312221056|emb|CBY00997.1| similar to SET domain-containing protein [Leptosphaeria maculans
JN3]
Length = 475
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 73/292 (25%), Positives = 114/292 (39%), Gaps = 63/292 (21%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER-VLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
V A+++++ + F +P S V+++E +L E T + L L L ++YE G
Sbjct: 40 VVATQEIREHEVLFRIPRSAVLSVENSILSTEIPTS--TFDLLGPWLSLILVMLYEHLNG 97
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-------------- 217
S W PY L + + + WSE ELA L S A+I
Sbjct: 98 DASNWAPYFAVLPNE-------FNTLMFWSEHELAELQASAVLAKIGREGANEAFLGQLV 150
Query: 218 -----------------LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK 260
++AE ++ E N T+ GSL Y +DI
Sbjct: 151 PVIKEFAGIFFSGDSRAAQKAEEMRDEKN--ITLMHKMGSLIMAYAFDIEPAT------P 202
Query: 261 QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 320
+ V + + +L + ++PL L A + +C A L +++ +P KAG
Sbjct: 203 RKDVDEEGFAEEEEDEALPK--GMIPLADMLNADADRCNARLFYEQKYLEMKALKPIKAG 260
Query: 321 ESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED-NPYDRLVVEAALNTE 371
E I Y GP P S LL YG+V E+ YD + V L +E
Sbjct: 261 EEIFNDY-----------GPLPRSDLLRRYGYVTENYAQYDVVEVPMELVSE 301
>gi|302816067|ref|XP_002989713.1| hypothetical protein SELMODRAFT_447801 [Selaginella moellendorffii]
gi|300142490|gb|EFJ09190.1| hypothetical protein SELMODRAFT_447801 [Selaginella moellendorffii]
Length = 400
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 61/266 (22%), Positives = 112/266 (42%), Gaps = 35/266 (13%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+ ++AG+ +P+ LV+T E++ ++ + +LL+T + L L ++ E+ +G+ S
Sbjct: 14 AARSIRAGEQIVRIPHELVLTAEKL--DDCVKKLLSTEY--DWCPLTLLILAEQHKGEAS 69
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PY+ L S + W + EL +L + ER E I EYN + V
Sbjct: 70 RWAPYVSCLPSFGDH-----HSTIFWGKEELKFLECTRAFRGTAERREMISDEYNSVKDV 124
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
P+ + F+ F A+ V V +L+ ++ P +
Sbjct: 125 -------ISSCPHVFGEDISLFQ-FAHAYATV---VSRAWNGALSSEISMRPF-VDFCNH 172
Query: 295 SSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGF 352
A ++ DA ++ +R Y GE + + Y G + N+ L ++YGF
Sbjct: 173 DPVSHATVSHDTCKDAT-IIAERDYTKGEEVFISY-----------GKRSNAVLAVDYGF 220
Query: 353 VDEDNPYDRLVVEAALNTEDPQYQDK 378
V +N D+ + + DP + K
Sbjct: 221 VLPNNLSDQAELWMEIPWNDPLREKK 246
>gi|406860468|gb|EKD13526.1| putative SET domain-containing protein [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 474
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 98/421 (23%), Positives = 166/421 (39%), Gaps = 70/421 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELAC-LALY-LMYEKKQ 170
+ A D+ + F++P V+ LG+ +L E+ C LAL ++ + Q
Sbjct: 42 LVAQSDIGEDEVLFTIPRDAVLNTTTALGSADNPAIL------EMPCWLALTAIILTEGQ 95
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEGIKREY- 228
+ S W PY+ L + ++S + WSE+EL L S +I AE + E+
Sbjct: 96 QEDSKWAPYLALLPSR-------LDSLVFWSESELLELQASTVVNKIGRASAEQLFLEHI 148
Query: 229 ------NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-----CVVHLQKVS 277
N + S+ Y +DIP K+ +S +V +
Sbjct: 149 SPLGLSNTNTEMCHKVASVVMAYAFDIPE--------KKGHDDPESPEDGDDLVSDNEEE 200
Query: 278 LARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFK 337
+++PL L A + A L ++ +++ +P GE I+ Y
Sbjct: 201 ENTILSMIPLADMLNADADGNNARLCCDNEELEMRSIKPISKGEEILNDY---------- 250
Query: 338 CGPQPNSKLLINYGFV-DEDNPYDRLVVE-------AALNTEDP-------------QYQ 376
G P S LL YG++ D+ YD V E A+L+TE P + +
Sbjct: 251 -GQLPRSDLLRRYGYISDKYAAYD--VAELSTQSLLASLSTEQPLLAGGTLQPLSREKLE 307
Query: 377 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVS 436
+ +AQR G H G + +I D L L + D + ++ +S + S
Sbjct: 308 QRVELAQREGVYEDSYDLTHPGPDDPSIPDELLALLYILLLDNENLAAIETSHASLPSRS 367
Query: 437 PCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 496
V L ++R Y T+ D+A+L NL +KR+A ++ EK +L +
Sbjct: 368 KLATSLVGQILTKILESRKQEYATTIEADQAILQADNLPSRKRMAVEVRLGEKLVLEKAI 427
Query: 497 Q 497
Q
Sbjct: 428 Q 428
>gi|320170797|gb|EFW47696.1| hypothetical protein CAOG_05634 [Capsaspora owczarzaki ATCC 30864]
Length = 903
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 53/200 (26%), Positives = 87/200 (43%), Gaps = 21/200 (10%)
Query: 62 TLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQA 121
T V G+R + + +L W+H G+ I + S + V A+E ++A
Sbjct: 371 TAVIGTRPAALESRKIGDNLLQWLHNAGMTS---IAENHLSIADFEHTGRGVLANERIEA 427
Query: 122 GDAAFSVPNSLVVTLERVLG-NETIAELLTT--NKLSELACLALYLMYEK-KQGKKSFWL 177
G +P L++ + L + I +L+ ++ + L LY+++EK G S W
Sbjct: 428 GVEVLHLPQHLLINIHVALDESHPIGRVLSDLRDEYDDDTLLLLYVLHEKLVAGSASRWA 487
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
P+ L SPLL+ TEL L G+ E E +G++ + L
Sbjct: 488 PFFETL-------PATYNSPLLFHVTELLELEGTRLIDETFEIKDGLRVLHESL------ 534
Query: 238 AGSLFQQYPYDIPTEAFTFE 257
G L + YP PT+AFT+E
Sbjct: 535 -GPLAEAYPALFPTDAFTYE 553
>gi|148686778|gb|EDL18725.1| mCG18357, isoform CRA_c [Mus musculus]
Length = 536
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 48/204 (23%), Positives = 94/204 (46%), Gaps = 21/204 (10%)
Query: 306 DDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVE 365
DD + V + ++AG+ I ++Y G + N++ +I+ GF ++N +DR+ ++
Sbjct: 234 DDRCECVALQDFQAGDQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKIK 282
Query: 366 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT------ 419
++ D Y K V R G + VF +H+ E + +L +LR+ +++
Sbjct: 283 LGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEHL 341
Query: 420 ---SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 476
S + + + PVS E + L D L Y T+ ED+ +L + +L
Sbjct: 342 LGDSAIDRIFTLGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSV 401
Query: 477 KKRVATQLVRMEKKMLNACLQVTA 500
+ +A +L EK++L ++ A
Sbjct: 402 RATMAIKLRLGEKEILEKAVKSAA 425
>gi|148686780|gb|EDL18727.1| mCG18357, isoform CRA_e [Mus musculus]
Length = 458
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 50/204 (24%), Positives = 96/204 (47%), Gaps = 21/204 (10%)
Query: 306 DDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVE 365
DD + V + ++AG+ I ++Y G + N++ +I+ GF ++N +DR+ ++
Sbjct: 156 DDRCECVALQDFQAGDQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKIK 204
Query: 366 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV 425
++ D Y K V R G + VF +H+ E + +L +LR+ +++ + +
Sbjct: 205 LGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEHL 263
Query: 426 --------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 476
I +LG PVS E + L D L Y T+ ED+ +L + +L
Sbjct: 264 LGDSAIDRIFTLGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSV 323
Query: 477 KKRVATQLVRMEKKMLNACLQVTA 500
+ +A +L EK++L ++ A
Sbjct: 324 RATMAIKLRLGEKEILEKAVKSAA 347
>gi|7329638|emb|CAB82703.1| putative protein [Arabidopsis thaliana]
Length = 486
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 44/159 (27%), Positives = 77/159 (48%), Gaps = 18/159 (11%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ + AGD VP + +T + + + +L +N++ + LA L+ EKK G+KS
Sbjct: 75 ASKVIYAGDCMLKVPFNAQITPDELPSD---IRVLLSNEVGNIGMLAAVLIREKKMGQKS 131
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W+PYI L + + S + W E EL+ + S E +++ I+++++
Sbjct: 132 RWVPYISRLPQPA-----EMHSSIFWGEDELSMIRCSAVHQETVKQKAQIEKDFS----- 181
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS--CVV 271
F+A + Q P I TE E F A+ + C+V
Sbjct: 182 -FVAQAFKQHCP--IVTERPDLEDFMYAYALGEKVLCIV 217
>gi|358056251|dbj|GAA97802.1| hypothetical protein E5Q_04481 [Mixia osmundae IAM 14324]
Length = 433
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 94/402 (23%), Positives = 164/402 (40%), Gaps = 66/402 (16%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETI---AELLTTNKLSELACLALYLMYEKKQG 171
A+ +L++ FS+P SLV+++ +++ +E+ T + + CL MYE+
Sbjct: 39 ATSNLRSETELFSIPRSLVLSVHTSPLPKSLPDWSEISTQGWVGLILCL----MYEQID- 93
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-LERAEGIKREYNE 230
S W Y+ + +S + WS+ EL L GS +I E AEG Y+
Sbjct: 94 PASHWKRYLNSM-------PTCFDSLMFWSDDELRELQGSSVLDKIGREEAEG--SYYSI 144
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL------------ 278
L +F+ P EA++ ++ + + S H+
Sbjct: 145 LVPYLSKHADIFK------PLEAYSLALYHRCGSLILSRSFHVSNQDDSASDASDDDDAA 198
Query: 279 ---ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHD 335
+VP+ L A S A L DA+ + + AGE I YN
Sbjct: 199 YHEVETVGMVPMADVLNAKSGSANACLVYHPDALVMTTTKEIAAGEQIFNTYN------- 251
Query: 336 FKCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL-NTEDPQYQDKRMVAQRNGKLSVQVFH 394
PN+ LL YG VDE N D + + A L +D +R L ++
Sbjct: 252 ----DPPNADLLRRYGHVDEVNLNDNVEISADLIGCKD---------LERVDWLLDRLDD 298
Query: 395 VHAGREKEAI-SDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 453
V+ + E + D + +++ + SE + + + P ++ A ++ + +
Sbjct: 299 VYTLTQAEDLPEDFITAVKI-LTASKSEFRKIQKA--DDLP-DDVLDEATAMRVREILQM 354
Query: 454 RLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM-EKKMLNA 494
RLA Y +T+ EDE++L + + A LVR+ EK++L A
Sbjct: 355 RLAQYSSTIEEDESLLASSTMLTSRSRAALLVRLGEKRILAA 396
>gi|384248321|gb|EIE21805.1| SET domain-containing protein, partial [Coccomyxa subellipsoidea
C-169]
Length = 275
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 66/269 (24%), Positives = 110/269 (40%), Gaps = 45/269 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLS----ELACLALYLMYEK 168
V A++D+ G+ VP+ V+ E +E + + TN E L L LM EK
Sbjct: 32 VVATKDISCGEVVVHVPDESVLMPENCSCSEALEDAGLTNASGDAEMESIGLILALMTEK 91
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR-- 226
K GK S W Y+ L + PL W +L L G+ ++E+ G K
Sbjct: 92 KLGKSSKWKGYLDFLPKS------IPGMPLFWDSEQLQSLEGT----SLIEKMNGCKAMP 141
Query: 227 --------EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 278
++N + + F+ + + P++ + + ++ A V+ S + +
Sbjct: 142 DRPLEPPCKFNSV-VLPFLQSNAHLKLPHNAASTRRLY-VWATAMVSAYSFTIGEDRFQ- 198
Query: 279 ARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDF 336
A+VP+ L + L A A++++ GE ++ Y
Sbjct: 199 ----AMVPMWDALNHITGHANVRLHHCARKGALRMIATCLITKGEQVINSY--------- 245
Query: 337 KCGPQPNSKLLINYGFVDED-NPYDRLVV 364
G PNS+LL YGFV+ D NP+D L V
Sbjct: 246 --GDLPNSELLRRYGFVETDPNPHDCLEV 272
>gi|307190530|gb|EFN74527.1| SET domain-containing protein 3 [Camponotus floridanus]
Length = 232
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 56/228 (24%), Positives = 95/228 (41%), Gaps = 22/228 (9%)
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
AL+P+ + + A D + R +K GE + + Y GP+
Sbjct: 18 ALIPMWDMCNHENGRITTDFNATSDHCECYALRNFKKGEQVFISY-----------GPRT 66
Query: 343 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 402
NS ++ GFV +N D + ++ D +++ + + G SV F + G E
Sbjct: 67 NSDFFVHSGFVYMNNKQDGFKLRLGISKADSLQKERIELLSKLGLPSVGEFLLKPG--TE 124
Query: 403 AISD-MLPYLRLGYVSDTSEMQSVISSLGPI------CPVSPCMERAVLDQLADYFKARL 455
ISD +L +LR+ + +E+ + S C + +E V L + +
Sbjct: 125 PISDTLLAFLRV-FSMRKAELAHWLRSDKVFDLKHMDCALETVVEENVRKFLLTRLQLLI 183
Query: 456 AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 503
A YP TL ED +L + L K++A QL EK++L L+ I
Sbjct: 184 ANYPTTLKEDLELL-ETTLPQIKKMAVQLRVTEKRILLGALEYVEQWI 230
>gi|384251065|gb|EIE24543.1| hypothetical protein COCSUDRAFT_40909 [Coccomyxa subellipsoidea
C-169]
Length = 685
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 53/182 (29%), Positives = 77/182 (42%), Gaps = 29/182 (15%)
Query: 201 SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK 260
+E E++ L G+P +E + I+ +Y V +L YP DI + T + F
Sbjct: 65 TEEEVSMLEGTPAHTTFVEARQHIREQYRAAQPV---LQALTAAYPDDITPDLVTEDKFI 121
Query: 261 QAFVAVQSCVVHLQKVSLARRFALVPLG--------PPLLAYSSKCKAMLAAVDDAVQLV 312
A S + ++ V A R LVP+ P ++ Y L A D+++L
Sbjct: 122 WACELWYSYAIEVEYVDGAVRQTLVPIAHLLNHSPWPHIVRY-----GRLDAATDSLRLR 176
Query: 313 VDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDR--LVVEAALNT 370
R AGE + Y GP PN KLL+ YGF DNP+D + EA N
Sbjct: 177 AFRHCAAGEQCFLSY-----------GPLPNLKLLLFYGFALPDNPHDTVPITFEAEKNE 225
Query: 371 ED 372
D
Sbjct: 226 GD 227
>gi|34784341|gb|AAH57968.1| Setd3 protein [Mus musculus]
Length = 408
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 50/204 (24%), Positives = 96/204 (47%), Gaps = 21/204 (10%)
Query: 306 DDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVE 365
DD + V + ++AG+ I ++Y G + N++ +I+ GF ++N +DR+ ++
Sbjct: 106 DDRCECVALQDFQAGDQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKIK 154
Query: 366 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV 425
++ D Y K V R G + VF +H+ E + +L +LR+ +++ + +
Sbjct: 155 LGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEHL 213
Query: 426 --------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 476
I +LG PVS E + L D L Y T+ ED+ +L + +L
Sbjct: 214 LGDSAIDRIFTLGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSV 273
Query: 477 KKRVATQLVRMEKKMLNACLQVTA 500
+ +A +L EK++L ++ A
Sbjct: 274 RATMAIKLRLGEKEILEKAVKSAA 297
>gi|19112238|ref|NP_595446.1| ribosomal lysine methyltransferase Set10 [Schizosaccharomyces pombe
972h-]
gi|74626910|sp|O74738.1|SET10_SCHPO RecName: Full=Ribosomal N-lysine methyltransferase set10
gi|3738151|emb|CAA21252.1| ribosomal lysine methyltransferase Set10 [Schizosaccharomyces
pombe]
Length = 547
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 65/258 (25%), Positives = 107/258 (41%), Gaps = 42/258 (16%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
L +L E +G +S W YI L + +PL ++E + A+L + + E
Sbjct: 82 LCTFLALESLKGIQSKWYGYIEYLPK-------TFNTPLYFNENDNAFLISTNAYSAAQE 134
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF-KQAFVAVQSCVVHLQKVSL 278
R K EY E A SL + PTE FTF+++ A V C +
Sbjct: 135 RLHIWKHEYQE-------ALSL-----HPSPTERFTFDLYIWSATVFSSRC---FSSNLI 179
Query: 279 ARRFALVPLGPPLL-AYSSKCKAMLAAVDD-----AVQLVVDRPYKAGESIVVWYNISFT 332
+ P+ PL+ + + K K + D +VQL+ G + Y
Sbjct: 180 YKDSESTPILLPLIDSLNHKPKQPILWNSDFQDEKSVQLISQELVAKGNQLFNNY----- 234
Query: 333 GHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG--KLSV 390
GP+ N +LL+ YGF DNP+D + ++ A++ + P K + + + +LS
Sbjct: 235 ------GPKGNEELLMGYGFCLPDNPFDTVTLKVAIHPDLPHKDQKAAILENDCQFQLSN 288
Query: 391 QVFHVHAGREKEAISDML 408
VF + +KE +L
Sbjct: 289 LVFFLPKSPDKEIFQKIL 306
>gi|327295326|ref|XP_003232358.1| hypothetical protein TERG_07206 [Trichophyton rubrum CBS 118892]
gi|326465530|gb|EGD90983.1| hypothetical protein TERG_07206 [Trichophyton rubrum CBS 118892]
Length = 692
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 49/206 (23%), Positives = 90/206 (43%), Gaps = 24/206 (11%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LA ++++E+ +G+ S W PY+ L R + S L + +++L +L G+
Sbjct: 108 LAFFMVHEQLKGRDSHWWPYLATLPRAS-----ELTSALFFQDSDLEWLQGTSLYETHRA 162
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
+K EY+ +A S+ + Y + E++T++IF A+ + S + +
Sbjct: 163 YRNTVKEEYD-------LAISILRDEGY-LAIESYTWDIFCWAYTLIAS------RAFTS 208
Query: 280 RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNI---SFTGHDF 336
R P L + + ML VD + + + E+ + + +FTG +
Sbjct: 209 RVLDAYLSNHPSLKQEEEFQIMLPLVDFSNHKPLAKIEWQAEATEIRLKVVEPTFTGEEV 268
Query: 337 --KCGPQPNSKLLINYGFVDEDNPYD 360
GP N +L+ YGF DNP D
Sbjct: 269 HNNYGPLNNQQLMTTYGFCIVDNPCD 294
>gi|346465219|gb|AEO32454.1| hypothetical protein [Amblyomma maculatum]
Length = 353
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 85/357 (23%), Positives = 147/357 (41%), Gaps = 47/357 (13%)
Query: 80 DLKSWMHKNGLP-PCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLER 138
DL WM NG ++ ++E E R + A + + AG+ VP L++T
Sbjct: 28 DLLEWMIANGFELHVQLCVRE---FTETGRGL---ATLQKVTAGETFLRVPTCLLITTTT 81
Query: 139 VLGNETIAELLTTNK-LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
L + L+ ++ L+ + L L+L+ EK +G S W +I L ++ +P
Sbjct: 82 ALSSSLHGFLVRHHRQLTAIEVLTLFLINEKLRGLDSEWRFFIDSL-------PVSYTTP 134
Query: 198 LLWSETELAYLTGSPT-KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 256
+ LA L + KAE + I+R + L + + +L +E FT+
Sbjct: 135 VFLGSKLLARLPETMCRKAE--AQVSRIRRTFVRLQIL--LKRALLDDSALLNLSENFTW 190
Query: 257 EIFKQAFVAVQS-CVVHLQKVSLARRFA---LVPLGPPLLAYSSKCKAMLAAVDDA--VQ 310
+F A+ AV + C+ K F L P L + KA + + +
Sbjct: 191 HLFVWAWTAVNTRCI--FSKHRTDHSFWDDDYCALAPFLDCLNHHWKADVETTVEGSYFE 248
Query: 311 LVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 370
+V + Y+ + + IS+ HD N KLL+ YGFV DNP D + + T
Sbjct: 249 IVTNNNYEPNDQVF----ISYGSHD-------NKKLLLEYGFVLADNPNDVVAI-----T 292
Query: 371 EDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS 427
+ Y ++ +Q+N + + EK+ ISD + G + + V+S
Sbjct: 293 KGHLY---KLNSQQNDTVLYFATKLSFLEEKDIISDTCGFTTDGLTWNGKIVMQVLS 346
>gi|145250231|ref|XP_001396629.1| SET domain protein [Aspergillus niger CBS 513.88]
gi|134082145|emb|CAK42259.1| unnamed protein product [Aspergillus niger]
gi|350636112|gb|EHA24472.1| hypothetical protein ASPNIDRAFT_48629 [Aspergillus niger ATCC 1015]
Length = 489
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 79/319 (24%), Positives = 122/319 (38%), Gaps = 65/319 (20%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
+W+ G P K+ K + + H V A DL G+ F++P + V++++ N
Sbjct: 22 TWLA--GKPGVKINPKIQIADLRSHAAGRGVVAQSDLDEGEELFTIPRAHVLSVQ----N 75
Query: 143 ETIAELLTTN--KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
+ LL+ N L L + ++YE QG +S W Y R L R ++ + W
Sbjct: 76 SNLKNLLSQNLDDLGPWLSLMVVMIYEYLQGDQSAWASYFRVLPRN-------FDTLMFW 128
Query: 201 SETELAYLTGSP---------TKAEILERAEGIKRE----YNELDTV------------- 234
S +EL L GS + ILE I R + +D V
Sbjct: 129 SASELEELQGSAIVEKIGKQGAEESILETIAPIVRANPALFPPIDGVASYDGDAGTQALL 188
Query: 235 --WFMAGSLFQQYPYDI--PTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 290
GSL Y +DI P + + ++ + +VPL
Sbjct: 189 HLAHTMGSLIMAYAFDIEKPEDEEGERDGEDGYLTDEE--------EEQSSKGMVPLADL 240
Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINY 350
L A + + A L ++ + + +P KAGE I Y G P S LL Y
Sbjct: 241 LNADADRNNARLFQEEEVLVMKAIKPIKAGEEIFNDY-----------GEIPRSDLLRRY 289
Query: 351 GFVDEDNPYDRLVVEAALN 369
G+V DN VVE +L+
Sbjct: 290 GYV-TDNYAQYDVVELSLD 307
>gi|449542715|gb|EMD33693.1| hypothetical protein CERSUDRAFT_56467 [Ceriporiopsis subvermispora
B]
Length = 510
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 66/237 (27%), Positives = 99/237 (41%), Gaps = 42/237 (17%)
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
A+VP+ L A A L + +++V +P AGE I Y G P
Sbjct: 260 AMVPMADMLNARFESENAKLFYEEHYLKMVATKPINAGEQIWNTY-----------GDPP 308
Query: 343 NSKLLINYGFVD----------EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQV 392
NS LL YG VD E NP D + + A L R + G L V+V
Sbjct: 309 NSDLLRRYGHVDVVPLGEPLSGEGNPADVVEIRADL-----VVSAVRKARKAAGDLQVRV 363
Query: 393 -FHVHAGREK--------EAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 443
F + + E ++L ++RL +S T + + + + G + +E +
Sbjct: 364 DFWLEEADDDTFVLMTDCEVPEELLSFIRL--LSLTKDEWNKVKAKGKLP--KGKLELEL 419
Query: 444 LDQLADYFKARLAGYPATLSEDEAML---TDYNLHPKKRVATQLVRMEKKMLNACLQ 497
L + D K RL YP T+ EDE++L + NL KR A + EK++L LQ
Sbjct: 420 LPAIVDVLKERLKEYPTTIEEDESLLGPDSAVNLSFNKRNAVVVRLGEKRILRGALQ 476
>gi|224077384|ref|XP_002305239.1| SET domain protein [Populus trichocarpa]
gi|222848203|gb|EEE85750.1| SET domain protein [Populus trichocarpa]
Length = 518
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 42/151 (27%), Positives = 66/151 (43%), Gaps = 21/151 (13%)
Query: 56 RVSSSDTLVAGSREVVSKKEEDLGD------LKSWMHKNGLPPCKVILKEKP-------- 101
R+ +S T++ + K+ ED G W G+ C L P
Sbjct: 9 RIWASFTVLRRNSRQTKKEMEDAGQDEGFERFLKWAANLGISDCTTNLSLHPQSPTSCLG 68
Query: 102 -SHNEKHRPI---HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL 157
S H P +AA DL+ G+ VP S+++T + +L +E + + N S L
Sbjct: 69 HSLTVSHFPDAGGRGLAAVRDLKKGELVLRVPKSVLITRDSLLKDEKLCSFVNNNTYSSL 128
Query: 158 A---CLALYLMYEKKQGKKSFWLPYIRELDR 185
+ LA+ L+YE +GK S+W PY+ L R
Sbjct: 129 SPTQILAVCLLYEMGKGKSSWWYPYLMHLPR 159
>gi|356564844|ref|XP_003550657.1| PREDICTED: uncharacterized protein LOC100778605 [Glycine max]
Length = 549
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 65/252 (25%), Positives = 107/252 (42%), Gaps = 33/252 (13%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A +DL+ GD A +P S++++ E V + L + +S L L+ M EK
Sbjct: 178 ARKDLKVGDIALEIPVSIIISEELVHETDMYGVLKEIDGISSETILLLWSMKEKYNCDSK 237
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F + Y L + G L +S + L G+ EI++ + + +Y+EL
Sbjct: 238 FKI-YFDTLPEKFNTG-------LSFSIQAITMLDGTLLLEEIMQARQHLHAQYDEL--- 286
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
F A L +P P E +T+E F A S + + R L+PL L
Sbjct: 287 -FPA--LCNNFPDIFPPELYTWEKFLWACELWYSNSMKIMYSDGKLRTCLIPLAGFL--N 341
Query: 295 SSKCKAML--AAVD---DAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
S C ++ VD ++++ + RP ++GE + Y G +S L+
Sbjct: 342 HSLCPHVMHYGKVDPATNSLKFCLSRPCRSGEECCLSY-----------GNFSSSHLITF 390
Query: 350 YGFVDE-DNPYD 360
YGF+ + DN YD
Sbjct: 391 YGFLPQGDNSYD 402
>gi|255071849|ref|XP_002499599.1| predicted protein [Micromonas sp. RCC299]
gi|226514861|gb|ACO60857.1| predicted protein [Micromonas sp. RCC299]
Length = 588
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 79/306 (25%), Positives = 124/306 (40%), Gaps = 47/306 (15%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVL---GNETIAELLTTNKLSELACLALYLMYEKK 169
AA+ + AGD A ++P + T+ L G A + L E AL+L+ E+
Sbjct: 188 AAATTHIPAGDIAAAIPVERLFTVRHALEMPGPRGDAYRMFA-ALGEDTIAALWLIAERA 246
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVE-----SPLLWS-ETELAYLTGSPTKAEILERAEG 223
G+ S W I L G G+ + +P+ W E A L G+P A+ + +E
Sbjct: 247 LGEASPWHAVIASLPWPEG-GEGSASPCGGCTPVSWPREACDALLGGTPLLADAIAASEK 305
Query: 224 IKREYNELDTVWFMAGSLFQQYPYDI-PTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF 282
+ R++ L F A ++ D+ P A+T + F++A A S + +Q
Sbjct: 306 LARQHAAL----FPA---LSEHMADVFPASAYTLDNFRRAHEAWNSYGMTVQASPGEPAA 358
Query: 283 ALVP---------LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTG 333
+P L P ++ YS D ++L V R AGE + V Y
Sbjct: 359 TCLPPVAMLCNHALWPHVVRYSRL-------RDGTLRLPVARSVHAGEEVFVSY------ 405
Query: 334 HDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL-NTEDPQYQDKRMVAQRNGKLSVQV 392
G + N++LL+ YGF NPYD + + L E R A L++
Sbjct: 406 -----GAKSNAELLLFYGFALPGNPYDDVPLSLELPGGEVADVTKAREAALARAGLTLSP 460
Query: 393 FHVHAG 398
V AG
Sbjct: 461 HAVRAG 466
>gi|308811012|ref|XP_003082814.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplast precursor (ISS)
[Ostreococcus tauri]
gi|116054692|emb|CAL56769.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplast precursor (ISS)
[Ostreococcus tauri]
Length = 588
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 88/399 (22%), Positives = 153/399 (38%), Gaps = 64/399 (16%)
Query: 49 RKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSH-NEKH 107
R +R S+ V G + E L W+ + G +V+ + N+
Sbjct: 12 RASRARWTTRSTRARVRGDAQRARASREAYDGLWMWLERRGADVSRVVADAVTTDANDSE 71
Query: 108 RPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL----ACLALY 163
R V A L+ G A +P + + R + + L + + +AL
Sbjct: 72 RAQFGVRAKTTLRRGTRAMVIPREVWMDATRATEDADVGAALRDARYDAVKQPWVRVALL 131
Query: 164 LMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG 223
L+ E+++G + Y+ L + ++SPL WS EL + G+ ++L+ A G
Sbjct: 132 LLKERERGADGEFAAYVATLPK-------TLDSPLFWSADELRDIAGT----QLLDNAAG 180
Query: 224 ----IKREYNELDTVWFMAGSLFQQYP--YDIPTEAFTFEIFKQAFVAVQS-CVVHLQKV 276
++ Y EL +F +Y +D+ AF F+ AF ++S + L
Sbjct: 181 YDAYVRAVYEEL------KNGVFVEYASTFDVDG-AFDEASFRWAFGILRSRTMAPLDGA 233
Query: 277 SLARRFALVPLGPPLLAYSSKCKAMLAAVDD---------------AVQLVVDRPYKAGE 321
++A LVP G L+ +SS A A + DR Y G
Sbjct: 234 NVA----LVP-GLDLINHSSLSGARWRVGGGGGMGGLFGGGSGSGVAAYVECDRDYDEGA 288
Query: 322 SIVVWYNISFTGHDFKCGPQP-NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRM 380
I V Y+ P+ +SK ++YGF+D NP + ++ +D DK
Sbjct: 289 EIFVNYD-----------PEGIDSKFALDYGFIDVVNPSPGYALTLSIPEDDANLFDKLD 337
Query: 381 VAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 419
V + G F + + + ++ +LRL + DT
Sbjct: 338 VLETQGLPEAPTFTLRPYSDPD--RELRTFLRLLHCKDT 374
>gi|428175234|gb|EKX44125.1| hypothetical protein GUITHDRAFT_109909 [Guillardia theta CCMP2712]
Length = 442
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 47/200 (23%), Positives = 79/200 (39%), Gaps = 39/200 (19%)
Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM---------AGSLFQQ 244
+ +PL WS+ E L GS YN LD W M A L Q
Sbjct: 161 LTTPLFWSDKEREELQGSNL--------------YNMLDG-WTMNVEKLHRSTARVLGQH 205
Query: 245 YPY-DIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA---RRFALVPLGPPLLAYSSKCKA 300
+ D+P ++ + FK A+ + + + S R+ + P+ K
Sbjct: 206 NVFPDLPKAIYSLKEFKWAYATIFARAFDVDGKSFGFSGRQRIMAPMADLFNHGDVKTSY 265
Query: 301 MLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYD 360
A +L + + GE I + Y+ + N++ L+ YGFV E NP+D
Sbjct: 266 TFNAASGHFELFTQQFFSRGEQIFMNYD-----------SKNNAEFLLQYGFVIESNPHD 314
Query: 361 RLVVEAALNTEDPQYQDKRM 380
+ + A++ + P Y+DK +
Sbjct: 315 YVGIAASIGNDQPFYRDKSL 334
>gi|149044196|gb|EDL97578.1| rCG27725, isoform CRA_b [Rattus norvegicus]
Length = 538
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 48/204 (23%), Positives = 93/204 (45%), Gaps = 21/204 (10%)
Query: 306 DDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVE 365
DD + V + ++AG+ I ++Y G + N++ +I+ GF ++N +DR+ ++
Sbjct: 234 DDRCECVALQDFQAGDQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKIK 282
Query: 366 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT------ 419
++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 283 LGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHL 341
Query: 420 ---SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 476
S + + + PVS E + L D L Y T+ ED+ +L + +L
Sbjct: 342 LGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKTVLKNPDLSV 401
Query: 477 KKRVATQLVRMEKKMLNACLQVTA 500
+ +A +L EK++L ++ A
Sbjct: 402 RATMAIKLRLGEKEILEKAVKSAA 425
>gi|302815683|ref|XP_002989522.1| hypothetical protein SELMODRAFT_129980 [Selaginella moellendorffii]
gi|300142700|gb|EFJ09398.1| hypothetical protein SELMODRAFT_129980 [Selaginella moellendorffii]
Length = 464
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 90/386 (23%), Positives = 151/386 (39%), Gaps = 73/386 (18%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
L L L+YE+ Q K S+W PYI L + P+ +S ++ + +P ++ +
Sbjct: 105 LGLKLLYERAQ-KGSYWWPYISMLPH-------SFTLPIFFSGVDIESIDYAPVTHQVKK 156
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF---EIFKQAFVAVQSCVVHLQKV 276
R + + +EL + + P +I A F A AV S + V
Sbjct: 157 RCRFLLQFSSEL--------AKLESLPEEIHPFAGQFVDSGALGWAMAAVSSRAFRIHGV 208
Query: 277 SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV----------QLVVDRPYKAGESIVVW 326
+ A++ PL+ + A +++ + ++V R + G +I +
Sbjct: 209 TNKLCSAMML---PLIDMCNHSFQPNAHIEEDLSRDAQDVSFLKVVTKRNLEKGSAITLN 265
Query: 327 YNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA--QR 384
Y GP N LL++YGFV DNP+DR+ L + ++ RM+A R
Sbjct: 266 Y-----------GPLSNDLLLLDYGFVIPDNPHDRI----ELRYDGSLMENARMIAGLSR 310
Query: 385 NGK------LSVQVFH--------------VHAGREKEAISDMLPYLRLGYVSDTS--EM 422
G S QV V G +E +L LR+ + E
Sbjct: 311 TGSPPFSSPASWQVDRLKQLGLADSGESQKVTLGGPEEVDGRLLAALRILHAESQEPLER 370
Query: 423 QSVIS--SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRV 480
+ ++S + G VS E VL L + T+ EDEA L+D +L R+
Sbjct: 371 RELVSLQAWGVESMVSSDNEERVLRTLCGLGAIVFNQFKTTIEEDEAKLSDKSLAETSRI 430
Query: 481 ATQLVRMEKKMLNACLQVTADMIMLL 506
A Q +K+++ L+ +M L
Sbjct: 431 AVQFRLTKKRLVVRVLESLKKRLMDL 456
>gi|449283795|gb|EMC90389.1| SET domain-containing protein 4 [Columba livia]
Length = 440
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 86/375 (22%), Positives = 148/375 (39%), Gaps = 60/375 (16%)
Query: 97 LKEKPSHNEKHRPIHY------VAASEDLQAG-DAAFSVPNSLVVTLERVLGNETIAELL 149
LK++ + RP + + ++ LQ D S+P ++T + VL + + E +
Sbjct: 39 LKDRGFEDSHLRPAEFWDTGRGLMTTKTLQVSRDLIISLPEKCLLTTDTVLSS-CLGEYI 97
Query: 150 TTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY 207
K +S L L +L+ EK G+KS W PY+ L + P+ E ++
Sbjct: 98 MKWKPPVSPLTALCTFLIAEKHAGEKSLWKPYLDVLPK-------TYSCPVC-LEHDVVS 149
Query: 208 LTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ 267
L P + + E+ + Y + LF + I F + + A+ +
Sbjct: 150 LLPEPLRKKAQEQRTKVHELYISSKAFFSSLQPLFAENTETI----FNYSALEWAWCTIN 205
Query: 268 SCVVHLQKVSLARRFALVP----LGP--PLLAYS--SKCKAMLAAVDDAVQLVVDRPYKA 319
+ +++ K S + F+L P L P LL +S + KA + ++ + K
Sbjct: 206 TRTIYM-KHSQRKCFSLEPDVYALAPYLDLLNHSPNVQVKAAFNEQTRSYEIRTNSLCKK 264
Query: 320 GESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 379
E + + Y GP N +LL+ YGFV DNP+ + V +A + DK
Sbjct: 265 YEEVFICY-----------GPHDNQRLLLEYGFVAMDNPHSSVYVSSATLLKYFPPLDK- 312
Query: 380 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCM 439
QRN K+S+ H D+L L G+ + + + + L C
Sbjct: 313 ---QRNAKVSILKDH-----------DLLENLTFGWDGPSWRLLTALKVLSLGADEFTCW 358
Query: 440 ERAVLDQLADYFKAR 454
R + L D AR
Sbjct: 359 RRTL---LGDVISAR 370
>gi|307195794|gb|EFN77608.1| SET domain-containing protein 3 [Harpegnathos saltator]
Length = 245
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 57/228 (25%), Positives = 93/228 (40%), Gaps = 22/228 (9%)
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
AL+P+ + + A D + R ++ GE I + Y GP+
Sbjct: 31 ALIPMWDMCNHENGRITTDFNATSDRCECYALRNFQKGEQIFISY-----------GPRT 79
Query: 343 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 402
NS ++ GFV DN D + ++ D +++ + + SV F + G E
Sbjct: 80 NSDFFVHSGFVYMDNEQDGFKLRLGISKADSLQKERTELLGKLDLPSVGEFLLKPG--TE 137
Query: 403 AISDM-LPYLRLGYVSDTSEMQSVISSLGPI------CPVSPCMERAVLDQLADYFKARL 455
ISDM L +LR+ + +E+ + S C + +E V L + +
Sbjct: 138 PISDMLLAFLRV-FSMRKAELAHWLRSDKVFDLKHMDCALETVVEENVRKFLLTRLQLLI 196
Query: 456 AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 503
A YP TL ED +L + L K++ QL EKK+L L+ I
Sbjct: 197 ANYPTTLKEDLELL-ETTLPQVKKMTVQLRVTEKKILLGALEYVEQWI 243
>gi|403215215|emb|CCK69715.1| hypothetical protein KNAG_0C06190 [Kazachstania naganishii CBS
8797]
Length = 496
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 62/265 (23%), Positives = 112/265 (42%), Gaps = 43/265 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLT--TNKLSELACLALYLMYE-KK 169
V A ED++ + F VP + ++ +E ++ E+ + + L + L++E K
Sbjct: 41 VIAIEDIEKDEILFEVPRTTMLNVENCELSKRYPEIKNHLVESVGQWEGLIIALLFEWKV 100
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
G+KS W PY++ L ++ QL + W++ EL L G+ E+ E
Sbjct: 101 VGEKSKWWPYLQVLPKKTDMNQL-----IYWADDELELLKPSLILERVGADKAKEMFENV 155
Query: 222 EGI--KREYNELDT-----VW---FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 271
I K E D+ W + S+ Y +D+ + + E K+ +
Sbjct: 156 VDIINKSTLKEKDSYILKVTWENFLLVASIIMSYSFDV--QDYVEE--KEGGTDEEEDDN 211
Query: 272 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISF 331
+ V + ++PL L + + KC A L + +++ + K GE I YNI
Sbjct: 212 ESENVRSLK--CMIPLADTLNSNTHKCNAHLIHGSNLLEMRSIKAIKKGEQI---YNIY- 265
Query: 332 TGHDFKCGPQPNSKLLINYGFVDED 356
G PNS++L YG+++ D
Sbjct: 266 -------GDHPNSEILRRYGYIEPD 283
>gi|226508108|ref|NP_001151788.1| SET domain containing protein [Zea mays]
gi|195649689|gb|ACG44312.1| SET domain containing protein [Zea mays]
Length = 536
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 72/288 (25%), Positives = 115/288 (39%), Gaps = 35/288 (12%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASE + GD A +P L+++ E + +E L N ++ L L+ M E+
Sbjct: 184 ASESIGVGDIALEIPEFLIISDELLCQSEVFLALKDFNNITSETMLLLWSMRERYNLGSK 243
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F PY L G L + LA L G+ EI++ + ++++Y+EL +
Sbjct: 244 F-KPYFDTLPANFNTG-------LSFGIDALAALEGTLLFDEIIQARQHLRQQYDELFPL 295
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
L +P + T++ F A S + + S LVP+ L
Sbjct: 296 ------LCTNFPEMFRKDVCTWDDFLWACELWYSNSMMIVLSSGKLSTCLVPVAGLLNHS 349
Query: 295 SSKCKAMLAAVDDAVQLV---VDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYG 351
S VD+A + + + RP AGE + Y G P S L+ YG
Sbjct: 350 VSPHILNYGRVDEATKSLKFPLSRPCDAGEQCFLSY-----------GKHPGSHLVTFYG 398
Query: 352 FVDE-DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 398
F+ DNPYD + ++ D D+ + AQ + S Q H+ G
Sbjct: 399 FLPRGDNPYDVIPLDL-----DTSVDDEDIAAQSSATTS-QTTHMVRG 440
>gi|299473350|emb|CBN77749.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 563
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 81/309 (26%), Positives = 130/309 (42%), Gaps = 41/309 (13%)
Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIP-TEAFT 255
P+ W+E E+ L GS ++ ER + I+ +Y G + YP P + T
Sbjct: 212 PIFWTEEEMRLLQGSYLVTQVEERNQAIEGDY----------GVICDLYP---PFRDVAT 258
Query: 256 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 315
E FK A + V S L ++ R ALVP L Y + + D
Sbjct: 259 LEEFKWARMCVCSRNFGLD-INGLRTSALVPYADMLNHYRPR----------ETKWTYDN 307
Query: 316 PYKAGESIVVWYNISFTGHDFKC-GPQPNSKLLINYGFVDEDNPY------DRLVVEAAL 368
+ G +I + I + G + N + L+NYGF E+N + + + L
Sbjct: 308 -NRGGFTITTLHRILGGAQVYDSYGQKCNHRFLLNYGFAIENNQEANGFCPNEVPLLFRL 366
Query: 369 NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS--VI 426
+ DP Q K + +G +V + G + +A+ L LR+ V+D +EM + +
Sbjct: 367 DARDPLRQKKARFWRMDGPEQRRV-RLCVG-DTDAVRGALSMLRV-IVADAAEMGARYMY 423
Query: 427 SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP--KKRVATQ 483
++ + P+S E A +++L L YP TL ED A L + L P +R A
Sbjct: 424 RTVKDVRFPLSVRNEVAAMERLLLLTTGALDAYPTTLEEDRAALKNGGLEPFSNRRHALI 483
Query: 484 LVRMEKKML 492
V EK +L
Sbjct: 484 QVYGEKVVL 492
>gi|145553305|ref|XP_001462327.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124430166|emb|CAK94954.1| unnamed protein product [Paramecium tetraurelia]
Length = 481
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 94/476 (19%), Positives = 184/476 (38%), Gaps = 74/476 (15%)
Query: 59 SSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASED 118
S L S+ + + + +L W+ KV ++ K +E +R + AS+
Sbjct: 21 SESELRTKSKRITYEDPDPYKNLIQWLKDGKAEVSKVSIEVK---SEGYRTLR---ASQF 74
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIA-ELLTTNKL-SELACLALYLMYEKKQGKKSFW 176
++ G+ VP + ++LE V + I +++ N + + + + + ++ + + SFW
Sbjct: 75 IRQGEWVLFVPRTHYLSLEEVKKSCLINRKMIQLNYIPNNIQTYFVNHLLQENRRQNSFW 134
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PYI L + P + + A L GSPT ++ + + + EY+ L
Sbjct: 135 KPYIDVLPKD------VSGFPTNFDAEQDALLKGSPTLFTVMNQRKTFQEEYDNLKE--- 185
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF-----------ALV 285
A FQ+Y Y T+ F V + ++++R F LV
Sbjct: 186 -AVKEFQRYGY-------TYNDF-----------VKFRTLTISRSFPVYIGENEQQQLLV 226
Query: 286 PLGPPLLAYSSKCKAMLAAVDDAVQLVVD--RPYKAGESIVVWYNISFTGHDFKCGPQPN 343
PL + + + DA + R + GE + Y G N
Sbjct: 227 PLAD-FINHDNNGFLQYGYSPDADGFFMQAVRNIQKGEELFYNY-----------GQWSN 274
Query: 344 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEA 403
+NYGF NP ++ + L+ D + +M + G + + +
Sbjct: 275 KYFFMNYGFASLTNPMNQFDFDICLDRNDRMF---KMKVELTGGNICWGNRLVNETDHDT 331
Query: 404 ISDMLPYLRLGYVS---DTSEMQSVISSLGPICP---VSPC---MERAVLDQLADYFKAR 454
L +R +S D +++ + + P +P +E+A L D +
Sbjct: 332 FRQSLATVRFAQISKLDDFLQLEEDVQNYNQFWPGWHTTPKTIELEKATFKALRDLLVSE 391
Query: 455 LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVT 510
LA + +T+ +D+ L D + +R L EK+++ ++V DM++ + D T
Sbjct: 392 LANFASTIEDDQRRLNDPSTPEFRRHIIMLTMREKQIIKKNIEV-CDMMLSVIDKT 446
>gi|260835045|ref|XP_002612520.1| hypothetical protein BRAFLDRAFT_214305 [Branchiostoma floridae]
gi|229297897|gb|EEN68529.1| hypothetical protein BRAFLDRAFT_214305 [Branchiostoma floridae]
Length = 287
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 46/195 (23%), Positives = 87/195 (44%), Gaps = 20/195 (10%)
Query: 75 EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
EE W+H+NG C+ + + R + A++ L+ + +P L++
Sbjct: 18 EESFVRFFQWLHRNG---CRNVPLKPAVFPGTGRGM---MATKALKHEELMLVIPQRLLI 71
Query: 135 TLERVLGNETIAELLTTN-KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
T++ ++ + + + +L+ LA++LM EK + +KSFW PYI L +
Sbjct: 72 TMDAIMDSYIAPYIERADPRLTPTQALAVFLMCEKYRREKSFWRPYIDILPEE------- 124
Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 253
P ++E + L S + + + +EY EL + M LF +A
Sbjct: 125 YSCPTFFTEDDFRLLPNS-LRGKAKAKKYECHKEYKELAPFFKMLADLFPD-----QEDA 178
Query: 254 FTFEIFKQAFVAVQS 268
F F+ FK A+ A+++
Sbjct: 179 FNFKDFKWAWSAIKT 193
>gi|367048695|ref|XP_003654727.1| hypothetical protein THITE_2117893 [Thielavia terrestris NRRL 8126]
gi|347001990|gb|AEO68391.1| hypothetical protein THITE_2117893 [Thielavia terrestris NRRL 8126]
Length = 481
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 56/205 (27%), Positives = 82/205 (40%), Gaps = 29/205 (14%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
+L+ E +G+ SFW PYI L + A+ P W E ++AYL G+ I E
Sbjct: 107 FFLIKEYLKGRDSFWAPYIATLPQPEHVSAWAL--PAFWPEEDIAYLAGTNAHVAIAEIQ 164
Query: 222 EGIKREYNE----LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV----AVQSCVVHL 273
+K E+ + L F A + Q Y FT F+ + V A Q L
Sbjct: 165 ANVKSEFKQARKALKAAGFPAWQDYTQMLYKWAFCIFTSRSFRPSLVLSEPAKQQMAELL 224
Query: 274 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA-----VQLVVDRPYKAGESIVVWYN 328
F+++ PL ++ A D A QLV Y+ GE + YN
Sbjct: 225 PPGCQLDDFSIL---QPLFDIANHSMTARYAWDVASDPASCQLVCHDAYQPGEQV---YN 278
Query: 329 ISFTGHDFKCGPQPNSKLLINYGFV 353
G + NS+LL+ YGF+
Sbjct: 279 --------NYGLKTNSELLLAYGFI 295
>gi|118395738|ref|XP_001030215.1| hypothetical protein TTHERM_01108540 [Tetrahymena thermophila]
gi|89284510|gb|EAR82552.1| hypothetical protein TTHERM_01108540 [Tetrahymena thermophila SB210]
Length = 1709
Score = 52.8 bits (125), Expect = 4e-04, Method: Composition-based stats.
Identities = 41/148 (27%), Positives = 70/148 (47%), Gaps = 37/148 (25%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLG-------NETIAELLTTNKLSEL----AC-- 159
+AA +D+ ++PN L+++ ++V G + +++ N+ EL C
Sbjct: 954 IAADQDISPQKVILAIPNKLIISEDKVYGCDLEEVLEKIQQQIIKQNRFPELFDEEKCGD 1013
Query: 160 -----LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 214
LALYLMYEK +G++SFW PY EL+++ + L WS ELA S
Sbjct: 1014 ADFNILALYLMYEKLKGEQSFWHPYF-ELNQKS-------YTLLDWSTEELAQFEDSY-- 1063
Query: 215 AEILERAEGIKREYNELDTVWFMAGSLF 242
I +E N+ + ++F+ S+
Sbjct: 1064 ---------ILQEVNQSNQIFFLQQSVL 1082
>gi|358369683|dbj|GAA86297.1| SET domain protein [Aspergillus kawachii IFO 4308]
Length = 489
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 77/319 (24%), Positives = 125/319 (39%), Gaps = 65/319 (20%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
+W+ G P K+ K + + H V A DL G+ F++P + V++++ N
Sbjct: 22 TWLA--GKPGVKINPKIQIADLRSHAAGRGVVAQSDLDEGEELFTIPRAHVLSVQ----N 75
Query: 143 ETIAELLTTN--KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
+ LL+ N L L + ++YE QG +S W Y R L R ++ + W
Sbjct: 76 SNLKNLLSQNLEDLGPWLSLMVVMIYEYLQGDQSAWASYFRVLPRN-------FDTLMFW 128
Query: 201 SETELAYLTGSP---------TKAEILE------RA--------EGIKREYNELDTVWFM 237
S +EL L GS + I+E RA +G+ + T +
Sbjct: 129 SASELEELQGSAIVEKIGKQGAEGSIIESIAPIVRANPALFPPIDGVASYDGDAGTQALL 188
Query: 238 -----AGSLFQQYPYDI--PTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 290
GSL Y +DI P + + ++ + + +VPL
Sbjct: 189 HLAHTMGSLIMAYAFDIEKPEDEEGDRDGEDGYLTDEEEEQSSK--------GMVPLADL 240
Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINY 350
L A + + A L ++ + + +P K+GE I Y G P S LL Y
Sbjct: 241 LNADADRNNARLFQEEEVLVMKAIKPIKSGEEIFNDY-----------GEIPRSDLLRRY 289
Query: 351 GFVDEDNPYDRLVVEAALN 369
G+V DN VVE +L+
Sbjct: 290 GYV-TDNYAQYDVVELSLD 307
>gi|148908465|gb|ABR17345.1| unknown [Picea sitchensis]
Length = 350
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 75/164 (45%), Gaps = 25/164 (15%)
Query: 317 YKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED----NPYDRLVVEAALNTED 372
++ GE +++ Y ++ + N +L ++YGFV+ + + D + ++ D
Sbjct: 136 FRTGEQVLMQYGMN----------KSNGQLALDYGFVERNRKNGSNRDIFTLTLEISESD 185
Query: 373 PQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM-------QSV 425
P + DK +A+ NG + F + G + ML +LRL + T SV
Sbjct: 186 PFFADKLDIAELNGMETTAYFDITQG--QGVPESMLTFLRLIALGGTDAFLLEPLFRDSV 243
Query: 426 ISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 469
L PVS E A+ + D ++ L+GY T+ EDEA+L
Sbjct: 244 WEHLS--LPVSQENEAAICKVVLDGCQSTLSGYGTTIEEDEALL 285
>gi|299115166|emb|CBN75532.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 524
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 79/340 (23%), Positives = 132/340 (38%), Gaps = 70/340 (20%)
Query: 77 DLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTL 136
+L L SW ++G K+ L++ + + L G+ S+P SL +T+
Sbjct: 27 ELDGLLSWFVEHGGSMTKLCLEDLGGEMSLS-----LLTGQALNKGEVVMSIPISLCMTV 81
Query: 137 ERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES 196
+ VL AL+LM E+++G SFW Y+R L V++
Sbjct: 82 DSVL--------------------ALHLMAERRKGDGSFWKQYLRTLPDD-------VDT 114
Query: 197 PLLW----SETELAYLTGSPTKAEILERA--EGIKREYNELDTVWFMAGSLFQQYPYDIP 250
PL W +E E L G T +L R +++++ E L + +P +
Sbjct: 115 PLRWLVEQAEEEFRLLDG--TMVGLLSRMMHSQVRKDWEEFHL------PLVEAHPEILG 166
Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQK----VSLARRFALVPL-----GPPLLAYSSKCKAM 301
TFE + A ++ S Q+ S R A+VP+ P A S
Sbjct: 167 --GVTFEDYLWAMSSIWSRSFDYQEPGPDDSPCSRRAMVPVINAANHDPSAADSLSEMIE 224
Query: 302 LAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKC--------GPQPNSKLLINYGFV 353
A + + + + P +A ++ V G D+ G N+KLL +YGFV
Sbjct: 225 FQAQEGGLSMGIGEPGRARGTLRV-----SAGRDYAAREQFFILYGRYSNAKLLYSYGFV 279
Query: 354 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
NPY L + DP + K+ + + + Q +
Sbjct: 280 LASNPYGGLDYWVRVPQTDPGFAWKQALLDEHPLTAAQAY 319
>gi|170067683|ref|XP_001868579.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167863782|gb|EDS27165.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 269
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 51/200 (25%), Positives = 90/200 (45%), Gaps = 35/200 (17%)
Query: 317 YKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 376
Y+ GE I ++Y G + N+ L++ GFV DN + + +LN + Q++
Sbjct: 74 YRKGEQIFIYY-----------GNRTNADFLVHNGFVYPDNANSAVAIPLSLNPTEEQFE 122
Query: 377 DKRMVAQRNGKLSVQVFHVHAGREKEAIS-DMLPYLR--------LGYVSDTSEMQSVIS 427
++ + ++ G S F+V G IS ++L + R LG+ +QS +
Sbjct: 123 QRKQLLEKLGLASSGDFNVQRGGGDSFISPELLGFARVFNMTKEQLGHWQGEDAVQSQL- 181
Query: 428 SLGPICPVSPCMERAVLDQLADYFKARLA----GYPATLSEDEAMLTDYN------LHPK 477
L P C P +E ++ +++ Y RL TL +DEA+L + L
Sbjct: 182 -LEPDC---PGLEASLREKVWKYLSIRLQLALRMTGTTLDQDEALLANQGQKGAQKLGHI 237
Query: 478 KRVATQLVRMEKKMLNACLQ 497
K + Q +EKK+L+ L+
Sbjct: 238 KSMLVQFRVVEKKILSEALE 257
>gi|297735395|emb|CBI17835.3| unnamed protein product [Vitis vinifera]
Length = 583
Score = 52.4 bits (124), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 62/260 (23%), Positives = 105/260 (40%), Gaps = 39/260 (15%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+EDL+ GD A +P S+V++ E V ++ L + +S L L+ M EK
Sbjct: 211 ATEDLKVGDVALEIPMSIVISEELVHESDMFPILEKIDGISSETMLLLWSMKEKHNSNSK 270
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F Y L A + L + + L G+ EI+E + + +Y EL
Sbjct: 271 F-NTYFNALPE-------AFNTGLSFEFDAIMVLAGTLLLEEIIEAKKHLNAQYEEL--- 319
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG------ 288
+L + +P P E +T E F A S + + R L+P+
Sbjct: 320 ---VPALCKDHPDIFPPEFYTQEQFLWACELWYSNGMQVMFTDGKLRTCLIPIAGFLNHS 376
Query: 289 --PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKL 346
P ++ Y + + ++++ V +P GE + Y G +S L
Sbjct: 377 LYPHIMHYGK-----VDSKTNSLKFCVSKPCNMGEQCYLSY-----------GNFSSSHL 420
Query: 347 LINYGFVDE-DNPYDRLVVE 365
+ YGF+ + DN YD + +E
Sbjct: 421 VTFYGFIPQGDNLYDTIPLE 440
>gi|307109196|gb|EFN57434.1| hypothetical protein CHLNCDRAFT_142903 [Chlorella variabilis]
Length = 1233
Score = 52.4 bits (124), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 88/386 (22%), Positives = 145/386 (37%), Gaps = 93/386 (24%)
Query: 111 HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
H A+ D+ G+ VP SL +T V G + E+L + SEL LAL+LM E+ +
Sbjct: 871 HGFVAARDVGQGEVLLQVPGSLAITAVDV-GKDAQLEVLARGR-SELVGLALWLMQERAK 928
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE-LAYLTGSPTKAEILERAEGIKREYN 229
A +P+LW + E L GSP E R + +++E+
Sbjct: 929 ----------------------ATLTPILWPDEERQQLLRGSPVLEEARTREQALRQEWQ 966
Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV-HLQKVSLARRFALVPLG 288
++ + + YP + E QAF+ S V+ H + A+ FAL+PL
Sbjct: 967 DIAAIAAQTSGGPEAYPAVVYNE--------QAFLEAMSVVLAHAAYLPKAQCFALLPLV 1018
Query: 289 PPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKL 346
L S A+L +AV +V R G+ + ++
Sbjct: 1019 GGLCRTGSSSGALLDYDLEREAVTVVAQR--TPGQEVALYC------------------- 1057
Query: 347 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD 406
L + A+L D Y KR + + G F + ++ A
Sbjct: 1058 ---------------LFMAASLVAADRLYTTKREILEELGLGVKAEFPIF--EDRLATQQ 1100
Query: 407 MLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 466
++ + E ++I SP E +L L + R+ Y +D
Sbjct: 1101 LINF----------EQDTII---------SPENEYEILQLLMGDLRDRIQAYATEFDDDI 1141
Query: 467 AMLTDYNLHPKKRVATQLVRMEKKML 492
L +L P++R+A QL EK++L
Sbjct: 1142 KDLQRTDLTPRQRLAAQLRLGEKRIL 1167
>gi|255581713|ref|XP_002531659.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase,
putative [Ricinus communis]
gi|223528717|gb|EEF30729.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase,
putative [Ricinus communis]
Length = 558
Score = 52.4 bits (124), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 59/255 (23%), Positives = 110/255 (43%), Gaps = 39/255 (15%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+EDL+ GD A +P S++++ E V ++ L + +S L L+ M E+
Sbjct: 190 ATEDLKVGDIALEIPVSIIISEELVRHSDMYHILEKIDGISSETMLLLWSMKERHNCNSK 249
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
+ Y L ++ G ++ S+ L + EI++ E ++ +Y+EL
Sbjct: 250 SKI-YFDTLPKEFNTGLSFGVDAIMASDGTLLF-------DEIMQAKEHLRVQYDEL--- 298
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP-------- 286
+L YP P E +T+E F A S + ++ + R L+P
Sbjct: 299 ---VPALCNNYPDVFPPELYTWEQFLWACELWYSNSMKIKFLDGKLRTCLIPIAGFLNHS 355
Query: 287 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKL 346
L P ++ Y + ++ + ++ + RP + GE + Y +F+G + L
Sbjct: 356 LHPHIIHYGK-----VDSITNTLKFPLSRPCRVGEQCCLSYG-NFSG----------AHL 399
Query: 347 LINYGFVDE-DNPYD 360
+ YGF+ + DN YD
Sbjct: 400 ITFYGFLPQGDNRYD 414
>gi|297836754|ref|XP_002886259.1| hypothetical protein ARALYDRAFT_319874 [Arabidopsis lyrata subsp.
lyrata]
gi|297332099|gb|EFH62518.1| hypothetical protein ARALYDRAFT_319874 [Arabidopsis lyrata subsp.
lyrata]
Length = 541
Score = 52.4 bits (124), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 60/255 (23%), Positives = 101/255 (39%), Gaps = 40/255 (15%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASEDL+ GD A +P S +++ E V ++ L + ++ + L+ M EK
Sbjct: 173 ASEDLKFGDVALEIPISSIISEEYVFNSDMYPILEKIDGITSETMVLLWTMREKHNLDSK 232
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F PY L G + + + L G+ EI++ E ++ Y+EL
Sbjct: 233 F-KPYFDSLQENFCTG-------MSFGVNAIMELDGTLLLDEIMQAKELLRERYDELI-- 282
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG------ 288
L + + P E +T+E + A S + ++ + L+P+
Sbjct: 283 -----PLLSNHRHVFPPEHYTWEHYLWACELYYSNSMQIKFPDGKLKTCLIPVAGFLNHS 337
Query: 289 --PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKL 346
P ++ Y C +++ V RP GE + Y G +S L
Sbjct: 338 IYPHIVKYGKVCVET-----SSLKFPVSRPCNKGEQCFLSY-----------GNYSSSHL 381
Query: 347 LINYGFVDE-DNPYD 360
L YGF+ + DNPYD
Sbjct: 382 LTFYGFLPKGDNPYD 396
>gi|281205954|gb|EFA80143.1| hypothetical protein PPL_06965 [Polysphondylium pallidum PN500]
Length = 417
Score = 52.0 bits (123), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 93/410 (22%), Positives = 174/410 (42%), Gaps = 59/410 (14%)
Query: 76 EDLGDLKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
+DL K WM G+ P ++ E + + A+ ++ GD VP ++
Sbjct: 9 DDLVTFKQWMDDEGIYLNPSLDIVKLEDYGRS--------IIANTLIKEGDVLIRVPRNV 60
Query: 133 VVT---LERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQR 187
+++ +E + E I ++ +N+ + A+YLMY K S+W Y L +Q
Sbjct: 61 MMSRTGIELHIPKE-IRSIIDSNRDDIGSTDGQAVYLMY-SLLNKDSYWHQYTSILPKQ- 117
Query: 188 GRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY 247
+ + + + E+ L S + R GI+R YN ++ SL ++
Sbjct: 118 ------FTTSIYFDQDEMKELQLSKLRYFTESRLSGIERHYN---VIFKKLSSLNDEFK- 167
Query: 248 DIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD 307
+ +TFE+FK A + S L + +VPL A K K+ +
Sbjct: 168 ---KKEYTFELFKWALSCIWSRAFSLS----SDDGGMVPLADMFNAIE-KAKSKVRPDSR 219
Query: 308 AVQLV--VDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVE 365
A QL+ + + GE + Y + T G N+++L++YGF D+P + ++
Sbjct: 220 ADQLIYYASKDIERGEQVFTPYGVYKT-----IG---NAQMLMDYGFA-FDDPSEGDTIQ 270
Query: 366 AALN--TEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQ 423
L+ ++D Y D ++ + V+ F++ + + ++L Y R+ + + +E+Q
Sbjct: 271 LTLDNFSDDELYIDTKIDLLEQLDI-VREFNL---KRNQLPQELLIYARVKNLKE-NELQ 325
Query: 424 ----SVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 469
+ PVS E+ L L++Y L Y TLS+D +L
Sbjct: 326 LAKEHYRNDDNRNKPVSRRNEKTALRYLSNYLSRYLDSYETTLSDDLELL 375
>gi|358366345|dbj|GAA82966.1| SET domain protein [Aspergillus kawachii IFO 4308]
Length = 673
Score = 52.0 bits (123), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 62/240 (25%), Positives = 97/240 (40%), Gaps = 18/240 (7%)
Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
+ + E +L+ + +G + FW PYIR L Q G ++ +P + +L +L G+
Sbjct: 77 DAVGEKESTIFFLIGQYLRGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGT 131
Query: 212 PTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
A +R E +K +Y + T AG Y +D+ A + I + V S V
Sbjct: 132 SLLAAREKRLEVLKEKYEKGSTALRNAGFEGADAYTWDLYLWAASMFISRAFSARVLSGV 191
Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNIS 330
+S + L+P+ + + K A D V VV AG+ I Y
Sbjct: 192 FPETDLSEEKLSVLLPI-IDMGNHRPLAKVEWRAGKDDVAFVVLEDVSAGQEISNNY--- 247
Query: 331 FTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 390
GP+ N +L++NYGF NP D +V P Y K Q L+V
Sbjct: 248 --------GPRNNEQLMMNYGFCIPGNPCDHRIVSLRAPPGSPLYMAKSHQLQMYPDLAV 299
>gi|387197713|gb|AFJ68815.1| set domain protein, partial [Nannochloropsis gaditana CCMP526]
Length = 327
Score = 52.0 bits (123), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 53/218 (24%), Positives = 96/218 (44%), Gaps = 31/218 (14%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
+W+ +G K+ E PS+ + I A +D+ + + S+P L++T + L +
Sbjct: 74 AWLRAHGARCDKI---EWPSYATGSQ-IRGAVALDDINSNEDMVSIPEPLLLTPDVALKD 129
Query: 143 ETIAELLTTN--KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
I ++ N S+ L + LM+E+ +G+ SF+ PY+ L R ++ L W
Sbjct: 130 PDIGKVFEDNLEDFSDEDMLLILLMHERGKGETSFFYPYLATLPR-------LPDTLLNW 182
Query: 201 SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAG--SLFQQYPYDIP------TE 252
+E L++L E+ R + Y L AG LF + P D +
Sbjct: 183 NEEGLSWLQDEGLSLEVFLRESQLTAHYTRLVEEKLKAGWPGLFGEAPDDASDSESKGAD 242
Query: 253 AFTFEIFKQAFVAVQSCVVHLQKVSLARRF---ALVPL 287
++ E F+ A++ +Q+ + RR AL+PL
Sbjct: 243 PYSLENFRFAWLTIQA-------RAFGRRLPYSALIPL 273
>gi|145353540|ref|XP_001421068.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581304|gb|ABO99361.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 813
Score = 52.0 bits (123), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 63/256 (24%), Positives = 101/256 (39%), Gaps = 31/256 (12%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A D G+ +P TL L ++ + + + + +AL++ E+ +G+K+
Sbjct: 26 ALRDCARGEVLLEIPLERGFTLAAALEDDAVKRVASCCARHD-DVVALHVCAERFRGEKA 84
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
++ L R + ++ WSE EL LTG+ E + E K +Y L
Sbjct: 85 TRAAHVATLPR-------SFDTAFFWSEEELRELTGTTCLRETMNLREETKNDYETLTKK 137
Query: 235 W--FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
G +++ D +E + A + S L R A+VP +
Sbjct: 138 MEAIGEGGWMREHEVD-------YERYAWARSNLWSRQCDLLMPDGKRTRAMVPT-FDIF 189
Query: 293 AYSSKC----KAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLI 348
+S+K L A + V + YKAGE + Y G NSKLL
Sbjct: 190 NHSAKAPLGKTHKLNAEKNCVTVYAADDYKAGEQAFISYG---------SGEAANSKLLT 240
Query: 349 NYGFVDEDNPYDRLVV 364
YGF +DNPY+ L V
Sbjct: 241 WYGFCIDDNPYEELDV 256
>gi|302810436|ref|XP_002986909.1| hypothetical protein SELMODRAFT_235145 [Selaginella moellendorffii]
gi|300145314|gb|EFJ11991.1| hypothetical protein SELMODRAFT_235145 [Selaginella moellendorffii]
Length = 447
Score = 52.0 bits (123), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 72/294 (24%), Positives = 122/294 (41%), Gaps = 50/294 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A DL G+ ++P + +TL + IA L L + +MYE+ +GK
Sbjct: 10 VRALRDLHHGELIATIPKAACLTLLTTAARDAIARARLGGGLG----LTVAVMYERSKGK 65
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL 231
S W Y++ L Q P LWSE E+ L G+ + E +K ++ E
Sbjct: 66 GSKWYRYLKTLPCQE-------SVPFLWSEEEIDGLLLGTELHKALKEDKLLMKEDWEE- 117
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ-KVSLARRFALVPLGPP 290
L ++ P + P + FTFE +++A +S V ++ + +VPL
Sbjct: 118 -----NIAPLTKEDPLEFPAQDFTFE----SYLAAKSLVSSRSFEIDAEHGYGMVPLAD- 167
Query: 291 LLAYSSKCKA-----MLAA-------------VDDAVQLVVDRPYKAGESI---VVWYNI 329
++ K A ML A +DD + R + +S+ V+ ++
Sbjct: 168 --LFNHKTDAEDVHFMLNASDSDDDDDNNGLIIDDGLANGDCREISSDKSVLEMVMVKDV 225
Query: 330 SFTGHDFKC-GPQPNSKLLINYGFVDEDNPYD--RLVVEAALNTEDPQYQDKRM 380
+ F G N+ LL YGF + +NP+D L ++ L ++Q KR+
Sbjct: 226 AAGSEIFNTYGQLGNAALLHRYGFTEPNNPHDIVNLDMDCLLEVLLSRFQKKRV 279
>gi|225446052|ref|XP_002268920.1| PREDICTED: uncharacterized protein LOC100256524 [Vitis vinifera]
Length = 566
Score = 52.0 bits (123), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 62/260 (23%), Positives = 105/260 (40%), Gaps = 39/260 (15%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+EDL+ GD A +P S+V++ E V ++ L + +S L L+ M EK
Sbjct: 194 ATEDLKVGDVALEIPMSIVISEELVHESDMFPILEKIDGISSETMLLLWSMKEKHNSNSK 253
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F Y L A + L + + L G+ EI+E + + +Y EL
Sbjct: 254 F-NTYFNALPE-------AFNTGLSFEFDAIMVLAGTLLLEEIIEAKKHLNAQYEEL--- 302
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG------ 288
+L + +P P E +T E F A S + + R L+P+
Sbjct: 303 ---VPALCKDHPDIFPPEFYTQEQFLWACELWYSNGMQVMFTDGKLRTCLIPIAGFLNHS 359
Query: 289 --PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKL 346
P ++ Y + + ++++ V +P GE + Y G +S L
Sbjct: 360 LYPHIMHYGK-----VDSKTNSLKFCVSKPCNMGEQCYLSY-----------GNFSSSHL 403
Query: 347 LINYGFVDE-DNPYDRLVVE 365
+ YGF+ + DN YD + +E
Sbjct: 404 VTFYGFIPQGDNLYDTIPLE 423
>gi|384483765|gb|EIE75945.1| hypothetical protein RO3G_00649 [Rhizopus delemar RA 99-880]
Length = 376
Score = 52.0 bits (123), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 67/262 (25%), Positives = 117/262 (44%), Gaps = 49/262 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A+ED++AG+ SVP + ++T NE++ +L T+ LS LAL+L+ + K
Sbjct: 1 MMATEDIEAGEVIVSVPRNFLIT------NESLTKLYGTHSLSPHQLLALHLVLLTRD-K 53
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+S+W PY L + S L L++L S K E +++ + I +Y
Sbjct: 54 QSWWKPYTDLLPMHFNTMPVNYPSEL------LSHLPNS-LKQETMQQKDNIHTDY---- 102
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL- 291
V + +Q P DI T E FK A++ V + +H+ + + L P L
Sbjct: 103 -VTCLKFCKSKQLPQDI-----TAEEFKWAWLCVNTRCIHMTVPDYLAKGENIALAPMLD 156
Query: 292 -LAYSSKCKAMLAAVDDAVQLVVDR-------PYKAGESIVVWYNISFTGHDFKCGPQPN 343
L ++++ K ++ + R YK GE + + Y GP N
Sbjct: 157 FLNHTTEAK-----IESGFNIRTQRFEIKTLTAYKKGEQVYINY-----------GPHDN 200
Query: 344 SKLLINYGFVDEDNPYDRLVVE 365
+L YGFV +N Y+ ++++
Sbjct: 201 LAMLKEYGFVLNENIYNFVLLD 222
>gi|326480913|gb|EGE04923.1| SET domain-containing protein [Trichophyton equinum CBS 127.97]
Length = 692
Score = 52.0 bits (123), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 47/206 (22%), Positives = 86/206 (41%), Gaps = 24/206 (11%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LA ++ +E+ +G+ S W PY+ L R + S L + +++L +L G+
Sbjct: 108 LAFFVAHEQLKGRDSHWWPYLATLPRAS-----ELTSALFYQDSDLDWLQGTNLYQTHQA 162
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
+K EY+ ++ G L E+++++IF A+ + S + +
Sbjct: 163 YRNTVKEEYDSAISILRDEGCL--------AVESYSWDIFCWAYTLIAS------RAFTS 208
Query: 280 RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNI---SFTGHDF 336
R P L + + ML VD + + + E+ + + +FTG +
Sbjct: 209 RVLDAYLSNHPTLKQDEEFQIMLPLVDSSNHKPLAKIEWRAEATEIGLKVIEPTFTGEEI 268
Query: 337 --KCGPQPNSKLLINYGFVDEDNPYD 360
GP N +L+ YGF DNP D
Sbjct: 269 HNNYGPLNNQQLMTTYGFCIVDNPCD 294
>gi|281201870|gb|EFA76078.1| hypothetical protein PPL_10657 [Polysphondylium pallidum PN500]
Length = 1234
Score = 52.0 bits (123), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 57/263 (21%), Positives = 106/263 (40%), Gaps = 29/263 (11%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTT-NKLSELACLALYLMYEKKQG 171
+ ++ ++ + VP ++ ++ + + + + L++ L L+++YEK +
Sbjct: 767 IVTTKKVEENEVIIKVPRKFLINVQVAREHPILGRIFEEFSGLNDDTILFLFVIYEK-EN 825
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
SFW P+ L + + ++ TEL L G+ AE L+ +K +
Sbjct: 826 PNSFWRPFFDTLPS-------YFPTSIHYTSTELLELEGTNLFAETLQ----VKEHLQSI 874
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
+ F L +QYP P F++E F A S + L K+ LVP+ +
Sbjct: 875 RDMLF--PELSEQYPTIFPESLFSWENFLWARSLFDSRAIQL-KIDDKITNCLVPMADMI 931
Query: 292 LAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
+ + + D ++V I + Y G N +L +
Sbjct: 932 NHHHNAQISQRFFDQTDQCFKMVSCCSVPPNAQIFLHY-----------GALQNRELALY 980
Query: 350 YGFVDEDNPYDRLVVEAALNTED 372
YGFV +DNPYD +++ L ED
Sbjct: 981 YGFVIQDNPYDSMLIGFDLPDED 1003
>gi|225561342|gb|EEH09622.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
Length = 487
Score = 52.0 bits (123), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 95/429 (22%), Positives = 166/429 (38%), Gaps = 69/429 (16%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
+ G+ F++P ++ T+E + + L + + LS LA+Y+++ +
Sbjct: 52 FKEGERIFTIPADVLWTVEHAYADSLLGPALRSARPPLSVDDTLAMYILFVRS------- 104
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
RE R LA S + +++ EL GS A I+ +Y L
Sbjct: 105 ----RESGYDGPRSHLATLPKSYSSSIFFTDDELEVCAGSSLYALTKRLGRCIEDDYRAL 160
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRFALVP 286
L Q+ P + FT E +K A V S + + + L FA
Sbjct: 161 VV------RLLVQHQDLFPLDKFTIEDYKWALCTVWSRAMDFVLPGGKSIRLMAPFA--- 211
Query: 287 LGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNS 344
+L +SS+ + A + + ++ + Y+AG+ V + + G PN+
Sbjct: 212 ---DMLNHSSEVRQCHAYDPLSGNLTILAGKDYEAGDQGVFFQVFIYYGS------IPNN 262
Query: 345 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI 404
+LL YGFV NP D + + P ++ KR + G S + +
Sbjct: 263 RLLRLYGFVMPGNPNDSYDLVLETHPMAPFFEQKRKLWDLAGFDSTSTISI-------TL 315
Query: 405 SDMLPYLRLGYV----SDTSEMQSVISSLGPICP----VSPCMERAVLDQLADYFKARLA 456
+D LP LGY+ SD S++ S+ I P +S E VL L + F L
Sbjct: 316 TDPLPKNVLGYLRIQRSDESDLASIARQ--RIDPKYEKISDSNEVEVLQSLIESFCGLLD 373
Query: 457 GYPATLSEDEAMLTDYNLHPKKR---VATQLVRMEKKMLNACLQVTADMIMLLPDVTVS- 512
+ L E L + ++P + A + E+++L + DM+ + + +
Sbjct: 374 SFGTQLESLEKQLAE-GVYPSRGNAWAAAHVSLGEQQVLRLARKRAEDMLAAVESGSGNE 432
Query: 513 ----PCPAP 517
P PAP
Sbjct: 433 KGSLPAPAP 441
>gi|242769547|ref|XP_002341787.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
gi|218724983|gb|EED24400.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
Length = 739
Score = 51.6 bits (122), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 66/241 (27%), Positives = 104/241 (43%), Gaps = 40/241 (16%)
Query: 161 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSE--TELAYLTGSPTKAEIL 218
+LM + + ++ FW PYI+ L G + +PLL+ E +LA+L + A
Sbjct: 138 TFFLMGQYLRREEGFWYPYIQSLP-----GPEELTTPLLFKEEDGDLAWLNMTSLAASRE 192
Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQ--------YPYDIPTEAFTFEIFKQAFVA--VQS 268
R + K Y + A S+ Q Y +D+ A T I +AF A + S
Sbjct: 193 RRLQIWKVNYEK-------AYSMMQDLGVENARLYTWDLYLWASTI-ISSRAFTAKVLAS 244
Query: 269 CVVHLQKVSLARRFALVPLGPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVW 326
+ LQ R ++ L P + A + K K A D++ LVV +AG+ +
Sbjct: 245 VIPKLQTAEEGDRISV--LLPLIDATNHKPLSKVEWRAGTDSIGLVVMSDLRAGDEV--- 299
Query: 327 YNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 386
G+++ GP+ N +L++NYGF DNP + VV + P Q K Q
Sbjct: 300 ------GNNY--GPRNNEQLMMNYGFCIPDNPCEYRVVSLRAPPDSPLAQIKAQYEQHCS 351
Query: 387 K 387
K
Sbjct: 352 K 352
>gi|443730800|gb|ELU16158.1| hypothetical protein CAPTEDRAFT_140019 [Capitella teleta]
Length = 255
Score = 51.6 bits (122), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 62/253 (24%), Positives = 103/253 (40%), Gaps = 19/253 (7%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGN---ETIAELLTTNKLSELACLALYLMYEKK 169
V L GD ++P SL++T VL + I + L +LS L ++L+ E+
Sbjct: 8 VMVRRRLLTGDTIIAIPESLLITTSTVLRSYLGPVIHDFLPC-RLSPTETLVIFLLCERN 66
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
+G SFW PY+ L + L W+ E+ L TK + + +N
Sbjct: 67 KGCSSFWKPYVDILPS-------SYTDILHWTSKEMDLLPKF-TKRRACDLRLKAEESFN 118
Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP 289
L + L +Q P AFT+++FK A+ +V + V++ + + L P
Sbjct: 119 RLCNGFLPL--LVRQMPQF--NGAFTWDLFKWAWSSVNTRCVYMSQ---PQNSVLSPDEE 171
Query: 290 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
A + + VD V D K+ + + + GP N KLL+
Sbjct: 172 DKSALAPFLDLLNHTVDVEVNARFDDSSKSYKITTLTACKPYDQVFINYGPHSNEKLLLE 231
Query: 350 YGFVDEDNPYDRL 362
YGF NP++ +
Sbjct: 232 YGFTLPCNPHNNI 244
>gi|320163048|gb|EFW39947.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 476
Score = 51.6 bits (122), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 110/482 (22%), Positives = 191/482 (39%), Gaps = 79/482 (16%)
Query: 83 SWMHKNGLPPC-KVILKEKPSHNEKHRP--IH---YVAASEDLQAGDAAFSVPNSLVVTL 136
+W+ NG K+ L+ + N R +H +A + FS+P L+++
Sbjct: 17 AWLRANGATVSPKLTLQATAAFNADSRTQVLHRRVIASAEAGFDKEEELFSIPRKLLLSA 76
Query: 137 ERVLGNETIAELLTTNKLSELAC-----LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQ 191
+IAELL NK A L + +MYE K SFW PY+ L
Sbjct: 77 ----STSSIAELLLENKKEACALVGWMPLVVAMMYEIT-NKDSFWRPYLDLLPE------ 125
Query: 192 LAVESPLLWSETELAYLTGSPTKAEI-LERAEGIKRE----------------------Y 228
+++P+ W++ +L L G+ T + + E AE I E Y
Sbjct: 126 -TLDTPMFWNDDDLELLEGTSTLSHLGKEDAETIFTEQIVPFMKLHPTHFDLKVHNMALY 184
Query: 229 NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG 288
+ + +V MA S + D E E A C ++ + + A+VPL
Sbjct: 185 HRVASV-IMAYSFSEDDDEDDDDEDDDEEEDCCDGDANNECCSQKRQKRM-EKIAMVPLA 242
Query: 289 PPLLAYSSKCK-AMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLL 347
+L + + C A L + + P AG + Y G NS+LL
Sbjct: 243 D-MLDHKTGCNNARLFYGKTTLAMSCIEPCAAGHELYNTY-----------GDLSNSELL 290
Query: 348 INYGFVDEDNPYDRLVVEAAL---NTEDPQYQDKRMVAQRNGKLSVQVFHVHAG----RE 400
YGF+D+ N ++ + + + E + ++ M A + FH+ A +E
Sbjct: 291 RKYGFIDDVNEHNSVDIPVEMLEERFESCSFMEEAMEALEEIGCWLPEFHIPADALPPQE 350
Query: 401 KEAISDML-----PYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARL 455
EA +L L + D E++S +++L V+ C R V + L + + R
Sbjct: 351 LEASIALLFQSPKQVRALRALDDEDEIRSFLATL-----VNKC-RRKVSETLLAFGQKRA 404
Query: 456 AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCP 515
Y T EDE L + +L ++++A ++ E+ +L+ + + + P + P
Sbjct: 405 EEYTTTREEDEERLKESDLTHRQKMALRVRIGERTILHNYISHLKERLETTPPDQETKEP 464
Query: 516 AP 517
AP
Sbjct: 465 AP 466
>gi|326473914|gb|EGD97923.1| hypothetical protein TESG_05224 [Trichophyton tonsurans CBS 112818]
Length = 692
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 47/206 (22%), Positives = 86/206 (41%), Gaps = 24/206 (11%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LA ++ +E+ +G+ S W PY+ L R + S L + +++L +L G+
Sbjct: 108 LAFFVAHEQLKGRDSHWWPYLATLPRAS-----ELTSALFYQDSDLDWLQGTNLYQTHQA 162
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
+K EY+ ++ G L E+++++IF A+ + S + +
Sbjct: 163 YRNTVKEEYDSAISILRDEGCL--------AVESYSWDIFCWAYTLIAS------RAFTS 208
Query: 280 RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNI---SFTGHDF 336
R P L + + ML VD + + + E+ + + +FTG +
Sbjct: 209 RVLDAYFSNHPTLKQDEEFQIMLPLVDSSNHKPLAKIEWRAEATEIGLKVIEPTFTGEEI 268
Query: 337 --KCGPQPNSKLLINYGFVDEDNPYD 360
GP N +L+ YGF DNP D
Sbjct: 269 HNNYGPLNNQQLMTTYGFCIVDNPCD 294
>gi|444705829|gb|ELW47217.1| Histone-lysine N-methyltransferase setd3 [Tupaia chinensis]
Length = 539
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 47/204 (23%), Positives = 92/204 (45%), Gaps = 21/204 (10%)
Query: 306 DDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVE 365
DD + V + ++ GE I ++Y G + N++ +I+ GF ++N +DR+ ++
Sbjct: 234 DDRCECVALQDFRPGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKIK 282
Query: 366 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT------ 419
++ D Y K V R G + VF +H + + +L +LR+ +++
Sbjct: 283 LGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TDPPISAQLLAFLRVFCMTEEELKEHL 341
Query: 420 ---SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 476
S + + + PVS E + L D L Y T+ ED+++L +L
Sbjct: 342 LGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKSRDLSV 401
Query: 477 KKRVATQLVRMEKKMLNACLQVTA 500
+ +A +L EK++L ++ A
Sbjct: 402 RATMAIKLRLGEKEILERAVRSAA 425
Score = 39.7 bits (91), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 38/160 (23%), Positives = 73/160 (45%), Gaps = 18/160 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ +++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVDGFEMVNFKEEGFGLR---ATREIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERAS-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
++PL + E E+ YL + ++ + + R+Y
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY 217
>gi|242823770|ref|XP_002488126.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
gi|218713047|gb|EED12472.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
Length = 480
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 89/397 (22%), Positives = 157/397 (39%), Gaps = 67/397 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A D+Q G+ F +P +V+ ++ NE +A+ L L L + ++YE G+
Sbjct: 50 VVARSDIQEGEDLFHLPQRVVLMVKTSPLNEILADEL--KNLGPWLSLVVVMIYEYSLGE 107
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT---------KAEILERAEG 223
+S W Y + L + ++ + WS EL+ L S + +I E+
Sbjct: 108 RSNWNQYFQVLPTK-------FDTLMFWSGEELSQLQASAVIHKIGKKDAEEDIFEKIIP 160
Query: 224 IKREYNELDTVWFMAGSLFQQYPYDIPTEA--------------FTFEIFKQAFVAVQSC 269
+ R + +L F + Y D +A + F+I K +
Sbjct: 161 LVRSHPDL----FPPVNGVMSYDDDAGAQALLELAHRMGSLIMAYAFDIEKGEEEESEGE 216
Query: 270 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNI 329
+L +VPL L A + + A L D A+ + +P K G+ I Y
Sbjct: 217 DGYLTDDEEQLPKGMVPLADLLNADADRNNARLFQEDGALVMRAIKPIKTGDEIFNDY-- 274
Query: 330 SFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL----------NTEDPQYQDKR 379
G P S LL YG+V DN VVE L N E +Y +
Sbjct: 275 ---------GELPRSDLLRRYGYV-TDNYAQYDVVELPLTGICHAAGLDNIESQEYPHLK 324
Query: 380 MVAQRNGKLSVQVFHVHAGREKEAISDMLP----YLRLGYVSDTSEMQSVISSLGPICPV 435
++ + ++ + + +++++D+LP L + E+Q + S P P+
Sbjct: 325 LLHEL--EILEDGYCILRPSAEDSLTDILPDELLALLKSLTLEREELQRLQSKQKPPKPI 382
Query: 436 SPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDY 472
E + L D K++L+ Y T+ +D+A+L +
Sbjct: 383 LAAREARI---LLDSVKSKLSQYGTTVEQDKAILQQF 416
>gi|42567909|ref|NP_197226.2| protein SET DOMAIN GROUP 40 [Arabidopsis thaliana]
gi|75271674|sp|Q6NQJ8.1|SDG40_ARATH RecName: Full=Protein SET DOMAIN GROUP 40
gi|34222078|gb|AAQ62875.1| At5g17240 [Arabidopsis thaliana]
gi|51969984|dbj|BAD43684.1| unknown protein [Arabidopsis thaliana]
gi|332005020|gb|AED92403.1| protein SET DOMAIN GROUP 40 [Arabidopsis thaliana]
Length = 491
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 82/407 (20%), Positives = 159/407 (39%), Gaps = 67/407 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNE-TIAELLTT-NKLSELACLALYLMYEKKQ 170
+ A+ +L+ G+ VP ++T E ++ + +++ + N LS L++ L+YE +
Sbjct: 51 LGAARELKKGELVLKVPRKALMTTESIIAKDLKLSDAVNLHNSLSSTQILSVCLLYEMSK 110
Query: 171 GKKSFWLPYI----RELDRQRGRGQ-----LAVESPLLWSETELAYLTGSPTKAEILERA 221
KKSFW PY+ R+ D G L VE + +E A +A L +
Sbjct: 111 EKKSFWYPYLFHIPRDYDLLATFGNFEKQALQVEDAVWATEKATAKCQSEWKEAGSLMKE 170
Query: 222 EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR 281
+K ++ W A + +P ++ V L
Sbjct: 171 LELKPKFRSFQ-AWLWASATISSRTLHVPWDS----------AGCLCPVGDLFNYDAPGD 219
Query: 282 FALVPLGPP---------LLAYSSKCKAMLAAVDDAVQ---LVVDRPYKAGESIVVWYNI 329
++ P GP L+ + + ++ V L R Y+ GE +++ Y
Sbjct: 220 YSNTPQGPESANNVEEAGLVVETHSERLTDGGFEEDVNAYCLYARRNYQLGEQVLLCY-- 277
Query: 330 SFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV--EAALNTEDPQYQDKRMVAQRNGK 387
G N +LL +YGF+ E+N D++ + E +L + + + ++GK
Sbjct: 278 ---------GTYTNLELLEHYGFMLEENSNDKVFIPLETSLFSLASSWPKDSLYIHQDGK 328
Query: 388 LSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQL 447
LS ++ LRL + + +SV+ + +S E V+ +
Sbjct: 329 LSFA---------------LISTLRLWLIPQSQRDKSVMRLVYAGSQISVKNEILVMKWM 373
Query: 448 ADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 494
++ + L P +++ED + LH ++ +R+E+K A
Sbjct: 374 SEKCGSVLRDLPTSVTEDTVL-----LHNIDKLQDPELRLEQKETEA 415
>gi|295668911|ref|XP_002795004.1| SET domain-containing protein [Paracoccidioides sp. 'lutzii' Pb01]
gi|226285697|gb|EEH41263.1| SET domain-containing protein [Paracoccidioides sp. 'lutzii' Pb01]
Length = 488
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 91/440 (20%), Positives = 169/440 (38%), Gaps = 78/440 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
+ A +D+ + F++P LV++ + N + +L+ N+ L + CL L ++YE Q
Sbjct: 50 IVAYDDINKEEELFAIPQGLVLSFQ----NSKLKDLMEINERDLGQWLCLILVMIYEYLQ 105
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA--EGIKRE- 227
G S W PY + L ++ + W++ EL L GS I + A E R+
Sbjct: 106 GVASPWAPYFKVLPTD-------FDTLMFWTDAELLELKGSAVLGRIGKSAAEEVFLRDL 158
Query: 228 -------------------YNELD------TVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
YN D ++ GSL Y +D+ + +
Sbjct: 159 LPLVSKNSELFPLTSGLLSYNSPDGKAALLSLAHRMGSLIMSYAFDVKNDEAEEVEGEGG 218
Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
+V ++ L + ++PL L A + + A L D + + + + GE
Sbjct: 219 YVTDD------EERQLPK--GMIPLADLLNADADRNNACLFQEDGYLAMKSIKSIRKGEE 270
Query: 323 IVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAALNTEDPQ 374
I Y G P ++LL YG+V ++ P + A L + P
Sbjct: 271 IFNDY-----------GELPRAELLRRYGYVTDNYAQYDEAEVPIQTICKVAGLKSSTPG 319
Query: 375 YQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICP 434
+ R+ + ++ + + +++ LP L ++ + L
Sbjct: 320 PDEPRLEFLDDLEVLDDGYGIPRPDRSTPLAETLPTELLVVLNILVMPLEQFNQLKQKSK 379
Query: 435 V-SPCMERAVLDQLADYFKARLAGYPATLSEDEAML-----TDYNLHPKK----RVATQL 484
V P + A L + + L YP T+++D+ +L + PK ++A Q+
Sbjct: 380 VPKPALGIAEATLLDEVVRLILGEYPTTVAQDKELLASCANNQGSTSPKSAGRLKMALQV 439
Query: 485 VRMEKKMLNACLQVTADMIM 504
+ EK++LNA L D I+
Sbjct: 440 RKGEKEILNAVLSELEDFIV 459
>gi|302762396|ref|XP_002964620.1| hypothetical protein SELMODRAFT_81798 [Selaginella moellendorffii]
gi|300168349|gb|EFJ34953.1| hypothetical protein SELMODRAFT_81798 [Selaginella moellendorffii]
Length = 464
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 87/378 (23%), Positives = 154/378 (40%), Gaps = 57/378 (15%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
L L L+YE+ Q K S+W PYI L + P+ +S ++ + +P ++ +
Sbjct: 105 LGLKLLYERAQ-KGSYWWPYISMLPH-------SFTLPIFFSGVDIESIDYAPVTHQVKK 156
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH--LQKVS 277
R + + EL + + + + + A + + A V+ ++ +H K+
Sbjct: 157 RCRFLLQFSAELAKLESLPEEVHPFAGQSVDSGALGWAM---AAVSSRAFRIHGVTNKLC 213
Query: 278 LARRFALVPLGPPLLAYSSKCKAMLA--AVDDA-VQLVVDRPYKAGESIVVWYNISFTGH 334
A L+ + ++ + L+ A D + +++V R + G +I + Y
Sbjct: 214 SAMMLPLIDMCNHSFQPNAHIEEDLSRDAQDVSFLKVVTKRNLEKGSAITLNY------- 266
Query: 335 DFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA--QRNGK----- 387
GP N LL++YGFV DNP+DR+ L + ++ RM+A R G
Sbjct: 267 ----GPLSNDLLLLDYGFVIPDNPHDRI----ELRYDGSLMENARMIAGLSRTGSPPFSS 318
Query: 388 -LSVQVFH--------------VHAGREKEAISDMLPYLRLGYVSDTS--EMQSVIS--S 428
S QV V G +E +L LR+ + E + ++S +
Sbjct: 319 PASWQVDRLKQLGLADSGESQKVTLGGPEEVDGRLLAALRILHAESQEPLERRELVSLQA 378
Query: 429 LGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRME 488
G VS E VL L + T+ EDEA L+D +L R+A Q +
Sbjct: 379 WGVESMVSSDNEERVLRTLCGLAAIVFNQFKTTIEEDEAKLSDKSLAETSRIAVQFRLTK 438
Query: 489 KKMLNACLQVTADMIMLL 506
K+++ L+ +M L
Sbjct: 439 KRLVVRVLESLKKRLMDL 456
>gi|384249602|gb|EIE23083.1| SET domain-containing protein, partial [Coccomyxa subellipsoidea
C-169]
Length = 306
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 70/150 (46%), Gaps = 17/150 (11%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A +DL G +P + V++++ N IA++L +++ L + +MYE GK
Sbjct: 4 VFAVQDLCEGQRLCEIPKTAVLSVQ----NTGIADILEQHRIRGGLGLIIAIMYELSIGK 59
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+SFW Y+ EL ++ PL W+E E + L G+ + E E + ++
Sbjct: 60 ESFWHGYLEELHKRE-------YLPLFWAEQERSLLQGTEAEHRPQEDEELTQEDFET-- 110
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
L +Q+ + ++FT E F+ A
Sbjct: 111 ----HVPPLVEQHADRLRADSFTLESFRVA 136
>gi|115386294|ref|XP_001209688.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114190686|gb|EAU32386.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 486
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 93/403 (23%), Positives = 156/403 (38%), Gaps = 73/403 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
V A D+ + F++P LV++ + N + +LL+ + L EL L L +MYE
Sbjct: 50 VVAQTDIPENEELFTIPRDLVLSTQ----NSKLKDLLSQD-LEELGPWLSLMLVMMYEYL 104
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP---------TKAEILER 220
G +S W Y + L R+ ++ + W+ +EL L GS ILE
Sbjct: 105 LGDQSTWAAYFKVLPRK-------FDTLMFWTPSELLELQGSAVIDKIGRQGADESILEM 157
Query: 221 AEGIKREYNEL----------------DTVWFMA---GSLFQQYPYDI--PTEAFTFEIF 259
I R + L + +A GSL Y +DI P +
Sbjct: 158 IAPIVRAHPSLFPPVDGLPSYDGDAGTQALLHLAHTMGSLIMAYAFDIEKPEDEDEEGDG 217
Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 319
+ ++ + ++ L++ +VPL L A + + A L ++A+ + +P
Sbjct: 218 EGGYMTDE------EEEQLSK--GMVPLADLLNADADRNNARLFQDENALVMKAIKPIAK 269
Query: 320 GESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV-DEDNPYDRLVVE-------AALNTE 371
GE I Y G P + LL YG+V D PYD + V A L+
Sbjct: 270 GEEIFNDY-----------GEIPRADLLRRYGYVTDNYAPYDVVEVSLDVICKAAGLSDS 318
Query: 372 DPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP-YLRLGYVSDTSEMQSVISSLG 430
DP+ Q L +E + ++D+LP L + + T + +
Sbjct: 319 DPEKQPPLEFLDELELLDDGYVIPRPSQEDDQLTDILPDELIILLRTLTLSPEQLAQQRS 378
Query: 431 PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYN 473
P P A LA + + A Y T+++D+ +L+ N
Sbjct: 379 KNKPPKPAFAEAEATILAKAIQLKQAQYATTIAQDQEILSQLN 421
>gi|396468374|ref|XP_003838159.1| hypothetical protein LEMA_P116830.1 [Leptosphaeria maculans JN3]
gi|312214726|emb|CBX94680.1| hypothetical protein LEMA_P116830.1 [Leptosphaeria maculans JN3]
Length = 660
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 92/439 (20%), Positives = 176/439 (40%), Gaps = 89/439 (20%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLE-----RVL-----GNETIAELLTTNKLSELACLAL 162
+ A+ D+ A F++P + ++ +E R+L G AE L A L L
Sbjct: 41 IVATRDIPAETTLFTIPRNAIINVETSDLARLLPGIFDGTLNDAEDEKAEPLDPWASLIL 100
Query: 163 YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAE 222
++ E G++S+W PYI L + ++P+ W++ EL L G+ AE + ++E
Sbjct: 101 VMLREYLHGEQSYWKPYIDIL-------PTSFDTPIFWTQDELKELEGTVLTAEKIGKSE 153
Query: 223 ------------------------GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEI 258
I +L + GS Y +D+ + +
Sbjct: 154 SDEMLRTHVLPIVTQNPTAFCPKGAIPLNEEDLLALAHRIGSTIMSYAFDLDDDKEESDA 213
Query: 259 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD---R 315
++ +V + + L +VP+ L A + A V+ +L V
Sbjct: 214 EEEGWVEDRDGLTML---------GMVPMADVLNANAD----FNAHVNHGEKLEVTSLRS 260
Query: 316 PYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVD-EDNPYD---------RLVVE 365
+AG I+ +Y GP P+S+LL YG+V E + YD R +
Sbjct: 261 DIRAGTEILNYY-----------GPLPSSELLRRYGYVTPEHHRYDVAEVSWELVRSTLV 309
Query: 366 AALNTEDPQYQDKRMVAQRNGKLSVQVFHV---HAGREKEAISDMLPYLRLGYVSDTSE- 421
A L D Q V + G+ ++ + V +G + + P ++ SE
Sbjct: 310 AHLELSDGILQ---AVETQLGEDELEDYFVLERDSGEPSDEGRLVQPPQTCEVPTELSEQ 366
Query: 422 MQSVISSLGPICP--VSPCMERAVLDQ--LADYFKARLAGYPATLSEDEAMLTDYNLHPK 477
+++++ +L P + + R + + + +L+ Y ++ EDE +L + +L +
Sbjct: 367 LKTILKALKKQQPELIGSAVRRDEILHAVIGEALNRKLSEYATSVEEDEELLENSSLTKR 426
Query: 478 KRVATQLVRMEKKMLNACL 496
R+A ++ EK++L+ L
Sbjct: 427 HRLAIEVRLGEKRLLHELL 445
>gi|425773952|gb|EKV12277.1| hypothetical protein PDIG_46020 [Penicillium digitatum PHI26]
gi|425782378|gb|EKV20291.1| hypothetical protein PDIP_17950 [Penicillium digitatum Pd1]
Length = 487
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 76/307 (24%), Positives = 120/307 (39%), Gaps = 73/307 (23%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
V A ++ G+ FS+P ++V+T++ N + LL N ++ L L ++YE
Sbjct: 50 VVAQSNIVEGEELFSIPRTMVLTVQ----NSELRTLLAENLEEQMGPWLSLMLVMVYEYL 105
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI------------ 217
QG+KS W PY R L + ++ + WS EL L S +I
Sbjct: 106 QGEKSRWAPYFRVLPSR-------FDTLMFWSPAELQELQASTIVEKIGRSNAEESIRDS 158
Query: 218 -----------------LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPT---EAFTFE 257
L EGI + L V + GSL Y +DI + E
Sbjct: 159 IAPILAKRPDLFPPPPGLASWEGIAGDA-ALIQVGHVMGSLIMAYAFDIEKAEDDDDEGE 217
Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 317
+ ++++ L K +VPL L A + + A L + A+ + +P
Sbjct: 218 VNDESYMTDDEEEEQLPK-------GMVPLADLLNADADRNNARLYQEEGALVMKAIKPI 270
Query: 318 KAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNP-YDRL------VVEAA-LN 369
+ G+ I Y G P + LL YG+V ++ YD L + EAA L
Sbjct: 271 QKGDEIFNDY-----------GEIPRADLLRRYGYVTDNYAVYDVLELSLETICEAAGLA 319
Query: 370 TEDPQYQ 376
DP+ Q
Sbjct: 320 NADPESQ 326
>gi|403350379|gb|EJY74649.1| SET domain containing protein [Oxytricha trifallax]
Length = 2165
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 76/344 (22%), Positives = 136/344 (39%), Gaps = 76/344 (22%)
Query: 97 LKEKPSHNEKHRPIHYVA------ASEDLQAGDAAFSVPNSLVVTLERVL----GNETIA 146
L++ SH EK + +Y A A+ D++ G+ VP ++TLE + G +
Sbjct: 154 LEQGGSHFEKLKIRYYTADYRGVHAARDIKKGEIILYVPKHQIITLEMAMTSPVGKKMYE 213
Query: 147 ELLTTNKLS-ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 205
+ L +S + + L+ Y+M EK++ + S W YI L + P+ ++E E
Sbjct: 214 KGLRQRLISPKHSFLSTYIMQEKRKPE-SQWQIYIDILPKNFSN------FPIFFTEEER 266
Query: 206 AYLTGSPTKAEILERAEGIKREYN---------------ELDTVWFMAGSLFQQYPYDIP 250
+L GSP +ILE+ E IK +Y+ E + M S + I
Sbjct: 267 IWLKGSPFLDQILEKIEDIKADYDLICKEVPEYVQFPIREYSEIRMMVSSRI----FGIQ 322
Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 310
E + FVA + H + P K ++ A++D
Sbjct: 323 IEG----VKTDGFVAYADMLNHKR-----------PRQTSWTYTDEKQGFIIEAMEDI-- 365
Query: 311 LVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 370
+ GE + Y G + NS+ +NYGF++ +N + + ++ +T
Sbjct: 366 -------QRGEQVYDSY-----------GKKCNSRFFLNYGFINLNNDANEVPIKVYYHT 407
Query: 371 EDPQYQDKR-MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 413
+D Q K+ M+ + + F V E + + +LR
Sbjct: 408 DDQLKQVKQDMIVDHS---EFKKFRVVENLEDRVMQEFFSWLRF 448
>gi|242081035|ref|XP_002445286.1| hypothetical protein SORBIDRAFT_07g007800 [Sorghum bicolor]
gi|241941636|gb|EES14781.1| hypothetical protein SORBIDRAFT_07g007800 [Sorghum bicolor]
Length = 490
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 66/254 (25%), Positives = 104/254 (40%), Gaps = 33/254 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ ASE + G+ A +P SL+++ E + +E L N ++ L L+ M E+
Sbjct: 182 MVASESIGVGEIALEIPESLIISDELLCQSEVFLALKDFNSITSETMLLLWSMRERYNLA 241
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
F PY L G L + LA L G+ EI++ + ++++Y+EL
Sbjct: 242 SKF-KPYFDTLPANFNTG-------LSFGIDGLAALEGTLLFDEIMQAKQHLRQQYDELF 293
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQA--FVAVQSCVVHLQKVSLARRFALVPLGPP 290
+ L +P + T++ F A S +V L L+ LVP+
Sbjct: 294 PL------LCTNFPEIFRKDVCTWDNFLWACELWYSNSMMVVLSSGKLST--CLVPVAGL 345
Query: 291 LLAYSSKCKAMLAAVDDA---VQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLL 347
L S VD+A ++ + RP AGE + Y G P S L+
Sbjct: 346 LNHSVSPHILNYGRVDEATKSLKFPLSRPCDAGEQCFLSY-----------GKHPGSHLV 394
Query: 348 INYGFVDE-DNPYD 360
YGF+ DNPYD
Sbjct: 395 TFYGFLPRGDNPYD 408
>gi|308802149|ref|XP_003078388.1| related to histone-lysine N-methyltransferase (ISS) [Ostreococcus
tauri]
gi|116056840|emb|CAL53129.1| related to histone-lysine N-methyltransferase (ISS), partial
[Ostreococcus tauri]
Length = 446
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 100/417 (23%), Positives = 164/417 (39%), Gaps = 64/417 (15%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL-ACLALYLMYEK-KQ 170
VA + D+ G+ +VP V+ + T+ L+ + L LA +++ E
Sbjct: 25 VATTRDVTRGELLATVPLEKCVSTSSARADATLWRGLSARPGASLDGILAAHVLREAFGL 84
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
G++S + P++R L + ++ + W E EL L GS A + + EY+
Sbjct: 85 GERSAFWPWLRLLPSE-------TDAAVGWDEDELRELQGSNVVAFARAIKKSWREEYDA 137
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAF--------TFEIFKQAFVAVQSCVVHLQKVSLARRF 282
LD F D P EAF TFE F A V S + L+ S +
Sbjct: 138 LD---------FAGLGVDFP-EAFGGEHAAHYTFEKFTWARFVVWSRAIDLKTDSTSA-- 185
Query: 283 ALVPLGPPLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFK 337
++ + P+L A S K A +AV++ +K + Y+
Sbjct: 186 PVIRMLVPILDMANHAPSGKLLPRWDAKANAVKIYAGSAFKRNTELRFNYDT-------- 237
Query: 338 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 397
+P+ L+ YGF+ E NP + + V L+ D + K + +R+G +
Sbjct: 238 ---KPSQYFLLQYGFIPEANPAECVEVTMQLSQRDNLRERKEALLRRHGLDPTKRNFEWK 294
Query: 398 GREKEAISDMLPYLRLGYVSDTSEMQSVIS-----SLGPICPVSPCMERAVLDQLADYFK 452
R + D+L R+ D SE+ S S + + +AVL +
Sbjct: 295 VRGLD--YDLLAAARI-IAMDESELDDDTSVALSVSGASVSAKNDARTKAVLLK---SLI 348
Query: 453 ARLAGYPATLSEDEAMLTDYNLH----PKKRVA-TQLVRMEKKMLNACLQVTADMIM 504
L GY TL ED + + +N PKKR L+RM +K L +AD +
Sbjct: 349 TSLDGYGTTLGEDNSYIARFNTSSDELPKKRKRFAVLLRMREK---GILLASADALF 402
>gi|367023575|ref|XP_003661072.1| hypothetical protein MYCTH_2300057 [Myceliophthora thermophila ATCC
42464]
gi|347008340|gb|AEO55827.1| hypothetical protein MYCTH_2300057 [Myceliophthora thermophila ATCC
42464]
Length = 496
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 70/281 (24%), Positives = 108/281 (38%), Gaps = 66/281 (23%)
Query: 113 VAASEDLQAGDAAFSVP-NSLVVTLERVLGNE----------------TIAELLTTNKLS 155
+ A D+ A F++P +S++ T L NE + E T++
Sbjct: 49 IVARTDIAADTVLFTIPRSSIICTATSALKNEIPGIFDLEGDEDGNSDSGGEDGTSSSQD 108
Query: 156 ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 215
L L L+YE QG S W PY+ L A ++P+ WS TELA L S
Sbjct: 109 SWTLLILILIYEYLQGDASQWKPYLDVL-------PSAFDTPMFWSPTELAELQASALVT 161
Query: 216 EI-LERAEGIKRE-----YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSC 269
++ E A+ + R D V+F G Q+ D FE+ + A+ +
Sbjct: 162 KVGREEADRMIRSKILPVIRGHDHVFFPHGR--QRLDDDQ-----LFELAHRMGSAIMAY 214
Query: 270 VVHLQKVSLARR-----------------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 312
L+K A +VP+ +L ++ A + D++
Sbjct: 215 AFDLEKDDDANEEASEQDEWVDDREGRTMLGMVPMA-DMLNADAEFNAYINHGADSLTAT 273
Query: 313 VDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV 353
R KAGE I+ +Y GP PN +LL YG+V
Sbjct: 274 ALRTIKAGEEILNYY-----------GPLPNGELLRRYGYV 303
>gi|451854554|gb|EMD67847.1| hypothetical protein COCSADRAFT_34629 [Cochliobolus sativus ND90Pr]
Length = 476
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 104/440 (23%), Positives = 167/440 (37%), Gaps = 83/440 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A +D+ + FS+P S ++ +E + + I T L L L ++YE G
Sbjct: 40 VVAKQDIAEHELLFSIPRSSILGVENSILSTEIPPA-TFAHLGPWLSLILIMLYEYHNGS 98
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL-------------- 218
S W PY L ++ + W+E ELA L S +I
Sbjct: 99 ASNWAPYFAVLPTD-------FDTLMFWTEDELAELQASAVVNKIGKEGANEVFIEQLLP 151
Query: 219 -------------ERAEGIKREYNELDTVWFM--AGSLFQQYPYDIPTEAFTFEIFKQAF 263
ERA+ +E + + M GSL Y +D+ A + + +
Sbjct: 152 VIEEFADVIFSGDERAKHKAKEMRAPENLELMHKMGSLIMAYAFDVEP-AISDKEVDEEG 210
Query: 264 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESI 323
A + L K +VPL L A +C A L D +++ +P +AG+ I
Sbjct: 211 FAEEEEDAALPK-------GMVPLADMLNADGDRCNARLFYEKDGLEMKALKPIQAGDEI 263
Query: 324 VVWYNISFTGHDFKCGPQPNSKLLINYGFV-DEDNPYD-----------RLVVEAALNTE 371
Y GP P S LL YG++ D YD L + + E
Sbjct: 264 FNDY-----------GPLPRSDLLRRYGYITDNYAQYDVVEIPVDLVSQTLAHDGLWHEE 312
Query: 372 DPQYQDKRMVAQRNGKLSVQV-FHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLG 430
+Y D++ + ++ + F + +E++S L L + E + + S G
Sbjct: 313 RIEYLDEQEIVDTGYDIAASIPFSL-----EESLSPELVILVETMLLPREEFER-LQSKG 366
Query: 431 PICPVSPCMERAVLDQLADYFKARLAGYPATLSED-----EAMLTDYNLHPKKRVA-TQL 484
+ P + M L +AR+A YP TL +D E ++RVA +
Sbjct: 367 RL-PKAEKMTGKAAKFLYKIVQARIAQYPTTLEQDLQISSETQPVQTMSRKERRVAMARA 425
Query: 485 VRM-EKKMLNACLQVTADMI 503
VR+ EKK+L + AD I
Sbjct: 426 VRIGEKKLLVQTEERLADKI 445
>gi|422293007|gb|EKU20308.1| ribulose- -bisphosphate carboxylase oxygenase small subunit
n-methyltransferase i [Nannochloropsis gaditana CCMP526]
Length = 385
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 53/202 (26%), Positives = 83/202 (41%), Gaps = 20/202 (9%)
Query: 78 LGDLKSWMHKN---GLPPCKVILKEKP-SHNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
LG+ WM G+PP ++L + E + + G+A F +P S+V
Sbjct: 115 LGENGVWMQDKSGWGVPPHPLLLSSRTIDEIELEDSGRGLICKYPINMGNALFQLPLSIV 174
Query: 134 VTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
+ E+ L A ++E +AL L+ E+ G SFW PYI L
Sbjct: 175 IDKEKSLAAFDGA---LPADINEYFAIALMLIKERALGPSSFWAPYIDVLPTTE-----E 226
Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS-LFQQYPYDIPTE 252
V L+W E +LA L SP A + E+ L+ + A S +F
Sbjct: 227 VNPTLVWPEGDLALLEASPLVAATRSLKRKLAAEFALLEEQYMRARSDVFD-------PS 279
Query: 253 AFTFEIFKQAFVAVQSCVVHLQ 274
FTFE + AF+ + S + ++
Sbjct: 280 VFTFEAYLWAFINIFSRAIRVK 301
>gi|224042477|ref|XP_002188626.1| PREDICTED: SET domain-containing protein 4 [Taeniopygia guttata]
Length = 457
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 94/403 (23%), Positives = 148/403 (36%), Gaps = 78/403 (19%)
Query: 81 LKSWM------HKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSV 128
LKS+M HK K LKE+ + RP + + ++ LQAGD S+
Sbjct: 17 LKSFMDGVNCSHKLEYIKLKKWLKERGFEDSNLRPAEFWETGRGLMTTKALQAGDLIISL 76
Query: 129 PNSLVVT----LERVLGNE------------TIAELLTTNKLSELACLALYL---MYEKK 169
P ++T L LG + L L L C L + EK
Sbjct: 77 PEKCLLTTGTVLSSCLGGHIEKWKPPVSPLLALCTFLIGQNLELLECFQFLLVNGIAEKH 136
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
G+KS W PY+ L + A P E ++ L P + + E+ I+ +
Sbjct: 137 AGQKSPWKPYLDVLPK-------AYTCPAC-LEPDIINLLPKPLQKKAQEQKMLIQELFQ 188
Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP--- 286
+ LF + +I F F + A+ V + +++ K F+L P
Sbjct: 189 SSRAFFSSLQPLFAEDTGNI----FNFSALQWAWCTVNTRTIYM-KHPHRECFSLEPDVY 243
Query: 287 -LGP--PLLAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQ 341
L P LL +S + KA + ++ D K + +++ Y GP
Sbjct: 244 ALAPYLDLLNHSPNVQVKAGFNEQTRSYEIWTDSQCKKYQEVLICY-----------GPH 292
Query: 342 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREK 401
N +LL+ YGFV DNP+ + V A + DK QR K+S+ H
Sbjct: 293 DNQRLLLEYGFVATDNPHSSVYVSADTLLKYFSSLDK----QREAKVSILKDH------- 341
Query: 402 EAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVL 444
D L L G+ + + + + L C R +L
Sbjct: 342 ----DFLENLTFGWEGPSWRLLTALKVLSLAADEFACWRRILL 380
>gi|323452617|gb|EGB08490.1| hypothetical protein AURANDRAFT_71532 [Aureococcus anophagefferens]
Length = 1114
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 93/384 (24%), Positives = 147/384 (38%), Gaps = 65/384 (16%)
Query: 123 DAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRE 182
DA + LV + ER + L E LAL L+YE+++G KS W P+I
Sbjct: 64 DAMLHARSPLVCSGEREANDARALGALLGKVTREDDALALRLLYERRKGAKSRWGPHIAL 123
Query: 183 LDRQRGRGQLAVESPLLWSETELAYLTGSPTK--------------AEILERAEGIKREY 228
L + L WSE ELA L GS +EI++++ E
Sbjct: 124 LP------ATPPHALLRWSEAELAELAGSDALELANRWRSQVSSDFSEIVDKSRAAVEES 177
Query: 229 NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG 288
+ + A ++P+ + E F++ + ++ + VS++R+ A
Sbjct: 178 DPGKQL-SAAVKASLRFPW-LDLEGFSWAV----------SMIWSRCVSVSRKGA----- 220
Query: 289 PPLLAY--------SSKCKAMLAAVDDAVQLVVDR---PYKAGESIVVWYNISFTGHDFK 337
PP+ A+ DDA V R K G+ + + Y+
Sbjct: 221 PPIKAFLPVVDMHNHDPGAPENHGFDDARDGFVLRRTGNAKKGDELKLCYD--------- 271
Query: 338 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 397
PN+ LL+ YGF + + + A L+ E P Y+ KR + KL + A
Sbjct: 272 --GLPNAWLLLLYGFALDHAAHAGRDLYAPLSPEAPHYEAKRAALE---KLGLGATADGA 326
Query: 398 GREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAG 457
+ A D LP RL ++ ++ + L + S RA L A LA
Sbjct: 327 APFRLAADDALPE-RL--LTALMAQRATLDELPGLPATSEATARAAAGDLVAACDALLAA 383
Query: 458 YPATLSEDEAMLTDYNLHPKKRVA 481
Y + ED A L D P+ R+A
Sbjct: 384 YRGSEDEDAAALADPATPPRLRLA 407
>gi|255945819|ref|XP_002563677.1| Pc20g11910 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211588412|emb|CAP86520.1| Pc20g11910 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 487
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 75/294 (25%), Positives = 118/294 (40%), Gaps = 68/294 (23%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
V A ++ G+ FSVP ++V+T++ N + LL N ++ L L ++YE
Sbjct: 50 VVAQSNISEGEELFSVPRAMVLTVQ----NSELRTLLGENLEEQMGPWLSLMLVMVYEYL 105
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA---EGIKR 226
QG+KS W PY R L + ++ + WS EL L S T E + R+ E I+
Sbjct: 106 QGEKSRWAPYFRVLPSR-------FDTLMFWSPAELQELQAS-TIVEKIGRSGAEESIRN 157
Query: 227 ----------------------EYNELDT----VWFMAGSLFQQYPYDI---PTEAFTFE 257
E + D V + GSL Y +DI + E
Sbjct: 158 SIAPILAKRPDLFPPPQGLASWEGDAGDAALIQVGHIMGSLIMAYAFDIEKSEDDGDEGE 217
Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 317
++++ L K +VPL L A + + A L + A+ + +P
Sbjct: 218 ANDESYMTDDEEEEQLPK-------GMVPLADLLNADADRNNARLYQEEGALVMKAIKPI 270
Query: 318 KAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNP-YDRLVVEAALNT 370
+ GE I Y G P + LL YG+V ++ YD V+E +L T
Sbjct: 271 QQGEEIFNDY-----------GEIPRADLLRRYGYVTDNYAVYD--VLELSLET 311
>gi|302784522|ref|XP_002974033.1| hypothetical protein SELMODRAFT_414219 [Selaginella moellendorffii]
gi|300158365|gb|EFJ24988.1| hypothetical protein SELMODRAFT_414219 [Selaginella moellendorffii]
Length = 527
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 61/237 (25%), Positives = 91/237 (38%), Gaps = 46/237 (19%)
Query: 135 TLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
T ER L + +L N + +L+ E+ +GK+SFW PYI L +L++
Sbjct: 90 TAERCL---LVGPMLRKNDFRPWLTMCAHLLVERSRGKESFWHPYISALPSVE---ELSI 143
Query: 195 ESPLLW-SETELAYLTGSPTKAEILERAEGIKREYNELDTVW---FMAGSLFQQYPYDIP 250
PLLW +ET L GSP I R + + ++ L T F+ G
Sbjct: 144 SHPLLWPAETIQELLQGSPMLDTIATRLKLCQEDHEALLTAGIEKFLPGG---------- 193
Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQKVSLA-------RRFALVPLGPPLLAYSSKCKAMLA 303
E + V S V+ + SL LVP L SS +
Sbjct: 194 ------ETLSEGDVRWASAVLLSRAFSLELDVDDDFDTLCLVPWADMLNHCSSAGEESCL 247
Query: 304 AVDD---AVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDN 357
D L + Y G+ + Y + TG S+L ++YGFVD++N
Sbjct: 248 IFDQDTKTASLEAHKSYSKGDEVFDSYGPALTG----------SQLFLDYGFVDDEN 294
>gi|28393324|gb|AAO42088.1| unknown protein [Arabidopsis thaliana]
Length = 543
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 63/250 (25%), Positives = 99/250 (39%), Gaps = 30/250 (12%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASEDL+ GD A +P S +++ E V ++ L T + ++ L L+ M EK
Sbjct: 180 ASEDLKLGDVALEIPVSSIISEEYVYNSDMYPILETFDGITSETMLLLWTMREKHNLDSK 239
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F PY L G L + + L G+ EI++ E ++ Y+EL
Sbjct: 240 F-KPYFDSLQENFCTG-------LSFGVDAIMELDGTLLLDEIMQAKELLRERYDELI-- 289
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
L + P E +T+E + A S + ++ + L+P+ L
Sbjct: 290 -----PLLSNHREVFPPELYTWEHYLWACELYYSNSMQIKFPDGKLKTCLIPVAGFLNHS 344
Query: 295 SSKCKAMLAAVD---DAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYG 351
VD +++ V RP GE + Y G +S LL YG
Sbjct: 345 IYPHIVKYGKVDIETSSLKFPVSRPCNKGEQCFLSY-----------GNYSSSHLLTFYG 393
Query: 352 FVDE-DNPYD 360
F+ + DNPYD
Sbjct: 394 FLPKGDNPYD 403
>gi|145549620|ref|XP_001460489.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124428319|emb|CAK93092.1| unnamed protein product [Paramecium tetraurelia]
Length = 482
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 82/419 (19%), Positives = 160/419 (38%), Gaps = 52/419 (12%)
Query: 103 HNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV----LGNETIAELLTTNKLSELA 158
+E HR + A++ ++ G+ +P + ++LE V L N + ++ K + +
Sbjct: 63 QSEGHRTLR---ATQFIRQGEWVLFIPRTQYLSLEEVKKSCLINRKMIQI--NYKPNNIQ 117
Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
+ + ++ + K SFW PYI L + P + + A L GSPT ++
Sbjct: 118 TYFVNHLLQENRRKYSFWKPYIDVLPKD------VSGFPTYFDAEQDALLKGSPTLFTVI 171
Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 278
+ + K EY L A FQ+Y Y T++ F + + S +Q
Sbjct: 172 NQRKVFKEEYENLKE----AVKEFQKYGY-------TYDDFIKFRILTISRSFTVQIGEK 220
Query: 279 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD--RPYKAGESIVVWYNISFTGHDF 336
++ LVPL + + + DA + R + GE + Y
Sbjct: 221 EQQQLLVPLAD-FINHDNNGFLKYGYSKDADGFFMQAVRNIQKGEELFYNY--------- 270
Query: 337 KCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN---GKLSVQVF 393
G N +NYGF NP ++ ++ LN D + K + + N G V
Sbjct: 271 --GQWSNKYFFMNYGFASLTNPMNQFDLDICLNKNDRLFNLKISLTKGNMCWGNRLVNET 328
Query: 394 HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICP------VSPCMERAVLDQL 447
R+ A + + ++ + D +++ + + P + +E+A L
Sbjct: 329 DHDTFRQSLA---TVRFTQISKLDDFLQLEEDVQNFKQFWPGWHTTIKTIELEKATFKAL 385
Query: 448 ADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLL 506
L + +T+ +DE L D ++ L EK+++ + + M+ ++
Sbjct: 386 KGILVTELGNFASTIEDDERRLNDPQTPEFRKHIIMLTLREKQIIKKNIDICDLMLQVI 444
>gi|358397725|gb|EHK47093.1| hypothetical protein TRIATDRAFT_298882 [Trichoderma atroviride IMI
206040]
Length = 481
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 51/204 (25%), Positives = 89/204 (43%), Gaps = 25/204 (12%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
L L+ E +G++SFW PYI+ L + A+ P W E E L G+ + + +
Sbjct: 90 LLLIKELLRGEESFWWPYIQALPQPEDVDDWAL--PPFWPEEEAELLEGTNVEVGLDKIR 147
Query: 222 EGIKREYNELDTVWFMA--------GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 273
+ +KRE+ E + + L + Y+ F+ F+ + V ++ L
Sbjct: 148 DDLKREFREAKAMLLASQKDAEDDFSELLTRELYNWAYCIFSSRSFRASLVMTEAQQQAL 207
Query: 274 -QKVSLARRFALVPL---GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNI 329
+ VS+ L+PL G +A + + A A QL V R ++ G+ I Y+
Sbjct: 208 PEDVSVDDFSVLLPLFDIGNHDMAVDVRWELDAANSGAACQLRVGREHQPGQQIFNNYS- 266
Query: 330 SFTGHDFKCGPQPNSKLLINYGFV 353
P+ N++LL+ YGF+
Sbjct: 267 ----------PKTNAELLLGYGFM 280
>gi|79557522|ref|NP_179475.3| SET domain-containing protein [Arabidopsis thaliana]
gi|56381987|gb|AAV85712.1| At2g18850 [Arabidopsis thaliana]
gi|330251719|gb|AEC06813.1| SET domain-containing protein [Arabidopsis thaliana]
Length = 543
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 63/250 (25%), Positives = 99/250 (39%), Gaps = 30/250 (12%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASEDL+ GD A +P S +++ E V ++ L T + ++ L L+ M EK
Sbjct: 180 ASEDLKFGDVALEIPVSSIISEEYVYNSDMYPILETFDGITSETMLLLWTMREKHNLDSK 239
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F PY L G L + + L G+ EI++ E ++ Y+EL
Sbjct: 240 F-KPYFDSLQENFCTG-------LSFGVDAIMELDGTLLLDEIMQAKELLRERYDELI-- 289
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
L + P E +T+E + A S + ++ + L+P+ L
Sbjct: 290 -----PLLSNHREVFPPELYTWEHYLWACELYYSNSMQIKFPDGKLKTCLIPVAGFLNHS 344
Query: 295 SSKCKAMLAAVD---DAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYG 351
VD +++ V RP GE + Y G +S LL YG
Sbjct: 345 IYPHIVKYGKVDIETSSLKFPVSRPCNKGEQCFLSY-----------GNYSSSHLLTFYG 393
Query: 352 FVDE-DNPYD 360
F+ + DNPYD
Sbjct: 394 FLPKGDNPYD 403
>gi|347967018|ref|XP_321037.5| AGAP002018-PA [Anopheles gambiae str. PEST]
gi|333469795|gb|EAA01259.5| AGAP002018-PA [Anopheles gambiae str. PEST]
Length = 493
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 91/408 (22%), Positives = 163/408 (39%), Gaps = 65/408 (15%)
Query: 121 AGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA--CLALYLMYEKKQGKKSFWLP 178
AG+ +VP S+ + + EL+ +SE LAL L+ E+ + K S W P
Sbjct: 110 AGECIITVPRSMFFYVTNEPRYRQLLELMPGAMMSEQGNIMLALALIMERFRAK-SDWKP 168
Query: 179 YIREL-DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
Y+ L DR +PL ++ ++ L + L+ + I R+Y +
Sbjct: 169 YLDLLPDR--------YTTPLYYTTEDMGELAETDAFLPALKLCKHIARQYGFIRR---- 216
Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS----CVVHLQKV-SLARRFALVPL----- 287
F Q D + FT+++F+ A V + V+L + + AL+PL
Sbjct: 217 ----FVQEKVDELRDCFTYDVFRWAVSTVMTRQNKVPVNLAEFDGMDHTLALIPLWDMAN 272
Query: 288 -GPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNS 344
P A ++C A A ++ ++ + R S+ ++ I + G + ++
Sbjct: 273 HAFPDTANETRCVAETCYNATNEQLECSLTREVSDIASVPIF--IVY-------GTRTDA 323
Query: 345 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI 404
+ L++ GFV NP+ + L P Y+++ + + G + F RE A
Sbjct: 324 EFLVHNGFVCPRNPHANVQKRFTLVPAIPLYKERAHLLELLGMPTTGTFSFGPAREPAAA 383
Query: 405 SDMLP----YLRLGYVSDTS------------EMQSVISSLGPICPVSPC--MERAVLDQ 446
+ P + L VS + + + + + P C ER
Sbjct: 384 TTTTPISQELISLARVSSMTAKELDEYTAMKETQRQTLRTYQALLPAELCARTER----W 439
Query: 447 LADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 494
LA K L YP T+ +DEA+L N H +R+ + EK++L +
Sbjct: 440 LATVMKIMLLRYPTTIEQDEALLKT-NRHHIRRLLIEYRLGEKQILRS 486
>gi|326472332|gb|EGD96341.1| SET domain-containing protein [Trichophyton tonsurans CBS 112818]
Length = 485
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 75/356 (21%), Positives = 125/356 (35%), Gaps = 83/356 (23%)
Query: 97 LKEKPSHNEKHRPIHY-----------VAASEDLQAGDAAFSVPNSLVVTLERVLGNETI 145
LK H + H IH + AS D+ + F +P+ L+++++ +
Sbjct: 24 LKRSSPHFKMHSGIHIADLRSIGAGRGICASRDIAEDEELFVIPDDLILSVQNSEARSVL 83
Query: 146 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 205
L +L L + ++YE QG++S W PY R L + ++ + W++ +L
Sbjct: 84 G--LDDKQLGPWLSLIITMIYEYYQGEQSKWYPYFRILPS-------SFDTLMFWTDEQL 134
Query: 206 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 265
+ L GS +I + A DT+ L Q P+ P
Sbjct: 135 SELQGSAVVGKIGKAAAD--------DTILQKVVPLIQANPHHFPPR------------- 173
Query: 266 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 325
P PPL + S+ + A ++ G SI++
Sbjct: 174 --------------------PNMPPLNSPDSQNALLCLA------------HRMG-SIIM 200
Query: 326 WYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 385
Y K Y DED P +V A + D Q + R+ +
Sbjct: 201 AYAFDIE----KADEADEDTAEDGYMTDDEDEPAKGMVPLADIFNADAQRNNARLFQEEG 256
Query: 386 GKLSVQVFHVHAGREKEAISDMLPYL----RLGYVSDTSEMQSVIS-SLGPICPVS 436
+ + ++H+G E LP R GYV+D V+ SL IC V+
Sbjct: 257 SFVMKAIKNIHSGEEIFNDYGELPRADLLRRYGYVTDNYAQYDVVEFSLDGICKVA 312
>gi|334184301|ref|NP_001189551.1| SET domain-containing protein [Arabidopsis thaliana]
gi|330251720|gb|AEC06814.1| SET domain-containing protein [Arabidopsis thaliana]
Length = 536
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 63/250 (25%), Positives = 99/250 (39%), Gaps = 30/250 (12%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASEDL+ GD A +P S +++ E V ++ L T + ++ L L+ M EK
Sbjct: 180 ASEDLKFGDVALEIPVSSIISEEYVYNSDMYPILETFDGITSETMLLLWTMREKHNLDSK 239
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F PY L G L + + L G+ EI++ E ++ Y+EL
Sbjct: 240 F-KPYFDSLQENFCTG-------LSFGVDAIMELDGTLLLDEIMQAKELLRERYDELI-- 289
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
L + P E +T+E + A S + ++ + L+P+ L
Sbjct: 290 -----PLLSNHREVFPPELYTWEHYLWACELYYSNSMQIKFPDGKLKTCLIPVAGFLNHS 344
Query: 295 SSKCKAMLAAVD---DAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYG 351
VD +++ V RP GE + Y G +S LL YG
Sbjct: 345 IYPHIVKYGKVDIETSSLKFPVSRPCNKGEQCFLSY-----------GNYSSSHLLTFYG 393
Query: 352 FVDE-DNPYD 360
F+ + DNPYD
Sbjct: 394 FLPKGDNPYD 403
>gi|66827459|ref|XP_647084.1| hypothetical protein DDB_G0267502 [Dictyostelium discoideum AX4]
gi|60475269|gb|EAL73204.1| hypothetical protein DDB_G0267502 [Dictyostelium discoideum AX4]
Length = 472
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 69/335 (20%), Positives = 134/335 (40%), Gaps = 68/335 (20%)
Query: 195 ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
++ L + E E+ YL GSP +I+ E + Y++L F + + +
Sbjct: 163 DTSLYFDEKEIEYLAGSPAFVDIMVEKEVATKLYDQLSQTLFKDNVILEMCQG--QSTII 220
Query: 255 TFEIFKQAFVAVQSCVVHLQK----VSLARRFALVPLGPPLLAYSSKCKAMLAAVD---- 306
++ F+ A + + +++ S ++ L P+ PP++ Y + A +D
Sbjct: 221 GWDQFRWAHSTITARKIYVTDPDSVGSDGKQMKLSPVVPPIVDYFNHGNQPSAEIDYNEE 280
Query: 307 -DAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGF----VDEDNPYDR 361
+V + + K GE I V Y+ + G D LL++YG+ +D+ + +
Sbjct: 281 LGSVDVKAIKDIKKGEEIFVSYDHHYCGSD----------LLVDYGYLPNQIDDKSCVNV 330
Query: 362 LVVE--AALNTEDPQYQDK-----RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLG 414
L+ E +N +DP DK +++ ++ KL +
Sbjct: 331 LMEELLETINLDDPIKDDKYYLVNKLLETKDIKLKIS----------------------- 367
Query: 415 YVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML--TDY 472
M S+ L I + ++L+ L ++ YP T+ +D+ L +Y
Sbjct: 368 -------MDSLTEDLLKISKYMSYKQESLLEYLKRLVSLKIGHYPTTIIQDKEFLLSKEY 420
Query: 473 N-LHPKKRVATQLVRMEKKMLNAC---LQVTADMI 503
N L + ++A L EK++L+ LQ D I
Sbjct: 421 NQLSARSKLAFNLAFQEKQILSNVYTKLQENIDTI 455
>gi|358388339|gb|EHK25932.1| hypothetical protein TRIVIDRAFT_82204 [Trichoderma virens Gv29-8]
Length = 915
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 80/370 (21%), Positives = 154/370 (41%), Gaps = 62/370 (16%)
Query: 158 ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
+ L + +M+E +G +S W PY+ L + E+P+ WS EL L S T+ ++
Sbjct: 543 SILIIIMMFEYFKGDESKWKPYMDVL-------PASFETPMFWSGAELDELQASATRTKV 595
Query: 218 LERAEGIKREYNELDTVWFMAGSLF---QQYPYDIPTE----------AFTFEIFKQAFV 264
+A+ + + ++ V +F Q Y D + ++ F+ +
Sbjct: 596 -GKADAEEMFHAKVLPVIRANHEIFPSSQSYSDDELVQLAHRMGSTIMSYAFDFQNEDEE 654
Query: 265 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 324
+ ++ +VP+ +L ++ A + DDA+ + R KAGE I+
Sbjct: 655 DEEDEEEWVEDRESKSTMGMVPMAD-ILNADAEYNAHVNYGDDALTVTALRTIKAGEEIL 713
Query: 325 VWYNISFTGHDFKCGPQPNSKLLINYGFVD-EDNPYD------RLV---VEAALNTEDPQ 374
+Y GP PNS+LL YG+V + + YD +LV + A+L + Q
Sbjct: 714 NYY-----------GPHPNSELLRRYGYVTPKHSRYDVVELPWKLVENSLAASLGLSEQQ 762
Query: 375 YQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE-----MQSVISSL 429
+ R + V RE + + + SD E ++ ++ ++
Sbjct: 763 LDNAREYLDMDEIEDTFVL----DRESDEPNPDGTFTGSARFSDIPEDLRDQLKLLLKAV 818
Query: 430 GPICPVSPCMER-------AVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVAT 482
+ P S +R +VL + D ++ YP T+ EDE +L+ ++ ++R A
Sbjct: 819 RKVDPSSVADKRKRDEIQHSVLIKALDALASQ---YPTTIPEDERILSGSDISERQRAAV 875
Query: 483 QLVRMEKKML 492
+ EK+++
Sbjct: 876 TVRLGEKRLI 885
>gi|302826668|ref|XP_002994755.1| hypothetical protein SELMODRAFT_432653 [Selaginella moellendorffii]
gi|300136963|gb|EFJ04180.1| hypothetical protein SELMODRAFT_432653 [Selaginella moellendorffii]
Length = 688
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 50/200 (25%), Positives = 83/200 (41%), Gaps = 33/200 (16%)
Query: 167 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 226
+K Q + S W PYI L +++ LW +TEL+YL SP + ER E I
Sbjct: 503 QKFQLQSSAWAPYISCLPEPA-----ELDNTFLWEDTELSYLRASPLYGKTRERLEIITT 557
Query: 227 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 286
E+ ++ + LF + + E F + V S + +++ LV
Sbjct: 558 EFGQVQNALDVWPQLFGK---------VSVEDFMHVYATVFS-----RPLAIGEDSTLVM 603
Query: 287 LGPPLLAYSSKCKAMLAAVDDAVQL---VVDRPYKAGESIVVWYNISFTGHDFKCGPQPN 343
+ P+L + + A A + L VV E+ +W N CG N
Sbjct: 604 I--PMLDFFNHNAASFAKLSFNGLLNYAVVTADRDCAENDQIWIN---------CGDLSN 652
Query: 344 SKLLINYGFVDEDNPYDRLV 363
++L ++YGF +N YD ++
Sbjct: 653 AELALDYGFTVPENRYDEVM 672
>gi|260819628|ref|XP_002605138.1| hypothetical protein BRAFLDRAFT_122719 [Branchiostoma floridae]
gi|229290469|gb|EEN61148.1| hypothetical protein BRAFLDRAFT_122719 [Branchiostoma floridae]
Length = 453
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 49/204 (24%), Positives = 93/204 (45%), Gaps = 22/204 (10%)
Query: 67 SREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAF 126
+R V EE W+H+NG C+ + + E R + A++ L+ +
Sbjct: 19 TRPVSLAHEESFVRFFQWLHRNG---CRNVPLKPAVFPETGRGL---MATKALKHEELIL 72
Query: 127 SVPNSLVVTLERVLGNETIAELL--TTNKLSELACLALYLMYEKKQGKKSFWLPYIRELD 184
+P L++T++ ++ + +A + ++L+ LA++LM EK + +KSFW PYI L
Sbjct: 73 VIPKRLLITIDAIM-DSYLAPYIERADSQLTPSQALAVFLMCEKCRREKSFWRPYIDILP 131
Query: 185 RQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQ 244
+ P ++E + L S + + + +E+ EL + M LF
Sbjct: 132 EE-------YTCPAFFTEEDFRLLPNS-LRGKAKAKKYECHKEFMELAPFFKMLADLFPD 183
Query: 245 YPYDIPTEAFTFEIFKQAFVAVQS 268
+AF F+ FK A+ A+++
Sbjct: 184 -----QEDAFNFKDFKWAWSAIKT 202
>gi|328772383|gb|EGF82421.1| hypothetical protein BATDEDRAFT_86633 [Batrachochytrium
dendrobatidis JAM81]
Length = 648
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 79/331 (23%), Positives = 128/331 (38%), Gaps = 52/331 (15%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L W +G V +KE S +++ + AS+D+ +P++++++ V
Sbjct: 36 LVDWGRMHGANIENVEIKETASDDDR-KLTRGAYASKDIPPNSEICFIPSTILLSESDVR 94
Query: 141 GNETIAELLT--------TNKLSE---------LACLALYLMYEKKQ-GKKSFWLPYIRE 182
+E +LT K+S+ L +A +++++ S WLPY+
Sbjct: 95 ASEIGKAILTYIDEHQDAKQKISDKIKHPHAEILLAMAAFIVHQVSLPTADSHWLPYLAS 154
Query: 183 LDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL-ERAEGIKREYNELDTVWFMAGSL 241
L + PL+W+ + L G + ++ ER E I+ N V G
Sbjct: 155 LPKNYAL-------PLMWTRDRIQNLLGGTSLLYMMIERLEWIQ---NSTKVVENACGHY 204
Query: 242 FQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAM 301
F PT A T + + A ++ S K SL + + S C +
Sbjct: 205 F-------PTGALTVQSMQWATCSIWSRAFPKAKPSLDLQDGSHQDVQDWIGLSEIC--L 255
Query: 302 LAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKC---------GPQPNSKLLINYGF 352
+D R Y+ E + +SF D C GP+ N LL NYGF
Sbjct: 256 FPILD---MFNHKRGYRV-EWRMTEKGVSFITPDGICKGSELLNNYGPKGNENLLSNYGF 311
Query: 353 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 383
V E+NP D V L EDP Y K+ V +
Sbjct: 312 VIENNPEDYFKVFLGLQQEDPLYTAKKAVLE 342
>gi|330800139|ref|XP_003288096.1| hypothetical protein DICPUDRAFT_152307 [Dictyostelium purpureum]
gi|325081857|gb|EGC35358.1| hypothetical protein DICPUDRAFT_152307 [Dictyostelium purpureum]
Length = 525
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 38/156 (24%), Positives = 69/156 (44%), Gaps = 16/156 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ +++DL+ + +P +++++ +I+ +LT + A+ L+YE G+
Sbjct: 82 IISNKDLKVNNIVAKIPKDIILSIH----TSSISNILTKYTMERNIATAIALIYEASIGE 137
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
KS W YI L L V+ P+LW + L G+ + I + I Y ++
Sbjct: 138 KSKWYGYISSL-------PLKVDIPILWDKESQQLLNGTVMEDVIQDDNILINHAYADI- 189
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 268
+ L + +P E F+FE FK A V S
Sbjct: 190 ----VESLLIKNHPEYFSKEIFSFENFKIANSIVSS 221
>gi|119467702|ref|XP_001257657.1| SET domain protein [Neosartorya fischeri NRRL 181]
gi|119405809|gb|EAW15760.1| SET domain protein [Neosartorya fischeri NRRL 181]
Length = 492
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 72/294 (24%), Positives = 119/294 (40%), Gaps = 72/294 (24%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
V A D+ G+ FS+P LV++ + N + +LL+ + L EL L L +MYE
Sbjct: 50 VVARSDIFDGEELFSIPRGLVLSAQ----NSKLKDLLSQD-LEELGPWLSLILVMMYEYL 104
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
G++S W PY + L + + ++ + WS +EL L GS ++I + EG +
Sbjct: 105 LGEQSAWAPYFKVLPK-------SFDTLMFWSPSELQELQGSAIVSKIGK--EGAE---- 151
Query: 230 ELDTVWFMAGSLFQQYPYDIPT----EAFTFEIFKQAFVAVQSCVVHLQKVSLARRF--- 282
D++ M + + P P+ ++ E A + + + L +A F
Sbjct: 152 --DSIMQMIAPVVRANPSLFPSVEGLASWDGEAGSHALLGLAHIMGSL---IMAYAFDIE 206
Query: 283 ---------------------------ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 315
+VPL L A + + A L +D++ + +
Sbjct: 207 KAEDEDDEDNDEEEGYVTDDEQDQSSKGMVPLADILNADADRNNARLFQEEDSLVMKAIK 266
Query: 316 PYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 369
P AGE I Y G P + LL YG+V DN VVE +L+
Sbjct: 267 PIHAGEEIFNDY-----------GELPRADLLRRYGYV-TDNYAHYDVVELSLD 308
>gi|167521575|ref|XP_001745126.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776740|gb|EDQ90359.1| predicted protein [Monosiga brevicollis MX1]
Length = 390
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 65/322 (20%), Positives = 125/322 (38%), Gaps = 46/322 (14%)
Query: 75 EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
EE+ +L W+ + G KV + + + A+ + G+ +P + ++
Sbjct: 24 EEEYDELVDWLKQCGATVDKVAVDHFNGMGQG------LKATAEAAPGETLLRIPEACML 77
Query: 135 --------TLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
TL + ++T+ +L+ L+ + + + SFW PYI L
Sbjct: 78 SEESARRSTLGAYMDSDTMLKLMPNVTLA-------FHLLLELHDLDSFWRPYIACL--- 127
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQY 245
++ PL W +L L GS E + + + R+Y L + + A +
Sbjct: 128 ----PVSYSVPLYWDLPDLMSLRGSSLFVEAIRLYKHVCRQYGYLHNKLSVRANPSCSCF 183
Query: 246 P--YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS----LARRFALVPLGPPLLAYSSKCK 299
P + EAFTFE ++ A V + + + + AL+PL + +
Sbjct: 184 PLTLGLSPEAFTFEDWRWAVATVMTRQNSIPQAGPDGQMKPTLALIPLWDMINHANHPMS 243
Query: 300 AMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPY 359
+ + ++ V P K G I +WY G + N + L++ GF +
Sbjct: 244 TQFDSERECLEFVCPAPAKPGSQITMWY-----------GDRNNGQFLLHQGFFFAGHAN 292
Query: 360 DRLVVEAALNTEDPQYQDKRMV 381
D + V +L+ D Y+ K ++
Sbjct: 293 DYVNVPFSLDETDSLYKIKALL 314
>gi|388516285|gb|AFK46204.1| unknown [Lotus japonicus]
Length = 271
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 40/144 (27%), Positives = 66/144 (45%), Gaps = 18/144 (12%)
Query: 257 EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVD 314
E FK +F + S +V L S+ + ALVP +L +S + L + D
Sbjct: 2 ESFKWSFGILFSRMVRLP--SMDGKVALVPWAD-MLNHSCDVETFLDYDKQSKGIVFTTD 58
Query: 315 RPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTED 372
RPY+ GE + + Y G + N +LL++YGFV + NP D + + +L D
Sbjct: 59 RPYQPGEQVFISY-----------GKKSNGELLLSYGFVTREGANPSDSVELSLSLKKSD 107
Query: 373 PQYQDKRMVAQRNGKLSVQVFHVH 396
Y++K + ++ G Q F +
Sbjct: 108 GSYKEKLELLKKYGLSGSQCFPIR 131
>gi|121707885|ref|XP_001271968.1| SET domain protein [Aspergillus clavatus NRRL 1]
gi|119400116|gb|EAW10542.1| SET domain protein [Aspergillus clavatus NRRL 1]
Length = 677
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 56/214 (26%), Positives = 93/214 (43%), Gaps = 38/214 (17%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
+L+ + QG+ FW PYIR L + L++ +PL + +L +L G+ +R
Sbjct: 97 FFLIGQYLQGEDGFWFPYIRTLPQP-----LSLTTPLYYEGDDLGWLKGTSLWPAREQRM 151
Query: 222 EGIKREYNELDTVWFMAGSLFQ---QYPYD--------IPTEAFTFEIFKQAFVAVQSCV 270
E +K Y + V + + FQ +Y +D I + AF+ ++ +AF +
Sbjct: 152 ELLKEAYE--NGVRELRKAGFQDVDKYTWDLYLWASSMIVSRAFSPKVLAEAFADID--- 206
Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNIS 330
+ VS+ L+P L+ + K A V +V AG+ I Y
Sbjct: 207 LPEDGVSV-----LLPC-IDLMNHRPLAKVEWRAGKQDVAYLVLEDVAAGQEIANNY--- 257
Query: 331 FTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
GP+ N +L++NYGF DNP D +V
Sbjct: 258 --------GPRNNEQLMMNYGFCLPDNPCDYRIV 283
>gi|225678514|gb|EEH16798.1| SET domain-containing protein RMS1 [Paracoccidioides brasiliensis
Pb03]
Length = 488
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 92/439 (20%), Positives = 165/439 (37%), Gaps = 78/439 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
+ A +D+ + F++P LV++ + N + +L+ N+ L + CL L ++YE Q
Sbjct: 50 IVAYDDINEEEELFAIPQGLVLSFQ----NSKLKDLMEINERDLGQWLCLILVMIYEYLQ 105
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA--------- 221
G S W PY + L ++ + W++ EL L GS I + A
Sbjct: 106 GAASPWAPYFKVLPTD-------FDTLMFWTDAELLELKGSAVLGRIGKSAAEEVFLRDL 158
Query: 222 -------------EGIKREYNELD------TVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
G YN D ++ GSL Y +D+ + +
Sbjct: 159 LPLVSKNSELFPLTGGLLSYNSPDGKAALLSLAHRMGSLIMSYAFDVENDE------AEE 212
Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
V ++ L + ++PL L A + + A L D + + + + GE
Sbjct: 213 VEGEDGYVTDDEERQLPK--GMIPLADLLNADADRNNARLFQEDGYLAMKSIKSIRKGEE 270
Query: 323 IVVWYNISFTGHDFKCGPQPNSKLLINYGFV-DEDNPYDRLVVE-------AALNTEDPQ 374
I Y G P ++LL YG+V D YD V A L + P
Sbjct: 271 IFNDY-----------GELPRAELLRRYGYVTDSYAQYDEAEVPIQTICRVAGLKSSTPG 319
Query: 375 YQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICP 434
+ R+ + ++ + + +++ LP L ++ + L
Sbjct: 320 PDEPRLEFLDDLEVLDDGYGIPRHDRSTPLAETLPTELLVVLNILVMPLEQFNQLKQKSK 379
Query: 435 V-SPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT---DYNLHP------KKRVATQL 484
V P + A L + + L YP T+++D+ +L +Y + ++A Q+
Sbjct: 380 VPKPALGIAEATLLDEVVRLILGEYPTTVAQDKELLASCANYQGSTSPISAGRLKMALQV 439
Query: 485 VRMEKKMLNACLQVTADMI 503
+ EK++LNA L D I
Sbjct: 440 RKGEKEILNAVLSELEDFI 458
>gi|322698908|gb|EFY90674.1| putative histone-lysine N-methyltransferase [Metarhizium acridum
CQMa 102]
Length = 437
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 65/282 (23%), Positives = 118/282 (41%), Gaps = 48/282 (17%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELL--TTNKLSELACLALYLMYEKKQGKKSFW 176
+ G+ ++P+ ++ T+E + + +L T+ LS LA+Y+++ + + K +
Sbjct: 34 FKEGENILTIPSGILWTVEHAYADSILGPVLRSTSLPLSVEDTLAIYILFVRSR-KSGYD 92
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
P R +A S + + E +L G+ + + I+ +Y L
Sbjct: 93 GP----------RNHVAALPASYSSSIFFMEDQLEVCAGTSLYTITKQLEQRIEDDYRGL 142
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRFALVP 286
V M G QYP P + FT E +K A V S + + + L FA
Sbjct: 143 --VVRMLG----QYPDLFPLDKFTVEDYKWALCTVWSRAMDFVLPDGKSIRLLAPFA--- 193
Query: 287 LGPPLLAYSSKCKA--MLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNS 344
+L +SS+ K + A + ++ + Y+AG+ + + Y GP PN+
Sbjct: 194 ---DMLNHSSEAKQCHVYDASSGNLSVLAGKDYEAGDQVFINY-----------GPMPNN 239
Query: 345 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 386
+LL YGFV NP D + A + P ++ K+ + G
Sbjct: 240 RLLRLYGFVVPGNPNDSYDLVLATHPMAPFFKQKQKLWASAG 281
>gi|308809221|ref|XP_003081920.1| N-methyltransferase (ISS) [Ostreococcus tauri]
gi|116060387|emb|CAL55723.1| N-methyltransferase (ISS) [Ostreococcus tauri]
Length = 403
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 51/171 (29%), Positives = 75/171 (43%), Gaps = 40/171 (23%)
Query: 158 ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
A LA+ LM + G + W Y L AV+S ++WS+ EL L GS +
Sbjct: 47 ATLAVALMQQTNGGASARWRAYCDAL-------PAAVDSLMMWSDEELEVLQGSALRQRA 99
Query: 218 LERAEGIKREYNELDTVWFMAGSLFQQYPYDI-PTEAFTFEIFKQAFVAVQSCVVHLQKV 276
+ R + KREY+ L F A L + P EA++F++F+ A+ V
Sbjct: 100 VFRRDLCKREYDAL----FPA--LARADPETFGDVEAYSFDVFRWAYATV---------- 143
Query: 277 SLARRFALVPLGPPLLAYSSKCKAMLAAVD------DAVQLVVDRPYKAGE 321
+AR F L L +C A+L +D DA + VV+R A E
Sbjct: 144 -MARAFVLPDL---------QCMALLPGLDIYNSARDAEKCVVERDEGACE 184
>gi|317038661|ref|XP_001401929.2| SET domain protein [Aspergillus niger CBS 513.88]
Length = 699
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 58/228 (25%), Positives = 93/228 (40%), Gaps = 18/228 (7%)
Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
+ + E +L+ + +G + FW PYIR L Q G ++ +P + +L +L G+
Sbjct: 103 DAVGEKESTIFFLIGQYLRGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGT 157
Query: 212 PTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
A +R E +K +Y + T AG Y +D+ A + I + V S V
Sbjct: 158 SLLAAREKRLEVLKEKYEKGSTELRNAGFEGADAYTWDLYLWAASMFISRAFSAKVLSGV 217
Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNIS 330
+S + L+P+ + + K A D + VV AG+ I Y
Sbjct: 218 FPETDLSEEKLSVLLPI-IDMGNHRPLAKVEWRAGKDDIAFVVLEDVWAGQEISNNY--- 273
Query: 331 FTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 378
GP+ N +L++NYGF NP D +V P Y K
Sbjct: 274 --------GPRNNEQLMMNYGFCIPGNPCDHRIVSLRAPPGSPLYMAK 313
>gi|226294776|gb|EEH50196.1| SET domain-containing protein [Paracoccidioides brasiliensis Pb18]
Length = 488
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 93/439 (21%), Positives = 166/439 (37%), Gaps = 78/439 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
+ A +D+ + F++P LV++ + N + +L+ N+ L + CL L ++YE Q
Sbjct: 50 IVAYDDINEEEELFAIPQGLVLSFQ----NSKLKDLMEINERDLGQWLCLILVMIYEYLQ 105
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP--------TKAEILERA- 221
G S W PY + L ++ + W++ EL L GS T E+ R
Sbjct: 106 GAASPWAPYFKVLPTD-------FDTLMFWTDAELLELKGSAVLGRIGKSTAEEVFLRDL 158
Query: 222 -------------EGIKREYNELD------TVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
G YN D ++ GSL Y +D+ + +
Sbjct: 159 LPLVSKNSELFPLTGGLLSYNSPDGKAALLSLAHRMGSLIMSYAFDVENDE------AEE 212
Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
V ++ L + ++PL L A + + A L D + + + + GE
Sbjct: 213 VEGEDGYVTDDEERQLPK--GMIPLADLLNADADRNNARLFQEDGYLSMKSIKSIRKGEE 270
Query: 323 IVVWYNISFTGHDFKCGPQPNSKLLINYGFV-DEDNPYDRLVVE-------AALNTEDPQ 374
I Y G P ++LL YG+V D YD V A L + P
Sbjct: 271 IFNDY-----------GELPRAELLRRYGYVTDSYAQYDEAEVPIQTICRVAGLKSSTPG 319
Query: 375 YQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICP 434
+ R+ + ++ + + +++ LP L ++ + L
Sbjct: 320 PDEPRLEFLDDLEVLDDGYGIPRHDRSTPLAETLPTELLVVLNILVMPLEQFNQLKQKSK 379
Query: 435 V-SPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT---DYNLHP------KKRVATQL 484
V P + A L + + L YP T+++D+ +L +Y + ++A Q+
Sbjct: 380 VPKPALGIAEATLLDEVVRLILGEYPTTVAQDKELLASCANYQGSTSPISAGRLKMALQV 439
Query: 485 VRMEKKMLNACLQVTADMI 503
+ EK++LNA L D I
Sbjct: 440 RKGEKEILNAVLSELEDFI 458
>gi|403342378|gb|EJY70508.1| SET domain containing protein [Oxytricha trifallax]
Length = 653
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 31/124 (25%), Positives = 59/124 (47%), Gaps = 14/124 (11%)
Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGN------ETIAELLTTNKLSELACLALYLMYE 167
AA +++ D VP +++T+ER L + + A + + + L ++L+YE
Sbjct: 56 AAKLNIKNNDVIVYVPQKVLITVERALASPIGFIFDNHASIFKATEDRDYLVLLVFLIYE 115
Query: 168 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
++G +SFW PY +D G L P WS+ + L S K +I + + + +
Sbjct: 116 HQKGTRSFWHPYFEAID----PGLL----PCFWSDQTIEELADSELKDQIRQERDNYEED 167
Query: 228 YNEL 231
++ L
Sbjct: 168 WDML 171
>gi|323449371|gb|EGB05259.1| hypothetical protein AURANDRAFT_66448 [Aureococcus anophagefferens]
Length = 762
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 73/316 (23%), Positives = 118/316 (37%), Gaps = 71/316 (22%)
Query: 66 GSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAA 125
GS VV+ + +W+ G K+ +K H R + AA E G+
Sbjct: 11 GSSAVVTS------EFVAWLRAGGASFDKLAIK----HTALGRGVVATAAYE---PGETL 57
Query: 126 FSVPNSLVVTLERVLGNETIAELLTTNKLSELAC------LALYLMYEKKQGKKSFWLPY 179
SVP +L++T+++ +A L + + LAL+L ++ + W PY
Sbjct: 58 LSVPEALLLTVDKASRRADVAASLGAARARGVDANGGNLALALFLAGDRSEA----WRPY 113
Query: 180 IRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAG 239
+ R P W + A L GSP +++ R + I+R+ L
Sbjct: 114 RNVISRS------VSHLPCFWPTADEALLAGSPLGEDVVRRRDEIRRDCRSLGLTAVEDR 167
Query: 240 SLFQQYPYDIPTEAFTFEIFKQA--FVAVQSCVVHLQK-VSLA-RRFALVPLGPPLLAYS 295
F + + AF F + F + + H ++ V A R A V
Sbjct: 168 QAFAFAEAQVLSRAFAFNGTRAMVPFADLMNTARHHERHVDFAFERGAFV---------- 217
Query: 296 SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDE 355
+ AV R AGE + Y GP+ N++ L+NYGF
Sbjct: 218 ------MRAV---------RRGAAGEPVTDSY-----------GPKSNARYLLNYGFAMA 251
Query: 356 DN--PYDRLVVEAALN 369
DN RL+ +AAL+
Sbjct: 252 DNRDEAGRLLDDAALD 267
>gi|346319394|gb|EGX88996.1| Protein kinase-like domain [Cordyceps militaris CM01]
Length = 1753
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 87/405 (21%), Positives = 157/405 (38%), Gaps = 47/405 (11%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
V A + G+ ++P++ + T E + + +L + + LS LA++L++ K
Sbjct: 912 VKALRSFKKGERILTIPSACLWTAEAARADPLLGPVLRSAQPPLSVEDTLAIHLLFVKS- 970
Query: 171 GKKSFWLPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIK 225
R + R +A + + ++E EL GS + + ++
Sbjct: 971 ----------RTAGYEGQRLHIAAMPQRHSASIFFAEDELQVCEGSSLHTLTTQLEQRVQ 1020
Query: 226 REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
++ +L L Q+ P + FT E +K A + S + +
Sbjct: 1021 DDFRQLLV------QLLSQHRDLFPLDQFTIEDYKWALCTIWSRAMDFAVSDTTSVRLVA 1074
Query: 286 PLGPPLLAYS---SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
PL +L +S +C A D + ++ + Y+ G+ I ++Y G P
Sbjct: 1075 PLAD-MLNHSLDVKQCHAYDPTSGD-LSILAAKDYQVGDQIFIYY-----------GSVP 1121
Query: 343 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 402
N++LL YGFV DNP D + + P Y+ K + G S + A +
Sbjct: 1122 NNRLLRLYGFVLLDNPNDSYDLVLQTSPMAPLYEQKERLWALAGLDSTCTIPLTA--KHP 1179
Query: 403 AISDMLPYLRLGYVSDTSEMQSVISSL--GPICPVSPCMERAVLDQLADYFKARLAGYPA 460
++L YLR + D +++ + L G V+ E VL L D + L G+
Sbjct: 1180 LPKNVLRYLRTQRL-DAADVADMTLQLLNGTDGKVNDGNEIQVLQFLIDSLGSVLEGFGI 1238
Query: 461 TLSEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNACLQVTADMI 503
L + EA L Y A Q+ E+ +L + DM+
Sbjct: 1239 PLEKLEAQLAGGFYPAGGNAWAAAQVSAGEQGILTRAKKTAEDML 1283
>gi|350632383|gb|EHA20751.1| hypothetical protein ASPNIDRAFT_120572 [Aspergillus niger ATCC
1015]
Length = 668
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 58/228 (25%), Positives = 93/228 (40%), Gaps = 18/228 (7%)
Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
+ + E +L+ + +G + FW PYIR L Q G ++ +P + +L +L G+
Sbjct: 77 DAVGEKESTIFFLVGQYLRGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGT 131
Query: 212 PTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
A +R E +K +Y + T AG Y +D+ A + I + V S V
Sbjct: 132 SLLAAREKRLEVLKEKYEKGSTELRNAGFEGADAYTWDLYLWAASMFISRAFSAKVLSGV 191
Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNIS 330
+S + L+P+ + + K A D + VV AG+ I Y
Sbjct: 192 FPETDLSEEKLSVLLPI-IDMGNHRPLAKVEWRAGKDDIAFVVLEDVWAGQEISNNY--- 247
Query: 331 FTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 378
GP+ N +L++NYGF NP D +V P Y K
Sbjct: 248 --------GPRNNEQLMMNYGFCIPGNPCDHRIVSLRAPPGSPLYMAK 287
>gi|328872715|gb|EGG21082.1| hypothetical protein DFA_00957 [Dictyostelium fasciculatum]
Length = 643
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 96/463 (20%), Positives = 185/463 (39%), Gaps = 68/463 (14%)
Query: 71 VSKKEEDLGDLKSWM-HKNG-LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
+S EDL + W+ +KN L P I+ P + A+ +++ + +
Sbjct: 202 ISTTPEDLKSFQQWLSNKNTYLNPSIDIVDLGPPFGRS------MVANTNIKKDEILVEI 255
Query: 129 PNSLVVTLERVLGN--ETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
P +++T + ++ N I + + K+S A+ ++Y + S+W Y+ L +Q
Sbjct: 256 PKGIMMTPKSMIKNLPRFIIDWMDEMKISRTDQQAIAIIYSILH-EDSYWYEYVSILPKQ 314
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN-------------ELDT 233
+ + ++ E+ L SP R G+ R Y+ E D+
Sbjct: 315 -------FTTTVYFTREEMTQLQASPVHRFTEMRLNGVHRHYDTTISRLRFGYEGGEDDS 367
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLA 293
S + + +T + FK A V S L + +VPL A
Sbjct: 368 TKTKTKSQLDAMK-EFKDDRYTLDQFKWALGCVWSRAFSLSE----EDGGMVPLADMFNA 422
Query: 294 YS----SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
+ SK ++A ++ + +AGE I Y + T + ++L++
Sbjct: 423 DTVISRSKVHPKISASSPSLVYTASQDIEAGEQIFTPYGVYKT--------LGSGQMLMD 474
Query: 350 YGFVDED-NPYDRLVVEAA-LNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDM 407
YGF+ ED + D +V A + +P Y KR + Q NG + + F + + + ++
Sbjct: 475 YGFIHEDGSSADSTIVTVAPIPPSEPLYDLKRHLMQSNG-IESEEFTI---TKNKLAKEL 530
Query: 408 LPYLRLGYVS--DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSED 465
+ R+ ++ ++ + + S ++P E+A L L++ L Y T+ +D
Sbjct: 531 FLFARIKSINKKESDQASAHFMSTQRHSMLNPRNEKAALRLLSNLISRHLDAYQTTIDQD 590
Query: 466 EAML-------TDYNLHPKKRVAT----QLVRMEKKMLNACLQ 497
+L T+ N H T +L MEK +LN+ L+
Sbjct: 591 NQILKEIEKDKTNTN-HSSVTFNTINAIKLRLMEKNILNSFLK 632
>gi|1150596|emb|CAA86307.1| putative transcription regulator [Saccharomyces cerevisiae]
Length = 496
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 63/265 (23%), Positives = 112/265 (42%), Gaps = 33/265 (12%)
Query: 113 VAASEDLQAGDAAFSVPNS--LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
V A++ ++ + F +P S L VT +++ + + N+ L + ++YE +
Sbjct: 41 VVATQKIKKDETLFKIPRSSVLSVTTSQLIKDYPSLKDKFLNETGSWEGLIICILYEMEV 100
Query: 171 -GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
++S W PY + ++ L + W + EL L G E+ ER
Sbjct: 101 LQERSRWAPYFKVWNKPSDMNAL-----IFWDDNELQLLKPSLVLERIGKKEAKEMHERI 155
Query: 222 -EGIKREYNELDTVWFMAGSLFQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKVS 277
+ IK+ E T S F + Y I + +F E+ + + +++
Sbjct: 156 IKSIKQIGGEFSTCVANCPSKFDNFAYIASIILSYSFDLEMQDSSVNENEEEETSEEELE 215
Query: 278 LARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDF 336
R +++PL L A +SKC A L + +++V R + E + YNI
Sbjct: 216 NERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQV---YNIY------ 266
Query: 337 KCGPQPNSKLLINYGFVDED-NPYD 360
G PNS+LL YG+V+ D + YD
Sbjct: 267 --GEHPNSELLRRYGYVEWDGSKYD 289
>gi|168014081|ref|XP_001759585.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689124|gb|EDQ75497.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 340
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 52/223 (23%), Positives = 98/223 (43%), Gaps = 39/223 (17%)
Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINY 350
LL +SS+ +++ +++V ++ + GE++V+ Y GP N LL++Y
Sbjct: 123 LLQHSSESQSL-----PVLEVVAEKDLEKGENVVLNY-----------GPLSNDILLLDY 166
Query: 351 GFVDEDNPYDR--------------LVVEAALNT-EDPQYQDKRMVAQRN--GKLSVQVF 393
GFV NP DR LV + +++ +DP ++ + N G S Q+
Sbjct: 167 GFVMPKNPNDRVELRYDDQLLHMACLVAKVNIDSFKDPTTSQLALLTRLNLHGPSSSQM- 225
Query: 394 HVHAGREKEAISDMLPYLRLGYVSDTSEMQSV----ISSLGPICPVSPCMERAVLDQLAD 449
V G + +L +R+ + D E+ V + + P+ ER + L
Sbjct: 226 -VTLGGTELVEGRLLAAVRVMHAQDPMELLDVDLEALQTWNQSPPLGVLNERKTIRTLIG 284
Query: 450 YFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 492
LA +P + ED++ L ++ R+A Q ++K++L
Sbjct: 285 LGMLALASFPTEIEEDQSELVKGDISENHRLAIQFRMLKKRLL 327
>gi|451852693|gb|EMD65988.1| hypothetical protein COCSADRAFT_86793 [Cochliobolus sativus ND90Pr]
Length = 478
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 86/380 (22%), Positives = 144/380 (37%), Gaps = 62/380 (16%)
Query: 151 TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 210
+ L L L ++YE QG+ S W Y+ L + A E+P+ W+ EL L G
Sbjct: 100 SEALDSWGSLILVMLYEYLQGEASRWKTYLDILPQ-------AFETPIFWTPDELKELEG 152
Query: 211 SPTKAEILERAEGIK--RE--------------------YNELD--TVWFMAGSLFQQYP 246
+ E + + E + RE NE D ++ GS Y
Sbjct: 153 TSLTTEKIGKKESDRMLRERILPIVTSHPDVFSPPGAPRLNEDDLLSLAHRMGSTIMAYA 212
Query: 247 YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD 306
+D+ E E + ++ + SL +VP+ +L +++ A + D
Sbjct: 213 FDLENEEEQSEDEEDGWIEDRDGK------SL---IGMVPMAD-MLNANAEFNAHVHHGD 262
Query: 307 DAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV-DEDNPYDRL--- 362
+ AG I+ +Y GP P+S+LL YG+V E + YD
Sbjct: 263 QLQVTSLRESIPAGSEILNYY-----------GPLPSSELLRRYGYVTSEHHRYDVAEIS 311
Query: 363 --VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS----DMLPYLRLGYV 416
+V AL E +D +R + ++ F V E S P LR
Sbjct: 312 WSLVRTALAEELKLSEDTIADIERKLESELEEFFVIERDAGEPSSYGTLTQPPVLREIST 371
Query: 417 SDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 476
+ ++ + +L P E L + RL YP + +DE++L+ L
Sbjct: 372 ELEEQTKAFLKALKKRDPKRKRSETICNTVLEKALRTRLGQYPTSAKQDESLLSKEGLSK 431
Query: 477 KKRVATQLVRMEKKMLNACL 496
+ R+A ++ EK++L L
Sbjct: 432 RHRMAVEVRLGEKRLLQEAL 451
>gi|358056332|dbj|GAA97699.1| hypothetical protein E5Q_04377 [Mixia osmundae IAM 14324]
Length = 347
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 56/118 (47%), Gaps = 15/118 (12%)
Query: 252 EAFTFEIFKQAFVAVQS-CV-VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA- 308
E F+ F+ A++ V S CV + L + F LVPL + +SS C D A
Sbjct: 122 EIIDFDAFRWAWLCVNSRCVWLDLDYEAHEENFTLVPL-LDMANHSSTCANATVKYDHAH 180
Query: 309 VQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEA 366
+L + RP K GE IV + GHD + L YGF++ NP++R+ + A
Sbjct: 181 FELKLTRPVKRGEEIV----FEYGGHD-------QATLWAEYGFIESSNPHERIDLTA 227
>gi|134074534|emb|CAK38827.1| unnamed protein product [Aspergillus niger]
Length = 625
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 58/228 (25%), Positives = 93/228 (40%), Gaps = 18/228 (7%)
Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
+ + E +L+ + +G + FW PYIR L Q G ++ +P + +L +L G+
Sbjct: 29 DAVGEKESTIFFLIGQYLRGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGT 83
Query: 212 PTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
A +R E +K +Y + T AG Y +D+ A + I + V S V
Sbjct: 84 SLLAAREKRLEVLKEKYEKGSTELRNAGFEGADAYTWDLYLWAASMFISRAFSAKVLSGV 143
Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNIS 330
+S + L+P+ + + K A D + VV AG+ I Y
Sbjct: 144 FPETDLSEEKLSVLLPI-IDMGNHRPLAKVEWRAGKDDIAFVVLEDVWAGQEISNNY--- 199
Query: 331 FTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 378
GP+ N +L++NYGF NP D +V P Y K
Sbjct: 200 --------GPRNNEQLMMNYGFCIPGNPCDHRIVSLRAPPGSPLYMAK 239
>gi|302803412|ref|XP_002983459.1| hypothetical protein SELMODRAFT_445547 [Selaginella moellendorffii]
gi|300148702|gb|EFJ15360.1| hypothetical protein SELMODRAFT_445547 [Selaginella moellendorffii]
Length = 536
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 57/227 (25%), Positives = 87/227 (38%), Gaps = 43/227 (18%)
Query: 145 IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW-SET 203
+ +L N + +L+ E+ +GK+SFW PYI L +L++ PLLW +ET
Sbjct: 97 VGPMLRKNDFRPWLTMCAHLLVERSRGKESFWHPYIAALPSV---DELSISHPLLWPAET 153
Query: 204 ELAYLTGSPTKAEILERAEGIKREYNELDTVW---FMAGSLFQQYPYDIPTEAFTFEIFK 260
L GSP I R + + ++ L T F+ G E
Sbjct: 154 IQELLQGSPMLDTIATRLKLCQEDHEALLTAGIEKFLPGG----------------ETLS 197
Query: 261 QAFVAVQSCVVHLQKVSLA-------RRFALVPLGPPLLAYSSKCKAMLAAVDD---AVQ 310
+ V S V+ + SL LVP L SS + D
Sbjct: 198 EGDVRWASAVLLSRAFSLELDVDDDFDTLCLVPWADMLNHCSSAGEESCLIFDQDTKTAS 257
Query: 311 LVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDN 357
L + Y G+ + Y + TG S+L ++YGFVD++N
Sbjct: 258 LEAHKSYSKGDEVFDSYGPALTG----------SQLFLDYGFVDDEN 294
>gi|67540796|ref|XP_664172.1| hypothetical protein AN6568.2 [Aspergillus nidulans FGSC A4]
gi|40738718|gb|EAA57908.1| hypothetical protein AN6568.2 [Aspergillus nidulans FGSC A4]
gi|259480141|tpe|CBF71002.1| TPA: SET domain protein (AFU_orthologue; AFUA_6G04520) [Aspergillus
nidulans FGSC A4]
Length = 484
Score = 48.9 bits (115), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 67/288 (23%), Positives = 109/288 (37%), Gaps = 60/288 (20%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN--KLSELACLALYLMYEKKQ 170
V A D+ + F++P LV++ N + +LL+ + +L L L +++E Q
Sbjct: 50 VVAQADIDEDEELFAIPRDLVLSTH----NSKLKDLLSQDLDQLGPWLSLMLVMIFEYLQ 105
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA--------- 221
G KS W PY + L + ++ + WS EL L GS +I ++
Sbjct: 106 GGKSTWAPYFKVLPQN-------FDTLMFWSPEELEELQGSAVVEKIGKQGAEESILKLI 158
Query: 222 --------------EGIKREYNELDTVWFMA-----GSLFQQYPYDIPTEAFTFEIFKQA 262
G+ ++ + GSL Y +DI T E +
Sbjct: 159 IPVVRANPALFPPINGLASYDGDVGAQALLGLAHTMGSLIMAYAFDIETPENEDEREGED 218
Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
K +VPL L A + + A L ++++ + +P +AGE
Sbjct: 219 GYLTDEEEEQSSK-------GMVPLADMLNADAYRNNARLFQEEESLVMKAIKPIRAGEE 271
Query: 323 IVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 370
I Y G P S LL YG+V DN V+E +L+T
Sbjct: 272 IFNDY-----------GEIPRSDLLRRYGYV-TDNYASYDVIELSLDT 307
>gi|159476096|ref|XP_001696150.1| protein N-methyltransferase [Chlamydomonas reinhardtii]
gi|158275321|gb|EDP01099.1| protein N-methyltransferase [Chlamydomonas reinhardtii]
Length = 474
Score = 48.9 bits (115), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 59/232 (25%), Positives = 99/232 (42%), Gaps = 46/232 (19%)
Query: 157 LACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAE 216
A + L++ K+QG +S P+I +L G PL WS+ +LA L A+
Sbjct: 138 FAKMGAMLLWHKRQGSQSPLAPWIAQLPADTG-------VPLNWSDKQLAALQYPYLVAQ 190
Query: 217 ILERAEGIKREYNEL-DTV-----------------WFMAGSLFQQYPYDIPTEAFTFEI 258
+ E+ +RE+ L DT+ W+ G + + + P T
Sbjct: 191 VKEQ----QREWTALYDTLRGSGMAAGAAPPSREEFWWAMG-VVRSRTFSGPYIGSTLSD 245
Query: 259 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAA--VDDAVQLVVDRP 316
+ V + VV L + SL +++A+ PL L ++S ++ ++ D+ +V R
Sbjct: 246 RLRLAGLVAALVVILSR-SL-KQYAICPL-IDLFNHTSAAQSEVSYNYFGDSYSVVASRD 302
Query: 317 YKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 368
+K GE + + Y G Q N L+ YGF + DNP D V+ L
Sbjct: 303 FKKGEQVFITY-----------GAQSNDSLMQYYGFAEADNPQDTYVISDVL 343
>gi|345325919|ref|XP_001512656.2| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Ornithorhynchus anatinus]
Length = 345
Score = 48.9 bits (115), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 52/208 (25%), Positives = 95/208 (45%), Gaps = 29/208 (13%)
Query: 306 DDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVE 365
DD + V + + AGE I ++Y G + N++ +I+ GF ++N +DR+ ++
Sbjct: 42 DDRCECVALQDFTAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKIK 90
Query: 366 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV 425
++ D Y K V R G + VF +H E + +L +LR+ +++ + +
Sbjct: 91 LGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHL 149
Query: 426 IS--------SLG-PICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDY 472
I +LG PVS E +L + +AR L Y T+ ED++ L
Sbjct: 150 IGDHAIDKIFTLGNSEFPVSWDNEV----KLWTFLEARASLLLKTYKTTIEEDKSFLETP 205
Query: 473 NLHPKKRVATQLVRMEKKMLNACLQVTA 500
+L +A +L EK++L ++ A
Sbjct: 206 DLTFHATMAIKLRLGEKEILEKAVKSAA 233
>gi|357145323|ref|XP_003573603.1| PREDICTED: SET domain-containing protein 4-like [Brachypodium
distachyon]
Length = 532
Score = 48.9 bits (115), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 62/275 (22%), Positives = 115/275 (41%), Gaps = 41/275 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ ASE++ G A +P SL+++ E + ++ L N ++ L L+ M E+
Sbjct: 178 MVASENIGVGHIALEIPESLIISEELLCQSDMFLALKDLNSITTETMLLLWSMRERHNPS 237
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+F + + L G L + LA L G+ E+++ + + ++Y+EL
Sbjct: 238 SNFKM-FFETLPSNFNTG-------LNFGIGALAALEGTLLFDELMQARQHLHQQYDELF 289
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG---- 288
+ L ++P + +T++ F A S + + S L+P+
Sbjct: 290 PM------LCTKFPEIFTQDIYTWDNFLWACELWYSNSMMVVLSSGKLTTCLIPVAGLLN 343
Query: 289 ----PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNS 344
P +L Y +A +++ + RP KAG+ + Y G S
Sbjct: 344 HSVYPHILNYGRVDQAT-----KSLKFPLSRPCKAGQQCFLSY-----------GKHSGS 387
Query: 345 KLLINYGFV-DEDNPYDR--LVVEAALNTEDPQYQ 376
L+ YGF+ EDNPYD L ++ +++ ED Q
Sbjct: 388 HLITFYGFLPREDNPYDVVPLDLDMSVDEEDGTAQ 422
>gi|327295769|ref|XP_003232579.1| hypothetical protein TERG_06571 [Trichophyton rubrum CBS 118892]
gi|326464890|gb|EGD90343.1| hypothetical protein TERG_06571 [Trichophyton rubrum CBS 118892]
Length = 488
Score = 48.9 bits (115), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 77/356 (21%), Positives = 124/356 (34%), Gaps = 83/356 (23%)
Query: 97 LKEKPSHNEKHRPIHY-----------VAASEDLQAGDAAFSVPNSLVVTLERVLGNETI 145
LK H + H IH + AS D+ + F +P+ LV++++ +
Sbjct: 24 LKRSSPHFKMHPGIHIADLRSVGAGRGICASRDIAEDEELFIIPDDLVLSVQNSEARSAL 83
Query: 146 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 205
L +L L + ++YE QG++S W PY R L + ++ + W++ +L
Sbjct: 84 E--LDDKQLGPWLSLIITMIYEYYQGEQSKWYPYFRILPS-------SFDTLMFWTDEQL 134
Query: 206 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 265
L GS +I + A DT+ L Q P
Sbjct: 135 LELQGSAVVGKIGKAAAD--------DTILQKVVPLIQANP------------------- 167
Query: 266 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 325
R F P PPL + S+ + A ++ G SI++
Sbjct: 168 --------------RHFPPRPNMPPLNSSDSQNALLCLA------------HRMG-SIIM 200
Query: 326 WYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 385
Y K Y DED P +V A + D Q + R+ +
Sbjct: 201 AYAFDIE----KTDEVDEDTAEDGYMTDDEDEPAKGMVPLADIFNADAQRNNARLFQEEG 256
Query: 386 GKLSVQVFHVHAGREKEAISDMLPYL----RLGYVSDTSEMQSVIS-SLGPICPVS 436
+ + ++H+G E LP R GYV+D V+ SL IC V+
Sbjct: 257 SFVMKAIKNIHSGEEIFNDYGELPRADLLRRYGYVTDNYAQYDVVEFSLDSICKVA 312
>gi|428179206|gb|EKX48078.1| hypothetical protein GUITHDRAFT_106158 [Guillardia theta CCMP2712]
Length = 410
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 55/222 (24%), Positives = 96/222 (43%), Gaps = 32/222 (14%)
Query: 153 KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP 212
+L E L+L+L+ EK + ++S W +IR + + ++ WSE +A L P
Sbjct: 19 QLCERQLLSLHLLVEKWKAERSRWWRFIRSIPP-------SYDTLENWSEQSVARLQYKP 71
Query: 213 TKAEILERAEGIKREYNELDTV--------WFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
A R + E+++L + W + + + +F+ E + A
Sbjct: 72 FLAIAARRKRVVNDEFSQLQRLLSRCKKRSWNEPEAAEEAERIQLGFSSFSREDYLWAAG 131
Query: 265 AVQSCVVHLQKVS-LARRFALVPLGPPLLAYSSKCKAMLAAV---DDAV--QLVVDRPYK 318
V + H ++ S + R V P+L + + A +AA DA+ ++ R Y+
Sbjct: 132 TVSTRSCHYERKSGYSLRGETVGCLVPVLDFLNHSTAPVAACGFCKDAMVYRVTCLRSYE 191
Query: 319 AGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYD 360
GE +++ Y G N+ LL +YGFV EDNP D
Sbjct: 192 EGEQVMIHY-----------GNWSNAGLLEHYGFVLEDNPLD 222
>gi|145518912|ref|XP_001445328.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124412772|emb|CAK77931.1| unnamed protein product [Paramecium tetraurelia]
Length = 761
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 41/141 (29%), Positives = 67/141 (47%), Gaps = 23/141 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERV-LGNETIA-----ELLTTNKLS--ELACLALYL 164
V A++D+ A A VP L+++ E+ L + +I EL N+ S E L YL
Sbjct: 46 VVATKDIPANTAIICVPQPLIISQEKCKLSSLSIVYDKHPELFDENETSDAEFNILIFYL 105
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
EKK+G+KSF+ PY++ + + + + WS+ EL Y+ E I
Sbjct: 106 FNEKKKGEKSFYHPYVQAIQ--------SNNTLIDWSKEELNYIEDPIILDEF-----AI 152
Query: 225 KREYNELDTVWFMAGSLFQQY 245
RE +L +W A +F ++
Sbjct: 153 VRE--DLKDLWNQAKEIFNEF 171
>gi|126325439|ref|XP_001376285.1| PREDICTED: SET domain-containing protein 4-like [Monodelphis
domestica]
Length = 437
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 66/294 (22%), Positives = 120/294 (40%), Gaps = 39/294 (13%)
Query: 86 HKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVVTLERV 139
HK + LK++ + RP + + A + LQ G+ S+P ++T + V
Sbjct: 30 HKQEFIELRKWLKKRKFEDHNLRPTRFSNTGRGLMAVKSLQPGELIISLPKECLLTTDTV 89
Query: 140 LGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
+ + + +T +S L L +L+ EK G KS W PY+ L + +
Sbjct: 90 I-RSYLGDYITKWMPPISPLLALCAFLISEKHAGNKSPWKPYLDVLPK--------AYTC 140
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
L+ E E+ L P + + E+ +++ + + SLF + D+ F +
Sbjct: 141 LVCLEPEVVRLLPRPLQMKAEEQRMQVQKLFISSRGFFSSLQSLFTE---DV-KHVFHYH 196
Query: 258 IFKQAFVAVQSCVV---HLQKVSLARRFALVPLGP--PLLAYSSKCKAMLAAVDDAV--Q 310
F A+ + + V H QK L+ + L P LL +S + A ++ +
Sbjct: 197 AFLWAWCTINTRTVYMKHAQKQCLSAEPDVYALAPYLDLLNHSPRVWVEAAFNEETCCYE 256
Query: 311 LVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
+ K E + + Y GP N +LL+ YGFV +NP+ + +
Sbjct: 257 IRTTSHCKKFEELFICY-----------GPHDNHRLLLEYGFVASNNPHSAVYI 299
>gi|453083670|gb|EMF11715.1| SET domain-containing protein [Mycosphaerella populorum SO2202]
Length = 477
Score = 48.5 bits (114), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 73/280 (26%), Positives = 108/280 (38%), Gaps = 62/280 (22%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA----CLALYLMYEK 168
V A++DL + FS+P + ++T NET L N EL L L +++E
Sbjct: 45 VVATQDLSEDEELFSIPRASILT------NETTD--LPANLRKELDHPWLSLILVMVHEY 96
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA------- 221
+G KS W PY L +S + WS+ EL L GS +I + +
Sbjct: 97 LKGTKSSWYPYFNLLPE-------TFDSLMFWSDEELLSLKGSAVVDKIGKESADSTFTE 149
Query: 222 -------------EGIKREYNELDTVWFMAGSLFQQYPYDI--PTEAFTFEIFKQAFVAV 266
+ R +EL ++ GS Y +D+ P +
Sbjct: 150 QLIPLIAQHANIFQTAGRSNDELLSLCHRMGSTIMAYAFDLEKPEPSQPPNQQDDEEWEE 209
Query: 267 QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 326
+ + L K ++PL L A + A L DD V + +AGE +
Sbjct: 210 EESAISLPK-------GMIPLADMLNANADHNNAKLFYQDDKVVMKTLHAVRAGEELF-- 260
Query: 327 YNISFTGHDFKCGPQPNSKLLINYGFV-DEDNPYDRLVVE 365
+DF GP P S LL YG+V D+ YD VVE
Sbjct: 261 -------NDF--GPLPRSDLLRRYGYVTDQYAKYD--VVE 289
>gi|323447496|gb|EGB03414.1| hypothetical protein AURANDRAFT_72732 [Aureococcus anophagefferens]
Length = 403
Score = 48.5 bits (114), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 69/280 (24%), Positives = 110/280 (39%), Gaps = 44/280 (15%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
W+ +NG + E S++++ R +H A+ DL+ + VP ++T+E +G
Sbjct: 37 WLTENGGKFADCV--ELRSYDDEVRGVH---ATRDLETEEILVEVPLKCLITVE--MGKA 89
Query: 144 TIAELLTTNKLSELAC-----LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPL 198
T EL L L+++ +++ +F+ PY L P+
Sbjct: 90 TDVGRAVLEAELELDAPKHVFLMLFVLLDRRDSS-TFFAPYYDIL------PSTLSNMPI 142
Query: 199 LWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEI 258
W EL +L GS +I ER IK +Y + +W P I + T E
Sbjct: 143 FWQPDELEWLKGSYLLTQIEERKRAIKADYEAICGIW----------PSFI--DVCTLEE 190
Query: 259 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSK-CKAMLAAVDDAVQLVVDRPY 317
FK A + V S + V+ AR A+VP L + + K A + +
Sbjct: 191 FKWARMCVCSRNFGVV-VNGARTSAMVPYADMLNHFRPRETKWTFDNSRGAFTITSLQKI 249
Query: 318 KAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDN 357
G I Y G + N + L+NYGF EDN
Sbjct: 250 SVGSQIYDSY-----------GQKCNHRFLLNYGFAIEDN 278
>gi|384246985|gb|EIE20473.1| rubisco small subunit N-methyltransferase, partial [Coccomyxa
subellipsoidea C-169]
Length = 363
Score = 48.5 bits (114), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 35/107 (32%), Positives = 54/107 (50%), Gaps = 10/107 (9%)
Query: 124 AAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIREL 183
A +P +L VT V +E +A L EL LAL+LM E+++G++S W P++ L
Sbjct: 2 ALVELPGNLSVTAVDVAAHEEVAGL--AEGRGELTGLALWLMAERQKGEESRWAPFLECL 59
Query: 184 DRQRGRGQLAVESPLLW-SETELAYLTGSPTKAEILERAEGIKREYN 229
A SP+LW E + L SPT E R +++E++
Sbjct: 60 PE-------ATLSPVLWPEEVQDELLKNSPTLKECRARRAALQQEWD 99
Score = 38.5 bits (88), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 53/219 (24%), Positives = 90/219 (41%), Gaps = 24/219 (10%)
Query: 297 KCKAMLAAVDDAVQLVVDR-------PYKAGESIVVWYNISFTGHDFKCGP--------Q 341
+C+A AA+ ++ R + G+ + +W + G +
Sbjct: 86 ECRARRAALQQEWDVIAQRIATGDARRFSGGDELKLWITLGSPGWGGTSDKLLMAIYDGR 145
Query: 342 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREK 401
PN +L + G V++DN D L V L D + K+ + + G VQ F + R
Sbjct: 146 PNGELAMATGRVEDDNASDCLTVRVGLVQADRLFSVKKQILESLGFDIVQEFPIFRDR-- 203
Query: 402 EAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP-- 459
+ +L YLRL ++D + + V S ++P E VL L + RL Y
Sbjct: 204 -MPTQLLAYLRLARLTDPALLAKV--SFEEDIILNPVNEYEVLQLLLGECRDRLTSYAGM 260
Query: 460 --ATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 496
+ ED +L L ++R+A +L + EK +L L
Sbjct: 261 HMGSAEEDVKLLQRPGLTAQERLAARLRKAEKAILQGTL 299
>gi|449520517|ref|XP_004167280.1| PREDICTED: LOW QUALITY PROTEIN: sulfate transporter 4.1,
chloroplastic-like, partial [Cucumis sativus]
Length = 923
Score = 48.1 bits (113), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 56/219 (25%), Positives = 92/219 (42%), Gaps = 39/219 (17%)
Query: 112 YVAASEDLQAGDAAFSVP-------NSLVVTLERVLGNETIAELLTTNKLSELACLALYL 164
++ ASE ++AGD VP +SL + + +LGNE + +A LA+ +
Sbjct: 734 FLFASETIRAGDCILKVPFNVQISPDSLPLPIRDLLGNE----------IGNVAKLAVVV 783
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
+ E K G S W PYI L + + + + W E+EL + S E L + I
Sbjct: 784 LLEHKLGLGSEWAPYIIRLPQ-----PWEMHNTIFWKESELEMIRKSSLYEESLNQRSQI 838
Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 284
KRE+ + + +P I + + + F A+ V S + +L
Sbjct: 839 KREFLAIRKA-------LEAFPEII--DRISCDDFMHAYALVTS-----RAWRSTEGVSL 884
Query: 285 VPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGE 321
+P L + +AML DD ++V DR + GE
Sbjct: 885 IPFA-DFLNHDGASEAMLLNDDDKQLSEVVADRDFAPGE 922
>gi|281201674|gb|EFA75882.1| tryptophan 2,3-dioxygenase [Polysphondylium pallidum PN500]
Length = 732
Score = 48.1 bits (113), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 49/170 (28%), Positives = 82/170 (48%), Gaps = 20/170 (11%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLE---RVLGNETIAELLTTNKLSELACLALYLMYEKK 169
+ A+ +++ GD VP L ++ E + +G++ E L K+ L ++L+ E +
Sbjct: 98 LIANTEIKEGDEFIKVPLKLFMSQETAFKSIGDKVSREPLF--KMLPNMLLVIHLIQETQ 155
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
+ ++SFW PYIR L + + ++ L ++ E L GSP +LE E I N
Sbjct: 156 KQQQSFWAPYIRMLPK-------SYKTALYFTLAEFQLLIGSP----VLE--ESINTYRN 202
Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
L F+ F + P + T FT+E F+Q +A +V L K LA
Sbjct: 203 TLRQYCFLY-DFFGKNPGILSTSNFTWE-FEQNELAAYKSIVSLLKKRLA 250
>gi|303271033|ref|XP_003054878.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226462852|gb|EEH60130.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 664
Score = 48.1 bits (113), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 87/365 (23%), Positives = 149/365 (40%), Gaps = 66/365 (18%)
Query: 149 LTTNKLSELACLALYLMYE-KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY 207
+T+ +++ A +AL+L++E Q +KS W P++ L R VE+PLLW+ ELA
Sbjct: 189 ITSREVTIDAVIALHLLHELYVQREKSEWWPWVSILPRD-------VETPLLWTPRELAQ 241
Query: 208 LTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ 267
L GS I R +K + D ++ L Q++P P E F E + A V
Sbjct: 242 LEGSNL---IGFRDAVLKGWTTQRDALF---PKLTQKFPSLFPEEHFRTERWAWAMAIVW 295
Query: 268 SCVVHLQKVSLARRFALVPLGP----------PLL-------------------AYSSKC 298
S V + R A+ P G PL
Sbjct: 296 SRAA---DVPVPRPEAIFPSGDDKSRELRVIVPLFDMINHGYDHAPVTPGGVKGGGGEGR 352
Query: 299 KAMLAAVDDAVQLVVD-RPYKAGESIVVWYNISFTGHD----FKCGPQPNSKLLINYGFV 353
+ VDD+ L+ P + + + + F G + F G +P+ +L+ YGFV
Sbjct: 353 EKGGVGVDDSPALIPSWDPSR--RMVAIRAGVPFPGPNYEVRFNYGAKPSQHVLLQYGFV 410
Query: 354 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLS--VQVFHVHAGREKEAISDMLPYL 411
+NP + + V + D + + + R +LS + F + R +D+L
Sbjct: 411 PMNNPDESVEVAMHAGSRD-KLKSLKSELLRTHELSPRERNFQFYPRRLD---ADLLAAT 466
Query: 412 RLGYVS----DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEA 467
R+ +S +++ S + P+ + RA+L + A L YP TL ED+
Sbjct: 467 RVQMMSELEINSAAAISAAVAGAPVSDRNEAKTRAMLLKAAHDM---LRRYPTTLWEDKK 523
Query: 468 MLTDY 472
++ ++
Sbjct: 524 IVQEH 528
>gi|395518633|ref|XP_003763464.1| PREDICTED: SET domain-containing protein 4 [Sarcophilus harrisii]
Length = 440
Score = 48.1 bits (113), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 70/294 (23%), Positives = 120/294 (40%), Gaps = 39/294 (13%)
Query: 86 HKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVVTLERV 139
HK + LKE+ + RP + + A + LQ G+ S+P ++T + V
Sbjct: 29 HKLEFIELRKWLKERKFEDHNLRPTRFSGTGRGLMAVKSLQPGELIISLPEKCLLTTDTV 88
Query: 140 LGNETIAELLT--TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
+ + + + +T T +S L L +L+ E G KS W PY+ L + +
Sbjct: 89 IKS-YLGDYITKWTPPISPLLALCTFLISENNAGNKSPWKPYLDILPKDY--------TC 139
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
L+ E ++ L P K + E+ ++ + + SLF + D+ F +
Sbjct: 140 LVCLEPQVVRLLPKPLKIKAQEQKTQVQELFVSSRGFFSSLQSLFTE---DVK-HIFHYH 195
Query: 258 IFKQAFVAVQSCVV---HLQKVSLARRFALVPLGPPL-LAYSSKCKAMLAAVDDAVQLVV 313
F A+ + + V H QK L+ + L P L L S + AA ++ +
Sbjct: 196 AFLWAWCTINTRTVYMKHAQKKCLSAEPDVYALAPYLDLLNHSPGVQVNAAFNEKTRCYE 255
Query: 314 DRPY---KAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
R K E + + Y GP N +LL+ YGFV +NP+ + V
Sbjct: 256 IRTTSSCKKYEELFICY-----------GPHDNHRLLLEYGFVAINNPHSAVYV 298
>gi|322706860|gb|EFY98439.1| SET domain protein [Metarhizium anisopliae ARSEF 23]
Length = 595
Score = 48.1 bits (113), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 68/299 (22%), Positives = 130/299 (43%), Gaps = 47/299 (15%)
Query: 113 VAASEDLQAGDA------AFSVPNSLVVTLERVLG----NETIAELL-TTNKLSELACLA 161
+ A DL++ +A ++P+ LV++ E V + +LL + S +
Sbjct: 143 LVAHADLESAEADGTSKGPVTIPHDLVLSAEAVEDFAKVDHNFKQLLEAVGRQSTRGDIM 202
Query: 162 LYLMYEKKQGKK------SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 215
LYL+ + Q + + W YIR L R + P +W+E E L G+ +A
Sbjct: 203 LYLVSQFAQSSRPKGLSPTPWTEYIRLLPR-------PIPVPTMWTEPERLLLNGTSLEA 255
Query: 216 EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 275
+ + + +E++ L V + +P+ E+ + +V V + +
Sbjct: 256 ALEAKLLSLGKEFDTLREV-------SEDFPFWNEFLWSGEEVSLEDWVLVDAWY-RSRC 307
Query: 276 VSLARR-FALVPLGPPLLAYSSKCKAMLAAVD-DAVQLVV--DRPYKAGESIVVWYNISF 331
+ L R A+VP G ++ +SSK A D D V L++ P ++GE + + Y +
Sbjct: 308 LELPRSGTAMVP-GLDMVNHSSKATAYYEEDDHDNVVLLIRPGCPVRSGEEVTISYGDA- 365
Query: 332 TGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 390
+P S++L +YGF+D +N D+L + +DP + K ++ L++
Sbjct: 366 ---------KPASEMLFSYGFIDPNNIVDKLTLRLDPFPDDPLARAKLRISNSGPTLTI 415
>gi|302834219|ref|XP_002948672.1| hypothetical protein VOLCADRAFT_104004 [Volvox carteri f.
nagariensis]
gi|300265863|gb|EFJ50052.1| hypothetical protein VOLCADRAFT_104004 [Volvox carteri f.
nagariensis]
Length = 510
Score = 48.1 bits (113), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 95/397 (23%), Positives = 155/397 (39%), Gaps = 81/397 (20%)
Query: 157 LACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAE 216
A +A L++ K+QG +S P+I +L G P+LW E ++A L A+
Sbjct: 135 FAKMAAMLLWHKRQGSQSPLAPWIAQLPSDTG-------VPVLWDERQIAALQYPYLIAQ 187
Query: 217 ILERAEGIKREYNEL---------------DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ 261
+ E+ ++ Y +L D W M S + + P T + +
Sbjct: 188 VKEQQREWQQLYGDLVRSGTPAGVQAPSREDFFWAM--SCVRSRTFSGPYIGSTLQDRLR 245
Query: 262 -----AFVAVQSCVVHL---QKVSLA-------------------RRFALVPLGPPLLAY 294
A +A + V+ L QK A +++A+ PL L +
Sbjct: 246 TAGLVAVLAAGNTVLGLADPQKTLSAAIAVLLFNVLYELILSRSLKQYAICPL-IDLFNH 304
Query: 295 SSKCKAMLAA--VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGF 352
SS ++ +A D+ +V R +K GE + + Y G Q N L+ YGF
Sbjct: 305 SSAVQSEVAYNYFGDSYSVVASREFKKGEQVFISY-----------GAQSNDSLMQYYGF 353
Query: 353 VDEDNPYDRLVVEAALN--TEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPY 410
+ +NP D V+ L T R+ A + L+ + V R S+ L
Sbjct: 354 AEANNPQDVYVMTDMLRWLTAVRSVGQSRLDALKGSPLANSLQQVAIQRAGFP-SETLQA 412
Query: 411 LRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 470
+R +D SE + +SS SP E + + +A+ + L ++L ED A+L+
Sbjct: 413 VRFLLAAD-SEAGADVSSFSKSG--SPDQEAQLAEVVAEVVRRELGHLGSSLQEDLALLS 469
Query: 471 DYNLHPKKR---------VATQLVRMEKK-MLNACLQ 497
R VA R+EKK +L A LQ
Sbjct: 470 STGASAGGRKGGTAAAAAVAAVAFRVEKKRLLTAVLQ 506
>gi|348676999|gb|EGZ16816.1| hypothetical protein PHYSODRAFT_251772 [Phytophthora sojae]
Length = 424
Score = 48.1 bits (113), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 79/357 (22%), Positives = 140/357 (39%), Gaps = 64/357 (17%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK---LSELACLALYLMYEKKQG 171
A+ + +G+ +P L+++ + + + + N+ + LAL+L+ E
Sbjct: 43 AAAAVASGEPMLCIPRRLLISEDLCWRDPQLGRVFQDNRDVFTRDDPVLALFLVRELLLA 102
Query: 172 KKSFWLPYIRELDRQRGRGQLAV----ESPLLWSETELAYLTGSPTKAEILER-AEGIKR 226
+SF+ PY LAV ES W++ EL L ER + R
Sbjct: 103 DRSFFHPY------------LAVLPYPESVQDWTQAELGELHD--------ERLVDAAAR 142
Query: 227 EYNELDTVWFMAGSLFQ-QYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF--- 282
+E+D + Q +YP + P +TF+ FK A+ +Q+ + RR
Sbjct: 143 RTSEIDVYYRRVMVRLQTKYPGEFPEALYTFDRFKFAWKTIQA-------RTFGRRLPWT 195
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
ALVP L + K D+ + R Y +G + F + G +
Sbjct: 196 ALVPFADCLNHTNVATKYDFDVNDNGLF----RLYPSGATSFAQGAEVFNSY----GRRS 247
Query: 343 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 402
N +LL++YGF DN +D + VE + P R KL V R+
Sbjct: 248 NFQLLLDYGFALPDNEWDYVDVEIGKDRAGP----------RGRKLRFMKRVVRIDRQS- 296
Query: 403 AISDMLPYLRLGYVSD------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 453
++ ++ P L ++D SE + +S +C + +++ +AD+ A
Sbjct: 297 SLDELFPPSFLAGLADPVPDEEQSEAAAELSERTALCDALEWLRSILIETIADWGTA 353
>gi|348690659|gb|EGZ30473.1| hypothetical protein PHYSODRAFT_553476 [Phytophthora sojae]
Length = 437
Score = 48.1 bits (113), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 101/415 (24%), Positives = 174/415 (41%), Gaps = 65/415 (15%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLS---ELACLALYLMYEK- 168
V +ED+ FS+P V++++ + N + + +L+ E LA+ L+YEK
Sbjct: 47 VFIAEDVTPHAEVFSIPLDSVLSVKSLQENAVLQSIAFFQQLTPEREDDQLAIALLYEKF 106
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
+G KS W +I L R + L + EL L GS + E + +Y
Sbjct: 107 VRGSKSKWAKHIELLPR-------TYHNALYFGPEELRALEGSNVYFIAQQMEEKVAHDY 159
Query: 229 NELDTVWFMAGSLFQQYP----YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF-- 282
L + LF+ P D+ E F+ E +K A + S V +A++
Sbjct: 160 ARLKESVLL--ELFENVPEGINVDLFDEFFSLENYKWALSTIWS---RFGDVPVAKQSFK 214
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQ---LVVDRPYKAGESIVVWYNISFTGHDFKCG 339
A+VP+ +L + + + M D + Q LV + + AG + + Y G
Sbjct: 215 AMVPVFD-MLNHDPEAE-MSHFFDMSTQRFKLVSHQHWNAGAQMFINY-----------G 261
Query: 340 PQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQ---YQDKRMVAQRNGKLSVQVFHVH 396
P N KLL YGFV NP+D VE L ++ +Q+K + NG H
Sbjct: 262 PLSNHKLLALYGFVIIGNPFD--AVEMWLPMDEASTKFFQEKEQLLLTNGL-------DH 312
Query: 397 AGREKEAISD-----MLPYLRLGYVS--DTSEMQSVISSLGPICPVSPCMERAVLDQLAD 449
A E ++D +L R+ + E + + + +S E+ L +L
Sbjct: 313 ATNPFELVADESNDLLLMAARIQEIDCETVEEFEELANKALEGEMISLENEQEALTRLIY 372
Query: 450 YFKARLAGYPATLSEDEAML------TDYNLHPKKRVATQLVRMEKKMLNACLQV 498
+ L +P ++ ED+ +L TD NL+ +R+A + R +K +L+ + +
Sbjct: 373 TLEKMLESFPTSIEEDDILLEQDDKKTD-NLN-HERMAVAVRRSDKYILSENINM 425
>gi|366987955|ref|XP_003673744.1| hypothetical protein NCAS_0A08050 [Naumovozyma castellii CBS 4309]
gi|342299607|emb|CCC67363.1| hypothetical protein NCAS_0A08050 [Naumovozyma castellii CBS 4309]
Length = 499
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 66/296 (22%), Positives = 122/296 (41%), Gaps = 56/296 (18%)
Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAE------LLTTNKLSELACLALYLM 165
++ A+ED++ + F +P ++ VL + ++E +L + L + ++
Sbjct: 42 FILATEDIKTDELLFEIPRESILN---VLTSSLVSEYPAWENILLDGDVGHWEGLIICML 98
Query: 166 YEKKQGKK-SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAE 216
+E K K S W PY L L + W+ EL L G+ +
Sbjct: 99 FEIKVKKNMSKWAPYFDVLPESTDLNSL-----MYWTAEELEALKPSLVLDRIGNDGAHQ 153
Query: 217 ILERAEGIKREYNELDTV-----------WFMAGSLFQQYPYDI---PTEAFTFEIFKQA 262
+ E+ + R + + +V + S+ Y +D+ PT A E +
Sbjct: 154 MHEKVMELIRTFEKDHSVDLSFGTITWEDFLYVASIIMSYSFDVELPPTSADENEEDDEV 213
Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
V+ V + + +++PL L + ++KC A L +D++++ KAGE
Sbjct: 214 EEDVEQTVRNEGSLK-----SMIPLADTLNSDTNKCNAHLIYDEDSLKMRAISNIKAGEQ 268
Query: 323 IVVWYNISFTGHDFKCGPQPNSKLLINYGFVD-EDNPYD--RLVVEAALNTEDPQY 375
+ YNI G PN+++L YG+V+ E + YD L +E + T QY
Sbjct: 269 V---YNIY--------GNHPNAEILRRYGYVEWEGSKYDFGELPLEVIIETLHEQY 313
>gi|298715435|emb|CBJ28046.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 719
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 72/323 (22%), Positives = 134/323 (41%), Gaps = 47/323 (14%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
W+ +G + E PS +E + A D+ GD +P++L+++ +
Sbjct: 26 WLRSHG---AAIDCVEWPS-SETESGVRGAVARRDIAPGDHMVIIPHALMMSEFHAKADP 81
Query: 144 TIAEL--LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
+ L T L LALY+M E + ++SF+ PY+R L ES LL
Sbjct: 82 KYGHVHRLNTRLLGSDNGLALYIMQEILKEERSFYWPYLRMLPTPCNLRNWNRESLLLLQ 141
Query: 202 ETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ 261
+ +L T + ++ ++L + RE T+ F++ S YP + +TFE+F
Sbjct: 142 DHKLVRRTAARSR-QLL----ALYRE-----TIEFLSSS----YPELYTADRYTFELFDF 187
Query: 262 AFVAVQSCVVHLQKVSLARRF---ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DR 315
A+ +Q+ + +R ALVP L + + K + + +
Sbjct: 188 AWRTIQA-------RAFGKRLKSSALVPFADCLNHGNVQTKYDFDVGGNGTFRLFPSGNN 240
Query: 316 PYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL---NTED 372
Y ++ Y G + N LL++YGF DN +D V +L + +
Sbjct: 241 RYPRNSEVLNSY-----------GRRANDNLLLDYGFAMLDNEWDAAEVICSLPPSHDQS 289
Query: 373 PQYQDKRMVAQRNGKLSVQVFHV 395
P + ++ + +G+ +V++ V
Sbjct: 290 PLDRRRKACLRASGQHTVRILRV 312
>gi|440802665|gb|ELR23594.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
Length = 984
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 60/275 (21%), Positives = 107/275 (38%), Gaps = 54/275 (19%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNET---IAELLTTNKLSEL------ACLALYLM 165
A+ED+ G+ S+P LV+T E +E +A L + L A L YL+
Sbjct: 32 ATEDILPGEELCSIPVRLVLTTEIARKSEVGRLVAAHLNAVQGERLRVSAGRAILCAYLI 91
Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIK 225
+++ + +FW PY+R L + R + ++ +L G+ + E+ + I+
Sbjct: 92 HQRA-AQDAFWGPYLRSLPKHDDR-----------PDEDIQHLAGTNLFYAMQEKQQQIR 139
Query: 226 REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
++ + +L +P P + FT++ F F A S R F
Sbjct: 140 ESFD------LLFPALCHAHPTVFPPDLFTWDHFLWTFTACSS-----------RSFPQT 182
Query: 286 PLGPPLLAYSSKCKAM-LAAVDDAVQLVVDRPYKAGESIVVWY------NISFTGHDF-- 336
+ P S+ L +D+ + +D + W + F D
Sbjct: 183 LVQQPTATTSAHADPYDLLEIDECLLPGLDMLNHQYRKKITWALDPSTGRLKFVTEDTVE 242
Query: 337 -------KCGPQPNSKLLINYGFVDEDNPYDRLVV 364
GP+ N +LL+ YGF EDN D +++
Sbjct: 243 KGTEAFNNYGPKGNEELLMGYGFCIEDNEQDYVMI 277
Score = 39.7 bits (91), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 22/70 (31%), Positives = 39/70 (55%), Gaps = 3/70 (4%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELL--TTNKLSELACLALYLMYEKKQ 170
V A++ + AG A ++P L++T++ L + E L L E L L+L++EK +
Sbjct: 497 VFAAQAVPAGQALLTIPRQLLITVDTAL-ESPLGEALQYVEGGLDEDTVLTLFLVWEKGR 555
Query: 171 GKKSFWLPYI 180
G+ S W P++
Sbjct: 556 GQASPWYPFL 565
>gi|195439104|ref|XP_002067471.1| GK16171 [Drosophila willistoni]
gi|194163556|gb|EDW78457.1| GK16171 [Drosophila willistoni]
Length = 511
Score = 47.8 bits (112), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 76/373 (20%), Positives = 143/373 (38%), Gaps = 49/373 (13%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A++D+ A VP + + E++ E + T + LA L+ EK +G S
Sbjct: 118 ATKDINADQQVLRVPRKKIFSEEQLSKTERESFCNFTTNFN----LANALVVEKSRGADS 173
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI L + + L ++ ++ L G+ + L + I R+Y +L
Sbjct: 174 IWKPYIDVLPSR-------YNTVLYFTVEQMRRLRGTSVCSSALRQCRMIARKYAKLYAF 226
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR-------FALVPL 287
+ S + +E+++ A V + +L +A + AL+P
Sbjct: 227 AYCDSSYLRPDTGLFTQHGLCYELYRWAVSTVMT-RQNLVPREIATKDDGNSPISALIPC 285
Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLL 347
K + + ++ KAG ++Y G +PN+ LL
Sbjct: 286 WDMANHRPGKITSFYDSNAHQMECTAQEFCKAGNQFFIYY-----------GDRPNADLL 334
Query: 348 INYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREK-----E 402
++ GFVD +N D + + L+ D +A++ +L ++ H G + E
Sbjct: 335 VHNGFVDPNNNKDFVNIRLGLSPTDG-------LAEKRSRLLDRLNIEHKGEFRVLPAPE 387
Query: 403 AIS-DMLPYLRLGYVSD------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARL 455
IS +L ++R+ +S S+++ + L C + +E FK L
Sbjct: 388 YISGQLLAFVRVFNMSSDQLDHWCSDLERAVDLLHIDCALETDLETRTWQYFHQRFKLLL 447
Query: 456 AGYPATLSEDEAM 468
ATL E + +
Sbjct: 448 GVLEATLREADEL 460
>gi|121703688|ref|XP_001270108.1| SET domain protein [Aspergillus clavatus NRRL 1]
gi|119398252|gb|EAW08682.1| SET domain protein [Aspergillus clavatus NRRL 1]
Length = 492
Score = 47.8 bits (112), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 71/289 (24%), Positives = 108/289 (37%), Gaps = 62/289 (21%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
V A + G+ FS+P LV++ E N + LL+ + L EL L L ++YE
Sbjct: 50 VVAQSAIVEGEELFSIPRDLVLSTE----NSKLKSLLSQD-LGELGPWLSLMLVMIYEYL 104
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA-------- 221
++S W PY R ++ + WS EL L GS +I +
Sbjct: 105 LREQSAWAPYYRIFPEN-------FDTLMFWSPAELQELQGSAIVDKIGRQGAEESILQM 157
Query: 222 ---------------EGIKREYNELDTVWFMA-----GSLFQQYPYDIPTEAFTFEIFKQ 261
+G+ E T + GSL Y +DI + +
Sbjct: 158 IAPVVKANPSLFPPIQGLSSWEGEAGTQALLGLAHVMGSLIMAYAFDIEKVNDEDDEDNE 217
Query: 262 AFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGE 321
+ Q +VPL L A + + A L +D++ + +P AG+
Sbjct: 218 GEDGYMTDEEEDQSSK-----GMVPLADILNADADRNNARLFQEEDSLVMKAIKPIAAGD 272
Query: 322 SIVVWYNISFTGHDFKCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALN 369
I Y G P S LL YG+V D PYD V+EA+L+
Sbjct: 273 EIFNDY-----------GELPRSDLLRRYGYVTDNYAPYD--VIEASLD 308
>gi|169626351|ref|XP_001806576.1| hypothetical protein SNOG_16462 [Phaeosphaeria nodorum SN15]
gi|160705819|gb|EAT76160.2| hypothetical protein SNOG_16462 [Phaeosphaeria nodorum SN15]
Length = 474
Score = 47.8 bits (112), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 76/322 (23%), Positives = 123/322 (38%), Gaps = 66/322 (20%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLER-VLG 141
+W+ ++G+ I E + + R V A++D+ + F +P + ++++E +L
Sbjct: 13 AWLRRSGVEISPKIQLEDLRNAQAGRG---VVATQDIPEHELLFRIPRTAILSVENSILS 69
Query: 142 NETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
E A T L L L ++YE G S W PY L + + + WS
Sbjct: 70 TEIPA--ATFEMLGPWLSLILVMLYEYINGDASNWAPYFSVLPTE-------FNTLMFWS 120
Query: 202 ETELAYLTGSPTKAEI-------------------------------LERAEGIKREYNE 230
E ELA L S +I +RAE ++ E N
Sbjct: 121 EDELAELQASAVLNKIGKEGANEAFMEQLLPIIKEFADIFFAGDERAKQRAEEMRDERNV 180
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 290
L + GSL Y +D+ A + + + A + L K ++PL
Sbjct: 181 L--LMHKMGSLIMAYAFDVEP-ATSRKDVDEEGFAEEEEDEALPK-------GMIPLADM 230
Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINY 350
L A + A L + +++ +P +AGE + Y GP P S LL Y
Sbjct: 231 LNADADCNNARLFYEEKYLEMKALKPIRAGEEVFNDY-----------GPLPRSDLLRRY 279
Query: 351 GFV-DEDNPYDRLVVEAALNTE 371
G+V D YD + + L TE
Sbjct: 280 GYVTDNYAQYDVVEINMDLVTE 301
>gi|440302460|gb|ELP94773.1| hypothetical protein EIN_341910 [Entamoeba invadens IP1]
Length = 823
Score = 47.8 bits (112), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 92/421 (21%), Positives = 161/421 (38%), Gaps = 100/421 (23%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL---VVTLE 137
+ +W+ ++G V +K P + + +S++ GD S+P L + L
Sbjct: 4 ITTWVKEHGGHIDGVYVKNFPVYGNG------LCSSKEFHEGDTLLSIPYHLQLNTIELH 57
Query: 138 RVL-----GNET--IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRG 190
V G E + E N+ E + + LYL K +K F PYI L
Sbjct: 58 NVFESMVPGFEVPRLGEG-AKNRDDENSVVYLYLAM-NKTNEKCFHFPYINTLPT----- 110
Query: 191 QLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIP 250
PL +SE EL L G+ ++L E K +L + +L QYP
Sbjct: 111 --TFSCPLSYSENELKMLKGT----KLLVTVEKTKTFLKKLSDYY---ETLTHQYP---- 157
Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV---PLGP-----PLLAYSSKCKAML 302
T F+ F Q V +V +R F ++ P+G P +S+
Sbjct: 158 TRFQQFDDFYQRLVWAH-------QVFWSRAFLVIYPDPIGDVASLIPFADFSNH----- 205
Query: 303 AAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQ--------PNSKLLINYGFVD 354
+ V V +R + +++ CG Q PN K+L+ YGFV
Sbjct: 206 -NTETKVTYVSNRQTQT-------FSLQTNEKVLHCGEQIFNNYRIRPNEKMLLGYGFVI 257
Query: 355 EDNPYDRLVV-----EAALNTEDPQYQDKRM--------------------VAQRNGKLS 389
+NPYD +++ E + + ++ +M + Q +
Sbjct: 258 SENPYDEVLLRINFKERHFEKQVEESEESKMEVENKENERMEVEEEDNEDEITQILKREG 317
Query: 390 VQVFHVHAGREKEAISDMLPYLRLGYVS--DTSEMQSVISSLGPICPVSPC-MERAVLDQ 446
V F + REKE +D+L LR+ +S + ++ + L + P++ R++++Q
Sbjct: 318 VDRFDYYLTREKELPTDLLRVLRIVNLSLVEANQYSQALLDLSYVSPINEIKATRSLMEQ 377
Query: 447 L 447
+
Sbjct: 378 I 378
>gi|302832548|ref|XP_002947838.1| hypothetical protein VOLCADRAFT_88145 [Volvox carteri f.
nagariensis]
gi|300266640|gb|EFJ50826.1| hypothetical protein VOLCADRAFT_88145 [Volvox carteri f.
nagariensis]
Length = 508
Score = 47.8 bits (112), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 94/414 (22%), Positives = 151/414 (36%), Gaps = 64/414 (15%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+ +SW+ GL ++L+ R + AS L G+ +P+ LV+T ER
Sbjct: 24 EFQSWLRSEGLSTQPLLLRHC------GREGRGLVASRSLSRGEVLVKLPDHLVITAERA 77
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
G ++ LL LA ++ + W PY+ L ++ G + L
Sbjct: 78 AGEWSLLALLLAEVKGRLAA------GDRSSPAAARWGPYVAVLPQRPG-------TLLD 124
Query: 200 WSETELA-YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEI 258
W E+ L GSP + + EL+ + G P +P E
Sbjct: 125 WPAKEVQQLLRGSPLQRLADSITSAASASWRELEPL-IAQGRADGLVPEHVPLSKGDLEW 183
Query: 259 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL---LAYSSKC----------------K 299
AF + S + L S L P L ++ C
Sbjct: 184 ---AFGVLLSRCIRLP--SRGDLQVLAPWADQLNHDVSAEEGCHLDWSWDVAGPAVPGGD 238
Query: 300 AMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV--DEDN 357
A A+ L DRPY AG+ + V Y GP+ + +LL++YGF N
Sbjct: 239 RAGGATKGALVLRADRPYAAGQQVYVSY-----------GPKSSGELLLSYGFCPPPASN 287
Query: 358 PYDRLVVEAALNTE-DPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL--RLG 414
P+ + A++ + DP K R+G S F + E + L +L R
Sbjct: 288 PHQDCRLRVAVDRQGDPLADLKEQALARHGLPSELEFPLKLEGIPEGLLQYLAFLDARPK 347
Query: 415 YVSDTSEMQSVISSLGPICPVSPCMERAV--LDQLADYFKARLAGYPATLSEDE 466
+T E+ SV+ G P+ + V L L++ A L YP ++ D+
Sbjct: 348 VAQETFELASVLFESGGF-PLLDGQDTLVLALRGLSNRCTAALKAYPTSMEADQ 400
>gi|358335378|dbj|GAA53907.1| histone-lysine N-methyltransferase setd3 [Clonorchis sinensis]
Length = 254
Score = 47.4 bits (111), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 43/201 (21%), Positives = 96/201 (47%), Gaps = 21/201 (10%)
Query: 310 QLVVDRPYKAGESIVVWYNISFTGHDFKC----GPQPNSKLLINYGFVDEDNPYDRLVVE 365
Q+ D ++GE +++Y++ FT + + G + +++ L+ GFV NP++ + +
Sbjct: 32 QVTTDFDPESGE--LIFYSMEFTPKNTQILMDYGKRTSAEFLMFSGFVPATNPHNNVRIV 89
Query: 366 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV 425
++ D + + + S + H+ + ++SD + + R+ +V D+ ++ +
Sbjct: 90 LGVSKSDQLSSKREQLLELIALQSPLILHITG--DLSSLSDAIAFARV-FVMDSDQLDAH 146
Query: 426 IS---------SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 476
+S P+CP P ++A+ L F+ ++ Y +SEDE NL P
Sbjct: 147 LSMTTSALHALRTSPLCPGDPIDDQAIA-FLIMRFELLVSAYGPMVSEDEVGYE--NLTP 203
Query: 477 KKRVATQLVRMEKKMLNACLQ 497
+R +L E ++L +C++
Sbjct: 204 IQRYCERLRVQEVQILRSCIE 224
>gi|195132508|ref|XP_002010685.1| GI21676 [Drosophila mojavensis]
gi|193907473|gb|EDW06340.1| GI21676 [Drosophila mojavensis]
Length = 593
Score = 47.4 bits (111), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 86/411 (20%), Positives = 156/411 (37%), Gaps = 59/411 (14%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+ D++AG+ SVP L+ + E + + L N + L + L+ EK +G S
Sbjct: 206 ATRDIKAGEQVLSVPRKLIFSEELLPEKQ---RQLFRNFPTHLK-VTYTLIMEKLRGADS 261
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W P+I L + + L ++ ++ L G+ + + I R Y +
Sbjct: 262 PWQPFIDTLPSR-------YNTVLYFTVEQMQRLRGTSACSAAVRHCRVIARLYASMYKC 314
Query: 235 WFMA---------GSLFQQYP--YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 283
FM +LF Y Y++ A + +Q V Q + ++ A
Sbjct: 315 AFMQLDDSVMGGMANLFTDYGLCYELYRWAVSTVTTRQNLVPRQEIPSDAANLPIS---A 371
Query: 284 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPN 343
L+P S K + ++ YK+GE ++Y G + N
Sbjct: 372 LIPYWDMANHRSGKITSFYDQAAGQMECTAQEAYKSGEQYFIYY-----------GDRSN 420
Query: 344 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV-----AQRNGKLSVQVFHVHAG 398
+ L++ GFVD NP D + + L+ D + + ++ +R +L V H
Sbjct: 421 ADRLVHNGFVDMQNPKDYVQIRLGLSPTDALAEQRAILLAELNIERKAELRVLPAPEHIS 480
Query: 399 REKEAISDMLPYLRLGYVSD------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFK 452
E +L ++R+ +S S+++ + L C + +E L K
Sbjct: 481 GE------LLAFVRVFNMSKEQLEHWCSDLERAVDLLHIDCALETDLETRTWQYLYQRLK 534
Query: 453 ARLAGYPATLSED------EAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 497
L ATL E EA+ + + Q R+E+++L+ LQ
Sbjct: 535 LLLGVLEATLKETDELKQLEALQQQADASEIDIMVLQYRRLERRILSDALQ 585
>gi|322712432|gb|EFZ04005.1| histone-lysine N-methyltransferase [Metarhizium anisopliae ARSEF
23]
Length = 462
Score = 47.4 bits (111), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 83/403 (20%), Positives = 162/403 (40%), Gaps = 45/403 (11%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
V A + G+ ++P+ L T++ + + L + + LS LA+++++ +
Sbjct: 28 VKARRRFKQGERILTIPSGLHWTVKHAQNDSLLGPALCSAQPPLSVEDTLAVHILFVRS- 86
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
++S + ++R + S + +++ EL G+ + + I+ +Y +
Sbjct: 87 -RESGYDGLRSHVERLPA----SYSSSIFFTDDELEVCAGASLYTITKQLQQRIEDDYRD 141
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 290
L + QYP P + FT +K A AV S + Q + L P
Sbjct: 142 LVV------RVLVQYPDLFPLDKFTLHHYKWALCAVWSRAMDFQLSDGSSIRLLAPFAD- 194
Query: 291 LLAYSS---KCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLL 347
+L +SS +C A+ D + ++ + Y+AG+ + + Y G PN +LL
Sbjct: 195 MLNHSSESKQCHVYDASSGD-LSVLAGKDYEAGDQVYIHY-----------GSIPNHRLL 242
Query: 348 INYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDM 407
YGF+ NP D + A + P ++ K+ + G S + ++D
Sbjct: 243 RLYGFIIPGNPNDSYDLVLATHPLAPFFELKQKLWALAGLDSTCTISL-------TLTDP 295
Query: 408 LP-----YLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATL 462
LP YLR+ + D S++ S+ +S E VL L + + L + L
Sbjct: 296 LPKNVIRYLRIQRL-DESDLASIALGQAADEKISNSNEVQVLQSLVESIASLLGSFGTRL 354
Query: 463 SEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNACLQVTADMI 503
+ E L Y + A + E+++L + D++
Sbjct: 355 EKLEEQLATGVYPVGGNAWAAAHVSLGEQRVLKLAKKKAEDLL 397
>gi|328854233|gb|EGG03367.1| hypothetical protein MELLADRAFT_90239 [Melampsora larici-populina
98AG31]
Length = 509
Score = 47.4 bits (111), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 59/227 (25%), Positives = 94/227 (41%), Gaps = 35/227 (15%)
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
A+VPL L A S A L + + + + GE I YN P
Sbjct: 260 AMVPLADILNAKSGCENAKLFYEPTTLNMTTTKSIRKGEQI---YNT--------YADPP 308
Query: 343 NSKLLINYGFVDEDNPYD----------RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQV 392
N+ LL YG VD++NP+D RL E +L+ DPQ Q+ + K +++V
Sbjct: 309 NADLLRRYGHVDDENPFDLAEVSLELCIRLAAE-SLHPSDPQNQNTLDELKSRAKWALEV 367
Query: 393 FHV--------HAGRE-KEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 443
+ + RE KE + D L + +S E Q+ S G + P M +
Sbjct: 368 SDIDEIFMLPTKSQREPKEILPDELVIMLRILLSTEEEFQT-WKSKGKVP--KPAMSEPI 424
Query: 444 LDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKK 490
RL Y T+ D+ +L D +L ++++ + VR+ +K
Sbjct: 425 AQLAIQILSNRLNQYSTTIQNDQDLLKDQSL-SRRKLKSIKVRLGEK 470
>gi|240278777|gb|EER42283.1| conserved hypothetical protein [Ajellomyces capsulatus H143]
gi|325090312|gb|EGC43622.1| conserved hypothetical protein [Ajellomyces capsulatus H88]
Length = 471
Score = 47.4 bits (111), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 61/282 (21%), Positives = 110/282 (39%), Gaps = 48/282 (17%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
+ G+ ++P+ ++ T+E + + L + + LS LA Y+++ +
Sbjct: 34 FKEGERILTIPSDVLWTVEHAYADSLLGPTLHSARPPLSVDDTLATYILFVRS------- 86
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
RE R LA S + ++E EL TG+ A + I+ +Y L
Sbjct: 87 ----RESGYNGLRSHLAALPKSYSSSIFFTEDELEVCTGTSLYAITKQLGRCIQDDYKAL 142
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRFALVP 286
L Q+ P FT E +K A V S + + + L FA
Sbjct: 143 VV------RLLIQHRDLFPLSKFTIEDYKWALCTVWSRAMDFVLPDGKSIRLLAPFA--- 193
Query: 287 LGPPLLAYSSKCKAMLAA--VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNS 344
+L +SS + A + + ++ + YKAG+ + ++Y G PN+
Sbjct: 194 ---DMLNHSSDVRQCHAYDPLSGNLSILAGKDYKAGDQVFIYY-----------GSIPNN 239
Query: 345 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 386
+LL YGF+ NP D + + P ++ K + + G
Sbjct: 240 RLLRLYGFIIPSNPNDNYELVLETHPMAPFFEQKHKLWESAG 281
>gi|340520781|gb|EGR51016.1| N-methyltransferase [Trichoderma reesei QM6a]
Length = 470
Score = 47.4 bits (111), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 51/197 (25%), Positives = 80/197 (40%), Gaps = 32/197 (16%)
Query: 196 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
S + +SE EL G+ + + IK +Y +L A LF Q+P P + FT
Sbjct: 107 SSIFFSEGELEVCAGTSLYTVTKQLEQRIKDDYRQL------AVRLFAQHPDLFPLQKFT 160
Query: 256 FEI----------FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS---KCKAML 302
E +K A V S + + L P +L +SS +C A
Sbjct: 161 IEDVRLLRRATDPYKWALCTVWSRSMDFTLPDGSSIRLLAPFAD-MLNHSSEVKQCHAYD 219
Query: 303 AAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRL 362
D + + + Y+ G+ + ++Y GP PN++LL YGFV DNP D
Sbjct: 220 VKSGD-LSVFAGKDYEIGDQVYIYY-----------GPIPNNRLLRLYGFVIPDNPNDSY 267
Query: 363 VVEAALNTEDPQYQDKR 379
+ + P Y+ K+
Sbjct: 268 DLVLTTHPMAPFYEQKQ 284
>gi|347967016|ref|XP_003436005.1| AGAP002018-PB [Anopheles gambiae str. PEST]
gi|333469796|gb|EGK97407.1| AGAP002018-PB [Anopheles gambiae str. PEST]
Length = 504
Score = 47.4 bits (111), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 90/419 (21%), Positives = 160/419 (38%), Gaps = 76/419 (18%)
Query: 121 AGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA--CLALYLMYEKKQGKKSFWLP 178
AG+ +VP S+ + + EL+ +SE LAL L+ E+ + K S W P
Sbjct: 110 AGECIITVPRSMFFYVTNEPRYRQLLELMPGAMMSEQGNIMLALALIMERFRAK-SDWKP 168
Query: 179 YIREL-DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
Y+ L DR +PL ++ ++ L + L+ + I R+Y +
Sbjct: 169 YLDLLPDR--------YTTPLYYTTEDMGELAETDAFLPALKLCKHIARQYGFIRR---- 216
Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFV--------AVQSCVVHLQKV--------SLARR 281
F Q D + FT+++F+ AV + + KV +
Sbjct: 217 ----FVQEKVDELRDCFTYDVFRLLLFSLLIPHSWAVSTVMTRQNKVPVNLAEFDGMDHT 272
Query: 282 FALVPL------GPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWYNISFTG 333
AL+PL P A ++C A A ++ ++ + R S+ ++
Sbjct: 273 LALIPLWDMANHAFPDTANETRCVAETCYNATNEQLECSLTREVSDIASVPIF------- 325
Query: 334 HDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
G + +++ L++ GFV NP+ + L P Y+++ + + G + F
Sbjct: 326 --IVYGTRTDAEFLVHNGFVCPRNPHANVQKRFTLVPAIPLYKERAHLLELLGMPTTGTF 383
Query: 394 HVHAGREKEAISDMLP----YLRLGYVSDTS------------EMQSVISSLGPICPVSP 437
RE A + P + L VS + + + + + P
Sbjct: 384 SFGPAREPAAATTTTPISQELISLARVSSMTAKELDEYTAMKETQRQTLRTYQALLPAEL 443
Query: 438 C--MERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 494
C ER LA K L YP T+ +DEA+L N H +R+ + EK++L +
Sbjct: 444 CARTER----WLATVMKIMLLRYPTTIEQDEALLKT-NRHHIRRLLIEYRLGEKQILRS 497
>gi|328772335|gb|EGF82373.1| hypothetical protein BATDEDRAFT_86177 [Batrachochytrium
dendrobatidis JAM81]
Length = 966
Score = 47.4 bits (111), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 71/330 (21%), Positives = 127/330 (38%), Gaps = 71/330 (21%)
Query: 71 VSKKEEDLGDLKS---WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFS 127
++K + L L+S W+H NG+ + +K+ + I ++ + G+
Sbjct: 548 TAEKLDQLASLESFTQWLHANGINTDGISIKKVDDSKDVGLGIF---STRQIHKGECLVK 604
Query: 128 VPNSLVVTLERVLGNETIA-----ELLTTNKL--SELACLALYLMYEKKQGKKSFWLPYI 180
+P L+ +L N+T A ++ +N L ++ + + + + ++ S W PY
Sbjct: 605 IP------LKLILSNDTSAMPALNSIVKSNVLLKTDPSVILVIRLLQEYINPMSLWQPYF 658
Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS 240
L R P+L S +LA TG+ E++ + R+Y L +
Sbjct: 659 DLLPR-------VFTIPVLGSAQDLAAYTGTSIIDEVVHDMIALMRQYLYLQHI------ 705
Query: 241 LFQQYPY-DIPTEAFTFEIFKQAFVAVQS-----CVVHLQKVSLARRFALVPL------- 287
F+ P IP FTF F A V + C + + + L+PL
Sbjct: 706 -FKSIPEPPIPLADFTFAAFSWARAIVSTRQNEICYANPSTSEMQQFLCLIPLFDMFNHK 764
Query: 288 -GPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNS 344
G + +K C +A+ D GE I + Y G + N
Sbjct: 765 PGNSTTQFDTKEYCSETIASCD----------VSPGEQIFIHY-----------GKRSNQ 803
Query: 345 KLLINYGFVDEDN-PYDRLVVEAALNTEDP 373
++L+ GFVD N YD + + ++ DP
Sbjct: 804 EMLLYSGFVDPTNIEYDHIKLSVSIPQSDP 833
>gi|308806489|ref|XP_003080556.1| SET-domain transcriptional regulator-like protein (ISS)
[Ostreococcus tauri]
gi|116059016|emb|CAL54723.1| SET-domain transcriptional regulator-like protein (ISS)
[Ostreococcus tauri]
Length = 394
Score = 47.0 bits (110), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 68/274 (24%), Positives = 109/274 (39%), Gaps = 57/274 (20%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERV------------------LGNETIAELLTTNKL 154
V A E ++AG+ VP ++ +E+ +G++ I + T L
Sbjct: 4 VRAVERVEAGECVARVPWDALLGVEQTVETSSPSPTSEILKQLTRMGDQIIMVIWLTAAL 63
Query: 155 SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 214
C YE+ W P +R L + S L W+ +L + G
Sbjct: 64 DAFEC-GDASAYEE-------WAPALRALPTR-------ASSSLAWNADDLGAVAGEDLA 108
Query: 215 AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF-TFEIFKQAFVAVQSCVVHL 273
+ E +K +Y+ L F A L +Q P P AF + F++A+ S + +
Sbjct: 109 NRLREYRRSVKVQYDAL----FPA--LCEQVPEAFPARAFGDYAKFERAYDIWTSYAMKV 162
Query: 274 QK-VSLARRFALVP----LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYN 328
Q SL R +VP L A+S + ++ A +L + R GE+I + Y
Sbjct: 163 QDPDSLQIREVIVPGVFLCNHSLSAHSVRYTSLERGTK-AFRLELSRGCVEGEAITISY- 220
Query: 329 ISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRL 362
G N+ LL+ YGF E+NPYDR+
Sbjct: 221 ----------GRLDNADLLMFYGFSLENNPYDRV 244
>gi|224012755|ref|XP_002295030.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220969469|gb|EED87810.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 753
Score = 47.0 bits (110), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 62/238 (26%), Positives = 101/238 (42%), Gaps = 39/238 (16%)
Query: 127 SVPNSLVVTLERVLGNET-IAELLTTNKLSELA----CLALYLMYEKK-QGKKSFWLPYI 180
S+P S ++T+E +G T I + T+ L A L +Y+++++K G+ SF+ PY
Sbjct: 141 SIPKSCLITVE--MGQATPIGRKILTSDLELDAPKHIFLMIYILWDRKVNGETSFFAPYY 198
Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS 240
+ L + P+ W+ EL L GS +I +RAE IK +Y + ++ G
Sbjct: 199 KILP------ETLRNMPIFWTREELDALEGSYLLLQIADRAEAIKEDYISICSIAPEFGD 252
Query: 241 LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSK-CK 299
+ T E F+ A + V S L ++ R ALVP L + K
Sbjct: 253 I------------ATLEEFQWARMIVCSRNFGLL-INGHRTSALVPHADMLNHLRPRETK 299
Query: 300 AMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDN 357
+ + + + GE + F + KC N + L+NYGF E N
Sbjct: 300 WTFSEESQSFTITTLQEIGMGEQV-------FDSYGQKC----NHRFLLNYGFCVERN 346
>gi|113930683|ref|NP_001039027.1| SET domain-containing protein 4 [Danio rerio]
gi|66911144|gb|AAH96876.1| SET domain containing 4 [Danio rerio]
Length = 440
Score = 47.0 bits (110), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 73/330 (22%), Positives = 136/330 (41%), Gaps = 50/330 (15%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVV 134
L+ W+++ G +I P+++ + A++ ++A ++ S+P ++
Sbjct: 37 LRRWLNERGFTSQSLI------------PVNFHDTGRGLMATQTIKAKNSVISLPEECLL 84
Query: 135 TLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
T VL + +A+ + +S L L +L+ E+ G+ S W PYI L +
Sbjct: 85 TTSTVLKS-YMADYIKRWHPPISPLLALCCFLISERHHGEASEWNPYIDILPK------- 136
Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 252
PL + + + L S K + ++ E + ++ T + LF Q PTE
Sbjct: 137 TYTCPLYFPDNVIELLPRSLQK-KATQQKEQFQELFSSSQTFFHSLQPLFNQ-----PTE 190
Query: 253 A-FTFEIFKQAFVAVQSCVV---HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA 308
F+ + + A+ +V + V H Q L+R + L P L + C +
Sbjct: 191 ELFSQDALRWAWCSVNTRTVYMEHDQSKYLSREKDVYALAP-YLDLLNHCPNV------Q 243
Query: 309 VQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPY-----DRLV 363
V+ ++ + E V F GP N +LL+ YGFV NP+ D
Sbjct: 244 VEAGFNKETRCYEIRSVNGCKKFQQAFINYGPHDNHRLLLEYGFVAPCNPHSVVYVDLET 303
Query: 364 VEAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
++ L+ +D Q ++K + + N L F
Sbjct: 304 LKVGLDEKDKQLKEKLLYLKDNDFLRNLTF 333
>gi|336468018|gb|EGO56181.1| hypothetical protein NEUTE1DRAFT_83233 [Neurospora tetrasperma FGSC
2508]
gi|350289741|gb|EGZ70966.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
Length = 459
Score = 47.0 bits (110), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 60/272 (22%), Positives = 113/272 (41%), Gaps = 42/272 (15%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
+ G+ ++P ++ T++ + + L + + LS LA Y+++ K
Sbjct: 34 FKEGEKILTIPAGILWTVKHAYADPLLGPALRSAQPPLSVEDTLATYILFVKS------- 86
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
RE R +A S +L++E +L G+ + + I+ ++ L
Sbjct: 87 ----RESGYDGQRSHIAALPTSYSSSILFAEDDLEACAGTSLYTITKQLEQSIEDDHRAL 142
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP-- 289
LF Q+P P + FT E +K A V S + LA ++ L P
Sbjct: 143 VV------RLFVQHPDLFPLDKFTVEDYKWALCTVWSRAMDF---VLADGNSIRLLAPFA 193
Query: 290 PLLAYSSKCKA--MLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLL 347
+L ++S+ K + + ++ + Y+AG+ + + Y GP PNS+LL
Sbjct: 194 DMLNHTSEVKQCHVYDPSSGNLSVLAGKDYEAGDQVFINY-----------GPVPNSRLL 242
Query: 348 INYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 379
YGFV NP D + + + + P ++ K+
Sbjct: 243 RLYGFVIPGNPNDSYDLVLSTHPQAPFFEQKQ 274
>gi|322697804|gb|EFY89580.1| putative histone-lysine N-methyltransferase [Metarhizium acridum
CQMa 102]
Length = 466
Score = 47.0 bits (110), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 59/276 (21%), Positives = 110/276 (39%), Gaps = 38/276 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
V A + G+ ++P++L T++ + + L + + L+ LA+Y+++ +
Sbjct: 28 VKARRRFKQGERILTIPSALHWTVQHAQADSLLGPALRSARPPLTVEDTLAVYVLFVRS- 86
Query: 171 GKKSFWLPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIK 225
RE R +A S + ++E EL G+ + + I+
Sbjct: 87 ----------RESGYNGPRSHVAALPTSYSSSIFFTEDELEVCAGTSLYTITKQLKQRIE 136
Query: 226 REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
+Y +L + P P FT +K A V S + + + L
Sbjct: 137 DDYKDL------IARVLGPRPDLFPLNKFTIHHYKWALCTVWSRAMDFELYDGSSMRLLA 190
Query: 286 PLGPPLLAYSSKCKA--MLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPN 343
P +L +SS+ K + A + ++ + Y+AG+ + + Y G PN
Sbjct: 191 PFAD-MLNHSSESKQCHVYDASTGNLSILAGKDYEAGDQVYIHY-----------GSIPN 238
Query: 344 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 379
S+LL YGFV DNP D + A + P ++ K+
Sbjct: 239 SRLLRLYGFVIPDNPNDSYDLVLATHPMAPFFEQKQ 274
>gi|302840199|ref|XP_002951655.1| hypothetical protein VOLCADRAFT_105180 [Volvox carteri f.
nagariensis]
gi|300262903|gb|EFJ47106.1| hypothetical protein VOLCADRAFT_105180 [Volvox carteri f.
nagariensis]
Length = 517
Score = 47.0 bits (110), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 46/210 (21%), Positives = 87/210 (41%), Gaps = 24/210 (11%)
Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
+++ E+ L + +MYEK +G++S W PY+ + PL W E L G+
Sbjct: 179 DEILEVQGLIIAVMYEKSRGRQSRWAPYLNLIPDD------MTHMPLYWKHREFKELRGT 232
Query: 212 PTKAEILERAEGIKREYNELDTVWF-MAGSLFQQYP-YDIPTEAFTFEIFKQAFVAVQSC 269
+++ + + ++ +W + Q++P ++P +++++ A AV S
Sbjct: 233 AAYDKMMGKVQCPADAPTQVPVLWSEVVEPFIQEHPELELPEGKAGYDLYRWATCAVASY 292
Query: 270 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWY 327
L A+VP+ L + + L A + ++ R GE +V Y
Sbjct: 293 SFILGDDKYQ---AMVPVWDLLNHITGRVNVRLHHCAKRHVLHMIATRDILRGEELVNNY 349
Query: 328 NISFTGHDFKCGPQPNSKLLINYGFVDEDN 357
G N++LL YGFV+ N
Sbjct: 350 -----------GELSNAELLRGYGFVEARN 368
>gi|384248108|gb|EIE21593.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 229
Score = 47.0 bits (110), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 52/110 (47%), Gaps = 9/110 (8%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
D W+ K G + E + E R V A +++ G +VP L+++
Sbjct: 5 DFAEWLQKGGALIADI---EPGAVAEGFRG---VIAKANIEEGTLLVAVPERLLLSAHSA 58
Query: 140 LGNETIAE-LLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
+ AE LL TNK + LA +L++E +G++SFW PY+ L RQ
Sbjct: 59 KKDRAFAEALLATNKQSIGSSQVLAAHLLHEASKGQESFWRPYLATLPRQ 108
>gi|164423408|ref|XP_963594.2| hypothetical protein NCU08733 [Neurospora crassa OR74A]
gi|157070080|gb|EAA34358.2| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 459
Score = 47.0 bits (110), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 60/272 (22%), Positives = 112/272 (41%), Gaps = 42/272 (15%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
+ G+ ++P ++ T++ + + L + + LS LA Y+++ K
Sbjct: 34 FKEGEKILTIPAGILWTVKHAYADPLLGPALRSAQPPLSVEDTLATYILFVKS------- 86
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
RE R +A S +L++E +L G+ + + I+ ++ L
Sbjct: 87 ----RESGYDGQRSHIAALPASYSSSILFAEDDLEACAGTSLYTITKQLEQSIEDDHRAL 142
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP-- 289
LF Q+P P + FT E +K A V S + LA ++ L P
Sbjct: 143 VV------RLFVQHPDLFPLDKFTVEDYKWALCTVWSRAMDF---VLADGNSIRLLAPFA 193
Query: 290 PLLAYSSKCKA--MLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLL 347
+L ++S+ K + + + + Y+AG+ + + Y GP PNS+LL
Sbjct: 194 DMLNHTSEVKQCHVYDPSSGTLSVFAGKDYEAGDQVFINY-----------GPVPNSRLL 242
Query: 348 INYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 379
YGFV NP D + + + + P ++ K+
Sbjct: 243 RLYGFVIPGNPNDSYDLVLSTHPQAPFFEQKQ 274
>gi|225554758|gb|EEH03053.1| SET domain-containing protein [Ajellomyces capsulatus G186AR]
Length = 485
Score = 47.0 bits (110), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 70/317 (22%), Positives = 120/317 (37%), Gaps = 56/317 (17%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+ SW+ + P KV K K + + A +D+ + F++P +LV++ +
Sbjct: 19 EFMSWLKQR--PGVKVSPKIKIADLRSEGAGRGIVADDDIGEDEELFAIPQNLVLSFQ-- 74
Query: 140 LGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
N ++ +LL N+ CL + ++YE QG S W Y + L ++
Sbjct: 75 --NSSLKDLLDFNERDFDPWLCLIVVMIYEYLQGGASTWSRYFQLLPTN-------FDTL 125
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP-------YDIP 250
+ W++ EL L+GS +L + E N L + + +P +D P
Sbjct: 126 MFWTDEELRELSGSA----VLNKIGRSDAEANILRNILPLVSGNPSHFPPMSGVASFDSP 181
Query: 251 TE----------------AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
A+ F+I K + ++ +VPL L A
Sbjct: 182 EGKAALLSLAHRMGSLIMAYAFDIEKGENDGGEGQDGYVTDDEEELSKGMVPLADLLNAD 241
Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV- 353
+ + A L D + + +P + GE I Y G P + LL YG+V
Sbjct: 242 TDRNNARLFQEDCYLSMRSIKPIRKGEEIFNDY-----------GELPRADLLRRYGYVT 290
Query: 354 DEDNPYDRLVVEAALNT 370
D YD VE ++ T
Sbjct: 291 DNYAQYDE--VEISMRT 305
>gi|389741836|gb|EIM83024.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 502
Score = 47.0 bits (110), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 104/474 (21%), Positives = 174/474 (36%), Gaps = 127/474 (26%)
Query: 118 DLQAGDAAFSVPNSLVVT-----LERVLGNETIAELLTTNKLSEL----ACLALYLMYEK 168
D+ G F++P +L ++ L +LG L K EL A L L +M+E+
Sbjct: 37 DIPEGHTLFTLPRNLTLSTRTSALPGLLG-------LDEWKQHELHIGWAGLILCMMWEE 89
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI----------- 217
QG S W Y+ L + ++P+ WS +L L G+ +I
Sbjct: 90 AQGASSRWSTYLASLPS-------SFDTPMFWSPDDLEELKGTSVVDKIGRDGAEEDYRS 142
Query: 218 -----------LERAEGIKREYNELDTVWFMAGSLF------------------------ 242
L E + R Y+ L+ M +
Sbjct: 143 KVVPTLQSRPDLFAPEALSRHYS-LENYHLMGSRILSRSFSVERWEGHAADKQEDSASSP 201
Query: 243 -----QQYPYDIPTEAFTFE-------IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 290
+ D+ TEA T + + +FV + A+VP+
Sbjct: 202 VADTGRDEAMDVDTEAVTATAPEAEDGVDEPSFVVDDENDSDDEDEEDPANVAMVPMADM 261
Query: 291 LLA-YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
L A Y S+ + +D ++++ +P GE I Y G PNS LL
Sbjct: 262 LNARYRSENAKLFYETED-LRMITTKPILKGEQIFNTY-----------GDPPNSDLLRR 309
Query: 350 YGFVDE--------DNPYDRLVVE-----AALNTEDPQYQDKRMVAQR------NGKLSV 390
YG VD NP D +VE A + + Q A+R G V
Sbjct: 310 YGHVDLVPLPNGDIGNPAD--IVELRGDLAFFSISERHKQPVESSAERVDWWLEEGGEDV 367
Query: 391 QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADY 450
+ + E +++P+ RL S + ++ S P V + ++L +A+
Sbjct: 368 FILETN----HELPDELVPFCRLLLQSQSEWEKTKSKSKLPKAKV----DESILSTIANA 419
Query: 451 FKARLAGYPATLSEDEAMLTD-YNLHPKKRVATQLVRMEKKMLNACLQVTADMI 503
+ RLA YP ++ ED+ +LT+ +L+ K V +L EK++L+ L + +
Sbjct: 420 LERRLAEYPTSVEEDQKLLTEPLSLNRKHAVIVRL--GEKRILHGTLSTVKEKL 471
>gi|400602586|gb|EJP70188.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
Length = 797
Score = 46.6 bits (109), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 41/88 (46%), Gaps = 13/88 (14%)
Query: 281 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGP 340
R AL+P+ L ++ C + +A + DR Y+AGE + Y G
Sbjct: 593 RLALLPVADVLNHANAGCSVAFST--EAYDITADRAYQAGEEVYTSY-----------GA 639
Query: 341 QPNSKLLINYGFVDEDNPYDRLVVEAAL 368
N LL YGFV DNP+D+L ++ L
Sbjct: 640 HSNDFLLAEYGFVLPDNPWDQLCLDKVL 667
>gi|346980096|gb|EGY23548.1| SET domain-containing protein RMS1 [Verticillium dahliae VdLs.17]
Length = 469
Score = 46.6 bits (109), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 84/382 (21%), Positives = 144/382 (37%), Gaps = 76/382 (19%)
Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 213
L L L ++YE QG S W PY L +Q ++P+ WS+ EL L G+
Sbjct: 91 LDSWGQLILVMLYEVLQGDASRWKPYFDILPQQ-------FDTPIFWSDGELLELQGTSL 143
Query: 214 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 273
AE + + E +++ + ++F PTE + + + + L
Sbjct: 144 TAEKIGKVESDAMFRSKILPIVQANPAIFYPEGAAQPTEDELLHLAHRMGSTIMAYAFDL 203
Query: 274 QKVSLARR--------------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 319
+ +VP+ L A +++ A + + + KA
Sbjct: 204 ENDDENENEEDGWVEDREGRTMLGMVPMADTLNA-NAEFNAHINHGESLEATAIRADIKA 262
Query: 320 GESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVD-EDNPYDRLVVEAALNTEDPQYQDK 378
G+ I+ +Y GP P S+LL YG+V E + YD + V L E
Sbjct: 263 GDQILNYY-----------GPLPTSELLRRYGYVTPEHSRYDVVEVPWTLVKE------- 304
Query: 379 RMVAQRNGKLSVQVF-HVHAGREKEAISDMLPYLRLGYVSDTSE--------MQSVISSL 429
V + LS + + V + + E I D R D+ E +V+ +
Sbjct: 305 --VIVSSLSLSAEAWKQVESQIDDEEIEDYFVIER-----DSGEPGPDGRFTAPAVLREV 357
Query: 430 GP---------ICPVSPCMERAVLDQ----------LADYFKARLAGYPATLSEDEAMLT 470
P + V + D+ +A+ K RLA YP ++ DE +L
Sbjct: 358 SPELVEQLKEFLKAVKKLDSERIPDKRKRDEICDAVIAEVLKVRLAQYPTSIETDEKLLA 417
Query: 471 DYNLHPKKRVATQLVRMEKKML 492
+ +L ++R+A + EKK+L
Sbjct: 418 EADLPARRRMAVVVRLGEKKLL 439
>gi|302658278|ref|XP_003020845.1| SET domain protein [Trichophyton verrucosum HKI 0517]
gi|291184711|gb|EFE40227.1| SET domain protein [Trichophyton verrucosum HKI 0517]
Length = 692
Score = 46.6 bits (109), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 48/217 (22%), Positives = 83/217 (38%), Gaps = 46/217 (21%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LA ++++E+ +G+ S W PY+ L R S L + + +L +L G+
Sbjct: 108 LAFFMVHEQLKGRDSHWWPYLATLPRAS-----EFTSALFYQDNDLEWLQGTNLYQTHQA 162
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
++ EY+ ++ G L E++ ++IF A+ + S + +
Sbjct: 163 YRNAVQEEYDSAISILRDEGFL--------AVESYRWDIFCWAYTLIAS------RAFTS 208
Query: 280 RRFALVPLGPPLLAYSSKCKAMLAAVDDA----------------VQLVVDRPYKAGESI 323
R P L + + ML VD + + L V P +GE +
Sbjct: 209 RVLDAYFSNHPTLKQDEEFQIMLPLVDSSNHKPLAKIEWRAEATEIGLKVIEPTSSGEEV 268
Query: 324 VVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYD 360
H+ GP N +L+ YGF DNP D
Sbjct: 269 ----------HN-NYGPLNNQQLMTTYGFCIVDNPCD 294
>gi|336260071|ref|XP_003344832.1| hypothetical protein SMAC_06115 [Sordaria macrospora k-hell]
Length = 456
Score = 46.6 bits (109), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 82/401 (20%), Positives = 150/401 (37%), Gaps = 52/401 (12%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
+ G+ ++P+S++ T+E + + L + + LS L YL++ +
Sbjct: 34 FKEGEKILTIPSSILWTVEHAYADPLLGPALCSVQPPLSPEDTLTTYLLFVRS------- 86
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
RE R +A S + ++E EL G+ + + I+ ++ L
Sbjct: 87 ----RESGYDGQRSHVAALPTSYSSSIFFTEEELEVCAGTSLYTITKQLEQSIEDDHRAL 142
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
LF Q+ P + F+ E +K A V S + Q L P +
Sbjct: 143 ------VMQLFIQHRDLFPLDKFSIEDYKWALCTVWSRRMDFQLRDGKSMRLLAPFAD-M 195
Query: 292 LAYSSKCK--AMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
L +SS+ K + + ++ + Y+ G+ + + Y G PNS+LL
Sbjct: 196 LNHSSEAKPCHVYDVSSGNLSVLAGKDYEPGDQVFINY-----------GSVPNSRLLRL 244
Query: 350 YGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP 409
YGFV NP D + + + + P Y+ K + G S + ++D LP
Sbjct: 245 YGFVIPGNPNDTYDLVLSTHPQAPFYEQKHKLWVSAGLDSTSTIPL-------TLTDPLP 297
Query: 410 -----YLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSE 464
YLR+ + + + VS E +L L + F L G+ L +
Sbjct: 298 KNVLRYLRIQRADASDLAAMALQNAKADEKVSDSNEVEILQFLVESFGHLLGGFGTPLEK 357
Query: 465 DEAMLTD--YNLHPKKRVATQLVRMEKKMLNACLQVTADMI 503
E L Y+ A + E+++L + D++
Sbjct: 358 LEEQLAQGVYSPGGNAWAAAHVSLGEQRVLRLAKKRAEDLL 398
>gi|380089029|emb|CCC12973.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 465
Score = 46.6 bits (109), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 77/365 (21%), Positives = 137/365 (37%), Gaps = 50/365 (13%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
+ G+ ++P+S++ T+E + + L + + LS L YL++ +
Sbjct: 43 FKEGEKILTIPSSILWTVEHAYADPLLGPALCSVQPPLSPEDTLTTYLLFVRS------- 95
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
RE R +A S + ++E EL G+ + + I+ ++ L
Sbjct: 96 ----RESGYDGQRSHVAALPTSYSSSIFFTEEELEVCAGTSLYTITKQLEQSIEDDHRAL 151
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
LF Q+ P + F+ E +K A V S + Q L P +
Sbjct: 152 ------VMQLFIQHRDLFPLDKFSIEDYKWALCTVWSRRMDFQLRDGKSMRLLAPFAD-M 204
Query: 292 LAYSSKCK--AMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
L +SS+ K + + ++ + Y+ G+ + + Y G PNS+LL
Sbjct: 205 LNHSSEAKPCHVYDVSSGNLSVLAGKDYEPGDQVFINY-----------GSVPNSRLLRL 253
Query: 350 YGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP 409
YGFV NP D + + + + P Y+ K + G S + ++D LP
Sbjct: 254 YGFVIPGNPNDTYDLVLSTHPQAPFYEQKHKLWVSAGLDSTSTIPL-------TLTDPLP 306
Query: 410 -----YLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSE 464
YLR+ + + + VS E +L L + F L G+ L +
Sbjct: 307 KNVLRYLRIQRADASDLAAMALQNAKADEKVSDSNEVEILQFLVESFGHLLGGFGTPLEK 366
Query: 465 DEAML 469
E L
Sbjct: 367 LEEQL 371
>gi|367013376|ref|XP_003681188.1| hypothetical protein TDEL_0D03930 [Torulaspora delbrueckii]
gi|359748848|emb|CCE91977.1| hypothetical protein TDEL_0D03930 [Torulaspora delbrueckii]
Length = 484
Score = 46.6 bits (109), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 48/100 (48%), Gaps = 6/100 (6%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A +DL+ G +P S + + N +IA LL +++ + L + +YE K
Sbjct: 40 VFAKQDLEEGTVLLKLPKSCLFSA----SNSSIANLLVDDEIDGVLALNIAFLYETTVFK 95
Query: 173 -KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
KS W PY++ + G L V P WSETE L GS
Sbjct: 96 EKSHWFPYLKSIRIYNDDG-LLVLPPSHWSETEKLLLKGS 134
>gi|211826273|gb|AAH09054.2| SETD3 protein [Homo sapiens]
Length = 228
Score = 46.6 bits (109), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 53/223 (23%), Positives = 100/223 (44%), Gaps = 29/223 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 5 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 58
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 59 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 114
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y V Q +P+
Sbjct: 115 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV-------IQTHPHA 162
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
+P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 163 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPL 205
>gi|260807503|ref|XP_002598548.1| hypothetical protein BRAFLDRAFT_118329 [Branchiostoma floridae]
gi|229283821|gb|EEN54560.1| hypothetical protein BRAFLDRAFT_118329 [Branchiostoma floridae]
Length = 448
Score = 46.2 bits (108), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 63/293 (21%), Positives = 124/293 (42%), Gaps = 57/293 (19%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN-KLSELACLALYLMYEKKQGKKSFWL 177
++ G +P ++++ + VL + + + +L+ + + +L+Y+K G+ SFW
Sbjct: 65 IKRGQTMIKMPQHMILSTKTVLDSVLGPYIESAEPQLTTIQAITTFLIYQKHIGETSFWK 124
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
PY+ L + P+ + E + YL S +A I + + + Y EL +
Sbjct: 125 PYLDILPNE-------YTHPVYFGEEDFLYLPHS-LRANIKAKKQECIKSYEELKPFFPS 176
Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAF--VAVQSCVVHLQKVSLARRF--------ALVPL 287
L + FTF+ ++ A+ V +S V + ++ R +LVP+
Sbjct: 177 LEPLLPNW-----EGIFTFDAYRWAWSTVKTRSLYVDDKGSTVLRNLDKSGLGVTSLVPM 231
Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQ------ 341
LL +S + L++ + K G+ +Y ++ D+K G Q
Sbjct: 232 -VDLLNHSHSART---------GLLIKKSCKNGD---YFYTVT-AEDDYKRGDQVLFCYR 277
Query: 342 --PNSKLLINYGFVDEDNPYDRL----------VVE-AALNTEDPQYQDKRMV 381
N LL+NYGFV DN D + ++E EDP+++ ++++
Sbjct: 278 RADNQTLLLNYGFVLPDNHLDTIKFFLVKDIIGILELMNFEEEDPKFRRRKVL 330
>gi|255070351|ref|XP_002507257.1| predicted protein [Micromonas sp. RCC299]
gi|226522532|gb|ACO68515.1| predicted protein [Micromonas sp. RCC299]
Length = 986
Score = 46.2 bits (108), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 67/292 (22%), Positives = 115/292 (39%), Gaps = 70/292 (23%)
Query: 113 VAASEDLQA----GDAAFSVPNSLVVTLERVLGNET---IAELLTTNK-LSELACLALYL 164
V A+E++ GD FS+P + ++T + T + EL ++ + + L +L
Sbjct: 45 VIAAENVNGAQDGGDTIFSIPITCLMTPAAAFADVTYGKVFELFAAHQSVEDRTVLVFFL 104
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
E+++G S W PYIREL +PL WS E L G+ R G
Sbjct: 105 AIERQRGMTSHWGPYIRELPS-------IFSNPLNWSRAETLRLAGT--------RLGGA 149
Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 284
+ F +L Q +P AF + Q ++ + + +SLA+
Sbjct: 150 TK---------FHDCALLQLTEVCVP--AFIAILRAQLILSANTKAIASGAISLAQD--- 195
Query: 285 VPLGPPLLAYSSKCKA---------------------MLAAVDDAVQLVVDRPYKAGESI 323
L P LA+S C + ML DA Q+ AG+ +
Sbjct: 196 -ALSPDRLAWSHSCVSSRAFSLFLNGQRTIALVPLGDMLDHSPDA-QIEWRTDDTAGQFL 253
Query: 324 VVWYN------ISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 369
++ ++ I F + G + N +L++ YGF + + + L V A++
Sbjct: 254 IISHDRLPAGSIMFNNY----GAKSNEELILGYGFFMKSSVLETLYVRLAVD 301
>gi|255075907|ref|XP_002501628.1| predicted protein [Micromonas sp. RCC299]
gi|226516892|gb|ACO62886.1| predicted protein [Micromonas sp. RCC299]
Length = 607
Score = 46.2 bits (108), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 59/238 (24%), Positives = 95/238 (39%), Gaps = 41/238 (17%)
Query: 145 IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 204
+A + +L+ A LAL++++E +S Y+ L G+ +V PLLW+ T+
Sbjct: 137 VAASMGAPELATHAALALHVLFELGD-PRSEGFAYLATLPGLAGKASPSV--PLLWTPTQ 193
Query: 205 LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
+A L G+PT +L RA+ + + L +P E F + A
Sbjct: 194 VATLRGTPTHGRVLRRAKFVSDAHAAL---------FGSGGGGGVPLEKFAW-----ALS 239
Query: 265 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG-ESI 323
+V S ++ +A +P G LL + VD +L + G E
Sbjct: 240 SVLSRAASGDRMP----YAFLP-GVDLLNHGG--------VDANCELSAVKLAPGGNEEN 286
Query: 324 VVWYNISFTG-HDFKCGPQ---------PNSKLLINYGFVDEDNPYDRLVVEAALNTE 371
V W ++ T D G Q N +LL YGF N +DR +E L +
Sbjct: 287 VTWGDVEVTCVKDTPAGEQLTISYGDESDNCRLLRLYGFATRGNVHDRRTIELRLTGD 344
>gi|410900968|ref|XP_003963968.1| PREDICTED: SET domain-containing protein 4-like [Takifugu rubripes]
Length = 386
Score = 46.2 bits (108), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 60/254 (23%), Positives = 108/254 (42%), Gaps = 22/254 (8%)
Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSF 175
+++ GD S+P S ++T VL N + + + K LS L L ++L+ E+ +G+ S
Sbjct: 65 NVKPGDMLISLPESCLLTTSTVL-NSYLGSFIKSWKPHLSPLLALCVFLVCERHRGEASD 123
Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 235
W PYI L + + P +++ +A L S + + E+ E + RE + + +
Sbjct: 124 WFPYIDVLPK-------SYTCPAYFTDEVMALLPPS-VQRKAREQREAV-REIHSSNKAF 174
Query: 236 FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CVVHLQKVSLARRFALVPLGPPLLAY 294
F + P + + T+E + A+ +V + V L + R V P L
Sbjct: 175 FRSLQPVLTQPAE---DVLTYEALRWAWCSVNTRSVFMLHSSNDFLRGQDVYALAPFLDL 231
Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVD 354
+ C D V+ + K E V + + G N +L++ YGFV
Sbjct: 232 LNHCP------DVQVKASFNEETKCYEIRSVSRMLQYQQAFINYGSHDNQRLMLEYGFVA 285
Query: 355 EDNPYDRLVVEAAL 368
NP+ + V+ L
Sbjct: 286 PCNPHSVVYVDKDL 299
>gi|149742140|ref|XP_001496337.1| PREDICTED: SET domain-containing protein 4 [Equus caballus]
Length = 440
Score = 46.2 bits (108), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 66/266 (24%), Positives = 108/266 (40%), Gaps = 23/266 (8%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSELACLALYLMYEKKQGKKSFWL 177
LQ G S+P S ++T + V+ + A + LS L L +L+ EK G +S W
Sbjct: 68 LQEGQMIISLPESCLLTTDTVIRSYLGAYIAKWQPPLSPLLALCTFLVAEKHAGDRSVWK 127
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
PY+ L + A P+ E E+ L P KA+ E+ ++ + +
Sbjct: 128 PYLEVLPK-------AYTCPVC-LEPEVVDLLPKPLKAKAREQRTRLQAFFTSSRDFFSS 179
Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSK 297
LF + I F++ F A+ V + V+++ RR P A +
Sbjct: 180 LRPLFSEAVESI----FSYSAFLWAWCTVNTRAVYMK----PRRRRCFSAEPDTYALAPY 231
Query: 298 CKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHD--FKC-GPQPNSKLLINYGFVD 354
+ + D V+ + + E V S H+ F C GP N +LL+ YGFV
Sbjct: 232 LDLLNHSPDVQVRAGFNEETRCYEIRTV---SSCRKHEEVFICYGPHDNQRLLLEYGFVS 288
Query: 355 EDNPYDRLVVEAALNTEDPQYQDKRM 380
NP+ + V + + DK+M
Sbjct: 289 IHNPHACVYVSKDILVKYLPSTDKQM 314
>gi|428163884|gb|EKX32933.1| hypothetical protein GUITHDRAFT_120884 [Guillardia theta CCMP2712]
Length = 320
Score = 46.2 bits (108), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 40/154 (25%), Positives = 70/154 (45%), Gaps = 17/154 (11%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ + G+ VP L++ + L ++ LL +L + C+ L LM E S
Sbjct: 32 ASKRISPGETFLKVPRHLLLGPHQ-LRASSLDRLLEGWQLPD--CMLLLLMCESVN-SSS 87
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F+ PY+ L V++P+ WS+ E L GSP ++ + R + E
Sbjct: 88 FFRPYLDLLPD-------TVDTPITWSKEEAKELVGSPVLHRAVKLRHELARSFQE---- 136
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 268
M +F +YP P F++E ++ A+ ++S
Sbjct: 137 --MKDKVFDKYPDRFPPLLFSYERYQWAYSILRS 168
>gi|357131865|ref|XP_003567554.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Brachypodium distachyon]
Length = 316
Score = 46.2 bits (108), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 47/192 (24%), Positives = 75/192 (39%), Gaps = 33/192 (17%)
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
KKS W PY+R L R + + + W EL + S E +ER E +E++ +
Sbjct: 38 KKSGWAPYVRSLPRND-----QMHNMMFWDLNELHMVRISSICDEAIERRERAMKEFSAV 92
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
LF + E F A S +V + +R +L+P
Sbjct: 93 KPSLECFPHLFGE---------IKLEDFMHA-----SALVSSRAWQTSRGVSLIPFA-DF 137
Query: 292 LAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
L + ++L D +++ DR Y GE ++V Y G N+ L +N
Sbjct: 138 LNHDGVSDSILLYDGQKDIAEVISDRNYAVGEQVMVRY-----------GKYSNAMLALN 186
Query: 350 YGFVDEDNPYDR 361
+GF N YD+
Sbjct: 187 FGFTLPRNIYDQ 198
>gi|224125978|ref|XP_002329631.1| predicted protein [Populus trichocarpa]
gi|222870512|gb|EEF07643.1| predicted protein [Populus trichocarpa]
Length = 513
Score = 46.2 bits (108), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 59/267 (22%), Positives = 109/267 (40%), Gaps = 39/267 (14%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A++DL+ GD A +P S++++ E V ++ L + ++ L L+ M E+
Sbjct: 149 ATKDLKVGDIALEIPVSIIISEEHVHKSDMYHILEKIDGITSETMLLLWSMKERHNCSSK 208
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F + Y L + G L + + L G+ EI++ E ++ +Y+EL
Sbjct: 209 FKI-YFDTLPEEFKTG-------LSFGVDAIMALDGTLLLEEIMQAKEHLRVQYDEL--- 257
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG------ 288
L + YP E +T+E F A S + + V R L+P+
Sbjct: 258 ---VPPLCKNYPDVFLPELYTWEQFLWACELWYSNSMKVMFVDGKLRTCLIPIAGFLNHS 314
Query: 289 --PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKL 346
P ++ Y + + + ++ + RP GE + Y G +S L
Sbjct: 315 LYPHIVHYGK-----VDSATNTLKFPLTRPCCFGEQCCLSY-----------GNFSSSHL 358
Query: 347 LINYGFVDE-DNPYDRLVVEAALNTED 372
+ YGF+ + DNP D + ++ + D
Sbjct: 359 ITFYGFMPQGDNPCDVIPLDIDVGDAD 385
>gi|70993754|ref|XP_751724.1| SET domain protein [Aspergillus fumigatus Af293]
gi|66849358|gb|EAL89686.1| SET domain protein [Aspergillus fumigatus Af293]
gi|159125354|gb|EDP50471.1| SET domain protein [Aspergillus fumigatus A1163]
Length = 674
Score = 46.2 bits (108), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 54/212 (25%), Positives = 89/212 (41%), Gaps = 34/212 (16%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
+L+ + +G + FW PYIR L + L++ +PL + +L +L G+ +R
Sbjct: 94 FFLIGQYLRGSEGFWFPYIRTLPQP-----LSLTTPLYYEGDDLRWLDGTSLAPAREQRM 148
Query: 222 EGIKREYNELDTVWFMAG-SLFQQYPYDI--------PTEAFTFEIFKQAFVAVQSCVVH 272
K +Y T AG QY +D+ + AF+ ++ +A V+ +
Sbjct: 149 GVWKEKYENGITELRKAGFEDVDQYTWDLYLWSSSILVSRAFSAKVLAEAVTDVE---LP 205
Query: 273 LQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFT 332
VS+ L+P L+ + K A V VV +G+ I Y
Sbjct: 206 EDGVSV-----LLPC-IDLMNHRPLAKVEWRAGKQDVAFVVLEDVASGQEISNNY----- 254
Query: 333 GHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
GP+ N +L++NYGF DNP D +V
Sbjct: 255 ------GPRNNEQLMMNYGFCLPDNPCDYRIV 280
>gi|320170159|gb|EFW47058.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 640
Score = 45.8 bits (107), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 76/313 (24%), Positives = 119/313 (38%), Gaps = 50/313 (15%)
Query: 64 VAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGD 123
A VS L L +W+ GL ++ +P N+ Y+ AS ++A
Sbjct: 156 AAAQLAAVSTPRGALARLTAWIDNAGL---EINSNARPGLNDVDE--LYLFASNPIEAAT 210
Query: 124 AAFSVPNSLVV--TLERVLGNETIAELLTTNKLSELA------CLALYLMYEKKQGKKSF 175
+VP LV+ T R L N I L + ++ LA+ L+YE + KS
Sbjct: 211 LVATVPAPLVMFETYLRTLENPMI--LAIDRRFKTMSVPDPSYALAMALLYESYE-PKSM 267
Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP--TKAEILERAEGIKREYNELDT 233
W +I L + ++S + WS E L P K +ILER +++ YN
Sbjct: 268 WREWISSLPQ-------TLDSTVFWSAEEQDALQSLPLKRKTQILER--HLQQLYNA--- 315
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CVVHLQKVSLARRFALVPLGPPLL 292
L +P+ +++E+FK A++ V S + + L PL L
Sbjct: 316 ---TTPRLLAAFPHIFAGGNYSYEMFKWAYMIVDSRSLTFSTGPDTLPQIMLAPLVDLLH 372
Query: 293 AYSSKCKAMLAAVDDAV-----QLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLL 347
+ L + V L R K GE +V G PN +LL
Sbjct: 373 HDPVQTNIQLGVHPEEVLGFEISLKTTRAIKKGEPLV-----------RHIGELPNHQLL 421
Query: 348 INYGFVDEDNPYD 360
+ +G NPY+
Sbjct: 422 LRFGLAMPRNPYE 434
>gi|402224283|gb|EJU04346.1| hypothetical protein DACRYDRAFT_114691 [Dacryopinax sp. DJM-731 SS1]
Length = 1313
Score = 45.8 bits (107), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 58/226 (25%), Positives = 91/226 (40%), Gaps = 30/226 (13%)
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
A+VP+ L A A L D +Q++ +P GE I Y G P
Sbjct: 1060 AMVPMADMLNARCGCNNAKLFYTRDDLQMMATKPIAKGEQIWNTY-----------GDPP 1108
Query: 343 NSKLLINYGFVDE-------DNPYDRLVVEAALNTEDPQ---YQDKRMVAQRNGKLSVQV 392
NS LL YG+VD +P D + + A E + YQD+ G V
Sbjct: 1109 NSDLLRRYGYVDALTLPDGVGSPSDVVEINADTVVEAAKVQSYQDRIDWWLEEGGDDAFV 1168
Query: 393 FHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFK 452
V ++ D + L + + + + S P P ++ + L +
Sbjct: 1169 LDV-----TYSVPDEMLSLVRLLLLNQEDWEKAQSK---GKPPKPKLDEKSYEVLLVVLQ 1220
Query: 453 ARLAGYPATLSEDEAMLTDYN-LHPKKRVATQLVRMEKKMLNACLQ 497
RLA YP +L+E E ML N L+ K+R A + E+++L+ L+
Sbjct: 1221 KRLAMYPISLTEQEGMLRSSNELNEKRRNALIVTTGEQRILHKTLE 1266
>gi|329663327|ref|NP_001192753.1| SET domain-containing protein 4 [Bos taurus]
gi|296490853|tpg|DAA32966.1| TPA: SET domain containing 4 [Bos taurus]
Length = 440
Score = 45.8 bits (107), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 74/328 (22%), Positives = 128/328 (39%), Gaps = 44/328 (13%)
Query: 39 NFGSSLRLVRRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILK 98
N G +RR+ F+ SS+ SR V + + +LK W+ +I
Sbjct: 3 NGGGRTSRIRRRKLFT----SSE-----SRGVNESYKPEFIELKKWLKDRRFEDTTLIPA 53
Query: 99 EKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-SEL 157
P + + LQ G S+P S ++T + V+ + A + S L
Sbjct: 54 HFPGTGRG------LMSKTSLQEGQTIISLPESCLLTTDTVIRSYLGAYIAKWQPPPSPL 107
Query: 158 ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
L +L+ EK G +S W PY+ L + A P+ E E+ L +P K +
Sbjct: 108 LALCTFLVSEKHAGDRSPWKPYLEVLPK-------AYTCPVC-LEPEVVNLLPNPLKTKA 159
Query: 218 LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK-- 275
E+ + ++ + LF + I F++ + A+ AV + V++++
Sbjct: 160 WEQRSHVWEFFSSSRGFFSSLQPLFSEAVETI----FSYRALRWAWCAVNTRAVYMKRPP 215
Query: 276 -VSLARRFALVPLGP--PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFT 332
+ L+ L P LL +S + A ++ + + G+ V
Sbjct: 216 LLCLSPEPDTCALAPYLDLLNHSPDVQVKAAFNEETRCYEIRTATRCGKHKEV------- 268
Query: 333 GHDFKC-GPQPNSKLLINYGFVDEDNPY 359
F C GP N +LL+ YGFV NP+
Sbjct: 269 ---FICYGPHDNHRLLLEYGFVCVSNPH 293
>gi|44890428|gb|AAH66931.1| SETD3 protein [Homo sapiens]
Length = 292
Score = 45.8 bits (107), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 53/223 (23%), Positives = 100/223 (44%), Gaps = 29/223 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y V Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV-------IQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
+P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPL 273
>gi|114684050|ref|XP_001168792.1| PREDICTED: SET domain-containing protein 4 isoform 4 [Pan
troglodytes]
gi|410222534|gb|JAA08486.1| SET domain containing 4 [Pan troglodytes]
gi|410259178|gb|JAA17555.1| SET domain containing 4 [Pan troglodytes]
gi|410287502|gb|JAA22351.1| SET domain containing 4 [Pan troglodytes]
gi|410336607|gb|JAA37250.1| SET domain containing 4 [Pan troglodytes]
Length = 307
Score = 45.8 bits (107), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 61/261 (23%), Positives = 102/261 (39%), Gaps = 33/261 (12%)
Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSF 175
LQ G S+P S ++T + V+ + +T K S L L +L+ EK G +S
Sbjct: 67 SLQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGHRSL 125
Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 235
W PY+ L + A P+ E E+ L KA+ E+ ++ + +
Sbjct: 126 WKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFF 177
Query: 236 FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--P 290
LF + I F++ A+ V + V+L Q+ L+ L P
Sbjct: 178 SSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPYLD 233
Query: 291 LLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLI 348
LL +S + KA + ++ ++ E + + Y GP N +L +
Sbjct: 234 LLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICY-----------GPHDNQRLFL 282
Query: 349 NYGFVDEDNPYDRLVVEAALN 369
YGFV NP+ + V N
Sbjct: 283 EYGFVSVHNPHACVYVSRGWN 303
>gi|40068483|ref|NP_954574.1| histone-lysine N-methyltransferase setd3 isoform b [Homo sapiens]
gi|28071060|emb|CAD61911.1| unnamed protein product [Homo sapiens]
gi|111309143|gb|AAI20968.1| SET domain containing 3 [Homo sapiens]
gi|118341365|gb|AAI27625.1| SET domain containing 3 [Homo sapiens]
gi|118341638|gb|AAI27626.1| SET domain containing 3 [Homo sapiens]
gi|119602071|gb|EAW81665.1| SET domain containing 3, isoform CRA_b [Homo sapiens]
gi|156138972|gb|AAI48252.1| SET domain containing 3 [Homo sapiens]
Length = 296
Score = 45.8 bits (107), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 53/223 (23%), Positives = 100/223 (44%), Gaps = 29/223 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y V Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV-------IQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
+P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPL 273
>gi|328771298|gb|EGF81338.1| hypothetical protein BATDEDRAFT_87914 [Batrachochytrium
dendrobatidis JAM81]
Length = 607
Score = 45.8 bits (107), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 41/154 (26%), Positives = 71/154 (46%), Gaps = 17/154 (11%)
Query: 78 LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
L LK W +N + + ++ + N R V A + L+ GD ++P +++++
Sbjct: 4 LNILKQWFGENKIAYDEEKIRIEHDTNNGFR----VFAKQTLEVGDILCAIPKEAILSIK 59
Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
N +A++L L L + LM+E+ G+KS W YI+ L L P
Sbjct: 60 ----NCGVADVLEEQGLGGQLGLVIALMFERSLGEKSPWYGYIQSL-------PLRENIP 108
Query: 198 LLWSETELAYLTGSPTKAEILE-RAEGIKREYNE 230
L W + + A L G+ A +LE + +K +Y E
Sbjct: 109 LFWEKDQQACLDGTAV-AHLLEPMPKDLKADYKE 141
>gi|325186532|emb|CCA21071.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 441
Score = 45.8 bits (107), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 26/99 (26%), Positives = 51/99 (51%), Gaps = 7/99 (7%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A++ LQ G+ +P L ++ + ++ L N+L + +AL+LM E+ +
Sbjct: 39 VYAAKSLQKGEITMEIPFHLTISKVTAMQSDLRQILQDKNELDQDEIVALFLMIERFKSS 98
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
SF+ P+I+ L Q + P+ W++++ A L G+
Sbjct: 99 DSFFEPFIQSLPSQ-------FDLPIFWNDSDFAELEGT 130
Score = 41.2 bits (95), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 49/197 (24%), Positives = 85/197 (43%), Gaps = 19/197 (9%)
Query: 339 GPQPNSKLLINYGFV--DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 396
G ++KLL +YGFV +E N ++ + + + DP ++ K+ + + N Q +
Sbjct: 247 GSYSDAKLLYSYGFVSLNEKNRFNGIDLWMRVPVTDPNFKLKQAILEGNAATRDQTYDFR 306
Query: 397 AGREKEAISD-MLPYLRLGYVS--DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 453
+ + + L R+ +S + E + S VS E AV + D +
Sbjct: 307 GTIHLDDVDERFLASFRIILLSQEEFREYEKAFDS----TIVSVRNELAVYAAIHDVCEK 362
Query: 454 RLAGYPATLSEDEAMLTDYNLHPKKRVATQL-VRME-KKMLNACLQVTADMIMLL----- 506
RLA +P +L +D L + ++ R + VRME KK+L + ++ + LL
Sbjct: 363 RLARFPTSLEDDLKKLAELEMNSDLRKTYAISVRMEDKKILQSVCRLMKEWRNLLENDSN 422
Query: 507 ---PDVTVSPCPAPYAP 520
PDVT P P
Sbjct: 423 IYPPDVTRQQQPQLSMP 439
>gi|426218421|ref|XP_004003445.1| PREDICTED: SET domain-containing protein 4 [Ovis aries]
Length = 439
Score = 45.8 bits (107), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 73/328 (22%), Positives = 127/328 (38%), Gaps = 44/328 (13%)
Query: 39 NFGSSLRLVRRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILK 98
N G +RR+ F SS+ SR V + + +LK W+ +I
Sbjct: 3 NGGGRTSRIRRRKLFR----SSE-----SRGVNESYKPEFIELKKWLKDRRFEDATLIPA 53
Query: 99 EKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-SEL 157
P + + LQ G S+P S ++T + V+ + A + S L
Sbjct: 54 RFPGTGRG------LMSKTSLQEGQTIISLPESCLLTTDTVIRSYLGAYIAKWQPPPSPL 107
Query: 158 ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
L +L+ EK G +S W PY+ L + A P+ E E+ L +P K +
Sbjct: 108 LALCTFLVSEKHAGDRSPWKPYLEVLPK-------AYTCPVC-LEPEVVNLLPNPLKTKA 159
Query: 218 LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK-- 275
E+ ++ ++ + LF + I F++ + A+ V + V++++
Sbjct: 160 WEQRSHVQEFFSSSRGFFSSLQPLFSEAIETI----FSYRALRWAWCTVNTRAVYMKRPP 215
Query: 276 -VSLARRFALVPLGP--PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFT 332
+ L+ L P LL +S + A ++ + + G+ V
Sbjct: 216 QLCLSPEPDTCALAPYLDLLNHSPDVQVKAAFNEETRCYEIRTATRCGKHKEV------- 268
Query: 333 GHDFKC-GPQPNSKLLINYGFVDEDNPY 359
F C GP N +LL+ YGFV NP+
Sbjct: 269 ---FICYGPHDNHRLLLEYGFVSVSNPH 293
>gi|303275314|ref|XP_003056953.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226461305|gb|EEH58598.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 701
Score = 45.8 bits (107), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 38/136 (27%), Positives = 59/136 (43%), Gaps = 19/136 (13%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
WM + G+ V + P R A+ D+ GD SVP ++T E + +
Sbjct: 37 WMKRRGIVLNGVGVGRFP------RTGRGCVATRDIAPGDVLVSVPEDAIITAETSVAAD 90
Query: 144 TIAEL-LTTNKLS-------ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ + L +++S E L L ++ E +G +S + PY+ L R A
Sbjct: 91 ALTKFGLGGDEMSAEASPRLEREALVLAVLAEMSRGHESDFAPYLAALPTLR-----ATH 145
Query: 196 SPLLWSETELAYLTGS 211
SPL WS ELA L G+
Sbjct: 146 SPLAWSGAELAELEGT 161
>gi|145346652|ref|XP_001417799.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578027|gb|ABO96092.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 490
Score = 45.8 bits (107), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 62/270 (22%), Positives = 105/270 (38%), Gaps = 42/270 (15%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLS------ELACLALYLMY 166
V A+ DL+ G+ SVP+ V+T++ + + E + + L + +M
Sbjct: 57 VRATRDLRVGEVVVSVPDDAVLTVDACAVKKELGEFVGDGDDEAPSPRLDKELLVIAVMC 116
Query: 167 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS-------PTKAEILE 219
E GK S W Y+ + G S L W + ++ L G+ E L+
Sbjct: 117 EMCAGKSSAWCEYLETVHEAVRVGH----SVLAWDDEQVTALFGTDAWRDAYENDDETLD 172
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF-TFEIFKQAFVAVQSCVVHLQKVSL 278
+ + + T++F LF + + EA A VA S + ++
Sbjct: 173 LPMMTEEHFENVVTLFF---KLFPKLASGLSVEALRELHFAATAMVAGYSFTLGDDEIQ- 228
Query: 279 ARRFALVPLGPPLLAYSSKCKAMLAAVDD----AVQLVVDRPYKAGESIVVWYNISFTGH 334
A+VP +L ++ C+A + D +Q++ R K GE + Y
Sbjct: 229 ----AMVPFW-DMLNHAPPCEASVRLHHDQKNGCLQMITVRGVKKGEEVFNTY------- 276
Query: 335 DFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
GP N++LL YGFV NP+ V
Sbjct: 277 ----GPLRNAELLRRYGFVLPRNPHGGTTV 302
>gi|422293951|gb|EKU21251.1| hypothetical protein NGA_2061300, partial [Nannochloropsis gaditana
CCMP526]
Length = 452
Score = 45.8 bits (107), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 63/285 (22%), Positives = 111/285 (38%), Gaps = 46/285 (16%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L W +KNG+ I S + A+ ++ G+ +VP +L +++ V
Sbjct: 66 LLEWCNKNGIKDASKITIGPVSQAGMGLGL---VATAPIKQGETLATVPLNLCFSMDSVR 122
Query: 141 GN---ETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
+ + I E L + + +AL L+YE G KS + YI+ L R GQ + P
Sbjct: 123 ASPLGKVIGEF--EPALGDASLIALQLLYEAHMGPKSKYAVYIKSLPRP---GQDGFDHP 177
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
L WS E L S T+ + + +Y W + +L + + ++F
Sbjct: 178 LFWSTAEQGVLAKSSTRNLGETLIDAVAEDYG-----WIQS-ALARGGISGLQADSFDLS 231
Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA---------MLAAVDDA 308
F+ A V S + R A PLL +++ + +
Sbjct: 232 DFEWAVAVVLSRSFFAEN---GLRLA------PLLDMANRGEGCTNEPQIGGLGIFGGKG 282
Query: 309 VQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV 353
++++ DR G+ IV+ Y GP+ + L ++GFV
Sbjct: 283 LKVIADRDTDKGQEIVISY-----------GPKSGIEFLEDHGFV 316
>gi|55953063|ref|NP_001007260.1| SET domain-containing protein 4 isoform 2 [Homo sapiens]
gi|12804091|gb|AAH02898.1| SET domain containing 4 [Homo sapiens]
gi|119630161|gb|EAX09756.1| SET domain containing 4, isoform CRA_a [Homo sapiens]
Length = 307
Score = 45.8 bits (107), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 61/261 (23%), Positives = 102/261 (39%), Gaps = 33/261 (12%)
Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSF 175
LQ G S+P S ++T + V+ + +T K S L L +L+ EK G +S
Sbjct: 67 SLQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGHRSL 125
Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 235
W PY+ L + A P+ E E+ L KA+ E+ ++ + +
Sbjct: 126 WKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFF 177
Query: 236 FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--P 290
LF + I F++ A+ V + V+L Q+ L+ L P
Sbjct: 178 SSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPYLD 233
Query: 291 LLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLI 348
LL +S + KA + ++ ++ E + + Y GP N +L +
Sbjct: 234 LLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICY-----------GPHDNQRLFL 282
Query: 349 NYGFVDEDNPYDRLVVEAALN 369
YGFV NP+ + V N
Sbjct: 283 EYGFVSVHNPHACVYVSRGWN 303
>gi|111306423|gb|AAI20969.1| SETD3 protein [Homo sapiens]
Length = 284
Score = 45.4 bits (106), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 53/223 (23%), Positives = 100/223 (44%), Gaps = 29/223 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y V Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV-------IQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
+P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPL 273
>gi|428173103|gb|EKX42007.1| hypothetical protein GUITHDRAFT_141487 [Guillardia theta CCMP2712]
Length = 355
Score = 45.4 bits (106), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 62/267 (23%), Positives = 103/267 (38%), Gaps = 45/267 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERV-------------LGNETIAELLTTNKLSELAC 159
A +D+ G+ ++P+ +++ ERV + +++I+ LSE
Sbjct: 75 TTAKDDIADGELYIAIPDHMLMGPERVEPGSRLDKKLMKIVKSQSISMQEQRRLLSEKNK 134
Query: 160 LALYL---MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAE 216
+ +Y MY K K+SFW PY + + SP+ WSE EL L GS
Sbjct: 135 VLMYFLLQMYNPK--KESFWKPYFDIMPTN-------LTSPIFWSEDELQELAGSEVSNM 185
Query: 217 ILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
+ ++ Y+EL +F+ +AFT + + A S V+ L +
Sbjct: 186 ARIEKKRLRAMYDELRE------RIFKHDRKTFLKQAFTLKNWFWANGLYDSRVIQLNRQ 239
Query: 277 SLARRF-ALVPLGPPLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWYNISFTG 333
+ +PL + S+ K + A + DR G + Y
Sbjct: 240 TGHGNVPTFIPLIDMVNCIESQDKTFIQYDKKLRAAVMYADRAVSRGVQVFESY------ 293
Query: 334 HDFKCGPQPNSKLLINYGFVDEDNPYD 360
G + N + L+ GFV EDNP D
Sbjct: 294 -----GNKSNYEYLLYNGFVMEDNPND 315
>gi|387193935|gb|AFJ68731.1| hypothetical protein NGATSA_2061300, partial [Nannochloropsis
gaditana CCMP526]
Length = 446
Score = 45.4 bits (106), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 63/285 (22%), Positives = 111/285 (38%), Gaps = 46/285 (16%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L W +KNG+ I S + A+ ++ G+ +VP +L +++ V
Sbjct: 60 LLEWCNKNGIKDASKITIGPVSQAGMGLGL---VATAPIKQGETLATVPLNLCFSMDSVR 116
Query: 141 GN---ETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
+ + I E L + + +AL L+YE G KS + YI+ L R GQ + P
Sbjct: 117 ASPLGKVIGEF--EPALGDASLIALQLLYEAHMGPKSKYAVYIKSLPRP---GQDGFDHP 171
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
L WS E L S T+ + + +Y W + +L + + ++F
Sbjct: 172 LFWSTAEQGVLAKSSTRNLGETLIDAVAEDYG-----WIQS-ALARGGISGLQADSFDLS 225
Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA---------MLAAVDDA 308
F+ A V S + R A PLL +++ + +
Sbjct: 226 DFEWAVAVVLSRSFFAEN---GLRLA------PLLDMANRGEGCTNEPQIGGLGIFGGKG 276
Query: 309 VQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV 353
++++ DR G+ IV+ Y GP+ + L ++GFV
Sbjct: 277 LKVIADRDTDKGQEIVISY-----------GPKSGIEFLEDHGFV 310
>gi|356553227|ref|XP_003544959.1| PREDICTED: protein SET DOMAIN GROUP 40-like [Glycine max]
Length = 475
Score = 45.4 bits (106), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 67/301 (22%), Positives = 120/301 (39%), Gaps = 58/301 (19%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSELACLALYLMYEKKQG 171
+ A DL+ G+ VP S ++T E V+ ++ + + + + LS L + L+YE +G
Sbjct: 55 LGAVRDLRRGEIVLRVPKSALMTRETVMEDKKLCDAVNRHSSLSSAQILIVCLLYEMGKG 114
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
K S W PY+ L + ++ E E L ++ E ++ +
Sbjct: 115 KTSRWHPYLMHLPH-------TYDVLAMFGEFEKHAL-------QVDEAMWVTEKAMLKA 160
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
+ W A SL Q + + FTF+ + A + S +H + L P+G L
Sbjct: 161 KSEWKEAHSLMQDLMF--KPQFFTFKAWVWAAATISSRTLH---IPWDEAGCLCPVG-DL 214
Query: 292 LAYSSKC--KAMLAAVDDAVQL---------------------VVDRPYKAGESIVVWYN 328
Y + + + +D A QL YK G+ +++ Y
Sbjct: 215 FNYDAPGIEPSGIEDLDHAEQLDSHSWRLTDGGFEEDANAYCFYAREHYKKGDQVLLCY- 273
Query: 329 ISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV--EAALNTEDPQYQDKRMVAQRNG 386
G N +LL +YGF+ ++NP D++ + E AL + + + + NG
Sbjct: 274 ----------GTYTNLELLEHYGFLLQENPNDKVFIPLEPALYSS-TSWSKESLYIHHNG 322
Query: 387 K 387
K
Sbjct: 323 K 323
>gi|171679805|ref|XP_001904849.1| hypothetical protein [Podospora anserina S mat+]
gi|170939528|emb|CAP64756.1| unnamed protein product [Podospora anserina S mat+]
Length = 468
Score = 45.4 bits (106), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 52/210 (24%), Positives = 81/210 (38%), Gaps = 40/210 (19%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
+L+ E + K S+W PYI L + A+ P +W E ++ L + + E
Sbjct: 104 FFLVKEYLKEKDSYWWPYISTLPQPDRVDTWAL--PAVWPEDDIECLEETNAHVAVREIQ 161
Query: 222 EGIKREYNE----LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 277
IK+EY L V F + Q Y FT F+ + + Q H+
Sbjct: 162 ANIKKEYKHARKLLKEVDFPGWQEYTQLLYKWAFCIFTSRSFRPSLILSQETQDHV---- 217
Query: 278 LARRFALVPLGP---------PLLAY-----SSKCKAMLAAVDDAVQLVVDRPYKAGESI 323
L P G PLL +S+ + L VD QL+ + Y+ G+ +
Sbjct: 218 ----LGLTPHGTKVDDFSILQPLLDIGNHDPTSQYQWNL-EVDGTCQLICNNAYQPGQQV 272
Query: 324 VVWYNISFTGHDFKCGPQPNSKLLINYGFV 353
Y G + NS+LL+ YGF+
Sbjct: 273 FNNY-----------GLKSNSELLLGYGFI 291
>gi|403350232|gb|EJY74567.1| hypothetical protein OXYTRI_04175 [Oxytricha trifallax]
Length = 766
Score = 45.4 bits (106), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 39/164 (23%), Positives = 67/164 (40%), Gaps = 23/164 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--------SELACLALYL 164
V A ED++ +A VP L++T+E + I + NK E L +++
Sbjct: 27 VRAREDIEHREAFLYVPFKLLITMELAHNHPIIGHVFKENKQIFTKEHEDFEQLTLTVFM 86
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
+YE ++G +SFW PY+ L VE WS++++ + E I
Sbjct: 87 LYEYQKGLESFWFPYLNLLP--------DVEFFCNWSKSDIEAIDDQELAYETKSYKRDI 138
Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 268
+ E+ E++ L YP + +F + F V S
Sbjct: 139 EIEWKEIEL-------LLLHYPQHFSSALIDKHLFMRIFAQVCS 175
>gi|168016200|ref|XP_001760637.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687997|gb|EDQ74376.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 450
Score = 45.1 bits (105), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 72/319 (22%), Positives = 123/319 (38%), Gaps = 62/319 (19%)
Query: 81 LKSWMHKNGLPP--CKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLER 138
+ WM NG+ C++ +PS N ++ A ++ Q P L +T
Sbjct: 18 FRDWMQINGVQSRFCEI----RPSSNGENAGFGLFATKDNAQG--VLMVTPLLLAITPMT 71
Query: 139 VLGNETIA----ELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
VL + + +L+ ++ + + L+L+ E+ +G+ SFW PY+ L + G
Sbjct: 72 VLQDPELGGHYCKLMEEGEVDDRLLIMLFLVIERARGRFSFWAPYLEILPFKFG------ 125
Query: 195 ESPLLWSETELAYLTGSPT-KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 253
+PL +SE EL+ L G+ +A + G+ LD A S+F +IP
Sbjct: 126 -TPLSFSEEELSELKGTHLFQATQQQSTTGLILRCPVLDR----ANSVFWTRALNIPCP- 179
Query: 254 FTFEIFKQAFVAVQSCVVH----------LQKVSLARRFALVPLGPPLLAYSSKCKAM-- 301
F F H V + + L P + + KA+
Sbjct: 180 ---HSFNNRFAVDLDSTTHKKPEESSAADTDDVKIPSSVWVEGLVPGIDFCNHDLKAVAL 236
Query: 302 ---------LAAVDDAVQLV--VDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINY 350
+ V +++ LV +D G I + Y G + N +LL Y
Sbjct: 237 WEVDGPEGSVTGVPNSMYLVTGLDVVISNGSEIFISY-----------GNKSNEELLYLY 285
Query: 351 GFVDEDNPYDRLVVEAALN 369
GFV +NP D L+V + +
Sbjct: 286 GFVLVENPDDYLMVRSTIG 304
>gi|299472213|emb|CBN77183.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloropl [Ectocarpus
siliculosus]
Length = 460
Score = 45.1 bits (105), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 65/281 (23%), Positives = 112/281 (39%), Gaps = 46/281 (16%)
Query: 84 WMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLG 141
W+ K+G L V+ P E+ + A++ ++ G + ++P SL +T L
Sbjct: 18 WLTKSGVRLTDNAVLAGRSPLAGERG-----LVAAKAIETGQSVLAIPQSLGLTATG-LK 71
Query: 142 NETIAELLTT--NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
+ IA+ + E +AL +++E+ QG+ S P+I L ++ G+L E PL
Sbjct: 72 SSGIAQYVEGFEGWTGETGLIALQVLWERAQGEGSKMAPWIAVLPKE---GEL--EMPLF 126
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
W E +L S T+ G + +E D W ++ + F ++P P + F F
Sbjct: 127 WGEADLTLADASSTRG-----ISGFVADVDE-DFAW-LSENAFAKHPKVFPADKFGPGDF 179
Query: 260 KQAFVAVQSCVVHLQK-------VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 312
+ A S + V A +L + P + + + AV L
Sbjct: 180 RWAVGVALSRSFFVDGELRLTPLVDFANHSSLRGVSEP----TGGTTGLFGS--KAVVLR 233
Query: 313 VDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV 353
+ Y+ GE V Y GP+ + L GFV
Sbjct: 234 AGKNYEEGEEFFVSY-----------GPKGAAGYLEENGFV 263
>gi|426392958|ref|XP_004062802.1| PREDICTED: SET domain-containing protein 4 [Gorilla gorilla
gorilla]
Length = 440
Score = 45.1 bits (105), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 66/289 (22%), Positives = 114/289 (39%), Gaps = 38/289 (13%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ + +T K S L L +L+ EK G++S W
Sbjct: 68 LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGRRSLW 126
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L + A P+ E E+ L KA+ E+ ++ + +
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
LF + I F++ A+ V + V+L Q+ L+ L P L
Sbjct: 179 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPYLDL 234
Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
L +S + KA + ++ ++ E + + Y GP N +L +
Sbjct: 235 LNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICY-----------GPHDNQRLFLE 283
Query: 350 YGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
YGFV NP+ R ++ L + D Q K + + +G + F
Sbjct: 284 YGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332
>gi|358386801|gb|EHK24396.1| hypothetical protein TRIVIDRAFT_168260 [Trichoderma virens Gv29-8]
Length = 370
Score = 45.1 bits (105), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 36/151 (23%), Positives = 63/151 (41%), Gaps = 17/151 (11%)
Query: 218 LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 277
LE E +++ E W + F+ D+P E +T+ + + K
Sbjct: 113 LESREHLRKREKEFQGNW----NAFKDAFPDVPYEEYTYAWMIVNTRSFYNETPETLKYP 168
Query: 278 LARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFK 337
R AL+P+ CK +A D +V DR YK GE + IS++ H
Sbjct: 169 WEDRLALIPVADLFNHSDDGCKVYYSA--DGYHIVADREYKKGEELF----ISYSSHS-- 220
Query: 338 CGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 368
N +L+ YGF+ +++ D + ++ A+
Sbjct: 221 -----NDYILLEYGFIPDESLDDDVYIDDAV 246
>gi|33468718|emb|CAE30375.1| SI:dZ63M10.4 (novel protein similar to human chromosome 21 open
reading frame 18 (C21orf18)) [Danio rerio]
Length = 440
Score = 45.1 bits (105), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 72/330 (21%), Positives = 135/330 (40%), Gaps = 50/330 (15%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVV 134
L+ W+++ G +I P+++ + +++ ++A ++ S+P ++
Sbjct: 37 LRRWLNERGFTSQSLI------------PVNFHGNGRGLMSTQTIKAKNSLISLPEECLL 84
Query: 135 TLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
T VL + +A+ + +S L L +L+ E+ G+ S W PYI L +
Sbjct: 85 TTSTVLKS-YMADYIKRWHPPISPLLALCCFLISERHHGEASEWNPYIDILPK------- 136
Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 252
PL + + + L S K ++ E + ++ T + LF Q PTE
Sbjct: 137 TYTCPLYFPDNVIELLPRSLQKKAT-QQKEQFQELFSSSQTFFHSLQPLFNQ-----PTE 190
Query: 253 A-FTFEIFKQAFVAVQSCVV---HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA 308
F+ + + A+ +V + V H Q L+R + L P L + C +
Sbjct: 191 ELFSQDALRWAWCSVNTRTVYMEHDQSKYLSREKDVYALAP-YLDLLNHCPNV------Q 243
Query: 309 VQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPY-----DRLV 363
V+ ++ + E V F GP N +LL+ YGFV NP+ D
Sbjct: 244 VEAGFNKETRCYEIRSVNGCKKFQQAFINYGPHDNHRLLLEYGFVAPCNPHSVVYVDLET 303
Query: 364 VEAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
++ L+ +D Q ++K + + N L F
Sbjct: 304 LKVGLDEKDKQLKEKLLYLKDNDFLRNLTF 333
>gi|116206234|ref|XP_001228926.1| hypothetical protein CHGG_02410 [Chaetomium globosum CBS 148.51]
gi|88183007|gb|EAQ90475.1| hypothetical protein CHGG_02410 [Chaetomium globosum CBS 148.51]
Length = 442
Score = 45.1 bits (105), Expect = 0.089, Method: Compositional matrix adjust.
Identities = 50/206 (24%), Positives = 80/206 (38%), Gaps = 31/206 (15%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
+L+ E +GK SFW PY+ L + P W E ++AYL + I E
Sbjct: 113 FFLIKEYLKGKDSFWWPYLATLPSPDQVNAWVL--PAFWPEDDIAYLECTNAHVAIQEIQ 170
Query: 222 EGIKREYNE----LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---- 273
+K E+ + L F + + Y FT F+ + + + H+
Sbjct: 171 ANVKGEFKQARKILKNENFPDVAAYTSLMYKWAFTIFTSRSFRPSLILSDTTKRHISTLL 230
Query: 274 -QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWY 327
Q V L F+++ PLL ++ + + D DA LV Y G +
Sbjct: 231 PQSVEL-DDFSIL---QPLLDIANHSPTAVYSWDTTSPADACTLVCGDRYPPGAQV---- 282
Query: 328 NISFTGHDFKCGPQPNSKLLINYGFV 353
F + K NS+LL+ YGF+
Sbjct: 283 ---FNNYGLKT----NSELLLGYGFI 301
>gi|146180409|ref|XP_001020886.2| hypothetical protein TTHERM_00411920 [Tetrahymena thermophila]
gi|146144524|gb|EAS00641.2| hypothetical protein TTHERM_00411920 [Tetrahymena thermophila
SB210]
Length = 726
Score = 45.1 bits (105), Expect = 0.092, Method: Compositional matrix adjust.
Identities = 48/188 (25%), Positives = 81/188 (43%), Gaps = 31/188 (16%)
Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGNE-----TIAELLTTNKLSELA---CLALYLM 165
AA++D+ A S+PN ++++ +R +E +E L + K ++ A L ++ M
Sbjct: 67 AATKDIAPLTAFISIPNKIIISYDRARFSELKSFFKQSEDLFSEKENDEAGVNVLTVFFM 126
Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIK 225
YE+ +GKKS W Y L+ E+ L W+ E+ + + +
Sbjct: 127 YERLKGKKSLWHEYFEILENN--------ETILTWTAEEINRIPDPYIQKQA-------- 170
Query: 226 REYNE-LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CVVHLQKVSLARRFA 283
REY E +D +W L P T E+F A+ V S C + QK + +
Sbjct: 171 REYKEQVDELWDELKELLHSQPNFFQKATATKELFLWAYNIVMSRCFGYTQKGT-----S 225
Query: 284 LVPLGPPL 291
+VP L
Sbjct: 226 IVPFADCL 233
>gi|50556556|ref|XP_505686.1| YALI0F20944p [Yarrowia lipolytica]
gi|49651556|emb|CAG78495.1| YALI0F20944p [Yarrowia lipolytica CLIB122]
Length = 402
Score = 45.1 bits (105), Expect = 0.092, Method: Compositional matrix adjust.
Identities = 60/230 (26%), Positives = 107/230 (46%), Gaps = 46/230 (20%)
Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIREL-DRQRGRGQLAVESPLLWSETELAYLTGSP 212
+S LAL+L+ ++ G KS W ++ L DR+ G ++ PL WS+ + LT P
Sbjct: 79 MSAHQVLALFLVIQQSLGSKSDWKAFMGLLPDRKEG----FLDVPLQWSKEDQDSLT--P 132
Query: 213 TKAEILERA-EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CV 270
+L++ + + +Y++ T F+A +Y D P +A+ + A++ V S C+
Sbjct: 133 EGIVVLKKTLDTFEADYDKTKT--FVA-----KYDSD-PRDAYLW-----AWLCVNSRCL 179
Query: 271 VHLQKVSLARRFAL-----VPLGP--PLLAYS-----SKCKAMLAAVDDAVQLVVDRPYK 318
++ ++ A + L P L+ +S + C+ +++ + L R Y
Sbjct: 180 YFDLTLTTGKKDAQEVPDNITLAPYVDLINHSVESGPTHCQLKTSSIGFEI-LCGQRGYT 238
Query: 319 AGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 368
A E I + Y GP+ NS LL YGF +NP+D + + AL
Sbjct: 239 ADEEIFLCY-----------GPRSNSVLLCEYGFTVPENPWDDVDISDAL 277
>gi|145517214|ref|XP_001444490.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124411912|emb|CAK77093.1| unnamed protein product [Paramecium tetraurelia]
Length = 748
Score = 45.1 bits (105), Expect = 0.094, Method: Compositional matrix adjust.
Identities = 37/141 (26%), Positives = 66/141 (46%), Gaps = 23/141 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERV-LGNETIA-----ELLTTNKLS--ELACLALYL 164
V A++D+ A A VP +L+++ E+ L + +I EL N+ S E L YL
Sbjct: 46 VVATQDIPANTAIICVPQTLIISQEKCKLSSLSIVYDKHPELFDENQTSDAEFNILIFYL 105
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
EKK+G++SF+ PYI+ + + + W++ EL+ + E +E +
Sbjct: 106 FNEKKKGEQSFFYPYIQAIQTNN--------TLIDWTKEELSQIEDPIVLDEFAIVSEDL 157
Query: 225 KREYNELDTVWFMAGSLFQQY 245
K +W A +F ++
Sbjct: 158 K-------VLWNYAQDIFNEF 171
>gi|395332633|gb|EJF65011.1| SET domain-containing protein [Dichomitus squalens LYAD-421 SS1]
Length = 502
Score = 45.1 bits (105), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 62/234 (26%), Positives = 98/234 (41%), Gaps = 38/234 (16%)
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
A+VP+ L A A L + +++V +P +AGE I Y G P
Sbjct: 257 AMVPMADMLNARFESENAKLFYEERELKMVTTKPVEAGEQIWNTY-----------GDPP 305
Query: 343 NSKLLINYGFVD----------EDNPYDRLVVEAAL----NTEDPQYQDKRMVAQRNGKL 388
NS LL YG VD NP D + V A L ++ +Y + V +
Sbjct: 306 NSDLLRRYGHVDVVPLRPPLSGMGNPRDIVEVRADLIVSAVSKKVEYSLQERVDWWLEEA 365
Query: 389 SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLA 448
VF + + E +++ + RL ++S+ +++ S P V P VL
Sbjct: 366 EDDVFILRT--DCELPEELVSFERLLFLSEDEWIKTAKKSKLPKPKVDP----DVLTVAI 419
Query: 449 DYFKARLAGYPATLSEDEAMLT-----DYNLHPKKRVATQLVRMEKKMLNACLQ 497
D ARL YP ++ EDE +L+ +L+ K V +L EK++L L+
Sbjct: 420 DVLSARLKEYPTSIEEDEKLLSADKVESLSLNKKHAVIVRL--GEKRILQGTLK 471
>gi|320163219|gb|EFW40118.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 1188
Score = 45.1 bits (105), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 49/212 (23%), Positives = 85/212 (40%), Gaps = 31/212 (14%)
Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 256
PL W++ EL +L G+ I ER ++ ++ + V L ++ P P + FT+
Sbjct: 245 PLWWNDAELDHLDGTNIGGYIQERRNQVRNQFLNVFPV------LSREQPALFPKDVFTY 298
Query: 257 EIFKQAFVAVQSCVVHLQ-KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 315
E + AF S L+ V+ +G P+ C L + D +
Sbjct: 299 EAYLWAFSTCSSRAFPLRVTVNPTTGVESHAIGNPM---KEPCVECLLPLLDMMN----- 350
Query: 316 PYKAGESIVVWYNIS-----FTGHDFK--------CGPQPNSKLLINYGFVDEDNPYDRL 362
++ G SI W+ FTG + GP+ N +LL+ YGF +N D +
Sbjct: 351 -HQFGASI-TWFTDETSVRFFTGAKVRKGEQVYNNYGPKSNEELLMGYGFCLPNNEADHV 408
Query: 363 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFH 394
++ + DP + K + + +G H
Sbjct: 409 KIQLTVGN-DPDGEAKLAILRWHGLSLTHFLH 439
>gi|302510645|ref|XP_003017274.1| hypothetical protein ARB_04152 [Arthroderma benhamiae CBS 112371]
gi|291180845|gb|EFE36629.1| hypothetical protein ARB_04152 [Arthroderma benhamiae CBS 112371]
Length = 479
Score = 45.1 bits (105), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 30/126 (23%), Positives = 55/126 (43%), Gaps = 14/126 (11%)
Query: 97 LKEKPSHNEKHRPIHY-----VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTT 151
LK H + H IH A + + F +PN L+++++ + L
Sbjct: 24 LKRSSPHFKMHPGIHIADLRSTGAGRGISEDEELFVIPNDLILSVQNSEARSVLG--LDD 81
Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
+L L + ++YE QG++S W PY R L + ++ + W++ +L+ L GS
Sbjct: 82 KQLGPWLSLIITMIYEYYQGEQSKWYPYFRILPS-------SFDTLMFWTDEQLSELQGS 134
Query: 212 PTKAEI 217
+I
Sbjct: 135 AVVGKI 140
>gi|256270722|gb|EEU05884.1| Set7p [Saccharomyces cerevisiae JAY291]
Length = 494
Score = 45.1 bits (105), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 64/266 (24%), Positives = 113/266 (42%), Gaps = 37/266 (13%)
Query: 113 VAASEDLQAGDAAFSVPNS--LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKK- 169
V A++ ++ + F +P S L VT +++ + + N+ L + ++YE +
Sbjct: 41 VVATQKIKKDETLFKIPRSSVLSVTTSQLIKDYPSLKDKFLNETGSWEGLIICILYEMEV 100
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
++S W PY + ++ L + W + EL L G E+ ER
Sbjct: 101 LQERSRWAPYFKVWNKPSDMNAL-----IFWDDNELQLLKPSLVLERIGKKEAKEMHERI 155
Query: 222 -EGIKREYNELDTVWFMAGSL-FQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
+ IK+ E V A S F + Y I + +F E+ + + +++
Sbjct: 156 IKSIKQIGGEFSRV---ATSFEFDNFAYIASIILSYSFDLEMQDSSVNENEEEETSEEEL 212
Query: 277 SLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHD 335
R +++PL L A +SKC A L + +++V R + E + YNI
Sbjct: 213 ENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQV---YNIY----- 264
Query: 336 FKCGPQPNSKLLINYGFVDED-NPYD 360
G PNS+LL YG+V+ D + YD
Sbjct: 265 ---GEHPNSELLRRYGYVEWDGSKYD 287
>gi|449506720|ref|XP_004162829.1| PREDICTED: uncharacterized LOC101212907 [Cucumis sativus]
Length = 559
Score = 45.1 bits (105), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 63/267 (23%), Positives = 106/267 (39%), Gaps = 39/267 (14%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A EDL GD +P +++++ E V + L + + L+ M EK
Sbjct: 189 AKEDLDVGDTVLEIPLAIIISEELVQKSTMYPVLSKVEGMLPETMMLLWSMKEKHIVDSE 248
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F + Y L A + L + + L G+ E+++ E ++++YNEL
Sbjct: 249 FRV-YFDTLPE-------AFNTGLSFGVGAMTTLVGTLLFDELMQAKEHLRKQYNEL--- 297
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP-------- 286
F A L +P P E +++E F A S + + R LVP
Sbjct: 298 -FPA--LCNNHPDIFPEEFYSWEEFLWACELWYSNSLKIMFPDGNVRTCLVPIAGFLNHS 354
Query: 287 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKL 346
L P +L Y + + D+++ + RP +AGE + Y G S L
Sbjct: 355 LHPHILHYGK-----VDSDTDSLKFRLSRPCRAGEECYLSY-----------GNYSGSHL 398
Query: 347 LINYGFVDE-DNPYDRLVVEAALNTED 372
+ YGF+ E DN D + ++ +D
Sbjct: 399 VTFYGFLPEGDNVNDVIPLDIDFGDDD 425
>gi|6320463|ref|NP_010543.1| Rkm4p [Saccharomyces cerevisiae S288c]
gi|46577338|sp|Q12504.1|RKM4_YEAST RecName: Full=Ribosomal N-lysine methyltransferase 4; AltName:
Full=SET domain-containing protein 7
gi|1136212|emb|CAA92714.1| unknown [Saccharomyces cerevisiae]
gi|1226033|emb|CAA94096.1| unknown [Saccharomyces cerevisiae]
gi|51830266|gb|AAU09704.1| YDR257C [Saccharomyces cerevisiae]
gi|190404795|gb|EDV08062.1| hypothetical protein SCRG_00269 [Saccharomyces cerevisiae RM11-1a]
gi|259145494|emb|CAY78758.1| Set7p [Saccharomyces cerevisiae EC1118]
gi|285811273|tpg|DAA12097.1| TPA: Rkm4p [Saccharomyces cerevisiae S288c]
gi|323349272|gb|EGA83501.1| Set7p [Saccharomyces cerevisiae Lalvin QA23]
gi|365766338|gb|EHN07836.1| Set7p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
gi|392300372|gb|EIW11463.1| Rkm4p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 494
Score = 45.1 bits (105), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 64/266 (24%), Positives = 113/266 (42%), Gaps = 37/266 (13%)
Query: 113 VAASEDLQAGDAAFSVPNS--LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKK- 169
V A++ ++ + F +P S L VT +++ + + N+ L + ++YE +
Sbjct: 41 VVATQKIKKDETLFKIPRSSVLSVTTSQLIKDYPSLKDKFLNETGSWEGLIICILYEMEV 100
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
++S W PY + ++ L + W + EL L G E+ ER
Sbjct: 101 LQERSRWAPYFKVWNKPSDMNAL-----IFWDDNELQLLKPSLVLERIGKKEAKEMHERI 155
Query: 222 -EGIKREYNELDTVWFMAGSL-FQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
+ IK+ E V A S F + Y I + +F E+ + + +++
Sbjct: 156 IKSIKQIGGEFSRV---ATSFEFDNFAYIASIILSYSFDLEMQDSSVNENEEEETSEEEL 212
Query: 277 SLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHD 335
R +++PL L A +SKC A L + +++V R + E + YNI
Sbjct: 213 ENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQV---YNIY----- 264
Query: 336 FKCGPQPNSKLLINYGFVDED-NPYD 360
G PNS+LL YG+V+ D + YD
Sbjct: 265 ---GEHPNSELLRRYGYVEWDGSKYD 287
>gi|323334121|gb|EGA75505.1| Set7p [Saccharomyces cerevisiae AWRI796]
Length = 515
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 64/266 (24%), Positives = 113/266 (42%), Gaps = 37/266 (13%)
Query: 113 VAASEDLQAGDAAFSVPNS--LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKK- 169
V A++ ++ + F +P S L VT +++ + + N+ L + ++YE +
Sbjct: 41 VVATQKIKKDETLFKIPRSSVLSVTTSQLIKDYPSLKDKFLNETGSWEGLIICILYEMEV 100
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
++S W PY + ++ L + W + EL L G E+ ER
Sbjct: 101 LQERSRWAPYFKVWNKPSDMNAL-----IFWDDNELQLLKPSLVLERIGKKEAKEMHERI 155
Query: 222 -EGIKREYNELDTVWFMAGSL-FQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
+ IK+ E V A S F + Y I + +F E+ + + +++
Sbjct: 156 IKSIKQIGGEFSRV---ATSFEFDNFAYIASIILSYSFDLEMQDSSVNENEEEETSEEEL 212
Query: 277 SLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHD 335
R +++PL L A +SKC A L + +++V R + E + YNI
Sbjct: 213 ENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQV---YNIY----- 264
Query: 336 FKCGPQPNSKLLINYGFVDED-NPYD 360
G PNS+LL YG+V+ D + YD
Sbjct: 265 ---GEHPNSELLRRYGYVEWDGSKYD 287
>gi|151942233|gb|EDN60589.1| SET domain-containing protein [Saccharomyces cerevisiae YJM789]
Length = 494
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 64/266 (24%), Positives = 113/266 (42%), Gaps = 37/266 (13%)
Query: 113 VAASEDLQAGDAAFSVPNS--LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKK- 169
V A++ ++ + F +P S L VT +++ + + N+ L + ++YE +
Sbjct: 41 VVATQKIKKDETLFKIPRSSVLSVTTSQLIKDYPSLKDKFLNETGSWEGLIICILYEMEV 100
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
++S W PY + ++ L + W + EL L G E+ ER
Sbjct: 101 LQERSRWAPYFKVWNKPSDMNAL-----IFWDDNELQLLKPSLVLERIGKKEAKEMHERI 155
Query: 222 -EGIKREYNELDTVWFMAGSL-FQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
+ IK+ E V A S F + Y I + +F E+ + + +++
Sbjct: 156 IKSIKQIGGEFSRV---ATSFEFDNFAYIASIILSYSFDLEMQDSSINENEEEETSEEEL 212
Query: 277 SLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHD 335
R +++PL L A +SKC A L + +++V R + E + YNI
Sbjct: 213 ENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQV---YNIY----- 264
Query: 336 FKCGPQPNSKLLINYGFVDED-NPYD 360
G PNS+LL YG+V+ D + YD
Sbjct: 265 ---GEHPNSELLRRYGYVEWDGSKYD 287
>gi|328772032|gb|EGF82071.1| hypothetical protein BATDEDRAFT_23340 [Batrachochytrium
dendrobatidis JAM81]
Length = 419
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 64/265 (24%), Positives = 110/265 (41%), Gaps = 49/265 (18%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERV--LGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
A+ D Q GD +P L++ R L N A + L + +AL++ ++K
Sbjct: 49 ATSDFQIGDPVVRIPARLLLVPRRTHKLFNNHPAIV----ALKQHPSIALFIAWQKIHPT 104
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
W PYI L R L ++ LL L Y +I E A K + ++LD
Sbjct: 105 PE-WSPYIDILPRSFDTMPLCIDLKLL---AMLPY--------DIQEIA---KNQQSKLD 149
Query: 233 TVWFMAGSLFQQYPYD-IPTEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-------RFAL 284
T + + Y+ IP + IFK A++ V + + + ++++ + +
Sbjct: 150 TDYAFVCTALAVSGYEMIPKD-----IFKWAWIVVNTRCITMNTNAISKPQLSHIHQQPI 204
Query: 285 VPLGPPLLAYSSKCKAMLAAVDDAVQ----LVVDRPYKAGESIVVWYNISFTGHDFKCGP 340
+ L P L + A ++A D V+ + PYK G + + Y GP
Sbjct: 205 ITLAPFLDCLNHTSTARISAGYDTVEKAYIIRTLVPYKKGSQVFINY-----------GP 253
Query: 341 QPNSKLLINYGFVDEDNPYDRLVVE 365
N+ LL YGF NP++ +V++
Sbjct: 254 HDNNFLLAEYGFAILKNPFNHVVLD 278
>gi|428174941|gb|EKX43834.1| hypothetical protein GUITHDRAFT_140267 [Guillardia theta CCMP2712]
Length = 805
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 43/190 (22%), Positives = 77/190 (40%), Gaps = 35/190 (18%)
Query: 69 EVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
E +K+ EDL + W+ +NG+ KV L+ H + + A + ++ + F +
Sbjct: 540 EGSAKRNEDLIEFSKWLRRNGVDDSKVKLRADGGHGMG----NSLYARQMIKEDELLFRI 595
Query: 129 PNSLVVTLERVLGNETIAELLTTNKL-----SELACLALYLM--------YE-------- 167
P + + V + T+ ++ ++ E L+L LM YE
Sbjct: 596 PLKIAFYSDAVRRHPTLGSVIKGARIPQGMQGETFLLSLMLMGPLTHLEQYEACQVGHME 655
Query: 168 ---KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
K + SFWLPYI+ L + +P+ W+E E L GS + +
Sbjct: 656 TGCKLSNETSFWLPYIKILPK-------TFSAPIFWNEVERQELKGSQVMEMLNDDLAQA 708
Query: 225 KREYNELDTV 234
+RE+ + V
Sbjct: 709 RREWEMMKIV 718
>gi|348552908|ref|XP_003462269.1| PREDICTED: SET domain-containing protein 4-like [Cavia porcellus]
Length = 440
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 63/292 (21%), Positives = 114/292 (39%), Gaps = 44/292 (15%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-SELACLALYLMYEKKQGKKSFWL 177
L+ G S+P S ++T + V+ + A ++ S L L +L+ EK G +S W
Sbjct: 68 LREGQMIISLPGSCLLTTDTVIRSSLGAYIIKWKPPPSPLLALCTFLVSEKHAGDQSVWK 127
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
PY+ L + + P+ E E+ L P KA+ E+ +++ + +
Sbjct: 128 PYLDILPK-------SYTCPVC-LEPEVVNLLPEPLKAKAEEQRMSVQQFFASSRDFFSS 179
Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS- 296
LF++ + F++ A+ V + V+L+ RR + L P A +
Sbjct: 180 LQPLFEEATDSV----FSYSALLWAWCTVNTRAVYLR----TRRRDCLSLEPDTCALAPY 231
Query: 297 ----------KCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKL 346
+ KA ++ Y+ + + + Y GP N +L
Sbjct: 232 LDLLNHSPNVQVKAAFNEETGCYEIRTASDYRKHKEVFICY-----------GPHDNHRL 280
Query: 347 LINYGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
L+ YGFV NP+ R ++ L + D Q K + + +G L F
Sbjct: 281 LLEYGFVSLCNPHACVYVSREILVKYLPSTDKQMNKKISILKDHGFLENLTF 332
>gi|349577313|dbj|GAA22482.1| K7_Set7p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 494
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 64/266 (24%), Positives = 113/266 (42%), Gaps = 37/266 (13%)
Query: 113 VAASEDLQAGDAAFSVPNS--LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKK- 169
V A++ ++ + F +P S L VT +++ + + N+ L + ++YE +
Sbjct: 41 VVATQKIKKDETLFKIPRSSVLSVTTSQLIKDYPSLKDKFLNETGSWEGLIICILYEMEV 100
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
++S W PY + ++ L + W + EL L G E+ ER
Sbjct: 101 LQERSRWAPYFKVWNKPSDMNAL-----IFWDDNELQLLKPSLVLERIGKKEAKEMHERI 155
Query: 222 -EGIKREYNELDTVWFMAGSL-FQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
+ IK+ E V A S F + Y I + +F E+ + + +++
Sbjct: 156 IKSIKQIGGEFSRV---ATSFEFDNFAYIASIILSYSFDLEMQDSSINENEEEETSEEEL 212
Query: 277 SLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHD 335
R +++PL L A +SKC A L + +++V R + E + YNI
Sbjct: 213 ENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQV---YNIY----- 264
Query: 336 FKCGPQPNSKLLINYGFVDED-NPYD 360
G PNS+LL YG+V+ D + YD
Sbjct: 265 ---GEHPNSELLRRYGYVEWDGSKYD 287
>gi|323355591|gb|EGA87411.1| Set7p [Saccharomyces cerevisiae VL3]
Length = 515
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 64/266 (24%), Positives = 113/266 (42%), Gaps = 37/266 (13%)
Query: 113 VAASEDLQAGDAAFSVPNS--LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKK- 169
V A++ ++ + F +P S L VT +++ + + N+ L + ++YE +
Sbjct: 41 VVATQKIKKDETLFKIPRSSVLSVTTSQLIKDYPSLKDKFLNETGSWEGLIICILYEMEV 100
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
++S W PY + ++ L + W + EL L G E+ ER
Sbjct: 101 LQERSRWAPYFKVWNKPSDMNAL-----IFWDDNELQLLKPSLVLERIGKKEAKEMHERI 155
Query: 222 -EGIKREYNELDTVWFMAGSL-FQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
+ IK+ E V A S F + Y I + +F E+ + + +++
Sbjct: 156 IKSIKQIGGEFSRV---ATSFEFDNFAYIASIILSYSFDLEMQDSSVNENEEEETSEEEL 212
Query: 277 SLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHD 335
R +++PL L A +SKC A L + +++V R + E + YNI
Sbjct: 213 ENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQV---YNIY----- 264
Query: 336 FKCGPQPNSKLLINYGFVDED-NPYD 360
G PNS+LL YG+V+ D + YD
Sbjct: 265 ---GEHPNSELLRRYGYVEWDGSKYD 287
>gi|255080174|ref|XP_002503667.1| set domain protein [Micromonas sp. RCC299]
gi|226518934|gb|ACO64925.1| set domain protein [Micromonas sp. RCC299]
Length = 401
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 30/119 (25%), Positives = 54/119 (45%), Gaps = 10/119 (8%)
Query: 144 TIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 203
++AE L +L L + +M+E+ G+ S W Y L RG+ + P+ W+
Sbjct: 38 SVAETLREARLGGGLALNIAIMHERSLGEGSRWAGYFAVLP---ARGERTL--PMFWTSA 92
Query: 204 ELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
+L +L G+ + E AE ++ ++NE + L +P P T E + +A
Sbjct: 93 QLEHLRGTDLLRHVTEDAESMRLDFNE-----NVVDGLCVTHPVAFPPGKHTLEAYMEA 146
>gi|392569623|gb|EIW62796.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
Length = 509
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 62/232 (26%), Positives = 93/232 (40%), Gaps = 34/232 (14%)
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
A+VP+ L A A L + +++V +P KAGE I Y G P
Sbjct: 264 AMVPMADMLNARFESENAKLFYDERELKMVSTKPIKAGEQIWNTY-----------GDPP 312
Query: 343 NSKLLINYGFVDE----------DNPYDRLVVEAALNTEDPQYQDKRMVAQRNG----KL 388
NS LL YG VD NP D + V A L + K + +R +
Sbjct: 313 NSDLLRRYGHVDLVPLSAPLSGLGNPGDVVEVRADLIVSVAAKKVKHDLKERVDWWLEEA 372
Query: 389 SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLA 448
VF + + E +++ ++RL + ++ S P P +++ VL
Sbjct: 373 DDDVFVLRT--DCELAEELVSFVRLLLLPKDEWEKAAQKSKLP----KPKLDKDVLTIAV 426
Query: 449 DYFKARLAGYPATLSEDEAMLTDY---NLHPKKRVATQLVRMEKKMLNACLQ 497
D + RL YP TL EDEA+ L KR A + EK++L L+
Sbjct: 427 DVLEKRLKDYPTTLEEDEALFAPERFGELSLNKRHAVVVRLGEKRILRGTLK 478
>gi|302804174|ref|XP_002983839.1| hypothetical protein SELMODRAFT_445692 [Selaginella moellendorffii]
gi|300148191|gb|EFJ14851.1| hypothetical protein SELMODRAFT_445692 [Selaginella moellendorffii]
Length = 236
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 49/195 (25%), Positives = 80/195 (41%), Gaps = 36/195 (18%)
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
+ S W PYI L G +++ LW +TEL+YL SP + ER E I E+ ++
Sbjct: 74 QSSAWAPYISCLPEPAG-----LDNTFLWEDTELSYLRASPLYGKTRERLEIITTEFGQV 128
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
+ LF + + E F + V S + +++ LV + P+
Sbjct: 129 QNALDVWPQLFGK---------VSVEDFMHVYATVFS-----RPLAIGEDSTLVMI--PM 172
Query: 292 LAYSSKCKAMLAAVDDAVQL---VVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLI 348
L + + A A + L VV E+ +W N CG N++L +
Sbjct: 173 LDFFNHNAASFAKLSFNGLLNYAVVTADRDCAENDQIWIN---------CGDLSNAELAL 223
Query: 349 NYGFVDEDNPYDRLV 363
+YGF P +RL+
Sbjct: 224 DYGFT---VPENRLI 235
>gi|402862437|ref|XP_003895567.1| PREDICTED: SET domain-containing protein 4 [Papio anubis]
Length = 456
Score = 44.7 bits (104), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 67/305 (21%), Positives = 113/305 (37%), Gaps = 54/305 (17%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ + +T K S L L +L+ EK G +S W
Sbjct: 68 LQEGQMIISLPESCLLTADTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGDRSLW 126
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L + A P+ E E+ L KA+ E+ ++ + +
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARR--------FALV 285
LF + I F++ A+ V + V+L Q+ L+ L
Sbjct: 179 SLQPLFVEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPACLC 234
Query: 286 PLGP------------PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTG 333
P P PL S + KA + ++ ++ E + + Y
Sbjct: 235 PFSPLCSSGLQNLMLHPLSGLSQQVKAAFNEETHSYEIRTTSRWRKHEEVFICY------ 288
Query: 334 HDFKCGPQPNSKLLINYGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKL 388
GP N +L + YGFV NP+ R ++ L + D Q K + + +G +
Sbjct: 289 -----GPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSRDKQMDKKISILKDHGYI 343
Query: 389 SVQVF 393
F
Sbjct: 344 ENLTF 348
>gi|302896942|ref|XP_003047350.1| hypothetical protein NECHADRAFT_106552 [Nectria haematococca mpVI
77-13-4]
gi|256728280|gb|EEU41637.1| hypothetical protein NECHADRAFT_106552 [Nectria haematococca mpVI
77-13-4]
Length = 471
Score = 44.7 bits (104), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 53/241 (21%), Positives = 102/241 (42%), Gaps = 47/241 (19%)
Query: 161 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-LE 219
A++L+ + G++SFW PYI+ L + A+ PLLW E++L +L G+ + + +
Sbjct: 111 AIFLVQQYLLGEQSFWYPYIQILPQPDDDKDSAI--PLLWPESDLLWLRGTHLEEAVSKQ 168
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
+ + +KR W A Q+Y +D P++ FT E+ A+ S
Sbjct: 169 KVDHVKR--------WTEAMETLQKYGWD-PSQ-FTLELGLWAYYCFYS----------- 207
Query: 280 RRFALVPLGPPLLAYSSKCKAMLAA---VDDAVQLV--VDRPYKAGESIVVWYNIS---- 330
R F + L P + + + ++ A +DD +++ + + YN+
Sbjct: 208 RYFWSIILEPDVANIKPEFQHLVKAGMNLDDTAKILLPILETLNHAQETNTEYNLDDKGL 267
Query: 331 --------------FTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 376
+ +D + N+ LL ++GF+ DN LV+ + + P +
Sbjct: 268 SVSKNIELKPGDPFYIAYDKETQRFNNTVLLKDFGFILPDNEAAELVLSSPFDLTRPMHL 327
Query: 377 D 377
D
Sbjct: 328 D 328
>gi|328864871|gb|EGG13257.1| hypothetical protein DFA_11018 [Dictyostelium fasciculatum]
Length = 1658
Score = 44.7 bits (104), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 81/365 (22%), Positives = 143/365 (39%), Gaps = 44/365 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELL--TTNKLSELACLALYLMYEKKQ 170
V ++ ++ + SVP ++ ++ + + +L L++ L L+++YEK +
Sbjct: 1212 VVTTKKVEENECVVSVPRKFLINVDCARKHPVLNSILFEEATGLNDDTILFLFVIYEK-E 1270
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
SFW P+ L + + ++ TEL L G+ + E IK
Sbjct: 1271 NPNSFWRPFFDTLPS-------YFPTSIHYTTTELLELEGT----NLFEETIQIKEHLES 1319
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 290
+ + F L QYP P FT E F A S + L K+ LVP
Sbjct: 1320 IRELLF--PELSNQYPDVFPESLFTMENFLWARSLFDSRAIQL-KIDGRIVNCLVP---- 1372
Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES----IVVWYNISFTGHDF-KCGPQPNSK 345
M ++ Q + + Y E+ ++ NI T F + G + +
Sbjct: 1373 ----------MADMINHHDQAQISQRYFDQENDCFRMISCCNIPATSQIFLQYGALQSWE 1422
Query: 346 LLINYGFVDEDNPYDRLVVEAALNTED-PQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI 404
L + YGFV +N YD + + + ED P+ ++++ L+V ++H
Sbjct: 1423 LALYYGFVISNNHYDSVHIGFDMPEEDTPELREEKQKLLDRHLLTVDHHYLHRSN---IP 1479
Query: 405 SDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSE 464
S +L LR+ +++ E + PI S E VL L L + +T E
Sbjct: 1480 SKLLASLRVALLAE-DEFNPHVDVWNPI---SRSNEEVVLYTLYSTVLMLLKQFSSTCDE 1535
Query: 465 DEAML 469
D+ +L
Sbjct: 1536 DQQLL 1540
>gi|67484540|ref|XP_657490.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56474743|gb|EAL52100.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
Length = 791
Score = 44.3 bits (103), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 72/328 (21%), Positives = 122/328 (37%), Gaps = 47/328 (14%)
Query: 78 LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
+ D+K W+ +NG V +K + + A+++ + + S+P S + +
Sbjct: 1 MEDIKKWVIQNGGVIDGVDVKTFEGYGRG------LCANKEFKKDEVIMSIPYS--IQIN 52
Query: 138 RVLGNETIAELLT------TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQ 191
R+ N E+ + +L L + K K F PYI L
Sbjct: 53 RINLNHIWPEVKLPKFNEGDDDRDDLNGLVYLYLAVNKTNPKCFHWPYINVLPE------ 106
Query: 192 LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPT 251
+ PL ++ EL + G+ A + E+ + V + L QQ+P
Sbjct: 107 -TYDCPLSYTIDELNLMKGTKLYAAV-EKINAFL-----MKVVDYYNNKLIQQFPQYF-- 157
Query: 252 EAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQL 311
++F ++FK+ A QS V + F V P +S+ C Q
Sbjct: 158 QSFD-DLFKRLQWAHQSFWSRAFLVIYPQPFGEVGSLIPFCDFSNHCTQAKVTYISNTQT 216
Query: 312 VV------DRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVE 365
+ K GE I Y I + N KLL+ YGFV+E+NP D L++
Sbjct: 217 ETFSFQTNEELVKPGEQIFNNYRI-----------RSNEKLLLGYGFVEENNPCDNLLLR 265
Query: 366 AALNTEDPQYQDKRMVAQRNGKLSVQVF 393
+D QY + + ++ S F
Sbjct: 266 IYFEVDDNQYNEIEEILKQEEIKSFDFF 293
>gi|397507017|ref|XP_003824008.1| PREDICTED: SET domain-containing protein 4 [Pan paniscus]
Length = 440
Score = 44.3 bits (103), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 66/289 (22%), Positives = 113/289 (39%), Gaps = 38/289 (13%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ + +T K S L L +L+ EK G +S W
Sbjct: 68 LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGHRSLW 126
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L + A P+ E E+ L KA+ E+ ++ + +
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
LF + I F++ A+ V + V+L Q+ L+ L P L
Sbjct: 179 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPYLDL 234
Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
L +S + KA + ++ ++ E + + Y GP N +L +
Sbjct: 235 LNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICY-----------GPHDNQRLFLE 283
Query: 350 YGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
YGFV NP+ R ++ L + D Q K + + +G + F
Sbjct: 284 YGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332
>gi|388579878|gb|EIM20197.1| RuBisCO-cytochrome methylase [Wallemia sebi CBS 633.66]
Length = 447
Score = 44.3 bits (103), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 91/461 (19%), Positives = 184/461 (39%), Gaps = 85/461 (18%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT-----LER 138
W NG K I+ + + R + VA D++A + F++P +V++ +
Sbjct: 10 WFTTNGGEFSKDIVAIGENVDGMGRGLVAVA---DIKAQTSLFTIPRDIVLSTRTSSFKE 66
Query: 139 VLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPL 198
+G + +L N + L + + +E QG S W Y + L +Q S +
Sbjct: 67 KVGQDVYKQLENDN-IGSWTPLIMAMCWEYNQGGSSKWDAYFKILPKQ-------FTSLM 118
Query: 199 LWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEI 258
WS+ EL+ L G+ +I E I+ E+ + + ++F DI +T ++
Sbjct: 119 FWSKEELSLLKGTTVVDKI--GLEDIENEFERVRDIVKQNENVFG----DIAN--YTLDL 170
Query: 259 FKQ--AFVAVQSCVVHLQKV---------------------SLARRFALVPLGPPLLAYS 295
FK+ + + +S V K + A+VP+ L + +
Sbjct: 171 FKRMGSLILSRSFTVEEWKTEEEREKEEEEEEDEDEEIDLRTSVDDVAMVPMADILNSRT 230
Query: 296 SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVD- 354
A ++ ++++ + KAG+ I YN PN+ L+ YG VD
Sbjct: 231 DSVNAHTEYEENCLRMISLQDIKAGDQIFNTYN-----------DPPNADLIRRYGHVDY 279
Query: 355 ----ED-----NPYD------RLVVEAALNTEDPQYQDKRM--VAQRNGKLSVQVFHVHA 397
+D N D +++E AL ++++R+ + G+ S ++ H
Sbjct: 280 SPLSQDPDFMGNKNDVVELPADILLELALPDAKESHKERRVEFLLDECGEDSFELTH--- 336
Query: 398 GREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAG 457
+ + ++L L + +E ++ S P + + + L K R+
Sbjct: 337 ---DDLVPELLKICVLLFTESEAEFKTREKSRK--LPKASGFTKGKAEFLIKAIKQRMEQ 391
Query: 458 YPATLSEDEAMLTDYNLHPKKRVATQLVRM-EKKMLNACLQ 497
Y +TL +D + L + + P+ +V + E+++LN ++
Sbjct: 392 YGSTLEDDISKLDNKDSLPENNFKALVVTVGERRILNKAIE 432
>gi|154272535|ref|XP_001537120.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150409107|gb|EDN04563.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 485
Score = 44.3 bits (103), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 35/140 (25%), Positives = 62/140 (44%), Gaps = 15/140 (10%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+ SW+ + P KV K K + + A +D+ + F++P SLV++ +
Sbjct: 19 EFMSWLKQR--PGVKVSPKIKIADLRSEGAGRGIVADDDIGEDEELFAIPQSLVLSFQ-- 74
Query: 140 LGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
N + +LL N+ CL + ++YE QG S W Y + L ++
Sbjct: 75 --NSRLKDLLDFNERDFDPWLCLIVVMIYEYLQGGASTWSRYFQLLPTN-------FDTL 125
Query: 198 LLWSETELAYLTGSPTKAEI 217
+ W++ EL L+GS +I
Sbjct: 126 MFWTDEELRELSGSAVLNKI 145
>gi|8393013|ref|NP_059134.1| SET domain-containing protein 4 isoform 1 [Homo sapiens]
gi|12229715|sp|Q9NVD3.1|SETD4_HUMAN RecName: Full=SET domain-containing protein 4
gi|7023055|dbj|BAA91819.1| unnamed protein product [Homo sapiens]
gi|119630162|gb|EAX09757.1| SET domain containing 4, isoform CRA_b [Homo sapiens]
gi|119630163|gb|EAX09758.1| SET domain containing 4, isoform CRA_b [Homo sapiens]
gi|119630165|gb|EAX09760.1| SET domain containing 4, isoform CRA_b [Homo sapiens]
Length = 440
Score = 44.3 bits (103), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 66/289 (22%), Positives = 113/289 (39%), Gaps = 38/289 (13%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ + +T K S L L +L+ EK G +S W
Sbjct: 68 LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGHRSLW 126
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L + A P+ E E+ L KA+ E+ ++ + +
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
LF + I F++ A+ V + V+L Q+ L+ L P L
Sbjct: 179 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPYLDL 234
Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
L +S + KA + ++ ++ E + + Y GP N +L +
Sbjct: 235 LNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICY-----------GPHDNQRLFLE 283
Query: 350 YGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
YGFV NP+ R ++ L + D Q K + + +G + F
Sbjct: 284 YGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332
>gi|388452885|ref|NP_001253203.1| SET domain-containing protein 4 [Macaca mulatta]
gi|355560299|gb|EHH16985.1| SET domain-containing protein 4 [Macaca mulatta]
gi|387541878|gb|AFJ71566.1| SET domain-containing protein 4 isoform 1 [Macaca mulatta]
Length = 440
Score = 44.3 bits (103), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 66/289 (22%), Positives = 113/289 (39%), Gaps = 38/289 (13%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ + +T K S L L +L+ EK G +S W
Sbjct: 68 LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGDRSLW 126
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L + A P+ E E+ L KA+ E+ ++ + +
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
LF + I F++ A+ V + V+L Q+ L+ L P L
Sbjct: 179 SLQPLFVEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPYLDL 234
Query: 292 LAYSSKC--KAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
L +S + KA + ++ ++ E + + Y GP N +L +
Sbjct: 235 LNHSPRVQVKAAFNEETHSYEIRTTSRWRKHEEVFICY-----------GPHDNQRLFLE 283
Query: 350 YGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
YGFV NP+ R ++ L + D Q K + + +G + F
Sbjct: 284 YGFVSVHNPHACVYVSREILVKYLPSRDKQMDKKISILKDHGYIENLTF 332
>gi|255077808|ref|XP_002502485.1| set domain protein [Micromonas sp. RCC299]
gi|226517750|gb|ACO63743.1| set domain protein [Micromonas sp. RCC299]
Length = 728
Score = 44.3 bits (103), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 61/137 (44%), Gaps = 19/137 (13%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE-RVLG 141
+WM K G+ V + P H + A+ D++ GD VP + ++T + V G
Sbjct: 51 AWMKKKGVKLNGVSIGRFP-HTGRG-----CVATRDIKEGDVLVEVPEAAIITADGSVAG 104
Query: 142 NETIAELLTTNKL-------SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
+ +A L L E L L +M E +G++S + PY+ L R A
Sbjct: 105 SALVAFGLGGEALLHEYSPRLEREALVLAVMAEMSRGEESEFAPYLAALPTLR-----AT 159
Query: 195 ESPLLWSETELAYLTGS 211
SPL WS EL+ L G+
Sbjct: 160 HSPLGWSGAELSELEGT 176
>gi|332872029|ref|XP_001168891.2| PREDICTED: SET domain-containing protein 4 isoform 8 [Pan
troglodytes]
gi|410222532|gb|JAA08485.1| SET domain containing 4 [Pan troglodytes]
gi|410259176|gb|JAA17554.1| SET domain containing 4 [Pan troglodytes]
gi|410287500|gb|JAA22350.1| SET domain containing 4 [Pan troglodytes]
gi|410336605|gb|JAA37249.1| SET domain containing 4 [Pan troglodytes]
Length = 440
Score = 44.3 bits (103), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 66/289 (22%), Positives = 113/289 (39%), Gaps = 38/289 (13%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ + +T K S L L +L+ EK G +S W
Sbjct: 68 LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGHRSLW 126
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L + A P+ E E+ L KA+ E+ ++ + +
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
LF + I F++ A+ V + V+L Q+ L+ L P L
Sbjct: 179 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPYLDL 234
Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
L +S + KA + ++ ++ E + + Y GP N +L +
Sbjct: 235 LNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICY-----------GPHDNQRLFLE 283
Query: 350 YGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
YGFV NP+ R ++ L + D Q K + + +G + F
Sbjct: 284 YGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332
>gi|323309789|gb|EGA62995.1| Set7p [Saccharomyces cerevisiae FostersO]
Length = 417
Score = 44.3 bits (103), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 64/266 (24%), Positives = 113/266 (42%), Gaps = 37/266 (13%)
Query: 113 VAASEDLQAGDAAFSVPNS--LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKK- 169
V A++ ++ + F +P S L VT +++ + + N+ L + ++YE +
Sbjct: 41 VVATQKIKKDETLFKIPRSSVLSVTTSQLIKDYPSLKDKFLNETGSWEGLIICILYEMEV 100
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
++S W PY + ++ L + W + EL L G E+ ER
Sbjct: 101 LQERSRWAPYFKVWNKPSDMNAL-----IFWDDXELQLLKPSLVLERIGKKEAKEMHERI 155
Query: 222 -EGIKREYNELDTVWFMAGSL-FQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
+ IK+ E V A S F + Y I + +F E+ + + +++
Sbjct: 156 IKSIKQIGGEFSRV---ATSFEFDNFAYIASIILSYSFDLEMQDSSVNENEEEETSEEEL 212
Query: 277 SLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHD 335
R +++PL L A +SKC A L + +++V R + E + YNI
Sbjct: 213 ENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQV---YNIY----- 264
Query: 336 FKCGPQPNSKLLINYGFVDED-NPYD 360
G PNS+LL YG+V+ D + YD
Sbjct: 265 ---GEHPNSELLRRYGYVEWDGSKYD 287
>gi|407852222|gb|EKG05847.1| hypothetical protein TCSYLVIO_003073 [Trypanosoma cruzi]
Length = 565
Score = 44.3 bits (103), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 48/174 (27%), Positives = 77/174 (44%), Gaps = 21/174 (12%)
Query: 311 LVVDRPYKAGESIVVWYNISFTGHDF------KCGPQPNSKLLINYGFVDEDNPYDRL-- 362
++V R G V+ S T D GP N +LL YGFV E N +DRL
Sbjct: 336 VLVRRVEPNGGDFVMQIGASLTAQDIGREIWMSYGPLQNWELLQFYGFVLEGNEHDRLPF 395
Query: 363 ---VVEAALNTEDPQYQDKR--MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS 417
EAA+ E + +R +VA+ L+ + H GR A ++ LR+ +++
Sbjct: 396 PLDFPEAAVGDE---WDGRRAALVAKYGLHLAGCCWICHDGRPPPA---LVALLRV-HLA 448
Query: 418 DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 471
+ E ++ + GP + E V +AD + L + +L EDE +L +
Sbjct: 449 EAEEFDTMERN-GPFASLGAGTEARVFATIADTIRCILDLFSTSLEEDERLLEN 501
>gi|297707870|ref|XP_002830708.1| PREDICTED: SET domain-containing protein 4 [Pongo abelii]
Length = 440
Score = 44.3 bits (103), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 65/289 (22%), Positives = 113/289 (39%), Gaps = 38/289 (13%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ + +T K S L L +L+ EK G +S W
Sbjct: 68 LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGDRSLW 126
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L + A P+ E E+ L KA+ E+ ++ + +
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPQSLKAKAEEQRAHVQEFFASSRDFFS 178
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ---KVSLARRFALVPLGP--PL 291
LF + I F++ A+ V + V+L+ + L+ L P L
Sbjct: 179 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRHRECLSAELDTCALAPYLDL 234
Query: 292 LAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
L +S + KA + ++ ++ E + + Y GP N +L +
Sbjct: 235 LNHSPHVQVKAAFNEETHSYEIRTTSRWRRHEEVFICY-----------GPHDNQRLFLE 283
Query: 350 YGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
YGFV NP+ R ++ L + D Q K + + +G + F
Sbjct: 284 YGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332
>gi|384254260|gb|EIE27734.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 724
Score = 44.3 bits (103), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 81/331 (24%), Positives = 126/331 (38%), Gaps = 43/331 (12%)
Query: 71 VSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVP- 129
+ K E + W ++G+ + L E + +AA++++ G+ S+P
Sbjct: 67 IQKSGEGPLGFQEWALQSGITSPSLRLAEFAG-------LRGMAAADNIAKGEVLVSLPV 119
Query: 130 -NSLVVT-LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQR 187
+LVV+ ER T +K +AL L+YE++ G S PY+ L
Sbjct: 120 AAALVVSPKERSQLPGTFCSSAFYSKKPWYVQMALNLLYERQLGPASKLAPYVAALP--- 176
Query: 188 GRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL------------DTVW 235
+ +PL WSE +L L E+ + EG+KR + EL D +W
Sbjct: 177 ----VDFSTPLSWSEAQLQALCYPQLIREVATQREGLKRLHAELAVSTPGTPITEQDLIW 232
Query: 236 FMAGSLFQQY--PYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP-LGPPLL 292
+ + + PY PT + F + + ++ A AL L +L
Sbjct: 233 ALQAVRSRAFSGPYAGPTWRSRLKTFGALGALAAASITVAHVLNGAIAAALFNLLYDVVL 292
Query: 293 AYSSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWYNISFTGHD--FKCGPQPNSK 345
+ K AM VD VQ V+ Y A V + G G Q N
Sbjct: 293 SQKVKWYAMCPVVDFLNHKSTVQSEVEYEYFADRFSVRCQSYFSKGEQVFISYGKQSNDS 352
Query: 346 LLINYGFVDEDNPYDRLVV----EAALNTED 372
LL YGFV+ P+D + AAL D
Sbjct: 353 LLQYYGFVEPGIPHDTYTIPDLRAAALALSD 383
>gi|238494116|ref|XP_002378294.1| SET domain protein [Aspergillus flavus NRRL3357]
gi|317148877|ref|XP_001822982.2| SET domain protein [Aspergillus oryzae RIB40]
gi|220694944|gb|EED51287.1| SET domain protein [Aspergillus flavus NRRL3357]
Length = 478
Score = 44.3 bits (103), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 81/316 (25%), Positives = 126/316 (39%), Gaps = 62/316 (19%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
SW+ +G P KV K + + V A D+ G+ F++P V++ + N
Sbjct: 22 SWL--SGKPGVKVNPKIRLADLRSRAAGRGVVAQSDIAEGEELFTIPREHVLSTQ----N 75
Query: 143 ETIAELLTTN--KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
+ +LL+ + +L L L ++YE G +S W Y + L R+ ++ + W
Sbjct: 76 SKLKDLLSQDVEELGPWLSLMLVMIYEYLLGDQSAWASYFKILPRK-------FDTLMFW 128
Query: 201 SETELAYLTGSPTKAEILER--AEGIKREYNELDTVWFMAG-SLF------QQYPYDIPT 251
S +EL L GS I++R EG + E+ A SLF Y D T
Sbjct: 129 SPSELQELQGSA----IVDRIGKEGAEESILEMIAPIVRANPSLFPPVDGLASYDGDAGT 184
Query: 252 EAF-------TFEIFKQAF-----------VAVQSCVVHLQKVSLARRFALVPLGPPLLA 293
+A I AF +S V + L++ +VPL L A
Sbjct: 185 QALLNLAHVMGSLIMAYAFDIEKPEDEDDEGDDESGYVTDDEEQLSK--GMVPLADLLNA 242
Query: 294 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV 353
+ + A L + + + +P AG I Y G P + LL YG+V
Sbjct: 243 DADQNNARLFQEETGLVMKAIKPISAGAEIFNDY-----------GEIPRADLLRRYGYV 291
Query: 354 -DEDNPYDRLVVEAAL 368
D +PYD VVE +L
Sbjct: 292 TDNYSPYD--VVELSL 305
>gi|345561352|gb|EGX44442.1| hypothetical protein AOL_s00188g347 [Arthrobotrys oligospora ATCC
24927]
Length = 468
Score = 43.9 bits (102), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 69/310 (22%), Positives = 122/310 (39%), Gaps = 52/310 (16%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTT--NKLSELACLALYLMYEKKQGKKSFW 176
+ G+ ++P+S++ T+E + I +L + LS LA+Y+++ +
Sbjct: 34 FKEGERILTIPSSILWTVEHAYADSIIRPVLQSMQGALSVDDTLAIYILFVRS------- 86
Query: 177 LPYIRELDRQRGRGQL-----AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
RE R + + S + +++ EL GS + + I+ +Y L
Sbjct: 87 ----RESGYNGLRSHVEALPTSYSSSIFFTDDELEVCAGSSLYTITKQLKQQIQDDYRTL 142
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
LF QY FT E +K A V S + + L P +
Sbjct: 143 ------VERLFGQYLDIFSLGKFTIEDYKWALCTVWSRAMDFVQPDGKSIRLLAPFAD-M 195
Query: 292 LAYSS---KCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLI 348
L +SS KC + D + ++ + Y+ G+ + + Y G PN++LL
Sbjct: 196 LNHSSDVKKCHVYDTSSGD-LSILAGKDYEPGDQVFINY-----------GSIPNNRLLR 243
Query: 349 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML 408
YGFV +NP D + E P ++ K+ + G SV + +++D L
Sbjct: 244 LYGFVVPNNPNDSYDLVLMTQPEAPFFELKQKLWVSAGLDSVSTISL-------SLNDPL 296
Query: 409 P-----YLRL 413
P YLR+
Sbjct: 297 PKSVLQYLRI 306
>gi|345325921|ref|XP_001512684.2| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Ornithorhynchus anatinus]
Length = 392
Score = 43.9 bits (102), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 37/164 (22%), Positives = 73/164 (44%), Gaps = 29/164 (17%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHY-----VAASEDLQAGDAAFS 127
K+E+ DL W NG + E +++ + A+ +++A +
Sbjct: 74 KREDYFPDLMKWATANG------------ASTEGFELVNFEEGFGLRATREIKAEELFLW 121
Query: 128 VPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELD 184
VP L++T+E N + L + +++ + LA +L+ E+ SFWLPYI+ L
Sbjct: 122 VPRKLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERAN-PSSFWLPYIQTLP 179
Query: 185 RQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
+ ++PL + E E+ YL + ++ + + R+Y
Sbjct: 180 SE-------YDTPLYFEEDEVQYLQSTQAIHDVFSQYKNTARQY 216
>gi|22328112|gb|AAH36556.1| SETD4 protein [Homo sapiens]
gi|119630166|gb|EAX09761.1| SET domain containing 4, isoform CRA_d [Homo sapiens]
gi|167773807|gb|ABZ92338.1| SET domain containing 4 [synthetic construct]
Length = 416
Score = 43.9 bits (102), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 66/289 (22%), Positives = 114/289 (39%), Gaps = 38/289 (13%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ + + +T K S L L +L+ EK G +S W
Sbjct: 44 LQEGQMIISLPESCLLTTDTVIRS-YLGAYITKWKPPPSPLLALCTFLVSEKHAGHRSLW 102
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L + A P+ E E+ L KA+ E+ ++ + +
Sbjct: 103 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 154
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
LF + I F++ A+ V + V+L Q+ L+ L P L
Sbjct: 155 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPYLDL 210
Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
L +S + KA + ++ ++ E + + Y GP N +L +
Sbjct: 211 LNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICY-----------GPHDNQRLFLE 259
Query: 350 YGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
YGFV NP+ R ++ L + D Q K + + +G + F
Sbjct: 260 YGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 308
>gi|403338831|gb|EJY68658.1| hypothetical protein OXYTRI_10728 [Oxytricha trifallax]
Length = 770
Score = 43.9 bits (102), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 21/77 (27%), Positives = 40/77 (51%), Gaps = 6/77 (7%)
Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGNE------TIAELLTTNKLSELACLALYLMYE 167
A ED+Q +A +PN ++T+ER +E + +++ + L +++M E
Sbjct: 67 AVKEDIQHNEAFVYIPNKCLITVERARSSEIGFIFANHENVFKSSEDRDFLTLLVFMMCE 126
Query: 168 KKQGKKSFWLPYIRELD 184
++G +SFW PY +D
Sbjct: 127 FQKGDQSFWYPYFNAVD 143
>gi|410079629|ref|XP_003957395.1| hypothetical protein KAFR_0E01060 [Kazachstania africana CBS 2517]
gi|372463981|emb|CCF58260.1| hypothetical protein KAFR_0E01060 [Kazachstania africana CBS 2517]
Length = 534
Score = 43.9 bits (102), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 40/144 (27%), Positives = 63/144 (43%), Gaps = 9/144 (6%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A DL AG +P S + + N TI+ LL ++ + L L +YE +
Sbjct: 40 VFAKRDLPAGTTLLQLPKSAIFSA----SNSTISNLLVEEEIDGVLALNLAFIYETTVFR 95
Query: 173 -KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE-YNE 230
KS W PY++ + +G ++V P WSE L G T + L A ++E Y
Sbjct: 96 EKSHWYPYLKSIQVVDSQGNISV-PPGYWSEEAKDLLRG--TTLDTLYDALSPQQEVYEG 152
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAF 254
+ +A Q++ +P E F
Sbjct: 153 FEISLHVAKKWNQEFSLPLPEEYF 176
>gi|302410103|ref|XP_003002885.1| SET domain-containing protein RMS1 [Verticillium albo-atrum
VaMs.102]
gi|261357909|gb|EEY20337.1| SET domain-containing protein RMS1 [Verticillium albo-atrum
VaMs.102]
Length = 469
Score = 43.9 bits (102), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 82/384 (21%), Positives = 143/384 (37%), Gaps = 80/384 (20%)
Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 213
L L L ++YE QG S W PY L +Q ++P+ WS+ EL L G+
Sbjct: 91 LDSWGQLILVMLYEVLQGDSSRWKPYFDILPQQ-------FDTPIFWSDGELLELQGTSL 143
Query: 214 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 273
AE + + E +++ + ++F PTE + + + + L
Sbjct: 144 TAEKIGKVESDAMFRSKILPIVQANPAIFYPEGAAQPTEDELLHLAHRMGSTIMAYAFDL 203
Query: 274 QKVSLARR--------------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 319
+ +VP+ L A +++ A + + + +A
Sbjct: 204 ENDDENENEEDGWVEDREGRTMLGMVPMADTLNA-NAEFNAHINHGESLEATAIRADIRA 262
Query: 320 GESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVD-EDNPYDRLVVEAALNTEDPQYQDK 378
G+ ++ +Y GP P S+LL YG+V E + YD + V L E
Sbjct: 263 GDQVLNYY-----------GPLPTSELLRRYGYVTPEHSRYDVVEVPWTLVKE------- 304
Query: 379 RMVAQRNGKLSVQVF-HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSP 437
V LS + + V + + E I D Y + S ++ + VSP
Sbjct: 305 --VIVSCLSLSAEAWKQVESQIDDEEIED---YFVIERDSGEPGPDGRFTAPAVLREVSP 359
Query: 438 CMERAVLDQLADYFKA-----------------------------RLAGYPATLSEDEAM 468
+++QL ++ KA RLA YP ++ DE +
Sbjct: 360 ----ELVEQLKEFLKAVKKLDSERIPDKRKRDEICDAVIAEVLKVRLAQYPTSIETDEKL 415
Query: 469 LTDYNLHPKKRVATQLVRMEKKML 492
L + +L ++R+A + EKK+L
Sbjct: 416 LAEADLPARRRMAVVVRLGEKKLL 439
>gi|407035166|gb|EKE37568.1| [Ribulose-bisphosphate-carboxylase]-lysine N-methyltransferase
[Entamoeba nuttalli P19]
Length = 791
Score = 43.9 bits (102), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 73/329 (22%), Positives = 122/329 (37%), Gaps = 49/329 (14%)
Query: 78 LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
+ D+K W+ +NG V +K + + A+++ + + S+P S + +
Sbjct: 1 MEDIKKWVIQNGGVIDGVDVKTFDGYGRG------LCANKEFKKDEIIMSIPYS--IQIN 52
Query: 138 RVLGNETIAELLT------TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQ 191
R+ N E+ + +L L + K K F PYI L
Sbjct: 53 RINLNHIWPEVKLPKFNEGDDDRDDLNGLVYLYLAVNKTNPKCFHWPYINVLPE------ 106
Query: 192 LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP-YDIP 250
+ PL ++ EL + G+ A + E+ + V + L QQ+P Y P
Sbjct: 107 -TYDCPLSYTIDELNLMKGTKLYAAV-EKINAFL-----MKVVDYYNNKLIQQFPQYFQP 159
Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 310
+ ++FK+ A QS V + F V P +S+ C Q
Sbjct: 160 FD----DLFKRLQWAHQSFWSRAFLVIYPQPFGEVGSLIPFCDFSNHCTQAKVTYISNTQ 215
Query: 311 L------VVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
+ K GE I Y I + N KLL+ YGFV+E+NP D L++
Sbjct: 216 TETFSFQTNEALVKPGEQIFNNYRI-----------RSNEKLLLGYGFVEENNPCDNLLL 264
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
+D QY + + ++ S F
Sbjct: 265 RIYFEVDDNQYNEIEEILKQEEIKSFDFF 293
>gi|355747383|gb|EHH51880.1| SET domain-containing protein 4 [Macaca fascicularis]
Length = 440
Score = 43.9 bits (102), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 65/289 (22%), Positives = 113/289 (39%), Gaps = 38/289 (13%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ + +T K S L L +L+ EK G +S W
Sbjct: 68 LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGDRSLW 126
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L + A P+ E E+ L KA+ E+ ++ + +
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
LF + I F++ A+ + + V+L Q+ L+ L P L
Sbjct: 179 SLQPLFVEAVDSI----FSYSALLWAWCTINTRAVYLRPRQRECLSAEPDTCALAPYLDL 234
Query: 292 LAYSSKC--KAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
L +S + KA + ++ ++ E + + Y GP N +L +
Sbjct: 235 LNHSPRVQVKAAFNEETHSYEIRTTSRWRKHEEVFICY-----------GPHDNQRLFLE 283
Query: 350 YGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
YGFV NP+ R ++ L + D Q K + + +G + F
Sbjct: 284 YGFVSVHNPHACVYVSREILVKYLPSRDKQMDKKISILKDHGYIENLTF 332
>gi|307104961|gb|EFN53212.1| hypothetical protein CHLNCDRAFT_137077 [Chlorella variabilis]
Length = 512
Score = 43.9 bits (102), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 40/73 (54%), Gaps = 2/73 (2%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
V A++D+ AG VP L++++E + + L ++ L+ LA++L+ E +
Sbjct: 37 VLATQDIPAGTCVLRVPRHLLMSVESARRDAELCTALRQHRAALTSDQVLAVHLLCEASK 96
Query: 171 GKKSFWLPYIREL 183
G SFW PY+R L
Sbjct: 97 GAASFWQPYLRSL 109
>gi|167389227|ref|XP_001738871.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165897700|gb|EDR24782.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 791
Score = 43.9 bits (102), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 72/329 (21%), Positives = 126/329 (38%), Gaps = 49/329 (14%)
Query: 78 LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
+ D+K W+ +NG V +K + + A+++ + + S+P S + +
Sbjct: 1 MEDIKKWVIQNGGIIDGVDVKTFEGYGRG------LCANKEFKQDEIIMSIPYS--IQIN 52
Query: 138 RVLGNETIAELLT------TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQ 191
R+ N E+ + +L L + K K F PYI L +
Sbjct: 53 RINLNHIWPEVKLPKFNEGDDDRDDLNGLVYLYLAINKTNPKCFHWPYINVLPK------ 106
Query: 192 LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP-YDIP 250
+ PL ++ EL + G+ + E+ + V + L QQ+P Y P
Sbjct: 107 -TYDCPLSYTIDELNIMKGTKLYVAV-EKINAFL-----MKVVDYYNNKLIQQFPQYFQP 159
Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC-KAMLAAVDDAV 309
+ ++FK+ A QS V + F V P +S+ C +A + + +
Sbjct: 160 FD----DLFKRLQWAHQSFWSRAFLVIYPQPFGEVGSLIPFCDFSNHCTQAKVTYISNTR 215
Query: 310 QLVV-----DRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
+ K GE I Y I + N KLL+ YGFV+E+NP D L++
Sbjct: 216 TETFSFQTNEEVVKPGEQIFNNYRI-----------RSNEKLLLGYGFVEENNPCDNLLL 264
Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
+D QY + + ++ S F
Sbjct: 265 RIYFEVDDNQYNEIEEILKQEEIKSFDFF 293
>gi|50546259|ref|XP_500648.1| YALI0B08624p [Yarrowia lipolytica]
gi|49646514|emb|CAG82890.1| YALI0B08624p [Yarrowia lipolytica CLIB122]
Length = 490
Score = 43.9 bits (102), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 53/185 (28%), Positives = 71/185 (38%), Gaps = 18/185 (9%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A +DL AGD VP S ++ R G IA LL + L +A L + +YE+ G
Sbjct: 36 VFAKKDLDAGDIVLKVPKSACLS-PRTCG---IANLLDEHDLDNIAGLLVAFLYERSLGD 91
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE-YNEL 231
+S W + L E P WS E L EI G E Y EL
Sbjct: 92 QSPWHEFFESLKPVIAD---VPEIPKFWSNDEDRALLSGTEVEEIGGLETGEDEEVYQEL 148
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
+F I E +F+ FK+ V + S + + R LVP G L
Sbjct: 149 IVPFFEDNGKL------INLECPSFDEFKKLVVVIASRAFEVDQF---RELCLVP-GACL 198
Query: 292 LAYSS 296
+S
Sbjct: 199 FNHSD 203
>gi|302820198|ref|XP_002991767.1| hypothetical protein SELMODRAFT_430007 [Selaginella moellendorffii]
gi|300140448|gb|EFJ07171.1| hypothetical protein SELMODRAFT_430007 [Selaginella moellendorffii]
Length = 389
Score = 43.5 bits (101), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 32/120 (26%), Positives = 58/120 (48%), Gaps = 9/120 (7%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+ ++AG+ +P+ LV+T E++ ++ + +LL+T + L L ++ E+ +G+ S
Sbjct: 14 AARSIRAGEQIVRIPHDLVLTAEKL--DDCVKKLLSTEY--DWCPLTLLILAEQHKGEAS 69
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PY+ L S + W + EL +L + ER E I EY + V
Sbjct: 70 RWAPYVSCLPSFGDH-----HSTIFWEKEELKFLECTRAFRGTAERREMISDEYISVKNV 124
>gi|403271547|ref|XP_003927684.1| PREDICTED: SET domain-containing protein 4 [Saimiri boliviensis
boliviensis]
Length = 440
Score = 43.5 bits (101), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 66/289 (22%), Positives = 112/289 (38%), Gaps = 38/289 (13%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ + +T K S L L +L+ EK G +S W
Sbjct: 68 LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGDRSLW 126
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L + A P+ E E+ L KA+ E+ ++ + +
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
LF + I F++ A+ V + V+L Q+ L+ L P L
Sbjct: 179 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQQECLSAEPDTCALAPYLDL 234
Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
L +S + KA ++ ++ E + + Y GP N +L +
Sbjct: 235 LNHSPHVQVKAAFNEETHCYEIRTTSRWRKHEEVFICY-----------GPHDNQRLFLE 283
Query: 350 YGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
YGFV NP+ R ++ L + D Q K + + +G + F
Sbjct: 284 YGFVSAHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332
>gi|449466129|ref|XP_004150779.1| PREDICTED: uncharacterized protein LOC101212907 [Cucumis sativus]
Length = 559
Score = 43.5 bits (101), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 63/267 (23%), Positives = 105/267 (39%), Gaps = 39/267 (14%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A EDL GD +P +++++ E V + L + L+ M EK
Sbjct: 189 AKEDLDVGDTVLEIPLAIIISEELVQKSTMYPVLSKVEGMLPETMTLLWSMKEKHIVDSE 248
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F + Y L A + L + + L G+ E+++ E ++++YNEL
Sbjct: 249 FRV-YFDTLPE-------AFNTGLSFGVGAMTTLVGTLLFDELMQAKEHLRKQYNEL--- 297
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP-------- 286
F A L +P P E +++E F A S + + R LVP
Sbjct: 298 -FPA--LCNNHPDIFPEEFYSWEEFLWACELWYSNSLKIMFPDGNVRTCLVPIAGFLNHS 354
Query: 287 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKL 346
L P +L Y + + D+++ + RP +AGE + Y G S L
Sbjct: 355 LHPHILHYGK-----VDSDTDSLKFRLSRPCRAGEECYLSY-----------GNYSGSHL 398
Query: 347 LINYGFVDE-DNPYDRLVVEAALNTED 372
+ YGF+ E DN D + ++ +D
Sbjct: 399 VTFYGFLPEGDNVNDVIPLDIDFGDDD 425
>gi|393217169|gb|EJD02658.1| SET domain-containing protein [Fomitiporia mediterranea MF3/22]
Length = 513
Score = 43.5 bits (101), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 105/451 (23%), Positives = 164/451 (36%), Gaps = 93/451 (20%)
Query: 115 ASEDLQAGDAAFSVPNSLVVT-----LERVLGNETIAELLTTNKLSELACLALYLMYEKK 169
A +D+ G FSVP SL ++ L +++G E + L NK L L +M+E+
Sbjct: 34 ALQDIPEGHTLFSVPRSLTLSTHTSELPKLIG-EAAWKSLRLNK--GWVGLILCMMWEEC 90
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
+ S W Y L R A ++P+ W+ EL L G+ +I + E +R+Y
Sbjct: 91 RWTDSKWCGYFNILPR-------AFDTPMFWTGDELKELDGTDVLGKIGK--EQAERDYY 141
Query: 230 EL-------DTVWFMAGSLFQQYPYD--------IPTEAFTFEIFKQAFVAVQS------ 268
E+ F G + Y + I + +F E +K+ QS
Sbjct: 142 EILNPAVRTRPDLFDPGHIASFYSLENYHVMGSRILSRSFHVEKWKEQTPGSQSRASSEL 201
Query: 269 -----CV------VHLQKVSLAR--------------RFALVPLGPPLLAYSSKCKAMLA 303
C+ +L V A+VP+ L A A L
Sbjct: 202 HENGDCMDIDDESSNLSAVGAENGGDDDSDDEAENPSDIAMVPMADMLNAQYGSENAKLF 261
Query: 304 AVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVD--------- 354
+ +V +P + GE I YN G PNS LL YG VD
Sbjct: 262 YEPTHLNMVSTKPIRRGEQI---YNA--------YGDLPNSALLREYGHVDLVPLPGVPW 310
Query: 355 -EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS------DM 407
E NP D + + A L R+ A+ + K + + G + + D+
Sbjct: 311 KEGNPADVVEIPADLALHAVLSSQARVDAE-SLKERIDWWLEEGGDDVFVLGTDLELPDV 369
Query: 408 LPYLRLGYVSDTSEMQSVISSLGPICP-VSPCMERAVLDQLADYFKARLAGYPATLSEDE 466
+ + E + S P P + + + + RLA YP TL DE
Sbjct: 370 MISFLKLLLLSKLEWEKARSKSKPPKPKLDMDSKLQTFPLVLGMLERRLAKYPTTLEHDE 429
Query: 467 AMLTDYNLHPKKRVATQLVRM-EKKMLNACL 496
A+L+ P +VR+ EK +L C+
Sbjct: 430 ALLSGQTSLPYNVRNAIIVRIGEKHILVGCM 460
>gi|384251962|gb|EIE25439.1| ResB-domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 889
Score = 43.5 bits (101), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 35/135 (25%), Positives = 55/135 (40%), Gaps = 15/135 (11%)
Query: 79 GDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLER 138
G L+ W+ GLPP KV + + V E L +VP L++T +
Sbjct: 32 GSLEDWLTHRGLPPQKVAISHEIPEGRGLVATRRVRKHEKL------LNVPAQLLLTADV 85
Query: 139 VLGNETIAELLTTNKLSELACLALYLMYEKKQ--GKKSFWLPYIRELDRQRGRGQLAVES 196
L + LL + + + LA +L ++Q G K+ W Y+ L Q G
Sbjct: 86 ALQHSAYGGLLESCGVPAWSVLATFLAETRRQPEGDKNVWGQYVDALPSQTG-------C 138
Query: 197 PLLWSETELAYLTGS 211
L W+ E+ L G+
Sbjct: 139 VLEWASEEVDLLRGT 153
>gi|409045252|gb|EKM54733.1| hypothetical protein PHACADRAFT_97093 [Phanerochaete carnosa
HHB-10118-sp]
Length = 513
Score = 43.5 bits (101), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 54/192 (28%), Positives = 82/192 (42%), Gaps = 33/192 (17%)
Query: 331 FTGHDFKCGPQ--------PNSKLLINYGFVD----------EDNPYD------RLVVEA 366
T H+ K G Q PNS LL YGF+D NP D LVVEA
Sbjct: 301 MTVHEIKAGEQIWNTYGDPPNSDLLRRYGFIDVTKLESPLSGAGNPADIVEIPANLVVEA 360
Query: 367 ALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVI 426
A + QD+ V + VF V G + E +M+ RL + +E +
Sbjct: 361 ATKHTTSKTQDR--VDWWLEEAEDDVFVV--GTDCELPPEMVSLARL-LLQPKAEWEKT- 414
Query: 427 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYN-LHPKKRVATQLV 485
+ G + P M+ + D ++RL YP ++ EDE +L D + L +++A +
Sbjct: 415 KAKGKVP--KPTMDTTIAAIAMDVLQSRLKEYPTSVEEDERLLADESQLGFNRKMAVTVR 472
Query: 486 RMEKKMLNACLQ 497
EK++L L+
Sbjct: 473 LGEKRILAGTLR 484
>gi|217074704|gb|ACJ85712.1| unknown [Medicago truncatula]
Length = 209
Score = 43.5 bits (101), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 32/120 (26%), Positives = 56/120 (46%), Gaps = 8/120 (6%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ +Q GD VP SL +T + + + + + +A LA L+ K G+ S
Sbjct: 66 ASKSIQTGDCILQVPYSLQLTPDNLPPE---IKPFISEDVGNIAKLATVLLIHKNLGQDS 122
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI L Q + + + W+E+EL + S E + + I++++ E+ V
Sbjct: 123 EWHPYISCLPPQA-----EMHNTIFWNESELEMIRQSSVYQETIYQKSQIEKDFLEIKPV 177
>gi|367009050|ref|XP_003679026.1| hypothetical protein TDEL_0A04830 [Torulaspora delbrueckii]
gi|359746683|emb|CCE89815.1| hypothetical protein TDEL_0A04830 [Torulaspora delbrueckii]
Length = 484
Score = 43.1 bits (100), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 42/79 (53%), Gaps = 12/79 (15%)
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
+++PL L A +SKC A L +++++ +P GE + YNI G P
Sbjct: 213 SMIPLADTLNANTSKCNANLVYDIESLKMCATKPIGMGEQV---YNIY--------GDHP 261
Query: 343 NSKLLINYGFVD-EDNPYD 360
NS+LL YG+V+ E + YD
Sbjct: 262 NSELLRRYGYVEWEGSKYD 280
>gi|325095092|gb|EGC48402.1| SET domain-containing protein [Ajellomyces capsulatus H88]
Length = 485
Score = 43.1 bits (100), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 69/313 (22%), Positives = 116/313 (37%), Gaps = 56/313 (17%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
W+ + P KV K K + + A +D+ + F++P +LV+ + N
Sbjct: 23 WLKQR--PGVKVSPKIKIADLRSEGAGRGIVADDDIGEDEELFAIPQNLVLGFQ----NS 76
Query: 144 TIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
+ +LL N+ CL + ++YE QG S W Y + L ++ + W+
Sbjct: 77 RLKDLLDFNERDFDPWLCLIVVMIYEYLQGGASTWSRYFQLLPTN-------FDTLMFWT 129
Query: 202 ETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP-------YDIPTE-- 252
+ EL L+GS +L + E N L + + +P +D P
Sbjct: 130 DEELRELSGSA----VLNKIGRSDAEANILRNILPLVSGNPSHFPPMSGVASFDSPEGKA 185
Query: 253 --------------AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC 298
A+ F+I K + ++ +VPL L A + +
Sbjct: 186 ALLSLAHRMGSLIMAYAFDIEKGENDGREGQDGYVTDDEEELSKGMVPLADLLNADADRN 245
Query: 299 KAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV-DEDN 357
A L D + + +P + GE I Y G P + LL YG+V D
Sbjct: 246 NARLFQEDCYLSMRSIKPIRKGEEIFNDY-----------GELPRADLLRRYGYVTDNYA 294
Query: 358 PYDRLVVEAALNT 370
YD VE ++ T
Sbjct: 295 QYDE--VEISMRT 305
>gi|240276868|gb|EER40379.1| SET domain-containing protein [Ajellomyces capsulatus H143]
Length = 485
Score = 43.1 bits (100), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 69/313 (22%), Positives = 116/313 (37%), Gaps = 56/313 (17%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
W+ + P KV K K + + A +D+ + F++P +LV+ + N
Sbjct: 23 WLKQR--PGVKVSPKIKIADLRSEGAGRGIVADDDIGEDEELFAIPQNLVLGFQ----NS 76
Query: 144 TIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
+ +LL N+ CL + ++YE QG S W Y + L ++ + W+
Sbjct: 77 RLKDLLDFNERDFDPWLCLIVVMIYEYLQGGASTWSRYFQLLPTN-------FDTLMFWT 129
Query: 202 ETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP-------YDIPTE-- 252
+ EL L+GS +L + E N L + + +P +D P
Sbjct: 130 DEELRELSGSA----VLNKIGRSDAEANILRNILPLVSGNPSHFPPMSGVASFDSPEGKA 185
Query: 253 --------------AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC 298
A+ F+I K + ++ +VPL L A + +
Sbjct: 186 ALLSLAHRMGSLIMAYAFDIEKGENDGREGQDGYVTDDEEELSKGMVPLADLLNADADRN 245
Query: 299 KAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV-DEDN 357
A L D + + +P + GE I Y G P + LL YG+V D
Sbjct: 246 NARLFQEDCYLSMRSIKPIRKGEEIFNDY-----------GELPRADLLRRYGYVTDNYA 294
Query: 358 PYDRLVVEAALNT 370
YD VE ++ T
Sbjct: 295 QYDE--VEISMRT 305
>gi|359476494|ref|XP_002269094.2| PREDICTED: protein SET DOMAIN GROUP 40-like [Vitis vinifera]
Length = 504
Score = 43.1 bits (100), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 43/74 (58%), Gaps = 1/74 (1%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN-KLSELACLALYLMYEKKQG 171
+AA+ DL G+ +VP S ++T + +L +E ++ + + LS L + L+ E +G
Sbjct: 51 LAAARDLSQGELILTVPKSALMTSQSLLKDEKLSVAVKRHTSLSSPQILTICLLAEMSKG 110
Query: 172 KKSFWLPYIRELDR 185
K S+W PY+ +L R
Sbjct: 111 KSSWWHPYLMQLPR 124
>gi|297738159|emb|CBI27360.3| unnamed protein product [Vitis vinifera]
Length = 449
Score = 43.1 bits (100), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 43/74 (58%), Gaps = 1/74 (1%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN-KLSELACLALYLMYEKKQG 171
+AA+ DL G+ +VP S ++T + +L +E ++ + + LS L + L+ E +G
Sbjct: 51 LAAARDLSQGELILTVPKSALMTSQSLLKDEKLSVAVKRHTSLSSPQILTICLLAEMSKG 110
Query: 172 KKSFWLPYIRELDR 185
K S+W PY+ +L R
Sbjct: 111 KSSWWHPYLMQLPR 124
>gi|255584095|ref|XP_002532790.1| Protein SET DOMAIN GROUP, putative [Ricinus communis]
gi|223527460|gb|EEF29592.1| Protein SET DOMAIN GROUP, putative [Ricinus communis]
Length = 510
Score = 43.1 bits (100), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 23/74 (31%), Positives = 39/74 (52%), Gaps = 1/74 (1%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSELACLALYLMYEKKQG 171
+ A+ DL+ G+ VP S ++T + L + + + + LS L + L+YE +G
Sbjct: 57 LGAARDLKKGELVLRVPKSALLTKDSFLKDGLLLSAINNHSALSPTQTLTVCLLYEMSKG 116
Query: 172 KKSFWLPYIRELDR 185
+ SFW PY+ L R
Sbjct: 117 QSSFWYPYLMHLPR 130
>gi|294948379|ref|XP_002785721.1| hypothetical protein Pmar_PMAR008080 [Perkinsus marinus ATCC 50983]
gi|239899769|gb|EER17517.1| hypothetical protein Pmar_PMAR008080 [Perkinsus marinus ATCC 50983]
Length = 353
Score = 42.7 bits (99), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 22/82 (26%), Positives = 38/82 (46%), Gaps = 11/82 (13%)
Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQ 341
++PL S+K + V++ Q++ ++P K GE I Y G
Sbjct: 171 LCVIPLADQFNHSSTKWHTRVREVEEGFQMLAEKPVKKGEEIFNNY-----------GLY 219
Query: 342 PNSKLLINYGFVDEDNPYDRLV 363
N LL+ +GF++ DNP+D +
Sbjct: 220 TNEMLLLTHGFIEFDNPHDHFI 241
>gi|449702130|gb|EMD42824.1| Hypothetical protein EHI5A_004190 [Entamoeba histolytica KU27]
Length = 749
Score = 42.7 bits (99), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 59/238 (24%), Positives = 89/238 (37%), Gaps = 34/238 (14%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
LYL K K W PYI L + PL ++ EL + G+ A + E+
Sbjct: 42 LYLAVNKTNPKCFHW-PYINVLPE-------TYDCPLSYTIDELNLMKGTKLYAAV-EKI 92
Query: 222 EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR 281
+ V + L QQ+P ++F ++FK+ A QS V +
Sbjct: 93 NAFL-----MKVVDYYNNKLIQQFPQYF--QSFD-DLFKRLQWAHQSFWSRAFLVIYPQP 144
Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV------DRPYKAGESIVVWYNISFTGHD 335
F V P +S+ C Q + K GE I Y I
Sbjct: 145 FGEVGSLIPFCDFSNHCTQAKVTYISNTQTETFSFQTNEELVKPGEQIFNNYRI------ 198
Query: 336 FKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
+ N KLL+ YGFV+E+NP D L++ +D QY + + ++ S F
Sbjct: 199 -----RSNEKLLLGYGFVEENNPCDNLLLRIYFEVDDNQYNEIEEILKQEEIKSFDFF 251
>gi|428175768|gb|EKX44656.1| hypothetical protein GUITHDRAFT_109433 [Guillardia theta CCMP2712]
Length = 591
Score = 42.7 bits (99), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 70/284 (24%), Positives = 120/284 (42%), Gaps = 50/284 (17%)
Query: 100 KPSHNEKH-RPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK----- 153
KP E+ R I +A E++ FS+P ++++ + + + +IA + +K
Sbjct: 35 KPHDGERGVRVISDIAPCEEM------FSIPEKILMSRKSCMAS-SIAHVFRKHKDVLFS 87
Query: 154 -LSELACLALYLMYEK-KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
ELA L L ++YEK QG SFW P I L G + WSE EL L
Sbjct: 88 SRDELA-LTLLILYEKLDQGNASFWKPMIDILPADPG-------AASKWSEEELQELQDE 139
Query: 212 PTKAEILERAEGIKREYNE-LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
KAE + +++ Y L + G +F + +T+E F+ A + V+S
Sbjct: 140 SLKAEAMIVVASMQQTYQRVLRPILVQHGDVF-------SVDRYTWEEFRWALLCVESRT 192
Query: 271 VHLQKVSLARRF----ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 326
RF ++VP L + + + + D +GE +
Sbjct: 193 FG--------RFLPHPSIVPFADLLNHVNVQTSYRWLPEERRAAYMCD---ASGEHVHRR 241
Query: 327 YNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 370
+F + GP+ N++LL++YGF + N Y+ + + +NT
Sbjct: 242 GEEAFMSY----GPRSNAELLLHYGFALQSNRYEAVELNFRINT 281
>gi|302754816|ref|XP_002960832.1| hypothetical protein SELMODRAFT_437299 [Selaginella moellendorffii]
gi|300171771|gb|EFJ38371.1| hypothetical protein SELMODRAFT_437299 [Selaginella moellendorffii]
Length = 418
Score = 42.7 bits (99), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 48/183 (26%), Positives = 77/183 (42%), Gaps = 33/183 (18%)
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF---TF 256
W +TEL+YL SP + ER E I E+ ++ F L Q D+ + F +
Sbjct: 202 WEDTELSYLRASPLYGKARERLEMITTEFGQVQND-FCTCVLEQ--ALDVWPQLFGKVSL 258
Query: 257 EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY-----SSKCKAMLAAVDDAVQL 311
E K + V S + +++ LV + P+L + +S K + + +
Sbjct: 259 EDLKHVYATVFS-----RSLAIGEDSTLVMI--PMLDFFNHNATSFAKLSFNGLLNYAVV 311
Query: 312 VVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 371
DR Y + I W N G N++L ++YGF +NPYD E L T+
Sbjct: 312 TADRDYAENDQI--WINY---------GDLSNAELALDYGFTVPENPYD----ETELLTQ 356
Query: 372 DPQ 374
P+
Sbjct: 357 FPE 359
>gi|326913214|ref|XP_003202935.1| PREDICTED: SET domain-containing protein 4-like, partial [Meleagris
gallopavo]
Length = 241
Score = 42.7 bits (99), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 31/100 (31%), Positives = 48/100 (48%), Gaps = 9/100 (9%)
Query: 94 KVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAE 147
K LK++ + RP + + + LQAG+ S+P +VT VL N + E
Sbjct: 36 KKWLKDRGFGDSSLRPAQFWGTGRGLMTTRALQAGELVISLPEKCLVTTNTVL-NSCLGE 94
Query: 148 LLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDR 185
+ K +S L L +L+ EK G+KS W PY+ L +
Sbjct: 95 YIMKWKPPVSPLIALCTFLIAEKHAGEKSLWKPYLDVLPK 134
>gi|296810368|ref|XP_002845522.1| SET domain-containing protein [Arthroderma otae CBS 113480]
gi|238842910|gb|EEQ32572.1| SET domain-containing protein [Arthroderma otae CBS 113480]
Length = 491
Score = 42.7 bits (99), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 65/284 (22%), Positives = 111/284 (39%), Gaps = 56/284 (19%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A D+ + F +P L++++E E + L +L L + ++YE QG+
Sbjct: 61 LGAVRDIAEDEELFVIPEDLILSVENSKAREALG--LNETQLGPWLSLIIVMIYEYYQGE 118
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA--EGIKREYNE 230
+S W PY L + ++ + W+E +L L G +I + A E I ++
Sbjct: 119 QSRWEPYFHIL-------PTSFDTLMFWTEAQLQELQGCAVVDKIGKSAADEAILQKVVP 171
Query: 231 L--------------------DTVWFMA---GSLFQQYPYDI-PTEAFTFEIFKQAFVAV 266
L D + +A GSL Y +DI TE + + ++
Sbjct: 172 LIQANPHHFPARSGMPPLDSNDALLCLAHRMGSLIMAYAFDIEKTEGADDDAAEDGYMTD 231
Query: 267 QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 326
+ A+ +VPL A + + A L + + + R +AGE I
Sbjct: 232 -------DEDEPAK--GMVPLADIFNADAQRNNARLFQEEGSFVMKAIRNIQAGEEIFND 282
Query: 327 YNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 370
Y G P + LL YG+V DN VVE +L++
Sbjct: 283 Y-----------GELPRADLLRRYGYV-TDNYAQYDVVEFSLDS 314
>gi|189190580|ref|XP_001931629.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187973235|gb|EDU40734.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 372
Score = 42.7 bits (99), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 58/256 (22%), Positives = 101/256 (39%), Gaps = 35/256 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A+ D+QAG+ VP L TL+ V + I+ L N +S A LA YL +K
Sbjct: 33 IIATRDIQAGETILFVPFKLFRTLKHV--PKAISRRLPRN-MSLHALLATYLSLDKTD-- 87
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+F +P D + P LW EL P ++++ KR+
Sbjct: 88 -TFAIPNKTLPDLSSFEAGM----PFLWP-AELHPFLPKPALDLLMKQQRSFKRD----- 136
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
W + + D A+ + ++F ++++ R A++P+
Sbjct: 137 --WDIVSKAYSNISQDQYLHAWLL-VNTRSFYCTTPI---MERLPHDDRLAILPVADLFN 190
Query: 293 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGF 352
C+A A+ + + DR Y+ GE + + Y G LL YGF
Sbjct: 191 HADVGCEARFAS--ENYSFIADRDYRTGEELHISY-----------GSHSTDFLLTEYGF 237
Query: 353 VDEDNPYDRLVVEAAL 368
V +N +D + ++ A+
Sbjct: 238 VPTENCWDVVCLDEAI 253
>gi|302754814|ref|XP_002960831.1| hypothetical protein SELMODRAFT_402223 [Selaginella moellendorffii]
gi|300171770|gb|EFJ38370.1| hypothetical protein SELMODRAFT_402223 [Selaginella moellendorffii]
Length = 486
Score = 42.7 bits (99), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 45/173 (26%), Positives = 72/173 (41%), Gaps = 29/173 (16%)
Query: 196 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF- 254
S W +TEL+YL SP + ER E I E+ ++ F L Q D+ + F
Sbjct: 313 STFRWEDTELSYLRASPLYGKARERLEMITTEFGQVQND-FCTCVLEQ--ALDVWPQLFG 369
Query: 255 --TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY-----SSKCKAMLAAVDD 307
+ E K + V S + +++ LV + P+L + +S K + +
Sbjct: 370 KVSLEDLKHVYATVFS-----RSLAIGEDSTLVMI--PMLDFFNHNATSFAKLSFNGLLN 422
Query: 308 AVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYD 360
+ DR Y + I W N G N++L ++YGF +NPYD
Sbjct: 423 YAVVTADRDYAENDQI--WINY---------GDLSNAELALDYGFTVPENPYD 464
>gi|332229557|ref|XP_003263953.1| PREDICTED: SET domain-containing protein 4 [Nomascus leucogenys]
Length = 440
Score = 42.4 bits (98), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 66/289 (22%), Positives = 112/289 (38%), Gaps = 38/289 (13%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ + +T K S L L +L+ EK G +S W
Sbjct: 68 LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGDRSLW 126
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L + A P+ E E+ L KA+ E+ ++ + +
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
LF + I F++ A+ V + V+L Q L+ L P L
Sbjct: 179 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQWECLSAEPDTCALAPYLDL 234
Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
L +S + KA + ++ ++ E + + Y GP N +L +
Sbjct: 235 LNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICY-----------GPHDNQRLFLE 283
Query: 350 YGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
YGFV NP+ R ++ L + D Q K + + +G + F
Sbjct: 284 YGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332
>gi|303272215|ref|XP_003055469.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463443|gb|EEH60721.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 468
Score = 42.4 bits (98), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 32/119 (26%), Positives = 52/119 (43%), Gaps = 10/119 (8%)
Query: 144 TIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 203
++A+ L +L L + +M E+ G +S W Y L RG L P+ W+E
Sbjct: 85 SVAKELRDARLGGGLALNVAVMVERALGSESRWRDYFAVLP-SRGERTL----PMFWTEA 139
Query: 204 ELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
L L G+ + E AE ++ +Y+E + L +P E TFE + +A
Sbjct: 140 RLEALKGTDLATHVREDAENLRADYDEE-----VVNGLCVAHPEKFRREELTFERYLEA 193
>gi|340507383|gb|EGR33354.1| SET domain protein [Ichthyophthirius multifiliis]
Length = 165
Score = 42.4 bits (98), Expect = 0.56, Method: Composition-based stats.
Identities = 33/103 (32%), Positives = 50/103 (48%), Gaps = 20/103 (19%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAEL--------LTTNKLSELACLALYL 164
V A E++ A ++PN+L+++ V +E L L + ++ LALYL
Sbjct: 51 VIAKEEIPANKVFVAIPNNLLLSTYLVEQSELKVILEENPHLFDLDEDDDAQFNKLALYL 110
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL--WSETEL 205
M EK +G+ SFW PY+ Q+A ES L W E E+
Sbjct: 111 MKEKIKGENSFWYPYL----------QIAPESFTLLDWKEEEV 143
>gi|403370373|gb|EJY85047.1| hypothetical protein OXYTRI_17100 [Oxytricha trifallax]
Length = 777
Score = 42.4 bits (98), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 39/165 (23%), Positives = 74/165 (44%), Gaps = 31/165 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIA-------ELLTTNKLSELACLALYLM 165
VAA + + +A +PN L++ +++ +E E T K S+ L ++
Sbjct: 112 VAAKKFIGPNEAYLYIPNKLIINEDKLYKSEYAQIFIDHPNEFKNTEK-SDQTSLIFFVA 170
Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVES--PLLWSETELAYLTGSPTKAEILERAEG 223
E +G++S+W PY + A +S P W + + L + KAE+
Sbjct: 171 LELLKGEESYWHPYF----------ETAQDSDLPQFWEDQNIDELEDALIKAEL------ 214
Query: 224 IKREYNELDTV--WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 266
+ +++D + + +A + YP + E FT EI+K+A+ V
Sbjct: 215 ---QMHQVDFIGDYEIAHGIANHYPDLVHAEKFTIEIYKRAYNIV 256
>gi|453087416|gb|EMF15457.1| SET domain-containing protein [Mycosphaerella populorum SO2202]
Length = 454
Score = 42.4 bits (98), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 73/347 (21%), Positives = 142/347 (40%), Gaps = 49/347 (14%)
Query: 126 FSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIREL 183
++P+ L+ T++R + + LL++ + LS LA Y+++ + + K + P
Sbjct: 41 LTIPHGLLWTVKRAYADPVLGPLLSSTRPPLSVDDTLATYILFIRAR-KSGYDGP----- 94
Query: 184 DRQRGRGQL--AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSL 241
Q L + S + +++ EL GS A I+ +Y +L L
Sbjct: 95 --QSHVAALPASYSSSIFFADAELEICAGSSLYTTTKHLARQIEVDYKDL------VARL 146
Query: 242 FQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYS---SKC 298
F ++ P++ FT + +K A V S + K+ L+ +L +S +C
Sbjct: 147 FGRHRDVFPSDKFTIDDYKWALCTVWSRAMDF-KLRDGESIRLMAPFADMLNHSPDVGQC 205
Query: 299 KAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNP 358
+ + ++ + Y+ G+ + + Y GP PN++L YGFV NP
Sbjct: 206 HVYDPQSGN-LSILAGKSYEPGDQVFINY-----------GPIPNNRLSRLYGFVVPGNP 253
Query: 359 YDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP-----YLRL 413
D + + + P ++ K + G S + ++D LP YLR+
Sbjct: 254 NDSYDLVLSTHPMAPFFEQKHKLWIAAGLDSTSTVSL-------TLTDPLPRSVLRYLRI 306
Query: 414 GYVSDTSEMQSVISSLGPIC--PVSPCMERAVLDQLADYFKARLAGY 458
+++T ++ +V + + +S E VL L + A L G+
Sbjct: 307 QRLNET-DLAAVGTRQSDVAFEKISDSNETEVLTFLVESISALLDGF 352
>gi|71425330|ref|XP_813082.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70877934|gb|EAN91231.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 565
Score = 42.4 bits (98), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 87/364 (23%), Positives = 143/364 (39%), Gaps = 59/364 (16%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL- 218
L L L+YE+ + S W EL G V P W +LA L G ++L
Sbjct: 201 LVLSLIYERYVAETSHW----NELLLSCPGGYPNV--PSFWDWEDLAELEGLDVLDDVLA 254
Query: 219 ERAEGIKREYNELDTVWFMAGSLFQ--QYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
++A+ + + + + F+ +L ++ D E F+ E A S +L V
Sbjct: 255 KKAQLAQFQTETMAVLPFIHEALAGGCRFGKDEFLECFSIEAMMWARATFDSRAFNL-NV 313
Query: 277 SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDF 336
ALVP+ ++ + ++ ++ V+ G V+ S T D
Sbjct: 314 DGRVVIALVPVAD-MINHHNRSDVLVRKVE-----------PNGGDFVMQIGASLTAQDI 361
Query: 337 ------KCGPQPNSKLLINYGFVDEDNPYDRL-----VVEAALNTEDPQYQDKR--MVAQ 383
GP N +LL YGFV E N +DRL E + E + +R +VA
Sbjct: 362 GREIWMSYGPLQNWELLQFYGFVLEGNEHDRLPFPFDFPEGVVGDE---WDGRRAALVAT 418
Query: 384 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 443
L+ + + H GR A+ +L ++++ E + + GP + E V
Sbjct: 419 YGLHLAGRCWICHDGRPPPALVALLRV----HLAEAEEFDT-MERKGPFASLGAGTEARV 473
Query: 444 LDQLADYFKARLAGYPATLSEDEAML------------TDYNLHP---KKRVATQLVRME 488
+ +AD + L + +L EDE +L D N P KR+A L+RM
Sbjct: 474 VATIADTIRCILDLFSTSLEEDERLLENGSGPVATHSGDDGNTQPLSCNKRLAI-LLRMG 532
Query: 489 KKML 492
K +
Sbjct: 533 MKRI 536
>gi|261190993|ref|XP_002621905.1| SET domain-containing protein [Ajellomyces dermatitidis SLH14081]
gi|239590949|gb|EEQ73530.1| SET domain-containing protein [Ajellomyces dermatitidis SLH14081]
gi|239613147|gb|EEQ90134.1| SET domain-containing protein [Ajellomyces dermatitidis ER-3]
gi|327354785|gb|EGE83642.1| SET domain-containing protein [Ajellomyces dermatitidis ATCC 18188]
Length = 481
Score = 42.0 bits (97), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 63/274 (22%), Positives = 108/274 (39%), Gaps = 61/274 (22%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELL--TTNKLSELACLALYLMYEKKQ 170
+ A ++ + F++P +LV++ + N + +LL + L CL L ++YE Q
Sbjct: 50 IVALSNINEDEELFAIPQNLVLSFQ----NSKLKDLLHISEKDLGPWLCLILVMIYEYLQ 105
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI--LERAEGIKRE- 227
G S W Y + L + ++ + W++ EL L+GS +I + I R+
Sbjct: 106 GGASPWSRYFQVLPTE-------FDTLMFWTDEELRELSGSAVLNKIGKSDAEAAILRDI 158
Query: 228 -------------------YNELD---TVWFMA---GSLFQQYPYDIPTEAFTFEIFKQA 262
Y+ D T+ +A GSL Y +DI E +
Sbjct: 159 FPIVSTNPHLFPPISGLGSYDSPDGRATLLSLAHRMGSLIMAYAFDI-------EKGEDE 211
Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
VQ + + L + +VPL L A + + A L D + + +P + GE
Sbjct: 212 EGEVQDGYITDEGEELTK--GMVPLADLLNADADRNNARLFQEDGYLAMKSIKPIRNGEE 269
Query: 323 IVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED 356
I Y G P + LL YG+V ++
Sbjct: 270 IFNDY-----------GELPRADLLRRYGYVTDN 292
>gi|400596811|gb|EJP64567.1| histone-lysine N-methyltransferase [Beauveria bassiana ARSEF 2860]
Length = 406
Score = 42.0 bits (97), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 87/403 (21%), Positives = 164/403 (40%), Gaps = 48/403 (11%)
Query: 81 LKSWMHKNG-LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+ +W++K+G + + L + P V A + + ++P + + T++
Sbjct: 1 MDAWLNKSGAVGLGDLDLADFPETGRG------VKAQRPFKEDERILTIPANCLWTVKGA 54
Query: 140 LGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
+ +L + + LS LALY+++ + +G+ + +RQ L E
Sbjct: 55 YADPLFGPVLQSVQPPLSVEDTLALYILFVRSRGEDPAYA------ERQTHVAMLPSEYT 108
Query: 198 L--LWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAF 254
L +++ EL GS + +Y +L T FM LF P + F
Sbjct: 109 LSMYFTDEELRVCAGSSLYTLTTHLRGRVGDDYKKLLTGVFMRHRDLF-------PLDKF 161
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV- 313
+F+ +K A ++ S + +S L+ +L ++S K A L V
Sbjct: 162 SFQHYKWALSSIWSRGMDF-TISEGNSVRLMAPFADMLNHASDAKQCHAYDPSTGSLTVL 220
Query: 314 -DRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 372
R Y+ G+ + ++Y G NS+LL YGFV DNP D + ++
Sbjct: 221 ACRDYEVGDQVFIYY-----------GNVSNSRLLRLYGFVLPDNPNDNYELVLQTSSMA 269
Query: 373 PQYQDKRMVAQRNGKLSV--QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLG 430
P Y+ K QR KL+ ++ + + +L YLR+ + D S++ ++ +
Sbjct: 270 PLYEQK----QRLWKLAGLDEISTIPLSLQNPLPDSVLRYLRIQRL-DASDLGTMTMQIA 324
Query: 431 --PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 471
+S E +L L+ +A L G+ +L + E L +
Sbjct: 325 TESYTKISDENESQILLFLSQSIEALLEGFEISLEKLETQLAE 367
>gi|409079523|gb|EKM79884.1| hypothetical protein AGABI1DRAFT_119942 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 401
Score = 42.0 bits (97), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 83/346 (23%), Positives = 124/346 (35%), Gaps = 79/346 (22%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE--- 137
K+W H+ G + Y A+ED+ + P S+ +T E
Sbjct: 9 FKAWFHERG------------GYLHDGVGFTYTVATEDIPTNATIITCPFSIAITEELAK 56
Query: 138 RVLGN----ETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
R L N +++ + +S CL L K + + PY LD QL
Sbjct: 57 RNLSNFFSADSLVDWTEKQWISTYICLHWILSDSKPTELEHY--PY---LDTLPPLNQL- 110
Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 253
+PL +++ E+ GS L R +K E+ E +V L Q D +
Sbjct: 111 -RTPLQFTKIEIETFKGSNLYHATLNRERQLKEEWQECQSV------LISQN--DSWGKG 161
Query: 254 FTFEIFKQAFVAVQSCVVHLQKVSLARRF--ALVPLGPPLLAYSSKCKAMLAAVD----- 306
FT+E + A V S R F ++ P L+A +L VD
Sbjct: 162 FTWERYLTAATYVSS-----------RAFPSTILSPNPSLIATPETKFVLLPGVDAFNHK 210
Query: 307 --DAVQLVVDRPYKAGESIVVWY---NISFTGHDFKC---------GPQPNSKLLINYGF 352
AV V P K+G S+ Y IS H GP+PN L++ YGF
Sbjct: 211 RAQAVSWSVTYPDKSG-SLASSYKGPTISLVPHTKTSAGEEIFNNYGPKPNGNLILGYGF 269
Query: 353 VDEDNPYDRLVVE------------AALNTEDPQYQDKRMVAQRNG 386
NP D ++++ AA E + RMV + G
Sbjct: 270 SLPANPDDTILLKIGGFEKKWEIGRAATGVEGLWTELLRMVKENQG 315
>gi|294659704|ref|XP_462118.2| DEHA2G13354p [Debaryomyces hansenii CBS767]
gi|199434171|emb|CAG90604.2| DEHA2G13354p [Debaryomyces hansenii CBS767]
Length = 480
Score = 42.0 bits (97), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 56/245 (22%), Positives = 97/245 (39%), Gaps = 42/245 (17%)
Query: 134 VTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
+T+E +LG LS L+ Y+ +EK++G SFW P+I L + LA
Sbjct: 135 LTMEEMLG------------LSSFQLLSFYICFEKQRGSSSFWKPFIDMLP-ETSDFDLA 181
Query: 194 VESPLLWS------ETELAYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYP 246
PL+W EL L + TK + + + + +YN + D + +
Sbjct: 182 ---PLVWKVLKVDHYEELLKLLPNSTKRHMDKIYDRFQTDYNVVKDLISIKLKEISDNER 238
Query: 247 YDIPTEAFT----FEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGPPLLAYSSKCK 299
+ T+A E++ +++ + S +++ Q + A F + P L +S +
Sbjct: 239 SNDLTDAIRHLVPIELYLWSWMCINSRCLYMEIPQSKNAADNFTMAPY-VDFLNHSCDDQ 297
Query: 300 AMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPY 359
L Q+ Y E + + Y GP N LL YGF +N +
Sbjct: 298 CGLKIDGTGFQVYTTCSYNPDEQLFLSY-----------GPHSNEFLLCEYGFTLPENKW 346
Query: 360 DRLVV 364
+ L V
Sbjct: 347 NDLDV 351
>gi|66813084|ref|XP_640721.1| hypothetical protein DDB_G0281543 [Dictyostelium discoideum AX4]
gi|60468751|gb|EAL66753.1| hypothetical protein DDB_G0281543 [Dictyostelium discoideum AX4]
Length = 1339
Score = 42.0 bits (97), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 68/325 (20%), Positives = 126/325 (38%), Gaps = 40/325 (12%)
Query: 42 SSLRLVRRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKP 101
S ++L KN+FS S + S EV + E +W+ G+ K+ +
Sbjct: 728 SLIQLSNLKNQFSKPTIKSVPFIKPSDEVYKRFE-------NWLKAGGVQFPKLQIA--- 777
Query: 102 SHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLA 161
N V ++ + +A VP ++ ++ + + + L++ L
Sbjct: 778 --NFTDSTGRGVVTTKKVDENEAVVVVPKKYLINVDVAKAHPILGPIFEELHLNDDTILF 835
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
L+++YEK SFW P+ L + + +S TEL L G+ + E
Sbjct: 836 LFVIYEKGNAN-SFWRPFYDTLPS-------YFTTSIHYSATELLELEGT----NLFEET 883
Query: 222 EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR 281
K++ N F L +QYP P F++E F A + S + L K+ + +
Sbjct: 884 LHTKQQLNSFRDYLF--PELSKQYPDIFPESQFSWENFLWARSLLDSRAIQL-KIDGSIK 940
Query: 282 FALVPLGPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCG 339
LVP+ + +++ + + +++ A I + Y G
Sbjct: 941 SCLVPMADMINHHTNAQISERFFDHDSQSFKMISSCNIPANNQIFLHY-----------G 989
Query: 340 PQPNSKLLINYGFVDEDNPYDRLVV 364
N +L + YGF+ +N YD L +
Sbjct: 990 ALQNWELALYYGFIIPNNIYDSLHI 1014
>gi|302829721|ref|XP_002946427.1| hypothetical protein VOLCADRAFT_86703 [Volvox carteri f.
nagariensis]
gi|300268173|gb|EFJ52354.1| hypothetical protein VOLCADRAFT_86703 [Volvox carteri f.
nagariensis]
Length = 658
Score = 42.0 bits (97), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 25/90 (27%), Positives = 40/90 (44%), Gaps = 17/90 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-----------------LS 155
+ A+ DLQ G+A VP L++T + +A L + L
Sbjct: 32 IVATRDLQPGEAVLRVPERLLLTTRSAARDPQLAAALQRHTERSRGVAAAPSCGGGCGLG 91
Query: 156 ELACLALYLMYEKKQGKKSFWLPYIRELDR 185
LA +L+ E +G +SFW PY+++L R
Sbjct: 92 PHQVLACHLLLEVSRGPQSFWWPYLKQLPR 121
>gi|169847976|ref|XP_001830696.1| hypothetical protein CC1G_03233 [Coprinopsis cinerea okayama7#130]
gi|116508170|gb|EAU91065.1| hypothetical protein CC1G_03233 [Coprinopsis cinerea okayama7#130]
Length = 496
Score = 42.0 bits (97), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 60/121 (49%), Gaps = 19/121 (15%)
Query: 115 ASEDLQAGDAAFSVPNSLVVT-----LERVLGNETIAELLTTNKLSE-LACLALYLMYEK 168
A +DL G F++P +L ++ L + G E L KL + A L L +M+E
Sbjct: 42 ALKDLPEGHVLFTIPRALTLSTRTSRLPELFGLEEWKRL----KLHQGWAGLMLCMMWEA 97
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
QGK+S W Y+ L A ++P+ W+E +L+ L G+ ++ + E +R+Y
Sbjct: 98 AQGKESRWAGYLDIL-------PAAFDTPMFWNEEDLSELAGTSIVGKLGK--EDAERDY 148
Query: 229 N 229
+
Sbjct: 149 D 149
>gi|320166344|gb|EFW43243.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 514
Score = 42.0 bits (97), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 59/241 (24%), Positives = 98/241 (40%), Gaps = 31/241 (12%)
Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
N + + LAL LMYE+ + S W ++R L +ES L W++ EL +
Sbjct: 132 NAIDPMTALALGLMYERSRA-DSPWRAWLRMLPD-------PIESMLEWNDVELWPVEQL 183
Query: 212 PTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 271
K ER ++ Y + T Y D+ FT E F A V Q+ +
Sbjct: 184 YVKELREERIRNLEAVYESVIT------PFIDTYESDLVGVDFTIEAFVWAAVIAQTRGL 237
Query: 272 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISF 331
H S +L+P+ ++ + + A++ A + + KAGE I + Y +S
Sbjct: 238 H---ESEKNGLSLLPIV-DMINHHREPNAVVVASGPNILVRTKTSLKAGEEITIDYEMS- 292
Query: 332 TGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ 391
+ LL+ YGFV+ D + + ++D Y + + + G LS Q
Sbjct: 293 -----------SHVLLLLYGFVEMSENLDFYPIRLSWESKDIDYPRRLRLLEGRG-LSRQ 340
Query: 392 V 392
V
Sbjct: 341 V 341
>gi|452982650|gb|EME82409.1| hypothetical protein MYCFIDRAFT_40308 [Pseudocercospora fijiensis
CIRAD86]
Length = 449
Score = 42.0 bits (97), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 63/261 (24%), Positives = 102/261 (39%), Gaps = 47/261 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A+ DL + + F +P + ++T E + I + LT LS L L +++E G
Sbjct: 42 VVATSDLTSDEEIFRIPRTSILTTETTDLPQEILQQLTDPWLS----LILAMIFEYLLGT 97
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-LERAEGIKREY--- 228
S + PY+ L + + + W++ EL YL GS ++I E A+ E
Sbjct: 98 NSRFKPYLDILPE-------SFNTLMFWTDNELQYLQGSAILSKIGKEEADNTFSEQLLP 150
Query: 229 ----------------NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH 272
+L + GS+ Y +D+ T +
Sbjct: 151 IITKNPEIFKIGTCNNQDLLALCHRMGSIIMSYAFDLDPPPTT--TTSSSEEWESDSDSE 208
Query: 273 LQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFT 332
+K+S AL+PL L A + L D+ + +P AGE ++
Sbjct: 209 NEKISPK---ALIPLADMLNANGDLTNSKLFFSSDSFIMKTLQPVAAGEELL-------- 257
Query: 333 GHDFKCGPQPNSKLLINYGFV 353
+DF GP P + LL YGFV
Sbjct: 258 -NDF--GPLPPADLLRRYGFV 275
>gi|294868786|ref|XP_002765694.1| hypothetical protein Pmar_PMAR013760 [Perkinsus marinus ATCC 50983]
gi|239865773|gb|EEQ98411.1| hypothetical protein Pmar_PMAR013760 [Perkinsus marinus ATCC 50983]
Length = 330
Score = 42.0 bits (97), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 22/83 (26%), Positives = 37/83 (44%), Gaps = 11/83 (13%)
Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQ 341
++PL S+K + V+ Q++ ++P K GE I Y G
Sbjct: 171 LCVIPLADQFNHSSTKWHTRVREVEGGFQMLAEKPVKKGEEIFNNY-----------GLY 219
Query: 342 PNSKLLINYGFVDEDNPYDRLVV 364
N LL+ +GF++ DNP+D +
Sbjct: 220 TNEMLLLTHGFIEFDNPHDHFIT 242
>gi|354548388|emb|CCE45124.1| hypothetical protein CPAR2_701280 [Candida parapsilosis]
Length = 565
Score = 42.0 bits (97), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 73/333 (21%), Positives = 127/333 (38%), Gaps = 76/333 (22%)
Query: 108 RPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYE 167
+P ++ A S+ G A+ +P LVVT ++ G + + T + + L +YL Y
Sbjct: 28 KPNYFGAISK--SNGKASIQIPRELVVTCDK--GIDLYKD--TYKNANHSSLLKIYLCYS 81
Query: 168 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
+ Q +SF PY+ L + A++SP +WS + A L G+ + E + E
Sbjct: 82 RTQ--QSFHQPYLDTLPSLQ-----AIDSPYIWSAEDKALLKGTNLGNSLKENISSLVEE 134
Query: 228 YNELDTVWFMAGSLFQQYPYDIP------------------TEAFTFEIFKQAFVAVQSC 269
W+ A +L P D+P T+ + F + + +
Sbjct: 135 -------WWNAINLL---PEDVPKPEQHFINLKFYYENKFYTDDDYYSYFNEVDTSNWTS 184
Query: 270 VVHLQKVSL---ARRFALVPLGPP-------------LLAYSSKCKAMLAAVDDAVQLVV 313
+ SL +R F + P LL ++ K K + D
Sbjct: 185 FPNYLWASLVLKSRAFPAYIIDPSLPKNEPMLLPVVDLLNHNPKTKVQWSGTDGGFLFQS 244
Query: 314 DRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 373
D +GE + Y G + N +LL+ YGF E+NP D AAL + P
Sbjct: 245 DDA-SSGEELFNNY-----------GQKGNEELLLAYGFAIENNPADS----AALKIKIP 288
Query: 374 QYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD 406
D ++ ++ + + H + + +SD
Sbjct: 289 ---DSKLQVVKDLGIKLPSIHDYTNSVIDQVSD 318
>gi|400594002|gb|EJP61885.1| histone-lysine N-methyltransferase [Beauveria bassiana ARSEF 2860]
Length = 481
Score = 41.6 bits (96), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 65/288 (22%), Positives = 112/288 (38%), Gaps = 35/288 (12%)
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
+ +S+ E+ GS + I +Y +L T M ++ P F E
Sbjct: 115 IFFSDEEMQVCKGSSLYTLTTQLRGRIGDDYKKLLTRVLM------RHRNLFPLSKFGIE 168
Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS---KCKAMLAAVDDAVQLVVD 314
+K A V S + VS L+ +L +SS +C A D + ++
Sbjct: 169 HYKWALCTVWSRGMDF-TVSEGNSLRLLAPFADMLNHSSDVKQCHAYDPTTGD-LSILAS 226
Query: 315 RPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQ 374
+ Y G+ + ++Y GP PN++LL YGFV +NP+D + + P
Sbjct: 227 KDYNVGDQVFIYY-----------GPVPNNRLLRLYGFVLPENPHDSYDLVLQTSPMAPL 275
Query: 375 YQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL----- 429
Y+ K + + G + + A +D LP L Y+ +S++ ++
Sbjct: 276 YEQKERLWKLAGLDTACTIPLTA-------NDPLPRSVLRYLRIQRLDESLLGAMTMQIA 328
Query: 430 -GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 476
G +S E +L L D A L G+ L A L +++P
Sbjct: 329 TGADEKISDDSETLILQFLIDSISAILEGFSIPLDILTAQLAAGDVYP 376
>gi|407417214|gb|EKF38012.1| hypothetical protein MOQ_001785 [Trypanosoma cruzi marinkellei]
Length = 578
Score = 41.6 bits (96), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 37/137 (27%), Positives = 65/137 (47%), Gaps = 9/137 (6%)
Query: 339 GPQPNSKLLINYGFVDEDNPYDRLVVEAAL--NTEDPQYQDKR--MVAQRNGKLSVQVFH 394
GP N +LL YGFV E+N +DRL ++ +R +VA L+ + +
Sbjct: 370 GPLQNWELLQFYGFVVEENEHDRLPFPFDFPEGVAGDEWDRRRATLVATYGLHLAGRCWI 429
Query: 395 VHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 454
H GR A ++ LR+ ++++ E ++ + GP + E V+ +AD +
Sbjct: 430 CHDGRPPPA---LVALLRV-HLAEAEEFDTMERN-GPFASLGAGTEARVVATIADTIRCI 484
Query: 455 LAGYPATLSEDEAMLTD 471
L + +L EDE +L +
Sbjct: 485 LDLFSTSLEEDEWLLEN 501
>gi|358399747|gb|EHK49084.1| hypothetical protein TRIATDRAFT_213818 [Trichoderma atroviride IMI
206040]
Length = 378
Score = 41.6 bits (96), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 39/182 (21%), Positives = 76/182 (41%), Gaps = 35/182 (19%)
Query: 192 LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPT 251
V P++W EL +L + + +R + + ++++ F + DI
Sbjct: 102 FEVGMPMMWPR-ELKHLLPLEPRNLVFKREKAFQGDWSD-----------FHKAFSDISY 149
Query: 252 EAFTFEIFKQAFVAVQSCVVHLQ-----KVSLARRFALVPLGPPLLAYSSKCKAMLAAVD 306
E +T+ A++ V + + + K R AL+P+ + C+ +
Sbjct: 150 EEYTY-----AWLTVNTRTFYNESPETLKYPWEDRLALIPVADLFNHADAGCRVYYSP-- 202
Query: 307 DAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEA 366
+ +V DR YK GE + IS++ H N L+ YGFV ++NP D + ++
Sbjct: 203 EGYHIVADRDYKRGEELY----ISYSSHS-------NDYNLVEYGFVPDENPSDDVYIDD 251
Query: 367 AL 368
+
Sbjct: 252 VI 253
>gi|299470104|emb|CBN78133.1| protein N-methyltransferase [Ectocarpus siliculosus]
Length = 482
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 37/85 (43%), Gaps = 11/85 (12%)
Query: 281 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGP 340
R AL+PL + YS M D A+ L V + G+ Y GP
Sbjct: 215 RMALLPLIDSINHYSRMPTHMYWEADGALSLSVGAAFDPGDHAFASY-----------GP 263
Query: 341 QPNSKLLINYGFVDEDNPYDRLVVE 365
N LL YGFV++DNP D V+E
Sbjct: 264 VSNDDLLQYYGFVEQDNPSDTYVLE 288
>gi|115391295|ref|XP_001213152.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114194076|gb|EAU35776.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 691
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 51/204 (25%), Positives = 88/204 (43%), Gaps = 26/204 (12%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
+L+ + +G + FW PYI L + G L +PL + +L +L G+ + A E+
Sbjct: 115 FFLIGQYLRGSEGFWYPYICTLPQP---GDLT--TPLYYEGADLRWLEGT-SLAPAREQK 168
Query: 222 EGIKRE-----YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
E + +E + EL F ++Y +++ A T + + V + VV ++
Sbjct: 169 ESLLKEKYQSTFEELRKSGFGDA---EKYTWELYLWASTIFVSRAFSAKVLAGVVPHAEL 225
Query: 277 SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDF 336
L+P +L + K A + V VV AGE + Y
Sbjct: 226 PEENVSVLLPF-IDVLNHRPLAKVEWRAGERDVLFVVLEHVAAGEEVANNY--------- 275
Query: 337 KCGPQPNSKLLINYGFVDEDNPYD 360
GP+ N +L++NYGF ++NP D
Sbjct: 276 --GPRNNEQLMMNYGFCLQNNPCD 297
>gi|302498903|ref|XP_003011448.1| SET domain protein [Arthroderma benhamiae CBS 112371]
gi|291174999|gb|EFE30808.1| SET domain protein [Arthroderma benhamiae CBS 112371]
Length = 689
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 43/207 (20%), Positives = 82/207 (39%), Gaps = 25/207 (12%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LA ++++E+ +G+ S W PY+ L R + S L + + +L +L G+
Sbjct: 104 LAFFMVHEQLKGRDSHWWPYLATLPRAS-----ELTSALFYQDNDLEWLQGTNLYQTHQA 158
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
+K EY+ ++ G L E++ ++IF A+ + S + +
Sbjct: 159 YRNAVKEEYDSAISILRDEGFL--------AVESYRWDIFCWAYTLIAS------RAFTS 204
Query: 280 RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNI---SFTG--- 333
R P L + + ML VD + + + E+ + + +F+G
Sbjct: 205 RVLDAYFSNHPTLKQDEEFQIMLPLVDSSNHKPLAKIEWRAEATEIGLKVIEPTFSGEEV 264
Query: 334 HDFKCGPQPNSKLLINYGFVDEDNPYD 360
H+ ++ YGF DNP D
Sbjct: 265 HNNYGSLNNQQSVMTTYGFCIVDNPCD 291
>gi|218189844|gb|EEC72271.1| hypothetical protein OsI_05430 [Oryza sativa Indica Group]
Length = 1243
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 31/114 (27%), Positives = 54/114 (47%), Gaps = 8/114 (7%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ +Q GD VP + +TL+++ L + + + + LA L+ E+ G +S
Sbjct: 66 ASKPIQEGDCIMQVPYHVQLTLDKLPQKFNT---LLDHAVGDTSKLAALLIMEQHLGNES 122
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
W PYI+ L + + + +LW EL + S E +E E K+E+
Sbjct: 123 GWAPYIKSLPTKD-----QMHNMVLWDLNELHAVQNSSIYDEAIEHKEQAKKEF 171
>gi|255637489|gb|ACU19071.1| unknown [Glycine max]
Length = 497
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 39/72 (54%), Gaps = 1/72 (1%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSELACLALYLMYEKKQG 171
+ A DL+ G+ VP S ++T E V+ ++ + + + + LS L + L+YE +G
Sbjct: 55 LGAVRDLRRGEIVLRVPKSALMTRETVMEDKKLCDAVNRHSSLSSAQILIVCLLYEMGKG 114
Query: 172 KKSFWLPYIREL 183
K S W PY+ L
Sbjct: 115 KTSRWHPYLMHL 126
>gi|297807745|ref|XP_002871756.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297317593|gb|EFH48015.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 41.6 bits (96), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 22/75 (29%), Positives = 40/75 (53%), Gaps = 2/75 (2%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAE--LLTTNKLSELACLALYLMYEKKQ 170
+ A +L+ G+ VP + ++T E ++ + ++ LS L++ L+YE +
Sbjct: 54 LGAVRELKKGELVLKVPRNALMTTESMIAKDRKLNDAVILHGSLSSTQILSVCLLYEMGK 113
Query: 171 GKKSFWLPYIRELDR 185
GK+SFW PY+ L R
Sbjct: 114 GKRSFWYPYLVHLPR 128
>gi|402076002|gb|EJT71425.1| hypothetical protein GGTG_10683 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 497
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 54/213 (25%), Positives = 83/213 (38%), Gaps = 38/213 (17%)
Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
L L +++E +G S W PY+ L + E+P+ WS ELA L SP A +
Sbjct: 104 SLILVMIHEHLRGSASPWRPYLDVLPAR-------FETPMFWSAAELAELQASPVVASV- 155
Query: 219 ERAEG-------IKREYNELDTVWFMAGS----------LFQQYPYDIPTEAFTFEIFKQ 261
RAEG I E + ++F AG L + I AF E
Sbjct: 156 GRAEGDAMIRSRILPVIRENEALFFGAGGAAMGDEELVELAHRMGSTIMAYAFDLERDDD 215
Query: 262 AFVAVQSCVVHLQKVSLARR-FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 320
A + + R +VP+ +L ++ A + ++A+ + R AG
Sbjct: 216 AMDEDDAEGDGWVEDRDGRTVMGMVPMA-DILNADAEFNAHINHSEEALVAISLRKIPAG 274
Query: 321 ESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV 353
E I+ +Y GP PN +L YG+
Sbjct: 275 EEILNYY-----------GPLPNGQLCRRYGYT 296
>gi|12718364|emb|CAC28558.1| related to histone-lysine N-methyltransferase [Neurospora crassa]
Length = 471
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 60/276 (21%), Positives = 112/276 (40%), Gaps = 46/276 (16%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
+ G+ ++P ++ T++ + + L + + LS LA Y+++ K
Sbjct: 42 FKEGEKILTIPAGILWTVKHAYADPLLGPALRSAQPPLSVEDTLATYILFVKS------- 94
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
RE R +A S +L++E +L G+ + + I+ ++ L
Sbjct: 95 ----RESGYDGQRSHIAALPASYSSSILFAEDDLEACAGTSLYTITKQLEQSIEDDHRAL 150
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEI----FKQAFVAVQSCVVHLQKVSLARRFALVPL 287
LF Q+P P + FT E +K A V S + LA ++ L
Sbjct: 151 VV------RLFVQHPDLFPLDKFTVEDVGLHYKWALCTVWSRAMDF---VLADGNSIRLL 201
Query: 288 GP--PLLAYSSKCKA--MLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPN 343
P +L ++S+ K + + + + Y+AG+ + + Y GP PN
Sbjct: 202 APFADMLNHTSEVKQCHVYDPSSGTLSVFAGKDYEAGDQVFINY-----------GPVPN 250
Query: 344 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 379
S+LL YGFV NP D + + + + P ++ K+
Sbjct: 251 SRLLRLYGFVIPGNPNDSYDLVLSTHPQAPFFEQKQ 286
>gi|324503528|gb|ADY41532.1| SET domain-containing protein 3 [Ascaris suum]
Length = 502
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 68/318 (21%), Positives = 121/318 (38%), Gaps = 69/318 (21%)
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
L++ + I + + L+ + C +K S WLPY+ L +
Sbjct: 142 LKKCFEQDMIVKTMDNVALALMVCC-------QKLSPDSSWLPYLDALPQ-------TFS 187
Query: 196 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS--------------- 240
+PL +S EL L+ SP E L + R++ V+F+A
Sbjct: 188 TPLYFSALELRKLSPSPAYEESLIMYRNVARQF-----VYFLAAVQRSERSRSAKKDKNH 242
Query: 241 -------LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL----QKVSLARRFALVPLGP 289
LF P+ + FTF++++ A V + + + K S + A VP
Sbjct: 243 AAVGMEPLFLNAPFTVSN--FTFDLYRWAVACVTTRINFIPSQYAKDSNGQPVA-VPCLI 299
Query: 290 PLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNS 344
PLL + + D + + YKAG+ + ++Y I + N
Sbjct: 300 PLLDMANHEFDHPLTVHFSTEGDYASIKATKDYKAGDEVTIFYGI-----------RTNR 348
Query: 345 KLLINYGFV-DEDNPYDRLVVEAALNTEDPQYQDKRMV---AQRNGKLSVQVFHVHAGRE 400
+ ++ GFV D +N D ++ D Q + + + A N + V VF V+A
Sbjct: 349 QFFLHNGFVPDGENKNDTYKLKIGFPRGDKQVRARLKLMHDAGFNAESRVFVFEVNASER 408
Query: 401 KEAISDMLPYLRLGYVSD 418
+S +L + R+ V +
Sbjct: 409 PVPLS-LLDFARVFLVEN 425
>gi|345566622|gb|EGX49564.1| hypothetical protein AOL_s00078g53 [Arthrobotrys oligospora ATCC
24927]
Length = 611
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 75/325 (23%), Positives = 131/325 (40%), Gaps = 54/325 (16%)
Query: 158 ACLAL---YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 214
AC L L+ E+ Q FW PYIR L + ++PL +++ E+ L G+
Sbjct: 112 ACFHLSQHLLLKEQSQ----FW-PYIRLLPK-------TFDTPLYFNDDEMERLAGTNLG 159
Query: 215 A-EILERAEGIKREYNELDTVWFMAG---SLFQQYPYDIPTEA---FTFEIFKQAFVAV- 266
A ++L R + E+ F+ G ++Y +D+ A +T F V +
Sbjct: 160 AGDVLLRKQLWMEEWEAGKQ--FLEGVGAERAREYTWDLFLRAATIYTSRSFPSKLVGIT 217
Query: 267 -QSCVVHLQKVSLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 324
S + +S F L+PL +L + K + + L+ G +
Sbjct: 218 MDSSIEENTMLSDDNGFPVLIPL-VDILNHKPNTKIIWEPTQTSFSLITPETISEGSQVF 276
Query: 325 VWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQR 384
Y GP+ N +LL+ YGFV +NP D L ++ ++ P+ Q ++ QR
Sbjct: 277 NNY-----------GPKGNEELLMGYGFVIPENPGDSLAMKFTIS---PRGQAAQIWEQR 322
Query: 385 NGKLSV-QVFHVHAGREKEAISDMLPYLRLGY-----------VSDTSEMQSVISSLGPI 432
K + +VFH+ + + +P L + V++ +E+ + +
Sbjct: 323 ALKQTWREVFHLTKSADSGQKTSTVPALESDWPEAFVDLFRILVANENEIDDLENGDINA 382
Query: 433 CPVSPCMERAVLDQLADYFKARLAG 457
P+S E AV L K +LA
Sbjct: 383 TPISIRNELAVALGLKAAIKQKLAA 407
>gi|308812602|ref|XP_003083608.1| unnamed protein product [Ostreococcus tauri]
gi|116055489|emb|CAL58157.1| unnamed protein product [Ostreococcus tauri]
Length = 427
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 89/216 (41%), Gaps = 30/216 (13%)
Query: 151 TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 210
+N+ +E A LA+ L E+++G S + P++ L+ T
Sbjct: 79 SNESAEWA-LAIELAMEREKGVASRYRPFV----------------DSLYERTPANSTVV 121
Query: 211 SPTKAEIL--ERAEGIKREYNELDTV--WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 266
S E L AE + R Y+E D V W A F+ +P + FT F++A V
Sbjct: 122 SKKARERLAEHHAEKVMRRYDE-DIVRGWNAAVRTFRTFPTIFRAQDFTRSKFEEALAIV 180
Query: 267 QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 326
++ + + R LVPL L+ +S + VDD + VD ++AG+ +
Sbjct: 181 RANSFEVTRADGVRERVLVPLAHLLVHDTSSSVPCVKMVDDTFVINVD-EHRAGDELSCS 239
Query: 327 YNISFTGHDF-KCGP------QPNSKLLINYGFVDE 355
+ F + G + N++ +I + F DE
Sbjct: 240 HGEYSDAETFARFGTSAVYSEENNARDVITFTFPDE 275
>gi|302660547|ref|XP_003021952.1| hypothetical protein TRV_03939 [Trichophyton verrucosum HKI 0517]
gi|291185873|gb|EFE41334.1| hypothetical protein TRV_03939 [Trichophyton verrucosum HKI 0517]
Length = 479
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 72/350 (20%), Positives = 126/350 (36%), Gaps = 77/350 (22%)
Query: 97 LKEKPSHNEKHRPIHY-----VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTT 151
LK H + H IH A + + F +P+ L+++++ + L
Sbjct: 24 LKRSSPHFKMHPGIHIADLRSTGAGRGISEDEELFVIPDDLILSVQNSEARSVLG--LDD 81
Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
+L L + ++YE QG++S W Y R L + ++ + W++ +L+ L GS
Sbjct: 82 KQLGPWLSLIITMIYEYYQGEQSKWYSYFRILPS-------SFDTLMFWTDEQLSELQGS 134
Query: 212 PTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 271
+I + A DT+ +Q V
Sbjct: 135 SVVGKIGKAAAD--------DTI-------------------------------LQKVVP 155
Query: 272 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISF 331
+Q S R F P PPL + S+ + A ++ G SI++ Y
Sbjct: 156 LIQANS--RHFPPRPNMPPLNSPDSQNALLCLA------------HRMG-SIIMAYAFDI 200
Query: 332 TGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ 391
D + + + DED P +V A + D Q + R+ + +
Sbjct: 201 EKTDEADEHTADDGYMTD----DEDEPAKGMVPLADIFNADAQRNNARLFQEEGSFVMKA 256
Query: 392 VFHVHAGREKEAISDMLPYL----RLGYVSDTSEMQSVIS-SLGPICPVS 436
+ ++++G E LP R GYV+D V+ SL IC V+
Sbjct: 257 IKNIYSGEEIFNDYGELPRADLLRRYGYVTDNYAQYDVVEFSLDAICKVA 306
>gi|145344497|ref|XP_001416768.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576994|gb|ABO95061.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 514
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 101/428 (23%), Positives = 169/428 (39%), Gaps = 76/428 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEK-KQG 171
VA + ++ AG+ VP + + + + + S A LA +++ E G
Sbjct: 85 VATTRNVSAGELLAEVPLEKCLCAASARMDARLWRAIGASGASGDAILAAHVLREAFDAG 144
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA--EGIKREYN 229
KS + P++R L R V+S + W+E EL+ L+GS + RA + EY+
Sbjct: 145 SKSAYWPWLRLLPRD-------VDSTVGWNEDELSELSGS--NVVVFTRAIKAQWRMEYD 195
Query: 230 ELDTVWFMAGSLFQQYPYDIPTEA---FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 286
LD +L +++P E +TF+ F A + S + L S +
Sbjct: 196 ALDV-----PTLGEKFPDVFGGERAAHYTFDKFTWARFIIWSRAIDLSTESA--EAPTIR 248
Query: 287 LGPPLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQ 341
+ PLL A K + A +AV++ ++ T F +
Sbjct: 249 VLVPLLDMANHAPGGKLRPEWDARSNAVKVYAASAFR-----------EHTELRFNYDTK 297
Query: 342 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREK 401
P+ L+ YGF+ E NP + VEA + D + R+ K + H +++
Sbjct: 298 PSQYFLLQYGFIPETNPAE--CVEATVRVSDHD-------SLRDAKEELLRLHGLDPKKR 348
Query: 402 E------AIS-DMLPYLRLGYVSDTSEMQSVIS-----SLGPICPVSPCMERAV-LDQLA 448
+I D+L R+ D +EM S S + + +AV L LA
Sbjct: 349 NFEWKPRSIDYDLLAATRV-ITMDEAEMSDATSLTLAVSGASVSAKNDARTKAVLLKSLA 407
Query: 449 DYFKARLAGYPATLSEDEAML-------TDYNLHPKKRVATQLVRMEKKMLNACLQVTAD 501
+ L Y TL+ED + D L K++ L+RM +K + L +AD
Sbjct: 408 SF----LESYTTTLAEDNEYVARVDDESNDEPLPGKRKRFAVLLRMREKQI---LLASAD 460
Query: 502 MIML-LPD 508
+ LPD
Sbjct: 461 ALFKELPD 468
>gi|330797452|ref|XP_003286774.1| hypothetical protein DICPUDRAFT_54488 [Dictyostelium purpureum]
gi|325083217|gb|EGC36675.1| hypothetical protein DICPUDRAFT_54488 [Dictyostelium purpureum]
Length = 1335
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 55/241 (22%), Positives = 96/241 (39%), Gaps = 34/241 (14%)
Query: 129 PNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
P ++ ++ N + + L++ L L+++YEK + +FW P+ L
Sbjct: 849 PRKYLINVDVAKSNPILGPIFEELHLNDETILFLFVIYEK-ENPNTFWRPFYDTLPS--- 904
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIK--REYNELDTVWFMAGSLFQQYP 246
+ + +S TEL L G+ AE L + ++ R+Y + L QYP
Sbjct: 905 ----YFTTSIHYSSTELLELEGTNLFAETLAVKQQLQAFRDY--------LFPELSNQYP 952
Query: 247 YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD 306
P F++E F A + S + L K+ + LVP+ ++ + + + D
Sbjct: 953 DIFPESVFSWENFLWARSLLDSRAIQL-KIDGKIKSCLVPMAD-MINHHTNAQISERHFD 1010
Query: 307 ---DAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLV 363
+ ++V A I + Y G NS L + YGFV +N YD
Sbjct: 1011 QDSNCFRMVSSCNIPANNQIFLHY-----------GALQNSDLALYYGFVIPNNIYDSFH 1059
Query: 364 V 364
V
Sbjct: 1060 V 1060
>gi|299115489|emb|CBN75653.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 451
Score = 41.2 bits (95), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 22/81 (27%), Positives = 39/81 (48%), Gaps = 11/81 (13%)
Query: 306 DDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVE 365
+D + LV + + +G + F GP PNS+LL+ +GF DNP++ + +
Sbjct: 229 NDCLHLVTLQDWASGSEV-----------KFSYGPLPNSRLLLLHGFCLPDNPFESVELW 277
Query: 366 AALNTEDPQYQDKRMVAQRNG 386
A + P + +K + NG
Sbjct: 278 AMMEPGAPGFAEKNKIMLDNG 298
>gi|408392258|gb|EKJ71616.1| hypothetical protein FPSE_08255 [Fusarium pseudograminearum CS3096]
Length = 527
Score = 41.2 bits (95), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 36/74 (48%), Gaps = 13/74 (17%)
Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVD 354
S CK + +A+ +VQ DR YK GE + V Y GP N LL YGF+
Sbjct: 194 SQGCKLVYSALGYSVQ--TDRAYKQGEEVFVSY-----------GPHSNDFLLTEYGFIL 240
Query: 355 EDNPYDRLVVEAAL 368
+ N +D + ++ +
Sbjct: 241 DTNRWDEVYLDEVI 254
>gi|315045047|ref|XP_003171899.1| SET domain-containing protein 6 [Arthroderma gypseum CBS 118893]
gi|311344242|gb|EFR03445.1| SET domain-containing protein 6 [Arthroderma gypseum CBS 118893]
Length = 485
Score = 41.2 bits (95), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 24/105 (22%), Positives = 49/105 (46%), Gaps = 9/105 (8%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ AS D+ + F +P L+++++ + L +L L + ++YE QG+
Sbjct: 51 ICASRDITEDEELFVIPEDLILSVQNSEARTVLG--LDDKQLGPWLSLIIAMIYEYYQGE 108
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
+S W PY L + ++ + W++ +L+ L GS +I
Sbjct: 109 QSKWYPYFGVLPS-------SFDTLMFWTDEQLSELQGSAVVGKI 146
>gi|363747032|ref|XP_003643892.1| PREDICTED: histone-lysine N-methyltransferase setd3-like, partial
[Gallus gallus]
Length = 283
Score = 41.2 bits (95), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 42/182 (23%), Positives = 85/182 (46%), Gaps = 23/182 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
+ A+ +++A + VP L++T+E N + L + +++ + LA +L+ E+
Sbjct: 108 LKATREIKAEELFLWVPRKLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERA 166
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
SFWLPYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 167 -NPNSFWLPYIQTLPSE-------YDTPLYFEEDEVQYLRSTQAIHDVFSQYKNTARQYA 218
Query: 230 ELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALV 285
V Q +P +P ++FT++ ++ A +V + + +R AL+
Sbjct: 219 YFYKV-------IQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRVTLALI 271
Query: 286 PL 287
PL
Sbjct: 272 PL 273
>gi|71659283|ref|XP_821365.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70886742|gb|EAN99514.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 661
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 45/95 (47%), Gaps = 10/95 (10%)
Query: 149 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA-- 206
L ++ + +AC+A Y+ YEKKQ + + L Y R L Q+ V++ LW+ L
Sbjct: 466 LDSSNMESIACIAAYMFYEKKQPEIALRL-YRRLL-------QMGVQTTELWNNLGLCCF 517
Query: 207 YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSL 241
Y + L+RA I E L VW+ G +
Sbjct: 518 YSSQYDIALSCLQRAVAISTEDETLADVWYNIGHI 552
>gi|403158396|ref|XP_003307692.2| hypothetical protein PGTG_00642 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375163798|gb|EFP74686.2| hypothetical protein PGTG_00642 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 622
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 41/89 (46%), Gaps = 13/89 (14%)
Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQ 341
A+VPL L A + A L D +++ R K GE I YN G
Sbjct: 345 IAMVPLADLLNAKTGSENARLFYETDCLKMKATRNIKKGEQI---YNT--------YGDP 393
Query: 342 PNSKLLINYGFVDEDNPYDRLVVEAALNT 370
PNS LL YG VD+ N +D VVE ++ T
Sbjct: 394 PNSDLLRRYGHVDDPNRFD--VVEISIKT 420
>gi|315042966|ref|XP_003170859.1| SET domain-containing protein [Arthroderma gypseum CBS 118893]
gi|311344648|gb|EFR03851.1| SET domain-containing protein [Arthroderma gypseum CBS 118893]
Length = 693
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 51/205 (24%), Positives = 85/205 (41%), Gaps = 22/205 (10%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LAL++ +++ + K S W PY+ L R + S L + +L +L G+
Sbjct: 108 LALFVAHQQLKEKGSHWWPYLATLPRAS-----ELTSALFYHGDDLEWLQGTNLYQTHQA 162
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQ-YPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 278
+K EY+ ++ G L + Y +D+ A+T I +AF + + V+L +
Sbjct: 163 YMNAVKEEYDSAISILRDEGCLAAELYSWDLFCWAYTV-IASRAFTS-RVLSVYLSRNPA 220
Query: 279 ARRFALVPLGPPLLAYSSK---CKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHD 335
++ + PL+ S+ K A + L V P + E I Y
Sbjct: 221 LKQDEEFQILLPLVDSSNHKPLAKIEWRAEAAEIGLKVVEPIVSEEEIHNNY-------- 272
Query: 336 FKCGPQPNSKLLINYGFVDEDNPYD 360
GP N +L+ YGF DNP D
Sbjct: 273 ---GPLNNQQLMTTYGFCIVDNPCD 294
>gi|207346544|gb|EDZ73016.1| YDR257Cp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 354
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 41/79 (51%), Gaps = 12/79 (15%)
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
+++PL L A +SKC A L + +++V R + E + YNI G P
Sbjct: 80 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQV---YNIY--------GEHP 128
Query: 343 NSKLLINYGFVDED-NPYD 360
NS+LL YG+V+ D + YD
Sbjct: 129 NSELLRRYGYVEWDGSKYD 147
>gi|4185151|gb|AAD08954.1| unknown protein [Arabidopsis thaliana]
gi|20197036|gb|AAM14885.1| unknown protein [Arabidopsis thaliana]
Length = 441
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 40/152 (26%), Positives = 64/152 (42%), Gaps = 15/152 (9%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASEDL+ GD A +P S +++ E V ++ L T + ++ L L+ M EK
Sbjct: 173 ASEDLKFGDVALEIPVSSIISEEYVYNSDMYPILETFDGITSETMLLLWTMREKHNLDSK 232
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F PY L G L + + L G+ EI++ E ++ Y+EL
Sbjct: 233 F-KPYFDSLQENFCTG-------LSFGVDAIMELDGTLLLDEIMQAKELLRERYDEL--- 281
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 266
L + P E +T+E + A+ V
Sbjct: 282 ----IPLLSNHREVFPPELYTWEHYLWAYFDV 309
>gi|358395377|gb|EHK44764.1| hypothetical protein TRIATDRAFT_80097 [Trichoderma atroviride IMI
206040]
Length = 463
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 76/355 (21%), Positives = 140/355 (39%), Gaps = 52/355 (14%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
Q G+ ++P + T+E + + +L + + LS LA+YL++ +
Sbjct: 34 FQQGERILTIPGDSLWTVEHADSDPLLGPVLRSVQPPLSVEDTLAVYLLFVR-------- 85
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
+RE + R +A S + ++E EL G+ + E I+ +Y
Sbjct: 86 ---LREHGYEGPRSHVAAMPARYSSSIFFNEDELEVCAGTSLYTITKQLEERIEDDYR-- 140
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
+ +F Q+P +P + + +K A V S + + L P +
Sbjct: 141 ----VLVMRVFTQHPDLLPLAKISIQDYKWALCTVWSRAMDFVLPNGKPLRVLAPFAD-M 195
Query: 292 LAYSSKCKAMLAAVDDAVQLVV--DRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
+ +S + K A + L V + Y+ G+ I + Y G PN++LL
Sbjct: 196 INHSPEVKQCHAYDPSSGNLSVLAGKDYEIGDQIYISY-----------GSIPNNRLLRL 244
Query: 350 YGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP 409
YGFV +NP D + + + P Y+ K+ + G S + + D LP
Sbjct: 245 YGFVIPENPNDSYDLVLSTHPMAPFYEQKQKLWASAGLDSASTIPL-------TLIDPLP 297
Query: 410 -----YLRLGYVSDTSEMQSV-ISSLGPICPVSPCMERAVLDQLADYFKARLAGY 458
YLR+ + D S++ ++ + L +S E +L L + A L G+
Sbjct: 298 KSVLRYLRIQRL-DASDLAAIALQKLDTNEKISNSKEVEILQFLVESISALLDGF 351
>gi|46129354|ref|XP_389038.1| hypothetical protein FG08862.1 [Gibberella zeae PH-1]
Length = 478
Score = 40.8 bits (94), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 26/88 (29%), Positives = 41/88 (46%), Gaps = 14/88 (15%)
Query: 281 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGP 340
R +P+ L + CK + +A+ +VQ DR YK GE + V Y GP
Sbjct: 178 RLVCMPVAD-LFNHDQGCKLVYSALGYSVQ--TDRVYKQGEEVYVSY-----------GP 223
Query: 341 QPNSKLLINYGFVDEDNPYDRLVVEAAL 368
N LL YGF+ + N +D + ++ +
Sbjct: 224 HSNDFLLTEYGFILDTNRWDEVYLDEVI 251
>gi|395848935|ref|XP_003797093.1| PREDICTED: SET domain-containing protein 4 [Otolemur garnettii]
Length = 440
Score = 40.8 bits (94), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 76/337 (22%), Positives = 134/337 (39%), Gaps = 47/337 (13%)
Query: 65 AGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHY------VAASED 118
A SR V + + +LK W LK++ + P H+ + +
Sbjct: 20 AESRGVNESFKCEFIELKKW------------LKDRKFEDTNLMPAHFPGTGRGLMSKTS 67
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
LQ G S+P + ++T + V+ + + +T K S L L +L+ EK G +S W
Sbjct: 68 LQEGQMIISLPENCLLTTDTVIES-YLGAYITKWKPPPSPLLALCTFLVSEKHAGDQSPW 126
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L + A P+ E E+ L P KA+ E+ ++ + +
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKPLKAKAEEQRAHVQEFFASSRDFFS 178
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS 296
LF + I F++ A+ V + V+L+ RR + P A +
Sbjct: 179 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRH----RRRECLSAEPDTCALAP 230
Query: 297 KCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHD--FKC-GPQPNSKLLINYGFV 353
+ + + V+ + + E + S+ H+ F C G N +LL+ YGFV
Sbjct: 231 YLDLLNHSPNVQVRAAFNEETRCYE---IRTASSWRKHEEVFICYGHHDNQRLLLEYGFV 287
Query: 354 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 390
NP+ + V + + DK+M N K+S+
Sbjct: 288 SIQNPHACVYVSREILVKYLPSTDKQM----NKKISI 320
>gi|296808191|ref|XP_002844434.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
gi|238843917|gb|EEQ33579.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
Length = 684
Score = 40.4 bits (93), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 52/205 (25%), Positives = 86/205 (41%), Gaps = 22/205 (10%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LA ++ +++ + K S W PY+ L R G+L S L + +L +L +
Sbjct: 111 LAFFVAHQQLKAKDSHWWPYLATLPRA---GELT--SALFYQGEDLEWLQDTNFYHARQM 165
Query: 220 RAEGIKREYNELDTVWFMAGS-LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 278
+ +K EY+ ++ G L + Y ++I A+T I +AF + + ++ K
Sbjct: 166 YHDAVKTEYDAAISILRKEGCPLVESYSWNIFCWAYTV-IASRAFTS-RVLEAYISKNPA 223
Query: 279 ARRFALVPLGPPLLAYSSK---CKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHD 335
R+ + PL+ S+ K A + L V P A E I H+
Sbjct: 224 LRQDDEFQIMLPLVDSSNHRPLAKIEWRAEATRIGLKVIDPVSAKEEI----------HN 273
Query: 336 FKCGPQPNSKLLINYGFVDEDNPYD 360
GP N +L+ YGF DNP D
Sbjct: 274 -NYGPLNNQQLMATYGFCIVDNPCD 297
>gi|145502426|ref|XP_001437191.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124404340|emb|CAK69794.1| unnamed protein product [Paramecium tetraurelia]
Length = 637
Score = 40.4 bits (93), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 38/132 (28%), Positives = 64/132 (48%), Gaps = 23/132 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIA------ELLT---TNKLSELACLALY 163
V ASEDL + +P SL+++ ++ + I E+ TN+ +E L Y
Sbjct: 46 VVASEDLPSDTVIICIPQSLIISPDKCKQSTLITVYNSHPEMFDEEETNE-AEFNILTFY 104
Query: 164 LMYEKKQGKKSFWLPYIRELDRQRGRGQLA--------VESPLLWSETELAY--LTGSPT 213
+ EKK+G++SF+ PYI+ + Q +A +E PL+ E +L G +
Sbjct: 105 MFNEKKKGEQSFYYPYIQAI--QTSNTLMAWSNEDLQKIEDPLILEEFQLIKQDFLGLWS 162
Query: 214 KAE-ILERAEGI 224
KA+ I + A+ I
Sbjct: 163 KAKLIFDNAQDI 174
>gi|452823683|gb|EME30691.1| hypothetical protein Gasu_19370 [Galdieria sulphuraria]
Length = 370
Score = 40.4 bits (93), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 67/288 (23%), Positives = 113/288 (39%), Gaps = 55/288 (19%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSE--------LACLALYLMY 166
A + + G +P+ L++T GN+ L N + + ++++L +
Sbjct: 38 AKKPITKGSILLEIPDPLLIT-----GNKVCKWLERNNWIGHQQISSVQGVLLVSIFLFF 92
Query: 167 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 226
E +Q SFW PY++ L L + LL +Y+T +A+I++ E ++R
Sbjct: 93 ESRQSD-SFWKPYLQVLPTSYDLLFLYRDGLLL------SYVT----EADIMQMVESVRR 141
Query: 227 EYNELDTVWFMAGSLFQQY--PY-----DIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
+ FQ Y P+ D F+ F + + AV S + +L
Sbjct: 142 ----------ILRDTFQTYVIPHFSSVDDRDKWNVLFKEFVRWYCAVVSRICYLPDDIAG 191
Query: 280 RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDF-KC 338
ALVPLG AVD V ++ + + N S F
Sbjct: 192 ---ALVPLGDIFNH---------EAVDTPVDILYAKWERGYYVFRAHRNFSIGTQVFVSY 239
Query: 339 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 386
G N++L++ YGF DNP+D L E ++ + R+V R G
Sbjct: 240 GALSNTELMMYYGFTLNDNPWDTLSFYPHELDESIKFYE-RVVLDREG 286
>gi|340505659|gb|EGR31971.1| SET domain protein [Ichthyophthirius multifiliis]
Length = 705
Score = 40.4 bits (93), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 26/104 (25%), Positives = 52/104 (50%), Gaps = 10/104 (9%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN---------KLSELACLALY 163
+AA+ED+ A +PN ++++L ++ E + +++ N +E +A+Y
Sbjct: 21 IAAAEDIPANTIIACIPNKIMISLNQIKECE-LKDIINENPSLFDEEENAEAEFNIIAMY 79
Query: 164 LMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY 207
+++EK +G+KSF+ PY + R +E L E+ Y
Sbjct: 80 VIHEKLKGEKSFYKPYFDTIQRSYTMYDWTIEEVKLTESEEIIY 123
>gi|302828172|ref|XP_002945653.1| hypothetical protein VOLCADRAFT_120141 [Volvox carteri f.
nagariensis]
gi|300268468|gb|EFJ52648.1| hypothetical protein VOLCADRAFT_120141 [Volvox carteri f.
nagariensis]
Length = 163
Score = 40.4 bits (93), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 20/73 (27%), Positives = 36/73 (49%)
Query: 111 HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
+ + A E ++ G VP L+++ + +E + L+E L L+L+ E+
Sbjct: 77 YSLVADEPVRRGQILVRVPRRLLMSQDTARASEACGRTVREAGLNEWQSLILHLLCERAL 136
Query: 171 GKKSFWLPYIREL 183
G +SFW PY+ L
Sbjct: 137 GSRSFWAPYLDTL 149
>gi|428167603|gb|EKX36559.1| hypothetical protein GUITHDRAFT_155193 [Guillardia theta CCMP2712]
Length = 321
Score = 40.0 bits (92), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 31/133 (23%), Positives = 58/133 (43%), Gaps = 10/133 (7%)
Query: 57 VSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNG-LPPCKVILKEKPSHNEKHRPIHYVAA 115
V++ D A + ++ +ED W NG + K+ +K + V
Sbjct: 50 VAAGDQGAASGADQQAQLQEDWTAFVKWFRSNGGIISSKLTVKVRNGRQG-------VYF 102
Query: 116 SEDLQAGDAAFSVPNSLVVTLERVLGNET--IAELLTTNKLSELACLALYLMYEKKQGKK 173
E ++ G+ S P +L + + + + + + L +K + L++++E K GK
Sbjct: 103 KERMRRGETIVSFPRNLRLDEKTAMKGKAGHVFQRLKQDKCYPDLMVILHVVHEDKLGKD 162
Query: 174 SFWLPYIRELDRQ 186
SFW PY + L RQ
Sbjct: 163 SFWFPYFKLLRRQ 175
>gi|346327621|gb|EGX97217.1| SET domain-containing protein, putative [Cordyceps militaris CM01]
Length = 371
Score = 40.0 bits (92), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 26/88 (29%), Positives = 39/88 (44%), Gaps = 13/88 (14%)
Query: 281 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGP 340
R AL+P+ S C + + + DR Y+A E + Y G
Sbjct: 178 RLALLPVADMFNHASVGCAVAFST--EVYDVTADRDYEADEELYTSY-----------GA 224
Query: 341 QPNSKLLINYGFVDEDNPYDRLVVEAAL 368
N LL YGF+ +DNP+D+L ++A L
Sbjct: 225 HSNDFLLAEYGFMLQDNPHDQLCLDAVL 252
>gi|159131477|gb|EDP56590.1| SET domain protein [Aspergillus fumigatus A1163]
Length = 490
Score = 40.0 bits (92), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 66/307 (21%), Positives = 123/307 (40%), Gaps = 58/307 (18%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDL-QAGDAA--FSVPNSLVVTLE 137
L SW NG+ + ++ S + + VA +E + G+A +VP+ L +TLE
Sbjct: 11 LSSWAKLNGISLEGIAFQKLYSEHGTDKGSAIVATAEKKDEEGEANTLLTVPSDLALTLE 70
Query: 138 RVLGN-----------ETIAELLTTNKLSELACLALYLMY--------EKKQGKKSFWLP 178
V + + + + T + + L L + + + +K G + W
Sbjct: 71 YVHNHAKIDRHLREVLDAVGDFGRTARGAILIFLIIQITHASPDFVNKRQKIGISNPWTE 130
Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-----DT 233
YIR + +V P +S E L G+ + + + +++E++ L +
Sbjct: 131 YIRFM-------PASVPLPTFYSAEERELLRGTSLQTAVDAKLGSLEKEFDHLRQATEEI 183
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP---LGPP 290
W Q++ +D T FTF+ +K +S VV L + A+VP +
Sbjct: 184 PWC------QEHWWDEDTGKFTFDDWKYVDAVYRSRVVDLPRSG----HAIVPCVDMANH 233
Query: 291 LLAYSSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
S K K +AV QL + + GE + + Y +P S+++ +
Sbjct: 234 ACEDSVKAKYDEEGAGNAVLQLRTGKKLRVGEEVTISYG----------DEKPASEMVFS 283
Query: 350 YGFVDED 356
YGFV+ +
Sbjct: 284 YGFVENE 290
>gi|119495234|ref|XP_001264406.1| SET domain protein [Neosartorya fischeri NRRL 181]
gi|119412568|gb|EAW22509.1| SET domain protein [Neosartorya fischeri NRRL 181]
Length = 492
Score = 40.0 bits (92), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 62/311 (19%), Positives = 121/311 (38%), Gaps = 66/311 (21%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASE----DLQAGDAAFSVPNSLVVTL 136
L +W NG+ + ++ + + VA +E D +A D +VP+ L +TL
Sbjct: 11 LSTWAKLNGMSLEGIAFQKLHGEHGTDKGTAIVATAEKKDEDAEA-DTLLTVPSDLALTL 69
Query: 137 ERVLGN-----------ETIAELLTTNKLSELACLALYLMY--------EKKQGKKSFWL 177
E V + + + + T + + L L + + + +K G + W
Sbjct: 70 EYVHNHAKTDRHLREVLDAVGDFGRTARGAILIFLIVQITHASPDFANQRQKIGVSNPWT 129
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-----D 232
YIR + ++ P +S E L G+ + + + +++E+ L D
Sbjct: 130 EYIRFM-------PASIPLPTFYSAEERELLRGTSLQTAVDAKLGSLEKEFEHLRQATED 182
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
W Q++ +D T FTF+ K +S VV L + A + P +
Sbjct: 183 IHWC------QEHWWDEDTGKFTFDDLKYVDAVYRSRVVDLPRSGHA-------IVPCVD 229
Query: 293 AYSSKCKAMLAAVDD-------AVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSK 345
+ C+ ++ A D +QL + + GE + + Y +P S+
Sbjct: 230 MANHACEDLVKARYDEDGAGNAVLQLRTGKKLRVGEEVTISYG----------DEKPASE 279
Query: 346 LLINYGFVDED 356
++ +YGFV+ +
Sbjct: 280 MVFSYGFVENE 290
>gi|71409849|ref|XP_807248.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70871208|gb|EAN85397.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 544
Score = 40.0 bits (92), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 45/95 (47%), Gaps = 10/95 (10%)
Query: 149 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA-- 206
L ++ + +AC+A Y+ YEKKQ + + L Y R L Q+ V++ LW+ L
Sbjct: 349 LDSSNMESIACIAAYMFYEKKQPEIALRL-YRRLL-------QMGVQTTELWNNLGLCCF 400
Query: 207 YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSL 241
Y + L+RA I E L VW+ G +
Sbjct: 401 YSSQYDIALSCLQRAVAISTEDETLADVWYNIGHI 435
>gi|407846232|gb|EKG02467.1| hypothetical protein TCSYLVIO_006496 [Trypanosoma cruzi]
Length = 546
Score = 40.0 bits (92), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 45/95 (47%), Gaps = 10/95 (10%)
Query: 149 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA-- 206
L ++ + +AC+A Y+ YEKKQ + + L Y R L Q+ V++ LW+ L
Sbjct: 351 LDSSNMESIACIAAYMFYEKKQPEIALRL-YRRLL-------QMGVQTTELWNNLGLCCF 402
Query: 207 YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSL 241
Y + L+RA I E L VW+ G +
Sbjct: 403 YSSQYDIALSCLQRAVAISTEDETLADVWYNIGHI 437
>gi|344300819|gb|EGW31140.1| hypothetical protein SPAPADRAFT_142076 [Spathaspora passalidarum
NRRL Y-27907]
Length = 436
Score = 39.7 bits (91), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 49/221 (22%), Positives = 79/221 (35%), Gaps = 30/221 (13%)
Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW---SETELAYLTG 210
LS L +Y+ E ++GK SFW P+ LD + PL+W ++ +L L
Sbjct: 117 LSSFQLLGMYITIETQRGKSSFWKPF---LDMLPSIADFEL-MPLVWQINNQHDLLDLLP 172
Query: 211 SPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
P + + +YN + + Q D + F A++ + S
Sbjct: 173 QPIRKTSEKVYTRFTSDYNTV--------TALLQTKIDNTEAVLPLDQFLLAWICINSRC 224
Query: 271 VHLQ---KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWY 327
+++ S + F + P L +S L Q+ Y E + + Y
Sbjct: 225 LYMNLPTSKSASDNFTMAPY-VDFLNHSPNDHCTLKIDGRGFQVFSTCAYSENEQVYLSY 283
Query: 328 NISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 368
GP N LL YGF DN ++ L V L
Sbjct: 284 -----------GPHSNDFLLCEYGFTISDNKWNDLDVTEYL 313
>gi|407923069|gb|EKG16157.1| hypothetical protein MPH_06594 [Macrophomina phaseolina MS6]
Length = 305
Score = 39.7 bits (91), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 55/261 (21%), Positives = 100/261 (38%), Gaps = 53/261 (20%)
Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWL 177
D+Q G+ F++P S V++ + + +L L A L + ++YE +G S W
Sbjct: 3 DIQEGEVLFTIPRSAVLSATNSSLSSILPQLF--EHLDPWASLIVTMIYEYLRGDASPWK 60
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG----------IKRE 227
PY L ++ + WS+ ELA L S +I + + ++R
Sbjct: 61 PYFDVLPAH-------FDTLMFWSDDELAELQASAVTQKIGKDSANEMFTNTIIPLVRRH 113
Query: 228 YN---------------ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH 272
+ +L + GS Y +DI + + ++ ++ + +
Sbjct: 114 ASVFFPDPNTAQGASDGDLLALAHRMGSTIMAYAFDIEPDPASKQVDEEGYASDD----- 168
Query: 273 LQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFT 332
+ +L + +VPL L A + + A L DA+ + AG+ + Y
Sbjct: 169 -EDEALPK--GMVPLADMLNADADRNNARLHYGPDALTMEAVTNISAGDEVFNDY----- 220
Query: 333 GHDFKCGPQPNSKLLINYGFV 353
G P S LL YG+V
Sbjct: 221 ------GSLPRSDLLRRYGYV 235
>gi|270005260|gb|EFA01708.1| hypothetical protein TcasGA2_TC007288 [Tribolium castaneum]
Length = 253
Score = 39.7 bits (91), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 35/119 (29%), Positives = 58/119 (48%), Gaps = 12/119 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSEL--ACLALYLMYEKK 169
V A+ D+ +VP L++++E + +L+ +K L + L+++L+ EK
Sbjct: 116 VKANVDIAESSLVIAVPRKLMMSVENA-KESVLKDLIEKDKILGSMPNVALSIFLLLEKY 174
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
+G SFW PYI L + + L +S EL L GSPT L + + I R+Y
Sbjct: 175 KGD-SFWKPYIDILPK-------TYTTVLYFSIDELEELRGSPTLEVALRQIKSITRQY 225
>gi|363746364|ref|XP_003643627.1| PREDICTED: histone-lysine N-methyltransferase setd3-like, partial
[Gallus gallus]
Length = 225
Score = 39.7 bits (91), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 29/119 (24%), Positives = 59/119 (49%), Gaps = 12/119 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
+ A+ +++A + VP L++T+E N + L + +++ + LA +L+ E+
Sbjct: 108 LKATREIKAEELFLWVPRKLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERA 166
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
SFWLPYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 167 -NPNSFWLPYIQTLPSE-------YDTPLYFEEDEVQYLRSTQAIHDVFSQYKNTARQY 217
>gi|390602144|gb|EIN11537.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
SS5]
Length = 503
Score = 39.7 bits (91), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 60/231 (25%), Positives = 88/231 (38%), Gaps = 35/231 (15%)
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
A+VP+ L A A L +++V +P +GE I Y G P
Sbjct: 255 AMVPMADMLNARYGSENAKLFYESRDLRMVTTKPIASGEQIWNTY-----------GDPP 303
Query: 343 NSKLLINYGFVD---------EDNPYDRLVVEA--ALNTEDPQYQDKRMVAQ-----RNG 386
NS LL YG VD NP D + V A LN + + Q + + G
Sbjct: 304 NSDLLRRYGHVDLLALSDGDGMGNPSDIVEVRADLVLNHVNSKKQSHELEERIDWWLEEG 363
Query: 387 KLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQ 446
V VF R+ E S+++ +RL + T ++ P V +L
Sbjct: 364 GDDVFVFT----RDAELPSELVSLIRLLILPPTEWTKTRDKGKLPKGKVDDVR---ILHV 416
Query: 447 LADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 497
+ RL YP ++ +DEA+L L KR A + EK +L L
Sbjct: 417 VTGALHERLQQYPTSIEDDEALLA-TALSENKRQAVIVRLAEKHILRKALH 466
>gi|302754812|ref|XP_002960830.1| hypothetical protein SELMODRAFT_402221 [Selaginella moellendorffii]
gi|300171769|gb|EFJ38369.1| hypothetical protein SELMODRAFT_402221 [Selaginella moellendorffii]
Length = 393
Score = 39.7 bits (91), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 39/72 (54%), Gaps = 6/72 (8%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LAL ++ E+ +G+ + W PYI L + +++ W +TEL+YL SP + E
Sbjct: 167 LALIVLMERYKGQ-AIWAPYISCLPQPA-----ELDNTFRWEDTELSYLRASPLYGKARE 220
Query: 220 RAEGIKREYNEL 231
R E I E+ ++
Sbjct: 221 RLEMITTEFGQV 232
>gi|345795412|ref|XP_544872.3| PREDICTED: SET domain-containing protein 4 [Canis lupus familiaris]
Length = 440
Score = 39.7 bits (91), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 76/358 (21%), Positives = 133/358 (37%), Gaps = 50/358 (13%)
Query: 41 GSSLRLVRRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEK 100
G +RR+ F VS R V + + +LK W+ +I
Sbjct: 5 GGRTSRIRRRKLFRCSVS---------RGVNESYKPEFIELKKWLKDRKFEDTNLIPACF 55
Query: 101 PSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-SELAC 159
P + + L+ G S+P S ++T + V+ + + S L
Sbjct: 56 PGTGRG------LMSKTSLREGQMIISLPESCLITTDTVIRSYLGTYIAKWQPPPSPLLA 109
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
L +L+ EK G +S W PY+ L + A P+ E E+ L P KA+ E
Sbjct: 110 LCTFLVSEKHAGDQSLWKPYLEILPQ-------AYTCPVC-LEPEVVNLFPKPLKAKAEE 161
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQKV 276
+ ++ ++ + LF + I F++ A+ V + V H Q+
Sbjct: 162 QRARVQEFFSSSRDFFSSLQPLFSEAVESI----FSYRALLWAWCTVNTRAVYVKHRQRQ 217
Query: 277 SLARRFALVPLGP--PLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWYNISFT 332
+ L P LL +S + + A ++ ++ + E + + Y
Sbjct: 218 CFSTEPNTYALAPYLDLLNHSPEVQVKGAFNEETRCYEIRTASNCRKHEEVFICY----- 272
Query: 333 GHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 390
GP N +LL+ YGFV NP+ + V + + DK+M N K+S+
Sbjct: 273 ------GPHDNQRLLLEYGFVSIHNPHACVYVSEDILVKYLPTTDKQM----NKKISI 320
>gi|300122775|emb|CBK23792.2| unnamed protein product [Blastocystis hominis]
Length = 854
Score = 39.7 bits (91), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 32/121 (26%), Positives = 56/121 (46%), Gaps = 10/121 (8%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGN----ETIAELLTTNKLSELACLALYLMYEK 168
V A E +Q G+ + N V+ L L + + + N+LSE A +AL L++EK
Sbjct: 512 VIAKEAIQKGEEVLRIHNDTVIGLHTALTHPRFGKAFSAFYHQNQLSEYALIALTLLWEK 571
Query: 169 KQGKK-SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
++ S + P++ +L P+L S+ +L +L GS E+ + RE
Sbjct: 572 FDNERWSLFAPFLAKLPSIE-----EFHHPVLLSKDDLLHLYGSALLDEVSALNATLHRE 626
Query: 228 Y 228
+
Sbjct: 627 F 627
>gi|258567286|ref|XP_002584387.1| predicted protein [Uncinocarpus reesii 1704]
gi|237905833|gb|EEP80234.1| predicted protein [Uncinocarpus reesii 1704]
Length = 706
Score = 39.3 bits (90), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 56/217 (25%), Positives = 88/217 (40%), Gaps = 33/217 (15%)
Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIREL--DRQRGRGQLAVESPLLWSE-TELAYLTG 210
+ E LA +LM + G +SFW PYI+ L D Q R + L W E T L L
Sbjct: 120 VEEPGALAFFLMDQYLLGDESFWAPYIQSLPDDSQFTRLEYYTGDDLKWLEGTNLLKLRE 179
Query: 211 SPTKAEILERAEGIK--REYNELDT---VW--FMAGSLFQQYPYDIPTEAFTFEIFKQAF 263
+ + G++ +E+ +T W F+ S I + AF+ E+ K
Sbjct: 180 KLLERLKAKYETGLRLLKEFPNKNTPKYTWERFLWASSI------ILSRAFSSEVLKDYI 233
Query: 264 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESI 323
+ V L+ S+ LVPL + + + A + + L+V + GE +
Sbjct: 234 KGTPTRVKPLEDFSV-----LVPLV-DISNHQPLAQVEWATSLEKIGLIVHKTLLPGEEV 287
Query: 324 VVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYD 360
Y GP+ N +L++NYGF N D
Sbjct: 288 PNNY-----------GPRSNERLMMNYGFCIRGNVCD 313
>gi|397642897|gb|EJK75526.1| hypothetical protein THAOC_02751 [Thalassiosira oceanica]
Length = 395
Score = 39.3 bits (90), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 70/329 (21%), Positives = 131/329 (39%), Gaps = 54/329 (16%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
K W G+ E+ ++K R I Y + +AG A VP L+++ + +
Sbjct: 82 FKYWASTMGIEKNDCFKLEE--QDKKQREI-YAMTTRSTEAGTAVLYVPEHLILSSSKAM 138
Query: 141 ----------GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRG 190
E +A + ++L E L L ++ E ++G S W ++ L R
Sbjct: 139 AELRTDGMAEAEEYLASVGAESQLREY-YLMLKVLLEYQKGSDSEWHKWLDALPRYYSNA 197
Query: 191 QLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIP 250
+ +E L L K + ER Y+ + +V F+A + + +P D+
Sbjct: 198 -------VAMTEFCLTCLPPLMKKLAVEERDAQKLLSYDSIQSVPFLADDIKEGFPRDMV 250
Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAV 309
T A+ +V+ + V ++P+G ++S ++ D+A
Sbjct: 251 TWAYQ--------------IVYTRSVETEDGDLKIIPMG-DFFDHASDYAEIVPQYDEAG 295
Query: 310 QLVVDRPYK--AGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNP--YDRLVVE 365
Y AG+ + Y+ P+ S LL YGF+DE P Y +L +
Sbjct: 296 NYYAVTAYDVPAGKKLRYIYS----------NPRNPSHLLARYGFIDEICPATYCKL-LP 344
Query: 366 AALNTE--DPQYQDKRMVAQRNGKLSVQV 392
+N E + Y ++M+ R+G+++ +V
Sbjct: 345 PTVNEEMIELGYSQEKMLFYRSGEVADEV 373
>gi|70995934|ref|XP_752722.1| SET domain protein [Aspergillus fumigatus Af293]
gi|66850357|gb|EAL90684.1| SET domain protein [Aspergillus fumigatus Af293]
Length = 490
Score = 39.3 bits (90), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 65/307 (21%), Positives = 123/307 (40%), Gaps = 58/307 (18%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDL-QAGDAA--FSVPNSLVVTLE 137
L SW NG+ + ++ S + + VA +E + G+A +VP+ L +TLE
Sbjct: 11 LSSWAKLNGISLEGIAFQKLYSEHGTDKGSAIVATAEKKDEEGEANTLLTVPSDLALTLE 70
Query: 138 RVLGN-----------ETIAELLTTNKLSELACLALYLMY--------EKKQGKKSFWLP 178
V + + + + T + + L L + + + +K G + W
Sbjct: 71 YVHNHAKIDRHLREVLDAVGDFGRTARGAILIFLIIQITHASPDFVNKRQKIGISNPWTE 130
Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-----DT 233
YIR + +V P +S E L G+ + + + +++E++ L +
Sbjct: 131 YIRFM-------PASVPLPTFYSAEERELLRGTSLQTAVDAKLGSLEKEFDHLRQATEEI 183
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP---LGPP 290
W Q++ +D T FTF+ +K +S VV L + A+VP +
Sbjct: 184 PWC------QEHWWDEDTGKFTFDDWKYVDAVYRSRVVDLPRSG----HAIVPCVDMANH 233
Query: 291 LLAYSSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
S K + +AV QL + + GE + + Y +P S+++ +
Sbjct: 234 ACEDSVKARYDEEGAGNAVLQLRTGKKLRVGEEVTISYG----------DEKPASEMVFS 283
Query: 350 YGFVDED 356
YGFV+ +
Sbjct: 284 YGFVENE 290
>gi|443733230|gb|ELU17670.1| hypothetical protein CAPTEDRAFT_97123, partial [Capitella teleta]
Length = 199
Score = 39.3 bits (90), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 51/207 (24%), Positives = 85/207 (41%), Gaps = 23/207 (11%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
L ++L+ E+ +G SFW PY+ L + L W+ E+ L TK +
Sbjct: 1 LVIFLLCERNKGCSSFWKPYVDILPS-------SYTDILHWTSKEMDLLPKF-TKRRACD 52
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
+ +N L + L +Q P AFT+++FK A+ +V + V++ S
Sbjct: 53 LRLKAEESFNRLCNGFLPL--LVRQMPQF--NGAFTWDLFKWAWSSVNTRCVYM---SQP 105
Query: 280 RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW----YNISFTGHD 335
+ L P A + + VD V D K+ + + Y+ F +
Sbjct: 106 QNSVLSPDEEDKSALAPFLDLLNHTVDVEVNARFDDSSKSYKITTLTACKPYDQVFINY- 164
Query: 336 FKCGPQPNSKLLINYGFVDEDNPYDRL 362
GP N KLL+ YGF NP++ +
Sbjct: 165 ---GPHSNEKLLLEYGFTLPCNPHNNI 188
>gi|410970027|ref|XP_003991492.1| PREDICTED: SET domain-containing protein 4 [Felis catus]
Length = 440
Score = 39.3 bits (90), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 65/281 (23%), Positives = 114/281 (40%), Gaps = 37/281 (13%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-SELACLALYLMYEKKQGKKSFWL 177
LQ G S+P + ++T + V+ + A + S L L +L+ EK G +S W
Sbjct: 68 LQEGQVIISLPETCLLTTDTVIRSYLGAYIAKWRPPPSPLLALCTFLVSEKHAGDQSVWK 127
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
PY+ L + A P+ E E+ L P +A+ E+ ++ ++ +
Sbjct: 128 PYLEILPK-------AYTCPVC-LEPEVVNLFPKPLRAKAEEQRARVREFFSSSRGFFSS 179
Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP----LGP--PL 291
LF + I F++ A+ V + V++ K R F+ P L P L
Sbjct: 180 LQPLFSEAVGSI----FSYRALLWAWCTVNTRAVYV-KPRRRRCFSAEPDTCALAPYLDL 234
Query: 292 LAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
L +S + A ++ ++ + E + + Y GP N +LL+
Sbjct: 235 LNHSPHVQVEAAFNEETRCYEIRTASSCRKHEEVFICY-----------GPHDNQRLLLE 283
Query: 350 YGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 390
YGFV NP+ + V + + DK+M N K+S+
Sbjct: 284 YGFVSIHNPHACVYVSEDILVKYLPSTDKQM----NKKISI 320
>gi|406978090|gb|EKE00118.1| hypothetical protein ACD_22C00090G0009 [uncultured bacterium]
Length = 478
Score = 38.9 bits (89), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 44/191 (23%), Positives = 74/191 (38%), Gaps = 28/191 (14%)
Query: 109 PIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEK 168
P+HY+ ++ G FS P + T + G E + E L SE +Y MY K
Sbjct: 77 PMHYMQKEKEHVKG---FS-PELAIAT---IAGGEKLTEELAIRPTSETI---MYDMYRK 126
Query: 169 KQGKKSFWLPYIRELD----------RQRGRGQLAVE-SPLLWSETELAYLTGSPTKAEI 217
W R+L R R L + S LW E A+LT + +
Sbjct: 127 -------WTNSWRDLPVLINQWCNVVRWEKRTYLFLRTSEFLWQEGHCAHLTHEESTDTV 179
Query: 218 LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 277
+ K+ YN+L +++ + G + + + +TFE ++Q+C H +
Sbjct: 180 IWAINAYKKTYNDLMSIYGIVGVKSESEKFAGAVKTYTFESLMPNGKSLQTCTSHDLGQN 239
Query: 278 LARRFALVPLG 288
++ F G
Sbjct: 240 FSKSFEWTVQG 250
>gi|358384831|gb|EHK22428.1| hypothetical protein TRIVIDRAFT_84056 [Trichoderma virens Gv29-8]
Length = 458
Score = 38.9 bits (89), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 67/327 (20%), Positives = 126/327 (38%), Gaps = 66/327 (20%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
++ W+ ++G + L P+ R + + G+ ++P+ + T+E
Sbjct: 1 MEGWLRESGAELDGLELAHFPAIGRGVRTLRC------FKQGERILTIPSGCLWTVEHAY 54
Query: 141 GNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV---- 194
+ + +L + + LS LA+Y+++ + RE R +A
Sbjct: 55 ADAVLGPVLRSAQPPLSVEDTLAIYILFVRS-----------RESGYDGLRSHVAALPAS 103
Query: 195 -ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT-VWFMAGSLFQQYPYDIPTE 252
S + + + EL GS + + I+ +Y L V+ + LF P
Sbjct: 104 YSSSIFFEDDELEVCAGSSLYTITRQLEQRIEEDYRGLVVRVFGLHLDLF-------PLN 156
Query: 253 AFTFEI--FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL---------LAYSSKCKA- 300
FT E +K A V S + ++P G PL + +S + K
Sbjct: 157 KFTIENVGYKWALCTVWSRAMDF----------VLPNGNPLRLLAPFADMVNHSPEVKQC 206
Query: 301 -MLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPY 359
+ A + ++ + Y+A + + ++Y GP PNS+LL YGFV DNP
Sbjct: 207 HVYDASSGNLSILAGKDYEAEDQVFIYY-----------GPMPNSRLLRLYGFVIPDNPN 255
Query: 360 DRLVVEAALNTEDPQYQDKRMVAQRNG 386
D + + + P Y+ K+ + G
Sbjct: 256 DSYDLVLSTHPLAPFYEQKQKLWASAG 282
>gi|440792461|gb|ELR13682.1| [Ribulose-bisphosphate-carboxylase]-lysine N-methyltransferase
[Acanthamoeba castellanii str. Neff]
Length = 400
Score = 38.9 bits (89), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 50/215 (23%), Positives = 89/215 (41%), Gaps = 34/215 (15%)
Query: 161 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE- 219
A+ + E +SFW PY+ EL AV + W++ ELA + + E++E
Sbjct: 49 AVLWLLESVNCAQSFWQPYLSELPD-------AVATVDRWNQEELAEVGHTLMLYEMVEY 101
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
+ + I +Y + + + Q + IP+E E +++A V S + L
Sbjct: 102 KKKKIAADYAAILLPFLQENT--QLFGGSIPSE----EEYRRALSLVYSRTFDFSE--LI 153
Query: 280 RRFALVPLGPPLLAYS------SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTG 333
+P L +S + C D +L+ Y GE + + Y
Sbjct: 154 GEHVFIPF-VDFLNHSINDTGKAACTYSYNHDKDCFELLAGADYDEGEEVFISY------ 206
Query: 334 HDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 368
G + +S+LL +YGF+ E+N D + + A+L
Sbjct: 207 -----GEKTSSQLLASYGFMYENNAEDTVDITASL 236
>gi|301763371|ref|XP_002917104.1| PREDICTED: SET domain-containing protein 4-like [Ailuropoda
melanoleuca]
Length = 440
Score = 38.9 bits (89), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 65/280 (23%), Positives = 110/280 (39%), Gaps = 35/280 (12%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-SELACLALYLMYEKKQGKKSFWL 177
L+ G S+P S ++T + V+ + A + S L L +L+ EK G +S W
Sbjct: 68 LREGQMIISLPESCLLTTDTVIRSYLGAYIAKWQPPPSPLLALCTFLVSEKHAGDQSLWK 127
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
PY+ L + A P+ E E+ L P KA+ E+ ++ ++ +
Sbjct: 128 PYLEILPK-------AYTCPVC-LEPEVVNLFPKPLKAKAEEQRARVQGFFSSSRDFFSS 179
Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQKVSLARRFALVPLGP--PLL 292
LF + I F++ A+ V + V H Q+ + L P LL
Sbjct: 180 LQPLFSEAVESI----FSYSALLWAWCTVNTRAVYVKHRQEQCFSTEPNTCALAPYLDLL 235
Query: 293 AYSSKC--KAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINY 350
+S + KA ++ + E + + Y GP N +LL+ Y
Sbjct: 236 NHSPRVQVKAAFNEETRCYEIRTASGCRKHEEVFICY-----------GPHDNQQLLLEY 284
Query: 351 GFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 390
GFV NP+ + V + + DK+M N K+S+
Sbjct: 285 GFVSIQNPHACVYVSEDVLVKYLPLTDKQM----NKKISI 320
>gi|392594054|gb|EIW83379.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
Length = 508
Score = 38.9 bits (89), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 28/109 (25%), Positives = 48/109 (44%), Gaps = 17/109 (15%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELAC------LALYLMYEK 168
A +D+ G FS+P L ++L T+ LL ++ E L L +M+E+
Sbjct: 43 ALQDIHEGTTLFSLPRELTLSLR----TSTLPSLLGVDRWKEFGLNKGWVGLILCMMWEE 98
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
+G +S W Y+ L ++P+ WS +L L G+ +I
Sbjct: 99 SRGVESKWDVYLSSLPS-------TFDTPMFWSAEDLEELKGTAVPDKI 140
>gi|340503949|gb|EGR30449.1| SET domain protein [Ichthyophthirius multifiliis]
Length = 518
Score = 38.9 bits (89), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 51/222 (22%), Positives = 92/222 (41%), Gaps = 36/222 (16%)
Query: 306 DDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVE 365
D+ + +P K G+ I Y G + N LL+ YGF N YD +
Sbjct: 250 DNYFVVTTQKPEKKGQQIYNCY-----------GQRTNKFLLMWYGFCFNKNRYDSYSLR 298
Query: 366 AALNTEDPQYQD---KRMVAQ---------------RNGKLSVQVFHVHAGREKEAIS-D 406
+N Q + +++V Q + K+++ + +K I+ D
Sbjct: 299 LWINMRQEQLNNDLFEKIVFQEFLEKEDCKGGFVWKKQEKVNLDDITQNFRIKKNKINID 358
Query: 407 MLPYLRLGYVSDTS--EMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSE 464
++ YLRL + +++ V+ SL PVSP E VL L+ + T+ +
Sbjct: 359 LIIYLRLYLMMHYKGPDLKRVMVSL----PVSPVYECFVLSFAIRLLSYLLSRFTTTIKD 414
Query: 465 DEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLL 506
D+ +L + NL+ K R A +K++L + + ++LL
Sbjct: 415 DKELLQNQNLNYKYRFAIIYRLNQKEILQEQISLMNQALILL 456
>gi|414886518|tpg|DAA62532.1| TPA: hypothetical protein ZEAMMB73_960129 [Zea mays]
Length = 483
Score = 38.9 bits (89), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 64/292 (21%), Positives = 118/292 (40%), Gaps = 69/292 (23%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNE-TIAELLTTNK--LSELACLALYLMYEKK 169
+AA+ DL+ G+ +P + ++T +RV ++ IA ++ +K LS + L + L+ E
Sbjct: 51 LAAARDLRRGELVLRLPRAALLTSDRVTADDPRIAACVSAHKPRLSSVQILIVCLLAEVG 110
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI---KR 226
+G S W PY+ +L T LA T + + E L+ + I ++
Sbjct: 111 KGSNSVWYPYLCQLPSYY---------------TILA--TFNDFEVEALQVDDAIWVAQK 153
Query: 227 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 286
+ + + W A L ++ + + F+ + AF V S +H ++ L P
Sbjct: 154 AKSAIKSDWEDATPLMKELEF--KPKLLMFKSWLWAFATVSSRTLH---IAWDEAGCLCP 208
Query: 287 LG-------------------PPLLAYSSKCKAMLAAVD----------DAVQLVVDRPY 317
+G L Y K M + + +A L + Y
Sbjct: 209 VGDLFNYAAPDDDTLLEDEDTAELTNYQQK-NGMTNSSERLTDGGYEDCNAYCLYARKNY 267
Query: 318 KAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 369
K GE +++ Y G N +LL +YGF+ +NP ++ +E L+
Sbjct: 268 KKGEQVLLAY-----------GTYTNLELLEHYGFLLGENPNEKTFIELDLD 308
>gi|281338852|gb|EFB14436.1| hypothetical protein PANDA_005285 [Ailuropoda melanoleuca]
Length = 415
Score = 38.9 bits (89), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 65/280 (23%), Positives = 110/280 (39%), Gaps = 35/280 (12%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-SELACLALYLMYEKKQGKKSFWL 177
L+ G S+P S ++T + V+ + A + S L L +L+ EK G +S W
Sbjct: 44 LREGQMIISLPESCLLTTDTVIRSYLGAYIAKWQPPPSPLLALCTFLVSEKHAGDQSLWK 103
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
PY+ L + A P+ E E+ L P KA+ E+ ++ ++ +
Sbjct: 104 PYLEILPK-------AYTCPVC-LEPEVVNLFPKPLKAKAEEQRARVQGFFSSSRDFFSS 155
Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQKVSLARRFALVPLGP--PLL 292
LF + I F++ A+ V + V H Q+ + L P LL
Sbjct: 156 LQPLFSEAVESI----FSYSALLWAWCTVNTRAVYVKHRQEQCFSTEPNTCALAPYLDLL 211
Query: 293 AYSSKC--KAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINY 350
+S + KA ++ + E + + Y GP N +LL+ Y
Sbjct: 212 NHSPRVQVKAAFNEETRCYEIRTASGCRKHEEVFICY-----------GPHDNQQLLLEY 260
Query: 351 GFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 390
GFV NP+ + V + + DK+M N K+S+
Sbjct: 261 GFVSIQNPHACVYVSEDVLVKYLPLTDKQM----NKKISI 296
>gi|328726082|ref|XP_001952202.2| PREDICTED: SET domain-containing protein 3-like [Acyrthosiphon
pisum]
Length = 241
Score = 38.9 bits (89), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 38/161 (23%), Positives = 73/161 (45%), Gaps = 20/161 (12%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
+ ++ + L W KNG IL H ++ + + A++++ GD +VP +L
Sbjct: 81 RNDQSIEKLTKWATKNG-----AILNGVEIHQFENYA-YGMKANKNITVGDKLVTVPRAL 134
Query: 133 VVTLERV----LGNETIAELLTTNKLSELACLALYLMYEK-KQGKKSFWLPYIRELDRQR 187
++T E + L +++ N + LA++++ E ++ KKSFW Y+ L
Sbjct: 135 MMTEENIPSSPLWKLHSQDMMLRNMPN--VALAIFILVESLRKDKKSFWHSYLTTLP--- 189
Query: 188 GRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
+ +P+ + +L L GSP L+ I R+Y
Sbjct: 190 ----VTYSTPVYFDVADLEALKGSPAFEAALKLNRNIARQY 226
>gi|222640175|gb|EEE68307.1| hypothetical protein OsJ_26571 [Oryza sativa Japonica Group]
Length = 422
Score = 38.9 bits (89), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 54/123 (43%), Gaps = 28/123 (22%)
Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDR--L 362
V +++ + RP KAGE + Y G P S L+ YGF+ DNPYD L
Sbjct: 315 VTKSLKFPLSRPCKAGEQCFLSY-----------GKHPGSHLITFYGFLPRDNPYDVIPL 363
Query: 363 VVEAALNTED----------PQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 412
++ +++ ED + +RM+ R + +Q + ++ + YLR
Sbjct: 364 DLDTSVDEEDSSSPSVTTSQTSHMGERMLG-RQSRTGLQ----RSTKKDSFVHCYFVYLR 418
Query: 413 LGY 415
LG+
Sbjct: 419 LGH 421
>gi|414886517|tpg|DAA62531.1| TPA: hypothetical protein ZEAMMB73_960129 [Zea mays]
Length = 147
Score = 38.9 bits (89), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 23/74 (31%), Positives = 43/74 (58%), Gaps = 3/74 (4%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNE-TIAELLTTNK--LSELACLALYLMYEKK 169
+AA+ DL+ G+ +P + ++T +RV ++ IA ++ +K LS + L + L+ E
Sbjct: 51 LAAARDLRRGELVLRLPRAALLTSDRVTADDPRIAACVSAHKPRLSSVQILIVCLLAEVG 110
Query: 170 QGKKSFWLPYIREL 183
+G S W PY+ +L
Sbjct: 111 KGSNSVWYPYLCQL 124
>gi|351694473|gb|EHA97391.1| SET domain-containing protein 3 [Heterocephalus glaber]
Length = 297
Score = 38.9 bits (89), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 40/72 (55%), Gaps = 4/72 (5%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQG 171
A+ D++AG+ VP LV+T+E N + L + +++ + LA +L+ ++
Sbjct: 112 ATRDIKAGELFLWVPRKLVMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLLCERAS 170
Query: 172 KKSFWLPYIREL 183
SFWLPYI+ L
Sbjct: 171 PISFWLPYIQTL 182
>gi|403375581|gb|EJY87766.1| hypothetical protein OXYTRI_23666 [Oxytricha trifallax]
Length = 789
Score = 38.5 bits (88), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 43/194 (22%), Positives = 85/194 (43%), Gaps = 26/194 (13%)
Query: 62 TLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQA 121
T + + + K++E + + W+ NG+ V + P + + +AA +D+
Sbjct: 28 TFIHHEKTNLLKQQEKYVNFQKWLEDNGVLHPGV---DYPVAFGRQGQLIGMAARKDIPP 84
Query: 122 GDAAFSVPNSLVVTLERVLGNETIAELLTTN-------KLSELACLALYLMYEKKQGKKS 174
A VP L+++ E + N IA LL+ + + +E + ++ +E +G+ S
Sbjct: 85 QKAFLFVPQRLMIS-EVTVRNSKIAPLLSKHPEIFKHHQDAEYLVIIAFVWHELMKGEAS 143
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-NELDT 233
FW PY + ++ + P+LWS+ E+ + +I + K EY NE
Sbjct: 144 FWHPYFQIIN--------LSDLPMLWSDQEIQEFQDQVLQKDI----QDYKVEYENEWKL 191
Query: 234 VW--FMAGSLFQQY 245
V+ F + +Y
Sbjct: 192 VYEAFSKDETYDEY 205
>gi|384490907|gb|EIE82103.1| hypothetical protein RO3G_06808 [Rhizopus delemar RA 99-880]
Length = 216
Score = 38.5 bits (88), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 28/119 (23%), Positives = 59/119 (49%), Gaps = 12/119 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V ++ ++ + +VP S+ +T E+V N T S +L+L+ +K GK
Sbjct: 22 VYTTDTVKENEKFATVPFSICIT-EKVARNA----FPTLTGFSGRVLQSLFLVQQKNLGK 76
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
KSF+ PYI L ++ + + L + E ++ Y+ + + + ER ++ ++++L
Sbjct: 77 KSFYFPYINILPKK-------IVTALHFDENDMNYIKKTNLELALRERKTALRDDFDKL 128
>gi|116197927|ref|XP_001224775.1| hypothetical protein CHGG_07119 [Chaetomium globosum CBS 148.51]
gi|88178398|gb|EAQ85866.1| hypothetical protein CHGG_07119 [Chaetomium globosum CBS 148.51]
Length = 555
Score = 38.5 bits (88), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 64/286 (22%), Positives = 118/286 (41%), Gaps = 56/286 (19%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
+ G+ ++P+ ++ T+E + + +L + + LS LA Y+++ +
Sbjct: 121 FKKGERILTIPSGILWTVEHAYADPLVGPVLRSARPPLSVEDTLATYILFIRS------- 173
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSP--TKAEILERAEGIKREYN 229
RE R +A S + ++E EL G+ T + L+R+ I+ +Y
Sbjct: 174 ----RESGYDGLRSHVAAFPTSYPSSIFFAEEELEVCAGTSLYTITKKLDRS--IEDDYR 227
Query: 230 ELDTVWFMAGS--LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRF 282
L V +A S LF P + F+ E +K A V S + + L F
Sbjct: 228 TL-VVRVLAQSRDLF-------PLDKFSIEDYKWALCTVWSRAMDFVLPDGNSIRLVAPF 279
Query: 283 ALVPLGPPLLAYSSKCK--AMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGP 340
A +L +SS+ + + A + ++ + Y+AG+ ++Y G
Sbjct: 280 A------DMLNHSSEVEPCHIYDASSGNLSVLAGKDYEAGDQAFIYY-----------GS 322
Query: 341 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 386
PNS+LL YGFV NP D + + + P ++ K+ + G
Sbjct: 323 IPNSRLLRLYGFVMPGNPNDSYDLVISTHPSAPFFERKQKLWASAG 368
>gi|320584053|gb|EFW98265.1| Nuclear protein that contains a SET-domain [Ogataea parapolymorpha
DL-1]
Length = 499
Score = 38.5 bits (88), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 58/231 (25%), Positives = 91/231 (39%), Gaps = 42/231 (18%)
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
GN+ + E L++ L L L YE G++S W Y+ L + S + W
Sbjct: 74 GNQEVLE-----TLNQWEALILCLAYEMMLGEESRWSSYLAVLPEK-------FNSLMFW 121
Query: 201 SETELAYLTGSPTKAEI-LERAE--------------GIKR--EYNELDTVWFMAGSLFQ 243
S EL L S I E+AE G K+ EY +D + + S+
Sbjct: 122 SSEELEKLKPSNVLQRIGREQAEQMYSKLVPEYCLRLGSKKLVEYLTIDR-FHVVASIIM 180
Query: 244 QYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA 303
Y +D+ E + K + ++VPL L + ++ A L+
Sbjct: 181 SYSFDVDDPEDDPEDDEDEEEDFDEIEQECIKYDGYLK-SMVPLADTLNSNTNLVNANLS 239
Query: 304 AVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVD 354
+DA+ + + K GE I YNI G PNS++L YG+V+
Sbjct: 240 YENDALVMTATKDIKKGEQI---YNIY--------GELPNSEILRKYGYVE 279
>gi|121701277|ref|XP_001268903.1| SET domain protein [Aspergillus clavatus NRRL 1]
gi|119397046|gb|EAW07477.1| SET domain protein [Aspergillus clavatus NRRL 1]
Length = 498
Score = 38.5 bits (88), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 47/213 (22%), Positives = 90/213 (42%), Gaps = 27/213 (12%)
Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 235
W YIR + ++ P ++E EL L G+ + + + +++E+ L
Sbjct: 134 WTEYIRFM-------PPSIRLPTFYTEAELELLRGTSLRTAVFAKLASLEKEFERLRQS- 185
Query: 236 FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP---LGPPLL 292
Q+Y +D T TF+ +K +S VV L + A+VP +
Sbjct: 186 TEGIPWCQKYWWDEDTGRLTFDDWKYVDAVYRSRVVELPESG----HAIVPCVDMANHAS 241
Query: 293 AYSSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYG 351
S K + ++ +DA+ QL R +GE + + Y +P S+++ +YG
Sbjct: 242 EDSVKARYDESSTEDALLQLRQGRRICSGEEVTISYG----------SEKPASEMVFSYG 291
Query: 352 FVD-EDNPYDRLVVEAALNTEDPQYQDKRMVAQ 383
FV+ E ++ ++ + +DP K+M +
Sbjct: 292 FVENERTDAKQIFLDLEIPDDDPLRMAKQMFCK 324
>gi|242045610|ref|XP_002460676.1| hypothetical protein SORBIDRAFT_02g032970 [Sorghum bicolor]
gi|241924053|gb|EER97197.1| hypothetical protein SORBIDRAFT_02g032970 [Sorghum bicolor]
Length = 489
Score = 38.5 bits (88), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 22/74 (29%), Positives = 43/74 (58%), Gaps = 3/74 (4%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNE-TIAELLTTNK--LSELACLALYLMYEKK 169
+AA+ DL+ G+ P + ++T +RV ++ IA ++ ++ LS + L + L+ E
Sbjct: 57 LAAARDLRRGELVLRAPRAALLTSDRVTADDPRIAACVSAHRPRLSSVQILIVCLLAEVG 116
Query: 170 QGKKSFWLPYIREL 183
+G+ S W PY+ +L
Sbjct: 117 KGRNSVWYPYLSQL 130
>gi|428183324|gb|EKX52182.1| hypothetical protein GUITHDRAFT_150712, partial [Guillardia theta
CCMP2712]
Length = 205
Score = 38.1 bits (87), Expect = 10.0, Method: Compositional matrix adjust.
Identities = 29/116 (25%), Positives = 55/116 (47%), Gaps = 15/116 (12%)
Query: 116 SEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSF 175
++D+++ S+P+ L L R + +KL +AL ++YEK + ++SF
Sbjct: 88 TQDVKSNSVVCSIPSKLF--LSRSTTRAAFGSM--ADKLDVRTAMALQILYEKSKKEESF 143
Query: 176 WLPYIREL-DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
W +++ L DR+ + +P LW E + L G+ + E E K Y++
Sbjct: 144 WCEWLKVLPDREN------LGTPCLWPEDDQNLLKGT----SVFEEVEASKSLYSK 189
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.134 0.393
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,913,178,062
Number of Sequences: 23463169
Number of extensions: 319442466
Number of successful extensions: 833965
Number of sequences better than 100.0: 734
Number of HSP's better than 100.0 without gapping: 84
Number of HSP's successfully gapped in prelim test: 650
Number of HSP's that attempted gapping in prelim test: 832737
Number of HSP's gapped (non-prelim): 998
length of query: 523
length of database: 8,064,228,071
effective HSP length: 147
effective length of query: 376
effective length of database: 8,910,109,524
effective search space: 3350201181024
effective search space used: 3350201181024
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 79 (35.0 bits)