BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 008853
(551 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225452167|ref|XP_002264334.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
[Vitis vinifera]
Length = 509
Score = 809 bits (2090), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 396/501 (79%), Positives = 423/501 (84%), Gaps = 46/501 (9%)
Query: 58 SSSDTLVAGSR-------EVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPI 110
S SDTLVAGSR E KKE++ GDLKSWMH+NGLPPCKV+LKE+PSH+E+H+ I
Sbjct: 48 SGSDTLVAGSRKEDGRVSEAARKKEDEFGDLKSWMHENGLPPCKVVLKERPSHHEQHKAI 107
Query: 111 HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
HY+AASEDLQAGD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ
Sbjct: 108 HYIAASEDLQAGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 167
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
GKKSFW PYIRELDRQRGRGQLAVESPLLWSE+ELAYLTGSPTKAE+LERAEGIKREYNE
Sbjct: 168 GKKSFWYPYIRELDRQRGRGQLAVESPLLWSESELAYLTGSPTKAEVLERAEGIKREYNE 227
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASH 290
LDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFVA+QSCVVHLQ
Sbjct: 228 LDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFVAIQSCVVHLQ---------------- 271
Query: 291 LVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQL 350
KVSLARRFALVPLGPPLLAY S CKAMLAAVD +VQL
Sbjct: 272 -----------------------KVSLARRFALVPLGPPLLAYRSNCKAMLAAVDGSVQL 308
Query: 351 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 410
VVDRPYKAGESIVVWCGPQPNSKLL+NYGFVDEDN YDR+VVEAALNTEDPQYQDKRMVA
Sbjct: 309 VVDRPYKAGESIVVWCGPQPNSKLLLNYGFVDEDNSYDRIVVEAALNTEDPQYQDKRMVA 368
Query: 411 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERA 470
QRNGKL+VQ FHV G+E+EA+SDMLPYLRLGYVSD SEMQSVISS GPICPVSPCMERA
Sbjct: 369 QRNGKLTVQKFHVSVGKEREAVSDMLPYLRLGYVSDPSEMQSVISSQGPICPVSPCMERA 428
Query: 471 VLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADM 530
VLDQL DYF+ RLAGYP T+SEDE +L D NL+PKK VATQLVR+EKKMLNACL+ T D+
Sbjct: 429 VLDQLVDYFERRLAGYPTTMSEDECLLADSNLNPKKLVATQLVRLEKKMLNACLKATVDL 488
Query: 531 IMLLPDVTVSPCPAPYAPLLN 551
I LPD TVSPCPAPY PLL
Sbjct: 489 INQLPDHTVSPCPAPYTPLLK 509
>gi|449455876|ref|XP_004145676.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cucumis
sativus]
gi|449492872|ref|XP_004159127.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cucumis
sativus]
Length = 521
Score = 809 bits (2089), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 404/538 (75%), Positives = 442/538 (82%), Gaps = 49/538 (9%)
Query: 21 HHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSSDTLVAGSR-------EVVSK 73
H PL + S IS+S R +F +S +RR N S SSS+TLVAGSR E V+K
Sbjct: 24 HRPLLLLSKISVSAPRISHFSNSFSPIRRWNVCS--ASSSETLVAGSRKENGKTGEAVTK 81
Query: 74 KEED-LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
KE+D GDLK+WMH NGLPPCKVIL+EKPSH++ HRPIHYVAASEDL+ GD AFSVPNSL
Sbjct: 82 KEDDEFGDLKAWMHDNGLPPCKVILEEKPSHDKNHRPIHYVAASEDLEVGDVAFSVPNSL 141
Query: 133 VVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
VVTLERVLGNET+AELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQL
Sbjct: 142 VVTLERVLGNETVAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQL 201
Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 252
AVESPLLWSE EL YL+GSPTK E+LERAEGIK+EYNELDTVWFMAGSLFQQYPYDIPTE
Sbjct: 202 AVESPLLWSEDELDYLSGSPTKKEVLERAEGIKKEYNELDTVWFMAGSLFQQYPYDIPTE 261
Query: 253 AFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPK 312
AF+FEIFKQAFVAVQSCVVHLQ
Sbjct: 262 AFSFEIFKQAFVAVQSCVVHLQ-------------------------------------- 283
Query: 313 EKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 372
KVSLARRFALVPLGPPLLAY S CKAML AVD AV+LVVDRPYKAGESI VWCGPQPNS
Sbjct: 284 -KVSLARRFALVPLGPPLLAYRSNCKAMLTAVDGAVELVVDRPYKAGESIAVWCGPQPNS 342
Query: 373 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI 432
KLL+NYGFVDEDN YDRLVVEAALNTEDPQYQDKRMVAQRNG+LS+Q F+V+AG+EKEA+
Sbjct: 343 KLLLNYGFVDEDNRYDRLVVEAALNTEDPQYQDKRMVAQRNGRLSIQAFYVYAGKEKEAV 402
Query: 433 SDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSE 492
DMLPYLRLGYV+ SEMQSVISS GP+CPVSPCMERA+L+Q+ADYFK RLAGYP TLSE
Sbjct: 403 LDMLPYLRLGYVTHPSEMQSVISSQGPVCPVSPCMERAMLEQVADYFKRRLAGYPTTLSE 462
Query: 493 DEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 550
DE +L D NL+PKKRVATQLVR+EKK+L++CL+VT D I LPD TVSPCPAPYAPLL
Sbjct: 463 DEFLLADGNLNPKKRVATQLVRLEKKLLHSCLEVTIDFINQLPDHTVSPCPAPYAPLL 520
>gi|359488614|ref|XP_003633789.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
[Vitis vinifera]
Length = 515
Score = 803 bits (2073), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 396/507 (78%), Positives = 423/507 (83%), Gaps = 52/507 (10%)
Query: 58 SSSDTLVAGSR-------EVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPI 110
S SDTLVAGSR E KKE++ GDLKSWMH+NGLPPCKV+LKE+PSH+E+H+ I
Sbjct: 48 SGSDTLVAGSRKEDGRVSEAARKKEDEFGDLKSWMHENGLPPCKVVLKERPSHHEQHKAI 107
Query: 111 HYVAASEDLQ------AGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYL 164
HY+AASEDLQ AGD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYL
Sbjct: 108 HYIAASEDLQGFLLLQAGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYL 167
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
MYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE+ELAYLTGSPTKAE+LERAEGI
Sbjct: 168 MYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESELAYLTGSPTKAEVLERAEGI 227
Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGT 284
KREYNELDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFVA+QSCVVHLQ
Sbjct: 228 KREYNELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFVAIQSCVVHLQ---------- 277
Query: 285 ESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAV 344
KVSLARRFALVPLGPPLLAY S CKAMLAAV
Sbjct: 278 -----------------------------KVSLARRFALVPLGPPLLAYRSNCKAMLAAV 308
Query: 345 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 404
D +VQLVVDRPYKAGESIVVWCGPQPNSKLL+NYGFVDEDN YDR+VVEAALNTEDPQYQ
Sbjct: 309 DGSVQLVVDRPYKAGESIVVWCGPQPNSKLLLNYGFVDEDNSYDRIVVEAALNTEDPQYQ 368
Query: 405 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVS 464
DKRMVAQRNGKL+VQ FHV G+E+EA+SDMLPYLRLGYVSD SEMQSVISS GPICPVS
Sbjct: 369 DKRMVAQRNGKLTVQKFHVSVGKEREAVSDMLPYLRLGYVSDPSEMQSVISSQGPICPVS 428
Query: 465 PCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 524
PCMERAVLDQL DYF+ RLAGYP T+SEDE +L D NL+PKK VATQLVR+EKKMLNACL
Sbjct: 429 PCMERAVLDQLVDYFERRLAGYPTTMSEDECLLADSNLNPKKLVATQLVRLEKKMLNACL 488
Query: 525 QVTADMIMLLPDVTVSPCPAPYAPLLN 551
+ T D+I LPD TVSPCPAPY PLL
Sbjct: 489 KATVDLINQLPDHTVSPCPAPYTPLLK 515
>gi|224117488|ref|XP_002331687.1| SET domain protein [Populus trichocarpa]
gi|222874165|gb|EEF11296.1| SET domain protein [Populus trichocarpa]
Length = 502
Score = 789 bits (2037), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 405/557 (72%), Positives = 439/557 (78%), Gaps = 61/557 (10%)
Query: 1 MEASCSLRSSKFISPPIRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSS 60
ME +C +K ISP L++ S +SIS P S RR F+
Sbjct: 1 MEFTC--LHNKCISPS-------LTVLSRVSISFSNLPKRAVSFHRRRRNLCFA------ 45
Query: 61 DTLVAGSR--EVVSKK----EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVA 114
TLV G R EVVSK+ E++ GDLKSWMHKNGLPPCKV+LKE+PSH++K RPIHYVA
Sbjct: 46 -TLVDGKRTSEVVSKRGGEEEDEFGDLKSWMHKNGLPPCKVVLKERPSHDKKLRPIHYVA 104
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASEDLQA D A SVPNSLVVTLERVLGNET+AELLTTNKLSELACLALYLMYEKKQGKKS
Sbjct: 105 ASEDLQASDVAVSVPNSLVVTLERVLGNETLAELLTTNKLSELACLALYLMYEKKQGKKS 164
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
FW PYIRELDRQRGRGQLAVESPLLWSE ELAYLTGSPTKAE+L+RA+GIKREY ELDTV
Sbjct: 165 FWYPYIRELDRQRGRGQLAVESPLLWSEAELAYLTGSPTKAEVLDRADGIKREYEELDTV 224
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHL 294
WFMAGSLFQQYPYDIPTEAF FEIFKQAFVA+QSCVVHLQ
Sbjct: 225 WFMAGSLFQQYPYDIPTEAFPFEIFKQAFVAIQSCVVHLQ-------------------- 264
Query: 295 QTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 354
KVSLARRFALVPLGPPLLAYSS CKAML AVD AV+LVVDR
Sbjct: 265 -------------------KVSLARRFALVPLGPPLLAYSSNCKAMLTAVDGAVELVVDR 305
Query: 355 PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 414
PYKAGE IVVWCGPQPNSKLL+NYGFVDEDNPYDR+ VEAALNTEDPQYQDKRMVAQRNG
Sbjct: 306 PYKAGEPIVVWCGPQPNSKLLLNYGFVDEDNPYDRIAVEAALNTEDPQYQDKRMVAQRNG 365
Query: 415 KLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQ 474
KLSVQVF V+AG+EKEA+SD+LPYLRLGYVSD SEMQSVISS GP+CPVSPCME+AVLDQ
Sbjct: 366 KLSVQVFQVYAGKEKEAVSDILPYLRLGYVSDPSEMQSVISSQGPVCPVSPCMEQAVLDQ 425
Query: 475 LADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLL 534
L YF+ RLAGY ++SEDE ML D NL+PKKRVATQLVR+EKKML ACLQ T D+I L
Sbjct: 426 LTVYFRTRLAGYCTSISEDELMLADPNLNPKKRVATQLVRLEKKMLKACLQATVDLINQL 485
Query: 535 PDVTVSPCPAPYAPLLN 551
PD T+ PCPAPYAPLL
Sbjct: 486 PDHTMPPCPAPYAPLLK 502
>gi|296090251|emb|CBI40070.3| unnamed protein product [Vitis vinifera]
Length = 428
Score = 773 bits (1997), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/466 (80%), Positives = 401/466 (86%), Gaps = 39/466 (8%)
Query: 85 MHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNET 144
MH+NGLPPCKV+LKE+PSH+E+H+ IHY+AASEDLQAGD AFSVP+SLVVTLERVLGNET
Sbjct: 1 MHENGLPPCKVVLKERPSHHEQHKAIHYIAASEDLQAGDVAFSVPDSLVVTLERVLGNET 60
Query: 145 IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 204
IAELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE+E
Sbjct: 61 IAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESE 120
Query: 205 LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
LAYLTGSPTKAE+LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFV
Sbjct: 121 LAYLTGSPTKAEVLERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFV 180
Query: 265 AVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALV 324
A+QSCVVHLQ KVSLARRFALV
Sbjct: 181 AIQSCVVHLQ---------------------------------------KVSLARRFALV 201
Query: 325 PLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 384
PLGPPLLAY S CKAMLAAVD +VQLVVDRPYKAGESIVVWCGPQPNSKLL+NYGFVDED
Sbjct: 202 PLGPPLLAYRSNCKAMLAAVDGSVQLVVDRPYKAGESIVVWCGPQPNSKLLLNYGFVDED 261
Query: 385 NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV 444
N YDR+VVEAALNTEDPQYQDKRMVAQRNGKL+VQ FHV G+E+EA+SDMLPYLRLGYV
Sbjct: 262 NSYDRIVVEAALNTEDPQYQDKRMVAQRNGKLTVQKFHVSVGKEREAVSDMLPYLRLGYV 321
Query: 445 SDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 504
SD SEMQSVISS GPICPVSPCMERAVLDQL DYF+ RLAGYP T+SEDE +L D NL+P
Sbjct: 322 SDPSEMQSVISSQGPICPVSPCMERAVLDQLVDYFERRLAGYPTTMSEDECLLADSNLNP 381
Query: 505 KKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 550
KK VATQLVR+EKKMLNACL+ T D+I LPD TVSPCPAPY PLL
Sbjct: 382 KKLVATQLVRLEKKMLNACLKATVDLINQLPDHTVSPCPAPYTPLL 427
>gi|357497055|ref|XP_003618816.1| SET domain protein [Medicago truncatula]
gi|355493831|gb|AES75034.1| SET domain protein [Medicago truncatula]
Length = 501
Score = 762 bits (1967), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/515 (72%), Positives = 418/515 (81%), Gaps = 41/515 (7%)
Query: 36 RDPNFGSSLRLVRRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKV 95
R P+F S RR+ R S+SDTLVA + + K++ED GDLK+WMHKNGLPPCKV
Sbjct: 27 RLPSFLSLSTNHRRRRRSFCSASNSDTLVAATGK--KKRDEDDGDLKTWMHKNGLPPCKV 84
Query: 96 ILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLS 155
+LK+KPS ++ +PIHYVAASEDLQ GD AFSVPNSLVVTLERVLGNETIAELLTTNK S
Sbjct: 85 VLKDKPSLDDSVKPIHYVAASEDLQKGDIAFSVPNSLVVTLERVLGNETIAELLTTNKFS 144
Query: 156 ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 215
ELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE+ELAYL GSP K
Sbjct: 145 ELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESELAYLEGSPLKD 204
Query: 216 EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQR 275
EI++R EGI++EYNELDTVWFM+GSLFQQYPYD+PTEAF FEIFKQAF AVQSCVVHLQ
Sbjct: 205 EIVKRIEGIRKEYNELDTVWFMSGSLFQQYPYDLPTEAFPFEIFKQAFAAVQSCVVHLQ- 263
Query: 276 CGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSS 335
VSLARRFALVPLGPPLLAY S
Sbjct: 264 --------------------------------------NVSLARRFALVPLGPPLLAYCS 285
Query: 336 KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 395
CKAML AVD AVQLVVDRPYKAG+ IVVWCGPQPN+KLL NYGFVDEDN DRL+VE A
Sbjct: 286 NCKAMLTAVDGAVQLVVDRPYKAGDPIVVWCGPQPNTKLLTNYGFVDEDNSNDRLIVEVA 345
Query: 396 LNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS 455
L+TEDPQYQDKR+VAQRNGKLS+Q F+V+ G+E+EA+SDM+PY+RLGYVSD SEMQSVIS
Sbjct: 346 LSTEDPQYQDKRIVAQRNGKLSIQTFYVYTGKEREAVSDMIPYMRLGYVSDPSEMQSVIS 405
Query: 456 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 515
S GP+CPVSPCMERAVLDQLADYF RLA YP TL+EDE+MLTD +L+PK+RVATQLVR+
Sbjct: 406 SQGPVCPVSPCMERAVLDQLADYFNTRLAAYPTTLAEDESMLTDGSLNPKRRVATQLVRL 465
Query: 516 EKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 550
EKKML+ACLQ D+I LPD +VSPCPAPYAP L
Sbjct: 466 EKKMLHACLQAIMDLISQLPDHSVSPCPAPYAPSL 500
>gi|22326803|ref|NP_196930.2| Rubisco methyltransferase family protein [Arabidopsis thaliana]
gi|30684815|ref|NP_851038.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
gi|42573363|ref|NP_974778.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
gi|17473570|gb|AAL38260.1| putative protein [Arabidopsis thaliana]
gi|23297671|gb|AAN13005.1| unknown protein [Arabidopsis thaliana]
gi|332004624|gb|AED92007.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
gi|332004625|gb|AED92008.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
gi|332004626|gb|AED92009.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
Length = 514
Score = 759 bits (1961), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/560 (69%), Positives = 433/560 (77%), Gaps = 55/560 (9%)
Query: 1 MEASCSLRSSKFISPPIRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSS 60
ME + +K +S PIR PLS S S+ R+ SS R V + S+ VSSS
Sbjct: 1 MEGVITCFHTKCVSLPIR--SFPLSRVS--SLPRWRNNKLISSSRSVHLR---SLCVSSS 53
Query: 61 DTLVAGSR--------EVVSKKE-EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIH 111
DTLVA +V SKKE +D DLK WM KNGLPPCKVILKE+P+H++KH+PIH
Sbjct: 54 DTLVASGSPKEDERQSKVSSKKEGDDSEDLKFWMDKNGLPPCKVILKERPAHDQKHKPIH 113
Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
YVAASEDLQ GD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG
Sbjct: 114 YVAASEDLQKGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 173
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
KKS W PYIRELDRQRGRGQL ESPLLWSE EL YLTGSPTKAE+LERAEGIKREYNEL
Sbjct: 174 KKSVWYPYIRELDRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGIKREYNEL 233
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHL 291
DTVWFMAGSLFQQYP+DIPTEAF+FEIFKQAFVA+QSCVVHLQ G
Sbjct: 234 DTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQNVG-------------- 279
Query: 292 VHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 351
LARRFALVPLGPPLLAY S CKAML AVD AV+LV
Sbjct: 280 -------------------------LARRFALVPLGPPLLAYCSNCKAMLTAVDGAVELV 314
Query: 352 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 411
VDRPYKAG+ IVVWCGPQPN+KLL+NYGFVDEDNPYDR++VEAALNTEDPQYQDKRMVAQ
Sbjct: 315 VDRPYKAGDPIVVWCGPQPNAKLLLNYGFVDEDNPYDRVIVEAALNTEDPQYQDKRMVAQ 374
Query: 412 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 471
RNGKLS QVF V G+E+EA+ DMLPYLRLGY+SD SEMQSVISS GP+CP+SPCMERAV
Sbjct: 375 RNGKLSQQVFQVRVGKEREAVQDMLPYLRLGYMSDPSEMQSVISSQGPVCPMSPCMERAV 434
Query: 472 LDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 531
LDQLA+YF RL+GYP T ED+A+L D +L P+KRVAT+LV++EKK+L ACL T D++
Sbjct: 435 LDQLANYFMRRLSGYPTTPKEDDALLADPSLSPRKRVATRLVQLEKKILVACLTTTVDLL 494
Query: 532 MLLPDVTVSPCPAPYAPLLN 551
LPD +SPCPAPYAP L
Sbjct: 495 NQLPDTAISPCPAPYAPSLK 514
>gi|18377718|gb|AAL67009.1| unknown protein [Arabidopsis thaliana]
Length = 514
Score = 757 bits (1954), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/560 (69%), Positives = 432/560 (77%), Gaps = 55/560 (9%)
Query: 1 MEASCSLRSSKFISPPIRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSS 60
ME + +K +S PIR PLS S S+ R+ SS R V + S+ VSSS
Sbjct: 1 MEGVITCFHTKCVSLPIR--SFPLSRVS--SLPRWRNNKLISSSRSVHLR---SLCVSSS 53
Query: 61 DTLVAGSR--------EVVSKKE-EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIH 111
DTLVA +V SKKE +D DLK WM KNGLPPCKVILKE+P+H++KH+PIH
Sbjct: 54 DTLVASGSPKEDERQSKVSSKKEGDDSEDLKFWMDKNGLPPCKVILKERPAHDQKHKPIH 113
Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
YVAASEDLQ GD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG
Sbjct: 114 YVAASEDLQKGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 173
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
KKS W PYIRELDRQRGRGQL ESPLLWSE EL YLTGSPTKAE+LERAEGIKREYNEL
Sbjct: 174 KKSVWYPYIRELDRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGIKREYNEL 233
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHL 291
DTVWFMAGSLFQQYP+DIPTEAF+FEIFKQAFVA+QSCVVHLQ G
Sbjct: 234 DTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQNVG-------------- 279
Query: 292 VHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 351
LARRFALVPLGPPLLAY S CKAML AVD AV+LV
Sbjct: 280 -------------------------LARRFALVPLGPPLLAYCSNCKAMLTAVDGAVELV 314
Query: 352 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 411
VDRPYKAG+ IVVWCGPQPN+KLL+NYGFVDEDNPYDR++VEAALNTE PQYQDKRMVAQ
Sbjct: 315 VDRPYKAGDPIVVWCGPQPNAKLLLNYGFVDEDNPYDRVIVEAALNTEGPQYQDKRMVAQ 374
Query: 412 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 471
RNGKLS QVF V G+E+EA+ DMLPYLRLGY+SD SEMQSVISS GP+CP+SPCMERAV
Sbjct: 375 RNGKLSQQVFQVRVGKEREAVQDMLPYLRLGYMSDPSEMQSVISSQGPVCPMSPCMERAV 434
Query: 472 LDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 531
LDQLA+YF RL+GYP T ED+A+L D +L P+KRVAT+LV++EKK+L ACL T D++
Sbjct: 435 LDQLANYFMRRLSGYPTTPKEDDALLADPSLSPRKRVATRLVQLEKKILVACLTTTVDLL 494
Query: 532 MLLPDVTVSPCPAPYAPLLN 551
LPD +SPCPAPYAP L
Sbjct: 495 NQLPDTAISPCPAPYAPSLK 514
>gi|297807453|ref|XP_002871610.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297317447|gb|EFH47869.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 516
Score = 753 bits (1944), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/560 (68%), Positives = 431/560 (76%), Gaps = 53/560 (9%)
Query: 1 MEASCSLRSSKFISPPIRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSS 60
ME + +K +S PIR PLS S S+ R+ SS R V ++ + SSS
Sbjct: 1 MEGVITCFHTKCVSLPIR--SFPLSRVS--SLPRWRNTKLISSSRSVPLRS-LCVSASSS 55
Query: 61 DTLVAGSR--------EVVSKKE-EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIH 111
DTLVAG +V SKKE +D DLK WM KNGLPPCKV+LKE+P+H+ K++PIH
Sbjct: 56 DTLVAGGSPKEDERQSKVSSKKEGDDSEDLKFWMDKNGLPPCKVLLKERPAHDLKYKPIH 115
Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
YVAASEDLQ GD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG
Sbjct: 116 YVAASEDLQKGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 175
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
KKS W PYIRELDRQRGRGQL ESPLLWSE EL YLTGSPTKAE+LERAEGIKREYNEL
Sbjct: 176 KKSVWYPYIRELDRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGIKREYNEL 235
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHL 291
DTVWFMAGSLFQQYP+DIPTEAF+FEIFKQAFVA+QSCVVHLQ G
Sbjct: 236 DTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQNVG-------------- 281
Query: 292 VHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 351
LARRFALVPLGPPLLAY S CKAML AVD AV+LV
Sbjct: 282 -------------------------LARRFALVPLGPPLLAYCSNCKAMLTAVDGAVELV 316
Query: 352 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 411
VDRPYKAG+ IVVWCGPQPN+KLL+NYGFVDEDNPYDR++VEAALNTEDPQYQDKRMVAQ
Sbjct: 317 VDRPYKAGDPIVVWCGPQPNAKLLLNYGFVDEDNPYDRIIVEAALNTEDPQYQDKRMVAQ 376
Query: 412 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 471
RNGKLS QVF V G+E+EA+ DMLPYLRLGY+SD SEMQSVISS GP+C +SPCMERAV
Sbjct: 377 RNGKLSQQVFQVRVGKEREAVQDMLPYLRLGYMSDPSEMQSVISSQGPVCTMSPCMERAV 436
Query: 472 LDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 531
LDQLA+YF RL+GYP T ED+A+L D +L P+KRVAT+LV++EKK+L ACL T D++
Sbjct: 437 LDQLANYFMRRLSGYPTTPKEDDALLADPSLSPRKRVATRLVQLEKKILAACLTTTVDLL 496
Query: 532 MLLPDVTVSPCPAPYAPLLN 551
LPD +SPCPAPYAP L
Sbjct: 497 NQLPDTAISPCPAPYAPSLK 516
>gi|356571407|ref|XP_003553868.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
isoform 1 [Glycine max]
gi|356571409|ref|XP_003553869.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
isoform 2 [Glycine max]
Length = 502
Score = 749 bits (1935), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/471 (76%), Positives = 397/471 (84%), Gaps = 39/471 (8%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
DLKSWMHK+GLPPCKV+LK+KP N+ H+PIHYVAAS+DLQ GD AFSVPNSLVVTLERV
Sbjct: 70 DLKSWMHKHGLPPCKVVLKDKPCPNDSHKPIHYVAASQDLQVGDVAFSVPNSLVVTLERV 129
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
LGNET+AELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQL+VESPLL
Sbjct: 130 LGNETVAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLSVESPLL 189
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
W ++EL YL+GSP K E+++R E I++EYNELDTVWFMAGSLFQQYPYDIPTEAF+FEIF
Sbjct: 190 WLKSELDYLSGSPIKDEVIQREEAIRKEYNELDTVWFMAGSLFQQYPYDIPTEAFSFEIF 249
Query: 260 KQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLAR 319
KQAF A+QSCVVHLQ KVSLAR
Sbjct: 250 KQAFAAIQSCVVHLQ---------------------------------------KVSLAR 270
Query: 320 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 379
RFALVPLGPPLL+Y S CKAML AVD AV+L VDRPYKAG+ IVVWCGPQPNSKLLINYG
Sbjct: 271 RFALVPLGPPLLSYQSNCKAMLTAVDGAVELAVDRPYKAGDPIVVWCGPQPNSKLLINYG 330
Query: 380 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL 439
FVDE+N DRL+VEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV+AG+E+EA+ DML Y+
Sbjct: 331 FVDENNSNDRLIVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVYAGKEREAVLDMLRYM 390
Query: 440 RLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 499
RLGYVSD SEM+SVISS GP+CPVSPCMERA LDQLADYFKARLAGYP TL+EDE+MLTD
Sbjct: 391 RLGYVSDPSEMESVISSQGPVCPVSPCMERAALDQLADYFKARLAGYPTTLAEDESMLTD 450
Query: 500 YNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 550
NL+PKKRVATQ VR+EKKML+ACLQ T D I LPD T+SPCPAPYAPLL
Sbjct: 451 DNLNPKKRVATQYVRLEKKMLHACLQATTDFINQLPDHTISPCPAPYAPLL 501
>gi|356511552|ref|XP_003524489.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
[Glycine max]
Length = 503
Score = 746 bits (1925), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/472 (76%), Positives = 396/472 (83%), Gaps = 40/472 (8%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
DLKSWMHK+GLPPCKV+LK+KP N+ H+PIHYVAAS+DLQ GD AFSVPNSLVVTLERV
Sbjct: 70 DLKSWMHKHGLPPCKVVLKDKPCPNDSHKPIHYVAASQDLQVGDVAFSVPNSLVVTLERV 129
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
LGNET+AELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQL+VESPLL
Sbjct: 130 LGNETVAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLSVESPLL 189
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
WS++EL YL+GSP K E+++R E I++EY ELDTVWFMAGSLFQQYPYDIPTEAF+FEIF
Sbjct: 190 WSKSELDYLSGSPIKDEVIQREEAIRKEYKELDTVWFMAGSLFQQYPYDIPTEAFSFEIF 249
Query: 260 KQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLAR 319
KQAF A+QSCVVHLQ KVSLAR
Sbjct: 250 KQAFAAIQSCVVHLQ---------------------------------------KVSLAR 270
Query: 320 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 379
RFALVPLGPPLL+Y S CKAML AVD AV+L VDRPYKAG+ IVVWCGPQPNSKLLINYG
Sbjct: 271 RFALVPLGPPLLSYQSNCKAMLTAVDGAVELAVDRPYKAGDPIVVWCGPQPNSKLLINYG 330
Query: 380 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL 439
FVDE+N DRL+VEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV+AG+E+EA+ DML Y+
Sbjct: 331 FVDENNSNDRLIVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVYAGKEREAVLDMLRYM 390
Query: 440 RLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 499
RLGYVSD SEMQSVISS GP+CPVSPCMERA LDQLADYFKARLAGYP L+EDE+MLTD
Sbjct: 391 RLGYVSDPSEMQSVISSQGPVCPVSPCMERAALDQLADYFKARLAGYPTILAEDESMLTD 450
Query: 500 -YNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 550
NL+PKKRVATQ VR+EKKML+ACLQ T D I LPD T+SPCPAPYAPLL
Sbjct: 451 GGNLNPKKRVATQYVRLEKKMLHACLQATIDFINQLPDHTISPCPAPYAPLL 502
>gi|116786810|gb|ABK24248.1| unknown [Picea sitchensis]
Length = 507
Score = 704 bits (1818), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 348/516 (67%), Positives = 403/516 (78%), Gaps = 46/516 (8%)
Query: 42 SSLRLVRRKNRFSIRVSSSDTLVAGS------REVVSKKEEDLGDLKSWMHKNGLPPCKV 95
S +RL R F + V S+DTL A S ++ + KEE++ DLKSWMH++GLPPC+V
Sbjct: 32 SRVRLPGRCVGFPMVVYSADTLTASSQHGEDKKDAIRGKEEEV-DLKSWMHRHGLPPCRV 90
Query: 96 ILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLS 155
+LKE+PS + KH+PI YVAASEDLQ GD AFS+PNSL+VTLERVLGNETIAELLTTNKLS
Sbjct: 91 MLKERPSPDGKHKPIKYVAASEDLQPGDVAFSIPNSLIVTLERVLGNETIAELLTTNKLS 150
Query: 156 ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 215
ELACLALYLMYEKKQG +SFW P+IRELDRQRGRGQLAVESPLLWS EL Y TGSP K
Sbjct: 151 ELACLALYLMYEKKQGNQSFWRPFIRELDRQRGRGQLAVESPLLWSSEELKYFTGSPMKE 210
Query: 216 EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQR 275
+LER GIKREY ELDTVWFMAGSLF+QYPYDIPTEAF FEIFKQAFVAVQSCVVHLQ
Sbjct: 211 IMLERNSGIKREYEELDTVWFMAGSLFKQYPYDIPTEAFPFEIFKQAFVAVQSCVVHLQ- 269
Query: 276 CGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSS 335
V+LARRFALVPLGPPLL+Y S
Sbjct: 270 --------------------------------------NVNLARRFALVPLGPPLLSYKS 291
Query: 336 KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 395
CKAML AV D+VQL VDR YKAGE IVVWCGPQPN++LL+NYGFVDEDNP+DRL+VE +
Sbjct: 292 NCKAMLKAVGDSVQLEVDREYKAGEPIVVWCGPQPNARLLLNYGFVDEDNPHDRLIVEVS 351
Query: 396 LNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS 455
L+T+DP YQDKR++AQRNGKLSVQ F+++ GREKEA+ DMLPYLRL YVSD SEMQSV+S
Sbjct: 352 LDTKDPLYQDKRIIAQRNGKLSVQTFNIYIGREKEAVLDMLPYLRLAYVSDPSEMQSVLS 411
Query: 456 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 515
S GP+CPVSPC ERAVLDQL+ YF+ RLAGYP T SEDE +L D +PK++VATQLV +
Sbjct: 412 SQGPVCPVSPCTERAVLDQLSRYFRERLAGYPTTASEDEIVLADPTTNPKRQVATQLVLI 471
Query: 516 EKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 551
EKKMLN+CL ++I LPD+ V+PCP+PY+P+L
Sbjct: 472 EKKMLNSCLAAVYEIIDQLPDLAVTPCPSPYSPILK 507
>gi|7573451|emb|CAB87765.1| putative protein [Arabidopsis thaliana]
Length = 537
Score = 698 bits (1801), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/510 (70%), Positives = 401/510 (78%), Gaps = 37/510 (7%)
Query: 1 MEASCSLRSSKFISPPIRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSS 60
ME + +K +S PIR PLS S S+ R+ SS R V + S+ VSSS
Sbjct: 1 MEGVITCFHTKCVSLPIR--SFPLSRVS--SLPRWRNNKLISSSRSVHLR---SLCVSSS 53
Query: 61 DTLVAGSR--------EVVSKKE-EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIH 111
DTLVA +V SKKE +D DLK WM KNGLPPCKVILKE+P+H++KH+PIH
Sbjct: 54 DTLVASGSPKEDERQSKVSSKKEGDDSEDLKFWMDKNGLPPCKVILKERPAHDQKHKPIH 113
Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
YVAASEDLQ GD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG
Sbjct: 114 YVAASEDLQKGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 173
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
KKS W PYIRELDRQRGRGQL ESPLLWSE EL YLTGSPTKAE+LERAEGIKREYNEL
Sbjct: 174 KKSVWYPYIRELDRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGIKREYNEL 233
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHL 291
DTVWFMAGSLFQQYP+DIPTEAF+FEIFKQAFVA+QSCVVHLQ +S+L
Sbjct: 234 DTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQVVLVA--------SSNL 285
Query: 292 VHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 351
+SCT + V LARRFALVPLGPPLLAY S CKAML AVD AV+LV
Sbjct: 286 DCYASSCT-------------QNVGLARRFALVPLGPPLLAYCSNCKAMLTAVDGAVELV 332
Query: 352 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 411
VDRPYKAG+ IVVWCGPQPN+KLL+NYGFVDEDNPYDR++VEAALNTEDPQYQDKRMVAQ
Sbjct: 333 VDRPYKAGDPIVVWCGPQPNAKLLLNYGFVDEDNPYDRVIVEAALNTEDPQYQDKRMVAQ 392
Query: 412 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 471
RNGKLS QVF V G+E+EA+ DMLPYLRLGY+SD SEMQSVISS GP+CP+SPCMERAV
Sbjct: 393 RNGKLSQQVFQVRVGKEREAVQDMLPYLRLGYMSDPSEMQSVISSQGPVCPMSPCMERAV 452
Query: 472 LDQLADYFKARLAGYPATLSEDEAMLTDYN 501
LDQLA+YF RL+GYP T ED+A+ N
Sbjct: 453 LDQLANYFMRRLSGYPTTPKEDDALEASCN 482
>gi|125536207|gb|EAY82695.1| hypothetical protein OsI_37912 [Oryza sativa Indica Group]
Length = 505
Score = 659 bits (1701), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 320/473 (67%), Positives = 369/473 (78%), Gaps = 42/473 (8%)
Query: 81 LKSWMHKNGLPPCKVILKEKPS---HNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
W+ ++GLPP KV + ++P K P+HYVAA +DL+AGD AF VP SLVVTLE
Sbjct: 71 FSDWLREHGLPPGKVAILDRPVPCFREGKDLPLHYVAAGQDLEAGDVAFEVPMSLVVTLE 130
Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
RVLG+E++AELLTTNKLSELACLALYLMYEKKQG+ SFW PYI+ELDRQRGRGQLAVESP
Sbjct: 131 RVLGDESVAELLTTNKLSELACLALYLMYEKKQGQDSFWYPYIKELDRQRGRGQLAVESP 190
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
LLW+E+EL YL GSP K E++ R EGI+REYNELDT+WFMAGSLFQQYP+DIPTEAF FE
Sbjct: 191 LLWTESELNYLKGSPIKDEVVARDEGIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFE 250
Query: 258 IFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSL 317
IFKQAFVAVQSCVVHLQ KVSL
Sbjct: 251 IFKQAFVAVQSCVVHLQ---------------------------------------KVSL 271
Query: 318 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 377
ARRFALVPLGPPLL Y S CKAML AV D+V+LVVDRPYKAGE I+VWCGPQPNS+LL+N
Sbjct: 272 ARRFALVPLGPPLLTYKSNCKAMLTAVGDSVRLVVDRPYKAGEPIIVWCGPQPNSRLLLN 331
Query: 378 YGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP 437
YGF+DEDNPYDR+V+EA+LN EDPQ+Q+KRMVAQRNGKL++Q FHV G+EKE I++MLP
Sbjct: 332 YGFIDEDNPYDRIVIEASLNIEDPQFQEKRMVAQRNGKLAIQNFHVCVGKEKETIAEMLP 391
Query: 438 YLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 497
YLRLGY+SD EMQS++SS G CPVSPC ERAVLDQL Y ++RLA YP TL ED+AML
Sbjct: 392 YLRLGYISDPDEMQSILSSEGDTCPVSPCTERAVLDQLVGYLESRLADYPTTLDEDDAML 451
Query: 498 TDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 550
D NL PKK VAT+LVR+EKK+L+ CLQ + I LPD TVSPCPAP+AP L
Sbjct: 452 ADGNLEPKKEVATRLVRLEKKLLHGCLQAANEFINDLPDHTVSPCPAPFAPEL 504
>gi|115487958|ref|NP_001066466.1| Os12g0236900 [Oryza sativa Japonica Group]
gi|77554044|gb|ABA96840.1| SET domain containing protein, expressed [Oryza sativa Japonica
Group]
gi|113648973|dbj|BAF29485.1| Os12g0236900 [Oryza sativa Japonica Group]
Length = 509
Score = 658 bits (1698), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 320/473 (67%), Positives = 369/473 (78%), Gaps = 42/473 (8%)
Query: 81 LKSWMHKNGLPPCKVILKEKPS---HNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
W+ ++GLPP KV + ++P K P+HYVAA +DL+AGD AF VP SLVVTLE
Sbjct: 75 FSDWLREHGLPPGKVAILDRPVPCFREGKDLPLHYVAAGQDLEAGDVAFEVPMSLVVTLE 134
Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
RVLG+E++AELLTTNKLSELACLALYLMYEKKQG+ SFW PYI+ELDRQRGRGQLAVESP
Sbjct: 135 RVLGDESVAELLTTNKLSELACLALYLMYEKKQGQDSFWYPYIKELDRQRGRGQLAVESP 194
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
LLW+E+EL YL GSP K E++ R EGI+REYNELDT+WFMAGSLFQQYP+DIPTEAF FE
Sbjct: 195 LLWTESELNYLKGSPIKDEVVARDEGIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFE 254
Query: 258 IFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSL 317
IFKQAFVAVQSCVVHLQ KVSL
Sbjct: 255 IFKQAFVAVQSCVVHLQ---------------------------------------KVSL 275
Query: 318 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 377
ARRFALVPLGPPLL Y S CKAML AV D+V+LVVDRPYKAGE I+VWCGPQPNS+LL+N
Sbjct: 276 ARRFALVPLGPPLLTYKSNCKAMLTAVGDSVRLVVDRPYKAGEPIIVWCGPQPNSRLLLN 335
Query: 378 YGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP 437
YGF+DEDNPYDR+V+EA+LN EDPQ+Q+KRMVAQRNGKL++Q FHV G+EKE I++MLP
Sbjct: 336 YGFIDEDNPYDRIVIEASLNIEDPQFQEKRMVAQRNGKLAIQNFHVCVGKEKETIAEMLP 395
Query: 438 YLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 497
YLRLGY+SD EMQS++SS G CPVSPC ERAVLDQL Y ++RLA YP TL ED+AML
Sbjct: 396 YLRLGYISDPDEMQSILSSEGDTCPVSPCTERAVLDQLVGYLESRLADYPTTLDEDDAML 455
Query: 498 TDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 550
D NL PKK VAT+LVR+EKK+L+ CLQ + I LPD TVSPCPAP+AP L
Sbjct: 456 ADGNLEPKKEVATRLVRLEKKLLHGCLQAANEFINDLPDHTVSPCPAPFAPEL 508
>gi|357160358|ref|XP_003578740.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
[Brachypodium distachyon]
Length = 516
Score = 658 bits (1698), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 321/471 (68%), Positives = 366/471 (77%), Gaps = 42/471 (8%)
Query: 84 WMHKNGLPPCKVILKEKP---SHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
W+ +GLPP KV + E+P S K RP+H+VAA +DL+ GD AF +P SLVVTLERVL
Sbjct: 85 WLLTHGLPPGKVAILERPVPCSRGGKDRPLHFVAAGQDLEVGDVAFEMPMSLVVTLERVL 144
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
G+E++AELLTTNKLSELACLALYLMYEKKQGK S W PYI+ELDRQRGRGQLAVESPLLW
Sbjct: 145 GDESVAELLTTNKLSELACLALYLMYEKKQGKDSLWYPYIKELDRQRGRGQLAVESPLLW 204
Query: 201 SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK 260
+E+EL YL GSP + E++ R EGI+REYNELDT+WFMAGSLF+QYP+D+PTEAF FEIFK
Sbjct: 205 TESELDYLNGSPMRDEVVVRDEGIRREYNELDTLWFMAGSLFKQYPFDVPTEAFPFEIFK 264
Query: 261 QAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARR 320
QAFVAVQSCVVHLQ KVSLARR
Sbjct: 265 QAFVAVQSCVVHLQ---------------------------------------KVSLARR 285
Query: 321 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 380
FALVPLGPPLL Y S CKAML AVDD+V+LVVDRPYKAGE I+VWCGPQPNS+LL+NYGF
Sbjct: 286 FALVPLGPPLLTYKSNCKAMLTAVDDSVRLVVDRPYKAGEPIIVWCGPQPNSRLLLNYGF 345
Query: 381 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 440
VDEDNPYDR+ +EA+LN EDPQYQ+KRMVAQRNGKL++Q F V G+EKE IS+MLPYLR
Sbjct: 346 VDEDNPYDRIAIEASLNMEDPQYQEKRMVAQRNGKLAIQKFQVCVGKEKETISEMLPYLR 405
Query: 441 LGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDY 500
LGY+SD EMQ ++SS G CPVSPC ERAVLDQL Y K+RLAGYP TL EDEAML D
Sbjct: 406 LGYISDPDEMQCILSSEGDTCPVSPCSERAVLDQLVVYLKSRLAGYPTTLDEDEAMLADG 465
Query: 501 NLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 551
NL PKK VAT+LVR+EKK+L+ CLQ + I LPD TVSPCPA YAP L
Sbjct: 466 NLEPKKEVATRLVRLEKKLLHGCLQAAHEFISALPDHTVSPCPALYAPNLK 516
>gi|326510275|dbj|BAJ87354.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525555|dbj|BAJ88824.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 523
Score = 657 bits (1695), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 336/558 (60%), Positives = 395/558 (70%), Gaps = 62/558 (11%)
Query: 8 RSSKFISPP-----IRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSSDT 62
RSS+ +PP + HH L + + R P GS R +R + +DT
Sbjct: 14 RSSEARAPPMASSALSGTHHRLLLPCFLR----RLPQPGS-----RSCSRLRLAACHADT 64
Query: 63 LVAGSRE------VVSKKEEDLGDLKSWMHKNGLPPCKVILKEKP---SHNEKHRPIHYV 113
L++ S G W+ NGLPP K+ + E+P S + RP+H+V
Sbjct: 65 LLSSSGAQGPPSPAACLSASSAGGFSDWLLTNGLPPGKLAILERPVPCSRGGRDRPLHFV 124
Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKK 173
AA +DL+AGD AF VP SLVVTLERVLG+E++AELLTTNKLSELACLALYLMYEKKQG+
Sbjct: 125 AAGQDLEAGDVAFEVPMSLVVTLERVLGDESVAELLTTNKLSELACLALYLMYEKKQGRD 184
Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
S W PYI+ELDRQRGRGQLAVESPLLW+E+EL YL GSP + E++ R EGIK+EYNELDT
Sbjct: 185 SLWYPYIKELDRQRGRGQLAVESPLLWTESELDYLNGSPMRDEVVVRDEGIKKEYNELDT 244
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVH 293
+WFMAGSLF+QYP+D+PTEAF FEIFKQAFVAVQSCVVHLQ
Sbjct: 245 LWFMAGSLFKQYPFDVPTEAFPFEIFKQAFVAVQSCVVHLQ------------------- 285
Query: 294 LQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD 353
KVSLARRFALVPLGPPLL Y S CKAML AVD +V+L+VD
Sbjct: 286 --------------------KVSLARRFALVPLGPPLLTYKSNCKAMLTAVDGSVRLLVD 325
Query: 354 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 413
RPYKAGE I+VWCGPQPNS+LL+NYGFVDEDNPYDR+ +EA+LNTEDPQYQ+KRMVAQRN
Sbjct: 326 RPYKAGEPIIVWCGPQPNSRLLLNYGFVDEDNPYDRIAIEASLNTEDPQYQEKRMVAQRN 385
Query: 414 GKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLD 473
GKL++Q F V G+EK+ IS+MLPYLRLGY+SD EMQ ++SS G CPVSPC ERAVLD
Sbjct: 386 GKLAIQKFQVCVGKEKQTISEMLPYLRLGYISDPDEMQCILSSEGDTCPVSPCSERAVLD 445
Query: 474 QLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIML 533
QL Y K+RLAGYP L EDEAML D +L PKK VAT+LVR+EKKML+ CL+ + I
Sbjct: 446 QLVVYLKSRLAGYPTNLDEDEAMLADGSLEPKKEVATRLVRLEKKMLHGCLEAANEFISG 505
Query: 534 LPDVTVSPCPAPYAPLLN 551
LPD TVSPCPA YAP L
Sbjct: 506 LPDHTVSPCPALYAPELK 523
>gi|242053769|ref|XP_002456030.1| hypothetical protein SORBIDRAFT_03g029140 [Sorghum bicolor]
gi|241928005|gb|EES01150.1| hypothetical protein SORBIDRAFT_03g029140 [Sorghum bicolor]
Length = 512
Score = 654 bits (1688), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 318/470 (67%), Positives = 367/470 (78%), Gaps = 42/470 (8%)
Query: 84 WMHKNGLPPCKVILKEKPSH---NEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
W+ GLPP KV ++E+P N K P+ YVAA DLQAGD AF VP SLVVTLERVL
Sbjct: 81 WLRARGLPPGKVDIRERPVPCLLNGKDLPLRYVAAGVDLQAGDVAFEVPMSLVVTLERVL 140
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
G+E+IAELLT NKLSELACLALYLMYEKKQGK SFW PYI+ELDR RGRGQLAVESPLLW
Sbjct: 141 GDESIAELLTNNKLSELACLALYLMYEKKQGKDSFWYPYIKELDRHRGRGQLAVESPLLW 200
Query: 201 SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK 260
+E+EL YLTGSP K E++ R E I+REYNELDT+WFMAGSLFQQYP+DIPTEAF FEIFK
Sbjct: 201 TESELDYLTGSPLKDEVVARDEAIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIFK 260
Query: 261 QAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARR 320
QAFVAVQSCVVHLQ KVSLARR
Sbjct: 261 QAFVAVQSCVVHLQ---------------------------------------KVSLARR 281
Query: 321 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 380
FALVPLGPPLL Y S CKAML A D+V+LVVDRPYKAGE I++WCGPQ NS+L++NYGF
Sbjct: 282 FALVPLGPPLLTYKSNCKAMLTADGDSVRLVVDRPYKAGEPIIIWCGPQTNSRLVLNYGF 341
Query: 381 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 440
VDEDNP+DR+ +EA+LN+EDPQYQ+KRMVAQRNGKL++Q F+V+ G+EK+ +++MLPYLR
Sbjct: 342 VDEDNPFDRIAIEASLNSEDPQYQEKRMVAQRNGKLAIQNFNVYVGKEKQTVAEMLPYLR 401
Query: 441 LGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDY 500
LGY+SD EMQS++SS G CP+SPC ERAVLDQL Y ++RLAGYP TL EDEAML D
Sbjct: 402 LGYISDPDEMQSILSSEGDTCPLSPCTERAVLDQLVGYLESRLAGYPTTLDEDEAMLADG 461
Query: 501 NLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 550
+L PKK VAT+LVR+EKKM++ACLQ T + I LPD TVSPCPAPYAP L
Sbjct: 462 SLEPKKEVATRLVRLEKKMIHACLQATNEFINDLPDHTVSPCPAPYAPEL 511
>gi|414881266|tpg|DAA58397.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
Length = 512
Score = 653 bits (1684), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 324/496 (65%), Positives = 374/496 (75%), Gaps = 42/496 (8%)
Query: 58 SSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPS---HNEKHRPIHYVA 114
SSS+ A V E W+ GLPP KV ++E+P + K +P+ YV+
Sbjct: 55 SSSEARAAPGPAVEPSSESATDCFVDWLRARGLPPGKVDIRERPVPCLRDGKDQPLRYVS 114
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A DLQAGD AF V SLVVTLERVLG+E+IAELLT NKLSELACLALYLMYEKKQGK S
Sbjct: 115 AVVDLQAGDVAFEVSMSLVVTLERVLGDESIAELLTNNKLSELACLALYLMYEKKQGKDS 174
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
FW PYI+ELDR RGRGQLAVESPLLW+E+EL YLTGSP K E++ R E I+REYNELDT+
Sbjct: 175 FWYPYIKELDRHRGRGQLAVESPLLWTESELDYLTGSPLKDEVVARDEAIRREYNELDTL 234
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHL 294
WFMAGSLFQQYP+DIPTEAF FEIFKQAFVAVQSCVVHLQ
Sbjct: 235 WFMAGSLFQQYPFDIPTEAFPFEIFKQAFVAVQSCVVHLQ-------------------- 274
Query: 295 QTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 354
KVSLARRFALVPLGPPLL Y S CKAML A D+V+LVVDR
Sbjct: 275 -------------------KVSLARRFALVPLGPPLLTYRSNCKAMLTADGDSVRLVVDR 315
Query: 355 PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 414
PYKAGE I++WCGPQ NS+L++NYGFVDEDNP+DR+ +EA+LNTEDPQYQ+KRMVAQRNG
Sbjct: 316 PYKAGEPIIIWCGPQTNSRLVLNYGFVDEDNPFDRVAIEASLNTEDPQYQEKRMVAQRNG 375
Query: 415 KLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQ 474
KL++Q F+V+ G+EK+ +++MLPYLRLGY+S+ EMQS++SS G CPVSPC ERAVLDQ
Sbjct: 376 KLAIQNFNVYVGKEKQTVAEMLPYLRLGYISNPDEMQSILSSEGDTCPVSPCTERAVLDQ 435
Query: 475 LADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLL 534
L Y ++RLAGYP TL EDEAML D NL PKK VAT+LVR+EKKML+ACLQ T + I L
Sbjct: 436 LVGYLESRLAGYPTTLDEDEAMLADGNLEPKKEVATRLVRLEKKMLHACLQATNEFINDL 495
Query: 535 PDVTVSPCPAPYAPLL 550
PD TVSPCPAPYAP L
Sbjct: 496 PDHTVSPCPAPYAPEL 511
>gi|125578929|gb|EAZ20075.1| hypothetical protein OsJ_35675 [Oryza sativa Japonica Group]
Length = 536
Score = 629 bits (1621), Expect = e-177, Method: Compositional matrix adjust.
Identities = 305/433 (70%), Positives = 345/433 (79%), Gaps = 39/433 (9%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLP 178
LQAGD AF VP SLVVTLERVLG+E++AELLTTNKLSELACLALYLMYEKKQG+ SFW P
Sbjct: 143 LQAGDVAFEVPMSLVVTLERVLGDESVAELLTTNKLSELACLALYLMYEKKQGQDSFWYP 202
Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMA 238
YI+ELDRQRGRGQLAVESPLLW+E+EL YL GSP K E++ R EGI+REYNELDT+WFMA
Sbjct: 203 YIKELDRQRGRGQLAVESPLLWTESELNYLKGSPIKDEVVARDEGIRREYNELDTLWFMA 262
Query: 239 GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSC 298
GSLFQQYP+DIPTEAF FEIFKQAFVAVQSCVVHLQ
Sbjct: 263 GSLFQQYPFDIPTEAFPFEIFKQAFVAVQSCVVHLQ------------------------ 298
Query: 299 TTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 358
KVSLARRFALVPLGPPLL Y S CKAML AV D+V+LVVDRPYKA
Sbjct: 299 ---------------KVSLARRFALVPLGPPLLTYKSNCKAMLTAVGDSVRLVVDRPYKA 343
Query: 359 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 418
GE I+VWCGPQPNS+LL+NYGF+DEDNPYDR+V+EA+LN EDPQ+Q+KRMVAQRNGKL++
Sbjct: 344 GEPIIVWCGPQPNSRLLLNYGFIDEDNPYDRIVIEASLNIEDPQFQEKRMVAQRNGKLAI 403
Query: 419 QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADY 478
Q FHV G+EKE I++MLPYLRLGY+SD EMQS++SS G CPVSPC ERAVLDQL Y
Sbjct: 404 QNFHVCVGKEKETIAEMLPYLRLGYISDPDEMQSILSSEGDTCPVSPCTERAVLDQLVGY 463
Query: 479 FKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVT 538
++RLA YP TL ED+AML D NL PKK VAT+LVR+EKK+L+ CLQ + I LPD T
Sbjct: 464 LESRLADYPTTLDEDDAMLADGNLEPKKEVATRLVRLEKKLLHGCLQAANEFINDLPDHT 523
Query: 539 VSPCPAPYAPLLN 551
VSPCPAP+AP L
Sbjct: 524 VSPCPAPFAPELK 536
>gi|168044593|ref|XP_001774765.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673920|gb|EDQ60436.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 523
Score = 588 bits (1516), Expect = e-165, Method: Compositional matrix adjust.
Identities = 305/520 (58%), Positives = 361/520 (69%), Gaps = 49/520 (9%)
Query: 38 PNFGSSLRLVRRKNRFSIRVSSSDTLVAGSRE---VVSKKEEDLG--DLKSWMHKNGLPP 92
P FG+ V + R S ++ T V E SKK+E DLK WM + GLP
Sbjct: 45 PRFGTQKVAVSSEKRGSRCRNTLTTDVYKQDENDLAQSKKQEHESGIDLKQWMEEQGLPE 104
Query: 93 CKVILKE-KPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTT 151
CKV L E +PS +K +PIHYV ASEDLQ G+ A ++P SLVVTLERVLG+ETIAELLTT
Sbjct: 105 CKVSLAEHQPSEGDKGKPIHYVVASEDLQPGELALTIPKSLVVTLERVLGDETIAELLTT 164
Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTG 210
NKLSELACLALYLMYEKKQGK+S+W PYIRELDRQRGRGQL+V SPLLWS EL Y TG
Sbjct: 165 NKLSELACLALYLMYEKKQGKESYWYPYIRELDRQRGRGQLSVASPLLWSREELNEYFTG 224
Query: 211 SPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
S K +LER GIKREY ELDTVWFMAGSLF+QYP+D+PTEAF+FEIFKQAFVAVQSCV
Sbjct: 225 STMKEVVLERLAGIKREYEELDTVWFMAGSLFKQYPFDLPTEAFSFEIFKQAFVAVQSCV 284
Query: 271 VHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPL 330
VHLQ VSLARRFALVPLGPPL
Sbjct: 285 VHLQ---------------------------------------GVSLARRFALVPLGPPL 305
Query: 331 LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 390
LAY S CKAML AVDD V L VDR YKAG+ I VWCGPQPNSKLL+NYGFVDEDNPYDRL
Sbjct: 306 LAYKSNCKAMLKAVDDNVVLEVDRAYKAGDPIAVWCGPQPNSKLLLNYGFVDEDNPYDRL 365
Query: 391 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM 450
VEA+L+TEDP YQ KR + Q+N +L++Q F ++ G+E EA+ DMLPY+RL +++D EM
Sbjct: 366 AVEASLDTEDPLYQQKRAIVQKNNRLTIQTFQIYKGKEMEAVLDMLPYMRLAHLADPEEM 425
Query: 451 QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVAT 510
++V + GP+CPVS C ERAVL+QL YF+ RLAGY ++ + + D + KKRVA
Sbjct: 426 ETVSFAQGPVCPVSACNERAVLEQLEQYFEKRLAGYKSSHATEGG---DAKKNAKKRVAE 482
Query: 511 QLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 550
+L+ +EK +L L ++I LPD +SPC PY P L
Sbjct: 483 KLLCIEKSILRNALAAVQELISQLPDSAISPCIGPYLPNL 522
>gi|302809535|ref|XP_002986460.1| hypothetical protein SELMODRAFT_269129 [Selaginella moellendorffii]
gi|300145643|gb|EFJ12317.1| hypothetical protein SELMODRAFT_269129 [Selaginella moellendorffii]
Length = 432
Score = 575 bits (1481), Expect = e-161, Method: Compositional matrix adjust.
Identities = 293/472 (62%), Positives = 344/472 (72%), Gaps = 46/472 (9%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
WM + GLPPCKV LKE+ + + I YV ASEDL+ GD A SVP SLVVTLERVLGN
Sbjct: 2 DWMLEQGLPPCKVSLKERDLNG---KTIRYVVASEDLKPGDLALSVPMSLVVTLERVLGN 58
Query: 143 ETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSE 202
ETIAELLTTNKLSELACLALYLMYEKK+GK+SFW P+IRELDRQRGRGQ+AVESPLLW+
Sbjct: 59 ETIAELLTTNKLSELACLALYLMYEKKRGKESFWYPFIRELDRQRGRGQVAVESPLLWTS 118
Query: 203 TEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ 261
EL Y TGS K +LER EGIKREY ELDTVWFMAGSLF++YP+DIPTEAF+FEIFKQ
Sbjct: 119 EELDEYFTGSRMKEVVLERLEGIKREYQELDTVWFMAGSLFKEYPFDIPTEAFSFEIFKQ 178
Query: 262 AFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRF 321
AFVAVQSCVVHLQ VSL RRF
Sbjct: 179 AFVAVQSCVVHLQ---------------------------------------GVSLPRRF 199
Query: 322 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 381
ALVPLGPPLLAY S CKAML A D V+L VDR YK GE I+VWCGPQPN++LL+NYGFV
Sbjct: 200 ALVPLGPPLLAYKSNCKAMLKAAGDLVRLEVDRAYKKGEQILVWCGPQPNTRLLLNYGFV 259
Query: 382 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 441
D DNP+DRL VEA+LNT DP YQ+KR++ Q+N +L++Q F + GREKEA+ +MLPYLRL
Sbjct: 260 DPDNPHDRLSVEASLNTRDPFYQNKRIIVQKNNRLTIQNFQIFKGREKEAVLEMLPYLRL 319
Query: 442 GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD-- 499
G+VSD M+SV S+ GP CPVS C ERAVLDQLA YF+ R+A Y T+ ED A+L D
Sbjct: 320 GHVSDPYHMESVFSAEGPTCPVSACNERAVLDQLAQYFQERIAKYKTTIDEDRALLEDCS 379
Query: 500 YNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVS-PCPAPYAPLL 550
+++PK+RVATQL+ +EK++L+ L V LPD +V+ PC + P L
Sbjct: 380 SDINPKQRVATQLLLIEKEILHNTLDVVNGFRNQLPDGSVAPPCCGDFVPKL 431
>gi|302794360|ref|XP_002978944.1| hypothetical protein SELMODRAFT_110000 [Selaginella moellendorffii]
gi|300153262|gb|EFJ19901.1| hypothetical protein SELMODRAFT_110000 [Selaginella moellendorffii]
Length = 432
Score = 574 bits (1480), Expect = e-161, Method: Compositional matrix adjust.
Identities = 293/472 (62%), Positives = 344/472 (72%), Gaps = 46/472 (9%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
WM + GLPPCKV LKE+ + + I YV ASEDL+ GD A SVP SLVVTLERVLGN
Sbjct: 2 DWMLEQGLPPCKVSLKERDLNG---KTIRYVVASEDLKPGDLALSVPMSLVVTLERVLGN 58
Query: 143 ETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSE 202
ETIAELLTTNKLSELACLALYLMYEKK+GK+SFW P+IRELDRQRGRGQ+AVESPLLW+
Sbjct: 59 ETIAELLTTNKLSELACLALYLMYEKKRGKESFWYPFIRELDRQRGRGQVAVESPLLWTS 118
Query: 203 TEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ 261
EL Y TGS K +LER EGIKREY ELDTVWFMAGSLF++YP+DIPTEAF+FEIFKQ
Sbjct: 119 EELDEYFTGSRMKEVVLERLEGIKREYQELDTVWFMAGSLFKEYPFDIPTEAFSFEIFKQ 178
Query: 262 AFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRF 321
AFVAVQSCVVHLQ VSL RRF
Sbjct: 179 AFVAVQSCVVHLQ---------------------------------------GVSLPRRF 199
Query: 322 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 381
ALVPLGPPLLAY S CKAML A D V+L VDR YK GE I+VWCGPQPN++LL+NYGFV
Sbjct: 200 ALVPLGPPLLAYKSNCKAMLKAAGDLVRLEVDRAYKKGEQILVWCGPQPNTRLLLNYGFV 259
Query: 382 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 441
D DNP+DRL VEA+LNT DP YQ+KR++ Q+N +L++Q F + GREKEA+ +MLPYLRL
Sbjct: 260 DPDNPHDRLSVEASLNTRDPFYQNKRIIVQKNNRLTIQNFQIFKGREKEAVLEMLPYLRL 319
Query: 442 GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD-- 499
G+VSD M+SV S+ GP CPVS C ERAVLDQLA YF+ R+A Y T+ ED A+L D
Sbjct: 320 GHVSDPYHMESVFSAEGPTCPVSACNERAVLDQLAQYFQERIAKYKTTIDEDRALLEDGS 379
Query: 500 YNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVS-PCPAPYAPLL 550
+++PK+RVATQL+ +EK++L+ L V LPD +V+ PC + P L
Sbjct: 380 SDINPKQRVATQLLLIEKEILHNTLDVVNGFRNQLPDGSVAPPCCGDFVPKL 431
>gi|168020073|ref|XP_001762568.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686301|gb|EDQ72691.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 427
Score = 555 bits (1431), Expect = e-155, Method: Compositional matrix adjust.
Identities = 281/468 (60%), Positives = 330/468 (70%), Gaps = 44/468 (9%)
Query: 85 MHKNGLPPCKVILKE-KPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
M + GLP C V L E + + +K +PIHYV AS+DLQ GD A +VP SLVVTLERVLG+E
Sbjct: 1 MEEQGLPKCNVALVEHQLAEGDKGKPIHYVVASQDLQPGDVALTVPKSLVVTLERVLGDE 60
Query: 144 TIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 203
TIAELLTTNKLSELACLALYLMYEKKQGK+S+W PYIRELDRQRGRGQL+V SPLLWS
Sbjct: 61 TIAELLTTNKLSELACLALYLMYEKKQGKESYWYPYIRELDRQRGRGQLSVASPLLWSPE 120
Query: 204 EL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
EL Y TGS K +LER GIKREY ELDTVWFMAGSLF+QYP+D+PTEAF+FEIFKQA
Sbjct: 121 ELNEYFTGSTMKEVVLERLAGIKREYEELDTVWFMAGSLFKQYPFDLPTEAFSFEIFKQA 180
Query: 263 FVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFA 322
FVAVQSCVVHLQ VSLARRFA
Sbjct: 181 FVAVQSCVVHLQ---------------------------------------GVSLARRFA 201
Query: 323 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 382
LVPLGPPLLAY S CKAML AV D VQL VD YK G+ I VWCGPQPNSKLL+NYGFVD
Sbjct: 202 LVPLGPPLLAYKSNCKAMLKAVGDNVQLEVDHAYKTGDPIAVWCGPQPNSKLLLNYGFVD 261
Query: 383 EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLG 442
EDNP+DRL VEA+LNTEDP YQ KR V Q+N +L++Q F ++ G+E EA+ DMLPY+RLG
Sbjct: 262 EDNPFDRLAVEASLNTEDPLYQQKRAVVQKNNRLTIQTFQIYKGKEMEAVRDMLPYMRLG 321
Query: 443 YVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNL 502
+++D E+++V + P+C VS C ERAVL+Q+ +F+ RLAGY S D D
Sbjct: 322 HLADPEEIETVSFAQEPLCYVSACNERAVLNQIEHFFERRLAGYK---SSDTTKAVDAKK 378
Query: 503 HPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 550
K+ VA +L+ +EK +L L ++I LPD +SPC PY P L
Sbjct: 379 DAKRTVAKKLMSIEKNILRNALAAVHELIRELPDGAISPCIGPYLPNL 426
>gi|326503142|dbj|BAJ99196.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 425
Score = 535 bits (1379), Expect = e-149, Method: Compositional matrix adjust.
Identities = 274/465 (58%), Positives = 326/465 (70%), Gaps = 62/465 (13%)
Query: 8 RSSKFISPP-----IRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSSDT 62
RSS+ +PP + HH L + + R P GS R +R + +DT
Sbjct: 9 RSSEARAPPMASSALSGTHHRLLLPCFLR----RLPQPGS-----RSCSRLRLAACHADT 59
Query: 63 LVAGSRE------VVSKKEEDLGDLKSWMHKNGLPPCKVILKEKP---SHNEKHRPIHYV 113
L++ S G W+ NGLPP K+ + E+P S + RP+H+V
Sbjct: 60 LLSSSGAQGPPSPAACLSASSAGGFSDWLLTNGLPPGKLAILERPVPCSRGGRDRPLHFV 119
Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKK 173
AA +DL+AGD AF VP SLVVTLERVLG+E++AELLTTNKLSELACLALYLMYEKKQG+
Sbjct: 120 AAGQDLEAGDVAFEVPMSLVVTLERVLGDESVAELLTTNKLSELACLALYLMYEKKQGRD 179
Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
S W PYI+ELDRQRGRGQLAVESPLLW+E+EL YL GSP + E++ R EGIK+EYNELDT
Sbjct: 180 SLWYPYIKELDRQRGRGQLAVESPLLWTESELDYLNGSPMRDEVVVRDEGIKKEYNELDT 239
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVH 293
+WFMAGSLF+QYP+D+PTEAF FEIFKQAFVAVQSCVVHLQ
Sbjct: 240 LWFMAGSLFKQYPFDVPTEAFPFEIFKQAFVAVQSCVVHLQ------------------- 280
Query: 294 LQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD 353
KVSLARRFALVPLGPPLL Y S CKAML AVD +V+L+VD
Sbjct: 281 --------------------KVSLARRFALVPLGPPLLTYKSNCKAMLTAVDGSVRLLVD 320
Query: 354 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 413
RPYKAGE I+VWCGPQPNS+LL+NYGFVDEDNPYDR+ +EA+LNTEDPQYQ+KRMVAQRN
Sbjct: 321 RPYKAGEPIIVWCGPQPNSRLLLNYGFVDEDNPYDRIAIEASLNTEDPQYQEKRMVAQRN 380
Query: 414 GKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLG 458
GKL++Q F V G+EK+ IS+MLPYLRLGY+SD EMQ ++SS G
Sbjct: 381 GKLAIQKFQVCVGKEKQTISEMLPYLRLGYISDPDEMQCILSSEG 425
>gi|384246822|gb|EIE20311.1| hypothetical protein COCSUDRAFT_48681 [Coccomyxa subellipsoidea
C-169]
Length = 539
Score = 377 bits (968), Expect = e-102, Method: Compositional matrix adjust.
Identities = 197/411 (47%), Positives = 264/411 (64%), Gaps = 48/411 (11%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+ DLQAG+ A +P+ LV+TL+RV +E++AELLTT+KLSELACL LYLMYEKK G++S
Sbjct: 49 AARDLQAGELALRIPDHLVITLDRVFEDESLAELLTTDKLSELACLTLYLMYEKKNGRQS 108
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDT 233
W +I+ELDR +GRGQ+ +SPLLW E ++ YL GSP AEI ER +GI++EY ELDT
Sbjct: 109 VWYEFIKELDRIQGRGQMGAKSPLLWDEGQVDEYLAGSPLVAEIKERLKGIEKEYAELDT 168
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVH 293
VWFMAGSLF+ YPYD+PTEAF+ ++F+Q F AVQ+ VVHLQ
Sbjct: 169 VWFMAGSLFKSYPYDVPTEAFSLKLFRQGFAAVQASVVHLQ------------------- 209
Query: 294 LQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA--VQLV 351
V L++RFALVPLGPPLL+YSS KAML +A VQL
Sbjct: 210 --------------------GVPLSKRFALVPLGPPLLSYSSTAKAMLTYNREAKEVQLA 249
Query: 352 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 411
VDR Y GE I WCGPQPN +LL+NYG V ++NP+D++ + L DP +Q KR V Q
Sbjct: 250 VDRSYTKGEPIEAWCGPQPNRRLLLNYGIVTDNNPHDKMALTVTLPHADPLFQAKRAVLQ 309
Query: 412 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPIC--PVSPCMER 469
+N + Q F + R+K +LPYLRL + +D + ++ +++L C P+SP ER
Sbjct: 310 QNNLSTQQTFQLQ--RDKGLPELLLPYLRLAHCTDAASLK--LATLDTCCAAPISPENER 365
Query: 470 AVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 520
VL QLA + + RL Y T EDE ++ P+++VA +L+R+EK +L
Sbjct: 366 TVLHQLASHLQDRLDRYKTTCEEDEVIIRSTTAGPRQKVAARLLRIEKAIL 416
>gi|255536985|ref|XP_002509559.1| conserved hypothetical protein [Ricinus communis]
gi|223549458|gb|EEF50946.1| conserved hypothetical protein [Ricinus communis]
Length = 348
Score = 373 bits (957), Expect = e-100, Method: Compositional matrix adjust.
Identities = 174/190 (91%), Positives = 182/190 (95%)
Query: 85 MHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNET 144
MHKNGLPPCKV+LKE+PSH+ K RPIHYVAASEDLQ GD AFSVPNSLVVTLERVLGNET
Sbjct: 1 MHKNGLPPCKVVLKERPSHDAKLRPIHYVAASEDLQTGDVAFSVPNSLVVTLERVLGNET 60
Query: 145 IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 204
+ ELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE E
Sbjct: 61 VVELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSEAE 120
Query: 205 LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
LAYLTGSPTKAE+LERA+GIKREY+ELDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFV
Sbjct: 121 LAYLTGSPTKAEVLERADGIKREYDELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFV 180
Query: 265 AVQSCVVHLQ 274
A+QSCVVHLQ
Sbjct: 181 AIQSCVVHLQ 190
Score = 279 bits (713), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 130/160 (81%), Positives = 148/160 (92%)
Query: 392 VEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQ 451
++AALNTEDPQYQDKRMVAQRNGKLS+QVF ++ G+EKEAISD+LPYLRLGYVSD SEMQ
Sbjct: 189 LQAALNTEDPQYQDKRMVAQRNGKLSIQVFQIYVGKEKEAISDILPYLRLGYVSDPSEMQ 248
Query: 452 SVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQ 511
SVISS GPICPVSPCME+AVLDQLADYFK RLAGYP +L+EDE ML D+NL+PKKRVATQ
Sbjct: 249 SVISSQGPICPVSPCMEQAVLDQLADYFKRRLAGYPTSLNEDELMLADHNLNPKKRVATQ 308
Query: 512 LVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 551
LVR+EKK+LNACLQ TAD+I LPD++VSPCPAPYAP+L
Sbjct: 309 LVRLEKKILNACLQATADLINQLPDLSVSPCPAPYAPILK 348
>gi|212721460|ref|NP_001132025.1| uncharacterized protein LOC100193433 [Zea mays]
gi|194693232|gb|ACF80700.1| unknown [Zea mays]
gi|414881264|tpg|DAA58395.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
gi|414881265|tpg|DAA58396.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
Length = 252
Score = 372 bits (955), Expect = e-100, Method: Compositional matrix adjust.
Identities = 172/239 (71%), Positives = 205/239 (85%)
Query: 312 KEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPN 371
++KVSLARRFALVPLGPPLL Y S CKAML A D+V+LVVDRPYKAGE I++WCGPQ N
Sbjct: 13 EQKVSLARRFALVPLGPPLLTYRSNCKAMLTADGDSVRLVVDRPYKAGEPIIIWCGPQTN 72
Query: 372 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEA 431
S+L++NYGFVDEDNP+DR+ +EA+LNTEDPQYQ+KRMVAQRNGKL++Q F+V+ G+EK+
Sbjct: 73 SRLVLNYGFVDEDNPFDRVAIEASLNTEDPQYQEKRMVAQRNGKLAIQNFNVYVGKEKQT 132
Query: 432 ISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLS 491
+++MLPYLRLGY+S+ EMQS++SS G CPVSPC ERAVLDQL Y ++RLAGYP TL
Sbjct: 133 VAEMLPYLRLGYISNPDEMQSILSSEGDTCPVSPCTERAVLDQLVGYLESRLAGYPTTLD 192
Query: 492 EDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 550
EDEAML D NL PKK VAT+LVR+EKKML+ACLQ T + I LPD TVSPCPAPYAP L
Sbjct: 193 EDEAMLADGNLEPKKEVATRLVRLEKKMLHACLQATNEFINDLPDHTVSPCPAPYAPEL 251
>gi|413950742|gb|AFW83391.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
gi|413950743|gb|AFW83392.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
Length = 252
Score = 361 bits (927), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 165/237 (69%), Positives = 202/237 (85%)
Query: 312 KEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPN 371
++KVSLARRFALVPLGPPLL Y S CKAML ++V+LVVDRPYKAGE I++WCGPQ N
Sbjct: 13 EQKVSLARRFALVPLGPPLLTYKSNCKAMLTVDGESVRLVVDRPYKAGEPIIIWCGPQTN 72
Query: 372 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEA 431
S+L++NYGFVDE+NP+DR+ +EA+LNTEDPQYQ+KRMVAQRNGK ++Q F+V+ G+EK+
Sbjct: 73 SRLVLNYGFVDENNPFDRISIEASLNTEDPQYQEKRMVAQRNGKHAIQNFNVYVGKEKQT 132
Query: 432 ISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLS 491
+++MLPYLRLGY+SD EMQS++SS G CPVSPC ERAVLDQL Y ++RLAGYP TL+
Sbjct: 133 VAEMLPYLRLGYISDPDEMQSILSSEGDTCPVSPCTERAVLDQLGGYLESRLAGYPTTLN 192
Query: 492 EDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAP 548
EDEAML D +L PK+ VAT+LVR+EKKML+ACLQ T + I LPD TVSPCPA YAP
Sbjct: 193 EDEAMLADGSLEPKQEVATRLVRLEKKMLHACLQATNEFITDLPDHTVSPCPAQYAP 249
>gi|307107385|gb|EFN55628.1| hypothetical protein CHLNCDRAFT_57818 [Chlorella variabilis]
Length = 435
Score = 345 bits (886), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 191/443 (43%), Positives = 262/443 (59%), Gaps = 48/443 (10%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
+ W+ ++G P KV L+ + + A+E LQ GD A +P L+VTL+RVL
Sbjct: 9 MMQWLTESGAPQQKVKLQTVVREGTE---VDITVAAEALQPGDVALRIPEHLIVTLDRVL 65
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
+ T+AEL+TT KLSELACL LYL YEKK+GK+ W +I+ELDR +GRG +SPLLW
Sbjct: 66 EDNTLAELVTTGKLSELACLTLYLAYEKKRGKEGCWYRFIKELDRMQGRGSQGAKSPLLW 125
Query: 201 SETELA-YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
E + A L GSP EI R +GI++EY ELDTVW++AGSLF + P+ PTE F+F +F
Sbjct: 126 DEGQAAELLAGSPVVGEIEARLQGIRKEYEELDTVWYLAGSLFNRQPFSPPTEQFSFPVF 185
Query: 260 KQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLAR 319
+QAF AVQS VVHLQ V+L +
Sbjct: 186 RQAFTAVQSSVVHLQ---------------------------------------GVALGK 206
Query: 320 RFALVPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGESIVVWCGPQPNSKLLIN 377
RFALVP+GPPLL YSS KAML ++ V+L VDR Y+ GE+++ WCGPQPNS+LLIN
Sbjct: 207 RFALVPMGPPLLTYSSTAKAMLKFDPESHEVRLAVDRAYQPGEAVLAWCGPQPNSRLLIN 266
Query: 378 YGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP 437
YG VDE NPYD+L + + ++DP Y+ KR G + Q F + A A +LP
Sbjct: 267 YGIVDESNPYDKLPLSITIPSDDPLYRLKRDRLAERGLSTQQTFQLQAAASLPA--QLLP 324
Query: 438 YLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 497
YLRL + + ++++ V PV+P E VL+QL + + R + Y T+ EDEA++
Sbjct: 325 YLRLVHSTREADVEGVKWE-EEAGPVAPENELTVLNQLITHLRLRQSRYRTTIEEDEAII 383
Query: 498 TDYNLHPKKRVATQLVRMEKKML 520
D P+ VA +L+++EK +L
Sbjct: 384 ADPAKGPRPTVAARLLKIEKGIL 406
>gi|413950744|gb|AFW83393.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
Length = 281
Score = 345 bits (884), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 165/266 (62%), Positives = 202/266 (75%), Gaps = 29/266 (10%)
Query: 312 KEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPN 371
++KVSLARRFALVPLGPPLL Y S CKAML ++V+LVVDRPYKAGE I++WCGPQ N
Sbjct: 13 EQKVSLARRFALVPLGPPLLTYKSNCKAMLTVDGESVRLVVDRPYKAGEPIIIWCGPQTN 72
Query: 372 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEA 431
S+L++NYGFVDE+NP+DR+ +EA+LNTEDPQYQ+KRMVAQRNGK ++Q F+V+ G+EK+
Sbjct: 73 SRLVLNYGFVDENNPFDRISIEASLNTEDPQYQEKRMVAQRNGKHAIQNFNVYVGKEKQT 132
Query: 432 ISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLS 491
+++MLPYLRLGY+SD EMQS++SS G CPVSPC ERAVLDQL Y ++RLAGYP TL+
Sbjct: 133 VAEMLPYLRLGYISDPDEMQSILSSEGDTCPVSPCTERAVLDQLGGYLESRLAGYPTTLN 192
Query: 492 EDEAM-----------------------------LTDYNLHPKKRVATQLVRMEKKMLNA 522
EDEAM L D +L PK+ VAT+LVR+EKKML+A
Sbjct: 193 EDEAMVMSCDFLRVVSWSLYKLAECYGIGFGHCQLADGSLEPKQEVATRLVRLEKKMLHA 252
Query: 523 CLQVTADMIMLLPDVTVSPCPAPYAP 548
CLQ T + I LPD TVSPCPA YAP
Sbjct: 253 CLQATNEFITDLPDHTVSPCPAQYAP 278
>gi|302847476|ref|XP_002955272.1| hypothetical protein VOLCADRAFT_76643 [Volvox carteri f.
nagariensis]
gi|300259344|gb|EFJ43572.1| hypothetical protein VOLCADRAFT_76643 [Volvox carteri f.
nagariensis]
Length = 488
Score = 328 bits (842), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 192/447 (42%), Positives = 255/447 (57%), Gaps = 53/447 (11%)
Query: 80 DLKSWMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
+L W+ +NG + +V + PS RP+ V A L AG+ A SVP L +TL+
Sbjct: 43 ELVDWLRENGAKIDAVEVKTMDVPSAG---RPLDVVVAGRSLAAGEVALSVPERLCLTLD 99
Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
R+ +E +AELLTT+KLSELACLALYLMYEKK KKSFW PYI+ELD+Q+ RG A ESP
Sbjct: 100 RIFESEFVAELLTTDKLSELACLALYLMYEKKLKKKSFWYPYIKELDKQQARGPQAAESP 159
Query: 198 LLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 256
LLW + EL + L GSP + +R GI++EY LDTVWFMAGSLF +YP+D+PTE F+F
Sbjct: 160 LLWGDQELDSLLKGSPLLPAVRQRQAGIRKEYEALDTVWFMAGSLFNKYPFDLPTETFSF 219
Query: 257 EIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVS 316
E+F+QAF VQ+ +VHLQ V
Sbjct: 220 ELFQQAFAVVQASIVHLQ---------------------------------------GVP 240
Query: 317 LARRFALVPLGPPLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKL 374
+A+RFALVPLGPPL+AYSS K M+ +D +V+LVV P +AG + WCGPQPNS+L
Sbjct: 241 IAKRFALVPLGPPLMAYSSTSKNMMTYDEDSRSVRLVVSGPVEAGRPVAAWCGPQPNSRL 300
Query: 375 LINYGFVDEDNPYDRLVVE--AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI 432
L+NYG VDE NP+D+L L T DP + KR V G + Q F V R
Sbjct: 301 LLNYGVVDEHNPFDKLQARFTFTLPTSDPLFPAKRAVLSEAGLATQQSFDVSVARPLP-- 358
Query: 433 SDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSE 492
+LPY+ L + ++ SV S +E A L L Y + R A Y L
Sbjct: 359 PQLLPYMMLALATTPEQVASV--SFSDTAGHDRELEAAALAALMAYVQRRTAAYAHPLWR 416
Query: 493 DEAMLTDYNLHPKKRVATQLVRMEKKM 519
D ++ D + P+++VA +L ++EK +
Sbjct: 417 DLEIINDPSSTPRQKVAARLTKIEKSI 443
>gi|413950741|gb|AFW83390.1| hypothetical protein ZEAMMB73_201403, partial [Zea mays]
Length = 130
Score = 233 bits (593), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 108/128 (84%), Positives = 117/128 (91%)
Query: 147 ELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA 206
ELLT NKLSELACLALYLMYEKKQGK SFW PYI+ELDR RGRGQLAVESPLLW+E+EL
Sbjct: 1 ELLTNNKLSELACLALYLMYEKKQGKDSFWYPYIKELDRHRGRGQLAVESPLLWTESELD 60
Query: 207 YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 266
YL+GSP K E++ R E I+REYNELDT+WFMAGSLFQQYP+DIPTEAF FEIFKQAFVAV
Sbjct: 61 YLSGSPLKDEVVARDEAIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIFKQAFVAV 120
Query: 267 QSCVVHLQ 274
QSCVVHLQ
Sbjct: 121 QSCVVHLQ 128
>gi|168002824|ref|XP_001754113.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694667|gb|EDQ81014.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 638
Score = 128 bits (322), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 123/451 (27%), Positives = 190/451 (42%), Gaps = 74/451 (16%)
Query: 78 LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
L L W+ K G P VIL E+ + AA D + G+ A +P + VT
Sbjct: 82 LHQLSEWLSKQGFPTQDVILT---GFGEEGVGL---AAGRDFKEGEVALKIPENYTVTGV 135
Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
V+ + +A ++ L L+LMYE+ G+KS W PY++ SP
Sbjct: 136 DVVNHPVVAAPAAGR--GDVIGLTLWLMYERSLGEKSVWYPYLQTFPS-------TTLSP 186
Query: 198 LLWS-ETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 256
+LW+ E + L GSP E+ +R+ ++ EY +L S F + P P E F+
Sbjct: 187 ILWTAEEQQKLLKGSPALEEVQQRSAALEGEYEDLQ-------SYFTKDPQAFPQEYFSL 239
Query: 257 EIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVS 316
E FK AF + S V+L P+
Sbjct: 240 EAFKSAFSVILSRAVYL---------------------------------PS-------- 258
Query: 317 LARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCG-PQPNSK 373
A FALVP L + + +A L + D AV VDR YK GE + G + N+
Sbjct: 259 -ADLFALVPYADA-LNHRADSQAYLDYSMEDQAVVFPVDRNYKEGEQVFTSYGRERSNAD 316
Query: 374 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 433
LLI YGFVDE+N D L +E L D K+ + Q+ S Q F ++ R +
Sbjct: 317 LLITYGFVDENNAMDYLDLEVGLVDGDRLLVLKQQILQQAMLDSPQTFPLYLDRFP---T 373
Query: 434 DMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSED 493
+L Y+RL + D + ++ + + E L L + +L Y + ++
Sbjct: 374 QLLTYMRLSRLQDPALFPKIVFDKDIM--LDQANEYECLQLLMGECRTKLGNYEGGVDDE 431
Query: 494 EAMLTDYNLHPKKRVATQLVRMEKKMLNACL 524
+L + + ++RVA QL EKK+L + +
Sbjct: 432 IRLLKNKKISQRERVAAQLRLCEKKILTSTM 462
>gi|298706765|emb|CBJ29688.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Ectocarpus siliculosus]
Length = 521
Score = 126 bits (317), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 120/465 (25%), Positives = 196/465 (42%), Gaps = 64/465 (13%)
Query: 81 LKSWMHKNGL-----------PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVP 129
LK WM +NG+ P + + NE + A+ +++ GD F++P
Sbjct: 92 LKEWMGENGVWVYDKSDWGVGPHALSVAVDTVDENENETAGRGMIANREIKEGDELFTLP 151
Query: 130 NSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
L++T + E A+++T + LSE +AL ++EK +GK+SFW YI L
Sbjct: 152 IDLLLT-KDAAKKEFGADVITED-LSEYIAIALLAVHEKAKGKESFWSSYIGVLPTVE-- 207
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDI 249
V LW+E +LA L GSP A ++ EY ++ L ++P +
Sbjct: 208 ---EVYPTYLWAEEDLALLEGSPVIAATESMRRKLEVEYATVEN------DLLDKFPEIL 258
Query: 250 PTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTH 309
P E T+E F+ AF + S + L + +A H
Sbjct: 259 PREVHTYEEFQWAFAMLFSRAIRLGGLSTGEAVALVPYADLF----------------NH 302
Query: 310 NPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 369
NP A + ++ + + + D V + DR YK E + + GP+
Sbjct: 303 NP-----FANSYI-----------DARQQGLFFSKTDEVVVYADRSYKKMEQVYISYGPK 346
Query: 370 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREK 429
NS LL+ YGF + NPY+ + V +L+ D Y+ K+ G + F ++ R
Sbjct: 347 GNSDLLLLYGFSLDRNPYNSVDVTVSLDENDELYERKKAFLSEAGLPPTKAFPLYNDRYP 406
Query: 430 EAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPAT 489
+ ++L YLRL ++ + L + E VLD L + KA +AGYP T
Sbjct: 407 D---ELLQYLRLIQLNTDQLRGRTLEDLSFEKKQTDVNELMVLDSLVEACKATIAGYPTT 463
Query: 490 LSEDEAMLTD----YNLHPKKRVATQLVRMEKKMLNACL-QVTAD 529
+D ++ D L +R+A + R EK +L + VT D
Sbjct: 464 EEQDSKLMNDPGMFRALSKTQRMAVKHRRQEKVILRRTIAAVTKD 508
>gi|452821842|gb|EME28868.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Galdieria sulphuraria]
Length = 490
Score = 119 bits (298), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 121/490 (24%), Positives = 209/490 (42%), Gaps = 91/490 (18%)
Query: 56 RVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGL----------PPCKVILKEKPSHNE 105
R S + + +G V ++ W+ +NG+ P ++++ E+ + +E
Sbjct: 58 RSSDAFSFTSGDPAVQKGWSSEISAFYDWLKENGVYLSEKASWTHAPHRLVIAEE-TKDE 116
Query: 106 KHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLM 165
+ +S + G+ +P L+ T R L ET + + E + L L+
Sbjct: 117 GEYSGRGLLSSRSVNLGEKVLEIPEKLMFT--RKLALETFPTSIIASIEDEYVSIGLLLL 174
Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPL-LWSETELAYLTGSPTKAEILERAEGI 224
YEK +G SF+ PY+ L L +PL LWS +L L GSPT + + + +
Sbjct: 175 YEKAKGFDSFFKPYLDILP------TLDELNPLFLWSNKDLDLLQGSPTLSACEQLRDKL 228
Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGT 284
REY ++ ++ Q P + ++ F+ F+ AF + R C P
Sbjct: 229 LREYT------YLGKNIIPQIP-NFASKPIDFKQFQWAFG------ILFSRAICFPS--- 272
Query: 285 ESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAML--- 341
++R ALVP LL +S C A +
Sbjct: 273 ---------------------------------SKRIALVPYAD-LLNHSPFCSAFIDEE 298
Query: 342 -----AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 396
V +AV + VDR Y+ E + V GP+ N +LL+ YGF E NP+D + + L
Sbjct: 299 KIPFGNGVTEAV-VYVDRLYEPYEQVYVSYGPRSNQELLLLYGFSLERNPFDCVEITIGL 357
Query: 397 NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISS 456
+ DP Y +K + + GK +Q F ++ R +M +LR + +++Q+ +
Sbjct: 358 DKTDPLYLEKCRMLESYGKSPLQSFPLYMDRYP---VEMAEFLRFCCIDTETDLQADFGT 414
Query: 457 LGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD----YNLHPKKRVATQL 512
+ VS E + LD+L +Y +L YP +L +DE ++ D L +R+A +
Sbjct: 415 I-----VSASNEESALDKLLNYIVDQLRKYPTSLEDDEKIIRDRAMFQTLEKNQRMAIRQ 469
Query: 513 VRMEKKMLNA 522
EK++L+A
Sbjct: 470 RLGEKRILHA 479
>gi|302764082|ref|XP_002965462.1| hypothetical protein SELMODRAFT_406852 [Selaginella moellendorffii]
gi|300166276|gb|EFJ32882.1| hypothetical protein SELMODRAFT_406852 [Selaginella moellendorffii]
Length = 481
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 122/448 (27%), Positives = 187/448 (41%), Gaps = 74/448 (16%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
D+ W+ + G P +++ S +K A+ DLQAGDAA S+P + VT V
Sbjct: 49 DMTKWLQEQGFPQQPLLVS---SFEDKGLG---CCATRDLQAGDAALSIPENFTVTAVDV 102
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
+ I+ EL LAL+LMYE+++ + S W PY++ + SPLL
Sbjct: 103 ANHPVISS--AAEGRDELVGLALWLMYEQERSQDSPWYPYLKVF-------PASTLSPLL 153
Query: 200 WSETELAYLT-GSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFE 257
W + E L GS A++ ++ +++ ++ L DT+ + D P E FTF
Sbjct: 154 WEQEEQEELLRGSSALAKVKDQLTSLRQTFDALKDTL---------KDNKDFPMEKFTFS 204
Query: 258 IFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSL 317
FK AF V S V+L P+
Sbjct: 205 AFKAAFSVVLSRAVYL---------------------------------PS--------- 222
Query: 318 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLI 376
A FALVP G + SS+ + V+L VD+ YK G+ + + ++ LI
Sbjct: 223 AELFALVPFGDLINHESSRSLLDYDIEEQKVKLAVDKRYKKGDQVFASYAQNLTSADFLI 282
Query: 377 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML 436
YGF+DE + D + +E L + D KR + Q G Q F V+ R + +L
Sbjct: 283 RYGFLDESDENDFIEIEVGLVSGDSLAPLKREILQEVGLTVPQKFPVYLNRFP---TQLL 339
Query: 437 PYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM 496
Y RL + D+ + I V E L L + +L + T+ +D
Sbjct: 340 TYTRLARIQDSGLFAKITFEKDLI--VCQTNEYETLMLLMADCRTKLLSFSDTMEDDMQT 397
Query: 497 LTDYNLHPKKRVATQLVRMEKKMLNACL 524
L NL K+RVA QL EK++L +
Sbjct: 398 LKRKNLSYKQRVAAQLRLKEKRILTDTM 425
>gi|302823067|ref|XP_002993188.1| hypothetical protein SELMODRAFT_449044 [Selaginella moellendorffii]
gi|300138958|gb|EFJ05708.1| hypothetical protein SELMODRAFT_449044 [Selaginella moellendorffii]
Length = 600
Score = 105 bits (263), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 117/443 (26%), Positives = 182/443 (41%), Gaps = 72/443 (16%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
D+ W+ + G P +++ S +K A+ DLQAGDAA S+P + VT V
Sbjct: 49 DMTKWLQEQGFPQQPLLV---SSFEDKGLG---CCATRDLQAGDAALSIPENFTVTAVDV 102
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
+ I+ EL LAL+LMYE+++ + S W PY++ + L
Sbjct: 103 ANHPVISS--AAEGRDELVGLALWLMYEQERSQDSPWYPYVKVFPAS------TLSLLLW 154
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFEI 258
E + L GS A++ ++ +++ ++ L DT+ + D P E FTF
Sbjct: 155 EQEEQEELLRGSSALAKVKDQLTSLRQTFDALKDTL---------KDNKDFPMEKFTFSA 205
Query: 259 FKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLA 318
FK AF V S V+L P+ A
Sbjct: 206 FKTAFSVVLSRAVYL---------------------------------PS---------A 223
Query: 319 RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLIN 377
FALVP G + SS+ + V+L VD+ YK G+ + + ++ LI
Sbjct: 224 ELFALVPFGDLINHESSRSLLDYDIEEQKVKLAVDKRYKKGDQVFASYAQNLTSADFLIR 283
Query: 378 YGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP 437
YGF+DE + D + +E L + D KR + Q G Q F ++ R + +L
Sbjct: 284 YGFLDESDENDCIEIEVGLVSGDSLAPLKREILQEVGLTVPQKFPLYLNR---FPTQLLT 340
Query: 438 YLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 497
Y RL + D+ + I VS E L L + +L T+ ++ L
Sbjct: 341 YTRLARIQDSGLFAKITFEKDLI--VSQTNEYETLMLLMADCRTKLLSSSDTMEDEMQTL 398
Query: 498 TDYNLHPKKRVATQLVRMEKKML 520
NL K+RVA QL EK++L
Sbjct: 399 RRKNLSYKQRVAAQLRLKEKRIL 421
>gi|297829320|ref|XP_002882542.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297328382|gb|EFH58801.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 504
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 93/354 (26%), Positives = 152/354 (42%), Gaps = 66/354 (18%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L++W+ +GLPP K+ + ++ E+ + AS++L+ G+ VP SLV++
Sbjct: 72 ENATSLQNWLSDSGLPPQKMAI-DRVDIGERG-----LVASQNLRKGEKLLFVPPSLVIS 125
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ N E++ + + LA YL+ E K S W YI L RQ
Sbjct: 126 ADSEWTNPEAGEVMKRYDVPDWPLLATYLISEASLQKSSRWYNYISALPRQ-------PY 178
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ TEL YL S + +ER + Y +L + +F ++P+ P E F
Sbjct: 179 SLLYWTRTELDMYLEASQIRERAIERITNVVGTYEDLRS------RIFSKHPHLFPKEVF 232
Query: 255 TFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEK 314
E FK +F + S +V L
Sbjct: 233 NDETFKWSFGILFSRLVRLP---------------------------------------- 252
Query: 315 VSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 372
S+ RFALVP +L ++ + + L V DRPY+ GE + + G + N
Sbjct: 253 -SMDGRFALVPWA-DMLNHNCEVETFLDYDKSSKGVVFTTDRPYQPGEQVFISYGNKSNG 310
Query: 373 KLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 424
+LL++YGFV + NP D + + +L D Y++K +++G + Q F V
Sbjct: 311 ELLLSYGFVPREGTNPSDSVELALSLRKNDKCYKEKLDALKKHGLSTPQCFPVR 364
>gi|21537309|gb|AAM61650.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
Length = 504
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 93/354 (26%), Positives = 151/354 (42%), Gaps = 66/354 (18%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L++W+ +GLPP K+ + ++ E+ + AS++L+ G+ VP SLV++
Sbjct: 72 ENATSLQNWLSDSGLPPQKMAI-DRVDIGERG-----LVASQNLRKGEKLLFVPPSLVIS 125
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ N E++ + + LA YL+ E K S W YI L RQ
Sbjct: 126 ADSEWTNAEAGEVMKRYDVPDWPLLATYLISEANLQKSSRWFNYISALPRQ-------PY 178
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ TEL YL S + +ER + Y +L + +F ++P P E F
Sbjct: 179 SLLYWTRTELDMYLEASQIRERAIERITNVVGTYEDLRS------RIFSKHPQLFPKEVF 232
Query: 255 TFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEK 314
E FK +F + S +V L
Sbjct: 233 NDETFKWSFGILFSRLVRLP---------------------------------------- 252
Query: 315 VSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 372
S+ RFALVP +L ++ + + L V DRPY+ GE + + G + N
Sbjct: 253 -SMDGRFALVPWA-DMLNHNCEVETFLDYDKSSKGVIFTTDRPYQPGEQVFISYGNKSNG 310
Query: 373 KLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 424
+LL++YGFV + NP D + + +L D Y++K +++G + Q F V
Sbjct: 311 ELLLSYGFVPREGTNPSDSVELALSLRKNDKCYEEKLDALKKHGLSTPQCFPVR 364
>gi|15231493|ref|NP_187424.1| rubisco methyltransferase-like protein [Arabidopsis thaliana]
gi|6466950|gb|AAF13085.1|AC009176_12 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
gi|6648179|gb|AAF21177.1|AC013483_1 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
gi|15028205|gb|AAK76599.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
gi|19310671|gb|AAL85066.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
gi|332641064|gb|AEE74585.1| rubisco methyltransferase-like protein [Arabidopsis thaliana]
Length = 504
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 93/354 (26%), Positives = 151/354 (42%), Gaps = 66/354 (18%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L++W+ +GLPP K+ + ++ E+ + AS++L+ G+ VP SLV++
Sbjct: 72 ENATSLQNWLSDSGLPPQKMAI-DRVDIGERG-----LVASQNLRKGEKLLFVPPSLVIS 125
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ N E++ + + LA YL+ E K S W YI L RQ
Sbjct: 126 ADSEWTNAEAGEVMKRYDVPDWPLLATYLISEASLQKSSRWFNYISALPRQ-------PY 178
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ TEL YL S + +ER + Y +L + +F ++P P E F
Sbjct: 179 SLLYWTRTELDMYLEASQIRERAIERITNVVGTYEDLRS------RIFSKHPQLFPKEVF 232
Query: 255 TFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEK 314
E FK +F + S +V L
Sbjct: 233 NDETFKWSFGILFSRLVRLP---------------------------------------- 252
Query: 315 VSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 372
S+ RFALVP +L ++ + + L V DRPY+ GE + + G + N
Sbjct: 253 -SMDGRFALVPWA-DMLNHNCEVETFLDYDKSSKGVVFTTDRPYQPGEQVFISYGNKSNG 310
Query: 373 KLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 424
+LL++YGFV + NP D + + +L D Y++K +++G + Q F V
Sbjct: 311 ELLLSYGFVPREGTNPSDSVELALSLRKNDKCYEEKLDALKKHGLSTPQCFPVR 364
>gi|440792294|gb|ELR13522.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
Length = 568
Score = 102 bits (253), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 126/497 (25%), Positives = 196/497 (39%), Gaps = 103/497 (20%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
+DL L+ W+ KNGL + E ++ + V A +D + G+ VP L+ T
Sbjct: 66 DDLEQLRVWLLKNGLDSKWLEGIEFAANLPEGSG---VVAKKDFKKGEPFLQVPRKLMFT 122
Query: 136 LERVLGNETIAELLTTNKL---SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
+ + N + +LL +K S CLAL+L+ EK SFW PYI+ L + G
Sbjct: 123 CQ-AMQNTPLGQLLKVDKFLAQSPSLCLALHLLVEK-HNHSSFWTPYIKTLPKSYG---- 176
Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 252
+ L ++ EL L GSPT ++ + +Y + LFQ +
Sbjct: 177 ---TCLYFTLEELEGLRGSPTFTSAIKVIATVAIQYTYIH-------DLFQIRKDILHIN 226
Query: 253 AFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPK 312
AFT++ F A AV S R PQ G + +
Sbjct: 227 AFTWDEFIWAMSAVGS------RQNQVPQWGHNALS------------------------ 256
Query: 313 EKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 372
+AL+P + D+ + R YK GE + ++ GP+PNS
Sbjct: 257 -------EYALIPAWDMCNHDHGDLQTFWDVNSDSTESHAMRAYKKGEQVYIFYGPRPNS 309
Query: 373 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK-RMVAQRNGKLSVQVFHVHAGREKEA 431
LL++ GFV E+N +D L + L + +DK R++ N K+ Q + G
Sbjct: 310 DLLLHAGFVYENNRFDALAIRVRLAPDAEHIKDKLRLLHLNNMKMDSQYYLYGLG----L 365
Query: 432 ISDMLPYLRLGYVSDTSEMQSVI----------------SSLGPICPVSPCMERAVLDQL 475
D++ +LR+ + + E+Q V+ +S G + R L+
Sbjct: 366 AVDLMAFLRI-HAMNEQELQQVLGAYDQQEAKVHNGEHPASNGEVVASGVFDPRVKLNDR 424
Query: 476 ADYFKARLAG---------YPATLS---------EDEAMLTDYNLHPKKRVATQLVRMEK 517
+ +LA YP TL ED+A L +L P R T L EK
Sbjct: 425 NELAALQLAEAKCLSLLSLYPTTLQVANGVELKQEDQAALRTTSLTPNMRAVTLLRLKEK 484
Query: 518 KMLNACLQVTADMIMLL 534
++LN L D I LL
Sbjct: 485 EILNRTL----DAIRLL 497
>gi|219121061|ref|XP_002185762.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Phaeodactylum tricornutum CCAP
1055/1]
gi|209582611|gb|ACI65232.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Phaeodactylum tricornutum CCAP
1055/1]
Length = 575
Score = 100 bits (248), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 99/435 (22%), Positives = 183/435 (42%), Gaps = 73/435 (16%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKK-QGKK 173
A D+ GD +P +L +T + + + + + ++++E +A +L+YE+ +G++
Sbjct: 131 ARRDINDGDELLRIPMALCMT--KSAARKAVGKDVLPSEINEYLAMACHLIYERNVRGEE 188
Query: 174 SFWLPYIR---ELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
S W PY+ ++D V W + +LA+L GSP A ++REY+
Sbjct: 189 SPWKPYLDVLPDIDE--------VNPTFTWPDEDLAFLNGSPVIAATKSLQMKLRREYDA 240
Query: 231 LDTVWFMAG--SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFA 288
L + G L +YP P EAF F+ ++ AF + S + L+ + +A
Sbjct: 241 L-----LGGEDGLLAKYPDRFPAEAFNFKAWEWAFTMLFSRAIRLRSLKQGETLALVPYA 295
Query: 289 SHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV 348
+ H +P S + + + + D+ V
Sbjct: 296 DLINH---------SPFSQAY---------------------IDARQNGDWLFKSGDEEV 325
Query: 349 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE--------- 399
L DR Y+ E I + GP+ N++LL+ YGF E NP++ + V ++
Sbjct: 326 ILYADRGYRRMEQIYISYGPKSNAELLLLYGFAVERNPFNSVDVTVSIAPRTASFVKELD 385
Query: 400 ------DPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV 453
DP ++K ++ G+ + F +A R +ML YLRL ++
Sbjct: 386 DDTIPVDPLAEEKAAFLEQVGRDATVDFPCYADRYP---VEMLEYLRLMQMTPEDTRGKP 442
Query: 454 ISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP----KKRVA 509
++ +S E AVL + +L+ YP + +D A++ D +L +R+A
Sbjct: 443 LAEFDYSRTISLGNEAAVLTSVITAVSRQLSNYPQSEEDDAALIKDKSLFRLLSYNQRMA 502
Query: 510 TQLVRMEKKMLNACL 524
+ R EK++L +
Sbjct: 503 VRHRRNEKRLLKRTI 517
>gi|3065835|gb|AAC14296.1| putative methyltransferase [Arabidopsis thaliana]
Length = 504
Score = 100 bits (248), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 92/354 (25%), Positives = 149/354 (42%), Gaps = 66/354 (18%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L++W+ +GLPP K+ + ++ E+ + AS++L+ G+ V SLV+
Sbjct: 72 ENATSLQNWLSDSGLPPQKMAI-DRVDIGERG-----LVASQNLRKGEKLLFVSPSLVIC 125
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ N E++ + + LA YL+ E K S W YI L RQ
Sbjct: 126 ADSEWTNAEAGEVMKRYDVPDWPLLATYLISEASLQKSSRWFNYISALPRQ-------PY 178
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ TEL YL S + +ER + Y +L + +F ++P P E F
Sbjct: 179 SLLYWTRTELDMYLEASQIRERAIERITNVVGTYEDLRS------RIFSKHPQLFPKEVF 232
Query: 255 TFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEK 314
E FK +F + S +V L
Sbjct: 233 NDETFKWSFGILFSRLVRLP---------------------------------------- 252
Query: 315 VSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 372
S+ RFALVP +L ++ + + L V DRPY+ GE + + G + N
Sbjct: 253 -SMDGRFALVPWA-DMLNHNCEVETFLDYDKSSKGVVFTTDRPYQPGEQVFISYGNKSNG 310
Query: 373 KLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 424
+LL++YGFV + NP D + + +L D Y++K +++G + Q F V
Sbjct: 311 ELLLSYGFVPREGTNPSDSVELALSLRKNDKCYEEKLDALKKHGLSTPQCFPVR 364
>gi|356547583|ref|XP_003542190.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Glycine
max]
Length = 499
Score = 99.4 bits (246), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 93/354 (26%), Positives = 148/354 (41%), Gaps = 66/354 (18%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L+ W+ ++GLPP K+ + E+ E+ + A ++++ G+ VP SLV+T
Sbjct: 67 ENSSALQRWLSESGLPPQKMGI-ERVEVGERG-----LVALKNIRKGEKLLFVPPSLVIT 120
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ E+L N + + LA YL+ E + S W YI L RQ
Sbjct: 121 PDSEWSCPEAGEVLKRNSVPDWPLLATYLISEASLMESSRWSNYISALPRQ-------PY 173
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W++ EL YL S + +ER + YN+L +F +YP P E F
Sbjct: 174 SLLYWTQAELDRYLEASQIRERAIERINNVIGTYNDLRL------RIFSKYPDLFPDEVF 227
Query: 255 TFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEK 314
E FK +F + S +V L P MG
Sbjct: 228 NIESFKWSFGILFSRLVRL------PSMGGN----------------------------- 252
Query: 315 VSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 372
ALVP +L +S + L + DRPY+ GE + + G + N
Sbjct: 253 ------VALVPWA-DMLNHSCDVETFLDYDKTSKGIVFTTDRPYQPGEQVFISYGKKSNG 305
Query: 373 KLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 424
+LL++YGFV ++ NP D + + +L D Y++K + + G + Q F +
Sbjct: 306 ELLLSYGFVPKEGANPSDSVELSLSLKKSDASYKEKLELLKNYGLSASQCFPIQ 359
>gi|224129218|ref|XP_002320530.1| predicted protein [Populus trichocarpa]
gi|222861303|gb|EEE98845.1| predicted protein [Populus trichocarpa]
Length = 503
Score = 99.4 bits (246), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 94/351 (26%), Positives = 145/351 (41%), Gaps = 66/351 (18%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L+ W+ +GLPP K+ + +K E+ + A ++++ G+ VP SLV+
Sbjct: 71 ENAEALQKWLSDSGLPPQKMAI-QKVEVGERG-----LVALKNIRKGEMLLFVPPSLVIA 124
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ E+L + + LA YL+ E K S W YI L RQ
Sbjct: 125 ADSEWSCPEAGEVLKKYSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PY 177
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ EL YL S + +ER + YN+L +F +YP+ P E F
Sbjct: 178 SLLYWTRAELDTYLEASQIRERAIERITNVTGTYNDLRL------RIFSKYPHLFPEEVF 231
Query: 255 TFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEK 314
E FK +F + S +V L P M
Sbjct: 232 NMETFKWSFGILFSRLVRL------PSMDG------------------------------ 255
Query: 315 VSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 372
R ALVP +L +SS+ + L V DRPY+ GE + + G + N
Sbjct: 256 -----RVALVPWA-DMLNHSSEVETFLDYDKSSKGVVFTTDRPYQPGEQVFISYGRKSNG 309
Query: 373 KLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 421
+LL++YGFV + NP D + + +L D Y++K +++G Q F
Sbjct: 310 ELLLSYGFVPREGTNPSDSVELSLSLKKSDKCYKEKLEALKKHGLSVSQCF 360
>gi|168003103|ref|XP_001754252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694354|gb|EDQ80702.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 431
Score = 98.2 bits (243), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 90/366 (24%), Positives = 156/366 (42%), Gaps = 70/366 (19%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L+ W+ K GL K++L S + A++ L+ G+ VP+ L++T +
Sbjct: 16 LQDWLMKEGLAKQKLVLDRVDSGGRG------LVATQSLRQGERLLFVPSGLLITADSEW 69
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
G ++ L E LA++L+ E + + S W PY L + S L W
Sbjct: 70 GCAETGRIIKEAGLPEWPMLAIFLISEASREESSRWFPYFATLPK-------TPSSILQW 122
Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
+E E+ +LT SP + + LE + Y +L ++F ++P P++ +T F
Sbjct: 123 TEEEVNTWLTASPVREKALECIRDVTETYRDL------RATIFLKHPEVFPSQVYTLAAF 176
Query: 260 KQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLAR 319
K AF + S +V L P +G
Sbjct: 177 KWAFGILFSRLVRL------PSVG------------------------------------ 194
Query: 320 RFALVPLGPPLLAYSSKCKAMLAAVDD---AVQLVVDRPYKAGESIVVWCGPQPNSKLLI 376
+ ALVP +L +S + + L + +V V DR Y++GE + + G + + +L +
Sbjct: 195 KLALVPWA-DMLNHSPQVDSFLDFDQNNAKSVVTVTDRAYQSGEQVFISYGKRSSGELFL 253
Query: 377 NYGFV-DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDM 435
YGF+ E N +D + +E ++++DP ++ K A G S Q F V R+ + +
Sbjct: 254 AYGFIPSELNVHDSVELEMEIDSDDPSFEAKLRAANEQGLSSPQRFPV---RKDGFPAQL 310
Query: 436 LPYLRL 441
L Y RL
Sbjct: 311 LAYARL 316
>gi|357469947|ref|XP_003605258.1| SET domain-containing protein [Medicago truncatula]
gi|355506313|gb|AES87455.1| SET domain-containing protein [Medicago truncatula]
Length = 494
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 92/354 (25%), Positives = 149/354 (42%), Gaps = 66/354 (18%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L+ W+ ++GLP K+ + +K E+ + A +++ G+ VP LV+T
Sbjct: 62 ENSSSLQKWLSQSGLPSQKMSI-DKVDVGERG-----LVALNNIRKGEKLLFVPPQLVIT 115
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ E+L N + + LA YL+ E K S W YI L RQ
Sbjct: 116 PDSEWSCPEAGEVLKKNSVPDWPLLATYLISEASLMKSSRWFSYISALPRQ-------PY 168
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L WS+ EL YL S + +ER + YN+ M +F +YP P E F
Sbjct: 169 SLLYWSQAELDRYLEASQIRERAIERTNNVIGTYND------MRVRIFSKYPDFFPEEVF 222
Query: 255 TFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEK 314
E FK +F + S +V L P+ + K
Sbjct: 223 NIESFKWSFGILFSRMVRL---------------------------------PSMDGKN- 248
Query: 315 VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV--VDRPYKAGESIVVWCGPQPNS 372
ALVP ++ +S + + L + +V DRPY+ GE + + G + N
Sbjct: 249 -------ALVPWA-DMMNHSCEVETFLDYDKSSKGIVFPTDRPYQPGEQVFISYGKKSNG 300
Query: 373 KLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 424
+LL++YGFV ++ NP D + + +L D Y++K + ++ G Q F +
Sbjct: 301 ELLLSYGFVPKEGTNPSDSVELSLSLKKSDESYKEKLELLKKYGLSGSQCFPIR 354
>gi|357462493|ref|XP_003601528.1| SET domain-containing protein [Medicago truncatula]
gi|355490576|gb|AES71779.1| SET domain-containing protein [Medicago truncatula]
gi|388500078|gb|AFK38105.1| unknown [Medicago truncatula]
Length = 497
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 92/354 (25%), Positives = 149/354 (42%), Gaps = 66/354 (18%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L+ W+ ++GLP K+ + +K E+ + A +++ G+ VP LV+T
Sbjct: 65 ENSSSLQKWLSQSGLPSQKMSI-DKVDVGERG-----LVALNNIRKGEKLLFVPPQLVIT 118
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ E+L N + + LA YL+ E K S W YI L RQ
Sbjct: 119 PDSEWSCPEAGEVLKKNSVPDWPLLATYLISEASLMKSSRWFSYISALPRQ-------PY 171
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L WS+ EL YL S + +ER + YN+ M +F +YP P E F
Sbjct: 172 SLLYWSQAELDRYLEASQIRERAIERTNNVIGTYND------MRVRIFSKYPDFFPEEVF 225
Query: 255 TFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEK 314
E FK +F + S +V L P+ + K
Sbjct: 226 NIESFKWSFGILFSRMVRL---------------------------------PSMDGKN- 251
Query: 315 VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV--VDRPYKAGESIVVWCGPQPNS 372
ALVP ++ +S + + L + +V DRPY+ GE + + G + N
Sbjct: 252 -------ALVPWA-DMMNHSCEVETFLDYDKSSKGIVFPTDRPYQPGEQVFISYGKKSNG 303
Query: 373 KLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 424
+LL++YGFV ++ NP D + + +L D Y++K + ++ G Q F +
Sbjct: 304 ELLLSYGFVPKEGTNPSDSVELSLSLKKSDESYKEKLELLKKYGLSGSQCFPIR 357
>gi|225447500|ref|XP_002267469.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic [Vitis
vinifera]
gi|296085051|emb|CBI28466.3| unnamed protein product [Vitis vinifera]
Length = 497
Score = 95.9 bits (237), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 92/355 (25%), Positives = 151/355 (42%), Gaps = 68/355 (19%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L+ W+ +GLPP K+ + E+ E+ + A ++++ G+ VP SLV+T
Sbjct: 65 ENAALLQKWLSDSGLPPQKMGI-ERVEVGERG-----LVALKNIRKGEKLLFVPPSLVIT 118
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ E+L N + + LA YL+ E + S W YI L RQ
Sbjct: 119 ADSEWSCTEAGEVLKRNSVPDWPLLATYLIGEASFMQSSRWSNYISALPRQ-------PY 171
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ EL YL S + +ER + YN+L +F ++P+ P E F
Sbjct: 172 SLLYWTRAELDKYLEASQIRERAIERINDVTGTYNDLRL------RIFSKHPHLFPEEVF 225
Query: 255 TFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEK 314
E FK +F + S +V L
Sbjct: 226 NMETFKWSFGILFSRLVRLP---------------------------------------- 245
Query: 315 VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DRPYKAGESIVVWCGPQPN 371
S+ + ALVP +L +S + + L D + Q VV DR Y+ E + + G + N
Sbjct: 246 -SMDEKIALVPWA-DMLNHSCEVETFL-DYDKSSQGVVFTTDRTYQPSEQVFISYGKKSN 302
Query: 372 SKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 424
+LL++YGFV + NP D++ + +L D Y++K +++G + Q F +
Sbjct: 303 GELLLSYGFVPREGTNPNDKVELLLSLKKSDKCYKEKSEAMKKHGLSTSQCFPIQ 357
>gi|255582876|ref|XP_002532210.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
gi|223528106|gb|EEF30179.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
Length = 508
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 95/350 (27%), Positives = 147/350 (42%), Gaps = 68/350 (19%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L+ W+ NGLP K+ + +K E+ + A ++++ G+ VP SLV+T +
Sbjct: 81 LQRWLSNNGLPDQKMAI-DKVEVGERG-----LVALKNIRKGEKLLFVPPSLVITADSEW 134
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
E+L + + LA+YL+ E K S W YI L RQ S L W
Sbjct: 135 SCPEAGEVLKQYSVPDWPLLAIYLISEANLQKSSKWSNYISALPRQ-------PYSLLYW 187
Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
+ EL YL S + +ER + YN+L +F +YP P E F E F
Sbjct: 188 TRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNLETF 241
Query: 260 KQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLAR 319
K +F + S +V L P M
Sbjct: 242 KWSFGILFSRLVRL------PSMDG----------------------------------- 260
Query: 320 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DRPYKAGESIVVWCGPQPNSKLLI 376
+ ALVP +L +S + + L D + Q VV DR Y+ GE + + G + N +LL+
Sbjct: 261 KVALVPWA-DMLNHSCEVETFL-DYDKSSQGVVFTTDRQYEPGEQVFISYGKKSNGELLL 318
Query: 377 NYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 424
+YGFV + NP D + + +L D Y++K +++G + Q F V
Sbjct: 319 SYGFVPREGTNPSDSVELSLSLKKSDKSYKEKLEALKKHGFSASQCFPVR 368
>gi|323456050|gb|EGB11917.1| hypothetical protein AURANDRAFT_61181 [Aureococcus anophagefferens]
Length = 516
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 93/372 (25%), Positives = 156/372 (41%), Gaps = 53/372 (14%)
Query: 151 TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 210
+ +E +AL L+ E+ +G +SFW YI L G + W ELAYL G
Sbjct: 162 NDDTNEYIAIALLLILERSKGSRSFWSEYIAILPTNEDVG-----ATFTWPAEELAYLEG 216
Query: 211 SPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
SP + ++ E+ A L D E FTFE ++ AF + S
Sbjct: 217 SPAASATASMMAKLRAEH---------AAVLEGNSALD--PEIFTFEAWQWAFTNLFSRA 265
Query: 271 VHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPL 330
+ L+ + + + H +P S ++ +V P
Sbjct: 266 IRLKASRAGELLAMVPYVDFINH---------SPFSSSYVDAREV------------PKA 304
Query: 331 LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 390
+ K +D V L DR YK E + + GP+ N+ LL+ YGF + NP++ +
Sbjct: 305 FPWEEK--------EDEVVLFADRAYKKFEQVFISYGPKSNADLLLLYGFALDRNPFNSV 356
Query: 391 VVEAALNTEDPQYQDKRMVAQRNGK-LSVQVFHVHAGREKEAISDMLPYLRLGYVS-DTS 448
+ + +D Y K A+ G+ +S F ++A R + +++ +LR+ + D
Sbjct: 357 DLAVGASKDDALYDAKERFARGAGRDVSSAAFPLYADRFPD---ELVQFLRMACATEDHL 413
Query: 449 EMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRV 508
+ + + +S E AVLD + D A +A YPA +D + D L +R+
Sbjct: 414 GARPLDDPDNYVDILSLDNELAVLDTIRDACDAAVAAYPAKSGDD---VPDAFLSRNQRM 470
Query: 509 ATQLVRMEKKML 520
A +LV EK++L
Sbjct: 471 AKRLVNTEKRIL 482
>gi|242066146|ref|XP_002454362.1| hypothetical protein SORBIDRAFT_04g029430 [Sorghum bicolor]
gi|241934193|gb|EES07338.1| hypothetical protein SORBIDRAFT_04g029430 [Sorghum bicolor]
Length = 499
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 108/425 (25%), Positives = 177/425 (41%), Gaps = 98/425 (23%)
Query: 9 SSKFISPPIRPPHH----PLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSSDTLV 64
S+ + PP+R P H P S +S+ S R + R IR S++
Sbjct: 4 STTTLHPPLRAPRHLRPLPHSYSSSFS----------------RTRGRAPIRASAASASA 47
Query: 65 AGSREVVS--------KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAAS 116
RE + + E L+ W+ +GLP ++ + ++ E+ + A
Sbjct: 48 PAQREAAAGVPWGCEIESLESAASLERWLIDSGLPEQRLAI-QRVDIGERG-----LVAL 101
Query: 117 EDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFW 176
++++ G+ VP SLV+T + G + E++ N + + +A YL+ E S W
Sbjct: 102 KNIRKGEKLLFVPPSLVITADSEWGRPEVGEVMKRNSVPDWPLIATYLISEASLEGSSRW 161
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL-DTV 234
YI L RQ S L W+ EL AYL SP + ++R + YN+L D +
Sbjct: 162 SSYIAALPRQ-------PYSLLYWTRAELDAYLVASPIRKRAIQRITDVIGTYNDLRDRI 214
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHL 294
+ LF P E + E F +F + S +V L P M
Sbjct: 215 FSRHSDLF-------PEEVYNIETFLWSFGILFSRLVRL------PSM------------ 249
Query: 295 QTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV-- 352
EKV ALVP +L +S + + L D + Q +V
Sbjct: 250 -----------------DEKV------ALVPWA-DMLNHSPEVETFL-DFDKSSQGIVFT 284
Query: 353 -DRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMV 409
DR Y+ GE + + G + + +LL++YGFV ++ NP D + + +L+ D Y++K
Sbjct: 285 TDRSYQPGEQVFISYGKKSSGELLLSYGFVPKEGTNPNDSVELLVSLDKSDKCYKEKLQA 344
Query: 410 AQRNG 414
+RNG
Sbjct: 345 LKRNG 349
>gi|346474100|gb|AEO36894.1| hypothetical protein [Amblyomma maculatum]
Length = 459
Score = 93.6 bits (231), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 104/458 (22%), Positives = 192/458 (41%), Gaps = 64/458 (13%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
W +NG V +K++P + + + A E ++ +P LV+T ++
Sbjct: 50 WCSENGAYLGSVAIKDRPDGD------YGLVAEEKIEESMQFLGIPMKLVMTTASARKSK 103
Query: 144 TIAELLTTN----KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
+ LL + +S +A LA++L+ E G+ SFW PYI L + + L
Sbjct: 104 -LGPLLRDDPIMKSMSNVA-LAIFLILELSAGESSFWHPYISVLPD-------SFNTVLY 154
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
++ EL L+GS E L+ I R+Y + F L + P+ + FT++++
Sbjct: 155 FNIEELELLSGSAVLDEALKLHRSIARQYAYFHKI-FRTHPLAKSLPF---KDCFTYDLY 210
Query: 260 KQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQ-TSCTTTRTPLSPTHNPKEKVSLA 318
+ A AV + R P ++ V + T+ T PL N +
Sbjct: 211 RWAVSAV------MTRQNAVPWTESDGLGGDDVEIDGTAAVTALVPLWDMCNHSD----- 259
Query: 319 RRFALVPLGPPLLAYSS-----KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 373
G L Y S +C AM R + GE + ++ G + N++
Sbjct: 260 --------GKVLTDYDSSASMVRCYAM-------------RDFDKGEEVTIFYGKRTNAE 298
Query: 374 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 433
I+ GFV EDN YD + ++ ++ +DP + K + + + LS+ R++
Sbjct: 299 FFIHNGFVFEDNRYDAVDIKLGVSKKDPLFAVKSKLCE-DHDLSLSGTFALVARDRPVSE 357
Query: 434 DMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSED 493
D+ +LR+ + D S+ ++ S I S R L L + L +P + E
Sbjct: 358 DLSTFLRILVLKDASQPEAF--SAEHILTSSDSNARDALTFLVVRIELLLKAFPKSDEEY 415
Query: 494 EAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 531
E ++ D + + ++A +L +E K+L + L+ + +
Sbjct: 416 EDIIKDGASNARVKMAARLRLLESKVLASVLETLGNHV 453
>gi|444909511|ref|ZP_21229702.1| hypothetical protein D187_00317 [Cystobacter fuscus DSM 2262]
gi|444720460|gb|ELW61244.1| hypothetical protein D187_00317 [Cystobacter fuscus DSM 2262]
Length = 445
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 113/470 (24%), Positives = 183/470 (38%), Gaps = 80/470 (17%)
Query: 68 REVVSKKEEDLGDLKSWMHKNG-LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAF 126
R E+ L L WM + G L P I+++ V A D+ G+
Sbjct: 2 RTSAESSEQKLSSLLRWMEQGGALFPKMHIVRQADGERS-------VLARTDIAEGEVVL 54
Query: 127 SVPNSLVVTLERV----LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRE 182
+P + + TLER +G ++L N + LA +L+ EK +G SFW P++
Sbjct: 55 QIPTTHLFTLERAKASDIGRRIQSQLQPDN---DFLYLASWLLEEKHRGADSFWKPFVDS 111
Query: 183 LDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLF 242
L + PL +SE E A + GS LER ++R+ E + +
Sbjct: 112 LP------EAYPHVPLFYSEQERARMKGSQ-----LERLVEVQRQSFEQE---------Y 151
Query: 243 QQYPYDIPT-EAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTT 301
Q +P E F FE + A +++ S + L+ P +
Sbjct: 152 AQLREKLPEYERFGFEEYVWARISLYSRLFSLKGGLQGPSL------------------- 192
Query: 302 RTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 361
PLS N ++ PP + +S+ +++ R AG
Sbjct: 193 -VPLSDMFNHRQ--------------PPDVLWSTSEDGQ------TFRMIAQRAVPAGTE 231
Query: 362 IVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQV 420
I G + + L++ GFV D + D + + L DP K+ + +
Sbjct: 232 IHTHYGAKSSDVFLLHSGFVPDGNEENDEVYLSVGLPPGDPLASVKQQMFGLASATAKHP 291
Query: 421 FHVHAGREKEAISDMLPYLRLGYVSDTSEM---QSVISSLGPICPVSPCMERAVLDQLAD 477
F V + A + +LR+ + S + ++S I PVS E VL LA
Sbjct: 292 FKVSRQGKYLASWSVFSFLRMAHASPDEFLALSNRLLSGTKTIAPVSVACEERVLGTLAA 351
Query: 478 YFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVT 527
+ RL +P TL EDE +L + L P +R L R EK++L L++T
Sbjct: 352 ACEERLKAFPTTLEEDERLLREGPLSPNERSCVLLRRQEKRLLGDYLELT 401
>gi|443722302|gb|ELU11224.1| hypothetical protein CAPTEDRAFT_181634 [Capitella teleta]
Length = 541
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 103/461 (22%), Positives = 190/461 (41%), Gaps = 72/461 (15%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPI--HYVAASEDLQAGDAAFSVP 129
S +E++ W+ N + V ++ H + + + A+ D + G+ ++P
Sbjct: 71 SGREKNFDGFMGWLKSNSVDAEAVEIQ--------HFDVGGYGIKATRDFKEGELFLAIP 122
Query: 130 NSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQ 186
S+++T + N + L+ N++ + LAL+++ E SFWLPY++ L
Sbjct: 123 RSVMMTTDTA-KNSALGALIADNRILQTMPNILLALHVLCELC-SPASFWLPYLKILPH- 179
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
+ SPL ++ +L L SPT +E++ + I R+Y + F L + P
Sbjct: 180 ------SYSSPLYFNPEDLQLLKASPTLSEMINQFRNITRQYAYFFNL-FQGHELASKLP 232
Query: 247 YDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLS 306
I + ++ ++ A +V + R P + + S L+ L C T ++
Sbjct: 233 --IQVKNICYDDYRWAVSSV------MTRQNQIPTLDGQRMISALIPLWDMCNHTNGQIT 284
Query: 307 PTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 366
+ K S+C ++ V AG + ++
Sbjct: 285 TDFSLKND-------------------RSECFSLEGTV-------------AGAQVFIFY 312
Query: 367 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 426
G + N++LLI+ GFV N DRL + ++ DP + K V R + ++F +H G
Sbjct: 313 GSRSNAELLIHNGFVYPQNHSDRLTIRLGISKNDPLFSMKSEVLSRLSMQASRLFSLHCG 372
Query: 427 REKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICP-------VSPCMERAVLDQLADYF 479
SD L +LR+ +++ ++++ ++ I VS ER LA
Sbjct: 373 VNP-VDSDTLAFLRVVVMTE-DDLRTALACRQQISKLRDFDDFVSEDNERKAWAFLATRV 430
Query: 480 KARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 520
L YP + ED +L +L R+A QL EK +L
Sbjct: 431 LLLLKAYPTSAQEDATLLQGNDLSTHARLAVQLRHCEKNIL 471
>gi|226501968|ref|NP_001140387.1| uncharacterized protein LOC100272441 [Zea mays]
gi|194699272|gb|ACF83720.1| unknown [Zea mays]
gi|413923744|gb|AFW63676.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Zea mays]
Length = 503
Score = 92.0 bits (227), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 83/344 (24%), Positives = 145/344 (42%), Gaps = 66/344 (19%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E L+ W+ +GLP ++ + ++ E+ + A ++++ G+ VP SLV+T
Sbjct: 71 ESAASLERWLIDSGLPEQRLAI-QRVDIGERG-----LVALKNIRKGEKLLFVPPSLVIT 124
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ G + +++ N + + +A YL+ E S W+ YI L RQ
Sbjct: 125 ADSEWGRPEVGDVMKRNSVPDWPLIATYLISEASLEGSSRWISYIAALPRQ-------PY 177
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ EL AYL SP + ++R + YN+L +F ++P P E +
Sbjct: 178 SLLYWTRAELDAYLVASPIRKRAIQRITDVIGTYNDLRD------RIFSRHPDLFPEEVY 231
Query: 255 TFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEK 314
E F +F + S +V L
Sbjct: 232 NIETFLWSFGILFSRLVRLP---------------------------------------- 251
Query: 315 VSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 372
S+ R ALVP +L +S + + L + DR Y+ GE + + G + +
Sbjct: 252 -SMDGRVALVPWA-DMLNHSPEVETFLDFDKSSRGIVFTTDRSYQPGEQVFISYGKKSSG 309
Query: 373 KLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNG 414
+LL++YGFV ++ NP D + + +L+ D Y++K +RNG
Sbjct: 310 ELLLSYGFVPKEGTNPNDSVELLVSLDKSDNCYKEKLQALKRNG 353
>gi|195651313|gb|ACG45124.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Zea mays]
Length = 503
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 82/344 (23%), Positives = 144/344 (41%), Gaps = 66/344 (19%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E L+ W+ +GLP ++ + ++ E+ + A ++++ G+ VP SLV+T
Sbjct: 71 ESAASLERWLIDSGLPEQRLAI-QRVDIGERG-----LVALKNIRKGENLLFVPPSLVIT 124
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ G + +++ N + + +A YL+ E S W+ YI L RQ
Sbjct: 125 ADSEWGRPEVGDVMKRNSVPDWPLIATYLISEASLEGSSRWISYIAALPRQ-------PY 177
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ EL AYL SP + ++R + YN+L +F ++P P E +
Sbjct: 178 SLLYWTRAELDAYLVASPIRKRAIQRITDVIGTYNDLRD------RIFSRHPDLFPEEVY 231
Query: 255 TFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEK 314
E F +F + S +V L
Sbjct: 232 NIETFLWSFGILFSRLVRLP---------------------------------------- 251
Query: 315 VSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 372
S+ R LVP +L +S + + L + DR Y+ GE + + G + +
Sbjct: 252 -SMDGRVVLVPWA-DMLNHSPEVETFLDFDKSSRGIVFTTDRSYQPGEQVFISYGKKSSG 309
Query: 373 KLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNG 414
+LL++YGFV ++ NP D + + +L+ D Y++K +RNG
Sbjct: 310 ELLLSYGFVPKEGTNPNDSVELLVSLDKSDNCYKEKLQALKRNG 353
>gi|449453618|ref|XP_004144553.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Cucumis
sativus]
gi|449511789|ref|XP_004164054.1| PREDICTED: LOW QUALITY PROTEIN: ribulose-1,5 bisphosphate
carboxylase/oxygenase large subunit N-methyltransferase,
chloroplastic-like [Cucumis sativus]
Length = 497
Score = 90.1 bits (222), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 92/355 (25%), Positives = 150/355 (42%), Gaps = 68/355 (19%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L+ W+ ++GLP K+ ++ N R + A ++++ G+ VP SLV++
Sbjct: 65 ENASALQKWLSESGLPDQKMSIQRV---NVGERGL---VALKNVRKGEKLLFVPPSLVIS 118
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
E E+L N + + +A YL+ E K S W YI L RQ
Sbjct: 119 AESEWSCPEAGEVLKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQ-------PY 171
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ EL YL S + +ER + YN+L +F ++P P E F
Sbjct: 172 SLLYWTREELDRYLEASEIRERAIERITNVVGTYNDLSI------RVFSKHPELFPEEVF 225
Query: 255 TFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEK 314
E FK +F + S +V L
Sbjct: 226 NIETFKWSFGILFSRLVRLP---------------------------------------- 245
Query: 315 VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DRPYKAGESIVVWCGPQPN 371
S+ + ALVP +L ++ + + L D A Q VV DR Y+ GE + + G + N
Sbjct: 246 -SMDGKVALVPWA-DMLNHNCEVETFL-DYDKASQGVVFTTDRAYQPGEQVFISYGKKSN 302
Query: 372 SKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 424
+LL++YGFV ++ NP D + + +L D Y++K +++G + Q F +
Sbjct: 303 GELLLSYGFVPKEGSNPSDSVELLLSLKKSDKCYKEKLEALKKHGLRASQCFPIQ 357
>gi|307109960|gb|EFN58197.1| hypothetical protein CHLNCDRAFT_142047 [Chlorella variabilis]
Length = 485
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 113/460 (24%), Positives = 187/460 (40%), Gaps = 75/460 (16%)
Query: 49 RKNRFSIRVSSSDTLV-----AGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSH 103
R R R+++ L+ AGS E+ + E + +LK+W+ + GLPP K+ P
Sbjct: 26 RHRRCRCRLAAQAGLLDLLRGAGSTEIATDAEGE--ELKAWLIERGLPPPKLAAAATPGS 83
Query: 104 NEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALY 163
+ A++ + G++ S+P LV+T L + LL L + LAL+
Sbjct: 84 GRG------LVAAQPIGKGESLLSIPQQLVLTPAAALEQSCLRPLLEEQPLPAWSVLALW 137
Query: 164 LMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG 223
L ++ G W PY+R L + G L WSE E+ +L GS ++ LE
Sbjct: 138 LAEQRAAGSAGGWWPYVRLLPERTG-------CVLEWSEEEVEWLCGSQLHSDALEIRAA 190
Query: 224 IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMG 283
+ + E+ V A + + + AF + AF + S +V L G Q
Sbjct: 191 AEASWAEMQAVLAAAKAQGRAPAHG----AFGRAQLQWAFAVLLSRLVRL--AGLGDQEA 244
Query: 284 TESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAA 343
+A L H C L +S A
Sbjct: 245 LLPWADLLNH---DCAAA---------------------------SFLDWS--------A 266
Query: 344 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE--DNPYD--RLVVEAALNTE 399
+ AV L +R Y+AGE +++ G + + +LL++YGF + NP+D RL++E L
Sbjct: 267 TEAAVVLRAERRYRAGEQLLISYGQKTSGELLLSYGFCPDLGSNPHDGCRLLLE--LAPG 324
Query: 400 DPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPY--LRLGYVSDTSEMQSVISSL 457
D K +++G + Q+F + R A +++ Y V E + + L
Sbjct: 325 DAARNWKAAALRQHGLAASQLFPL---RMAAAPFELVHYTAFSAAVVGSRQEAEQLARRL 381
Query: 458 GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 497
+ P ++ A L+ + KA LA YP + D A L
Sbjct: 382 FEEGDIPPALQTAALEAVVAACKAALAAYPRSFDGDRAEL 421
>gi|302821397|ref|XP_002992361.1| hypothetical protein SELMODRAFT_430576 [Selaginella moellendorffii]
gi|300139777|gb|EFJ06511.1| hypothetical protein SELMODRAFT_430576 [Selaginella moellendorffii]
Length = 463
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 84/345 (24%), Positives = 139/345 (40%), Gaps = 54/345 (15%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+L SW+ G +LK P + A D++AG+ V ++T +R+
Sbjct: 39 ELVSWLKIRGEHDACSLLKTGPDKRG-------LFAVRDIKAGECILRVSRDTMMTADRL 91
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
+LL++ +SE A LAL L++EK+ G+ S W PYI L R + S
Sbjct: 92 --PLEFQQLLSSG-VSEWAQLALLLLFEKRAGEASIWAPYISCLPRWG-----TIHSTAF 143
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
W + ELA + S E + R I+ E+NE+ + FQ+Y + + ++ F
Sbjct: 144 WRKEELAMIQESSLSYETMSRRAAIREEFNEMQPI-------FQRYEH-VFGGPVSYASF 195
Query: 260 KQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLAR 319
K A+V C R ++ FA + H +S E+V +
Sbjct: 196 KHAYVTATVC-SRAWRIDGLEKLAMVPFADFMNHDWSSNAMLTYDTDNGSTEVEEVKV-- 252
Query: 320 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 379
YS L QL D+ Y AGE + + GP N+ L +++G
Sbjct: 253 -------------YSDCLDIALF----CAQLFADKNYAAGEQVTISFGPLCNASLALDFG 295
Query: 380 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 424
F NP+D++ + ++ D ++K +Q H H
Sbjct: 296 FTVPYNPWDKVQLWLGISRRDSLRKEK-----------LQYLHAH 329
>gi|326495906|dbj|BAJ90575.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 507
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 85/345 (24%), Positives = 147/345 (42%), Gaps = 68/345 (19%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E L+ W+ +GLP ++ L EK E+ + A ++++ G+ VP +LV+T
Sbjct: 74 ESAASLERWLTASGLPEQRLAL-EKVDIGERG-----LVALKNVRNGEKLLFVPPTLVIT 127
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ N + +++ + + LA YL+ E S W YI L RQ
Sbjct: 128 ADSEWTNREVGDVMKRYSVPDWPLLATYLISEASLEGSSRWSSYIDALPRQ-------PY 180
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ TE+ AYL SP + + R + YN+L +F ++P P + +
Sbjct: 181 SLLYWTRTEIDAYLVASPIRERAISRISDVIGTYNDLRD------RIFSKHPDLFPEKVY 234
Query: 255 TFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEK 314
E F+ +F + S +V L+
Sbjct: 235 NMENFRWSFGILFSRLVRLE---------------------------------------- 254
Query: 315 VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DRPYKAGESIVVWCGPQPN 371
S+ + ALVP +L +S + A L D + Q +V DR Y+ GE + + G + +
Sbjct: 255 -SMGGKVALVPWA-DMLNHSPEVDAFL-DYDKSSQGIVFTTDRSYQPGEQVFISYGKKSS 311
Query: 372 SKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNG 414
+LL++YGFV ++ NP D + +L D Y++K +++G
Sbjct: 312 GELLLSYGFVPKEGTNPNDSVEFLVSLKKSDECYKEKLQALKKHG 356
>gi|340780678|pdb|3SMT|A Chain A, Crystal Structure Of Human Set Domain-Containing Protein3
Length = 497
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 103/472 (21%), Positives = 198/472 (41%), Gaps = 76/472 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 72 GKREDYFPDLXKWASENG---ASVEGFEXVNFKEEGFGLR---ATRDIKAEELFLWVPRK 125
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L+ T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 126 LLXTVESA-KNSVLGPLYSQDRILQAXGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 181
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 182 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 229
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P ++FT+E ++ A +V + R P L+ L C T +
Sbjct: 230 NKLPLKDSFTYEDYRWAVSSVXT------RQNQIPTEDGSRVTLALIPLWDXCNHTNGLI 283
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V + ++AGE I ++
Sbjct: 284 TTGYNLE--------------------------------DDRCECVALQDFRAGEQIYIF 311
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 312 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAXKAEVLARAGIPTSSVFALHF 371
Query: 426 GREKEAISDMLPYLRL---------GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLA 476
E + +L +LR+ ++ S + + + PVS E + L
Sbjct: 372 -TEPPISAQLLAFLRVFCXTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLE 430
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
D L Y T+ ED+++L +++L + + A +L EK++L ++ A
Sbjct: 431 DRASLLLKTYKTTIEEDKSVLKNHDLSVRAKXAIKLRLGEKEILEKAVKSAA 482
>gi|332321747|sp|B7ZUF3.1|SETD3_XENTR RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|213624517|gb|AAI71209.1| LOC549331 protein [Xenopus (Silurana) tropicalis]
Length = 582
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 107/493 (21%), Positives = 205/493 (41%), Gaps = 84/493 (17%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ +L W +NG L E P + A+ +++A + VP
Sbjct: 73 GKREDYFPELMEWCKENGASTDGFELVEFPEEG------FGLKATREIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E G+ + L + +++ + LA +L+ E+ SFWLPYI+ L +
Sbjct: 127 LLMTVESAKGS-VLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWLPYIKTLPNE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL ++E E+ YL + ++ + + R+Y +F + Q +P
Sbjct: 183 -----YDTPLYFNEDEVQYLQSTQAILDVFSQYKNTARQY-----AYFY--KVIQTHPNA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P ++FTF+ ++ A +V + R P L+ L C T +
Sbjct: 231 NKLPLKDSFTFDDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLI 284
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V + +K+GE I ++
Sbjct: 285 TTGYNLE--------------------------------DDRCECVALQDFKSGEQIYIF 312
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF E+N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 313 YGTRSNAEFVIHNGFFFENNLHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHV 372
Query: 426 GREKEAISDMLPYLRL---------GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLA 476
E + +L +LR+ G++ + + + PVS E + +L
Sbjct: 373 -TEPPISAQLLAFLRVFCMNEDELKGHLIGDHAIDKIFTLGNSEFPVS--WENEI--KLW 427
Query: 477 DYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIM 532
+ +AR L Y T+ +D +L ++ +A +L R+EK++L L+ +D
Sbjct: 428 TFLEARASLLLKTYKTTVEDDNKVLEQPDMTFHSAMAIKLRRVEKEILEKALKSASDNRK 487
Query: 533 LLPDVTVSPCPAP 545
L + P P
Sbjct: 488 LYSKNSEEGTPLP 500
>gi|62857953|ref|NP_001016577.1| histone-lysine N-methyltransferase setd3 [Xenopus (Silurana)
tropicalis]
gi|89272100|emb|CAJ81720.1| novel protein containing a SET domain [Xenopus (Silurana)
tropicalis]
Length = 581
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 107/493 (21%), Positives = 205/493 (41%), Gaps = 84/493 (17%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ +L W +NG L E P + A+ +++A + VP
Sbjct: 72 GKREDYFPELMEWCKENGASTDGFELVEFPEEG------FGLKATREIKAEELFLWVPRK 125
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E G+ + L + +++ + LA +L+ E+ SFWLPYI+ L +
Sbjct: 126 LLMTVESAKGS-VLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWLPYIKTLPNE-- 181
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL ++E E+ YL + ++ + + R+Y +F + Q +P
Sbjct: 182 -----YDTPLYFNEDEVQYLQSTQAILDVFSQYKNTARQY-----AYFY--KVIQTHPNA 229
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P ++FTF+ ++ A +V + R P L+ L C T +
Sbjct: 230 NKLPLKDSFTFDDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLI 283
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V + +K+GE I ++
Sbjct: 284 TTGYNLE--------------------------------DDRCECVALQDFKSGEQIYIF 311
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF E+N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 312 YGTRSNAEFVIHNGFFFENNLHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHV 371
Query: 426 GREKEAISDMLPYLRL---------GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLA 476
E + +L +LR+ G++ + + + PVS E + +L
Sbjct: 372 -TEPPISAQLLAFLRVFCMNEDELKGHLIGDHAIDKIFTLGNSEFPVS--WENEI--KLW 426
Query: 477 DYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIM 532
+ +AR L Y T+ +D +L ++ +A +L R+EK++L L+ +D
Sbjct: 427 TFLEARASLLLKTYKTTVEDDNKVLEQPDMTFHSAMAIKLRRVEKEILEKALKSASDNRK 486
Query: 533 LLPDVTVSPCPAP 545
L + P P
Sbjct: 487 LYSKNSEEGTPLP 499
>gi|255080880|ref|XP_002504006.1| predicted protein [Micromonas sp. RCC299]
gi|226519273|gb|ACO65264.1| predicted protein [Micromonas sp. RCC299]
Length = 529
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 112/411 (27%), Positives = 165/411 (40%), Gaps = 43/411 (10%)
Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWL 177
D++AG+ +P +L VT V + +A L EL LAL+L E+ +G S W
Sbjct: 83 DVRAGEPLIEIPQNLAVTSVDVADSPIVAGLAAGR--GELVGLALWLCLERHKGPLSEWA 140
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L + PLLW+ EL L GSP + + + R E EY +
Sbjct: 141 PYVATLP------SAGSDHPLLWTAGELQTLLQGSPVREQAVSRLESADDEYASI----- 189
Query: 237 MAGSLFQQYPYDIPTEAFTF---EIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVH 293
+ P D P +A+ F + F A V + V L C + LV
Sbjct: 190 --ADQIRSNPNDFPPDAYEFLTRDAFVDALATVLARAVWLNAANCYAMV-------PLVD 240
Query: 294 LQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD 353
L L +P VS A +G P LA ++ AA + + +
Sbjct: 241 L----------LPLVGSPPPGVSPAAAAGGPAVGKPGLAAAAGVVDYDAATECVAVVSAN 290
Query: 354 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 413
+ + V + L + G VDE + D L A+ D Y+ KR + +
Sbjct: 291 DAQQTARVVCVDPLARNAGDLFLATGAVDESHCGDYLAFAASCTQTDRLYEAKRQILEGM 350
Query: 414 G-KLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVL 472
G Q F V A R +L Y+R V D E+ SV I VSP E VL
Sbjct: 351 GMSADGQTFPVFADRMP---MQLLAYMRFARVQDPGELMSVSFEEDRI--VSPMNEYEVL 405
Query: 473 DQLADYFKARLAGYPATLSEDEAM-LTDYNLHPKKRVATQLVRMEKKMLNA 522
L + LA Y ++ E E + L + L ++RVA +L EK+++NA
Sbjct: 406 QLLMQDAREMLAEYESSSEEFELLQLKEKGLSARQRVAAKLRLAEKRLINA 456
>gi|308802083|ref|XP_003078355.1| ribulose-1,5-bisphosphate carb (ISS) [Ostreococcus tauri]
gi|116056807|emb|CAL53096.1| ribulose-1,5-bisphosphate carb (ISS) [Ostreococcus tauri]
Length = 520
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 111/462 (24%), Positives = 182/462 (39%), Gaps = 69/462 (14%)
Query: 76 EDLGDLKSWM-HKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
ED +L +W+ + G+ + KE + V D +AG A VP S V
Sbjct: 48 EDARELAAWLSYDKGVDASALAFKEDAKGGVR------VILKADAEAGATALRVPQSAAV 101
Query: 135 TLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
T V + ++EL + EL LAL+L E+ +G S W PY++ L +
Sbjct: 102 TSVDVGEHPIVSELASGR--PELIGLALWLCAERIKGGASEWAPYVKTL-------RANP 152
Query: 195 ESPLLWSET-ELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 253
++PL W++ + A L GSP A+ +ER++ + EY + V + P P EA
Sbjct: 153 DAPLFWTDAKDFALLKGSPVAADAIERSKSARTEYASITEV-------IKSDPSSYPPEA 205
Query: 254 FTFEI---FKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHN 310
+ F F A V + L C LV L + P+
Sbjct: 206 YEFLTEARFVDALATVCAKATWLPTAQCY----------ALVPLLDVISIGGAPVPGVLP 255
Query: 311 PKEKVSLARRFALVPLGPPLLAYSS-----KCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
P + R GP + +C AA + +QL +
Sbjct: 256 PSASDGVVR------CGPADYDVDTASVVLRCATKAAANSEVIQLDALQ----------- 298
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV-QVFHVH 424
+ N +L +N G+VD+ +P D + ++ + T D + K+ V + G + Q F V+
Sbjct: 299 ---RNNGELFLNTGYVDQKHPGDYIYMKTDIQTSDRLFTAKKQVLEGMGFTAADQYFPVY 355
Query: 425 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 484
R + + YLR V D EM +V I VS E +L L + +A
Sbjct: 356 KDRMP---TQLYSYLRFSRVQDPGEMMAVSFEEDKI--VSVMNEYEILQILMGDCRELMA 410
Query: 485 GYPATLSEDEAMLTDYNLHPKKRVATQLVRM-EKKMLNACLQ 525
Y ++ +L + P + + +RM EKK++ A +
Sbjct: 411 EYDTNEEDELNLLKLSDQMPVREIEAAKLRMSEKKLIGATMN 452
>gi|397613505|gb|EJK62256.1| hypothetical protein THAOC_17139 [Thalassiosira oceanica]
Length = 648
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 106/453 (23%), Positives = 180/453 (39%), Gaps = 97/453 (21%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLE---RVLGNETIAELLTTNKLSELACLALYLMYEK-KQ 170
A + GD +P L +T + R LG + + E ++E +A L++EK +
Sbjct: 210 ARRSINDGDELLKIPLDLCLTRKSARRELGKDALQE-----GINEYLAVACQLIHEKFVK 264
Query: 171 GKKSFWLPYIR---ELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
G+ SF+ Y+ E+D V W + +LA+L GSP A ++RE
Sbjct: 265 GEDSFYAAYMGVLPEVDE--------VNPTFTWPDEDLAFLEGSPVVAATRSLQMKLRRE 316
Query: 228 YNELDTVWFMAG--SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTE 285
Y++L + G L ++P P E +TFE ++ AF + S + L+
Sbjct: 317 YDDL-----LGGPDGLVAKFPLRFPAEHYTFENWEWAFTMLFSRAIRLR----------- 360
Query: 286 SFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAA-- 343
+ + R A+VP L+ +S+ +A + A
Sbjct: 361 ----------------------------NLQVGERLAMVPYAD-LINHSAFSQAFIDARE 391
Query: 344 --------VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 395
++ V L DR Y+ E + + G + N++LL+ YGF E NPY+ + V +
Sbjct: 392 SGDWLFKSGEEEVILYADRGYRQMEQVYISYGQKSNAELLLLYGFALERNPYNSVDVTVS 451
Query: 396 LN-------------TEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLG 442
+ EDP +K G+ F +A R +ML YLRL
Sbjct: 452 IAPRTKQIAEANEGVEEDPLADEKLEFLLSVGRDQTVDFPCYADRYP---VEMLEYLRLM 508
Query: 443 YVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNL 502
++ +S +S E +VL + K +L +P T +D A++ D +
Sbjct: 509 MMTPEDTRGKPLSDFDYSRTISSANEASVLRSVVAAVKYQLGLFPQTEEDDAAIIKDKGM 568
Query: 503 HP----KKRVATQLVRMEKKMLNACLQVTADMI 531
+R+A + R EK++L L I
Sbjct: 569 FRLFSYNQRMAVRHRRNEKRLLKRTLAALEKQI 601
>gi|148744485|gb|AAI42996.1| SET domain containing 3 [Homo sapiens]
Length = 594
Score = 86.7 bits (213), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 103/472 (21%), Positives = 201/472 (42%), Gaps = 76/472 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P ++FT+E ++ A +V + R P L+ L C T +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLI 284
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V + ++AGE I ++
Sbjct: 285 TTGYNLE--------------------------------DDRCECVALQDFRAGEQIYIF 312
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 313 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 372
Query: 426 GREKEAISDMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLA 476
E + +L +LR+ +++ S + + + PVS E + L
Sbjct: 373 -TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSKFPVSWDNEVKLWTFLE 431
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
D L Y T+ ED+++L +++L + ++A +L EK++L ++ A
Sbjct: 432 DRASLLLKTYKTTIEEDKSVLKNHDLSVRAKMAIKLRLGEKEILEKAVKSAA 483
>gi|194038089|ref|XP_001925323.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Sus scrofa]
gi|456754196|gb|JAA74239.1| SET domain containing 3 [Sus scrofa]
Length = 595
Score = 86.7 bits (213), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 107/472 (22%), Positives = 197/472 (41%), Gaps = 76/472 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASDNG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYAQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP-- 246
++PL + E E+ YL + ++ + + R+Y +F + Q +P
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPQA 230
Query: 247 YDIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+ +P E+FT+E ++ A +V + R P L+ L C T +
Sbjct: 231 HKLPLKESFTYEDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLI 284
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V R ++AGE I ++
Sbjct: 285 TTGYNLE--------------------------------DDRCECVALRDFRAGEQIYIF 312
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 313 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 372
Query: 426 GREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG-PICPVSPCMERAVLDQLA 476
E + +L +LR+ +++ + + I +LG PVS E + L
Sbjct: 373 -TEPPVSAQLLAFLRVFCMTEGELKEHLLGENAIDRIFTLGNSEYPVSWDNEVKLWTFLE 431
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
D L Y T+ ED+ L ++ L + +A +L EK++L ++ A
Sbjct: 432 DRASLLLKTYKTTIEEDKTFLKNHGLSVRATMAVKLRLGEKEILEKAVESAA 483
>gi|431839268|gb|ELK01195.1| SET domain-containing protein 3 [Pteropus alecto]
Length = 805
Score = 86.3 bits (212), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 106/472 (22%), Positives = 199/472 (42%), Gaps = 76/472 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E E+ + A+ D++A + VP
Sbjct: 252 GKREDYFPDLMKWASENG---ASVEGFEMVDFKEEGFGLR---ATRDIKAEELFLWVPRK 305
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 306 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 361
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 362 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 409
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P ++FT+E ++ A +V + R P L+ L C T +
Sbjct: 410 NKLPLKDSFTYEDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLI 463
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V R ++AGE I ++
Sbjct: 464 TTGYNLE--------------------------------DDRCECVALRDFRAGEQIYIF 491
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 492 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 551
Query: 426 GREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG-PICPVSPCMERAVLDQLA 476
E + +L +LR+ +++ + + I +LG PVS E + L
Sbjct: 552 -TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEYPVSWDNEVKLWTFLE 610
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
D L Y T+ ED + L +++L + +A +L EK++L ++ A
Sbjct: 611 DRASLLLKTYKTTVEEDRSFLRNHDLSVRAAMAVKLRLGEKEILERAVKSAA 662
>gi|355718753|gb|AES06373.1| SET domain containing 3 [Mustela putorius furo]
Length = 585
Score = 86.3 bits (212), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 107/472 (22%), Positives = 201/472 (42%), Gaps = 76/472 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRDLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P +AFT+E ++ A +V + R P L+ L C T +
Sbjct: 231 NKLPLKDAFTYEDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLI 284
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V R ++AGE I ++
Sbjct: 285 TTGYNLE--------------------------------DDRCECVALRDFRAGEQIYIF 312
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H+
Sbjct: 313 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS 372
Query: 426 GREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG-PICPVSPCMERAVLDQLA 476
E + +L +LR+ +++ + + I +LG PVS E + L
Sbjct: 373 -TEPPVSAQLLAFLRVFCMTEEELKEHLLGDNALDRIFTLGNSEYPVSWDNEVRLWTFLE 431
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
D L Y T+ ED++ L D++L + +A +L EK++L ++ A
Sbjct: 432 DRASLLLKTYKTTIEEDKSFLKDHDLSVRAAMAIKLRLGEKEILEKAVKSAA 483
>gi|40068481|ref|NP_115609.2| histone-lysine N-methyltransferase setd3 isoform a [Homo sapiens]
gi|74750394|sp|Q86TU7.1|SETD3_HUMAN RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|28071092|emb|CAD61927.1| unnamed protein product [Homo sapiens]
gi|119602070|gb|EAW81664.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
gi|119602072|gb|EAW81666.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
gi|119602073|gb|EAW81667.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
gi|194380984|dbj|BAG64060.1| unnamed protein product [Homo sapiens]
gi|307686103|dbj|BAJ20982.1| SET domain containing 3 [synthetic construct]
Length = 594
Score = 86.3 bits (212), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 103/472 (21%), Positives = 201/472 (42%), Gaps = 76/472 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P ++FT+E ++ A +V + R P L+ L C T +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLI 284
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V + ++AGE I ++
Sbjct: 285 TTGYNLE--------------------------------DDRCECVALQDFRAGEQIYIF 312
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 313 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 372
Query: 426 GREKEAISDMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLA 476
E + +L +LR+ +++ S + + + PVS E + L
Sbjct: 373 -TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLE 431
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
D L Y T+ ED+++L +++L + ++A +L EK++L ++ A
Sbjct: 432 DRASLLLKTYKTTIEEDKSVLKNHDLSVRAKMAIKLRLGEKEILEKAVKSAA 483
>gi|427784595|gb|JAA57749.1| Putative histone-lysine n-methyltransferase setd3 [Rhipicephalus
pulchellus]
Length = 485
Score = 86.3 bits (212), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 103/450 (22%), Positives = 185/450 (41%), Gaps = 62/450 (13%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
W NG V +K+ P + A E ++ + VP L++T
Sbjct: 80 WCSDNGAYLGSVSIKDLPDGE------YGFVADEHIEESNQFLGVPLKLMMTTAAA-KKS 132
Query: 144 TIAELLTTN----KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
+ LL + +S +A LA++L+ E G+ SFW PYI L + + L
Sbjct: 133 KLGPLLRDDPIMMSMSNVA-LAMFLILEFCTGESSFWHPYISTL-------PASFNTVLY 184
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
+S EL L GS E L+ I R+Y+ + F L + PY + FT++++
Sbjct: 185 FSVEELELLHGSTVLDEALKLHRSIARQYSYFHKI-FRTHPLAKSLPY---KDCFTYDLY 240
Query: 260 KQAFVAV---QSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVS 316
+ A AV Q+ V G + GT++ T PL N +
Sbjct: 241 RWAVSAVMTRQNAVPLTDTAGGDDEDGTDAM------------TAMVPLWDMCNHSD--- 285
Query: 317 LARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 376
+ F + +L +C AM R ++ G+ + ++ G + N++ I
Sbjct: 286 -GKVFTDYDISANML----RCYAM-------------RDFEKGQEVTIFYGRRTNAEFFI 327
Query: 377 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML 436
+ GFV +N +D + ++ ++ +DP Y K + + +F + RE+ D+
Sbjct: 328 HNGFVFPENRHDSVDIKLGISKQDPLYAVKAKLCDDHELTPSGIFAL-VPRERPVCEDLS 386
Query: 437 PYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM 496
+LR+ + D S+ S I + R L+ L + L +P + E E +
Sbjct: 387 TFLRILVLKDASQAASFTDE--HIMVATDDNAREALNFLIVRIQLLLRAFPKSDQEYENI 444
Query: 497 LTDYNLHPKKRVATQLVRMEKKMLNACLQV 526
+ D + + ++A QL +E+K+L A L+
Sbjct: 445 IADEGSNGRLKMAAQLRLLERKILTAVLET 474
>gi|10439587|dbj|BAB15525.1| unnamed protein product [Homo sapiens]
Length = 512
Score = 86.3 bits (212), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 99/450 (22%), Positives = 194/450 (43%), Gaps = 75/450 (16%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQG 171
A+ D++A + VP L++T+E N + L + +++ + LA +L+ E+
Sbjct: 28 ATRDIKAEELFLWVPRKLLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERAS- 85
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 86 PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY--- 135
Query: 232 DTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFA 288
+F + Q +P+ +P ++FT+E ++ A +V + R P
Sbjct: 136 --AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSV------MTRQNQIPTEDGSRVT 185
Query: 289 SHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV 348
L+ L C T ++ +N + DD
Sbjct: 186 LALIPLWDMCNHTNGLITTGYNLE--------------------------------DDRC 213
Query: 349 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRM 408
+ V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K
Sbjct: 214 ECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAE 273
Query: 409 VAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD---------TSEMQSVISSLGP 459
V R G + VF +H E + +L +LR+ +++ S + + +
Sbjct: 274 VLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNS 332
Query: 460 ICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 519
PVS E + L D L Y T+ ED+++L +++L + ++A +L EK++
Sbjct: 333 EFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNHDLSVRAKMAIKLRLGEKEI 392
Query: 520 LNACLQVTADMIMLLPDVTVSPCPAPYAPL 549
L ++ A + P + +SP APL
Sbjct: 393 LEKAVKSAA----VNPGI-LSPTDGGKAPL 417
>gi|302785554|ref|XP_002974548.1| hypothetical protein SELMODRAFT_101776 [Selaginella moellendorffii]
gi|300157443|gb|EFJ24068.1| hypothetical protein SELMODRAFT_101776 [Selaginella moellendorffii]
Length = 467
Score = 85.9 bits (211), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 82/338 (24%), Positives = 136/338 (40%), Gaps = 65/338 (19%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L+ W+ + GLP KV LK + + + L GD +P +L +T E
Sbjct: 41 LQQWLSQAGLPIQKVELKNVGAGGRG------LVSKRMLYKGDRLLFLPATLAITTESEW 94
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
+++ L E LA YL+ E GK S W PYI L R+ G S LLW
Sbjct: 95 ACAEAGKVIRAKDLPEWPFLACYLISEASLGKSSPWYPYIAALPRRPG-------SILLW 147
Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
+ ++ A+L+ + K L+ ++ +N+L+ FM + P E F E F
Sbjct: 148 TALDVEAHLSATSIKDRALQCVREVEDTFNDLNKQVFMKNR------EEFPPEVFNLESF 201
Query: 260 KQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLAR 319
K AF + S +V L SL +
Sbjct: 202 KWAFGILFSRLVRLP-----------------------------------------SLGQ 220
Query: 320 RFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 377
+ AL+P G +L + ++ L + ++ +DR Y++ + + + G + N +LL+
Sbjct: 221 KLALIPFG-DMLNHDTEVTTFLDFDSGSKSITCTLDRGYESNKEVFISYGKRSNGELLVA 279
Query: 378 YGFV-DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 414
YGFV N D + + L+ D Y+ K + +G
Sbjct: 280 YGFVPSGKNSEDSVSITLGLDPADEMYEAKLGALKEHG 317
>gi|3403234|gb|AAC29136.1| ribulose-1,5-bisphosphate carboxylase/oxygenase N-methyltransferase
[Spinacia oleracea]
gi|3403238|gb|AAC29138.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase II [Spinacia oleracea]
Length = 495
Score = 85.9 bits (211), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 95/425 (22%), Positives = 173/425 (40%), Gaps = 76/425 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A +D+ + VP + + V +E + N L +AL+LM EKK G
Sbjct: 86 LVAQKDISRNEVVLEVPQKFWINPDTVAASEIGS---VCNGLKPWVSVALFLMREKKLGN 142
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W PYI L + S + WSE EL+ L GS L E + E+ +L+
Sbjct: 143 SSSWKPYIDILPD-------STNSTIYWSEEELSELQGSQLLNTTLGVKELVANEFAKLE 195
Query: 233 TVWFMAGSLFQQYPYDIPTEAF--TFEIFK---------QAFVAVQSCVVHLQRCGCTPQ 281
+ Q +P+D+ + F F + + Q+ V + + +Q+ +P
Sbjct: 196 EEVLVPHK--QLFPFDVTQDDFFWAFGMLRSRAFTCLEGQSLVLIPLADLWVQQANHSPD 253
Query: 282 MGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAML 341
+ +A ++ + +R + NP
Sbjct: 254 ITAPKYA---WEIRGAGLFSRELVFSLRNPT----------------------------- 281
Query: 342 AAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 400
P KAG+ +++ + + N++L ++YG + + + + + D
Sbjct: 282 -------------PVKAGDQVLIQYDLNKSNAELALDYGLTESRSERNAYTLTLEIPESD 328
Query: 401 PQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS--DTSEMQSVI--SS 456
Y DK +A+ NG F + E+ ++MLPYLRL + D ++S+ S
Sbjct: 329 SFYGDKLDIAESNGMGESAYFDIVL--EQPLPANMLPYLRLVALGGEDAFLLESIFRNSI 386
Query: 457 LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 515
G + P+SP E + + D + L+GY T++EDE +L + ++ P+ +A +
Sbjct: 387 WGHLDLPISPANEELICQVIRDACTSALSGYSTTIAEDEKLLAEGDIDPRLEIAITIRLG 446
Query: 516 EKKML 520
EKK+L
Sbjct: 447 EKKVL 451
>gi|110331827|gb|ABG67019.1| hypothetical protein LOC84193 [Bos taurus]
Length = 488
Score = 85.5 bits (210), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 106/469 (22%), Positives = 197/469 (42%), Gaps = 76/469 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 80 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 133
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 134 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 189
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y V Q +P+
Sbjct: 190 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV-------IQTHPHA 237
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P ++FT+E ++ A S TR
Sbjct: 238 HKLPLKDSFTYEDYRWA---------------------------------VSSVMTRQNQ 264
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
PT + +V+LA L+PL S DD + V + ++AGE I ++
Sbjct: 265 IPTEDGS-RVTLA----LIPLWDMCNHTSGLITTGYNLEDDRCECVALQDFRAGEQIYIF 319
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 320 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 379
Query: 426 GREKEAISDMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLA 476
E + +L +LR+ +++ S + + + PVS E + L
Sbjct: 380 -TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEYPVSWDNEVRLWTFLE 438
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 525
D L Y T+ ED++ L +++L + +A +L EK++L ++
Sbjct: 439 DRASLLLKTYKTTIEEDKSFLKNHDLSARATMAIKLRLGEKEILERAVK 487
>gi|440907688|gb|ELR57800.1| SET domain-containing protein 3 [Bos grunniens mutus]
Length = 594
Score = 85.1 bits (209), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 107/472 (22%), Positives = 200/472 (42%), Gaps = 76/472 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P ++FT+E ++ A S TR
Sbjct: 231 HKLPLKDSFTYEDYRWA---------------------------------VSSVMTRQNQ 257
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
PT + +V+LA L+PL S DD + V + ++AGE I ++
Sbjct: 258 IPTEDGS-RVTLA----LIPLWDMCNHTSGLITTGYNLEDDRCECVALQDFRAGEQIYIF 312
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 313 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 372
Query: 426 GREKEAISDMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLA 476
E + +L +LR+ +++ S + + + PVS E + L
Sbjct: 373 -TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEYPVSWDNEVRLWTFLE 431
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
D L Y T+ ED++ L +++L + +A +L EK++L ++ A
Sbjct: 432 DRASLLLKTYKTTIEEDKSFLKNHDLSARATMAIKLRLGEKEILERAVKSAA 483
>gi|426248573|ref|XP_004018037.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
setd3 [Ovis aries]
Length = 596
Score = 85.1 bits (209), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 107/472 (22%), Positives = 200/472 (42%), Gaps = 76/472 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 80 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 133
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 134 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 189
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 190 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--RVIQTHPHA 237
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P ++FT+E ++ A S TR
Sbjct: 238 HKLPLKDSFTYEDYRWA---------------------------------VSSVMTRQNQ 264
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
PT + +V+LA L+PL S DD + V + ++AGE I ++
Sbjct: 265 IPTEDGS-RVTLA----LIPLWDMCNHTSGLITTGYNLEDDRCECVALQDFRAGEQIYIF 319
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 320 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 379
Query: 426 GREKEAISDMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLA 476
E + +L +LR+ +++ S + + + PVS E + L
Sbjct: 380 -TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEYPVSWDNEVRLWAFLE 438
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
D L Y T+ ED++ L +++L + +A +L EK++L ++ A
Sbjct: 439 DRASLLLKTYKTTIEEDKSFLKNHDLSARATMAVKLRLGEKEILERAVKSAA 490
>gi|355778846|gb|EHH63882.1| hypothetical protein EGM_16943 [Macaca fascicularis]
Length = 595
Score = 85.1 bits (209), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 103/472 (21%), Positives = 200/472 (42%), Gaps = 76/472 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERAN-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P ++FT+E ++ A +V + R P L+ L C T +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLI 284
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V + ++AGE I ++
Sbjct: 285 TTGYNLE--------------------------------DDRCECVALQDFRAGEQIYIF 312
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 313 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 372
Query: 426 GREKEAISDMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLA 476
E + +L +LR+ +++ S + + + PVS E + L
Sbjct: 373 -TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLE 431
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
D L Y T+ ED+++L + +L + ++A +L EK++L ++ A
Sbjct: 432 DRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEKAVKSAA 483
>gi|3403236|gb|AAC29137.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Spinacia oleracea]
Length = 491
Score = 85.1 bits (209), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 98/414 (23%), Positives = 171/414 (41%), Gaps = 58/414 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A +D+ + VP + + V +E + N L +AL+LM EKK G
Sbjct: 86 LVAQKDISRNEVVLEVPQKFWINPDTVAASEIGS---VCNGLKPWVSVALFLMREKKLGN 142
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W PYI L + S + WSE EL+ L GS L E + E+ +L+
Sbjct: 143 SSSWKPYIDILPD-------STNSTIYWSEEELSELQGSQLLNTTLGVKELVANEFAKLE 195
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLV 292
+ Q +P+D+ + F F F +C+
Sbjct: 196 EEVLVPHK--QLFPFDVTQDDF-FWAFGMLRSRAFTCL---------------------- 230
Query: 293 HLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 352
+ PL+ N ++ A ++A G A L + + L
Sbjct: 231 ---EGQSLVLIPLADLANHSPDIT-APKYAWEIRG-----------AGLFSRELVFSLRN 275
Query: 353 DRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 411
P KAG+ +++ + + N++L ++YG + + + + + D Y DK +A+
Sbjct: 276 PTPVKAGDQVLIQYDLNKSNAELALDYGLTESRSERNAYTLTLEIPESDSFYGDKLDIAE 335
Query: 412 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS--DTSEMQSVI--SSLGPI-CPVSPC 466
NG F + E+ ++MLPYLRL + D ++S+ S G + P+SP
Sbjct: 336 SNGMGESAYFDIVL--EQPLPANMLPYLRLVALGGEDAFLLESIFRNSIWGHLDLPISPA 393
Query: 467 MERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 520
E + + D + L+GY T++EDE +L + ++ P+ +A + EKK+L
Sbjct: 394 NEELICQVIRDACTSALSGYSTTIAEDEKLLAEGDIDPRLEIAITIRLGEKKVL 447
>gi|386780935|ref|NP_001247800.1| SET domain containing 3 [Macaca mulatta]
gi|355693560|gb|EHH28163.1| hypothetical protein EGK_18532 [Macaca mulatta]
gi|380817110|gb|AFE80429.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
gi|383422129|gb|AFH34278.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
gi|384949778|gb|AFI38494.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
Length = 595
Score = 85.1 bits (209), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 103/472 (21%), Positives = 200/472 (42%), Gaps = 76/472 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-NPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P ++FT+E ++ A +V + R P L+ L C T +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLI 284
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V + ++AGE I ++
Sbjct: 285 TTGYNLE--------------------------------DDRCECVALQDFRAGEQIYIF 312
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 313 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 372
Query: 426 GREKEAISDMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLA 476
E + +L +LR+ +++ S + + + PVS E + L
Sbjct: 373 -TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLE 431
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
D L Y T+ ED+++L + +L + ++A +L EK++L ++ A
Sbjct: 432 DRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEKAVKSAA 483
>gi|114654683|ref|XP_522946.2| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2 [Pan
troglodytes]
gi|332843114|ref|XP_003314566.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Pan
troglodytes]
gi|397525919|ref|XP_003832895.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1 [Pan
paniscus]
gi|397525921|ref|XP_003832896.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2 [Pan
paniscus]
gi|410227562|gb|JAA11000.1| SET domain containing 3 [Pan troglodytes]
gi|410255618|gb|JAA15776.1| SET domain containing 3 [Pan troglodytes]
gi|410289938|gb|JAA23569.1| SET domain containing 3 [Pan troglodytes]
gi|410342147|gb|JAA40020.1| SET domain containing 3 [Pan troglodytes]
Length = 594
Score = 85.1 bits (209), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 103/472 (21%), Positives = 200/472 (42%), Gaps = 76/472 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P ++FT+E ++ A +V + R P L+ L C T +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLI 284
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V + ++AGE I ++
Sbjct: 285 TTGYNLE--------------------------------DDRCECVALQDFRAGEQIYIF 312
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 313 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 372
Query: 426 GREKEAISDMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLA 476
E + +L +LR+ +++ S + + + PVS E + L
Sbjct: 373 -TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLE 431
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
D L Y T+ ED+++L + +L + ++A +L EK++L ++ A
Sbjct: 432 DRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEKAVKSAA 483
>gi|145344456|ref|XP_001416748.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576974|gb|ABO95041.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 515
Score = 85.1 bits (209), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 113/456 (24%), Positives = 173/456 (37%), Gaps = 63/456 (13%)
Query: 76 EDLGDLKSWM-HKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
ED +L +W+ + G+ ++ KE R VA D+ AG +VP V
Sbjct: 47 EDARELAAWLSYDKGVDASGLVFKEG------ARGEVEVALRGDVDAGARVLAVPQDCAV 100
Query: 135 TLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
T V + ++ L EL LAL+L E+ +G S W PY++ L
Sbjct: 101 TSVDVDAHPIVSGL--AKGRPELVGLALWLCAERIKGGASDWAPYVKTL-------AANP 151
Query: 195 ESPLLWSETE-LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 253
++PL W+E E A L GSP + +ER+ + EY + V + P P EA
Sbjct: 152 DAPLFWTEAEDFALLKGSPIVNDAVERSRSAREEYAAIVEV-------IKGDPTAFPAEA 204
Query: 254 FTF---EIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHN 310
+ F E F A V + L C LV L T +P+
Sbjct: 205 YEFFTEERFVDALATVCAKATWLPTASCY----------ALVPLLDVITIAGSPVPGVSP 254
Query: 311 PKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP-- 368
P K +AR A + + S+ KA S VV P
Sbjct: 255 PSAKDGIARCAADYDVDSACVVLSAVVKA------------------PANSRVVQLDPLQ 296
Query: 369 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG-KLSVQVFHVHAGR 427
+ N +L +N G VD+ +P D L + + D + K+ V + G Q F V+
Sbjct: 297 RNNGELFLNTGRVDQKHPGDYLYMRTEIQPSDRLFSAKKQVLEGMGFTAENQYFPVY--- 353
Query: 428 EKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP 487
E + + YLR V D EM +V I VS E +L L + ++ Y
Sbjct: 354 EDRMPTQLYSYLRFARVQDPGEMMAVSFEEDKI--VSVMNEYEILQLLMGDCRELMSEYD 411
Query: 488 ATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNAC 523
++ +L + + + +RM +K L C
Sbjct: 412 TNEEDELNLLKLSDTMRVREIEAAKLRMSEKKLIGC 447
>gi|281182452|ref|NP_001162549.1| histone-lysine N-methyltransferase setd3 [Papio anubis]
gi|332321745|sp|A9X1D0.1|SETD3_PAPAN RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|163781076|gb|ABY40825.1| SET domain containing 3, isoform 1 (predicted) [Papio anubis]
Length = 595
Score = 85.1 bits (209), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 103/472 (21%), Positives = 200/472 (42%), Gaps = 76/472 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERAN-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P ++FT+E ++ A +V + R P L+ L C T +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLI 284
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V + ++AGE I ++
Sbjct: 285 TTGYNLE--------------------------------DDRCECVALQDFRAGEQIYIF 312
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 313 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 372
Query: 426 GREKEAISDMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLA 476
E + +L +LR+ +++ S + + + PVS E + L
Sbjct: 373 -TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLE 431
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
D L Y T+ ED+++L + +L + ++A +L EK++L ++ A
Sbjct: 432 DRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEKAVKSAA 483
>gi|119914085|ref|XP_589822.3| PREDICTED: histone-lysine N-methyltransferase setd3 [Bos taurus]
gi|297488270|ref|XP_002696879.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Bos taurus]
gi|296475307|tpg|DAA17422.1| TPA: SET domain containing 3 [Bos taurus]
Length = 601
Score = 85.1 bits (209), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 107/472 (22%), Positives = 200/472 (42%), Gaps = 76/472 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 80 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 133
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 134 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 189
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 190 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 237
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P ++FT+E ++ A S TR
Sbjct: 238 HKLPLKDSFTYEDYRWA---------------------------------VSSVMTRQNQ 264
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
PT + +V+LA L+PL S DD + V + ++AGE I ++
Sbjct: 265 IPTEDGS-RVTLA----LIPLWDMCNHTSGLITTGYNLEDDRCECVALQDFRAGEQIYIF 319
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 320 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 379
Query: 426 GREKEAISDMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLA 476
E + +L +LR+ +++ S + + + PVS E + L
Sbjct: 380 -TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEYPVSWDNEVRLWTFLE 438
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
D L Y T+ ED++ L +++L + +A +L EK++L ++ A
Sbjct: 439 DRASLLLKTYKTTIEEDKSFLKNHDLSARATMAIKLRLGEKEILERAVKSAA 490
>gi|134254196|gb|AAI35195.1| LOC549331 protein [Xenopus (Silurana) tropicalis]
Length = 507
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 106/490 (21%), Positives = 204/490 (41%), Gaps = 84/490 (17%)
Query: 75 EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
E+ +L W +NG L E P + A+ +++A + VP L++
Sbjct: 1 EDYFPELMEWCKENGASTDGFELVEFPEEG------FGLKATREIKAEELFLWVPRKLLM 54
Query: 135 TLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQ 191
T+E G+ + L + +++ + LA +L+ E+ SFWLPYI+ L +
Sbjct: 55 TVESAKGS-VLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWLPYIKTLPNE----- 107
Query: 192 LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY--DI 249
++PL ++E E+ YL + ++ + + R+Y +F + Q +P +
Sbjct: 108 --YDTPLYFNEDEVQYLQSTQAILDVFSQYKNTARQY-----AYFY--KVIQTHPNANKL 158
Query: 250 P-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPT 308
P ++FTF+ ++ A +V + R P L+ L C T + ++
Sbjct: 159 PLKDSFTFDDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNSLITTG 212
Query: 309 HNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 368
+N + DD + V + +K+GE I ++ G
Sbjct: 213 YNLE--------------------------------DDRCECVALQDFKSGEQIYIFYGT 240
Query: 369 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGRE 428
+ N++ +I+ GF E+N +DR+ ++ ++ D Y K V R G + VF +H E
Sbjct: 241 RSNAEFVIHNGFFFENNLHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHV-TE 299
Query: 429 KEAISDMLPYLRL---------GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYF 479
+ +L +LR+ G++ + + + PVS E + +L +
Sbjct: 300 PPISAQLLAFLRVFCMNEDELKGHLIGDHAIDKIFTLGNSEFPVS--WENEI--KLWTFL 355
Query: 480 KAR----LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLP 535
+AR L Y T+ +D +L ++ +A +L R+EK++L L+ +D L
Sbjct: 356 EARASLLLKTYKTTVEDDNKVLEQPDMTFHSAMAIKLRRVEKEILEKALKSASDNRKLYS 415
Query: 536 DVTVSPCPAP 545
+ P P
Sbjct: 416 KNSEEGTPLP 425
>gi|426377975|ref|XP_004055723.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Gorilla
gorilla gorilla]
Length = 594
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 103/472 (21%), Positives = 200/472 (42%), Gaps = 76/472 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P ++FT+E ++ A +V + R P L+ L C T +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLI 284
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V + ++AGE I ++
Sbjct: 285 TTGYNLE--------------------------------DDRCECVALQDFRAGEQIYIF 312
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 313 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 372
Query: 426 GREKEAISDMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLA 476
E + +L +LR+ +++ S + + + PVS E + L
Sbjct: 373 -TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLE 431
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
D L Y T+ ED+++L + +L + ++A +L EK++L ++ A
Sbjct: 432 DRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEKTVKSAA 483
>gi|297695854|ref|XP_002825140.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
[Pongo abelii]
gi|395746278|ref|XP_003778419.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
[Pongo abelii]
Length = 595
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 103/472 (21%), Positives = 200/472 (42%), Gaps = 76/472 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P ++FT+E ++ A +V + R P L+ L C T +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLI 284
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V + ++AGE I ++
Sbjct: 285 TTGYNLE--------------------------------DDRCECVALQDFRAGEQIYIF 312
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 313 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 372
Query: 426 GREKEAISDMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLA 476
E + +L +LR+ +++ S + + + PVS E + L
Sbjct: 373 -TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLE 431
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
D L Y T+ ED+++L + +L + ++A +L EK++L ++ A
Sbjct: 432 DRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEKAVKSAA 483
>gi|332252553|ref|XP_003275417.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
[Nomascus leucogenys]
gi|332252555|ref|XP_003275418.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
[Nomascus leucogenys]
Length = 595
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 103/472 (21%), Positives = 200/472 (42%), Gaps = 76/472 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P ++FT+E ++ A +V + R P L+ L C T +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLI 284
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V + ++AGE I ++
Sbjct: 285 TTGYNLE--------------------------------DDRCECVALQDFRAGEQIYIF 312
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 313 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 372
Query: 426 GREKEAISDMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLA 476
E + +L +LR+ +++ S + + + PVS E + L
Sbjct: 373 -TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLE 431
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
D L Y T+ ED+++L + +L + ++A +L EK++L ++ A
Sbjct: 432 DRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEKAVKSAA 483
>gi|332320543|sp|B0VX69.2|SETD3_CALJA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
Length = 595
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 105/472 (22%), Positives = 202/472 (42%), Gaps = 76/472 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEEEVRYLQSTQAVHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P ++FT+E ++ A +V + R P L+ L C T +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLI 284
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V + ++AGE I ++
Sbjct: 285 TTGYNLE--------------------------------DDRCECVALQDFRAGEQIYIF 312
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 313 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 372
Query: 426 GREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG-PICPVSPCMERAVLDQLA 476
E + +L +LR+ +++ + + I +LG PVS E + L
Sbjct: 373 -TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLE 431
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
D L Y T+ ED+++L + +L + ++A +L EK++L ++ A
Sbjct: 432 DRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEKAVKSAA 483
>gi|168986666|gb|ACA35060.1| SET domain containing 3 isoform a (predicted) [Callithrix jacchus]
Length = 597
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 105/472 (22%), Positives = 202/472 (42%), Gaps = 76/472 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 75 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 128
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 129 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 184
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 185 -----YDTPLYFEEEEVRYLQSTQAVHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 232
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P ++FT+E ++ A +V + R P L+ L C T +
Sbjct: 233 NKLPLKDSFTYEDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLI 286
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V + ++AGE I ++
Sbjct: 287 TTGYNLE--------------------------------DDRCECVALQDFRAGEQIYIF 314
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 315 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 374
Query: 426 GREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG-PICPVSPCMERAVLDQLA 476
E + +L +LR+ +++ + + I +LG PVS E + L
Sbjct: 375 -TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLE 433
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
D L Y T+ ED+++L + +L + ++A +L EK++L ++ A
Sbjct: 434 DRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEKAVKSAA 485
>gi|296215874|ref|XP_002754318.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Callithrix jacchus]
Length = 610
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 105/472 (22%), Positives = 202/472 (42%), Gaps = 76/472 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 88 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 141
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 142 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 197
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 198 -----YDTPLYFEEEEVRYLQSTQAVHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 245
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P ++FT+E ++ A +V + R P L+ L C T +
Sbjct: 246 NKLPLKDSFTYEDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLI 299
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V + ++AGE I ++
Sbjct: 300 TTGYNLE--------------------------------DDRCECVALQDFRAGEQIYIF 327
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 328 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 387
Query: 426 GREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG-PICPVSPCMERAVLDQLA 476
E + +L +LR+ +++ + + I +LG PVS E + L
Sbjct: 388 -TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLE 446
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
D L Y T+ ED+++L + +L + ++A +L EK++L ++ A
Sbjct: 447 DRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEKAVKSAA 498
>gi|308807993|ref|XP_003081307.1| putative methyltransferase (ISS) [Ostreococcus tauri]
gi|116059769|emb|CAL55476.1| putative methyltransferase (ISS) [Ostreococcus tauri]
Length = 505
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 111/449 (24%), Positives = 174/449 (38%), Gaps = 106/449 (23%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
DL W+ NGL K+ L+ + + A+E+++ G+A V S ++T+ER
Sbjct: 66 DLTRWLASNGLRAQKMTLESNLAEG------RGLVATEEIKRGEALLGVDASCLITVERA 119
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEK---KQGKKSFWLPYIRELDRQRGRGQLAVES 196
+ + +L E + LA +L + + G + YIR L R+ G S
Sbjct: 120 IAEAKLGP--RHAELQEWSVLATFLAQQAMALESGNAGTFGEYIRALPRRTG-------S 170
Query: 197 PLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
L W E E+ L GSP++ ER E + E+ + +P DI A
Sbjct: 171 VLDWPEDEVETLLKGSPSRLAAAERQESVNAAIAEIRS----------SFP-DITEGALR 219
Query: 256 FEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKV 315
+ AF + S ++ L MG E
Sbjct: 220 W-----AFDILFSRLIRLD------AMGGE------------------------------ 238
Query: 316 SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLL 375
ALVP +L + C A + AV L DR Y AGE + G +P+S+LL
Sbjct: 239 -----LALVPW-ADMLNHKPGCAAFIDLNGSAVNLTTDRAYAAGEQVWASYGQRPSSELL 292
Query: 376 INYGFVDE--DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH-AGREKEAI 432
I+YGF E +NP D + ++ DP Q K V +R G V+ F + G ++ +
Sbjct: 293 ISYGFAPEVGENPDDEYSLTLGVDVNDPYAQAKADVLRRMGLSPVETFPLRLNGYPRQLL 352
Query: 433 -------------SDMLPYLRLGYVSDTSEMQSVISSL------------GPICPVSPCM 467
S++ R + + QS+ S+ G I P
Sbjct: 353 QYASFILCNPDKPSELEGLARTAFTGSANFGQSIFDSVRGLAQGQARGKQGVILGGVPG- 411
Query: 468 ERAVLDQLADYFKARLAGYPATLSEDEAM 496
E AV + LAD L+ YP +L +D+ +
Sbjct: 412 EIAVREMLADMCAEALSAYPNSLEKDKGI 440
>gi|169409575|gb|ACA57918.1| SET domain containing 3 isoform a (predicted) [Callicebus moloch]
Length = 597
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 105/472 (22%), Positives = 202/472 (42%), Gaps = 76/472 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 75 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 128
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 129 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 184
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 185 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 232
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P ++FT+E ++ A +V + R P L+ L C T +
Sbjct: 233 NKLPLKDSFTYEDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLI 286
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V + ++AGE I ++
Sbjct: 287 TTGYNLE--------------------------------DDRCECVALQDFRAGEQIYIF 314
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 315 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 374
Query: 426 GREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG-PICPVSPCMERAVLDQLA 476
E + +L +LR+ +++ + + I +LG PVS E + L
Sbjct: 375 -TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLE 433
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
D L Y T+ ED+++L + +L + ++A +L EK++L ++ A
Sbjct: 434 DRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEKAVKSAA 485
>gi|332321478|sp|B1MTJ4.2|SETD3_CALMO RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
Length = 595
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 105/472 (22%), Positives = 202/472 (42%), Gaps = 76/472 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P ++FT+E ++ A +V + R P L+ L C T +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLI 284
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V + ++AGE I ++
Sbjct: 285 TTGYNLE--------------------------------DDRCECVALQDFRAGEQIYIF 312
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 313 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 372
Query: 426 GREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG-PICPVSPCMERAVLDQLA 476
E + +L +LR+ +++ + + I +LG PVS E + L
Sbjct: 373 -TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLE 431
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
D L Y T+ ED+++L + +L + ++A +L EK++L ++ A
Sbjct: 432 DRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEKAVKSAA 483
>gi|348554489|ref|XP_003463058.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cavia
porcellus]
Length = 789
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 107/472 (22%), Positives = 200/472 (42%), Gaps = 76/472 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFWLPYI+ L +
Sbjct: 127 LLMTVESA-KNSILGPLYSQDRILQAMGNIALAFHLLCERAN-PNSFWLPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEEEVQCLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P ++FT+E ++ A +V + R P L+ L C T +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLI 284
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V + ++AGE I ++
Sbjct: 285 TTGYNLE--------------------------------DDRCECVALQDFRAGEQIYIF 312
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 313 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 372
Query: 426 GREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG-PICPVSPCMERAVLDQLA 476
E + +L +LR+ +++ + + I +LG PVS E + L
Sbjct: 373 -TEPPISAQLLAFLRVFCMTEEELKEHLLGENAIDRIFTLGNSEFPVSWENEVKLWSFLE 431
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
D L Y T+ ED+A+L L + ++A +L EK++L +Q A
Sbjct: 432 DRASLLLKTYKTTIEEDKAVLKGPELPTRMKMAVKLRLGEKEILERTVQSAA 483
>gi|332321743|sp|C1FXW2.1|SETD3_DASNO RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|226526916|gb|ACO71275.1| SET domain containing 3 isoform a (predicted) [Dasypus
novemcinctus]
Length = 589
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 105/472 (22%), Positives = 201/472 (42%), Gaps = 76/472 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSMLGPLYSQDRILQAMGNITLAFHLLCERAN-PNSFWQPYIQSLPGE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLHSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P ++FT+E ++ A +V + R P L+ L C T +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLI 284
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V + ++AGE I ++
Sbjct: 285 TTGYNLE--------------------------------DDRCECVALQDFRAGEQIYIF 312
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 313 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 372
Query: 426 GREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG-PICPVSPCMERAVLDQLA 476
E + +L +LR+ +++ + + I +LG PVS E + L
Sbjct: 373 -TEPPISAQLLAFLRVFCMTEEELKEHLLGENAIDRIFTLGNSEFPVSWDNEVKLWTFLE 431
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
D L Y T+ ED++ L +++L + +A +L EK++L ++ A
Sbjct: 432 DRASLLLKTYKTTIEEDKSFLKNHDLSVRATMAIKLRLGEKEILEKAVKSAA 483
>gi|332321746|sp|B2KI88.1|SETD3_RHIFE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|183637154|gb|ACC64548.1| SET domain containing 3 isoform a (predicted) [Rhinolophus
ferrumequinum]
Length = 594
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 105/472 (22%), Positives = 200/472 (42%), Gaps = 76/472 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E S E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVSFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFGEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P ++FT+E ++ A +V + R P L+ L C T +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLI 284
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V + ++AGE I ++
Sbjct: 285 TTGYNLE--------------------------------DDRCECVALQDFQAGEQIYIF 312
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 313 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 372
Query: 426 GREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG-PICPVSPCMERAVLDQLA 476
E + +L +LR+ +++ + + I +LG PVS E + L
Sbjct: 373 -TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEYPVSWDNEVKLWTFLE 431
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
D L Y + ED++ L +++L + +A +L EK++L ++ A
Sbjct: 432 DRASLLLKTYKTNIEEDKSFLKNHDLSVRATMAIKLRLGEKEILEKAVKSAA 483
>gi|332321744|sp|B5FW36.1|SETD3_OTOGA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|197215622|gb|ACH53017.1| SET domain containing 3 isoform a (predicted) [Otolemur garnettii]
Length = 595
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 105/472 (22%), Positives = 200/472 (42%), Gaps = 76/472 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKRENYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQSLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P ++FT+E ++ A +V + R P L+ L C T +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLI 284
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V + ++AGE I ++
Sbjct: 285 TTGYNLE--------------------------------DDRCECVALQDFRAGEQIYIF 312
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 313 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 372
Query: 426 GREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG-PICPVSPCMERAVLDQLA 476
E + +L +LR+ +++ + + I +LG PVS E + L
Sbjct: 373 -TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLE 431
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
D L Y T+ ED+ +L +++L + +A +L EK++L ++ A
Sbjct: 432 DRASLLLKTYKTTIEEDKFVLKNHDLSVRATMAIKLRLGEKEILEKAVKSAA 483
>gi|395827792|ref|XP_003787079.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Otolemur
garnettii]
Length = 595
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 105/472 (22%), Positives = 200/472 (42%), Gaps = 76/472 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKRENYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQSLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIYDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P ++FT+E ++ A +V + R P L+ L C T +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLI 284
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V + ++AGE I ++
Sbjct: 285 TTGYNLE--------------------------------DDRCECVALQDFRAGEQIYIF 312
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 313 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 372
Query: 426 GREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG-PICPVSPCMERAVLDQLA 476
E + +L +LR+ +++ + + I +LG PVS E + L
Sbjct: 373 -TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLE 431
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
D L Y T+ ED+ +L +++L + +A +L EK++L ++ A
Sbjct: 432 DRASLLLKTYKTTIEEDKFVLKNHDLSVRATMAIKLRLGEKEILEKAVKSAA 483
>gi|357137766|ref|XP_003570470.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like
[Brachypodium distachyon]
Length = 389
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 77/308 (25%), Positives = 131/308 (42%), Gaps = 62/308 (20%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A +++ G+ VP SLV++ + N + +++ + + + LA YL+ E
Sbjct: 13 LVALTNVRNGEKLLFVPPSLVISADSEWSNREVGDVMKSYSVPDWPLLATYLISEASLEG 72
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL 231
S W YI L RQ S L W+ TE+ AYL SP + + R + YN+L
Sbjct: 73 SSRWSSYIDALPRQ-------PYSLLYWTRTEIDAYLVASPIRERAISRIGDVIGTYNDL 125
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHL 291
+F ++P P E + E F+ +F + S +V L
Sbjct: 126 ------RDRIFSKHPELFPEEVYNMENFRWSFGILFSRLVRLP----------------- 162
Query: 292 VHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 351
S+ + ALVP +L ++ + A L D + Q +
Sbjct: 163 ------------------------SMDGKVALVPWA-DMLNHNPEVDAFL-DFDKSSQGI 196
Query: 352 V---DRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDK 406
V DR Y+ GE + + G + + +LL++YGFV ++ NP D + +LN D Y++K
Sbjct: 197 VFTTDRSYQPGEQVFISYGKKSSGELLLSYGFVPKEGTNPNDSVEFSVSLNKSDDCYREK 256
Query: 407 RMVAQRNG 414
+R+G
Sbjct: 257 LQALKRHG 264
>gi|115448405|ref|NP_001047982.1| Os02g0725200 [Oryza sativa Japonica Group]
gi|45735887|dbj|BAD12920.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase [Oryza sativa Japonica
Group]
gi|45736017|dbj|BAD13045.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase [Oryza sativa Japonica
Group]
gi|113537513|dbj|BAF09896.1| Os02g0725200 [Oryza sativa Japonica Group]
gi|215737236|dbj|BAG96165.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623589|gb|EEE57721.1| hypothetical protein OsJ_08208 [Oryza sativa Japonica Group]
Length = 502
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 86/347 (24%), Positives = 145/347 (41%), Gaps = 68/347 (19%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L+ W+ +GLP ++ + ++ E+ + A ++++ G+ VP SLV+T +
Sbjct: 75 LERWLTDSGLPEQRLGI-QRVDVGERG-----LVALKNIRKGEKLLFVPPSLVITADSEW 128
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
G + +L N + + +A YL+ E S W YI L RQ S L W
Sbjct: 129 GCPEVGNVLKRNSVPDWPLIATYLISEASLESSSRWSSYIAALPRQ-------PYSLLYW 181
Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFEI 258
+ EL AYL SP + ++R + YN+L D ++ LF P E + E
Sbjct: 182 TRPELDAYLVASPIRERAIQRITDVVGTYNDLRDRIFSKHSDLF-------PEEVYNLET 234
Query: 259 FKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLA 318
F+ +F + S +V L S+
Sbjct: 235 FRWSFGILFSRLVRLP-----------------------------------------SMD 253
Query: 319 RRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 376
R ALVP +L +S + + L + DR Y+ GE + + G + + +LL+
Sbjct: 254 GRVALVPWA-DMLNHSPEVETFLDYDKSSGGIVFTTDRSYQPGEQVFISYGKKSSGELLL 312
Query: 377 NYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 421
+YGFV ++ NP D + + +LN D Y++K +RNG + F
Sbjct: 313 SYGFVPKEGTNPNDSVELLVSLNKSDKCYKEKLQALKRNGLSEFESF 359
>gi|344273731|ref|XP_003408672.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Loxodonta
africana]
Length = 597
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 105/472 (22%), Positives = 200/472 (42%), Gaps = 76/472 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEVVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAN-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ +L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRHLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P ++FT+E ++ A +V + R P L+ L C T +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLI 284
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V + ++AGE I ++
Sbjct: 285 TTGYNLE--------------------------------DDRCECVALQDFRAGEQIYIF 312
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 313 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 372
Query: 426 GREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG-PICPVSPCMERAVLDQLA 476
E + +L +LR+ +++ + + I +LG PVS E + L
Sbjct: 373 -TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLE 431
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
D L Y T+ ED+A L ++L + +A +L EK++L ++ A
Sbjct: 432 DRASLLLKTYKTTIEEDKAFLKGHDLSIRATMAVKLRLGEKEILERAVKSAA 483
>gi|218191491|gb|EEC73918.1| hypothetical protein OsI_08761 [Oryza sativa Indica Group]
Length = 502
Score = 82.8 bits (203), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 86/347 (24%), Positives = 145/347 (41%), Gaps = 68/347 (19%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L+ W+ +GLP ++ + ++ E+ + A ++++ G+ VP SLV+T +
Sbjct: 75 LERWLTDSGLPEQRLGI-QRVDVGERG-----LVALKNIRKGEKLLFVPPSLVITADSEW 128
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
G + +L N + + +A YL+ E S W YI L RQ S L W
Sbjct: 129 GCPEVGNVLKRNSVPDWPLIATYLISEASLESSSRWSSYIAALPRQ-------PYSLLYW 181
Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFEI 258
+ EL AYL SP + ++R + YN+L D ++ LF P E + E
Sbjct: 182 TRPELDAYLVASPIRERAIQRITDVVGTYNDLRDRIFSKHSDLF-------PEEVYNLET 234
Query: 259 FKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLA 318
F+ +F + S +V L S+
Sbjct: 235 FRWSFGILFSRLVRLP-----------------------------------------SMD 253
Query: 319 RRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 376
R ALVP +L +S + + L + DR Y+ GE + + G + + +LL+
Sbjct: 254 GRVALVPWA-DMLNHSPEVETFLDYDKSSGGIVFTTDRSYQPGEQVFISYGKKSSGELLL 312
Query: 377 NYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 421
+YGFV ++ NP D + + +LN D Y++K +RNG + F
Sbjct: 313 SYGFVPKEGTNPNDSVELLVSLNKSDKCYKEKLQALKRNGLSEFESF 359
>gi|343961019|dbj|BAK62099.1| SET domain containing 3 isoform a [Pan troglodytes]
Length = 492
Score = 82.4 bits (202), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 99/456 (21%), Positives = 190/456 (41%), Gaps = 76/456 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y V Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV-------IQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P ++FT+E ++ A +V + R P L+ L C T +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLI 284
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V + ++AGE I ++
Sbjct: 285 TTGYNLE--------------------------------DDRCECVALQDFRAGEQIYIF 312
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 313 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 372
Query: 426 GREKEAISDMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLA 476
E + +L +LR+ +++ S + + + PVS E + L
Sbjct: 373 -TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLE 431
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQL 512
D L Y T+ ED+++L + +L + ++A +L
Sbjct: 432 DRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKL 467
>gi|293333172|ref|NP_001168589.1| uncharacterized protein LOC100382373 [Zea mays]
gi|223949395|gb|ACN28781.1| unknown [Zea mays]
gi|414885391|tpg|DAA61405.1| TPA: hypothetical protein ZEAMMB73_723554 [Zea mays]
Length = 489
Score = 82.4 bits (202), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 108/423 (25%), Positives = 174/423 (41%), Gaps = 71/423 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A+ DL G+ VP L + + V ++ L +AL L+ E +G
Sbjct: 84 LVAARDLPRGEVVAEVPKKLWMDADAVAASDIGRACGGGGGLRPWVAVALLLLSEVARGA 143
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY--NE 230
S W PY+ L RQ +S + WSE EL + G+ ++L G+K EY +E
Sbjct: 144 DSPWAPYLAILPRQ-------TDSTIFWSEEELLEIQGT----QLLSTTVGVK-EYVQSE 191
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFA-- 288
D+V S + D+ + TF+ F AF ++S V P++ + A
Sbjct: 192 FDSVQAEIISTNK----DLFPGSITFDDFLWAFGMLRSRVF--------PELRGDKLALI 239
Query: 289 --SHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD 346
+ LV+ + T+ + + K K R L +S + +
Sbjct: 240 PFADLVNHSPNITSEGS----SWEIKGKGLFGRE----------LMFSLRTPVNV----- 280
Query: 347 AVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQD 405
K+G+ I + + + N++L ++YGFV+ + D V ++ DP Y D
Sbjct: 281 ----------KSGQQIYIQYDLDKSNAELALDYGFVESNPSRDSFTVTLEISESDPFYGD 330
Query: 406 KRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM-------QSVISSLG 458
K +A+ NG F V + MLPYLRL + T SV L
Sbjct: 331 KLDIAEANGLGETAYFDVIL--NEPLPPQMLPYLRLLCIGGTDAFLLEALFRNSVWGHLE 388
Query: 459 PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKK 518
P+SP E ++ + D K+ LA Y T+ EDE + NL P+ +A + EKK
Sbjct: 389 --LPLSPDNEESICQAMRDACKSALADYHTTIEEDEELSGRENLQPRLAIAIGVRAGEKK 446
Query: 519 MLN 521
+L
Sbjct: 447 VLQ 449
>gi|332321742|sp|E2RBS6.1|SETD3_CANFA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
Length = 588
Score = 82.0 bits (201), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 105/472 (22%), Positives = 200/472 (42%), Gaps = 76/472 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRDLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P +AFT+E ++ A +V + R P L+ L C T +
Sbjct: 231 NKLPLKDAFTYEDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLI 284
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V R ++AGE I ++
Sbjct: 285 TTGYNLE--------------------------------DDRCECVALRDFRAGEQIYIF 312
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 313 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHY 372
Query: 426 GREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG-PICPVSPCMERAVLDQLA 476
+ + +L +LR+ +++ + + I +LG PVS E + L
Sbjct: 373 -TDPPVSAQLLAFLRVFCMTEEELKEHLLGDNALDRIFTLGNSEYPVSWDNEVRLWTFLE 431
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
D L Y T+ ED++ L +++L + +A +L EK++L ++ A
Sbjct: 432 DRASLLLKTYKTTIEEDKSFLRNHDLSVRATMAIKLRLGEKEILEKAVKSAA 483
>gi|302759643|ref|XP_002963244.1| hypothetical protein SELMODRAFT_80789 [Selaginella moellendorffii]
gi|300168512|gb|EFJ35115.1| hypothetical protein SELMODRAFT_80789 [Selaginella moellendorffii]
Length = 467
Score = 82.0 bits (201), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 80/338 (23%), Positives = 134/338 (39%), Gaps = 65/338 (19%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L+ W+ + GLP KV LK + + + L GD +P +L +T E
Sbjct: 41 LQQWLSQAGLPIQKVELKNVGAGGRG------LVSKRMLYKGDRLLFLPATLAITTESEW 94
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
+++ L E LA YL+ E GK S W PYI L R+ G S LLW
Sbjct: 95 ACAEAGKVIRAKDLPEWPFLACYLISEASLGKSSPWYPYIAALPRRPG-------SILLW 147
Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
+ ++ +L+ + K L+ ++ +N+L+ FM + P E F + F
Sbjct: 148 TALDVETHLSATSIKDRALQCVREVEDTFNDLNKQVFMKNR------EEFPPEVFNLKSF 201
Query: 260 KQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLAR 319
K AF + S +V L SL +
Sbjct: 202 KWAFGILFSRLVRLP-----------------------------------------SLGQ 220
Query: 320 RFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 377
+ AL+P G +L + ++ L + ++ +DR Y++ + + G + N +LL+
Sbjct: 221 KLALIPFG-DMLNHDTEVTTFLDFDSGSKSITCTLDRGYESNREVFISYGKRSNGELLVA 279
Query: 378 YGFV-DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 414
YGFV N D + + L+ D Y+ K + +G
Sbjct: 280 YGFVPSGKNSEDSVSITLGLDPADEMYEAKLGTLKEHG 317
>gi|73964462|ref|XP_547974.2| PREDICTED: SET domain containing 3 [Canis lupus familiaris]
Length = 589
Score = 82.0 bits (201), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 105/472 (22%), Positives = 200/472 (42%), Gaps = 76/472 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRDLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P +AFT+E ++ A +V + R P L+ L C T +
Sbjct: 231 NKLPLKDAFTYEDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLI 284
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V R ++AGE I ++
Sbjct: 285 TTGYNLE--------------------------------DDRCECVALRDFRAGEQIYIF 312
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 313 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHY 372
Query: 426 GREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG-PICPVSPCMERAVLDQLA 476
+ + +L +LR+ +++ + + I +LG PVS E + L
Sbjct: 373 -TDPPVSAQLLAFLRVFCMTEEELKEHLLGDNALDRIFTLGNSEYPVSWDNEVRLWTFLE 431
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
D L Y T+ ED++ L +++L + +A +L EK++L ++ A
Sbjct: 432 DRASLLLKTYKTTIEEDKSFLRNHDLSVRATMAIKLRLGEKEILEKAVKSAA 483
>gi|218202140|gb|EEC84567.1| hypothetical protein OsI_31339 [Oryza sativa Indica Group]
Length = 649
Score = 81.6 bits (200), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 106/425 (24%), Positives = 173/425 (40%), Gaps = 76/425 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A+ DL G+ VP L + + V ++ + + L +AL L+ E +G
Sbjct: 244 LVAARDLPRGEVLAEVPKKLWLDADAVAASD-LGGAVGRGGLRPWVAVALLLLREAARGA 302
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W PY+ L RQ +S + WSE EL + G+ ++L G+K EY +
Sbjct: 303 GSPWAPYLAILPRQ-------TDSTIFWSEEELLEIQGT----QLLSTTMGVK-EYVQ-- 348
Query: 233 TVWFMAGSLFQQYPYDIPTE-------AFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTE 285
S F+ +I +E TF F AF ++S V R ++
Sbjct: 349 -------SEFESVEAEIISENRELFPGTVTFNDFLWAFGILRSRVFAELRG---DKLALI 398
Query: 286 SFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD 345
FA LV+ T+ + ++ K K + D
Sbjct: 399 PFAD-LVNHSDDITSKES----------------------------SWEIKGKGLFGR-D 428
Query: 346 DAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 404
L K+GE I + + + N++L ++YGF + ++ D + ++ DP Y
Sbjct: 429 VVFSLRTPVNVKSGEQIYIQYDLDKSNAELALDYGFTESNSSRDAYTLTLEISESDPFYD 488
Query: 405 DKRMVAQRNGKLSVQVFHVHAGREKEAIS-DMLPYLRLGYVSDTSEM-------QSVISS 456
DK +A+ NG F + G E++ MLPYLRL + T +V
Sbjct: 489 DKLDIAELNGMGETAYFDIVLG---ESLPPQMLPYLRLLCLGGTDAFLLEALFRNAVWGH 545
Query: 457 LGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRME 516
L PVS E A+ + + K+ L Y T+ EDE +L NL P+ ++A ++ E
Sbjct: 546 L--ELPVSQDNEEAICQVIRNACKSALGAYHTTIEEDEELLGSENLQPRLQIAVEVRAGE 603
Query: 517 KKMLN 521
KK+L
Sbjct: 604 KKVLQ 608
>gi|223992783|ref|XP_002286075.1| rubisco small subunit small subunit n-methyltransferase
[Thalassiosira pseudonana CCMP1335]
gi|220977390|gb|EED95716.1| rubisco small subunit small subunit n-methyltransferase
[Thalassiosira pseudonana CCMP1335]
Length = 434
Score = 81.6 bits (200), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 97/433 (22%), Positives = 167/433 (38%), Gaps = 71/433 (16%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLE---RVLGNETIAELLTTNKLSELACLALYLMYEKK-Q 170
A + GD +P L +T + + LG + + + ++E +A L++EK
Sbjct: 50 ARRSINDGDELLKIPMDLCITRKSARKALGKDAL-----QDGINEYLAIACQLIHEKYVL 104
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
G +S W Y+ L V W + +LA+L GSP A ++REY+
Sbjct: 105 GDESEWDAYMGVLPEVE-----EVNPTFTWKDEDLAFLDGSPVVAATRSLQMKLRREYDA 159
Query: 231 LDTVWFMAG--SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFA 288
L + G L ++P P E FT+E + AF + S + L+ ++ +A
Sbjct: 160 L-----LGGQDGLIAKFPDRFPAEHFTYENWVWAFTMLFSRAIRLRNLQVGERLAMVPYA 214
Query: 289 SHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV 348
+ H S + + ++ V
Sbjct: 215 DLINHSAFSGA------------------------------FIDARESGDWLFKNGEEEV 244
Query: 349 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE--------- 399
L DR Y+ E + + G + N++LL+ YGF E NPY+ + V ++
Sbjct: 245 ILYADRGYRQMEQVYISYGQKSNAELLLLYGFALERNPYNSVDVTVSIAPRTAALAAANE 304
Query: 400 ----DPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS 455
DP Q+K G+ F +A R +ML +LRL ++ ++
Sbjct: 305 GIEVDPLAQEKVEFLASVGRDQTVDFPCYADRYP---VEMLEFLRLMMMTPEDTRGKPLA 361
Query: 456 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP----KKRVATQ 511
+SP E AVL + + K +L YP + +D ++ D L +R+A +
Sbjct: 362 DFDYSRTISPANEAAVLSSVVEAVKYQLNLYPQSEEDDANIIKDKALFRLLSYNQRMAVR 421
Query: 512 LVRMEKKMLNACL 524
R EK++L L
Sbjct: 422 HRRNEKRLLKRTL 434
>gi|50252331|dbj|BAD28364.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplast precursor
[Oryza sativa Japonica Group]
gi|215769445|dbj|BAH01674.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 495
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 106/425 (24%), Positives = 173/425 (40%), Gaps = 76/425 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A+ DL G+ VP L + + V ++ + + L +AL L+ E +G
Sbjct: 90 LVAARDLPRGEVLAEVPKKLWLDADAVAASD-LGGAVGRGGLRPWVAVALLLLREAARGA 148
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W PY+ L RQ +S + WSE EL + G+ ++L G+K EY +
Sbjct: 149 GSPWAPYLAILPRQ-------TDSTIFWSEEELLEIQGT----QLLSTTMGVK-EYVQ-- 194
Query: 233 TVWFMAGSLFQQYPYDIPTE-------AFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTE 285
S F+ +I +E TF F AF ++S V R ++
Sbjct: 195 -------SEFESVEAEIISENRELFPGTVTFNDFLWAFGILRSRVFAELRG---DKLALI 244
Query: 286 SFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD 345
FA LV+ T+ + ++ K K + D
Sbjct: 245 PFAD-LVNHSDDITSKES----------------------------SWEIKGKGLFGR-D 274
Query: 346 DAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 404
L K+GE I + + + N++L ++YGF + ++ D + ++ DP Y
Sbjct: 275 VVFSLRTPVNVKSGEQIYIQYDLDKSNAELALDYGFTESNSSRDAYTLTLEISESDPFYD 334
Query: 405 DKRMVAQRNGKLSVQVFHVHAGREKEAIS-DMLPYLRLGYVSDTSEM-------QSVISS 456
DK +A+ NG F + G E++ MLPYLRL + T +V
Sbjct: 335 DKLDIAELNGMGETAYFDIVLG---ESLPPQMLPYLRLLCLGGTDAFLLEALFRNAVWGH 391
Query: 457 LGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRME 516
L PVS E A+ + + K+ L Y T+ EDE +L NL P+ ++A ++ E
Sbjct: 392 LE--LPVSQDNEEAICQVIRNACKSALGAYHTTIEEDEELLGSENLQPRLQIAVEVRAGE 449
Query: 517 KKMLN 521
KK+L
Sbjct: 450 KKVLQ 454
>gi|281338628|gb|EFB14212.1| hypothetical protein PANDA_005835 [Ailuropoda melanoleuca]
Length = 585
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 105/472 (22%), Positives = 200/472 (42%), Gaps = 76/472 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEEEVRDLQCTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P +AFT+E ++ A +V + R P L+ L C T +
Sbjct: 231 NKLPLKDAFTYEDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLI 284
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V R ++AGE I ++
Sbjct: 285 TTGYNLE--------------------------------DDRCECVALRDFRAGEQIYIF 312
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 313 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 372
Query: 426 GREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG-PICPVSPCMERAVLDQLA 476
+ + +L +LR+ +++ + + I +LG PVS E + L
Sbjct: 373 -TDPPVSAQLLAFLRVFCMTEEELKEHLLGDNALDRIFTLGNSEYPVSWDNEVRLWTFLE 431
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
D L Y T+ ED++ L +++L + +A +L EK++L ++ A
Sbjct: 432 DRASLLLKTYKTTIEEDKSFLKNHDLSVRATMAIKLRLGEKEILEKAVKSAA 483
>gi|338719872|ref|XP_001488117.2| PREDICTED: histone-lysine N-methyltransferase setd3-like [Equus
caballus]
Length = 609
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 103/472 (21%), Positives = 196/472 (41%), Gaps = 76/472 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 87 GKREDYFPDLMKWASENG---ASVDGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 140
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 141 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 196
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 197 -----YDTPLYFEEDEVRYLQSTQAVHDVFSQYKNTARQY-----AYFY--RVIQTHPHA 244
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P ++FT+E ++ A +V + R P L+ L C T +
Sbjct: 245 NKLPLKDSFTYEDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTTGLI 298
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V + ++AGE I ++
Sbjct: 299 TTGYNLE--------------------------------DDRCECVALQDFRAGEQIYIF 326
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 327 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 386
Query: 426 GREKEAISDMLPYLRL---------GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLA 476
E + +L +LR+ ++ + + + + PVS E + L
Sbjct: 387 -TEPPISAQLLAFLRVFCMTEEELKDHLLGDNAIDRIFTLGNSEYPVSWDNEVKLWTFLE 445
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
D L Y T+ ED A L + +L + +A +L EK++L ++ A
Sbjct: 446 DRALLLLKTYKTTVEEDRAFLKNSDLSVRATMAIKLRLGEKEILEKAVKSAA 497
>gi|291411315|ref|XP_002721936.1| PREDICTED: SET domain containing 3 [Oryctolagus cuniculus]
Length = 591
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 104/475 (21%), Positives = 196/475 (41%), Gaps = 82/475 (17%)
Query: 72 SKKEEDLGDLKSWMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVP 129
K+E+ +L W NG + +V+ E+ + A+ +++A + VP
Sbjct: 73 GKREDYFPELMKWASANGASVEGFEVVNFEEEGFG--------LRATREIKAEELFLWVP 124
Query: 130 NSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQ 186
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 125 RKLLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERAS-PNSFWQPYIQTLPSE 182
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
++PL + E E+ YL + ++ + + R+Y +F + Q +P
Sbjct: 183 -------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--RVIQTHP 228
Query: 247 Y--DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRT 303
+ +P ++FT+E ++ A +V + R P L+ L C T
Sbjct: 229 HANKLPLKDSFTYEDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNG 282
Query: 304 PLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 363
++ +N + DD + V R + AGE I
Sbjct: 283 LITTGYNLE--------------------------------DDRCECVALRDFHAGEQIY 310
Query: 364 VWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV 423
++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +
Sbjct: 311 IFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370
Query: 424 HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPI----------CPVSPCMERAVLD 473
H E + +L +LR+ + E++ + G I PVS E +
Sbjct: 371 HF-TEPPISAQLLAFLRV-FCMTEEELREHLLGDGAIDRIFTLGNSEFPVSWDNEVKLWT 428
Query: 474 QLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
L D L Y T+ ED+A+L L + +A +L EK++L ++ A
Sbjct: 429 FLEDRASLLLKTYKTTIEEDKAVLRSPALSARAAMAVKLRLGEKEILEKAVRSAA 483
>gi|301764186|ref|XP_002917505.1| PREDICTED: SET domain-containing protein 3-like [Ailuropoda
melanoleuca]
Length = 591
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 105/472 (22%), Positives = 200/472 (42%), Gaps = 76/472 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEEEVRDLQCTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P +AFT+E ++ A +V + R P L+ L C T +
Sbjct: 231 NKLPLKDAFTYEDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLI 284
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V R ++AGE I ++
Sbjct: 285 TTGYNLE--------------------------------DDRCECVALRDFRAGEQIYIF 312
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 313 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 372
Query: 426 GREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG-PICPVSPCMERAVLDQLA 476
+ + +L +LR+ +++ + + I +LG PVS E + L
Sbjct: 373 -TDPPVSAQLLAFLRVFCMTEEELKEHLLGDNALDRIFTLGNSEYPVSWDNEVRLWTFLE 431
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
D L Y T+ ED++ L +++L + +A +L EK++L ++ A
Sbjct: 432 DRASLLLKTYKTTIEEDKSFLKNHDLSVRATMAIKLRLGEKEILEKAVKSAA 483
>gi|449017905|dbj|BAM81307.1| similar to ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplast precursor
[Cyanidioschyzon merolae strain 10D]
Length = 567
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 122/451 (27%), Positives = 183/451 (40%), Gaps = 62/451 (13%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLE---RVLGNETIAELLTTNKLSELACLALY------LM 165
A D+QAG+ F VP L T + R + EL + LA L LY
Sbjct: 120 ARRDIQAGEVLFQVPFHLCFTKDVAVRRFAALNVPELADEEEFFALATLLLYERGLDESW 179
Query: 166 YEKKQGKKSFWLPYIRELDR--QRGRGQLAVES----PL----LWSETELAYLTGSPTKA 215
+ +G SFW PY+ L +G ES PL LW+E E+ +L GSPT
Sbjct: 180 KKSGRGPGSFWGPYLDILPPVPWEFKGAEPAESLSMDPLDALWLWAEDEMQWLQGSPTLL 239
Query: 216 EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE-AFTFEIFKQAFVAVQSCVVHL- 273
++REY E L++++P+ E AF E F AF + S V L
Sbjct: 240 SARALRSKVEREYAE------ACERLYRRHPHIFDLEGAFRLERFLWAFGVLFSRAVSLP 293
Query: 274 QRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSP----THNPKEKVSLARRFALVPLGPP 329
G + A+H S RT P + +++ +RF
Sbjct: 294 AENGMLALVPYADLANHSAFC-VSFIDARTAAFPYAFRASSKQKRGQWWQRF-------- 344
Query: 330 LLAYSSKCKAMLAAVD------DAVQLVV---DRPYKAGESIVVWCGPQPNSKLLINYGF 380
LA +S +A D DA + VV DR Y E + V G + N++LL+ YGF
Sbjct: 345 -LAPNSDDAGAVANTDSSHYREDAQREVVAYADRFYDKFEQVYVSYGQKSNAELLLLYGF 403
Query: 381 VDEDNPYDRLVVEAALNTEDPQ----YQDKRMVAQRNGK--LSVQVFHVHAGREKEAISD 434
V + NPY+ + V +L+ + KR G+ + F ++A R +
Sbjct: 404 VSDRNPYNSVEVCVSLSGSEAAGAGLLDRKRSFLLACGRDPDKPECFPLYADRYPLELMQ 463
Query: 435 MLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 494
+L + L + S + + PV+ E A L K L YP + ED+
Sbjct: 464 LLRFASL--TEQDAAGYSDLEQIDVAQPVNRENEIAAKSALLQACKIALQAYPTSADEDD 521
Query: 495 AMLTDYN----LHPKKRVATQLVRMEKKMLN 521
A L D + L K+R++ +L R EK++L
Sbjct: 522 AALKDKSMAQLLSRKQRLSVRLRRSEKRILE 552
>gi|403274243|ref|XP_003928891.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Saimiri
boliviensis boliviensis]
Length = 513
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 95/429 (22%), Positives = 185/429 (43%), Gaps = 70/429 (16%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQG 171
A+ D++A + VP L++T+E N + L + +++ + LA +L+ E+
Sbjct: 28 ATRDIKAEELFLWVPRKLLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-S 85
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 86 PNSFWQPYIQTLPSE-------YDTPLYFEEEEVRYLQSTQAIHDVFSQYKNTARQY--- 135
Query: 232 DTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFA 288
+F + Q +P+ +P ++FT+E ++ A +V + R P
Sbjct: 136 --AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSV------MTRQNQIPTEDGSRVT 185
Query: 289 SHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV 348
L+ L C T ++ +N + DD
Sbjct: 186 LALIPLWDMCNHTNGLITTGYNLE--------------------------------DDRC 213
Query: 349 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRM 408
+ V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K
Sbjct: 214 ECVALQDFQAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAE 273
Query: 409 VAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG-P 459
V R G + VF +H E + +L +LR+ +++ + + I +LG
Sbjct: 274 VLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNS 332
Query: 460 ICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 519
PVS E + L D L Y T+ ED+ +L + +L + ++A +L EK++
Sbjct: 333 EFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKFVLKNQDLSVRAKMAIKLRLGEKEI 392
Query: 520 LNACLQVTA 528
L ++ A
Sbjct: 393 LEKAVKSAA 401
>gi|444915331|ref|ZP_21235465.1| SET domain containing protein [Cystobacter fuscus DSM 2262]
gi|444713560|gb|ELW54457.1| SET domain containing protein [Cystobacter fuscus DSM 2262]
Length = 449
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 112/477 (23%), Positives = 194/477 (40%), Gaps = 85/477 (17%)
Query: 70 VVSKKEEDLGDLKSWMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFS 127
S + L +L W+ + G P +++ +E V A + AG+
Sbjct: 10 AASSSNQKLSNLLRWLEEGGARFPKLQLVRREDGERA--------VLAQAPISAGETVLQ 61
Query: 128 VPNSLVVTLERVLGNET-----IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRE 182
VP + ++TLE L E+ IAE L + +E LA +L+ EK + + SFW PYI
Sbjct: 62 VPRTHMLTLE--LARESDIGRAIAEGLDPD--NEDLYLASFLLQEKHR-EGSFWKPYIDS 116
Query: 183 LDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLF 242
L + PL + E A L G + +A+ ++ +Y SL
Sbjct: 117 LPESYS------QMPLFYGSEEHALLKGCFALTLLTHQAQSLREDYL----------SLC 160
Query: 243 QQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTR 302
Q P E FT F A ++V S + L++ G Q T
Sbjct: 161 QNVP---GYERFTPGEFVWARLSVSSRLFSLKKGGFLGQ-------------------TL 198
Query: 303 TPLSPTHNPKEKVSLARRFALVPLGPP--LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGE 360
P++ N + PP L + ++ + ++AV AG+
Sbjct: 199 VPMADMLNHRR--------------PPDVLWETTEDGESFVMKANNAV--------AAGD 236
Query: 361 SIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQV 420
+ G + N +L+++GFV +DN +D + + DP K+M+ + +
Sbjct: 237 EVHDSYGAKSNDLMLLHFGFVTDDNEHDEAFLGLRILDGDPLAATKQMLLMLPSPTAARP 296
Query: 421 FHVHAGREKEAISDMLPYLRLGYV--SDTSEMQS-VISSLGPICPVSPCMERAVLDQLAD 477
F + +LR+ +D ++ S V+S + P+S E VL+ LA
Sbjct: 297 FKISRPYVHTTTRMAFSFLRIAAAVPNDIEDISSRVMSGERALGPLSVENEENVLELLAA 356
Query: 478 YFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLL 534
+ARL+ +P +L++DE +L +L P R + R EK+++ L++T + LL
Sbjct: 357 TCQARLSIFPTSLAQDEELLRGESLSPNARNCVLVRRAEKQLIEDYLEMTRVCLKLL 413
>gi|432098266|gb|ELK28072.1| Histone-lysine N-methyltransferase setd3 [Myotis davidii]
Length = 585
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 102/472 (21%), Positives = 196/472 (41%), Gaps = 76/472 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ +L W +NG V E + E+ + A+ D++A + VP
Sbjct: 62 GKREDYFPNLMKWASENG---ASVEGFEMFNFKEEGFGLR---ATRDIKAEELFLWVPRK 115
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 116 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 171
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 172 -----YDTPLYFEEDEVRSLQSTQAVHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 219
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P ++FT+E ++ A +V + R P L+ L C T +
Sbjct: 220 NKLPLKDSFTYEDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLI 273
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V R ++AGE I ++
Sbjct: 274 TTGYNLE--------------------------------DDRCECVALRDFRAGEQIYIF 301
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 302 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 361
Query: 426 GREKEAISDMLPYLRL---------GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLA 476
E + +L +LR+ ++ + + + + PVS E + L
Sbjct: 362 -MEPPISAQLLAFLRVFCMTEEELKDHLLGDNAIDKIFTLGNSEYPVSWDNEVKLWTFLE 420
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
D L Y T ED++ L +++L + R+A +L EK++L + A
Sbjct: 421 DRASLLLKTYKTTSEEDKSFLKNHDLSVRARMAIKLRLGEKEILEKAVTSAA 472
>gi|432952574|ref|XP_004085141.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Oryzias
latipes]
Length = 606
Score = 79.3 bits (194), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 102/474 (21%), Positives = 190/474 (40%), Gaps = 82/474 (17%)
Query: 62 TLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQA 121
T+ GSRE + DL SW +NG + + R + D++A
Sbjct: 69 TVFEGSRE------DSFADLMSWAQENGASCDGFTITNFGTEGYGLR------TTRDIKA 116
Query: 122 GDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKKQGKKSFWLP 178
+ VP +++T+E N + + + +++ + LAL+L+ E+ SFW P
Sbjct: 117 EELFLWVPRKMLMTVESA-QNSVLGPIYSQDRILQAMGNVTLALHLLCERGD-PASFWSP 174
Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMA 238
YIR L ++ ++PL + + ++ L G+ ++L + + R+Y +F
Sbjct: 175 YIRSLPQE-------YDTPLYYQQEDVQLLLGTQAVQDVLNQYKNTARQY-----AYFY- 221
Query: 239 GSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQ 295
L Q +P +P + F+F+ ++ A +V + R P + L+ L
Sbjct: 222 -KLVQTHPAASKLPLKDGFSFDDYRWAVSSV------MTRQNQIPTVDGSRVTLALIPLW 274
Query: 296 TSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRP 355
C T ++ +N + DD + V +
Sbjct: 275 DMCNHTNGLITTGYNLE--------------------------------DDRCECVALQD 302
Query: 356 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 415
YK E I ++ G + N++ +I+ GF +DN +DR+ ++ ++ + Y K V R G
Sbjct: 303 YKKNEQIYIFYGTRSNAEFVIHNGFFFQDNAHDRVKIKLGVSKSERLYAMKAEVLARAGI 362
Query: 416 LSVQVFHVHAGREKEAISDMLPYLRL---------GYVSDTSEMQSVISSLGPICPVSPC 466
+ VF +H + + +L +LR+ Y+ + + + PVS
Sbjct: 363 PASCVFALHC-NDPPISAQLLAFLRVFCMTEEELKDYLLGERAINKIFTLGNSDFPVSWE 421
Query: 467 MERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 520
E + L L Y T ED ++L +L R+A QL EK++L
Sbjct: 422 NEIKLWTFLETRAALLLKTYKTTSEEDRSILEKPDLSLHTRLAVQLRLAEKQIL 475
>gi|410962953|ref|XP_003988033.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Felis catus]
Length = 591
Score = 79.3 bits (194), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 107/472 (22%), Positives = 199/472 (42%), Gaps = 86/472 (18%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRDLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P +AFT+E ++ V++ L R + G
Sbjct: 231 NKLPLKDAFTYEDYRLGLVSLA-----LGRWALGLECG---------------------- 263
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
V +AR G P + + DD + V + ++AGE I ++
Sbjct: 264 ---------VGIAR------CGKPQITTGYNLE------DDRCECVALQDFRAGEQIYIF 302
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 303 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 362
Query: 426 GREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG-PICPVSPCMERAVLDQLA 476
E + +L +LR+ +++ + + I +LG PVS E + L
Sbjct: 363 -TEPPVSAQLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEYPVSWDNEVKLWTFLE 421
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
D L Y T+ ED+A L ++NL + +A +L EK++L ++ A
Sbjct: 422 DRASLLLKTYKTTIEEDKAFLKNHNLSVRATMAIKLRLGEKEILEKAVKSAA 473
>gi|268370088|ref|NP_082538.2| histone-lysine N-methyltransferase setd3 [Mus musculus]
gi|81879567|sp|Q91WC0.1|SETD3_MOUSE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=Endothelial differentiation inhibitory protein D10;
AltName: Full=SET domain-containing protein 3
gi|16359331|gb|AAH16123.1| SET domain containing 3 [Mus musculus]
gi|18044800|gb|AAH19973.1| Setd3 protein [Mus musculus]
gi|26327255|dbj|BAC27371.1| unnamed protein product [Mus musculus]
gi|74145116|dbj|BAE27425.1| unnamed protein product [Mus musculus]
gi|74151505|dbj|BAE38861.1| unnamed protein product [Mus musculus]
gi|148686776|gb|EDL18723.1| mCG18357, isoform CRA_a [Mus musculus]
Length = 594
Score = 79.0 bits (193), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 102/472 (21%), Positives = 198/472 (41%), Gaps = 76/472 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P E+FT+E ++ A +V + R P L+ L C T +
Sbjct: 231 NKLPLKESFTYEDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLI 284
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V + ++AG+ I ++
Sbjct: 285 TTGYNLE--------------------------------DDRCECVALQDFQAGDQIYIF 312
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H+
Sbjct: 313 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS 372
Query: 426 GREKEAISDMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLA 476
E + +L +LR+ +++ S + + + PVS E + L
Sbjct: 373 -TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNAEFPVSWDNEVKLWTFLE 431
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
D L Y T+ ED+ +L + +L + +A +L EK++L ++ A
Sbjct: 432 DRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAA 483
>gi|348537527|ref|XP_003456245.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Oreochromis niloticus]
Length = 607
Score = 79.0 bits (193), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 105/463 (22%), Positives = 187/463 (40%), Gaps = 63/463 (13%)
Query: 74 KEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
+E+ +L SW +NG C+ + + + A+ D++A + +P ++
Sbjct: 75 REDHFPELMSWAKENG-ASCECFTVANFG-----KEGYGLRATRDIKAEELFLWIPRKML 128
Query: 134 VTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRG 190
+T+E N + L + +++ + LAL+L+ E+ SFWLPYIR L ++
Sbjct: 129 MTVESA-QNSILGPLYSQDRILQAMGNVTLALHLLCERA-NPASFWLPYIRSLPQE---- 182
Query: 191 QLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIP 250
+ PL + + ++ L G+ ++L + + R+Y Y Y +
Sbjct: 183 ---YDIPLYYQQEDVQLLLGTQAVQDVLSQYKNTARQY---------------AYFYKLV 224
Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHN 310
+ + A TP MG + F V S TR PT +
Sbjct: 225 QDKGMLGSVELRLFA-----------SLTPVMGGKLFDQWAV----SSVMTRQNQIPTED 269
Query: 311 PKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 370
+V+LA L+PL + DD + V + YK E I ++ G +
Sbjct: 270 GS-RVTLA----LIPLWDMCNHTNGLITTGYNLEDDRCECVALQDYKENEQIYIFYGTRS 324
Query: 371 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 430
N++ +I+ GF +D+ +DR+ ++ ++ + Y K V R G + VF +H E
Sbjct: 325 NAEFVIHNGFFFQDDAHDRVKIKLGVSKSERLYAMKAEVLARAGIPASYVFALHCN-EPP 383
Query: 431 AISDMLPYLRLG---------YVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 481
+ +L +LR+ Y+ + + + PVS E + L
Sbjct: 384 ISAQLLAFLRVFCMTEDELKYYLLGDRAINKIFTLGNSEFPVSWENEIKLWTFLETRAAL 443
Query: 482 RLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 524
L Y T ED +ML +L R+A QL EK++L L
Sbjct: 444 LLKTYKTTSEEDRSMLEKPDLSLHSRMAIQLRLAEKQILEKAL 486
>gi|224051705|ref|XP_002200601.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Taeniopygia
guttata]
Length = 593
Score = 79.0 bits (193), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 93/431 (21%), Positives = 187/431 (43%), Gaps = 70/431 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
+ A+ +++A + VP L++T+E N + L + +++ + LA +L+ E+
Sbjct: 108 LKATREIKAEELFLWVPRKLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERA 166
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
SFWLPYI+ L + ++PL + E E+ +L + ++ + + R+Y
Sbjct: 167 N-PHSFWLPYIQTLPSE-------YDTPLYFEEDEVQHLQSTQAIHDVFSQYKNTARQY- 217
Query: 230 ELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTES 286
+F + Q +P +P ++FT++ ++ A +V + R P
Sbjct: 218 ----AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSV------MTRQNQIPTEDGSR 265
Query: 287 FASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD 346
L+ L C T ++ +N + DD
Sbjct: 266 VTLALIPLWDMCNHTNGLITTGYNLE--------------------------------DD 293
Query: 347 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 406
+ V + +KAGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K
Sbjct: 294 RCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMK 353
Query: 407 RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS--SLGPICPVS 464
V R G + VF +H+ E + +L +LR+ +S+ + +I ++G I +
Sbjct: 354 AEVLARAGIPTSSVFALHS-TEPAISAQLLAFLRVFCMSEEELKEHLIGEHAIGKIFTLG 412
Query: 465 PCMERAVLD---QLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEK 517
D +L + +AR L Y T+ D++ L ++L P +A +L EK
Sbjct: 413 NSDFPVSWDNEVKLWTFLEARASLLLKTYKTTVEVDKSFLETHDLTPHAIMAIKLRLGEK 472
Query: 518 KMLNACLQVTA 528
++L ++ A
Sbjct: 473 EILEKAVKSAA 483
>gi|148686779|gb|EDL18726.1| mCG18357, isoform CRA_d [Mus musculus]
Length = 597
Score = 78.6 bits (192), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 102/472 (21%), Positives = 198/472 (41%), Gaps = 76/472 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 76 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 129
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 130 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 185
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 186 -----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 233
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P E+FT+E ++ A +V + R P L+ L C T +
Sbjct: 234 NKLPLKESFTYEDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLI 287
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V + ++AG+ I ++
Sbjct: 288 TTGYNLE--------------------------------DDRCECVALQDFQAGDQIYIF 315
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H+
Sbjct: 316 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS 375
Query: 426 GREKEAISDMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLA 476
E + +L +LR+ +++ S + + + PVS E + L
Sbjct: 376 -TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNAEFPVSWDNEVKLWTFLE 434
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
D L Y T+ ED+ +L + +L + +A +L EK++L ++ A
Sbjct: 435 DRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAA 486
>gi|302754606|ref|XP_002960727.1| hypothetical protein SELMODRAFT_449995 [Selaginella moellendorffii]
gi|300171666|gb|EFJ38266.1| hypothetical protein SELMODRAFT_449995 [Selaginella moellendorffii]
Length = 430
Score = 78.6 bits (192), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 105/413 (25%), Positives = 174/413 (42%), Gaps = 72/413 (17%)
Query: 127 SVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
S+P +L + ++ V +E I E L +ALYL++EK + S W YIR L R
Sbjct: 37 SIPKTLWMDVDTVRRSE-IGECCAG--LRPWIAVALYLLHEKAK-PHSDWSAYIRVLPR- 91
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
++SPL WSE ELA L G+ + I E +KREY+++ T + + P
Sbjct: 92 ------TLDSPLFWSEEELAELKGTQLLSSINGFKEFLKREYDKVMT------EVIEPRP 139
Query: 247 YDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTE-------SFASHLVHLQTSCT 299
+T E F AF ++S P +G F +H L
Sbjct: 140 DVFDRSLYTLEAFTWAFGILRSRTF-------PPLIGDNLALVPLADFVNHGFGL----- 187
Query: 300 TTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 359
T P + K AR+ L L ++ C +
Sbjct: 188 ---TNEDPYWHVKSAGVFARQETLT------LQAAANCA------------------EKQ 220
Query: 360 ESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLS 417
E ++ + + N++L +YGFVD D N D + ++ + DK +AQ G S
Sbjct: 221 EVLMQYGKKKGNAQLATDYGFVDSDEKNNRDSFTLTLQVSLSERFADDKVDIAQMAGLDS 280
Query: 418 VQVFHVHAGREKEAISDMLPYLRLG--YVSDTSEMQSVISSL---GPICPVSPCMERAVL 472
F+++ R + DM+ YLRL + SD+ ++++ + P+S E A+
Sbjct: 281 TAYFNLY--RNQGPPEDMIAYLRLIALFGSDSFLLEALFRNTVWDHLRLPISRENEEAIC 338
Query: 473 DQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 525
+ + + +A L Y +T+ ED +L L +K++A + EK++L LQ
Sbjct: 339 EAMIEGCRATLREYSSTIDEDTMLLNSSELSTRKKMAVVVRLGEKRILQEQLQ 391
>gi|146181028|ref|XP_001021989.2| SET domain containing protein [Tetrahymena thermophila]
gi|146144300|gb|EAS01744.2| SET domain containing protein [Tetrahymena thermophila SB210]
Length = 590
Score = 78.2 bits (191), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 99/439 (22%), Positives = 191/439 (43%), Gaps = 89/439 (20%)
Query: 103 HNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNET-IAELLTTNKLSELA--- 158
+++ +R +H A + + +P S ++TLE + ET +A+ + KL+ L+
Sbjct: 173 YSKNYRGVH---ARRKVYNKETILFIPKSHLITLE--MAKETDVAKKIIAAKLNLLSPKH 227
Query: 159 -CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
L+ +L+ E+K K+S W PY+ L + P+ +SE +L++L GSP + ++
Sbjct: 228 SFLSTFLLQERK-NKESKWKPYLDILPSDYN------QFPIFFSEDDLSWLKGSPFQNQV 280
Query: 218 LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCG 277
E+ IKR+Y+++ +V F +Y TFE F A + S V LQ
Sbjct: 281 REKKADIKRDYDDICSV----APEFAEY---------TFEDFCWARMTASSRVFGLQ--- 324
Query: 278 CTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKC 337
+ T++F PL+ N + + Y +
Sbjct: 325 -INEQKTDAFV---------------PLADMLNHRRPKQTS------------WQYDDQR 356
Query: 338 KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 397
+ + +Q + D P GE + G + NS+ +NYGF++ DN + + + +
Sbjct: 357 EGFV------IQALEDIP--RGEQVYDSYGRKCNSRFFLNYGFINLDNDANEVALRLTFD 408
Query: 398 TEDPQYQDKRMVAQRNGKL-SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISS 456
EDP + K+ + G + +V+ + +++ +S+ + YLR + D S++ ++SS
Sbjct: 409 AEDPTIERKKEMM--GGDVPEFKVYRILENYQEQNVSEFMSYLRFILIRDNSKL-LMLSS 465
Query: 457 L----------------GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDY 500
L P+S E + ++++ + ++ Y TL ED+ +L
Sbjct: 466 LHEQQTENSENLSGYKPQKTPPISIQNETDMWVRISNMCQTSISLYNTTLKEDKELLAKD 525
Query: 501 NLHPKKRVATQLVRMEKKM 519
NL +R L EK++
Sbjct: 526 NLTQNQRNCVLLRSGEKEV 544
>gi|410928182|ref|XP_003977480.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Takifugu
rubripes]
Length = 598
Score = 78.2 bits (191), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 98/426 (23%), Positives = 177/426 (41%), Gaps = 72/426 (16%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKKQG 171
A+ D++A + +P +++T+E + L +++ + LAL+L+ E+
Sbjct: 110 ATRDIKAEELFLWIPRKMLMTVESA-KKSVLGPLYNQDRILQAMDNVTLALHLLCERA-N 167
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
SFWLPYIR L ++ ++PL + + E+ L G+ ++L + R+Y
Sbjct: 168 PASFWLPYIRTLPQE-------YDTPLFYEQDEVQLLQGTQAVQDVLSQYRNTARQY--- 217
Query: 232 DTVWFMAGSLFQQYPYD--IP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFA 288
+F L Q +P +P ++FTF+ ++ A
Sbjct: 218 --AYFY--KLIQTHPASSKLPLKDSFTFDDYRWA-------------------------- 247
Query: 289 SHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV 348
S TR PT + ++ V+LA L+PL + DD
Sbjct: 248 -------VSSVMTRQNQIPTEDGRQ-VTLA----LIPLWDMCNHRNGLITTGYNLEDDRC 295
Query: 349 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRM 408
+ V + YK E I ++ G + N++ +I+ GF ++N +D++ ++ ++ + Y K
Sbjct: 296 ECVALQDYKKNEQIYIFYGTRSNAEFVIHNGFFYQENAHDQVKIKLGISKSERLYAMKAE 355
Query: 409 VAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV---------ISSLGP 459
V R G +F ++ E+ + +L +LR+ + E++ I +LG
Sbjct: 356 VLARAGIPVSSIFALYCN-EQPISAQLLAFLRV-FCMKEEELRDYLLGGHAINKIVTLGS 413
Query: 460 I-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKK 518
+ PVS E + L L Y T ED + L L P R+A QL EK
Sbjct: 414 MEFPVSWDNEIKLWTFLETRVALLLKAYKTTSEEDSSTLEKSELSPHSRMAIQLRLAEKW 473
Query: 519 MLNACL 524
+L L
Sbjct: 474 ILEKAL 479
>gi|302804384|ref|XP_002983944.1| hypothetical protein SELMODRAFT_119151 [Selaginella moellendorffii]
gi|300148296|gb|EFJ14956.1| hypothetical protein SELMODRAFT_119151 [Selaginella moellendorffii]
Length = 439
Score = 78.2 bits (191), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 105/413 (25%), Positives = 174/413 (42%), Gaps = 72/413 (17%)
Query: 127 SVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
S+P +L + + V +E I E L +ALYL++EK + S W YIR L R
Sbjct: 37 SIPKTLWMDADTVRRSE-IGE--CCEGLRPWIAVALYLLHEKAK-PHSDWSAYIRVLPR- 91
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
++SPL WSE ELA L G+ + + E +KREY+++ T + + P
Sbjct: 92 ------TLDSPLFWSEEELAELKGTQLLSSMNGFKEFLKREYDKVMT------EVIEPRP 139
Query: 247 YDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTE-------SFASHLVHLQTSCT 299
+T E F AF ++S P +G F +H L
Sbjct: 140 DVFDRSLYTLEAFTWAFGILRSRTF-------PPLIGDNLALVPLADFVNHGFGL----- 187
Query: 300 TTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 359
+P KV A FA + + AA + A +
Sbjct: 188 -------TNEDPGWKVKSAGVFA-----------RQETLTLQAAANCA---------EKQ 220
Query: 360 ESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLS 417
E ++ + + N++L +YGFVD D N D + ++ + DK +AQ G S
Sbjct: 221 EVLIQYGKKKGNAQLATDYGFVDSDEKNNRDSFTLTLQVSLSERFADDKVDIAQMAGLDS 280
Query: 418 VQVFHVHAGREKEAISDMLPYLRLG--YVSDTSEMQSVISSL---GPICPVSPCMERAVL 472
F+++ R + DM+ YLRL + SD+ ++++ + P+S E A+
Sbjct: 281 TAYFNLY--RNQGPPEDMIAYLRLIALFGSDSFLLEALFRNTVWDHLRLPISRENEEAIC 338
Query: 473 DQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 525
+ + + +A L Y +T+ ED +L L +K++A + EK++L LQ
Sbjct: 339 EAMIEGCRATLREYSSTIDEDTMLLNSSELSTRKKMAVVVRLGEKRILQEQLQ 391
>gi|354483159|ref|XP_003503762.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Cricetulus
griseus]
gi|344254671|gb|EGW10775.1| SET domain-containing protein 3 [Cricetulus griseus]
Length = 577
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 102/472 (21%), Positives = 197/472 (41%), Gaps = 76/472 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P ++FT+E ++ A +V + R P L+ L C T +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLI 284
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V + ++AGE I ++
Sbjct: 285 TTGYNLE--------------------------------DDRCECVALQDFQAGEQIYIF 312
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 313 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 372
Query: 426 GREKEAISDMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLA 476
E + +L +LR+ +++ S + + + PVS E + L
Sbjct: 373 -TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLE 431
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
D L Y T+ ED+ +L + +L + +A +L EK++L ++ A
Sbjct: 432 DRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAA 483
>gi|115657973|ref|XP_798530.2| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Strongylocentrotus purpuratus]
Length = 682
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 104/476 (21%), Positives = 190/476 (39%), Gaps = 74/476 (15%)
Query: 64 VAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGD 123
VAG S +E W++ NG+ V + K + + A++D++
Sbjct: 66 VAGEPMQQSDREVHFETFFKWLNTNGVTTDAVKMA-------KFDEGYGLQATQDIKMDQ 118
Query: 124 AAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKKQGKKSFWLPYI 180
++P +++T + + + TI +L+ ++L + LA++++ EK + SFW PY+
Sbjct: 119 ELMNIPRKVMMTDQNAVDSPTIGDLVRGDRLLKGMPNVSLAIFILSEKLK-SDSFWKPYL 177
Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS 240
L + PL ++ E+ GS E L++ + I R+Y
Sbjct: 178 DVLPS-------SYSLPLYFTPDEIQLFQGSTMYGECLKQHKNIARQY------------ 218
Query: 241 LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTT 300
A+ F++ ++ +H++ E F S
Sbjct: 219 ------------AYLFKLLNLP----ENSKLHIR----------EYFTYDFYRWAVSTVM 252
Query: 301 TRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGE 360
TR P + K +L+PL + + K D+ + R + GE
Sbjct: 253 TRQNQIPAKDGK-----GMSLSLIPLWDMCNHANGEMKTDFIEERDSCVNMALRDFSVGE 307
Query: 361 SIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV-- 418
I + G + ++ LL+ GFV N YD + ++ L++ D Y K + KL V
Sbjct: 308 QIFICYGRRSSADLLLYSGFVYPGNVYDGMAIQLGLSSSDRLYAMKAQLCSVM-KLGVPS 366
Query: 419 QVFHVHAGREKEAISDMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMER 469
Q +H+ AG+E + ++L +LR+ + D + Q++ S + +S E
Sbjct: 367 QNYHISAGKEPVTL-ELLTFLRIFCMQDLELRDRLLGDNRAQALFSLVDRSQIISKLNEL 425
Query: 470 AVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 525
LA Y ++ EDE L D NL ++R A QL+ +EK L L+
Sbjct: 426 RTCVYLATRVTLLQRQYKTSIQEDEEKLKDGNLSAQERSALQLLLIEKCTLENVLE 481
>gi|12848462|dbj|BAB27964.1| unnamed protein product [Mus musculus]
gi|46241521|gb|AAS82953.1| endothelial differentiation inhibitory protein D10 [Mus musculus]
Length = 594
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 102/472 (21%), Positives = 198/472 (41%), Gaps = 76/472 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P E+FT+E ++ A +V + R P L+ L C T +
Sbjct: 231 NKLPLKESFTYEDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLI 284
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V + ++AG+ I ++
Sbjct: 285 TTGYNLE--------------------------------DDRCECVALQDFQAGDQIYIF 312
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H+
Sbjct: 313 YGTRSNAESVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS 372
Query: 426 GREKEAISDMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLA 476
E + +L +LR+ +++ S + + + PVS E + L
Sbjct: 373 -TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNAEFPVSWDNEVKLWTFLE 431
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
D L Y T+ ED+ +L + +L + +A +L EK++L ++ A
Sbjct: 432 DRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAA 483
>gi|47215092|emb|CAF98166.1| unnamed protein product [Tetraodon nigroviridis]
Length = 444
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 96/425 (22%), Positives = 175/425 (41%), Gaps = 70/425 (16%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKKQG 171
A+ D++A + +P +++T+E + L T +++ + LAL+L+ E+
Sbjct: 28 ATRDIKAEELFLWIPRKMLMTVESA-KKSVLGPLYTQDRILQAMDNVTLALHLLCERAD- 85
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
SFWLPYIR L ++ ++PL + + ++ L G+ ++L + R+Y
Sbjct: 86 PASFWLPYIRTLPQE-------YDTPLFYQQQDVQLLHGTQAIQDVLSQYRNTARQY--- 135
Query: 232 DTVWFMAGSLFQQYPYD--IP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFA 288
+F L Q +P +P ++FTF+ ++ A
Sbjct: 136 --AYFY--KLVQTHPASSKLPLKDSFTFDDYRWA-------------------------- 165
Query: 289 SHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV 348
S TR PT + ++ V+LA L+PL + DD
Sbjct: 166 -------VSSVMTRQNQIPTEDGRQ-VTLA----LIPLWDMCNHRNGLITTGYNLEDDRC 213
Query: 349 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRM 408
+ V + YK E I ++ G + N++ +I+ GF ++N +D++ ++ ++ + Y K
Sbjct: 214 ECVALQDYKKNEQIYIFYGTRSNAEFVIHNGFFYQENAHDQVKIKLGISKSERLYAMKAE 273
Query: 409 VAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL---------GYVSDTSEMQSVISSLGP 459
V R G VF ++ E + +L +LR+ Y+ + +++
Sbjct: 274 VLGRAGIPVSSVFALYCN-EPPISAQLLAFLRVFCMMEEELKDYLFGAQAINRLVTLGSM 332
Query: 460 ICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 519
PVS E + L L Y T ED + L +L P R+A QL EK +
Sbjct: 333 EFPVSWENEIKLWTFLETRAALLLKAYKTTAEEDSSTLDKTDLSPHSRMAVQLRLAEKAI 392
Query: 520 LNACL 524
L L
Sbjct: 393 LEKAL 397
>gi|225462926|ref|XP_002267249.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic [Vitis
vinifera]
gi|296087793|emb|CBI35049.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 97/416 (23%), Positives = 171/416 (41%), Gaps = 61/416 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A D+ +A VP + + V +E + L +AL+L+ EK +
Sbjct: 81 LVAQRDIARNEAVLEVPKRFWINPDAVAASEIGS---VCGGLKPWVSVALFLIREKLR-D 136
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+S W Y+ L S + WSE EL + G+ L E ++ E+ +++
Sbjct: 137 ESPWRSYLDILPEY-------TNSTIYWSEEELVEIQGTQLSNTTLGVKEYVQSEFLKVE 189
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLV 292
+ S Q +P+ + T + F AF ++S R +LV
Sbjct: 190 EEVILPHS--QLFPFPV-----TLDDFLWAFGILRSRAFSRLR------------GQNLV 230
Query: 293 HLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCK-AMLAAVDDAVQLV 351
+ + +P T Y+ + K A L + D L
Sbjct: 231 LIPLADLINHSPSITTEE----------------------YAWEIKGAGLFSRDQLFSLR 268
Query: 352 VDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 410
KAGE +++ + + N++L ++YGF++ + + ++ DP + DK +A
Sbjct: 269 TPVSVKAGEQVLIQYDLDKSNAELALDYGFIESRPNRNSYTLTLEISESDPFFGDKLDIA 328
Query: 411 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS--DTSEMQSVISS--LGPI-CPVSP 465
+ NG + F + G+ A MLPYLRL + D ++S+ + G + PVS
Sbjct: 329 ESNGLSEIAYFDIVLGQSLPAA--MLPYLRLVALGGPDAFLLESIFRNTIWGHLELPVSR 386
Query: 466 CMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 521
E + + D K+ L+GY T+ EDE + + NLHP+ +A + EKK+L
Sbjct: 387 ANEELICQVIQDACKSALSGYLTTIEEDEKLKEEGNLHPRLEIAVGVRTGEKKVLQ 442
>gi|302814473|ref|XP_002988920.1| hypothetical protein SELMODRAFT_129035 [Selaginella moellendorffii]
gi|300143257|gb|EFJ09949.1| hypothetical protein SELMODRAFT_129035 [Selaginella moellendorffii]
Length = 389
Score = 77.0 bits (188), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 70/290 (24%), Positives = 126/290 (43%), Gaps = 62/290 (21%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS + G+ V + L++T E++ E + +LL+ + +S A LAL+L+ +K+ + S
Sbjct: 5 ASRPIHTGECMLHVSHDLMITPEKL--PEEVTKLLSKD-VSAWAKLALFLLAHQKKKETS 61
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI L ++ S + W++ EL YL SP E ++R + ++ E+ +
Sbjct: 62 AWAPYISCLPPFG-----SMHSTIFWTQDELVYLKVSPVYRETVQRKDVVRMEFAAAENA 116
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHL 294
+ P+ + E FK A+ V C+ G E+ S
Sbjct: 117 LLLC-------PHIFGSRVSALE-FKHAYATV-----------CSRAWGIETIKS----- 152
Query: 295 QTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD--AVQLVV 352
ALVP + + C+AML+ +D ++V
Sbjct: 153 --------------------------LALVPF-VDFFNHDANCRAMLSYDEDRHCAEVVS 185
Query: 353 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV-VEAALNTEDP 401
DR Y G+ +V+ G N+ L +++GF NP+D++ + +L+ +DP
Sbjct: 186 DRDYATGDQVVISYGQLSNATLALDFGFALPFNPHDQVAGIWLSLSEKDP 235
>gi|41056027|ref|NP_956348.1| histone-lysine N-methyltransferase setd3 [Danio rerio]
gi|82187658|sp|Q7SXS7.1|SETD3_DANRE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|32766447|gb|AAH55261.1| SET domain containing 3 [Danio rerio]
Length = 596
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 91/421 (21%), Positives = 171/421 (40%), Gaps = 66/421 (15%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKK 169
+ A++D++A + +P +++T+E N + L + +++ + LAL+L+ E+
Sbjct: 108 LKATKDIKAEELFLWIPRKMLMTVESA-KNSVLGPLYSQDRILQAMGNVTLALHLLCERA 166
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
S WLPYI+ L + ++PL + E E+ +L + ++L + + R+Y
Sbjct: 167 N-PSSPWLPYIKTLPSE-------YDTPLYFEEEEVRHLLATQAIQDVLSQYKNTARQY- 217
Query: 230 ELDTVWFMAGSLFQQYPYDIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFA 288
+F +P +AFTF+ ++ A +V + R P
Sbjct: 218 ----AYFYKVIHTHPNASKLPLKDAFTFDDYRWAVSSV------MTRQNQIPTADGSRVT 267
Query: 289 SHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV 348
L+ L C T ++ +N + DD
Sbjct: 268 LALIPLWDMCNHTNGLITTGYNLE--------------------------------DDRC 295
Query: 349 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRM 408
+ V + YK GE I ++ G + N++ +I+ GF EDN +DR+ ++ ++ + Y K
Sbjct: 296 ECVALKDYKEGEQIYIFYGTRSNAEFVIHNGFFFEDNAHDRVKIKLGVSKGERLYAMKAE 355
Query: 409 VAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL---------GYVSDTSEMQSVISSLGP 459
V R G + +F +H E + +L +LR+ Y+ + + +
Sbjct: 356 VLARAGIPASSIFALHCS-EPPISAQLLAFLRVFCMTEEELRDYLVGDHAINKIFTLGNT 414
Query: 460 ICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 519
PVS E + L L Y ED +ML +L R+A +L EK++
Sbjct: 415 EFPVSWENEIKLWTFLETRAALLLKTYKTASEEDRSMLEKPDLSLHSRIAIKLRLAEKEI 474
Query: 520 L 520
L
Sbjct: 475 L 475
>gi|147843303|emb|CAN82664.1| hypothetical protein VITISV_015206 [Vitis vinifera]
Length = 507
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 97/416 (23%), Positives = 171/416 (41%), Gaps = 61/416 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A D+ +A VP + + V +E + L +AL+L+ EK +
Sbjct: 81 LVAQRDIARNEAVLEVPKRFWINPDAVAASEIGS---VCGGLKPWVSVALFLIREKLR-D 136
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+S W Y+ L S + WSE EL + G+ L E ++ E+ +++
Sbjct: 137 ESPWRSYLDILPEY-------TNSTIYWSEEELVEIQGTQLSNTTLGVKEYVQSEFLKVE 189
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLV 292
+ S Q +P+ + T + F AF ++S R +LV
Sbjct: 190 EEVILPHS--QLFPFPV-----TLDDFLWAFGILRSRAFSRLR------------GQNLV 230
Query: 293 HLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCK-AMLAAVDDAVQLV 351
+ + +P T Y+ + K A L + D L
Sbjct: 231 LIPLADLINHSPSITTEE----------------------YAWEIKGAGLFSRDQLFSLR 268
Query: 352 VDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 410
KAGE +++ + + N++L ++YGF++ + + ++ DP + DK +A
Sbjct: 269 TPVSVKAGEQVLIQYDLDKSNAELALDYGFIESRPNRNSYTLTLEISESDPFFGDKLDIA 328
Query: 411 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS--DTSEMQSVISS--LGPI-CPVSP 465
+ NG + F + G+ A MLPYLRL + D ++S+ + G + PVS
Sbjct: 329 ESNGLSEIAYFDIVLGQSLPAA--MLPYLRLVALGGPDAFLLESIFRNTIWGHLELPVSR 386
Query: 466 CMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 521
E + + D K+ L+GY T+ EDE + + NLHP+ +A + EKK+L
Sbjct: 387 ANEELICQVIQDACKSALSGYLTTIEEDEKLKEEGNLHPRLEIAVGVRTGEKKVLQ 442
>gi|392341246|ref|XP_002726820.2| PREDICTED: histone-lysine N-methyltransferase setd3 [Rattus
norvegicus]
gi|392349051|ref|XP_216781.6| PREDICTED: histone-lysine N-methyltransferase setd3 [Rattus
norvegicus]
gi|149044195|gb|EDL97577.1| rCG27725, isoform CRA_a [Rattus norvegicus]
Length = 596
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 101/472 (21%), Positives = 197/472 (41%), Gaps = 76/472 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSILGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P ++FT+E ++ A +V + R P L+ L C T +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLI 284
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V + ++AG+ I ++
Sbjct: 285 TTGYNLE--------------------------------DDRCECVALQDFQAGDQIYIF 312
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 313 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 372
Query: 426 GREKEAISDMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLA 476
E + +L +LR+ +++ S + + + PVS E + L
Sbjct: 373 -TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLE 431
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
D L Y T+ ED+ +L + +L + +A +L EK++L ++ A
Sbjct: 432 DRASLLLKTYKTTIEEDKTVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAA 483
>gi|160774366|gb|AAI55279.1| SET domain containing 3 [Danio rerio]
Length = 596
Score = 75.9 bits (185), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 90/421 (21%), Positives = 170/421 (40%), Gaps = 66/421 (15%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKK 169
+ A++D++A + +P +++T+E N + L + +++ + LAL+L+ E+
Sbjct: 108 LKATKDIKAEELFLWIPRKMLMTVESA-KNSVLGPLYSQDRILQAMGNVTLALHLLCERA 166
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
S WLPYI+ L + ++PL + E E+ +L + ++L + + R+Y
Sbjct: 167 N-PSSPWLPYIKTLPSE-------YDTPLYFEEEEVRHLLATQAIQDVLSQYKNTARQY- 217
Query: 230 ELDTVWFMAGSLFQQYPYDIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFA 288
+F +P +AFTF+ ++ A +V + R P
Sbjct: 218 ----AYFYKVIHTHPNASKLPLKDAFTFDDYRWAVSSV------MTRQNQIPTADGSRVT 267
Query: 289 SHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV 348
L+ L C T ++ +N + DD
Sbjct: 268 LALIPLWDMCNHTNGLITTGYNLE--------------------------------DDRC 295
Query: 349 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRM 408
+ V + YK GE I ++ G + N++ +I+ GF EDN +DR+ ++ ++ + Y K
Sbjct: 296 ECVALKDYKEGEQIYIFYGTRSNAEFVIHNGFFFEDNAHDRVKIKLGVSKSERLYAMKAE 355
Query: 409 VAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL---------GYVSDTSEMQSVISSLGP 459
V R G + +F +H E + +L +LR+ Y+ + + +
Sbjct: 356 VLARAGIPASSIFALHCS-EPPISAQLLAFLRVFCMTEEELRDYLVGDHAINKIFTLGNT 414
Query: 460 ICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 519
PVS E + L L Y ED +ML +L R+ +L EK++
Sbjct: 415 EFPVSWENEIKLWTFLETRAALLLKTYKTASEEDRSMLEKPDLSLHSRITIKLRLAEKEI 474
Query: 520 L 520
L
Sbjct: 475 L 475
>gi|449280698|gb|EMC87934.1| SET domain-containing protein 3 [Columba livia]
Length = 593
Score = 75.5 bits (184), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 96/435 (22%), Positives = 187/435 (42%), Gaps = 78/435 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
+ A+ +++A + VP L++T+E N + L + +++ + LA +L+ E+
Sbjct: 108 LKATREIKAEELFLWVPRRLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERA 166
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
SFWLPYI+ L + +PL + E E+ YL + ++ + + R+Y
Sbjct: 167 N-PNSFWLPYIQTLPSE-------YNTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY- 217
Query: 230 ELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTES 286
+F + Q +P +P ++FT++ ++ A +V + R P
Sbjct: 218 ----AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSV------MTRQNQIPTEDGSR 265
Query: 287 FASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD 346
L+ L C T ++ +N + DD
Sbjct: 266 VTLALIPLWDMCNHTNGLITTGYNLE--------------------------------DD 293
Query: 347 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 406
+ V + +KAGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K
Sbjct: 294 RCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMK 353
Query: 407 RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS--------SLG 458
V R G + VF +H+ E + +L +LR+ +S+ + +I +LG
Sbjct: 354 AEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMSEEELKEHLIGEHAIDKIFTLG 412
Query: 459 -PICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVATQLV 513
PVS E +L + +AR L Y T+ +D++ L ++L +A +L
Sbjct: 413 NSEFPVSWDNEV----KLWTFLEARASLLLKTYKTTVEDDKSFLETHDLTSHAIMAIKLR 468
Query: 514 RMEKKMLNACLQVTA 528
EK++L ++ A
Sbjct: 469 LGEKEILEKAVKSAA 483
>gi|424513480|emb|CCO66102.1| predicted protein [Bathycoccus prasinos]
Length = 571
Score = 75.1 bits (183), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 108/416 (25%), Positives = 178/416 (42%), Gaps = 39/416 (9%)
Query: 116 SEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSF 175
S++++ GD S+P VT + +A L+ + EL LAL+L EK + K S
Sbjct: 122 SKNVEGGDVILSIPQDNCVTAVDAKEHPIVAPLI--EEKPELVQLALWLCCEKAKAKGSE 179
Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELA-YLTGSPTKAEILERAEGIKREYNELDTV 234
W PY++ L+ S L ++E E L G+ E +R + K EY L
Sbjct: 180 WWPYLKTLNGNPN-------SVLRFTEEEFKELLKGTSIDKEARQRRDSAKEEYEALRAA 232
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHL 294
+ P P + + F + + AF+ V C + + + + +V L
Sbjct: 233 -------IAEDPGKYPLDVYAF-LTESAFIDALDIV-----CARAQWLNSANCYA-MVPL 278
Query: 295 QTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA-VQLVVD 353
+ P P +P E S AR + + + L A +C VD A V L +
Sbjct: 279 MDAIPICGAP--PPVSP-EDPSFARFYEIRDIKTGLTAV--RCGYADYDVDSASVVLCAN 333
Query: 354 RPYKAGESIV-VWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQR 412
AG I+ + + NS+L +++G VD+ +P D L+ DP Y K+ V +
Sbjct: 334 TRASAGSKILQIDHSVRNNSELYLSFGDVDDQHPGDYEYWPTELSENDPLYAAKKSVLEA 393
Query: 413 NGKLSV-QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 471
G Q F V+ R + L YLR V+++ E+ +V + + VSP E
Sbjct: 394 QGFADKGQTFPVYKDRMPR---EFLSYLRFARVTNSEELFAVSFTEDKV--VSPMNEYET 448
Query: 472 LDQLADYFKARLAGYPATLSEDEAMLTDY-NLHPKKRVATQLVRMEKKMLNACLQV 526
L L + R++ Y T EDE +L ++ K R A++L R EK+++ +
Sbjct: 449 LQLLMADCRDRMSAYD-TNEEDELLLQKRDDVSLKIRNASRLRRCEKELVGEMMNA 503
>gi|260803924|ref|XP_002596839.1| hypothetical protein BRAFLDRAFT_284593 [Branchiostoma floridae]
gi|229282099|gb|EEN52851.1| hypothetical protein BRAFLDRAFT_284593 [Branchiostoma floridae]
Length = 500
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 100/424 (23%), Positives = 178/424 (41%), Gaps = 70/424 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKK 169
+ A +D++A + ++P L++T E ++ L+ +++ ++ LAL+++ EK
Sbjct: 121 LKAVKDIKAEELFITIPRKLMLTTETA-RESSLGPLIKKDRILQVMANVSLALHVLCEK- 178
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
SFW PYI +PL + E E+ +L GS +++L + + I R+Y
Sbjct: 179 YSSNSFWAPYINIFPG-------TYTTPLYFEEGEMLHLQGSLNFSDVLNQYKSIARQY- 230
Query: 230 ELDTVWFMAGSLFQQYP--YDIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTES 286
+F LFQ P +P E FTF+ ++ A V + R P
Sbjct: 231 ----AYFY--KLFQTQPEAAGLPLKECFTFDEYRWAVSTV------MTRQNQVPTSDGRH 278
Query: 287 FASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD 346
+ L+ + C + +S N L + S++C AM
Sbjct: 279 LITALIPMWDMCNHSNGEVSTEFN-------------------LGSDSAECLAM------ 313
Query: 347 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 406
R + + ++ G + N++ LI+ GFV +N +DR+ V+ ++ D + K
Sbjct: 314 -------REFPTDSQVYIFYGMRSNAEFLIHNGFVYPENVHDRVNVKLGVSKNDSLFAMK 366
Query: 407 RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL-----GYVSD--TSEMQSV-ISSLG 458
V R G + F VH G++ ++L +LR+ G + D TSE QS +S LG
Sbjct: 367 AEVLSRAGIHASTSFQVHCGKDP-IPPELLVFLRVFTMVEGDLRDLLTSEHQSAYLSCLG 425
Query: 459 -PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEK 517
C V+ E L + Y ++ + E L ++ R A QL E
Sbjct: 426 RSDCMVTQEQETKAWAFLETRLSLLIRSYRTSIKDVETELQAPDMTYHSRAALQLKLAEM 485
Query: 518 KMLN 521
++L+
Sbjct: 486 QILS 489
>gi|57529914|ref|NP_001006486.1| histone-lysine N-methyltransferase setd3 [Gallus gallus]
gi|363734802|ref|XP_003641459.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Gallus
gallus]
gi|75571462|sp|Q5ZML9.1|SETD3_CHICK RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|53127281|emb|CAG31024.1| hypothetical protein RCJMB04_1k10 [Gallus gallus]
Length = 593
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 94/435 (21%), Positives = 188/435 (43%), Gaps = 78/435 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
+ A+ +++A + VP L++T+E N + L + +++ + LA +L+ E+
Sbjct: 108 LKATREIKAEELFLWVPRKLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERA 166
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
SFWLPYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 167 N-PNSFWLPYIQTLPSE-------YDTPLYFEEDEVQYLRSTQAIHDVFSQYKNTARQY- 217
Query: 230 ELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTES 286
+F + Q +P +P ++FT++ ++ A +V + R P
Sbjct: 218 ----AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSV------MTRQNQIPTEDGSR 265
Query: 287 FASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD 346
L+ L C T ++ +N + DD
Sbjct: 266 VTLALIPLWDMCNHTNGLITTGYNLE--------------------------------DD 293
Query: 347 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 406
+ V + +KAGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K
Sbjct: 294 RCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMK 353
Query: 407 RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS--------SLG 458
V R G + VF +H+ E + +L +LR+ +++ + +I +LG
Sbjct: 354 AEVLARAGIPTSSVFALHS-IEPPISAQLLAFLRVFCMNEEELKEHLIGEHAIDKIFTLG 412
Query: 459 -PICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVATQLV 513
P+S E +L + +AR L Y T+ +D++ L ++L +A +L
Sbjct: 413 NSEFPISWDNEV----KLWTFLEARASLLLKTYKTTVEDDKSFLETHDLTSHATMAIKLR 468
Query: 514 RMEKKMLNACLQVTA 528
EK++L ++ A
Sbjct: 469 LGEKEILEKAVKSAA 483
>gi|387016380|gb|AFJ50309.1| Histone-lysine N-methyltransferase setd3 [Crotalus adamanteus]
Length = 592
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 106/477 (22%), Positives = 199/477 (41%), Gaps = 88/477 (18%)
Query: 73 KKEEDLGDLKSWMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPN 130
K+E+ +L W +NG +V E+ K A+ D++A + VP
Sbjct: 74 KREDYFSELIKWAGENGAFTDGFEVANFEEEGFGLK--------ATRDIKAEELFLWVPR 125
Query: 131 SLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQR 187
L++T+E N + L + +++ + LA +L+ E+ SFWLPYI+ L +
Sbjct: 126 KLLMTVESA-KNSILGSLYSQDRILQAMGNITLAFHLLCER-YNPNSFWLPYIQTLPNE- 182
Query: 188 GRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY 247
+ L + E E+ YL + +I + + R+Y +F + Q +P
Sbjct: 183 ------YNTALYFEEDEVQYLQSTQAIHDIFSQYKNTARQY-----AYFY--KVVQTHPN 229
Query: 248 --DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTP 304
+P ++FT++ ++ A +V + R P L+ L C T
Sbjct: 230 ASKLPLKDSFTYDDYRWAVSSV------MARQNQIPAEDGSRVTLALIPLWDMCNHTNGL 283
Query: 305 LSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 364
++ +N K DD + V + +KAGE I +
Sbjct: 284 ITTGYNLK--------------------------------DDRCECVALQDFKAGEQIYI 311
Query: 365 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 424
+ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 312 FYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALH 371
Query: 425 AGREKEAISDMLPYLRLGYVSDTSEMQSVIS--------SLG-PICPVSPCMERAVLDQL 475
+ E + +L +LR+ +++ + +I +LG PVS E +L
Sbjct: 372 S-TEPPISAQLLAFLRVFCMTEDELKEHLIGEHTIDRIFTLGNSEFPVSWDNEV----KL 426
Query: 476 ADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
+ +AR L Y T+ +D+ +L +L +A +L EK++L ++ A
Sbjct: 427 WTFLEARASLLLKTYKTTIHDDKFILETQDLTHNATMAIKLRLGEKEILEKAIKSAA 483
>gi|357153645|ref|XP_003576520.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
[Brachypodium distachyon]
Length = 492
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 90/419 (21%), Positives = 172/419 (41%), Gaps = 63/419 (15%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A+ +L G+ VP L + + V ++ + L ++L ++ E +G
Sbjct: 86 LVAARNLPRGEVVAEVPKKLWMDADAVAASDIGRACRSGGDLRPWVSVSLLILREAARGG 145
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W PY+ L RQ +S + WSE EL + G+ + + E ++ E++ ++
Sbjct: 146 DSLWAPYLAILPRQ-------TDSTIFWSEEELLEIQGTQLLSTTMGVKEYVQSEFDNVE 198
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFA---- 288
+ G +P + TF+ F AF ++S V P++ + A
Sbjct: 199 AK--IIGPNKDLFP-----DTITFDDFLWAFGILRSRVF--------PELRGDKLALIPF 243
Query: 289 SHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV 348
+ L++ T+ ++ + + K L D
Sbjct: 244 ADLINHSADITSKQS----------------------------CWEIQGKGFLGR-DVVF 274
Query: 349 QLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 407
L K+GE + V + + N++L ++YGF + ++ D + ++ DP Y DK
Sbjct: 275 SLRTPMEVKSGEQVYVQYDLDKSNAELALDYGFTETNSTRDSYTLTLEISESDPFYGDKL 334
Query: 408 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE--MQSVISS--LGPI-CP 462
+A+ NG F V G + M+ YLRL + T ++++ + G + P
Sbjct: 335 DIAELNGMGETAYFDVVLG--ESLPPQMITYLRLLCLGGTDAFLLEALFRNKVWGFLELP 392
Query: 463 VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 521
VS E ++ + K+ L Y T+ EDE +L +L + ++A ++ EKK+L
Sbjct: 393 VSRDNEESICQVIQTACKSALTAYHTTIEEDEELLKREDLQSRHQIAVEVRAGEKKVLQ 451
>gi|145350419|ref|XP_001419603.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579835|gb|ABO97896.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 524
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 84/348 (24%), Positives = 135/348 (38%), Gaps = 79/348 (22%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+L W+ LP K+ L+ + + A+E+++ G+A VP + ++T+ER
Sbjct: 83 ELARWLEGRRLPGQKMALEVNLAEG------RGLVATEEIKRGEALLGVPRTTLITVERA 136
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEK---KQGKKSFWLPYIRELDRQRGRGQLAVES 196
+ + +L E + LA +L + + G + YIR L R+ G S
Sbjct: 137 IAEAKLGP--KHAELQEWSVLATFLAQQALALESGTAGTFGEYIRALPRRTG-------S 187
Query: 197 PLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
L W E E+ L GSP++ ER + + +E+ + +P T
Sbjct: 188 VLDWPEDEVDKLLKGSPSRLAAAERQDSVNAAIDEIRSY----------FP------EIT 231
Query: 256 FEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKV 315
+ AF + S ++ L MG E
Sbjct: 232 VGALRWAFDILFSRLIRLD------AMGGE------------------------------ 255
Query: 316 SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLL 375
ALVP +L + C A + DAV L DR Y GE + G +P+S+LL
Sbjct: 256 -----LALVPW-ADMLNHKPGCAAFIDLNGDAVNLTTDRSYVKGEQVWASYGQRPSSELL 309
Query: 376 INYGFVDE--DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 421
I+YGF E +NP D + ++ DP K V + G V+ F
Sbjct: 310 ISYGFAPEVGENPDDEYALTLGVDVNDPLADAKAQVLRDMGLSPVETF 357
>gi|358392567|gb|EHK41971.1| hypothetical protein TRIATDRAFT_251278, partial [Trichoderma
atroviride IMI 206040]
Length = 956
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 103/467 (22%), Positives = 180/467 (38%), Gaps = 81/467 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER----------VLGNETIAELLTTNKLSELACLAL 162
+ A +D+ A F+VP S +V +E L +T E+ + + L +
Sbjct: 531 IVALQDIPADTVLFTVPRSAIVNIETSELRAKLPDVFLNQDTAMEVDNKPQQDPWSTLII 590
Query: 163 YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAE 222
L+YE +G +S W PY+ L + E+P+ WS+ E+ L S T+++I
Sbjct: 591 VLIYEYFKGDQSSWKPYLDVL-------PASFETPMFWSDAEVDELQASATRSKI----- 638
Query: 223 GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL-QRCGCTPQ 281
K E+ F A L P +IF+ + ++ L R G T
Sbjct: 639 -GKTNAEEM----FHAKIL--------PVIRGNPDIFQTSQAKSDEELIQLAHRMGST-- 683
Query: 282 MGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAML 341
S+ Q +E S +VP+ +L ++ A +
Sbjct: 684 -----IMSYAFDFQNEDEEEEDDSEEWVEDREAKST---MGMVPMA-DILNADAEYNAHV 734
Query: 342 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN--------PY----DR 389
DDA+ + R KAGE I+ + GP PNS+LL YG+V + P+ D
Sbjct: 735 NYGDDALTVATLRTIKAGEEILNYYGPHPNSELLRRYGYVTPKHSRYDVVELPWKMIEDA 794
Query: 390 LVVEAALNTED----------PQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL 439
L L++E ++++ ++ + + + + + + E D+ L
Sbjct: 795 LAANLGLSSEQLDSAREHLDLDEFEETFVLERESDEPNPDGTFANPAKFSEIPEDLREQL 854
Query: 440 RLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQ-LADYFKARLAGYPATLSEDEAMLT 498
+ M I + P C V V L A + YP T+ EDE +L+
Sbjct: 855 K--------SMLKAIRKVDPSCIVDKRKRDEVQHTVLITALDALTSQYPTTIIEDELILS 906
Query: 499 DYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAP 545
NL +++ A + EK++L + ++ + D + PAP
Sbjct: 907 GSNLSERRKAAVTVRLGEKRLLQEARVLLSE---IASDAILDDAPAP 950
>gi|302786274|ref|XP_002974908.1| hypothetical protein SELMODRAFT_102436 [Selaginella moellendorffii]
gi|300157067|gb|EFJ23693.1| hypothetical protein SELMODRAFT_102436 [Selaginella moellendorffii]
Length = 389
Score = 73.6 bits (179), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 70/290 (24%), Positives = 126/290 (43%), Gaps = 64/290 (22%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS + G+ V + L++T E++ E + +LL+ + +S A LAL+L+ +K+ + S
Sbjct: 5 ASRPIHTGECMLHVSHDLMITPEKL--PEEVTKLLSKD-VSAWAKLALFLLAHQKKKETS 61
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI L ++ S + W++ EL YL SP E ++R + ++ E+ + V
Sbjct: 62 AWAPYISCLPPFG-----SMHSTIFWTQDELVYLKVSPVYRETVQRKDVVRMEFAAAENV 116
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHL 294
L QQ + + T ++ V C+ G E+ S
Sbjct: 117 CM----LMQQVKLFVCSRILT------DYITV-----------CSRAWGIETIKS----- 150
Query: 295 QTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD--AVQLVV 352
ALVP + + C+AML+ +D ++V
Sbjct: 151 --------------------------LALVPF-VDFFNHDANCRAMLSYDEDRHCAEVVS 183
Query: 353 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV-VEAALNTEDP 401
DR Y G+ +V+ G N+ L +++GF NP+D++ + +L+ +DP
Sbjct: 184 DRDYATGDQVVISYGQLSNATLALDFGFALPFNPHDQVAGIWLSLSEKDP 233
>gi|330800139|ref|XP_003288096.1| hypothetical protein DICPUDRAFT_152307 [Dictyostelium purpureum]
gi|325081857|gb|EGC35358.1| hypothetical protein DICPUDRAFT_152307 [Dictyostelium purpureum]
Length = 525
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 99/472 (20%), Positives = 193/472 (40%), Gaps = 83/472 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ +++DL+ + +P +++++ +I+ +LT + A+ L+YE G+
Sbjct: 82 IISNKDLKVNNIVAKIPKDIILSIH----TSSISNILTKYTMERNIATAIALIYEASIGE 137
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
KS W YI L L V+ P+LW + L G+ + I + I Y ++
Sbjct: 138 KSKWYGYISSL-------PLKVDIPILWDKESQQLLNGTVMEDVIQDDNILINHAYADI- 189
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS---CVVHLQRCGCTP--QMGTESF 287
+ L + +P E F+FE FK A V S C+ P +
Sbjct: 190 ----VESLLIKNHPEYFSKEIFSFENFKIANSIVSSRAFCIDSYHGDSLVPLADIFNHKT 245
Query: 288 ASHLVHLQTS------CTTTR-------TPL-SPTHNPKEKVSLARRFALV-PLGPPLLA 332
VH++++ C + + TPL + + +K++ ++ L+ +
Sbjct: 246 GRENVHIESNGDVCNKCGSIKTCKHRKVTPLITKSAKSYKKLTNKKKMELIEKQKQQQIN 305
Query: 333 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 392
C + D+ + + V + KA + + G N+ LL YGF++ DNP D L V
Sbjct: 306 DEENCGDIAEEDDEHLYIKVVKAVKANQEVYNTYGDHSNATLLSKYGFIEMDNPCDNLPV 365
Query: 393 EAAL-----------NTEDPQYQDKRM--------VAQRNGKLSVQVFHVHAGREKEA-- 431
E +L N D KR+ + RN S+++ +GR +A
Sbjct: 366 EKSLVDTNLISLCKENGFDSNELSKRISFYASLFDIDSRNTH-SIEI----SGRLDDALV 420
Query: 432 ---------ISDMLPYLRLGYVSDTSEMQSVISSLGP--ICPVSPCMERAVLDQLADYFK 480
+S+ +L++ +++ L I + +++A++ L +
Sbjct: 421 CSVGIALAPLSEFEGWLKMS----EHKLEKYFEKLEAEDIVKQNAQVKKAIVQILNN--- 473
Query: 481 ARLAGYPATLSEDEAMLTDY--NLHPKKRVATQLVRMEKKMLNACLQVTADM 530
+L+ YP TL +D+ L + N +K ++T L EKK++ ++ D+
Sbjct: 474 -KLSNYPTTLEQDQNKLKELKENEENRKIISTSLNICEKKLIYKSIKYYEDL 524
>gi|307108530|gb|EFN56770.1| hypothetical protein CHLNCDRAFT_8187, partial [Chlorella
variabilis]
Length = 398
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 109/431 (25%), Positives = 177/431 (41%), Gaps = 82/431 (19%)
Query: 116 SEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSF 175
S+ + G+ F+VP + +T + ++ + L L +AL+L++E+ G S
Sbjct: 4 SKAVNKGEQLFAVPEAAWITADTAQQSQIGSHL---TGLESWLAIALFLLHERAMGNASR 60
Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 235
W PYI L G SP+ W E +LA L GS ++L +G +
Sbjct: 61 WAPYIALLPADSG-------SPVQWEEADLAELQGS----QVLGTVQGYR---------- 99
Query: 236 FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQ 295
+ FQQ + E F QAF + V + L
Sbjct: 100 ----AYFQQRFDQLQAEVFGPN--SQAFDPI---VFNFD-----------------AFLW 133
Query: 296 TSCTTTRTPLSPTHNPKEKVSLARRFALVPLG----------PPLLAYSSKCKAMLAAVD 345
+CT + H P + ++A LVPL P + K L
Sbjct: 134 AACTVR----ARAHPPLDGGNIA----LVPLADMVRSQPSWPPDSAGWQLKQTGGLFGAG 185
Query: 346 DAVQLVVDRP--YKAGESIVVWCGPQ-PNSKLLINYGFVDE--DNPYDRLVVEAALNTED 400
LV++ AG++I + GPQ + +LL+++G +D + P L +E L+ ED
Sbjct: 186 STQALVMEASGSMAAGDAIAMDFGPQKSDGQLLVDHGVIDPLVNQPSYALTLE--LSKED 243
Query: 401 PQYQDKRMVAQRNGKLSVQVFHV-HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL-- 457
Y DK + + N +L+ H+ A R +A + L +D ++S+ +
Sbjct: 244 RNYDDKADILELN-ELAESTEHILRADRAPDAGLLPVLRLLNLSGTDAFLLESIFRNEVW 302
Query: 458 -GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQL-VRM 515
PVS ER QL D A LA YP ++ ED A++ +L P R + + VR+
Sbjct: 303 EHMQLPVSEDNERGCYQQLIDGCTAALAAYPTSIDEDLALMASGSLQPGSRRQSAVRVRL 362
Query: 516 -EKKMLNACLQ 525
EK+ L+A L+
Sbjct: 363 GEKEALDATLR 373
>gi|395504553|ref|XP_003756612.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Sarcophilus
harrisii]
Length = 602
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 104/476 (21%), Positives = 198/476 (41%), Gaps = 84/476 (17%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG L N K + A+ +++A + VP
Sbjct: 80 GKREDYFPDLIKWAAENGASTDGFELV-----NFKEEGFG-LRATREIKAEELFLWVPRK 133
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFWLPYI+ L +
Sbjct: 134 LLMTVESA-KNSVLGALYSQDRILQAMGNITLAFHLLCERA-NPSSFWLPYIQTLPSE-- 189
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ +L + ++ + + R+Y +F + Q +P
Sbjct: 190 -----YDTPLYFEEDEVQHLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPNA 237
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P ++FT+E ++ A +V + R P L+ L C T +
Sbjct: 238 NKLPLKDSFTYEDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLI 291
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V + + GE I ++
Sbjct: 292 TTGYNLE--------------------------------DDRCECVALQDFNVGEQIYIF 319
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 320 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 379
Query: 426 GREKEAISDMLPYLRLGYVSDTSEMQSVIS--------SLG-PICPVSPCMERAVLDQLA 476
E + +L +LR+ +++ + +I +LG PVS E +L
Sbjct: 380 -TEPPISAQLLAFLRVFCMTEEELKEHLIGEHAIDRIFTLGNSEFPVSWDNEV----KLW 434
Query: 477 DYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
+ +AR L Y T+ ED++ L ++L +A +L EK++L ++ A
Sbjct: 435 TFLEARASLLLKTYKTTIEEDKSFLATHDLTFHATMAIKLRLGEKEILEKAVKSAA 490
>gi|326921018|ref|XP_003206761.1| PREDICTED: LOW QUALITY PROTEIN: SET domain-containing protein
3-like [Meleagris gallopavo]
Length = 593
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 94/435 (21%), Positives = 187/435 (42%), Gaps = 78/435 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
+ A+ +++A + VP L++T+E + + L + +++ + LA +L+ E+
Sbjct: 108 LKATREIKAEELFLWVPRKLLMTVESA-KSSVLGSLYSQDRILQAMGNITLAFHLLCERA 166
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
SFWLPYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 167 N-PNSFWLPYIQTLPNE-------YDTPLYFEEDEVQYLRSTQAIHDVFSQYKNTARQY- 217
Query: 230 ELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTES 286
+F + Q +P +P ++FT++ ++ A +V + R P
Sbjct: 218 ----AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSV------MTRQNQIPTEDGSR 265
Query: 287 FASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD 346
L+ L C T ++ +N + DD
Sbjct: 266 VTLALIPLWDMCNHTNGLITTGYNLE--------------------------------DD 293
Query: 347 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 406
+ V + +KAGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K
Sbjct: 294 RCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMK 353
Query: 407 RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS--------SLG 458
V R G + VF +H+ E + +L +LR+ +++ + +I +LG
Sbjct: 354 AEVLARAGIPTSSVFALHS-IEPPISAQLLAFLRVFCMNEEELKEHLIGEHAIDKIFTLG 412
Query: 459 -PICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVATQLV 513
P+S E +L + +AR L Y T+ +D+ L ++L +A +L
Sbjct: 413 NSEFPISWDNEV----KLWTFLEARASLLLKTYKTTVEDDKLFLETHDLTSHATMAIKLR 468
Query: 514 RMEKKMLNACLQVTA 528
EKK+L ++ A
Sbjct: 469 LGEKKILEKTVKSAA 483
>gi|303277863|ref|XP_003058225.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226460882|gb|EEH58176.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 612
Score = 72.4 bits (176), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 94/332 (28%), Positives = 139/332 (41%), Gaps = 55/332 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIA--ELLTTNKLSELACLALYLMYEKKQ 170
AAS DL AG A ++P+S ++T L + T T L E + L+L+YEK
Sbjct: 195 AAASTDLPAGADALTIPSSALLTSRVALEDPTARGDAYRTFAGLGEDTLMTLWLVYEKYA 254
Query: 171 -GKKSFWLPYIREL---------DRQRGRGQLAVESPLLW-SETELAYLTGSPTKAEILE 219
G +S W P + L + G L + +P W +E A L G+P + ++
Sbjct: 255 LGDRSPWAPLLASLPMDDGGGDDGDRTAAGALGL-TPASWPAEVTDALLRGAPLLDDAVK 313
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCT 279
E R++ L F A L + +P PTE +T F+ A A + + +Q
Sbjct: 314 ARETTARQHAAL----FPA--LGEHFPEVFPTELYTLRRFRIASEAWNAYGMTVQ---AE 364
Query: 280 PQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKA 339
G H T+C L P ++L A P + YS
Sbjct: 365 TVGGASGGGEHHPPAPTTC------LPP-------IALLCNHATWPHA---VRYSRL--- 405
Query: 340 MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 399
DDA+ L + R +AGE I V G + N++LL+ YGF DNPYD + L+ E
Sbjct: 406 ----RDDALHLPIARGVRAGEEIFVSYGAKSNAELLLFYGFGVRDNPYD----DVPLSLE 457
Query: 400 DPQYQDKRMVAQR-----NGKLSVQVFHVHAG 426
PQ + + + A R KLS+ V G
Sbjct: 458 LPQGEVRDVSALRERVLHRAKLSLSPHSVRCG 489
>gi|126290266|ref|XP_001367810.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Monodelphis domestica]
Length = 595
Score = 72.0 bits (175), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 104/476 (21%), Positives = 197/476 (41%), Gaps = 84/476 (17%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W NG L N K + A+ +++A + VP
Sbjct: 73 GKREDYFPDLIKWAAANGASTDGFELV-----NFKEEGFG-LRATREIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFWLPYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGALYSQDRILQAMGNITLAFHLLCERA-NPSSFWLPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ +L + ++ + + R+Y +F + Q +P
Sbjct: 183 -----YDTPLYFEEDEVQHLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPNA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P ++FT+E ++ A +V + R P L+ L C T +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLI 284
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V + + GE I ++
Sbjct: 285 TTGYNLE--------------------------------DDRCECVALQDFNVGEQIYIF 312
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 313 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 372
Query: 426 GREKEAISDMLPYLRLGYVSDTSEMQSVIS--------SLG-PICPVSPCMERAVLDQLA 476
E + +L +LR+ +++ + +I +LG PVS E +L
Sbjct: 373 -TEPPISAQLLAFLRVFCMTEEELKEHLIGEHAIDRIFTLGNSEFPVSWDNEV----KLW 427
Query: 477 DYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
+ +AR L Y T+ ED++ L ++L +A +L EK++L ++ A
Sbjct: 428 TFLEARASLLLKTYKTTIEEDKSFLATHDLTFHATMAIKLRLGEKEILEKAVKSAA 483
>gi|242007310|ref|XP_002424484.1| SET domain-containing protein, putative [Pediculus humanus
corporis]
gi|212507902|gb|EEB11746.1| SET domain-containing protein, putative [Pediculus humanus
corporis]
Length = 492
Score = 72.0 bits (175), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 104/468 (22%), Positives = 190/468 (40%), Gaps = 82/468 (17%)
Query: 74 KEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
+E+ +L SW+ +NG V +K NE + + A++DL+ + ++P +++
Sbjct: 81 REDHFSNLISWIKENGGVADNVTIKH---FNEMG---YGLEAAKDLEESELICAIPKNVM 134
Query: 134 VTLERVLGNETIAELLTTN----KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
+TL+ V + L N + +A LAL+L+ E + + SFW YI L
Sbjct: 135 MTLDNV-KVSPLKYLYENNPILKNMGNVA-LALFLILEHVKNENSFWHHYISSLPSD--- 189
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYD- 248
+ L + + + SPT + + I R+Y +F +LFQ +
Sbjct: 190 ----YNTVLYFDLNDFLEMKNSPTFEMATKHCKNIARQY-----AYF--NNLFQNSNDEA 238
Query: 249 --IPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGT----ESFASHLVHLQTSCTTTR 302
I FT+++++ A V + R P T E+ + L+ L C T
Sbjct: 239 SLILRNVFTYQLYRWAVSTV------MTRQNFIPSSSTSNDVENGINGLIPLWDMCNHTN 292
Query: 303 TPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESI 362
LS + LA CK P+K GE +
Sbjct: 293 GYLSTQYKVDRSECLA------------------CK----------------PFKKGEQV 318
Query: 363 VVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFH 422
+++ G + NS L++ GFV ++NP+D + ++ D + + + + G F+
Sbjct: 319 LIFYGERSNSDFLVHNGFVYDENPHDSFRLRLGISKSDKLHGLRCELLKDLGIPDSGDFY 378
Query: 423 VHAGREKEAISDMLPYLRLGY--VSDTSEMQSVISSLGPI----CPVSPCMERAVLDQLA 476
+++G E ++L +LR+ V + + +S S L + C + +E V + L
Sbjct: 379 LYSGSEP-VRENLLAFLRIFNMDVENLNHWKSHSSRLSDLMWKDCALDTKIESKVWNFLY 437
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 524
D L Y E E + D N +R+ ++ EKK L++ L
Sbjct: 438 DRINLLLKTYKG--DEVEVRVEDSNSTECRRLVRAQLKCEKKFLSSIL 483
>gi|168067849|ref|XP_001785817.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162662541|gb|EDQ49381.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 489
Score = 71.6 bits (174), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 109/465 (23%), Positives = 186/465 (40%), Gaps = 88/465 (18%)
Query: 93 CKVILKEKPSHNEKHRPIHYVAASEDL--------QAGDAAFSVPNSLVVTLERVLGNET 144
+ I SH + + SE L AGD +VP S+ + L V N +
Sbjct: 43 VQTIWSWAQSHGIQGEAVKPAEVSEGLGLIAQRPVNAGDEILNVPESVWINLAAVQ-NSS 101
Query: 145 IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 204
+ + L +AL+L++E S W PY+ L + +++SPL WS+ E
Sbjct: 102 LGK--ACEGLKPWVAVALFLIHESSN-PSSKWRPYLDSLPK-------SLDSPLFWSDEE 151
Query: 205 LAYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 263
LA L G+ + E ++ EYN L + V +F Y TF+ FK AF
Sbjct: 152 LAELVGTQLLGSVTGYLEFLENEYNNLVEEVLEPNNKIFNPAVY-------TFDGFKWAF 204
Query: 264 VAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFAL 323
++S T +PL+ E ++L L
Sbjct: 205 GILRS-------------------------------RTFSPLT-----GEDIALVPIADL 228
Query: 324 VPLGPPLLAYS--------SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP-QPNSKL 374
V G L S S+ + D + + + AGE +++ G + N+ L
Sbjct: 229 VNHGKGLGDGSPSWVRKGTSQFWNIGKGSSDLLTVRASANFSAGEQVLMQYGATKSNADL 288
Query: 375 LINYGFVDEDN--------PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 426
++YGFV+ D D L + ++ +D DK + + NG F + G
Sbjct: 289 ALDYGFVERDRGSQFSPGIERDSLALSLEISPDDRFVDDKADILEINGFQCSMQFDLSRG 348
Query: 427 REKEAISDMLPYLRLGYVS--DTSEMQSVI--SSLGPIC-PVSPCMERAVLDQLADYFKA 481
+ +M+ +LRL +S D+ ++++ + G + PVS E A+ + + KA
Sbjct: 349 QGPS--DEMITFLRLSALSGPDSFLLEALFRNEAWGHVSLPVSRDNEEALCTSMLEGLKA 406
Query: 482 RLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQV 526
L GY T+ +D +L +L + +A +VR+ +K + LQ
Sbjct: 407 ALDGYSTTVEQDMELLARGDLSTRMEIAV-VVRLGEKRVMQELQT 450
>gi|297849804|ref|XP_002892783.1| hypothetical protein ARALYDRAFT_471564 [Arabidopsis lyrata subsp.
lyrata]
gi|297338625|gb|EFH69042.1| hypothetical protein ARALYDRAFT_471564 [Arabidopsis lyrata subsp.
lyrata]
Length = 482
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 97/423 (22%), Positives = 179/423 (42%), Gaps = 59/423 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A D+ + +P L + E V ++ I L L +AL+L+ EK + +
Sbjct: 79 LVARRDIGRNEVVLEIPKRLWINPETVTASK-IGPL--CGGLKPWVSVALFLIREKYE-E 134
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+S W Y+ L + + +S + WSE ELA L G+ + L E ++ E+ +L+
Sbjct: 135 ESSWRLYLDMLPQ-------STDSTVFWSEEELAELKGTQLLSTTLGVKEYVENEFLKLE 187
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLV 292
+ D+ + T + F AF ++S R +LV
Sbjct: 188 QEILLPNK-------DLFSSRITLDDFIWAFGILKSRAFSRLR------------GQNLV 228
Query: 293 HLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 352
+ + P T + ++ A F+ + +L ++ V +
Sbjct: 229 LIPLADLINHNPAITTEDYAYEIKGAGLFS---------------RDLLFSLKSPVYV-- 271
Query: 353 DRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 411
KAGE + + + + N++L ++YGFV+ + + + + DP + DK +A+
Sbjct: 272 ----KAGEQVYIQYDLNKSNAELALDYGFVESNPNRNSYTLTIEIPESDPFFGDKLDIAE 327
Query: 412 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV--SDTSEMQSVISS--LGPI-CPVSPC 466
N F V G+ A ML YLRL + SD ++S+ ++ G + PVS
Sbjct: 328 TNKMGETGYFDVVDGQTLPA--GMLQYLRLVALGGSDAFLLESIFNNTIWGHLELPVSRS 385
Query: 467 MERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQV 526
E + + D K+ L+G+ T+ EDE +L + L P+ +A ++ EK++L Q+
Sbjct: 386 NEELICRVVRDACKSALSGFSTTIEEDEKLLEEGKLDPRLEMALKIRIGEKRVLQQIDQI 445
Query: 527 TAD 529
D
Sbjct: 446 FKD 448
>gi|217038301|gb|ACJ76599.1| SET domain-containing protein 3 (predicted) [Oryctolagus cuniculus]
Length = 394
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 81/378 (21%), Positives = 156/378 (41%), Gaps = 71/378 (18%)
Query: 72 SKKEEDLGDLKSWMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVP 129
K+E+ +L W NG + +V+ E+ + A+ +++A + VP
Sbjct: 73 GKREDYFPELMKWASANGASVEGFEVVNFEEEGFG--------LRATREIKAEELFLWVP 124
Query: 130 NSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQ 186
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 125 RKLLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE 182
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
++PL + E E+ YL + ++ + + R+Y V Q +P
Sbjct: 183 -------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYRV-------IQTHP 228
Query: 247 Y--DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRT 303
+ +P ++FT+E ++ A +V + R P L+ L C T
Sbjct: 229 HANKLPLKDSFTYEDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNG 282
Query: 304 PLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 363
++ +N + DD + V R + AGE I
Sbjct: 283 LITTGYNLE--------------------------------DDRCECVALRDFHAGEQIY 310
Query: 364 VWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV 423
++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +
Sbjct: 311 IFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370
Query: 424 HAGREKEAISDMLPYLRL 441
H E + +L +LR+
Sbjct: 371 HF-TEPPISAQLLAFLRV 387
>gi|168063638|ref|XP_001783777.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664720|gb|EDQ51429.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 395
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 77/313 (24%), Positives = 130/313 (41%), Gaps = 67/313 (21%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+ ++ G+ V L++T ++ + ELL T ++E A LAL+++ E+ G+ S
Sbjct: 5 AARPIEVGEQVLRVSGDLMITPNKL--PTEVKELLPTG-VTEWARLALFILVEQHLGQAS 61
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI L A+ S + W + EL + + E ++R I E+ + V
Sbjct: 62 QWAPYINCLPTCG-----ALHSTVFWKKEELELVRFTSLHRETMQRRAVIGSEFASVLPV 116
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHL 294
Q+ P+ I E FKQA+ +S L
Sbjct: 117 -------LQKCPH-IFGERVLHSKFKQAYATGKS-------------------------L 143
Query: 295 QTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD--AVQLVV 352
+ S T R VP + S C+A+L+ ++ +++
Sbjct: 144 RRSSNT-------------------RILTVPF-VDFFNHDSNCRALLSYDEERACAEVIA 183
Query: 353 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQR 412
D+ Y GE +V+ G PN+ L +++GF NPYD++ V AL+ DP + K +
Sbjct: 184 DKNYARGEQVVISYGRLPNTTLALDFGFTISCNPYDQVEVWMALSHRDPLRKMKLALLHA 243
Query: 413 NGKLSVQVFHVHA 425
+G +V VHA
Sbjct: 244 HGMPTV----VHA 252
>gi|17368377|sp|P94026.1|RBCMT_TOBAC RecName: Full=Ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic; AltName:
Full=[Ribulose-bisphosphate carboxylase]-lysine
N-methyltransferase; Short=RuBisCO LSMT; Short=RuBisCO
methyltransferase; Short=rbcMT; Flags: Precursor
gi|1731475|gb|AAC49565.1| ribulose-1,5-bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase [Nicotiana tabacum]
gi|1731477|gb|AAC49566.1| ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase [Nicotiana tabacum]
Length = 491
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 90/416 (21%), Positives = 167/416 (40%), Gaps = 62/416 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A D+ G+ VP + + V +E I + + L +AL+L+ EK +
Sbjct: 87 LVAKRDIAKGETVLQVPKRFWINPDAVAESE-IGNVCS--GLKPWISVALFLLREKWR-D 142
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W Y+ L + + +S + WSE EL+ + G+ + + + ++ E+ +++
Sbjct: 143 DSKWKYYMDVLPK-------STDSTIYWSEEELSEIQGTQLLSTTMSVKDYVQNEFQKVE 195
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLV 292
+ Q +P+ I T + F AF ++S R
Sbjct: 196 EEVILRNK--QLFPFPI-----TLDDFFWAFGILRSRAFSRLR----------------- 231
Query: 293 HLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 352
+ P + N +V+ A+ + A L + D L
Sbjct: 232 ----NQNLILVPFADLTNHNARVTTEDH-----------AHEVRGPAGLFSWDLLFSLRS 276
Query: 353 DRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 411
KAG+ + + + + N+ + ++YGF++ + D + ++ D Y DK +A+
Sbjct: 277 PLKLKAGDQLFIQYDLNKSNADMALDYGFIEPSSARDAFTLTLEISESDEFYGDKLDIAE 336
Query: 412 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM-------QSVISSLGPICPVS 464
NG F + G+ M+PYLRL + T SV LG PVS
Sbjct: 337 TNGIGETAYFDIKIGQSLPPT--MIPYLRLVALGGTDAFLLESIFRNSVWGHLG--LPVS 392
Query: 465 PCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 520
E + + D K+ L+GY T+ EDE ++ + NL + ++A + EK++L
Sbjct: 393 RANEELICKVVRDACKSALSGYHTTIEEDEKLMEEGNLSTRLQIAVGIRLGEKRVL 448
>gi|62642307|gb|AAX92711.1| SET domain-containing protein [Picea abies]
Length = 106
Score = 70.1 bits (170), Expect = 3e-09, Method: Composition-based stats.
Identities = 37/75 (49%), Positives = 52/75 (69%), Gaps = 7/75 (9%)
Query: 42 SSLRLVRRKNRFSIRVSSSDTLVAGS------REVVSKKEEDLGDLKSWMHKNGLPPCKV 95
S +RL R F + V S+DTL A S ++ + KEE++ DLKSWMH++GLPPC+V
Sbjct: 32 SRVRLPGRCVGFPMVVYSADTLTASSQHGEDKKDAIRGKEEEV-DLKSWMHRHGLPPCRV 90
Query: 96 ILKEKPSHNEKHRPI 110
+LKE+PS + KH+PI
Sbjct: 91 MLKERPSPDGKHKPI 105
>gi|145524453|ref|XP_001448054.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124415587|emb|CAK80657.1| unnamed protein product [Paramecium tetraurelia]
Length = 581
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 104/454 (22%), Positives = 185/454 (40%), Gaps = 97/454 (21%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNET-IAELLTTNKLSELA----CLALYLMYE 167
V A + + A + +P S ++TLE + ET +A+ + +L L+ L+ +L+ E
Sbjct: 165 VNAKQKINAKELILFIPKSHMITLE--MAKETPVAKKMIQFRLDLLSPKHSFLSTFLLQE 222
Query: 168 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
K + SFW PY+ L Q P+ ++ +L +L GSP +I ++ +K++
Sbjct: 223 KSRPN-SFWKPYLDIL------PQSYPSFPIFFNNYDLEWLQGSPFLKQINDKLSDLKKD 275
Query: 228 YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESF 287
YN++ V F QY +F F A + S + + G + T++F
Sbjct: 276 YNDICNV----APEFSQY---------SFYEFCWARMTASSRIFGINIKG----VKTDAF 318
Query: 288 ASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLA--YSSKCKAMLAAVD 345
PL+ N K P L + YS + + + D
Sbjct: 319 V---------------PLADMLNHKR--------------PKLTSWCYSEEKQGFIIETD 349
Query: 346 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQD 405
+ + DR G+ I G + NS+ L+NYGFV +DN + + V A DP Q
Sbjct: 350 EKI----DR----GQMIFDSYGRKCNSRFLLNYGFVVDDNDANEVNVTVAAEFNDPLIQL 401
Query: 406 K--------------RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQ 451
K R++ +G + F + + + + Y+R + D +++Q
Sbjct: 402 KEDATEEQLKQPKTFRLIMDTDGINEITHFL------EATVMEFMSYIRFLVIRDQTQLQ 455
Query: 452 SVISS-----LGP--ICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 504
+++ + P I P+ E + D + L+ YP TL +D+ +L +L
Sbjct: 456 FLLNERESKYIKPTKIQPLGIHNELDMWDLIRRICYVSLSRYPTTLEQDKEILQICDLTT 515
Query: 505 KKRVATQLVRMEKKMLNACLQVTADMIMLLPDVT 538
+R L EK++L Q + M LL +
Sbjct: 516 NQRNCLILRMGEKEILKFYYQFSEKMKQLLSNFN 549
>gi|413923745|gb|AFW63677.1| hypothetical protein ZEAMMB73_839660 [Zea mays]
gi|413923746|gb|AFW63678.1| hypothetical protein ZEAMMB73_839660 [Zea mays]
Length = 306
Score = 69.7 bits (169), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 52/199 (26%), Positives = 91/199 (45%), Gaps = 20/199 (10%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E L+ W+ +GLP ++ + ++ E+ + A ++++ G+ VP SLV+T
Sbjct: 71 ESAASLERWLIDSGLPEQRLAI-QRVDIGERG-----LVALKNIRKGEKLLFVPPSLVIT 124
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ G + +++ N + + +A YL+ E S W+ YI L RQ
Sbjct: 125 ADSEWGRPEVGDVMKRNSVPDWPLIATYLISEASLEGSSRWISYIAALPRQ-------PY 177
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ EL AYL SP + ++R + YN+L +F ++P P E +
Sbjct: 178 SLLYWTRAELDAYLVASPIRKRAIQRITDVIGTYNDL------RDRIFSRHPDLFPEEVY 231
Query: 255 TFEIFKQAFVAVQSCVVHL 273
E F +F + S +V L
Sbjct: 232 NIETFLWSFGILFSRLVRL 250
>gi|148686777|gb|EDL18724.1| mCG18357, isoform CRA_b [Mus musculus]
Length = 466
Score = 69.3 bits (168), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 82/382 (21%), Positives = 159/382 (41%), Gaps = 66/382 (17%)
Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
LA +L+ E+ SFW PYI+ L + ++PL + E E+ L + ++
Sbjct: 28 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVF 79
Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQR 275
+ + R+Y +F + Q +P+ +P E+FT+E ++ A +V + R
Sbjct: 80 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKESFTYEDYRWAVSSV------MTR 126
Query: 276 CGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSS 335
P L+ L C T ++ +N +
Sbjct: 127 QNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLE----------------------- 163
Query: 336 KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 395
DD + V + ++AG+ I ++ G + N++ +I+ GF ++N +DR+ ++
Sbjct: 164 ---------DDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLG 214
Query: 396 LNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD--------- 446
++ D Y K V R G + VF +H+ E + +L +LR+ +++
Sbjct: 215 VSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEHLLG 273
Query: 447 TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKK 506
S + + + PVS E + L D L Y T+ ED+ +L + +L +
Sbjct: 274 DSAIDRIFTLGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSVRA 333
Query: 507 RVATQLVRMEKKMLNACLQVTA 528
+A +L EK++L ++ A
Sbjct: 334 TMAIKLRLGEKEILEKAVKSAA 355
>gi|15223054|ref|NP_172856.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase
[Arabidopsis thaliana]
gi|17369870|sp|Q9XI84.1|RBCMT_ARATH RecName: Full=[Fructose-bisphosphate aldolase]-lysine
N-methyltransferase, chloroplastic; AltName:
Full=Aldolases N-methyltransferase; AltName:
Full=[Ribulose-bisphosphate carboxylase]-lysine
N-methyltransferase-like; Short=AtLSMT-L;
Short=LSMT-like enzyme; Flags: Precursor
gi|5080779|gb|AAD39289.1|AC007576_12 Putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase [Arabidopsis thaliana]
gi|28973755|gb|AAO64193.1| putative ribulose-1,5 bisphosphate carboxylase oxygenase large
subunit N-methyltransferase [Arabidopsis thaliana]
gi|332190979|gb|AEE29100.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase
[Arabidopsis thaliana]
Length = 482
Score = 69.3 bits (168), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 95/423 (22%), Positives = 177/423 (41%), Gaps = 59/423 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A D+ + +P L + E V ++ I L L +AL+L+ EK + +
Sbjct: 79 LVARRDIGRNEVVLEIPKRLWINPETVTASK-IGPL--CGGLKPWVSVALFLIREKYE-E 134
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+S W Y+ L + + +S + WSE ELA L G+ + L E ++ E+ +L+
Sbjct: 135 ESSWRVYLDMLPQ-------STDSTVFWSEEELAELKGTQLLSTTLGVKEYVENEFLKLE 187
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLV 292
+ D+ + T + F AF ++S R +LV
Sbjct: 188 QEILLPNK-------DLFSSRITLDDFIWAFGILKSRAFSRLR------------GQNLV 228
Query: 293 HLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 352
+ + P T + ++ A F+ + +L ++ V +
Sbjct: 229 LIPLADLINHNPAIKTEDYAYEIKGAGLFS---------------RDLLFSLKSPVYV-- 271
Query: 353 DRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 411
KAGE + + + + N++L ++YGFV+ + + + + DP + DK +A+
Sbjct: 272 ----KAGEQVYIQYDLNKSNAELALDYGFVESNPKRNSYTLTIEIPESDPFFGDKLDIAE 327
Query: 412 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS--DTSEMQSVISS--LGPI-CPVSPC 466
N F + G+ A ML YLRL + D ++S+ ++ G + PVS
Sbjct: 328 SNKMGETGYFDIVDGQTLPA--GMLQYLRLVALGGPDAFLLESIFNNTIWGHLELPVSRT 385
Query: 467 MERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQV 526
E + + D K+ L+G+ T+ EDE +L L P+ +A ++ EK++L Q+
Sbjct: 386 NEELICRVVRDACKSALSGFDTTIEEDEKLLDKGKLEPRLEMALKIRIGEKRVLQQIDQI 445
Query: 527 TAD 529
D
Sbjct: 446 FKD 448
>gi|168043570|ref|XP_001774257.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674384|gb|EDQ60893.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 458
Score = 68.9 bits (167), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 69/298 (23%), Positives = 121/298 (40%), Gaps = 61/298 (20%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLP 178
++ G+ V L++T R+ + E + ++E + LAL+ + K GK S W P
Sbjct: 70 IKRGEQVLRVSRELMITPNRL---PSCVEESLSEDVNEWSRLALFQLLHKHAGKASPWEP 126
Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMA 238
YIR L RG +++ + W + EL L S + R I +++ + V
Sbjct: 127 YIRCLPPLRG-----LQNTVFWRDEELELLRQSNVYDQTEHRKTLISNQFDLVQAV---- 177
Query: 239 GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSC 298
+YP ++ E T E FK A+ CV + G E+ S
Sbjct: 178 ---VNKYP-ELFGETVTLESFKHAY-----CVA------SSRSWGVEALGS--------- 213
Query: 299 TTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPY 356
+VP + + S +A+LA ++ ++V D+ Y
Sbjct: 214 ----------------------ITMVPF-VDMFNHDSSARALLAYYEEEGYAEVVADKDY 250
Query: 357 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 414
G +V+ G PNS L +++GF DNP+D + + + DP +K + + +G
Sbjct: 251 NQGSQVVITYGTLPNSSLALDFGFTLPDNPHDEVQIWMEAPSGDPLRAEKLKLLRDHG 308
>gi|356534483|ref|XP_003535783.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Glycine
max]
Length = 463
Score = 68.9 bits (167), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 64/294 (21%), Positives = 123/294 (41%), Gaps = 58/294 (19%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ +Q GD VP + +T + +L L ++ +A LA ++ EKK G+ S
Sbjct: 65 ASKIIQTGDCILKVPYRVQITADNLLPE---IRSLIGEEVGNIAKLATVILIEKKLGQGS 121
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI L +Q G+L + + W+E+EL + S E +++ I++++ + +
Sbjct: 122 EWYPYISCLPQQ---GEL--HNTVFWTESELEMIRPSSVYQETIDQKSQIEKDFLAIKHI 176
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHL 294
+ + F Y A T +F V + +G+ ++ S
Sbjct: 177 FECSHQSFGDSTYKDFMHACTLVLFDHFNVEL--------------PVGSRAWGS----- 217
Query: 295 QTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD-------A 347
AL+P L + +A++ + DD +
Sbjct: 218 -----------------------TNGLALIPFA-DFLNHDGVSEAIVMSDDDKQCSEVQS 253
Query: 348 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 401
+Q++ DR Y GE +++ G N+ L++++GF N YD++ ++ + DP
Sbjct: 254 LQIIADRDYAPGEQVLIRYGKFSNATLMLDFGFTIPYNIYDQVQIQFDIPKHDP 307
>gi|330822500|ref|XP_003291689.1| hypothetical protein DICPUDRAFT_57488 [Dictyostelium purpureum]
gi|325078125|gb|EGC31794.1| hypothetical protein DICPUDRAFT_57488 [Dictyostelium purpureum]
Length = 540
Score = 68.9 bits (167), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 105/494 (21%), Positives = 186/494 (37%), Gaps = 91/494 (18%)
Query: 67 SREVVSKKEED---LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGD 123
+++ K+E D + + W+ +G K +K + E + ++ D++ G+
Sbjct: 56 GKQIAVKQETDQQLVSNFMEWLKNSGFDETKSKVKIGRNLAEGSG----LVSTCDIKEGE 111
Query: 124 AAFSVPNSL---VVTLERVLGNETIAELLTTNKLSEL--ACLALYLMYEKKQGKKSFWLP 178
+P L ++T + G LL N + + LALYL+ E S P
Sbjct: 112 EFLEIPEKLFIDIMTALKSFGQSGYDILLRDNLIRRVPNLVLALYLIKESTNPDSSI-AP 170
Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMA 238
Y++ L + + W + L GSP + G R+Y +F
Sbjct: 171 YLKVLPK-------TYSTIGYWGIEDFKQLEGSPVFQTAVNYTRGSMRQY-----CYFY- 217
Query: 239 GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSC 298
LF P + T FT+E F A VQS
Sbjct: 218 -QLFDNNPGILQTSNFTYEAFIWAVATVQS------------------------------ 246
Query: 299 TTTRTPLSPTHNPKEKVSLARRFALVPLGP--PLLAYSSKCKAMLAAVDDAVQLVVDRPY 356
NP V + AL+P ++ K + V + + Y
Sbjct: 247 ---------RQNP---VGGGQEMALIPFWDFCNHSSHGGKITTFIDPVKHVLTCSAAKSY 294
Query: 357 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQ--YQDK-RMVAQRN 413
K GE + ++ GP+PNS+ + GF + N D + L+ ED + DK ++ +R
Sbjct: 295 KKGEQVYMYYGPRPNSQFYLFQGFSLKTNLNDDYSFDMDLDNEDDRDIAHDKIHILEERC 354
Query: 414 GKLSVQVFHVHAGREKEAI-SDMLPYLRLGYVS--DTSEM-----------QSVISSLGP 459
G Q + E + ++++P+ R+ +S +T ++ + P
Sbjct: 355 GLRVGQTVSLSQNPSSEKLPAEIIPFYRIAALSPEETKKLAPPQEEGHHHHHQGPMDMKP 414
Query: 460 --ICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEK 517
+S E+ L D KARL+GYP TL++DE + + N ++R ++ EK
Sbjct: 415 EAFNIISEENEKKAFKLLLDSLKARLSGYPTTLAQDEQEMKN-NPTTQRRYVLYILINEK 473
Query: 518 KMLNACLQVTADMI 531
K+L ++ +I
Sbjct: 474 KILERNIKYVEQLI 487
>gi|8778402|gb|AAF79410.1|AC068197_20 F16A14.25 [Arabidopsis thaliana]
Length = 474
Score = 68.6 bits (166), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 94/427 (22%), Positives = 172/427 (40%), Gaps = 75/427 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A D+ + +P L + E V ++ I L L +AL+L+ EK + +
Sbjct: 79 LVARRDIGRNEVVLEIPKRLWINPETVTASK-IGPL--CGGLKPWVSVALFLIREKYE-E 134
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+S W Y+ L + + +S + WSE ELA L G+ + L E ++ E+ +L+
Sbjct: 135 ESSWRVYLDMLPQ-------STDSTVFWSEEELAELKGTQLLSTTLGVKEYVENEFLKLE 187
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ----SCVVHLQRCGCTPQMGTESFA 288
+ D+ + T + F AF + + + ++ P + TE +A
Sbjct: 188 QEILLPNK-------DLFSSRITLDDFIWAFGILNRESLTSMFEFEQINHNPAIKTEDYA 240
Query: 289 SHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV 348
+ + L R L L P+
Sbjct: 241 YEI---------------------KGAGLFSRDLLFSLKSPVYV---------------- 263
Query: 349 QLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 407
KAGE + + + + N++L ++YGFV+ + + + + DP + DK
Sbjct: 264 --------KAGEQVYIQYDLNKSNAELALDYGFVESNPKRNSYTLTIEIPESDPFFGDKL 315
Query: 408 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS--DTSEMQSVISS--LGPI-CP 462
+A+ N F + G+ A ML YLRL + D ++S+ ++ G + P
Sbjct: 316 DIAESNKMGETGYFDIVDGQTLPA--GMLQYLRLVALGGPDAFLLESIFNNTIWGHLELP 373
Query: 463 VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 522
VS E + + D K+ L+G+ T+ EDE +L L P+ +A ++ EK++L
Sbjct: 374 VSRTNEELICRVVRDACKSALSGFDTTIEEDEKLLDKGKLEPRLEMALKIRIGEKRVLQQ 433
Query: 523 CLQVTAD 529
Q+ D
Sbjct: 434 IDQIFKD 440
>gi|19112238|ref|NP_595446.1| ribosomal lysine methyltransferase Set10 [Schizosaccharomyces pombe
972h-]
gi|74626910|sp|O74738.1|SET10_SCHPO RecName: Full=Ribosomal N-lysine methyltransferase set10
gi|3738151|emb|CAA21252.1| ribosomal lysine methyltransferase Set10 [Schizosaccharomyces
pombe]
Length = 547
Score = 68.6 bits (166), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 70/281 (24%), Positives = 114/281 (40%), Gaps = 60/281 (21%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
L +L E +G +S W YI L + +PL ++E + A+L + + E
Sbjct: 82 LCTFLALESLKGIQSKWYGYIEYLPK-------TFNTPLYFNENDNAFLISTNAYSAAQE 134
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCT 279
R K EY E A SL + PTE FTF+++ + S V RC
Sbjct: 135 RLHIWKHEYQE-------ALSL-----HPSPTERFTFDLY------IWSATVFSSRC--- 173
Query: 280 PQMGTESFASHLVHLQTSCTTTRTPL--SPTHNPKEKVSLARRFALVPLGPPLLAYSSKC 337
F+S+L++ + T PL S H PK+ + F
Sbjct: 174 -------FSSNLIYKDSESTPILLPLIDSLNHKPKQPILWNSDFQ--------------- 211
Query: 338 KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 397
+ +VQL+ G + GP+ N +LL+ YGF DNP+D + ++ A++
Sbjct: 212 ------DEKSVQLISQELVAKGNQLFNNYGPKGNEELLMGYGFCLPDNPFDTVTLKVAIH 265
Query: 398 TEDPQYQDKRMVAQRN--GKLSVQVFHVHAGREKEAISDML 436
+ P K + + + +LS VF + +KE +L
Sbjct: 266 PDLPHKDQKAAILENDCQFQLSNLVFFLPKSPDKEIFQKIL 306
>gi|291235388|ref|XP_002737626.1| PREDICTED: SET domain containing 4-like [Saccoglossus kowalevskii]
Length = 353
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 82/332 (24%), Positives = 141/332 (42%), Gaps = 65/332 (19%)
Query: 75 EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
+ D +L WM +NG K L + + E R + A++ Q GD S+P L++
Sbjct: 28 DNDYIELVRWMSRNGF---KGALLKPANFKETGRGL---MATKPFQIGDQVISIPEMLLI 81
Query: 135 TLERVLGNETIAELL---TTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQ 191
T + VL + + + + T KLS + + YL+ E+ + K SFW YI+ L +
Sbjct: 82 TTQNVLSS-YLGDFIKQQTRPKLSPMQVICTYLICERSRQKDSFWYNYIKVLPK------ 134
Query: 192 LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPT 251
+ +P+ ++ E+ +L K ++ + E I Y EL ++ + S F +
Sbjct: 135 -SYSNPVYFTNEEINWLP-RRIKRKVFDECEKINTAYRELKNLFSILESTFVSFK----- 187
Query: 252 EAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNP 311
F + F+ A+ V + V++ LQ NP
Sbjct: 188 GIFEYSAFRWAWCTVNTRSVYM--------------------LQ------------EQNP 215
Query: 312 KEKVSLAR-RFALVPLGPPLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 368
+S+ R +AL P LL +++ + KA V ++ K + + ++ GP
Sbjct: 216 --HLSIERDHYALAPF-LDLLNHTNTVEVKASYNPVSKCYEIFTCTACKKYDQMFIYYGP 272
Query: 369 QPNSKLLINYGFVDEDNPYDRLVVEAALNTED 400
N KL I YGFV N ++ VVE L+ ED
Sbjct: 273 HDNVKLFIEYGFVLPQNQHN--VVE--LDFED 300
>gi|361129824|gb|EHL01706.1| putative Ribosomal N-lysine methyltransferase 4 [Glarea lozoyensis
74030]
Length = 483
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 108/466 (23%), Positives = 181/466 (38%), Gaps = 70/466 (15%)
Query: 84 WMHKNGLPPC-KVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL-G 141
W+ K G+ K+ LK+ S V A+ D + + F +P + V+ + V G
Sbjct: 16 WLSKIGVRINPKMTLKDLKSEGRGR----GVVAAADFEEDEVVFCIPRTAVLNVNNVFAG 71
Query: 142 NETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
++ A ++ L +M E +Q S W PY+ L ++ ++S + WS
Sbjct: 72 QDSGASKEALLQMPNWLALTATMMSEGQQSD-SRWAPYLAVLPQK-------LDSLVFWS 123
Query: 202 ETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ 261
E ELA L S A+ I R E +F ++ + F E+ Q
Sbjct: 124 EEELAELQASSV-------AKKIGRSSAE---------EMFTKHISPLGLGEFNVELCHQ 167
Query: 262 AFVAVQSCVVHL--QRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLAR 319
+ + + + G E LV EK L+
Sbjct: 168 VASVIMAYAFDIPEEEPAKQENGGAEGETDDLVS--------------DDGEDEKTILS- 212
Query: 320 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 379
++PL L A + + A + ++ +++ +P AGE I G P S LL YG
Sbjct: 213 ---MIPLADMLNADAERNNARIYYENEDLEMRTIKPIMAGEEIFNDYGQLPRSDLLRRYG 269
Query: 380 FVDED-NPYDRLVVEAA----LNTEDPQ---------------YQDKRMVAQRNGKLSVQ 419
+V E+ YD + + +A L TE PQ +++ +A R G L
Sbjct: 270 YVTENYAQYDVVEISSASIKSLMTEKPQEIQSGQFLDPLTSAEAEERVALADREGILEDS 329
Query: 420 VFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYF 479
A E+ AI D L L ++ D ++++++S + S V L
Sbjct: 330 YDVNIANAEERAIPDELLALLYLFLLDNENLEAIVTSQSALPSRSKLATELVGKVLVKVL 389
Query: 480 KARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 525
+ R A Y TL EDE +L NL + +A Q+ EK++L ++
Sbjct: 390 RHREAEYATTLEEDEKLLQAANLPRRTAMAIQVRHGEKRVLRLAVE 435
>gi|146162512|ref|XP_001009518.2| SET domain containing protein [Tetrahymena thermophila]
gi|146146406|gb|EAR89273.2| SET domain containing protein [Tetrahymena thermophila SB210]
Length = 789
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 105/446 (23%), Positives = 183/446 (41%), Gaps = 88/446 (19%)
Query: 107 HRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNET-IAELLTTNKLSELA----CLA 161
+R +H A + ++ G+ +P ++TLE L E I +L+ + + L+ L+
Sbjct: 373 YRGVH---ARQKIKKGECILFIPVDNMITLE--LSKELPICQLIESKNIRLLSPKHTFLS 427
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES---PLLWSETELAYLTGSPTKAEIL 218
+Y++ EKK KSFW P++ L VE P+L+++ EL +L GSP ++
Sbjct: 428 IYIIIEKK-NHKSFWKPFL---------DILPVEYTTFPILYTDEELFWLKGSPFLNQVK 477
Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA--FTFEIFKQAFVAVQSCVVHLQRC 276
ER E I ++ +Q IP A T + F A + S + L
Sbjct: 478 ERRECITQD--------------YQAIVSKIPEFAKLCTLDEFAWARMMAASRIYGL--- 520
Query: 277 GCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSK 336
+ T++F PL+ N RR A G K
Sbjct: 521 -FINKKRTDAFV---------------PLADMFN-------HRRPAYTNWG----FCEDK 553
Query: 337 CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 396
ML A +D + G+ I CG + NS+ L+NYGFV ++N + + +
Sbjct: 554 GGFMLKASEDI---------RRGDQIYYSCGRKCNSRFLLNYGFVVKNNEANEIQLRVDF 604
Query: 397 NTEDPQYQDK-RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQ---- 451
+ +D K +M+ +R K +F +H E++++ + +LR + D ++
Sbjct: 605 DKKDETLPIKLQMIGKR--KPESLIFRIHINYEEKSVLEFFGFLRFVLIRDYIVLEKFHE 662
Query: 452 -SVISSLGPIC--PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRV 508
S P+ P S E+ + ++ + YP TL ED+ +L L ++
Sbjct: 663 MSEGKEFDPLRTPPFSIENEKQMWTEIHKICAEIMIQYPTTLDEDKKILETSKLTINQKN 722
Query: 509 ATQLVRMEKKMLNACLQVTADMIMLL 534
L EK++L + + M LL
Sbjct: 723 CVILRMGEKEILMYYITMADRMKKLL 748
>gi|255087300|ref|XP_002505573.1| set domain protein [Micromonas sp. RCC299]
gi|226520843|gb|ACO66831.1| set domain protein [Micromonas sp. RCC299]
Length = 509
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 112/477 (23%), Positives = 187/477 (39%), Gaps = 70/477 (14%)
Query: 64 VAGSREVVSKKEEDLGDLKSWMHKNGLPPCKV---ILKEKPSHNEKHRPIHYVAASEDLQ 120
VA V S+ + D L +W+ G+ KV ++ P R VAA ED+
Sbjct: 44 VAVDASVDSRTQADFDALWAWLGSEGVDVSKVSPALVDAAPGG----RGWGLVAA-EDIG 98
Query: 121 AGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYI 180
GDA ++P SL +T++ L + A + +AL L++E+ G+KS W Y+
Sbjct: 99 GGDAVLAIPRSLWMTVDTALASPIGAH---CGDEAGWIAVALQLLHERSIGEKSRWAAYV 155
Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS 240
L Q +++PL WS E+A LTG+ ++L+ A G + T + S
Sbjct: 156 NALPAQ-------LDAPLFWSAEEVATLTGT----QLLDAAAGY--DSYARGTWARLKES 202
Query: 241 LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTE-------SFASHLVH 293
F P P++AF F AF ++S RC G + A+H
Sbjct: 203 AFDANPDVFPSDAFDEPSFLWAFGILRS------RCQAPVDQGADIALVPGLDMANH-SG 255
Query: 294 LQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD 353
L + T K G LL K +LA
Sbjct: 256 LSSQTWTLNNGGVAAVFGGGKSG----------GSMLLRTEKGAKGLLA----------- 294
Query: 354 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE-AALNTEDPQYQDKRMVAQR 412
K E + + + +++L ++YGF D V+ A+ DP DK V +
Sbjct: 295 ---KGAEVFMNYGQRKIDNQLALDYGFTDAFASRPGYVLGPIAIPESDPNAFDKMDVLEV 351
Query: 413 NGKLSVQVFHVHAGREKEAISDMLPYLRLGYV--SDTSEMQSVI--SSLGPIC-PVSPCM 467
G F + A + E ++ ++RL + D ++++ + G I PVS
Sbjct: 352 AGLREAPSFVLRAFEDPE--PELRVFMRLLNLKGEDAFLLEAIFRQEAWGLISEPVSRLN 409
Query: 468 ERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 524
E+ + + + L GY + +D + D + + R+A ++ EK+ L L
Sbjct: 410 EQEACGTMINGCEEALRGYATRVEDDRRVAEDPGVGHRLRLAARVRMGEKQALADAL 466
>gi|392349055|ref|XP_003750278.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Rattus
norvegicus]
Length = 416
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 81/376 (21%), Positives = 156/376 (41%), Gaps = 67/376 (17%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 12 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 65
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 66 LLMTVESA-KNSILGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 121
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y V Q +P+
Sbjct: 122 -----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQYAYFYKV-------IQTHPHA 169
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
+P ++FT+E ++ A +V + R P L+ L C T +
Sbjct: 170 NKLPLKDSFTYEDYRWAVSSV------MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLI 223
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
+ +N + DD + V + ++AG+ I ++
Sbjct: 224 TTGYNLE--------------------------------DDRCECVALQDFQAGDQIYIF 251
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H
Sbjct: 252 YGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 311
Query: 426 GREKEAISDMLPYLRL 441
E + +L +LR+
Sbjct: 312 -TEPPISAQLLAFLRV 326
>gi|198413420|ref|XP_002131202.1| PREDICTED: similar to SET domain containing 3 [Ciona intestinalis]
Length = 577
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 104/471 (22%), Positives = 200/471 (42%), Gaps = 83/471 (17%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
KSW+ ++G+ + ++E S E V A +D++ ++P ++T E
Sbjct: 83 FKSWLKEHGVEYSAIDIQE-VSEEEGFG----VIALQDIEIKCPLVTIPRKAMMTYEDA- 136
Query: 141 GNETIAELLTTNKLSEL---ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
+ +A L+ N++ + CLALYL E+ S + PYI + ++ +
Sbjct: 137 KSSYLAGLIEGNEVLSVMPNVCLALYLHCERFT-LNSKYQPYIDMIPQE-------FNTI 188
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA---F 254
L + E+ YL G+ + + + + I R++ L V+ GS ++ +P +A F
Sbjct: 189 LYFKPHEMKYLKGTAALSVAINQFKSIVRQFALLYQVF--NGSHQKEDVEKLPLQARNAF 246
Query: 255 TFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEK 314
TF+ ++ AV TTR PTH
Sbjct: 247 TFDTYRWCASAV---------------------------------TTRQNKIPTHVGDVL 273
Query: 315 VSLARR--FALVPL--------GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 364
L AL+P+ GP AY+ A+ ++ + + +K GE + +
Sbjct: 274 GDLDENSTLALIPMWDMFNHAIGPLSTAYN--------ALTRGIECLAMQDFKTGEQVKI 325
Query: 365 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK-RMVAQRNGKLSVQVFHV 423
G + NS LLI+ GFV +++P+D++ + ++ +DP Y K +++ + N ++S Q
Sbjct: 326 CYGARTNSDLLIHNGFVMKESPFDKVRIHLGVSQKDPLYSLKAKLLEKLNVEVSGQFAVC 385
Query: 424 HAGREKEAISDMLPYLRLGYVSDT------SEMQSVISSLGPI---CPVSPCMERAVLDQ 474
+L +LR+ ++++ + ++ +SSL I V + V +
Sbjct: 386 SMDNSLPTSPQLLVFLRVFHMNEEELRSWLEKQKNELSSLREIYISGEVKFKSDVKVWEF 445
Query: 475 LADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 525
L + K L G+ E M+ D +L + ++A Q E ++L+AC+
Sbjct: 446 LENRVKLLLMGFKKIGDNIEEMMEDKSLTHRSKLALQFRIEEHRILSACVN 496
>gi|149044197|gb|EDL97579.1| rCG27725, isoform CRA_c [Rattus norvegicus]
Length = 468
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 81/382 (21%), Positives = 158/382 (41%), Gaps = 66/382 (17%)
Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
LA +L+ E+ SFW PYI+ L + ++PL + E E+ L + ++
Sbjct: 28 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVF 79
Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQR 275
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + R
Sbjct: 80 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSV------MTR 126
Query: 276 CGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSS 335
P L+ L C T ++ +N +
Sbjct: 127 QNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLE----------------------- 163
Query: 336 KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 395
DD + V + ++AG+ I ++ G + N++ +I+ GF ++N +DR+ ++
Sbjct: 164 ---------DDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLG 214
Query: 396 LNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD--------- 446
++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 215 VSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLG 273
Query: 447 TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKK 506
S + + + PVS E + L D L Y T+ ED+ +L + +L +
Sbjct: 274 DSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKTVLKNPDLSVRA 333
Query: 507 RVATQLVRMEKKMLNACLQVTA 528
+A +L EK++L ++ A
Sbjct: 334 TMAIKLRLGEKEILEKAVKSAA 355
>gi|58177849|gb|AAH89108.1| Setd3 protein [Rattus norvegicus]
Length = 450
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 81/382 (21%), Positives = 158/382 (41%), Gaps = 66/382 (17%)
Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
LA +L+ E+ SFW PYI+ L + ++PL + E E+ L + ++
Sbjct: 10 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVF 61
Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQR 275
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + R
Sbjct: 62 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSV------MTR 108
Query: 276 CGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSS 335
P L+ L C T ++ +N +
Sbjct: 109 QNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLE----------------------- 145
Query: 336 KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 395
DD + V + ++AG+ I ++ G + N++ +I+ GF ++N +DR+ ++
Sbjct: 146 ---------DDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLG 196
Query: 396 LNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD--------- 446
++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 197 VSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLG 255
Query: 447 TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKK 506
S + + + PVS E + L D L Y T+ ED+ +L + +L +
Sbjct: 256 DSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKTVLKNPDLSVRA 315
Query: 507 RVATQLVRMEKKMLNACLQVTA 528
+A +L EK++L ++ A
Sbjct: 316 TMAIKLRLGEKEILEKAVKSAA 337
>gi|298711968|emb|CBJ32910.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 247
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 64/192 (33%), Positives = 90/192 (46%), Gaps = 23/192 (11%)
Query: 354 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT-EDPQYQDK-RMVAQ 411
R + AGE +++ GP+ N LL YGFV++DNP D + ++ D +D R++ +
Sbjct: 44 RGWTAGEQVLISYGPRSNDHLLRRYGFVEQDNPNDVYRITGLIDKLSDVLGKDSVRVLRE 103
Query: 412 RNGKLSVQ--------VFHVHAGR-----EKEAISDMLPYLRLGYVSDTS--EMQSVISS 456
GKL V V GR EKE ++P RL V D E ++ S
Sbjct: 104 SGGKLGTTGDNAEGRPVESVTVGRSGLLGEKEE-GRVMPVFRLAVVKDDQLPEGKAAGIS 162
Query: 457 LGPIC-PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYN--LHPKKRVATQLV 513
L +SP E A D L G+ TL+EDEA L+ L +KRVA
Sbjct: 163 LKDFSNEISPANEAAARDALRKLCIKEREGFATTLAEDEAYLSSLGNSLGAQKRVAFSF- 221
Query: 514 RMEKK-MLNACL 524
RMEKK +L+A +
Sbjct: 222 RMEKKRVLDAAI 233
>gi|340720054|ref|XP_003398458.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Bombus
terrestris]
Length = 484
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 101/472 (21%), Positives = 181/472 (38%), Gaps = 79/472 (16%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+ + +G +W+ +NG + E P ++ + A + + +P L
Sbjct: 77 KRSQGIGRFINWLKQNGANVYGASVAEFPGYDLG------LKAERNFLENELILRIPREL 130
Query: 133 VVTLERV------LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
+ ++ L N+ + +L+ LA+ L+ EK + + S W PY+ L
Sbjct: 131 IFSIHNAAPELVALQNDPLLQLMPQ------VALAIALLIEKHK-EYSKWKPYLDILPT- 182
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
+ L + ++ L GSPT L++ I R+Y +F LFQ+
Sbjct: 183 ------TYTTVLYMTAADMNELKGSPTLEAALKQCRNIARQY-----AYF--NKLFQKNN 229
Query: 247 YDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLS 306
+ I + F + C S TR +
Sbjct: 230 NAVSA------ILRDVFTYEKYCWA------------------------VSTVMTRQNII 259
Query: 307 PTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 366
P+ + + AL+P+ SK A + + R +K E I +
Sbjct: 260 PSKDGSLMIH-----ALIPMWDMCNHEDSKITTDFNATLNCCECYALRDFKKAEQIFISY 314
Query: 367 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 426
GP+ NS ++ GFV DN D + ++ DP ++++ + + +V F + G
Sbjct: 315 GPRTNSDFFVHSGFVYMDNEQDGFKLRLGISKADPLHKERVELLNKLDLPAVGEFLLKPG 374
Query: 427 REKEAISD-MLPYLRLGYVSDTSEMQSVISS--LGPICPVSPCMERAVLDQLADYFKARL 483
E ISD +L +LR+ + E+ I S + + + +E V + + + RL
Sbjct: 375 --TEPISDTLLAFLRV-FSMRKEELAHWIQSDRVNDLKHMDCALETVVEENVKKFLLTRL 431
Query: 484 ----AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 531
A YP TL ED +L + L K++A QL EK++L L+ I
Sbjct: 432 QLLIANYPTTLKEDLQLL-ETTLPRIKKLAIQLRVTEKRILQGALEYVQQWI 482
>gi|302755392|ref|XP_002961120.1| hypothetical protein SELMODRAFT_402746 [Selaginella moellendorffii]
gi|300172059|gb|EFJ38659.1| hypothetical protein SELMODRAFT_402746 [Selaginella moellendorffii]
Length = 371
Score = 67.0 bits (162), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 72/288 (25%), Positives = 118/288 (40%), Gaps = 74/288 (25%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+ + AG +P ++T E V ++ LL+T+ L+L+L+ EK + ++S
Sbjct: 10 ATRRVPAGSRFLEIPRIAIITPENVPSQ--VSHLLSTSNPKTR--LSLFLLSEKHKAQES 65
Query: 175 FWLPYIRELDRQRGRGQLA-VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
W PY+R L QL +ES + W + ELA+L SPT E +E + IK E++ L+
Sbjct: 66 QWAPYLRCL------PQLGDIESTMFWKDEELAWLKHSPTYRETMECLKIIKSEFHVLEA 119
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVH 293
F + D+ E + F A+ T Q+ F +H +
Sbjct: 120 NVF-------PWCRDVLGEV-SLTDFMHAY--------------STDQIPFADFFNHDHN 157
Query: 294 LQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD 353
QT L+Y + +A D
Sbjct: 158 CQTR---------------------------------LSYDKEKDCAVAVAD-------- 176
Query: 354 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 401
+ YKAG+ I + G PNS L ++YGF NP++++ V ++ DP
Sbjct: 177 QDYKAGDEIFLSYGSTPNSILAVDYGFAVASNPHEQVEVPMGVSLTDP 224
>gi|189236574|ref|XP_975615.2| PREDICTED: similar to SET domain containing 3 [Tribolium castaneum]
Length = 667
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 58/227 (25%), Positives = 99/227 (43%), Gaps = 28/227 (12%)
Query: 321 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRP-------YKAGESIVVWCGPQPNSK 373
+AL+PL C + A V+DR +KAGE + ++ G + N+
Sbjct: 451 YALIPLW-------DMCNHTNGTISTAYNPVLDRSECLAVKNFKAGEQLFIFYGSRSNAD 503
Query: 374 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQV---FHVHAGREKE 430
L ++ GFV E+N YD + ++ DP Q + + GKLS+ F + G
Sbjct: 504 LFVHNGFVFENNDYDVYWIRLGISKSDPLQQKRGHLL---GKLSIASTCDFSIRKG-ASP 559
Query: 431 AISDMLPYLRLGYVSDTSEMQSVISS-----LGPI-CPVSPCMERAVLDQLADYFKARLA 484
+L +LR+ + + ++ I+S LG I C + +E L K L+
Sbjct: 560 IDGQLLAFLRV-FNMNEEQLDHWINSDKSADLGHIDCALDTALETKSWRFLHARLKLLLS 618
Query: 485 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 531
Y TL EDE +L + P + +A ++ EK+++ L+ I
Sbjct: 619 TYKTTLDEDEKLLAEAQATPNRLLAIKMRATEKRIIRETLEYVEQYI 665
Score = 40.8 bits (94), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 35/119 (29%), Positives = 58/119 (48%), Gaps = 12/119 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSEL--ACLALYLMYEKK 169
V A+ D+ +VP L++++E + +L+ +K L + L+++L+ EK
Sbjct: 116 VKANVDIAESSLVIAVPRKLMMSVENA-KESVLKDLIEKDKILGSMPNVALSIFLLLEKY 174
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
+G SFW PYI L + + L +S EL L GSPT L + + I R+Y
Sbjct: 175 KGD-SFWKPYIDILPK-------TYTTVLYFSIDELEELRGSPTLEVALRQIKSITRQY 225
>gi|270005261|gb|EFA01709.1| hypothetical protein TcasGA2_TC007289 [Tribolium castaneum]
Length = 230
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 58/227 (25%), Positives = 99/227 (43%), Gaps = 28/227 (12%)
Query: 321 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRP-------YKAGESIVVWCGPQPNSK 373
+AL+PL C + A V+DR +KAGE + ++ G + N+
Sbjct: 14 YALIPLW-------DMCNHTNGTISTAYNPVLDRSECLAVKNFKAGEQLFIFYGSRSNAD 66
Query: 374 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQV---FHVHAGREKE 430
L ++ GFV E+N YD + ++ DP Q + + GKLS+ F + G
Sbjct: 67 LFVHNGFVFENNDYDVYWIRLGISKSDPLQQKRGHLL---GKLSIASTCDFSIRKG-ASP 122
Query: 431 AISDMLPYLRLGYVSDTSEMQSVISS-----LGPI-CPVSPCMERAVLDQLADYFKARLA 484
+L +LR+ + + ++ I+S LG I C + +E L K L+
Sbjct: 123 IDGQLLAFLRV-FNMNEEQLDHWINSDKSADLGHIDCALDTALETKSWRFLHARLKLLLS 181
Query: 485 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 531
Y TL EDE +L + P + +A ++ EK+++ L+ I
Sbjct: 182 TYKTTLDEDEKLLAEAQATPNRLLAIKMRATEKRIIRETLEYVEQYI 228
>gi|449542715|gb|EMD33693.1| hypothetical protein CERSUDRAFT_56467 [Ceriporiopsis subvermispora
B]
Length = 510
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 111/463 (23%), Positives = 175/463 (37%), Gaps = 72/463 (15%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLE-RVLGNETIAELLTTNKLSE-LACLALYLMYEKKQGK 172
A D+ G A FS+P L ++L L E + L E L L +M+E+ G
Sbjct: 34 ALRDIPEGHALFSIPRGLTLSLRTSSLPTRFGIEAWKQHGLHEGWVGLILCMMWEESLGG 93
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W Y+ + +P+ WSE E+ L G+ +I R + + Y +L
Sbjct: 94 SSKWSEYMSSMPD-------TFTTPMFWSEEEIQELKGTAVVDKI-GRDDAERDYYEKLI 145
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASH-- 290
LF + IP + + E + + S H++ ++ +H
Sbjct: 146 PAIKSRPDLFLEGS--IP-KFYALERYHMMGSRILSRSFHVEPWRGDDGHESDESETHGD 202
Query: 291 -----------LVHLQTSCTTTRTPLSPTH---------------NPKEKVSLARRFALV 324
+ S TT + P H + E A+V
Sbjct: 203 APHSDDPNGMDVDADNPSGDTTADSIEPPHEEEVSQLDSAFDGNTDDDEDAETPADVAMV 262
Query: 325 PLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD-- 382
P+ L A A L + +++V +P AGE I G PNS LL YG VD
Sbjct: 263 PMADMLNARFESENAKLFYEEHYLKMVATKPINAGEQIWNTYGDPPNSDLLRRYGHVDVV 322
Query: 383 --------EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQV-FHVHAGREK---- 429
E NP D + + A L R + G L V+V F + +
Sbjct: 323 PLGEPLSGEGNPADVVEIRADLVV-----SAVRKARKAAGDLQVRVDFWLEEADDDTFVL 377
Query: 430 ----EAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAG 485
E ++L ++RL +S T + + + + G + +E +L + D K RL
Sbjct: 378 MTDCEVPEELLSFIRL--LSLTKDEWNKVKAKGKLP--KGKLELELLPAIVDVLKERLKE 433
Query: 486 YPATLSEDEAML---TDYNLHPKKRVATQLVRMEKKMLNACLQ 525
YP T+ EDE++L + NL KR A + EK++L LQ
Sbjct: 434 YPTTIEEDESLLGPDSAVNLSFNKRNAVVVRLGEKRILRGALQ 476
>gi|159479580|ref|XP_001697868.1| rubisco large subunit N-methyltransferase [Chlamydomonas
reinhardtii]
gi|158273966|gb|EDO99751.1| rubisco large subunit N-methyltransferase [Chlamydomonas
reinhardtii]
Length = 475
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 112/457 (24%), Positives = 176/457 (38%), Gaps = 77/457 (16%)
Query: 83 SWMHKNGLPPCKV-----ILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
+W K G K IL +KP + AS D+Q G++ VP++ V++
Sbjct: 46 AWATKQGAKLEKANLSTDILTDKP----------ILVASADVQPGESLIVVPDAAWVSVP 95
Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
V T+ +L ++ L LAL L+ E+ KS Y L G +P
Sbjct: 96 NV-AKTTVGKLASSAGLEPWLQLALVLVAERFGSAKSELAGYASSLPEDLG-------TP 147
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
LLWSE E L G+ + + + +L LF P P FT
Sbjct: 148 LLWSEEETRALAGTQVAGTLNSYLTFFRSTFAQLQA------GLFTANPAAFPPAVFTLP 201
Query: 258 IFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSL 317
F A AV+S P G + + LV L + T LS
Sbjct: 202 NFVWAVAAVRSR-------SHPPLEGDKIALAPLVDLVSHRRAANTKLS----------- 243
Query: 318 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP-QPNSKLLI 376
R L G +A V+ R + GE++ + P + + +L+
Sbjct: 244 VRSSGLFGRG------------QVAVVE------ATRAIRKGEALGMDYAPGKLDGPVLL 285
Query: 377 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML 436
+YG +D +P + L+ D DK + + G + + +++ +M+
Sbjct: 286 DYGVMDTASPKPGYSLTLTLDESDKFVDDKADIVEGAGLRPSMTYSITP--DQQPGEEMM 343
Query: 437 PYLRLGYVS--DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 494
+LRL + D ++S+ + VS E AV LA+ +A LAGYP TL +D
Sbjct: 344 AFLRLMNIKAMDAFLLESIFRN-----EVSEGNEEAVCAMLAEGARAALAGYPTTLDQDL 398
Query: 495 AMLTDYNLHPKKRV-ATQLVRM-EKKMLNACLQVTAD 529
A L + R A LVR+ EK+ L+A + D
Sbjct: 399 AALRSNSTPLGSRAEAALLVRLGEKESLDAVARFFED 435
>gi|326492674|dbj|BAJ90193.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 65.9 bits (159), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 96/424 (22%), Positives = 169/424 (39%), Gaps = 67/424 (15%)
Query: 109 PIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEK 168
P+ + A +L G+ VP L + + V + + L ++L ++ E
Sbjct: 39 PVLGLVAERNLPRGEVVAEVPKKLWLDADAVAASVLGRVCGSGGDLRPWVSVSLLILREA 98
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
+G S W PY+ L RQ +S + WSE EL + G+ + + E ++ E+
Sbjct: 99 ARGGDSLWAPYLAILPRQ-------TDSTIFWSEEELLEIQGTQLLSTTMGVKEYVQSEF 151
Query: 229 NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFA 288
+ ++ AG + D+ TF+ F AF ++S V P++ + A
Sbjct: 152 DNVE-----AGII--NVNKDLFPGTITFDDFLWAFGVLRSRVF--------PELRGDKLA 196
Query: 289 ----SHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAV 344
+ L++ T+ + + K K L
Sbjct: 197 LIPFADLINHNGDITSKES----------------------------CWEIKGKGFLGR- 227
Query: 345 DDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 403
D L K+GE I V + + N++L ++YGF + ++ D + ++ DP Y
Sbjct: 228 DTVFSLRTPVDVKSGEQIYVQYDLDKSNAELALDYGFTESNSSRDSYTLTLEISESDPFY 287
Query: 404 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM-------QSVISS 456
+DK +A+ NG F V G + M+ YLRL + T V
Sbjct: 288 EDKLDIAELNGMGETAYFDVVLG--ESLPPQMITYLRLLCLGGTDAFLLEALFRNKVWEH 345
Query: 457 LGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRME 516
L PVS E ++ + + K+ LA Y T+ EDE +L +L + ++A ++ E
Sbjct: 346 LE--LPVSRDNEESICQVIQNACKSALAAYHTTIEEDEELLEREDLQSRHQIAVEVRVGE 403
Query: 517 KKML 520
KK+L
Sbjct: 404 KKVL 407
>gi|302766942|ref|XP_002966891.1| hypothetical protein SELMODRAFT_408134 [Selaginella moellendorffii]
gi|300164882|gb|EFJ31490.1| hypothetical protein SELMODRAFT_408134 [Selaginella moellendorffii]
Length = 374
Score = 65.9 bits (159), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 71/288 (24%), Positives = 116/288 (40%), Gaps = 71/288 (24%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+ + AG +P ++T E V ++ LL+T+ L+L+L+ EK + ++S
Sbjct: 10 ATRRVPAGSRFLEIPRIAIITPENVPSQ--VSHLLSTSNPK--TRLSLFLLSEKHKAQES 65
Query: 175 FWLPYIRELDRQRGRGQLA-VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
W PY+R L QL +ES + W ELA+L SPT E +E + IK E++ L
Sbjct: 66 QWAPYLRCL------PQLGDIESTMFWKAEELAWLKHSPTYRETMECLKIIKSEFHLLT- 118
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVH 293
+A Q +P+ C +G S L
Sbjct: 119 ---LANK--QVFPW------------------------------CRDALGEVS----LTD 139
Query: 294 LQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD 353
+ +T + P + N L+Y + +A D
Sbjct: 140 FMHAYSTDQIPFADFFNHDHNCQTR------------LSYDKEKDCAVAVAD-------- 179
Query: 354 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 401
+ YKAG+ I + G PNS L ++YGF NP++++ V ++ DP
Sbjct: 180 QDYKAGDEIFLSYGSTPNSILAVDYGFAVASNPHEQVEVPMGVSLTDP 227
>gi|160331079|ref|XP_001712247.1| met [Hemiselmis andersenii]
gi|159765694|gb|ABW97922.1| met [Hemiselmis andersenii]
Length = 464
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 80/416 (19%), Positives = 173/416 (41%), Gaps = 69/416 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER-VLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
+ A + +Q G+ +P +L+++++R + NE + L+E L ++L+ + G
Sbjct: 103 LLAFKKIQQGEKLIEIPENLILSVDRDQIKNEG------NDFLNEYDSLGIFLIQQMAMG 156
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
KS W Y L R+ + W+ ++ +L GS T L E IK ++ L
Sbjct: 157 DKSKWKIYFDILPREED-----LNLGFRWNLNDIVFLRGSKTLNASLYLKEKIKIQFLRL 211
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHL 291
+ F L +YP I F ++ A + S + LQ + F +H
Sbjct: 212 EKTIFSKNRL--KYPVSI----FNLAQWEWALSILLSRAIFLQNLKKVSLVPYADFMNH- 264
Query: 292 VHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 351
P S ++ +K+S ++ +V +
Sbjct: 265 -----------NPFSTSYINSKKISFSKNHEIV-------------------------MY 288
Query: 352 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV-- 409
D+ Y + I G + N +LL+ YGF+ E NP+D + + +L+ +D ++ K+
Sbjct: 289 ADKDYNKFDQIFTTYGQKTNLELLLLYGFILERNPFDSIELRISLSDKDSFFEKKKQFMI 348
Query: 410 -AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV-ISSLGPICPVSPCM 467
++ +++ +F+ +E + +LR +S+ E+ S +S + +
Sbjct: 349 ECEKTSEITFPIFYYKYPKE------LYEFLRFC-ISNQEELGSTDLSDFNFNDENNYEI 401
Query: 468 ERAVLDQLADYFKARLAGYPATLSEDE---AMLTDYNLHPKKRVATQLVRMEKKML 520
E+ + + + L Y +SE++ ++ +++ + +++A + + EKK++
Sbjct: 402 EKIIRKLVLFSCEKLLKNYSKKVSEEKILNSLNSNFLISKNQKMALKQSKCEKKII 457
>gi|42565948|ref|NP_191068.2| SET domain-containing protein [Arabidopsis thaliana]
gi|56236044|gb|AAV84478.1| At3g55080 [Arabidopsis thaliana]
gi|59958342|gb|AAX12881.1| At3g55080 [Arabidopsis thaliana]
gi|332645816|gb|AEE79337.1| SET domain-containing protein [Arabidopsis thaliana]
Length = 463
Score = 65.5 bits (158), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 67/312 (21%), Positives = 131/312 (41%), Gaps = 64/312 (20%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ + AGD VP + +T + + + +L +N++ + LA L+ EKK G+KS
Sbjct: 75 ASKVIYAGDCMLKVPFNAQITPDELPSD---IRVLLSNEVGNIGMLAAVLIREKKMGQKS 131
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W+PYI L + + S + W E EL+ + S E +++ I+++++
Sbjct: 132 RWVPYISRLPQ-----PAEMHSSIFWGEDELSMIRCSAVHQETVKQKAQIEKDFS----- 181
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHL 294
F+A + Q P I TE E F A+ V G+ ++ +
Sbjct: 182 -FVAQAFKQHCP--IVTERPDLEDFMYAYALV----------------GSRAWEN----- 217
Query: 295 QTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV-QLVVD 353
++R +L+P + +L D+ + ++ D
Sbjct: 218 -----------------------SKRISLIPFADFMNHDGLSASIVLRDEDNQLSEVTAD 254
Query: 354 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 413
R Y G+ + + G N+ L++++GF N +D + ++ + +DP K + Q +
Sbjct: 255 RNYSPGDEVFIKYGEFSNATLMLDFGFTFPYNIHDEVQIQMDVPNDDPLRNMKLGLLQTH 314
Query: 414 GKLSVQ---VFH 422
+V+ +FH
Sbjct: 315 HTRTVKDINIFH 326
>gi|52545671|emb|CAH56365.1| hypothetical protein [Homo sapiens]
Length = 380
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 61/253 (24%), Positives = 114/253 (45%), Gaps = 15/253 (5%)
Query: 285 ESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAV 344
+SF S TR PT + +V+LA L+PL +
Sbjct: 23 DSFTYEDYRWAVSSVMTRQNQIPTEDGS-RVTLA----LIPLWDMCNHTNGLITTGYNLE 77
Query: 345 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 404
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 78 DDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 137
Query: 405 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD---------TSEMQSVIS 455
K V R G + VF +H E + +L +LR+ +++ S + + +
Sbjct: 138 MKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFT 196
Query: 456 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 515
PVS E + L D L Y T+ ED+++L +++L + ++A +L
Sbjct: 197 LGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNHDLSVRAKMAIKLRLG 256
Query: 516 EKKMLNACLQVTA 528
EK++L ++ A
Sbjct: 257 EKEILEKAVKSAA 269
>gi|310799999|gb|EFQ34892.1| SET domain-containing protein [Glomerella graminicola M1.001]
Length = 478
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 98/460 (21%), Positives = 176/460 (38%), Gaps = 97/460 (21%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER----------VLGNETIAELLTTNKLSELACLAL 162
+ A++D+ F++P ++ +E GN+ E + L L L
Sbjct: 44 IVATKDIAPETVLFTIPRKSIINIETSELPKKIPQVFTGNDGDDEDMENEPLDSWGSLIL 103
Query: 163 YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAE 222
++YE QG S W Y L + ++ + W +L YL GS ++I +
Sbjct: 104 VMIYEYLQGNASPWKTYFEVLPEK-------FDTLMFWESPDLEYLKGSAVLSKIGKDEA 156
Query: 223 GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQM 282
L + AG F Q P+E+ E+ + A R G
Sbjct: 157 DEMFRSRILPVISANAGIFFPQ-GVSPPSES---ELLQLA-----------HRMGSI--- 198
Query: 283 GTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLA 342
++ L+ + + + K L +VP+ +L ++ A +
Sbjct: 199 ----IMAYAFDLENEEEPEQEDEEWVEDREGKTMLG----MVPMAD-ILNADAEFNAHVN 249
Query: 343 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN--------PYDRLVVEA 394
+D + + RP KAGE I+ + GP PNS+LL YG+V + P+D +V++
Sbjct: 250 HGEDDLSVTALRPIKAGEEILNYYGPHPNSELLRRYGYVTPKHSRYDVVEIPWD--LVQS 307
Query: 395 ALNTE---------------DPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL 439
L + DP+ + V +R H+ + + +S L
Sbjct: 308 TLTEQLRLTDEVWKQVAEHVDPEDLEDVFVLERESGEPDSEGHLQTPAKVQEVSAEL--- 364
Query: 440 RLGYVSDTSEMQSVISSL----GPICP-------VSPCMERAVLDQLADYFKARLAGYPA 488
+++ V+ ++ G + P V C+ + L +L LA YP
Sbjct: 365 -------EEQLKDVLKAIKKVRGDLIPDKRKRDEVYQCVVVSTLQKL-------LAQYPT 410
Query: 489 TLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
T EDEAML N+ ++++A ++ EK+++ LQV
Sbjct: 411 TAEEDEAMLASGNVTSRQKLAVEVRLGEKRLIKEALQVAG 450
>gi|190402231|gb|ACE77646.1| hypothetical protein [Sorex araneus]
Length = 350
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 65/253 (25%), Positives = 112/253 (44%), Gaps = 15/253 (5%)
Query: 285 ESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAV 344
ESF S TR PT + +V+LA L+PL +
Sbjct: 12 ESFTYEDYRWAVSSVMTRQNQIPTEDGS-RVTLA----LIPLWDMCNHTNGLITTGYNLE 66
Query: 345 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 404
DD + V + ++AGE I ++ G + N++ +++ GF ++N +DR+ ++ ++ D Y
Sbjct: 67 DDRCECVALQDFRAGEQIYIFYGTRSNAEFVVHSGFFFDNNSHDRVKIKLGVSKSDRLYA 126
Query: 405 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISS 456
K V R G + VF +H E + +L +LR+ +++ + + I +
Sbjct: 127 MKAEVLARAGIPTSSVFALHV-TELPISAQLLAFLRVFCMTEEELREHLLGENAIDRIFT 185
Query: 457 LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 515
LG PVS E + L D L Y T+ ED+A L L P+ +A +L
Sbjct: 186 LGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKAFLQSPGLSPRAAMAVKLRLG 245
Query: 516 EKKMLNACLQVTA 528
EK++L ++ A
Sbjct: 246 EKEILEKAVRSAA 258
>gi|342881738|gb|EGU82570.1| hypothetical protein FOXB_06936 [Fusarium oxysporum Fo5176]
Length = 467
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 104/469 (22%), Positives = 185/469 (39%), Gaps = 74/469 (15%)
Query: 104 NEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--------LS 155
N ++ A ED+ A F++P ++ +E + I + +K L
Sbjct: 38 NAGRGEVNKTVALEDIPAETTLFTIPRKGIINVETSELPKKIPDAFDLDKPDDDDAPGLD 97
Query: 156 ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 215
+ L L ++YE QG+ S W PY L + ++P+ WS+ EL L S +
Sbjct: 98 SWSSLILIMIYEYLQGENSKWKPYFDVLPS-------SFDTPMFWSDNELDQLQASHMRH 150
Query: 216 EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQR 275
+I +A+ A ++FQ+ +P EIF + R
Sbjct: 151 KI-GKAD---------------AENMFQKTL--LPIIRSNAEIFNAGNKTDAELIEIAHR 192
Query: 276 CGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSS 335
G T ++ L+ + K + +VP+ +L +
Sbjct: 193 MGST-------IMAYAFDLENDEEEEEEADGWVEDRDGKSMMG----MVPMAD-ILNADA 240
Query: 336 KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN--------PY 387
+ A + ++++ + RP KAGE I+ + GP PNS+LL YG+V E + P+
Sbjct: 241 EFNAHVNHEEESLTVTSLRPIKAGEEILNYYGPHPNSELLRRYGYVTEKHSRYDVVEIPW 300
Query: 388 DRLVVEAAL--NTEDPQYQDKRMVAQRNGKL-------SVQVFHVHAGREKEAISDMLPY 438
D +VE+AL N P +++ Q G L V G + P
Sbjct: 301 D--IVESALTSNFGIP----GQVLEQIRGALEEDEEFEDTFVLERETGEVNSDGTFAEPA 354
Query: 439 LRLGYVSDTSE-MQSVISSLGPICP--VSPCMERAVLDQ--LADYFKARLAGYPATLSED 493
D E +++ + + P + +R + Q LA +A +A YP ++SED
Sbjct: 355 RFESMPEDLQEQLKTFLKGIKKAQPDAIPDKRKRDEIHQAVLAKTLEALVARYPTSISED 414
Query: 494 EAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPC 542
E +L +L+ + R+A + EKK+L + ++ + + D P
Sbjct: 415 ENLLKQ-DLNQRTRMAIAVRLGEKKLLQEAITASSGDVEMTMDDESGPA 462
>gi|384484604|gb|EIE76784.1| hypothetical protein RO3G_01488 [Rhizopus delemar RA 99-880]
Length = 400
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 72/304 (23%), Positives = 129/304 (42%), Gaps = 42/304 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A++D++ GD FS+P S++++ + ++EL ++LS + L L +MYE ++
Sbjct: 41 VTANKDIKEGDLLFSLPRSILLSQLTSSLKDQVSEL---SELSGWSPLILCMMYEIEK-P 96
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
SFW PY L R+ +P+ W++ +L L G+ ++I + E + +NEL+
Sbjct: 97 DSFWKPYFDVLPRE-------FTTPMFWNQEDLKELEGTDIISKI-GKKESEELFHNELE 148
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFK--QAFVAVQSCVVHLQRCGCTPQMGTESFASH 290
+ ++YP + T E+F + + S LQ+ E
Sbjct: 149 PI-------IKKYPNLFDEQKHTIELFHICGSLIMAYSFNDELQKAPKENNKEEEKEEEE 201
Query: 291 LVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQL 350
+ L ++VP+ L + A L D++Q+
Sbjct: 202 EEEEEEEEEEEEEGL---------------ISMVPMADMLNHKTGFNNARLFHEPDSLQM 246
Query: 351 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR------LVVEAALNTEDPQYQ 404
+ K GE I G N+ LL YGFVDE N +D L+VE +D +
Sbjct: 247 RAIKDIKEGEQIYNTYGDLCNADLLRKYGFVDEKNDFDLVELDGPLLVEVCCEDQDEALK 306
Query: 405 DKRM 408
++++
Sbjct: 307 ERKI 310
>gi|168046556|ref|XP_001775739.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672891|gb|EDQ59422.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 524
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 87/339 (25%), Positives = 134/339 (39%), Gaps = 43/339 (12%)
Query: 52 RFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIH 111
RF R S+ VS + L++W+ K C + L+ P
Sbjct: 49 RFGCRWVQSNGSTHTKESNVSISNTKVERLRNWLKKLNHDDCNLKLERCPQGGSGS-GYG 107
Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
A + G VP ++T E + + L+ + L+ + L+L+YE+ +G
Sbjct: 108 AFAGPGGVGNGSTIVKVPRKALMTEETARLCQDVGPLVKKSDLTPWQAMCLHLLYERARG 167
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSET-ELAYLTGSPTKAEILERAEGIKREYNE 230
+ SFW PYI L ++ +L P+LWS+ +L GSP ++ ER I RE E
Sbjct: 168 ETSFWYPYIAVLPKEL---ELIGIHPMLWSQKMRREWLEGSPM-LDVTERRLAICREDYE 223
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASH 290
+AG+ + P E + AVQ L + SF+
Sbjct: 224 ---AMLLAGA-GRLTPRGNEGEPISI-----TETAVQWAATMLL---------SRSFS-- 263
Query: 291 LVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD---A 347
++LQT L P ++ + ALVP L SS + D
Sbjct: 264 -LNLQTQ------KLRPGSFAEDTI------ALVPWADMLNHSSSAGRESCLVYDQKSGV 310
Query: 348 VQLVVDRPYKAGESIVVWCGPQPN-SKLLINYGFVDEDN 385
L R Y GE + GP + S+LL++YGFVDE+N
Sbjct: 311 ATLQAHRTYSEGEQVFDSYGPSCSPSRLLLDYGFVDEEN 349
>gi|297726941|ref|NP_001175834.1| Os09g0411650 [Oryza sativa Japonica Group]
gi|255678893|dbj|BAH94562.1| Os09g0411650, partial [Oryza sativa Japonica Group]
Length = 206
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 47/158 (29%), Positives = 82/158 (51%), Gaps = 9/158 (5%)
Query: 369 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGRE 428
+ N++L ++YGF + ++ D + ++ DP Y DK +A+ NG F + G
Sbjct: 10 KSNAELALDYGFTESNSSRDAYTLTLEISESDPFYDDKLDIAELNGMGETAYFDIVLG-- 67
Query: 429 KEAIS-DMLPYLRLGYVSDTSE--MQSVISS--LGPI-CPVSPCMERAVLDQLADYFKAR 482
E++ MLPYLRL + T ++++ + G + PVS E A+ + + K+
Sbjct: 68 -ESLPPQMLPYLRLLCLGGTDAFLLEALFRNAVWGHLELPVSQDNEEAICQVIRNACKSA 126
Query: 483 LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 520
L Y T+ EDE +L NL P+ ++A ++ EKK+L
Sbjct: 127 LGAYHTTIEEDEELLGSENLQPRLQIAVEVRAGEKKVL 164
>gi|302896454|ref|XP_003047107.1| SET domain protein [Nectria haematococca mpVI 77-13-4]
gi|256728035|gb|EEU41394.1| SET domain protein [Nectria haematococca mpVI 77-13-4]
Length = 1037
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 108/466 (23%), Positives = 184/466 (39%), Gaps = 92/466 (19%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN----------KLSELACLAL 162
+ A +D+ A F++P ++ +E + + ++ + +L + L L
Sbjct: 610 IIALQDIPAETTLFTIPRKGIINVETSELPKKLPDVFDLDKPIDDDDEAPRLDSWSSLIL 669
Query: 163 YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAE 222
LMYE QG+KS W PY L + ++P+ WSE+EL L S + +I +A+
Sbjct: 670 VLMYEYLQGEKSQWKPYFDVLPS-------SFDTPMFWSESELDQLQASHMRHKI-GKAD 721
Query: 223 GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQM 282
A S+F++ +P +F + V R G T
Sbjct: 722 ---------------AESMFRKTL--LPIIRKNSSVFGGENRSDDDLVEIAHRMGST--- 761
Query: 283 GTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLA 342
++ L+ +E S+ +VP+ +L ++ A +
Sbjct: 762 ----IMAYAFDLENDEDEEEEETDGWVEDREGKSM---MGMVPMA-DILNADAEFNAHVN 813
Query: 343 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP-YDRLVVEAALNTEDP 401
++++ + RP KAGE I + GP PNS+LL YG+V E + YD VVE + +
Sbjct: 814 HEEESLTVTSLRPIKAGEEIFNYYGPHPNSELLRRYGYVTERHSRYD--VVEIPWDVVES 871
Query: 402 QYQDKRMVAQRNGKLSVQVFHV--HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGP 459
V + N +S QV H E+E D R +T E+ S + GP
Sbjct: 872 -------VMRLNFGISGQVLEKLRHGLEEEEEFEDTFVLER-----ETGEVNSDGTFSGP 919
Query: 460 -----------------ICPVSPCMERAVLDQ----------LADYFKARLAGYPATLSE 492
+ V A+ D+ LA +A + YP + SE
Sbjct: 920 ARFESMPEDLQEQLKTFLKGVKKAQPEAIPDKRKRDEIHHAVLAKTLQALASKYPTSTSE 979
Query: 493 DEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA--DMIMLLPD 536
D +L +L + R+A ++ EKK+L + T+ D+ M + D
Sbjct: 980 DGILLQRQDLSQRTRMAIEVRLGEKKLLQEAIASTSSVDVEMTVDD 1025
>gi|380015248|ref|XP_003691619.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Apis
florea]
Length = 483
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 98/473 (20%), Positives = 184/473 (38%), Gaps = 79/473 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
SK+ + +G +W+ +NG + E P ++ + A + + +P
Sbjct: 75 SKRSQGIGQFINWLKENGANVDGASVAEFPGYDLG------LKAERNFLENELILRIPRG 128
Query: 132 LVVTLERVLGNETIAELLTTNK------LSELACLALYLMYEKKQGKKSFWLPYIRELDR 185
L+ ++ + EL+T + ++A LA+ L+ E+ + + S W PY+ L
Sbjct: 129 LIFSI-----HNAAPELITLQNDPLIQHMPQVA-LAIALLIERHK-ENSKWKPYLDILPT 181
Query: 186 QRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQY 245
+ L + ++ L GSPT L++ I R+Y+ + V+ +
Sbjct: 182 -------TYTTVLYMTAADMIELKGSPTLEAALKQCRNIARQYSYFNKVFQNNNNAVSAI 234
Query: 246 PYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPL 305
D+ FT+E + A S TR L
Sbjct: 235 LRDV----FTYERYCWA---------------------------------VSTVMTRQNL 257
Query: 306 SPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 365
P+ + + AL+P+ + + A + + R +K GE I +
Sbjct: 258 IPSEDGSRMIH-----ALIPMWDMCNHENGRITTDFNATSNYCECYALRDFKKGEQIFIS 312
Query: 366 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 425
GP+ NS ++ GFV +N D + ++ D +++ + + +V F +
Sbjct: 313 YGPRTNSDFFVHSGFVYMENKQDGFKLRLGISKADSLQKERIELLNKLDLPTVGEFLLKL 372
Query: 426 GREKEAISD-MLPYLRLGYVSDTSEMQSVISS--LGPICPVSPCMERAVLDQLADYFKAR 482
G E ISD +L +LR+ + +E+ I S + + + +E V + + + R
Sbjct: 373 G--TEPISDLLLAFLRV-FSMRKAELAHWIRSDRVNDLKHMDCALETVVEENVRKFLLTR 429
Query: 483 L----AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 531
L A YP TL ED +L + L K++ QL EK++L L+ I
Sbjct: 430 LQLLIANYPTTLKEDLQLL-ETTLPQIKKLTIQLRVTEKRILQGALEYVEQWI 481
>gi|145516108|ref|XP_001443948.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124411348|emb|CAK76551.1| unnamed protein product [Paramecium tetraurelia]
Length = 572
Score = 63.5 bits (153), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 92/439 (20%), Positives = 175/439 (39%), Gaps = 76/439 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA----CLALYLMYEK 168
V A + + A + +P S ++TLE + T+A+ + +L L+ L+ +L+ EK
Sbjct: 165 VNAKQTINAKELILFIPKSHMITLE-MAKETTVAKKMMQFRLDLLSPKHSFLSTFLLQEK 223
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
+ SFW PYI L P+ ++ ++L +L GSP +I ++ ++++Y
Sbjct: 224 FRPN-SFWKPYIDILPSSYP------SFPIFYNNSDLEWLKGSPFLKQIKDKLADLQKDY 276
Query: 229 NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFA 288
N++ V F QY F F A + S + + G + T++F
Sbjct: 277 NDICNV----VPEFTQY---------QFHEFCWARMTASSRIFGINING----VKTDAFV 319
Query: 289 SHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLA--YSSKCKAMLAAVDD 346
PL+ N K P L + YS + + + D+
Sbjct: 320 ---------------PLADMLNHKR--------------PKLTSWCYSDEKQGFIIETDE 350
Query: 347 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 406
++ G+ I G + NS+ +NYGFV E N + + + + DP Q K
Sbjct: 351 KIE--------RGQMIFDSYGRKCNSRFFLNYGFVVEGNDANEVNLAVEADQNDPLLQLK 402
Query: 407 RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLG-------P 459
+ + + + F + ++ A+ D + ++R + D ++++ +++
Sbjct: 403 EQAIKESLQWP-KNFKLLMDTDETAVIDFMSHIRFLVIRDEAQLKLLLNQKNSQNFKSTK 461
Query: 460 ICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 519
P+ E + + K L YP T +D+ +L L +R L EK++
Sbjct: 462 TQPLGIYNELEMWKMIGRICKKTLKQYPTTFEQDQEILQICELTTNQRNCLILRMGEKEI 521
Query: 520 LNACLQVTADMIMLLPDVT 538
L Q + M LL +
Sbjct: 522 LKFYFQFSERMKELLSNFN 540
>gi|328700922|ref|XP_003241429.1| PREDICTED: SET domain-containing protein 3-like [Acyrthosiphon
pisum]
Length = 463
Score = 63.5 bits (153), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 73/337 (21%), Positives = 133/337 (39%), Gaps = 63/337 (18%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
+ ++ + L W KNG IL H ++ + + A++++ GD +VP +L
Sbjct: 81 RNDQSIEKLTKWATKNG-----AILNGVEIHQFENYA-YGMKANKNITVGDKLVTVPRAL 134
Query: 133 VVTLERV----LGNETIAELLTTNKLSELACLALYLMYEK-KQGKKSFWLPYIRELDRQR 187
++T E + L +++ N + LA++++ E ++ KKSFW Y+ L
Sbjct: 135 MMTEENIPSSPLWKLHSQDMMLRNMPN--VALAIFILVESLRKDKKSFWHSYLTTL---- 188
Query: 188 GRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY 247
+ +P+ + +L L GSP L+ I R+Y ++ ++ P
Sbjct: 189 ---PVTYSTPVYFDVADLEALKGSPAFEAALKLNRNIARQYAYFKKLFQLSND-----PA 240
Query: 248 D-IPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLS 306
I + FT+E ++ A + S R P
Sbjct: 241 SVILKDTFTYEYYRWAVSTLMS------RQNTVPS------------------------- 269
Query: 307 PTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 366
+ NP E VS AL+PL S + + D Y A E + ++
Sbjct: 270 -SDNPSENVS-----ALIPLWDMFNHRSGRLSTDFVKSSNVCVCYADGDYAADEQVYIFY 323
Query: 367 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 403
G + N+ L++ GFV DN +D + + ++ DP Y
Sbjct: 324 GVRTNADFLVHNGFVYPDNEHDAVKIRLGVSRSDPLY 360
>gi|297820264|ref|XP_002878015.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297323853|gb|EFH54274.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 473
Score = 63.5 bits (153), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 60/288 (20%), Positives = 122/288 (42%), Gaps = 61/288 (21%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ + AGD VP ++ +T + + + ++ T+++ + LA L+ EKK+G+KS
Sbjct: 75 ASKVIHAGDCMLKVPFNVQITPDELSPDIRVS---LTDEVGNIGKLAAVLIREKKKGQKS 131
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W+PYI L + + S + W E E + + S E +++ I++E++
Sbjct: 132 RWVPYISRLPQ-----PAEMHSTIFWGEDEFSMIRCSAVHKETVKQKAQIEKEFS----- 181
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHL 294
F+A + Q YP I E E F A+ V G+ ++ +
Sbjct: 182 -FVAQAFKQHYPMVI--ERPYLEDFMYAYALV----------------GSRAWET----- 217
Query: 295 QTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV-QLVVD 353
++ +L+P + +L+ D+ + ++ D
Sbjct: 218 -----------------------SKGISLIPFADFMNHDGLSASIVLSDEDNQLSEVTAD 254
Query: 354 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 401
R Y G+ + + G N+ L++++GF N +D + ++ + +DP
Sbjct: 255 RNYSPGDEVFIKYGEFSNATLMLDFGFTVPYNIHDEVQIQMDVPNDDP 302
>gi|145528147|ref|XP_001449873.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124417462|emb|CAK82476.1| unnamed protein product [Paramecium tetraurelia]
Length = 605
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 95/440 (21%), Positives = 179/440 (40%), Gaps = 76/440 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNET-IAELLTTNKLSELA----CLALYLMYE 167
V A + + + + VP S ++TLE + +T +A+ + +L L+ L+ +L+ E
Sbjct: 190 VNARKAISSKEVILFVPRSHMITLE--MAKDTPVAKKIIQYRLDLLSPKHSFLSTFLLQE 247
Query: 168 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
KK + SFW PY+ L + P+ +++++L +L GSP ++ ++ +K++
Sbjct: 248 KK-IQDSFWKPYLDVLPKSYSNF------PIFFNDSDLEWLKGSPFLKQVKDKITDLKKD 300
Query: 228 YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESF 287
Y ++ V A Q +F+ F A + S + + G + T++F
Sbjct: 301 YCDICQV---APEFLQN----------SFDEFCWARMTASSRIFGINIKG----VKTDAF 343
Query: 288 ASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLA--YSSKCKAMLAAVD 345
PL+ N K P L + YS + + + D
Sbjct: 344 V---------------PLADMLNHKR--------------PKLTSWCYSDERQGFIIETD 374
Query: 346 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQD 405
+ ++ G+ I G + NS+ L+NYGFV +DN + + V + Q
Sbjct: 375 ENIE--------KGQMIFDSYGSKCNSRFLLNYGFVVDDNNANEVNVMVEPDGTISLIQL 426
Query: 406 KRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICP--- 462
K +++ + + F + + SD + ++R + + E +++ I P
Sbjct: 427 KEGLSRETLQFP-KSFRLVIDPNDVSFSDFMSFIRFILIQEEKEFANLLGKNSYIKPTKI 485
Query: 463 --VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 520
+S E A + + + L YP TL +D +L L +R L EKK+L
Sbjct: 486 HFISIQNELATWNLIENICIRALNQYPTTLEQDLEILKICELTTNQRNCLILRMGEKKIL 545
Query: 521 NACLQVTADMIMLLPDVTVS 540
N Q + M L + S
Sbjct: 546 NFYKQFSEKMRQLFSNFDFS 565
>gi|449442309|ref|XP_004138924.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Cucumis
sativus]
Length = 503
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 94/423 (22%), Positives = 171/423 (40%), Gaps = 59/423 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+A +++L + VP + + V +E + L +AL+L+ E +G
Sbjct: 100 LATTKNLSKNEVVLEVPKRFWINPDAVADSEIGN---VCSGLKPWISVALFLIRENLKGD 156
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W Y+ L ++ +S + WSE ELA + G+ + L E +K E+ +++
Sbjct: 157 -SRWRRYLDILPQE-------TDSTVFWSEEELAEIQGTQLLSTTLNVKEYVKSEFLKVE 208
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLV 292
+ D+ T + F AF ++S R +LV
Sbjct: 209 EEILLRHK-------DLFPSRITLDDFFWAFGILRSRAFSRLR------------GQNLV 249
Query: 293 HLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 352
+ P + N V+ A+ K A L + D L
Sbjct: 250 LI---------PFADLVNHSANVTTEEH-----------AWEVKGPAGLFSWDVLFSLRS 289
Query: 353 DRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 411
KAG+ + + + + N+ L ++YGF+++ + + + + D + DK +A+
Sbjct: 290 PLSVKAGDQVFIQYDLKKSNADLALDYGFIEQKSDRNAYTLTLEIPESDLFFDDKLDIAE 349
Query: 412 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE--MQSVI--SSLGPI-CPVSPC 466
NG F + E+ MLP+LRL + T ++S+ S G + PVS
Sbjct: 350 TNGLNQTAYFDIIL--ERPFPPAMLPFLRLLALGGTDAFLLESLFRNSVWGHLEMPVSRA 407
Query: 467 MERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQV 526
E + + + +A L+GY T+ EDE L + NL + R+A + EK++L +Q+
Sbjct: 408 NEELICQVVRNACEAALSGYHTTIEEDEK-LKEENLDSRLRIAVGIREGEKRVLQQIIQI 466
Query: 527 TAD 529
D
Sbjct: 467 FKD 469
>gi|449495943|ref|XP_004159992.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Cucumis
sativus]
Length = 503
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 94/423 (22%), Positives = 171/423 (40%), Gaps = 59/423 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+A +++L + VP + + V +E + L +AL+L+ E +G
Sbjct: 100 LATTKNLSKNEVVLEVPKRFWINPDAVADSEIGN---VCSGLKPWISVALFLIRENLKGD 156
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W Y+ L ++ +S + WSE ELA + G+ + L E +K E+ +++
Sbjct: 157 -SRWRRYLDILPQE-------TDSTVFWSEEELAEIQGTQLLSTTLNVKEYVKSEFLKVE 208
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLV 292
+ D+ T + F AF ++S R +LV
Sbjct: 209 EEILLRHK-------DLFPSRITLDDFFWAFGILRSRAFSRLR------------GQNLV 249
Query: 293 HLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 352
+ P + N V+ A+ K A L + D L
Sbjct: 250 LI---------PFADLVNHSANVTTEEH-----------AWEVKGPAGLFSWDVLCSLRS 289
Query: 353 DRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 411
KAG+ + + + + N+ L ++YGF+++ + + + + D + DK +A+
Sbjct: 290 PLSVKAGDQVFIQYDLKKSNADLALDYGFIEQKSDRNAYTLTLEIPESDLFFDDKLDIAE 349
Query: 412 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE--MQSVI--SSLGPI-CPVSPC 466
NG F + E+ MLP+LRL + T ++S+ S G + PVS
Sbjct: 350 TNGLNQTAYFDIIL--ERPFPPAMLPFLRLLALGGTDAFLLESLFRNSVWGHLEMPVSRA 407
Query: 467 MERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQV 526
E + + + +A L+GY T+ EDE L + NL + R+A + EK++L +Q+
Sbjct: 408 NEELICQVVRNACEAALSGYHTTIEEDEK-LKEENLDSRLRIAVGIREGEKRVLQQIIQI 466
Query: 527 TAD 529
D
Sbjct: 467 FKD 469
>gi|322703179|gb|EFY94792.1| UV-endonuclease UVE-1 [Metarhizium anisopliae ARSEF 23]
Length = 1118
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 96/445 (21%), Positives = 178/445 (40%), Gaps = 65/445 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN-------KLSELACLALYLM 165
+ A D+ A F++P ++ + E + EL + L + L L +M
Sbjct: 694 IVALRDIPADTTLFTIPRDAIINSDTSSLREKLPELFESQGDEDEQQALDSWSALILIMM 753
Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIK 225
YE G +S W PYI L L ++P+ WSE EL+YL S T +I +A+
Sbjct: 754 YEFFLGHQSKWKPYIDVL-------PLTFDTPMFWSEEELSYLQASATVNKI-GKAD--- 802
Query: 226 REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTE 285
A +F+ IP +F + ++ G +MG+
Sbjct: 803 ------------AEEMFRTRL--IPAIRGNPSVFASSGDCSDEDLI-----GLAHRMGS- 842
Query: 286 SFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD 345
+ ++ L+ +E S+ A+ + L ++ A + D
Sbjct: 843 TIMAYAFDLENEEAENDDESDGWVEDREGKSMMGMVAMADI----LNADAEFNAHVNHGD 898
Query: 346 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAALN 397
+ + + R KAGE I+ + GP PNS+LL YG++ E + P+D V+ +L
Sbjct: 899 EELTVTSIRDIKAGEEILNYYGPHPNSELLRRYGYITEKHSRYDVVEIPWD--AVQHSLM 956
Query: 398 TEDPQYQDKRMVAQRNGKL------SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE-M 450
+E QD ++ + K+ + V +G + P + G D E +
Sbjct: 957 SELGVPQD--IMTETMDKMDQDDLEDIFVLERDSGEPNPDGTFAGPAVVDGMPPDLKEQL 1014
Query: 451 QSVISSL----GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKK 506
++ I L G + + + + + + + Y T++EDE +L +L ++
Sbjct: 1015 KATIKLLQKVDGNLISDKRKRDDILRSTMVETLRLIASRYSTTIAEDEILLAQDSLTRRQ 1074
Query: 507 RVATQLVRMEKKMLNACLQVTADMI 531
R+A ++ EKK+L ++M+
Sbjct: 1075 RMAVRVRLGEKKLLQEAFDHFSEMV 1099
>gi|327259114|ref|XP_003214383.1| PREDICTED: SET domain-containing protein 3-like, partial [Anolis
carolinensis]
Length = 311
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 55/197 (27%), Positives = 97/197 (49%), Gaps = 18/197 (9%)
Query: 345 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 404
DD + V + +KAGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 9 DDRCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 68
Query: 405 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS--------S 456
K V R G + VF +HA E + +L +LR+ +++ + +I +
Sbjct: 69 MKAEVLARAGIPTSSVFALHA-TEPPISAQLLAFLRVFCMTEDELKEHLIGEHAIDRIFT 127
Query: 457 LG-PICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVATQ 511
LG PVS E +L + +AR L Y T+SED+A L +L +A +
Sbjct: 128 LGNSEFPVSWDNEV----KLWTFLEARASLLLKTYKTTVSEDKAFLGTQDLTCNATMAIK 183
Query: 512 LVRMEKKMLNACLQVTA 528
L EK++L ++ A
Sbjct: 184 LRLGEKEILEKAIKSAA 200
>gi|302754340|ref|XP_002960594.1| hypothetical protein SELMODRAFT_402971 [Selaginella moellendorffii]
gi|300171533|gb|EFJ38133.1| hypothetical protein SELMODRAFT_402971 [Selaginella moellendorffii]
Length = 403
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 84/346 (24%), Positives = 134/346 (38%), Gaps = 82/346 (23%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEK----HRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
+ SW+ + G + + S + + HRP + AG+ +LV+T
Sbjct: 41 EFMSWLRRRGEDMNSIAVAIGMSKHGRALFAHRP---------MCAGECMIKFSQNLVLT 91
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
E+ L E IA L N+ + ++ L +M EK++G+ S W PYI L G+ +
Sbjct: 92 PEK-LPCEVIALLDQANEFTRVS---LLVMAEKRKGQNSAWAPYIECLP---SFGE--IH 142
Query: 196 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
S + W ELA L SP ER ++ EY E+ V S Y D+ +
Sbjct: 143 STIFWDPKELACLECSPIHRGTGERNALLQSEYREVKKV---VESCPHLYDPDV-----S 194
Query: 256 FEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLS--PTHNPKE 313
E FK + V S + G P T PL H+P+
Sbjct: 195 LEQFKHEYATVSS-----RAWGQGPH----------------SDMTMIPLVDFANHDPRS 233
Query: 314 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES-----IVVWCGP 368
+ + A D+ +V R Y+ G+ + + G
Sbjct: 234 RTLFSH-----------------------ADDNCTVVVASRDYQTGDENFHLKVHICYGD 270
Query: 369 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK-RMVAQRN 413
N+ L ++YGFV DNP+D + + +EDP + K + +AQ N
Sbjct: 271 HSNAVLALDYGFVVPDNPFDEAEIFLEIPSEDPLREIKLQYMAQNN 316
>gi|350408192|ref|XP_003488333.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Bombus
impatiens]
Length = 484
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 100/472 (21%), Positives = 180/472 (38%), Gaps = 79/472 (16%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+ + +G +W+ +NG + E P ++ + A + + +P L
Sbjct: 77 KRSQGIGRFINWLKQNGANVYGASVAEFPGYDLG------LKAERNFLENELILRIPREL 130
Query: 133 VVTLERV------LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
+ ++ L N+ + +L+ LA+ L+ EK + + S W PY+ L
Sbjct: 131 IFSIHNAAPELVALQNDPLLQLMPQ------VALAIALLIEKHK-EYSKWKPYLDILPT- 182
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
+ L + ++ L GSPT L++ I R+Y +F LFQ+
Sbjct: 183 ------TYTTVLYMTAADMNELKGSPTLEAALKQCRNIARQY-----AYF--NKLFQKNN 229
Query: 247 YDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLS 306
+ I + F + C S TR +
Sbjct: 230 NAVSA------ILRDVFTYEKYCWA------------------------VSTVMTRQNII 259
Query: 307 PTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 366
P+ + + AL+P+ +SK A + + R +K E I +
Sbjct: 260 PSKDGSLMIH-----ALIPMWDMCNHENSKITTDFNATLNCCECYALRDFKKAEQIFISY 314
Query: 367 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 426
G + NS ++ GFV DN D + ++ DP +++ + + +V F + G
Sbjct: 315 GARTNSDFFVHSGFVYMDNEQDGFKLRLGISKADPLQKERVELLNKLDLPAVGEFLLKPG 374
Query: 427 REKEAISD-MLPYLRLGYVSDTSEMQSVISS--LGPICPVSPCMERAVLDQLADYFKARL 483
E ISD +L +LR+ + E+ I S + + + +E V + + + RL
Sbjct: 375 --TEPISDTLLAFLRV-FSMRKEELAHWIQSDRVNDLKHMDCALETVVEENVKKFLLTRL 431
Query: 484 ----AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 531
A YP TL ED +L + L K++A QL EK++L L+ I
Sbjct: 432 QLLIANYPTTLKEDLQLL-ETTLPRIKKLAIQLRVTEKRILQGALEYVQQWI 482
>gi|302771638|ref|XP_002969237.1| hypothetical protein SELMODRAFT_410177 [Selaginella moellendorffii]
gi|300162713|gb|EFJ29325.1| hypothetical protein SELMODRAFT_410177 [Selaginella moellendorffii]
Length = 336
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 77/301 (25%), Positives = 118/301 (39%), Gaps = 69/301 (22%)
Query: 121 AGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYI 180
AG+ LV+T E+ L E IA L N+ + ++ L +M EK++G+ S W PYI
Sbjct: 77 AGECMIKFSQDLVLTPEK-LPCEVIALLDQANEFTRVS---LLVMAEKRKGQNSAWAPYI 132
Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS 240
L G+ + S + W ELA L SP ER ++ EY E+ V S
Sbjct: 133 ECLP---SFGE--IHSTIFWDPKELACLECSPIHRGTGERNALLQSEYREVKKV---VES 184
Query: 241 LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTT 300
Y D+ + E FK + V S + G P
Sbjct: 185 CPHLYDPDV-----SLEQFKHEYATVSS-----RAWGQGPH----------------SDM 218
Query: 301 TRTPLS--PTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 358
T PL H+P+ + + A D+ +V R Y+
Sbjct: 219 TMIPLVDFANHDPRSRTLFSH-----------------------ADDNCTVVVASRDYQT 255
Query: 359 GES-----IVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK-RMVAQR 412
G+ + + G N+ L ++YGFV DNP+D + + +EDP + K + +AQ
Sbjct: 256 GDENFHLKVHICYGDHSNAVLALDYGFVVPDNPFDEAEIFLEIPSEDPLREIKLQYMAQN 315
Query: 413 N 413
N
Sbjct: 316 N 316
>gi|340519125|gb|EGR49364.1| predicted protein [Trichoderma reesei QM6a]
Length = 963
Score = 62.4 bits (150), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 96/437 (21%), Positives = 171/437 (39%), Gaps = 68/437 (15%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERV--------LGNETIAELLTTNKLSELACLALYL 164
+ A +D+ A F+VP S +++ E + ET E+ + + L + +
Sbjct: 537 IVALQDIPAEAVLFTVPRSGILSSETSELKGKLPEIFQETAMEVDDKPQQDPWSTLIIVM 596
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
MYE +G +S W PYI L + E+P+ WS+ EL L S T++++ + +
Sbjct: 597 MYEYFKGSESKWKPYIDVLPS-------SFETPMFWSDAELDELQASATRSKVGKAS--- 646
Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL-QRCGCTPQMG 283
A +FQ +P +F + ++ L R G T
Sbjct: 647 -------------AEEMFQDKV--LPVIRANQHLFPTSQTYSDDDLIQLAHRMGST---- 687
Query: 284 TESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAA 343
S+ Q +E S +VP+ +L ++ A +
Sbjct: 688 ---IMSYSFDFQNEDEEDEDETEEWVEEREAKST---MGMVPMA-DILNADAEYNAHVNY 740
Query: 344 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD-EDNPYDRL---------VVE 393
DDA+ + R KAGE I + GP PNS+LL YG+V + + YD + V
Sbjct: 741 GDDALTVTALRTIKAGEEIFNYYGPHPNSELLRRYGYVTPKHSRYDVVELPWTLVEESVA 800
Query: 394 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT--SEMQ 451
A+L Q DK + +L E + R + + +++
Sbjct: 801 ASLGLSSEQL-DKARECLDSDELEDTFVLERETEEPNPDGTLTGSARFSEIPEDLRDQLK 859
Query: 452 SVISSLGPICPVSPCMERA-------VLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 504
S++ ++ P S +R +L + D +R YP ++SEDE +L ++
Sbjct: 860 SLLKAIRKAVPSSVVDKRKRDEIQHNILIRALDALASR---YPTSISEDERILAGNDISE 916
Query: 505 KKRVATQLVRMEKKMLN 521
++R A + EK+++
Sbjct: 917 RRRAAVTVRLGEKRLIQ 933
>gi|260835045|ref|XP_002612520.1| hypothetical protein BRAFLDRAFT_214305 [Branchiostoma floridae]
gi|229297897|gb|EEN68529.1| hypothetical protein BRAFLDRAFT_214305 [Branchiostoma floridae]
Length = 287
Score = 62.4 bits (150), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 70/315 (22%), Positives = 129/315 (40%), Gaps = 48/315 (15%)
Query: 75 EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
EE W+H+NG C+ + + R + A++ L+ + +P L++
Sbjct: 18 EESFVRFFQWLHRNG---CRNVPLKPAVFPGTGRGM---MATKALKHEELMLVIPQRLLI 71
Query: 135 TLERVLGNETIAELLTTN-KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
T++ ++ + + + +L+ LA++LM EK + +KSFW PYI L +
Sbjct: 72 TMDAIMDSYIAPYIERADPRLTPTQALAVFLMCEKYRREKSFWRPYIDILPEE------- 124
Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 253
P ++E + L S + + + +EY EL + M LF +A
Sbjct: 125 YSCPTFFTEDDFRLLPNS-LRGKAKAKKYECHKEYKELAPFFKMLADLFPD-----QEDA 178
Query: 254 FTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKE 313
F F+ FK A+ A+++ + + +G ES HL + + T T PL + N
Sbjct: 179 FNFKDFKWAWSAIKTRALDV-------PIGRES-CRHLRDAEDTPTPTMFPLVDSINHAA 230
Query: 314 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 373
+ + R Y+ K + + + + Y+ ++ G N
Sbjct: 231 QAKIRHR------------YNEKSRCLESRTETV--------YRRHAEVMNSYGRADNDN 270
Query: 374 LLINYGFVDEDNPYD 388
LL+ +GFV NP D
Sbjct: 271 LLLEFGFVVPGNPED 285
>gi|156064409|ref|XP_001598126.1| hypothetical protein SS1G_00212 [Sclerotinia sclerotiorum 1980]
gi|154691074|gb|EDN90812.1| hypothetical protein SS1G_00212 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 470
Score = 62.4 bits (150), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 103/445 (23%), Positives = 171/445 (38%), Gaps = 90/445 (20%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSE----LACLALYLMYEK 168
V A+ D+ + FS+P + V L + +A L + +L E L LM E
Sbjct: 42 VVATGDIDDDEIIFSIPRNAV------LNAQNVAPLPVSRRLFEKMPSWLVLTSILMTEA 95
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA------- 221
Q + S W PY+ L + ++S + WS++ELA L S +I ++
Sbjct: 96 -QMENSKWAPYLAVLPER-------LDSLVFWSDSELAELQASAVVKKIGKKDAEDMFKS 147
Query: 222 ----EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCG 277
+G+K E+ S+ Y +DIP +
Sbjct: 148 YIAPQGLKHSSTEM---CHKVASVIMAYAFDIPDPS------------------------ 180
Query: 278 CTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKC 337
P G + + + +T LS ++PL L A + +
Sbjct: 181 DAPTSGGKGGEAGDDLVSDDGEDEKTILS----------------MIPLADMLNADADRN 224
Query: 338 KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYD------RL 390
A L ++ +++ +P GE I G P S LL YG+V D + YD L
Sbjct: 225 NARLICDNEELEMRAIKPISKGEEIFNDYGQLPRSDLLRRYGYVTDGYSAYDVAEISAEL 284
Query: 391 VVEAALN----------TEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 440
+V N T+D + + +A+R G VH+ ++ +I D L
Sbjct: 285 IVSLFRNGKVHPSLHKLTQDG-LKTRLELAEREGVYEDSFDLVHSSPDEPSIPDELLAFL 343
Query: 441 LGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDY 500
+ D S ++++++S I S LA +AR Y TL EDE +L +
Sbjct: 344 YLLLVDESHLKAILNSESSIPSRSKLTTELAGQVLATLLQAREKEYSTTLEEDEDLLKNA 403
Query: 501 NLHPKKRVATQLVRMEKKMLNACLQ 525
+L + +A Q+ EKK+L A +Q
Sbjct: 404 DLPVRHAMAIQVRSGEKKVLRAAMQ 428
>gi|428163078|gb|EKX32170.1| hypothetical protein GUITHDRAFT_121664 [Guillardia theta CCMP2712]
Length = 449
Score = 62.0 bits (149), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 98/459 (21%), Positives = 174/459 (37%), Gaps = 60/459 (13%)
Query: 77 DLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTL 136
D D+ W NG KV+L++ + + PI + A ED++AG+ S+P +L+
Sbjct: 25 DGSDVYEWAAANGANVSKVVLRD----DGEAGPI--LHAKEDIEAGEVILSLPANLLFP- 77
Query: 137 ERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES 196
RV + + ++ + + + LYL+ E+ S W P+++ L +
Sbjct: 78 TRVSDHSPVVHMIENTTIGRITAICLYLISERADS-SSHWKPWLQSLPPRFFHA------ 130
Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQ------QYPYDIP 250
L +SE ++ + S K + + +++EY + F P ++
Sbjct: 131 -LSYSEDDMLHFQASSFKELRDRKKKNVRQEYEQTVAPLLHKLPAFDPLLAAVDKPQNVT 189
Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHN 310
E FT+E F+ A+ S TTR
Sbjct: 190 REDFTYEAFEWAY---------------------------------SVVTTRGIFPGLLG 216
Query: 311 PKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 370
+E+ L PL + +S K A + I + G
Sbjct: 217 EEEREGEVPLLVLGPLADSFIHGASGVKISYDAQEHRCVFSALHKVAKNSPISIGVGMSS 276
Query: 371 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 430
N +LL N GF+ ++N + ++++ L+ + R + LS + +V R E
Sbjct: 277 NMELLANRGFMMQNNGNNFVLMKFQLDRNSDMHASARESMMKQLNLSNPMTYV--VRYGE 334
Query: 431 AISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATL 490
+L LR+ +S E S +L PV+ E L + LA YP T+
Sbjct: 335 MPQGLLASLRIQSLSPV-EFGSYGKALA--TPVTLENEWRAYRLLISSCNSILAMYPTTI 391
Query: 491 SEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTAD 529
EDE +LT R A L+R E+K++ ++ A+
Sbjct: 392 EEDEIVLTQTKTSRHLRAAV-LLRREEKLIYESIKTWAN 429
>gi|229596469|ref|XP_001008992.3| SET domain containing protein [Tetrahymena thermophila]
gi|225565279|gb|EAR88747.3| SET domain containing protein [Tetrahymena thermophila SB210]
Length = 629
Score = 62.0 bits (149), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 81/327 (24%), Positives = 121/327 (37%), Gaps = 82/327 (25%)
Query: 66 GSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAA 125
+E + K E +L SW+ N + LK +HN + + +QA +
Sbjct: 142 ADKETLKKSE----NLLSWVQANKGEFSSIKLKYLSTHNRS------IVSKRIIQADETV 191
Query: 126 FSVPNSLVVTLERVLGNETIAELLTTNKLSEL-----ACLALYLMYEKKQGKKSFWLPYI 180
S+P V+TL+ V + ++L T K ++L A AL+L+ E+K+ S + YI
Sbjct: 192 ISIPQEQVITLD-VASSSDFCKIL-TEKNTQLVQQKHAYFALFLLQEQKKKDASHYKAYI 249
Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS 240
L P L+SE EL YL G+ + E+ E IK +Y + V
Sbjct: 250 DSLPTDLSSF------PALFSEEELQYLEGTAALKLVQEQKEDIKTDYESISQV------ 297
Query: 241 LFQQYPYDIP--TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSC 298
IP F+FE F+ AF+ S V +
Sbjct: 298 --------IPEFKSEFSFEQFRWAFLCSHSRVFGI------------------------- 324
Query: 299 TTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 358
KV + +VPL L S + DDA + K
Sbjct: 325 ---------------KVKGVKTSVMVPLADMLNHKHSGQEDSEWVFDDATNCFTVKALKK 369
Query: 359 ---GESIVVWCGPQPNSKLLINYGFVD 382
+ I G + NSKL +NYGFVD
Sbjct: 370 IQRNQQIHFSYGSKCNSKLFLNYGFVD 396
>gi|297736447|emb|CBI25318.3| unnamed protein product [Vitis vinifera]
Length = 487
Score = 62.0 bits (149), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 75/329 (22%), Positives = 135/329 (41%), Gaps = 63/329 (19%)
Query: 75 EEDLGDLKSWM-HKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
++D D W+ K G+ V+ K ++ + + AS+ +Q GD VP ++
Sbjct: 39 DKDCDDFLPWLEQKAGVEISSVLSIGKSTYGSRS-----LFASKSIQTGDCILKVPYNVQ 93
Query: 134 VTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
++ + V I LL +++ +A LA+ + E K G+ S W PYI L Q G
Sbjct: 94 ISPDNV--PSKINSLLG-DEVGNIAKLAIVISVEWKMGQDSEWAPYINRLP-QPGE---- 145
Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW-FMAGSLFQQYPYDIPTE 252
+ S + WSE EL + S E + + I++++ + V + +LF+ DI +
Sbjct: 146 MHSTIFWSEGELKMIQQSSVYQETINQKAQIQKDFLAIKPVLHHFSENLFK----DISLK 201
Query: 253 AFTFEIFKQAFVAVQSCVVHLQRC-GCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNP 311
F + +C + R G T + FA + H S +
Sbjct: 202 EF-----------MHACALVGSRAWGSTKGLSLIPFADFVNHDGFSDSV----------- 239
Query: 312 KEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPN 371
LL K L+ ++++ DR Y GE +++ G PN
Sbjct: 240 ------------------LLGDEDK---QLSESSSTLEVIADRNYAPGEQVLIRYGKFPN 278
Query: 372 SKLLINYGFVDEDNPYDRLVVEAALNTED 400
+ LL+++GF N YD++ ++ + D
Sbjct: 279 ATLLLDFGFTLPYNIYDQVQIQVNIPHHD 307
>gi|326496433|dbj|BAJ94678.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 453
Score = 62.0 bits (149), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 94/415 (22%), Positives = 165/415 (39%), Gaps = 67/415 (16%)
Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWL 177
+L G+ VP L + + V + + L ++L ++ E +G S W
Sbjct: 52 NLPRGEVVAEVPKKLWLDADAVAASVLGRVCGSGGDLRPWVSVSLLILREAARGGDSLWA 111
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
PY+ L RQ +S + WSE EL + G+ + + E ++ E++ ++
Sbjct: 112 PYLAILPRQ-------TDSTIFWSEEELLEIQGTQLLSTTMGVKEYVQSEFDNVE----- 159
Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFA----SHLVH 293
AG + D+ TF+ F AF ++S V P++ + A + L++
Sbjct: 160 AGII--NVNKDLFPGTITFDDFLWAFGVLRSRVF--------PELRGDKLALIPFADLIN 209
Query: 294 LQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD 353
T+ + + K K L D L
Sbjct: 210 HDGDITSKES----------------------------CWEIKGKGFLGR-DTVFSLRTP 240
Query: 354 RPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQR 412
K+GE I V + + N++L ++YGF + ++ D + ++ DP Y+DK +A+
Sbjct: 241 VDVKSGEQIYVQYDLDKSNAELALDYGFTESNSSRDSYTLTLEISESDPFYEDKLDIAEL 300
Query: 413 NGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSP 465
NG F V G + M+ YLRL + T V L PVS
Sbjct: 301 NGMGETAYFDVVLG--ESLPPQMITYLRLLCLGGTDAFLLEALFRNKVWEHLE--LPVSR 356
Query: 466 CMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 520
E ++ + + K+ LA Y T+ EDE +L +L + ++A ++ EKK+L
Sbjct: 357 DNEESICQVIQNACKSALAAYHTTIEEDEELLEREDLQSRHQIAVEVRVGEKKVL 411
>gi|46130858|ref|XP_389160.1| hypothetical protein FG08984.1 [Gibberella zeae PH-1]
Length = 1000
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 97/462 (20%), Positives = 184/462 (39%), Gaps = 78/462 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--------LSELACLALYL 164
+ A +D+ A F++P ++ E + I ++ +K L + L L +
Sbjct: 578 IIALKDIPAETTLFTIPRKGIINTETSELPKKIPDVFDLDKPDEDDVPGLDSWSSLILIM 637
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
+YE QG S W Y L + ++P+ WSE EL L S + +I +A+
Sbjct: 638 IYEYLQGDSSQWKSYFDVLPS-------SFDTPMFWSENELDQLQASHMRHKI-GKAD-- 687
Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGT 284
A +F++ +P IF + V R G T
Sbjct: 688 -------------AEDMFKKTL--VPIIRSNPSIFNAENRSDYELVEIAHRMGST----- 727
Query: 285 ESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAV 344
++ L+ + + K + +VP+ +L ++ A +
Sbjct: 728 --IMAYAFDLENDEEEEEETEEWVEDREGKSMMG----MVPMAD-ILNADAEFNAHVNHE 780
Query: 345 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAAL 396
++++ + RP KAGE I+ + GP PNS+LL YG+V E + P+D +VE+ L
Sbjct: 781 EESLTVTSLRPIKAGEEILNYYGPHPNSELLRRYGYVTEKHSRYDVVEIPWD--IVESVL 838
Query: 397 NT----------------EDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 440
E+ +++D ++ + G+++ + + D+ L+
Sbjct: 839 TNFGISSKILEQIRGEFEEEEEFEDTFVLERDTGEVNSDGTFAEPAKFEGMPEDLQEQLK 898
Query: 441 LGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDY 500
++ ++Q S P + +AVL + + AR YP ++SED+ +L
Sbjct: 899 -SFLKGIKKLQ---SDTIPDKRKRDEIHQAVLVKTLEALAAR---YPTSISEDQILLNGQ 951
Query: 501 NLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPC 542
NL + R+AT + EKK+L + ++ + + D P
Sbjct: 952 NLDQRARMATVVRLGEKKLLQEAIATFSEDVEMTMDDESGPA 993
>gi|42567909|ref|NP_197226.2| protein SET DOMAIN GROUP 40 [Arabidopsis thaliana]
gi|75271674|sp|Q6NQJ8.1|SDG40_ARATH RecName: Full=Protein SET DOMAIN GROUP 40
gi|34222078|gb|AAQ62875.1| At5g17240 [Arabidopsis thaliana]
gi|51969984|dbj|BAD43684.1| unknown protein [Arabidopsis thaliana]
gi|332005020|gb|AED92403.1| protein SET DOMAIN GROUP 40 [Arabidopsis thaliana]
Length = 491
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 85/418 (20%), Positives = 172/418 (41%), Gaps = 61/418 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNE-TIAELLTT-NKLSELACLALYLMYEKKQ 170
+ A+ +L+ G+ VP ++T E ++ + +++ + N LS L++ L+YE +
Sbjct: 51 LGAARELKKGELVLKVPRKALMTTESIIAKDLKLSDAVNLHNSLSSTQILSVCLLYEMSK 110
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA--EILERAEGIKREY 228
KKSFW PY+ + R + +L G+ K ++ + ++
Sbjct: 111 EKKSFWYPYLFHIPR----------------DYDLLATFGNFEKQALQVEDAVWATEKAT 154
Query: 229 NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFA 288
+ + W AGSL ++ +F ++ A ++ ++ V GC +G
Sbjct: 155 AKCQSEWKEAGSLMKELELKPKFRSFQAWLWASATISSRTLHVPWDSAGCLCPVG----- 209
Query: 289 SHLVHLQTSCTTTRTPLSP-THNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD-D 346
L + + TP P + N E+ L + +S + D +
Sbjct: 210 -DLFNYDAPGDYSNTPQGPESANNVEEAGLV-----------VETHSERLTDGGFEEDVN 257
Query: 347 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV--EAALNTEDPQYQ 404
A L R Y+ GE +++ G N +LL +YGF+ E+N D++ + E +L + +
Sbjct: 258 AYCLYARRNYQLGEQVLLCYGTYTNLELLEHYGFMLEENSNDKVFIPLETSLFSLASSWP 317
Query: 405 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVS 464
+ ++GKLS ++ LRL + + +SV+ + +S
Sbjct: 318 KDSLYIHQDGKLSFA---------------LISTLRLWLIPQSQRDKSVMRLVYAGSQIS 362
Query: 465 PCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 522
E V+ +++ + L P +++ED + LH ++ +R+E+K A
Sbjct: 363 VKNEILVMKWMSEKCGSVLRDLPTSVTEDTVL-----LHNIDKLQDPELRLEQKETEA 415
>gi|367029027|ref|XP_003663797.1| hypothetical protein MYCTH_2080826, partial [Myceliophthora
thermophila ATCC 42464]
gi|347011067|gb|AEO58552.1| hypothetical protein MYCTH_2080826, partial [Myceliophthora
thermophila ATCC 42464]
Length = 357
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 80/338 (23%), Positives = 137/338 (40%), Gaps = 66/338 (19%)
Query: 78 LGDLKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
G L W ++G P ++ L ++ + P +A+E LQ G AA S P + +
Sbjct: 10 FGALVEWAEQHGARLHPSVEIYLDPVSKYSLRVSP----SATEGLQPGFAAVSCPARITL 65
Query: 135 TLERVLGNETIAELLTTNKLSELACL------------------------ALYLMYEKKQ 170
+ L + + +++ ++ A L +L+ E +
Sbjct: 66 SYLNALVDGLLDPSALSDRSAQSARLDQETSSTGAFPPRFTRSVPPHVLGRFFLVKEYLK 125
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVES-PLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
GK SFW PYI L Q+AV + P W + ++AYL G+ I E E +KRE+
Sbjct: 126 GKDSFWWPYIATLPPPE---QVAVWALPPFWPDHDIAYLEGTNAHVAIQEIQENVKREFK 182
Query: 230 ELDTVWFMAGSLFQQYPY-DIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFA 288
+ A L ++ + D+P A+T ++K AF C+ + SF
Sbjct: 183 Q-------ARKLLKEEDFPDLP--AYTQLLYKWAF-----CI-----------FTSRSFR 217
Query: 289 SHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV 348
LV L + + L P + S+ + + P Y+ +V D
Sbjct: 218 PSLV-LSDATKRRLSALLPQGVQLDDFSVLQPLLDIANHSPTARYTWDT----TSVPDTC 272
Query: 349 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 386
+L+ PY+ G + G + NS+LL+ YGF+ + P
Sbjct: 273 RLICHDPYQPGTQVYNNYGLKTNSELLLAYGFILPETP 310
>gi|156361027|ref|XP_001625323.1| predicted protein [Nematostella vectensis]
gi|156212150|gb|EDO33223.1| predicted protein [Nematostella vectensis]
Length = 447
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 77/312 (24%), Positives = 119/312 (38%), Gaps = 57/312 (18%)
Query: 75 EEDLGDLKSWMHKNGLPPCKVILKEKPS-HNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
EE+ L W +NG+ V K +P+ + R + A E + + + SVP L+
Sbjct: 47 EENYISLLKWAKRNGM----VFKKIRPAIFSSTGRGM---LAIERIHSSECVISVPERLL 99
Query: 134 VT----LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
+T LE +GN + K S L L+LMYEK K SFW PYIR L
Sbjct: 100 ITASSVLESAIGNYVAERMKGGAKSSNDYLLVLFLMYEKYLEKGSFWAPYIRTLPD---- 155
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDI 249
+P ++ EL +L + + E+ IK+ Y + + Q + +
Sbjct: 156 ---TFNTPCYFTRKEL-FLLPEQCREQAFEQVTQIKQSYKSFAKAY---NDVLQDFDCNF 208
Query: 250 PTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTH 309
FE FK A+ V + V+ Q + +C PL
Sbjct: 209 -WRTVDFESFKWAWCVVNTRSVYHDEPNRRAQ-----------PIDGNCAL--APLLDLL 254
Query: 310 NPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 369
N +K + RF SSK ++ V Y+ G + + GP
Sbjct: 255 NHCDKAEMCGRF----------NSSSK----------NYEINVITEYQKGTQVFINYGPH 294
Query: 370 PNSKLLINYGFV 381
N++L + YGFV
Sbjct: 295 DNTRLFLEYGFV 306
>gi|322694827|gb|EFY86647.1| SET domain protein [Metarhizium acridum CQMa 102]
Length = 467
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 99/445 (22%), Positives = 177/445 (39%), Gaps = 65/445 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN-------KLSELACLALYLM 165
+ A D+ A F++P ++ E + + +L + L + L L +M
Sbjct: 43 ITALRDIPADTTLFTIPRDAIINSETSSLRKKLPDLFESQGDEDEEQALDSWSALILIMM 102
Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIK 225
YE G +S W PYI L L ++P+ WSE EL+YL S T +I +A+
Sbjct: 103 YEFFLGDESKWKPYIDVL-------PLTFDTPMFWSEEELSYLQASATVNKI-GKAD--- 151
Query: 226 REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTE 285
A +F+ IP FV+ C G +MG+
Sbjct: 152 ------------AEEMFRTRL--IP----AIRGNPSVFVSSGDCSDE-DLIGLAHRMGS- 191
Query: 286 SFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD 345
+ ++ L+ +E S+ A+ +L ++ A + D
Sbjct: 192 TIMAYAFDLENEEAENDEESDGWVEDREGKSMMGMVAMA----DILNADAEFNAHVNHGD 247
Query: 346 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAALN 397
+ + + R KAGE I+ + GP PNS+LL YG++ E + P+D V+ +L
Sbjct: 248 EELTVTSIRDIKAGEEILNYYGPHPNSELLRRYGYITEKHSRYDVVEIPWD--AVQHSLM 305
Query: 398 TEDPQYQD------KRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE-M 450
+E QD +RM ++ + V +G + P + G D E +
Sbjct: 306 SELGVPQDIMAETMERM--DQDDLEDIFVLERDSGEPNPDGTFAGPAVVDGMPPDLKEQL 363
Query: 451 QSVISSL----GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKK 506
++ I L G + + + + + + + Y T++EDE +L +L ++
Sbjct: 364 KATIKLLQKLDGNLISDKRKRDDILRSTMVETLRLIASRYSTTIAEDEVLLAQDSLTRRQ 423
Query: 507 RVATQLVRMEKKMLNACLQVTADMI 531
R+A Q+ EKK+L ++M+
Sbjct: 424 RMAVQVRLGEKKLLQEACDHFSEMV 448
>gi|452841392|gb|EME43329.1| hypothetical protein DOTSEDRAFT_131367 [Dothistroma septosporum
NZE10]
Length = 445
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 103/460 (22%), Positives = 179/460 (38%), Gaps = 66/460 (14%)
Query: 83 SWMHKNGLP-PCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLG 141
+W+ NG K+ L + N + A EDL + FSVP S ++T E
Sbjct: 16 NWLRDNGASISAKITLDDLRQQNAGRG----IVAVEDLDEDEELFSVPRSTMLTTETSRN 71
Query: 142 NETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
E + + + LS + +AL + G +S W PY L ++ ++ + WS
Sbjct: 72 GEAVLQEVDDPWLSLIVVMALEYL----DGSQSRWKPYFDVL-------PVSFDNLMFWS 120
Query: 202 ETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ 261
+ EL +L GS +I + A A + F++ IP E +
Sbjct: 121 DRELRHLEGSTVVGKIGKEA----------------ADATFREQL--IP----VIERISK 158
Query: 262 AFVAVQSCVVHL-QRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARR 320
A A ++ + R G T ++ L+TS + + +
Sbjct: 159 AKAADNEELLRMCHRMGST-------IMAYGFDLETSSDQAKNDGEEWEEDSDAGETLPK 211
Query: 321 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 380
+VPL L A + + A L DD V + +P KAGE + G P + LL YG+
Sbjct: 212 -GMVPLADMLNADADRNNAKLFYEDDKVVMKTIKPVKAGEELYNDFGSLPRADLLRRYGY 270
Query: 381 VDEDN---------PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEA 431
+ DN P D + A L T+D ++ A+ G L A E+
Sbjct: 271 L-TDNYAQYDVVEIPADLIKERAGLRTQD--VDERWQYAEEQGVLDDGYDVSRASSEEGQ 327
Query: 432 ISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLS 491
+ L L +E + V + P + +L + Y R A YP +
Sbjct: 328 FPEELCVLLNLLALPRAEFEKVKNKDKIPKPDLTTNAKKLLRTILVY---RYAAYPGNVD 384
Query: 492 EDEAMLTDYNLHPKKR-VATQLVRMEKKMLNACLQVTADM 530
+ M +D +L+ ++R +A +++ EK++L + +++
Sbjct: 385 Q---MHSDVSLNDRRRKMAIVVIQGEKQVLQEAVDAISEI 421
>gi|425766115|gb|EKV04742.1| hypothetical protein PDIG_87340 [Penicillium digitatum PHI26]
gi|425778867|gb|EKV16969.1| hypothetical protein PDIP_33360 [Penicillium digitatum Pd1]
Length = 679
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 61/243 (25%), Positives = 106/243 (43%), Gaps = 44/243 (18%)
Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
A +LM + +G + FW PY+R L + GQL +PL + E ++ ++ G+ +
Sbjct: 105 TFAFFLMAQYLRGPEGFWYPYLRTLPQP---GQLT--TPLFFGEEDVDWIQGTGIPEAAV 159
Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGC 278
ER + + +Y D+ + G+ +P E +T+E++ A + S
Sbjct: 160 ERIKIWEEKY---DSGYLQLGA--TGFP---DCETYTWELYLWASTIITS---------- 201
Query: 279 TPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCK 338
+F++ ++ + P P++ VS AL+PL L + K
Sbjct: 202 ------RAFSAKVL---------SGAVQPGDLPEDGVS-----ALLPL-IDLPNHRPMAK 240
Query: 339 AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 398
A D + L+V + AG+ I GP+ N +LLINYGF NP D +V +
Sbjct: 241 VEWRAGDKDIGLLVLEDHSAGQEISNNYGPRNNEQLLINYGFCIAGNPTDYRIVHLGVKP 300
Query: 399 EDP 401
+ P
Sbjct: 301 DSP 303
>gi|336261436|ref|XP_003345507.1| hypothetical protein SMAC_07495 [Sordaria macrospora k-hell]
gi|380088183|emb|CCC13858.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 499
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 73/329 (22%), Positives = 123/329 (37%), Gaps = 61/329 (18%)
Query: 81 LKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
L W H +G P ++ EK + + +P +A+E L + A + P S+ ++
Sbjct: 12 LLDWAHNHGASLHPSVEIYQDEKTGFSLRVKP----SAAESLHSPFKAVTCPTSITLSYL 67
Query: 138 RVLGNETIAELLT---------------TNKLSELACLALYLMYEKKQGKKSFWLPYIRE 182
L + I LT N L YL+ + +GK S W PYI
Sbjct: 68 NALTDGPITPYLTPPALDTQKHAFPERFMNSLPPHVIGRFYLIQQYLKGKSSLWAPYIST 127
Query: 183 LDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS-- 240
L + A+ P W+E ++ L G+ I E + +K EY + + GS
Sbjct: 128 LTDPSQLDKWAL--PPFWTEHDIELLRGTNAYVAIQEIQDNVKSEYKQARKILKQEGSPD 185
Query: 241 --LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSC 298
+ Q Y+ FT F+ + + +S +++R
Sbjct: 186 YRAYTQVLYNWAYCMFTSRSFRPSLILSESAREYVER----------------------- 222
Query: 299 TTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 358
L P + S+ + + P YS L + A +L+ Y+
Sbjct: 223 ------LLPEGAKIDDFSILQPLYDIGNHSPEAEYSWN----LTSEPSACELICRNSYEP 272
Query: 359 GESIVVWCGPQPNSKLLINYGFVDEDNPY 387
G+ + G + NS+LL+ YGFV E+N Y
Sbjct: 273 GQQVFNNYGKKTNSELLLGYGFVTENNDY 301
>gi|159477607|ref|XP_001696900.1| rubisco small subunit N-methyltransferase [Chlamydomonas
reinhardtii]
gi|158274812|gb|EDP00592.1| rubisco small subunit N-methyltransferase [Chlamydomonas
reinhardtii]
Length = 411
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 52/183 (28%), Positives = 83/183 (45%), Gaps = 14/183 (7%)
Query: 354 RPYKAGESIVVWCGP---------QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 404
R A + +VVW G +PN +LL+ G + ++N D L A L D Y
Sbjct: 166 RAAGARKGVVVWDGAGSEMLLNDGRPNGELLLATGTLQDNNSSDFLSWPAGLVPADRYYM 225
Query: 405 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVS 464
K V + G + + F V+A R +L YLRL V+D + + + +S
Sbjct: 226 MKSQVLESMGYSAAEEFPVYADRMP---IQLLAYLRLSRVADPALLAKC--TFEADVELS 280
Query: 465 PCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 524
E +L L + RLA Y + ED + +L PK+R+A +L EK+++NA +
Sbjct: 281 QMNEYEILQILMGDCRERLASYTKSYEEDVKIAQQSDLSPKERLAVKLRLGEKRIINATM 340
Query: 525 QVT 527
+
Sbjct: 341 EAV 343
>gi|380480025|emb|CCF42668.1| SET domain-containing protein RMS1 [Colletotrichum higginsianum]
Length = 318
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 55/232 (23%), Positives = 107/232 (46%), Gaps = 34/232 (14%)
Query: 321 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 380
+VP+ +L ++ A + +D + +V RP KAGE I+ + GP PNS+LL YG+
Sbjct: 71 LGMVPMAD-ILNADAEFNAHVNHGEDDLSVVALRPIKAGEEILNYYGPHPNSELLRRYGY 129
Query: 381 VDEDN--------PYDRLVVEAALNTE---------------DPQ-YQDKRMVAQRNGKL 416
V + P+D +V++ L + DP+ ++D ++ + +G+
Sbjct: 130 VTPKHSRYDVVEIPWD--LVQSILTEQLRLTDDVWKQLAEHVDPEDFEDVFVLERDSGEP 187
Query: 417 SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLA 476
+ + +E +++ L+ + ++++ G + P + +A
Sbjct: 188 DSEGRLTTPAKVQEVSAELEEQLK-------AVLKAIKKVRGDLIPDKRKRDEVYQHVVA 240
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
+ LA YP T EDEA+L NL ++R+A ++ EK++L LQ+
Sbjct: 241 AALQKLLAQYPTTAEEDEALLASGNLTSRQRMAVEVRLGEKRLLKEALQMDG 292
>gi|395332633|gb|EJF65011.1| SET domain-containing protein [Dichomitus squalens LYAD-421 SS1]
Length = 502
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 95/403 (23%), Positives = 161/403 (39%), Gaps = 56/403 (13%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
L L +M+E+ +G +S W Y+ L + ++P+ W E +L L G+ + +
Sbjct: 88 LILCMMWEESRGPESKWSGYLAALPEK-------FDTPMFWPEDDLKELQGTAV-VDKIG 139
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRC--G 277
RA+ +R+Y+E + S +P D F+ E + + S H++R G
Sbjct: 140 RADA-ERDYHE--KLIPAVKSRPDLFPEDKLERYFSLERYHVNGSRILSRSFHVERWKGG 196
Query: 278 CTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLAR----------------RF 321
T G E + + + E+V +
Sbjct: 197 HTEDQGVEDDEADGNGMDVDPQEPPVDTEQGGDATEQVEEVQLEGDEELGDEDQEDPADV 256
Query: 322 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 381
A+VP+ L A A L + +++V +P +AGE I G PNS LL YG V
Sbjct: 257 AMVPMADMLNARFESENAKLFYEERELKMVTTKPVEAGEQIWNTYGDPPNSDLLRRYGHV 316
Query: 382 D----------EDNPYDRLVVEAAL----NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR 427
D NP D + V A L ++ +Y + V + VF +
Sbjct: 317 DVVPLRPPLSGMGNPRDIVEVRADLIVSAVSKKVEYSLQERVDWWLEEAEDDVFILRT-- 374
Query: 428 EKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP 487
+ E +++ + RL ++S+ +++ S P V P VL D ARL YP
Sbjct: 375 DCELPEELVSFERLLFLSEDEWIKTAKKSKLPKPKVDP----DVLTVAIDVLSARLKEYP 430
Query: 488 ATLSEDEAMLT-----DYNLHPKKRVATQLVRMEKKMLNACLQ 525
++ EDE +L+ +L+ K V +L EK++L L+
Sbjct: 431 TSIEEDEKLLSADKVESLSLNKKHAVIVRL--GEKRILQGTLK 471
>gi|255947868|ref|XP_002564701.1| Pc22g06730 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211591718|emb|CAP97961.1| Pc22g06730 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 679
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 61/247 (24%), Positives = 102/247 (41%), Gaps = 52/247 (21%)
Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
A +LM + +G + FW PY+R L + GQL +PL + E ++ ++ G+ +
Sbjct: 105 TFAFFLMGQYLRGSEGFWYPYLRTLPQP---GQLT--TPLFFGEEDVDWIQGTGIPEAAV 159
Query: 219 ERAEGIKREYN----ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ 274
ER + +++Y+ +LD + F E +T+E++ A + S
Sbjct: 160 ERIKVWEQKYDLGYLKLDEIGFPD------------CEQYTWELYLWASTIITSRAF--- 204
Query: 275 RCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYS 334
S + P P++ VS AL+PL L +
Sbjct: 205 ----------------------SAKVLSGAVQPDDLPEDGVS-----ALLPL-IDLPNHR 236
Query: 335 SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEA 394
K A D+ + L+V + AG+ I GP+ N +LLINYGF NP D +V
Sbjct: 237 PMAKVEWRAGDEDIGLLVLEDHSAGQEISNNYGPRNNEQLLINYGFCIAGNPTDYRIVLL 296
Query: 395 ALNTEDP 401
+ + P
Sbjct: 297 GVKPDSP 303
>gi|328869852|gb|EGG18227.1| hypothetical protein DFA_03714 [Dictyostelium fasciculatum]
Length = 504
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 66/314 (21%), Positives = 114/314 (36%), Gaps = 66/314 (21%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A +DL+ + +P V++ + +IA +L +L E ++ LMYE +G
Sbjct: 44 IIAKQDLKVDEIIAVIPKRCVLSPKTT----SIAPILEKYELEEAVATSIALMYETSKGV 99
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+S W YI+ + ++ P+LW + + YL G+ + ++E E ++ +Y E
Sbjct: 100 QSKWYSYIQSM-------PTVIDLPILWDKESIEYLVGTDLEEIVIENIETLEEQYRE-- 150
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV---------------------- 270
+ + +P FT E FK A V S
Sbjct: 151 ----DVEPIIKNHPETFKENIFTLESFKIASTIVSSRAFNIDQYHGESLVPLADIFNHKT 206
Query: 271 ----VHLQRCG-CTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKE---KVSLARRFA 322
VH++ G Q G H P P++ K + R
Sbjct: 207 GRENVHVEADGNVCKQCGELDGCEHKKKKGGKKVVKGAPSLKKATPQDIEKKTTFKDRIE 266
Query: 323 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 382
L+P D++ + + +P + G NS LL YGF++
Sbjct: 267 LLP-------------------KDSLFITIVKPVNKDCEVFNTYGDHDNSLLLSKYGFLE 307
Query: 383 EDNPYDRLVVEAAL 396
DNP D L ++ L
Sbjct: 308 MDNPCDLLRIDRQL 321
>gi|302836231|ref|XP_002949676.1| Rubisco large subunit N-methyltransferase [Volvox carteri f.
nagariensis]
gi|300265035|gb|EFJ49228.1| Rubisco large subunit N-methyltransferase [Volvox carteri f.
nagariensis]
Length = 484
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 111/440 (25%), Positives = 179/440 (40%), Gaps = 75/440 (17%)
Query: 105 EKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYL 164
+ RP+ + AS D Q GD FSVP+S ++ E V + +L L +AL L
Sbjct: 65 QTDRPV--LIASTDAQQGDVLFSVPDSAWLSAESVK-KAAVGKLAAAAGLEPWLQIALQL 121
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
+ ++ KS Y + +++PLLWSE EL L G+ ++L+ G
Sbjct: 122 VADRFGSTKSELSAYAASIPED-------LDTPLLWSEDELQELQGT----QVLQTLGGY 170
Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGT 284
+ T + LF P P FT F A AV+S
Sbjct: 171 LTFFRS--TFQQLQSGLFTSNPAAFPPSIFTLPRFLWAVAAVRS---------------- 212
Query: 285 ESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPL---LAYSSKCKAML 341
SH P PK AL PL + A +SK
Sbjct: 213 ---RSH---------------PPLDGPK--------IALAPLTELVSHRRAANSKLSVRS 246
Query: 342 AAVDDAVQLVV---DRPYKAGESIVVWCGP-QPNSKLLINYGFVDEDNPYDRLVVEAALN 397
A + Q++V R + GE + + GP + + +L++YG +D +P + +
Sbjct: 247 AGLFGRGQVLVLEATRAIRKGEPLSMDYGPGKLDGPVLVDYGVMDVTSPKPGYSLTLKMP 306
Query: 398 TEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV--SDTSEMQSVIS 455
D DK + + N V+++ +++ +ML +LRL + SD ++S+
Sbjct: 307 DSDRFIDDKLDILESNDLPQSVVYNLTP--DEQPTIEMLAFLRLMQLKGSDAFLLESIFR 364
Query: 456 S--LGPIC-PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPK--KRVAT 510
+ G + PVS E AV + L++ +A L GY T+ +D A L K +R A
Sbjct: 365 NDVWGFMQEPVSEGNEEAVCNTLSEGARAALGGYGTTIDQDLAELRAQGSRAKGSRREAA 424
Query: 511 QLVRM-EKKMLNACLQVTAD 529
L+R+ EK+ L+A + D
Sbjct: 425 LLIRLGEKEALDAVARFFED 444
>gi|390354259|ref|XP_001201449.2| PREDICTED: SET domain-containing protein 4-like [Strongylocentrotus
purpuratus]
Length = 455
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 77/325 (23%), Positives = 132/325 (40%), Gaps = 57/325 (17%)
Query: 75 EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
+E L WM ++G CK + ++ R + ++L+ GD+ +P L+V
Sbjct: 40 DEQYITLMKWMKEHGFN-CKGCCLKPAVFSDTGRGL---MTKKNLRPGDSIVEIPRHLLV 95
Query: 135 TLERVLGNETIAELLTTN--KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
T + +L E + ++ K + + +L+ E+ +GK SFW PYI L +
Sbjct: 96 TAKDILNTE-LGPIIKRQRQKPTPYQVVCAFLLTERSKGKSSFWYPYINVLPKD------ 148
Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 252
+P S T+ A PT A R+ I + ++ + A LF+ P
Sbjct: 149 -FTTPAFGS-TKQADFDVLPTIA----RSRAIN-QLQDIRAAFESASCLFEDIERTFPQY 201
Query: 253 A--FTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHN 310
F+ + F A+ + S V+++ GC E+F +
Sbjct: 202 RIFFSLDSFVWAWFVINSRSVYIEPSGC------EAF----------------------D 233
Query: 311 PKEKVSLARRFALVPLGPPLLAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 368
PK + FAL P LL +S ++ A V + ++ Y A + + + GP
Sbjct: 234 PKA----SDDFALAPF-LDLLNHSPGAEVTAGFDPVSNCYRIKTLDSYHAYDQVFIHYGP 288
Query: 369 QPNSKLLINYGFVDEDNPYDRLVVE 393
N LL+ YGFV NP+D + E
Sbjct: 289 HDNVNLLLEYGFVIPSNPHDAVSFE 313
>gi|302768639|ref|XP_002967739.1| hypothetical protein SELMODRAFT_408995 [Selaginella moellendorffii]
gi|300164477|gb|EFJ31086.1| hypothetical protein SELMODRAFT_408995 [Selaginella moellendorffii]
Length = 421
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 45/155 (29%), Positives = 72/155 (46%), Gaps = 15/155 (9%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+L SW+ G +LK P + A D++AG+ V ++T +R+
Sbjct: 39 ELVSWLKIRGEHDACSLLKTGPDK-------RGLFAVRDIKAGECILRVSRDTMMTADRL 91
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
+LL++ +SE A LAL L++EK+ G+ S W PYI L R + S
Sbjct: 92 --PLEFQQLLSSG-VSEWAQLALLLLFEKRAGEASIWAPYISCLPRWG-----TIHSTAF 143
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W + EL + S E + R I+ E+NE+ +V
Sbjct: 144 WRKEELTMIQESSLSYETMSRRAAIREEFNEMQSV 178
Score = 43.1 bits (100), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 18/58 (31%), Positives = 33/58 (56%)
Query: 349 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 406
QL D+ Y AGE + + GP N+ L +++GF NP+D++ + ++ D ++K
Sbjct: 223 QLFADKNYAAGEQVTISFGPLCNASLALDFGFTVPYNPWDKVQLWLGISRRDSLRKEK 280
>gi|392569623|gb|EIW62796.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
Length = 509
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 110/484 (22%), Positives = 178/484 (36%), Gaps = 63/484 (13%)
Query: 85 MHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNET 144
+ L KV + E P H A +D+ F++P L ++
Sbjct: 15 LQHGNLDTEKVGIVEFPEHGRG------AIALQDIPEDYTLFTIPRELTLSTRTCSLPTL 68
Query: 145 IAELLTTNKLSE-LACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 203
+ + + L E A L L +++E+ +G S W Y+ L + ++P+ W +
Sbjct: 69 MGQAWKEHGLHEGWAGLILCMIWEESRGSDSKWSGYLATLPS-------SFDTPMFWGQE 121
Query: 204 ELAYLTGSPTKAEILERAEGIKREYNEL------DTVWFMAGSLFQQYPYD--------I 249
+L L G+ + + R E + + +L T F A L Q + + I
Sbjct: 122 DLNELQGTAV-VDKVGRDEAERDYHGKLVPAITGRTDLFPADKLSQYFSLERYHLMGSRI 180
Query: 250 PTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTH 309
+ +FT E +K T M +S + TT P+ +
Sbjct: 181 LSRSFTVEQWKGEHHDGGDEHDGDDEAADTSAMDVDS--QEAPSTDAAETTEPAPVDGSD 238
Query: 310 NPKE-------------KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 356
P+E A+VP+ L A A L + +++V +P
Sbjct: 239 GPEELQIEGDGEDSDDEDHEDPGDVAMVPMADMLNARFESENAKLFYDERELKMVSTKPI 298
Query: 357 KAGESIVVWCGPQPNSKLLINYGFVD----------EDNPYDRLVVEAALNTEDPQYQDK 406
KAGE I G PNS LL YG VD NP D + V A L + K
Sbjct: 299 KAGEQIWNTYGDPPNSDLLRRYGHVDLVPLSAPLSGLGNPGDVVEVRADLIVSVAAKKVK 358
Query: 407 RMVAQR-NGKLSVQVFHVHAGREK-EAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVS 464
+ +R + L V R E +++ ++RL + ++ S P
Sbjct: 359 HDLKERVDWWLEEADDDVFVLRTDCELAEELVSFVRLLLLPKDEWEKAAQKSKLP----K 414
Query: 465 PCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDY---NLHPKKRVATQLVRMEKKMLN 521
P +++ VL D + RL YP TL EDEA+ L KR A + EK++L
Sbjct: 415 PKLDKDVLTIAVDVLEKRLKDYPTTLEEDEALFAPERFGELSLNKRHAVVVRLGEKRILR 474
Query: 522 ACLQ 525
L+
Sbjct: 475 GTLK 478
>gi|408393455|gb|EKJ72719.1| hypothetical protein FPSE_07119 [Fusarium pseudograminearum CS3096]
Length = 465
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 96/462 (20%), Positives = 181/462 (39%), Gaps = 78/462 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--------LSELACLALYL 164
+ A D+ A F++P + +E + I ++ +K L + L L +
Sbjct: 43 IIALRDIPAETTLFTIPRKGSINIETSELPQKIPDVFDLDKPDEDDVPGLDSWSSLILIM 102
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
+YE +G S W Y L + ++P+ WSE EL L S + +I +A+
Sbjct: 103 IYEYLRGDSSQWKSYFDVLPS-------SFDTPMFWSENELDQLQASHMRHKI-GKAD-- 152
Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGT 284
A ++F++ +P IF + V R G T
Sbjct: 153 -------------AENMFKKTL--VPIIRSNPSIFNAENRSDSELVEIAHRMGST----- 192
Query: 285 ESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAV 344
++ L+ + K + +VP+ +L ++ A +
Sbjct: 193 --IMAYAFDLENDEEEEEETEEWVEDRDGKSMMG----MVPMAD-ILNADAEFNAHVNHE 245
Query: 345 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAAL 396
++++ + RP KAGE I+ + GP PNS+LL YG+V E + P+D +VE+ L
Sbjct: 246 EESLTVTSLRPIKAGEEILNYYGPHPNSELLRRYGYVTEKHSRYDVVEIPWD--IVESVL 303
Query: 397 NT----------------EDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 440
+ +++D ++ + G+++ + + D+ L+
Sbjct: 304 TNFGISSKILKQIRGEFEGEEEFEDTFVLERDTGEINSDGTFAEPAKFEGMPEDLQEQLK 363
Query: 441 LGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDY 500
S ++ V S P + +AVL + + AR YP ++SED+ +L
Sbjct: 364 ----SFLKGIKKVQSDTIPDKRKRDEIHQAVLVKTLEALAAR---YPTSISEDQTLLNGQ 416
Query: 501 NLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPC 542
NL + R+AT + EKK+L + ++ + + D P
Sbjct: 417 NLDQRARMATVVRLGEKKLLQEAIATFSEDVEMTMDDESGPA 458
>gi|399949805|gb|AFP65462.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Chroomonas mesostigmatica
CCMP1168]
Length = 464
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 80/412 (19%), Positives = 162/412 (39%), Gaps = 65/412 (15%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A +Q G+ +P +L+ L++ L E +E L+ L+E LA+ + E+ G+KS
Sbjct: 100 AFRKIQQGEKLIEIPENLI--LKKSLK-ENRSEDLSF--LNEYDSLAIKAIQERAIGEKS 154
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W Y L +++ + W +++ +L GS E IK ++ ++
Sbjct: 155 KWKVYYEILPKEKDLNLV-----FRWKISDIVFLRGSKVLNASFYLKEKIKIQFLRIEKT 209
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHL 294
F L P + F + ++ A + S + LQ + F +H
Sbjct: 210 IFSKNRLV------YPEKIFNLQSWEWAISLLLSRAIFLQNMKKIALVPYADFINH---- 259
Query: 295 QTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 354
P S ++ +K++ + ++ + + D+
Sbjct: 260 --------NPFSTSYINSKKIAFSE-------------------------NNEIVMYADK 286
Query: 355 PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV---AQ 411
Y + I G + N +LL+ YGF+ E NP+D + + AL+T+D Y K +
Sbjct: 287 DYNKFDQIFTTYGQKTNLELLVLYGFIIERNPFDSIELRVALSTKDELYNKKEKFINDCE 346
Query: 412 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 471
+ +++ VF+ +E + ++RL S+L + +E+ +
Sbjct: 347 KTEQITFPVFYYKYPKE------LYEFMRLCLSGPRDFFGEEFSNLNFTDEENFNLEKII 400
Query: 472 LDQLADYFKARLAGYPATLSEDEAMLTDYN---LHPKKRVATQLVRMEKKML 520
+ K L Y T++E++ + N L ++ + + + EKK+L
Sbjct: 401 RKTVIFACKKNLKAYNKTINEEKILNNLSNIIVLTKNQKTSIKQRKCEKKIL 452
>gi|328854233|gb|EGG03367.1| hypothetical protein MELLADRAFT_90239 [Melampsora larici-populina
98AG31]
Length = 509
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 103/455 (22%), Positives = 180/455 (39%), Gaps = 68/455 (14%)
Query: 111 HYVAASEDLQAGDAAFSVP-----NSLVVTLERVLGNETIAELLTTNKLSELAC----LA 161
H + A+ED++AG FS+P NS ++T+ G L T+ +++ L
Sbjct: 37 HGLIATEDIEAGTILFSIPRPPVSNSPLLTI----GTSDFLSKLGTSDAEKISRNWIPLL 92
Query: 162 LYLMYEKKQGKKSF------WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 215
+ +M+E+ +G W PY + + ++ + WS+ EL L S
Sbjct: 93 MTMMWERARGYDQSVPSHMSWRPYFEMMPTE-------FDTLMFWSDDELKELQASTVLG 145
Query: 216 EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQR 275
+ E AE +Y++L + S P + +T++ F + + L R
Sbjct: 146 K--EEAEA---DYHQLVAPLIRSRSDLFPIPTSNQGKVWTWDDFYGLQIYHLMGSLALSR 200
Query: 276 ---CGCTPQMGTESFASHLVHLQT--------SCTTTRTPLSPTHNPKEKVSL--ARRFA 322
P+ + ++ + T S T +E+V L + A
Sbjct: 201 SFEVDVVPETDNHDEEEEVDDDKSIEDQDENGDISMTAASASSTGIIEEEVGLNTSEGVA 260
Query: 323 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 382
+VPL L A S A L + + + + GE I PN+ LL YG VD
Sbjct: 261 MVPLADILNAKSGCENAKLFYEPTTLNMTTTKSIRKGEQIYNTYADPPNADLLRRYGHVD 320
Query: 383 EDNPYD----------RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV--------H 424
++NP+D RL E +L+ DPQ Q+ + K +++V +
Sbjct: 321 DENPFDLAEVSLELCIRLAAE-SLHPSDPQNQNTLDELKSRAKWALEVSDIDEIFMLPTK 379
Query: 425 AGRE-KEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARL 483
+ RE KE + D L + +S E Q+ S G + P M + RL
Sbjct: 380 SQREPKEILPDELVIMLRILLSTEEEFQT-WKSKGKVP--KPAMSEPIAQLAIQILSNRL 436
Query: 484 AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKK 518
Y T+ D+ +L D +L ++++ + VR+ +K
Sbjct: 437 NQYSTTIQNDQDLLKDQSL-SRRKLKSIKVRLGEK 470
>gi|148686780|gb|EDL18727.1| mCG18357, isoform CRA_e [Mus musculus]
Length = 458
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 47/193 (24%), Positives = 93/193 (48%), Gaps = 10/193 (5%)
Query: 345 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 404
DD + V + ++AG+ I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 156 DDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 215
Query: 405 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVIS 455
K V R G + VF +H+ E + +L +LR+ +++ S + + +
Sbjct: 216 MKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFT 274
Query: 456 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 515
PVS E + L D L Y T+ ED+ +L + +L + +A +L
Sbjct: 275 LGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRLG 334
Query: 516 EKKMLNACLQVTA 528
EK++L ++ A
Sbjct: 335 EKEILEKAVKSAA 347
>gi|332020870|gb|EGI61268.1| SET domain-containing protein 3 [Acromyrmex echinatior]
Length = 232
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 55/217 (25%), Positives = 94/217 (43%), Gaps = 11/217 (5%)
Query: 322 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 381
AL+P+ + + A D + R +K GE + + GP+ NS ++ GFV
Sbjct: 18 ALIPMWDMCNHENGRITTDFNATSDRCECYALRDFKKGEQVFISYGPRTNSDFFVHSGFV 77
Query: 382 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-MLPYLR 440
DN D + ++ D +++ + + SV F + G E ISD +L +LR
Sbjct: 78 CMDNEQDGFKLRLGISKADSLQKERIELLSKLDLPSVGEFLLKPG--TEPISDTLLAFLR 135
Query: 441 LGYVSDTSEMQSVISSLGPI------CPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 494
+ + +E+ + S C + +E V L + +A YP TL ED
Sbjct: 136 V-FSMRKAELTHWLRSDKVFDLKHVDCALETVVEENVRKFLLTRLQLLIANYPTTLKEDL 194
Query: 495 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 531
+L + L K++A QL EK++L+ L+ I
Sbjct: 195 ELL-ETTLPQMKKMAVQLRVTEKRILSGALEYVEQWI 230
>gi|442753255|gb|JAA68787.1| Putative set domain-containing protein [Ixodes ricinus]
Length = 428
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 71/318 (22%), Positives = 125/318 (39%), Gaps = 61/318 (19%)
Query: 81 LKSWMHKNGLP-PCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
L +WM NG K+ L++ P V A E L G+ +P SL+++
Sbjct: 31 LLTWMEANGFRLHSKLGLRDFPDTGRG------VVALEKLVGGETFLKLPTSLLISTRTA 84
Query: 140 LGNETIAELLTTN---KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES 196
L +++ T KL+ + L L+++ +K G+ S W P++ L R +
Sbjct: 85 L--QSLLHSFITRYHAKLTPIDVLTLFVLDQKLLGEASRWWPFVDSLPR-------TFTT 135
Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 256
P+ T L + E+ R I+R + +L + + G + ++ + FT+
Sbjct: 136 PVFLRRTVFESLP-KDLREEVHTRITSIQRTFLKLKVL--LGGHVEEEPEVQSLSTGFTW 192
Query: 257 EIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVS 316
F A+ AV T C + S + + +
Sbjct: 193 NNFVWAWTAV----------------------------NTRCIFAQGSNSSSLWENDHCA 224
Query: 317 LARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 374
LA P L + KA + A V + +++ + + A E + + GP N +L
Sbjct: 225 LA---------PFLDCLNHHWKASIETAMVGENFEILSHKSHDANEQVFISYGPHSNRRL 275
Query: 375 LINYGFVDEDNPYDRLVV 392
++YGFV DNP D +VV
Sbjct: 276 FLDYGFVLPDNPNDVVVV 293
>gi|402224283|gb|EJU04346.1| hypothetical protein DACRYDRAFT_114691 [Dacryopinax sp. DJM-731 SS1]
Length = 1313
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 111/447 (24%), Positives = 177/447 (39%), Gaps = 63/447 (14%)
Query: 115 ASEDLQAGDAAFSVPNSLVVT-----LERVLGNETIAELLTTNKLSE-LACLALYLMYEK 168
A D+ G+ F++P SL+++ L +LG E N L + A L L +M+E+
Sbjct: 847 ALRDIYEGEKLFTIPRSLLLSTRTSSLPFLLGEEDW------NALGDGWAGLILCMMWEE 900
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
+ ++S W Y+ + + + + W++ EL L GS +I RA G +++Y
Sbjct: 901 ARAEESPWRGYLESMPTE-------FSTLMFWTDEELGLLKGSLVLDKI-GRA-GAEKDY 951
Query: 229 NE------LDTVWFMAGSLFQ-QYP---YDIPTEAFTFEIFKQAF-VAVQSCVVHLQRCG 277
NE A SLFQ +Y Y I I ++F V+ S V
Sbjct: 952 NEKVLPLLQKRTDLFAPSLFQTRYTLQNYHIQGS----RILSRSFTVSPWSGAVPENDED 1007
Query: 278 CTPQMGTESFASHLVHL--------QTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPP 329
P++ S AS L S E+ A+VP+
Sbjct: 1008 EAPELVDTSMASAGDELMGEVNVLGDDSEGNIDMGDMDEDEEDEEREKTEDVAMVPMADM 1067
Query: 330 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE------ 383
L A A L D +Q++ +P GE I G PNS LL YG+VD
Sbjct: 1068 LNARCGCNNAKLFYTRDDLQMMATKPIAKGEQIWNTYGDPPNSDLLRRYGYVDALTLPDG 1127
Query: 384 -DNPYDRLVVEAALNTEDPQ---YQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL 439
+P D + + A E + YQD+ G V V ++ D + L
Sbjct: 1128 VGSPSDVVEINADTVVEAAKVQSYQDRIDWWLEEGGDDAFVLDV-----TYSVPDEMLSL 1182
Query: 440 RLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 499
+ + + + S P P ++ + L + RLA YP +L+E E ML
Sbjct: 1183 VRLLLLNQEDWEKAQSK---GKPPKPKLDEKSYEVLLVVLQKRLAMYPISLTEQEGMLRS 1239
Query: 500 YN-LHPKKRVATQLVRMEKKMLNACLQ 525
N L+ K+R A + E+++L+ L+
Sbjct: 1240 SNELNEKRRNALIVTTGEQRILHKTLE 1266
>gi|260835124|ref|XP_002612559.1| hypothetical protein BRAFLDRAFT_219602 [Branchiostoma floridae]
gi|229297937|gb|EEN68568.1| hypothetical protein BRAFLDRAFT_219602 [Branchiostoma floridae]
Length = 327
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 51/199 (25%), Positives = 90/199 (45%), Gaps = 22/199 (11%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
+D L W+ +NG ++L + P V ++ +L+ GD S+P +L++T
Sbjct: 4 DDSIQLMRWLRRNGFRDSHLVLTDFPDTGRG------VMSTRNLKEGDCIVSLPENLLIT 57
Query: 136 LERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
V+ N + + + T K L+ L+LYL+ EK +GK SFW PYI+ L +
Sbjct: 58 TTTVV-NSHLGQYIKTWKPRLTPKQVLSLYLIAEKSRGKDSFWYPYIQTL-------PTS 109
Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 253
+P +S E+ L +A + R + ++ Y L T LF +
Sbjct: 110 YTTPSYFSTAEVDALPALVREATLRHR-KVLQNSYKSLQTSLHNLEPLFPDW-----KTV 163
Query: 254 FTFEIFKQAFVAVQSCVVH 272
FT + ++ A+ V + V+
Sbjct: 164 FTLKSYRWAWATVYTRSVY 182
>gi|148686778|gb|EDL18725.1| mCG18357, isoform CRA_c [Mus musculus]
Length = 536
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 47/193 (24%), Positives = 93/193 (48%), Gaps = 10/193 (5%)
Query: 345 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 404
DD + V + ++AG+ I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 234 DDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 293
Query: 405 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVIS 455
K V R G + VF +H+ E + +L +LR+ +++ S + + +
Sbjct: 294 MKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFT 352
Query: 456 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 515
PVS E + L D L Y T+ ED+ +L + +L + +A +L
Sbjct: 353 LGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRLG 412
Query: 516 EKKMLNACLQVTA 528
EK++L ++ A
Sbjct: 413 EKEILEKAVKSAA 425
>gi|34784341|gb|AAH57968.1| Setd3 protein [Mus musculus]
Length = 408
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 49/193 (25%), Positives = 95/193 (49%), Gaps = 10/193 (5%)
Query: 345 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 404
DD + V + ++AG+ I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 106 DDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 165
Query: 405 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISS 456
K V R G + VF +H+ E + +L +LR+ +++ + + I +
Sbjct: 166 MKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFT 224
Query: 457 LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 515
LG PVS E + L D L Y T+ ED+ +L + +L + +A +L
Sbjct: 225 LGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRLG 284
Query: 516 EKKMLNACLQVTA 528
EK++L ++ A
Sbjct: 285 EKEILEKAVKSAA 297
>gi|393217169|gb|EJD02658.1| SET domain-containing protein [Fomitiporia mediterranea MF3/22]
Length = 513
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 108/445 (24%), Positives = 173/445 (38%), Gaps = 53/445 (11%)
Query: 115 ASEDLQAGDAAFSVPNSLVVT-----LERVLGNETIAELLTTNKLSELACLALYLMYEKK 169
A +D+ G FSVP SL ++ L +++G E + L NK L L +M+E+
Sbjct: 34 ALQDIPEGHTLFSVPRSLTLSTHTSELPKLIG-EAAWKSLRLNK--GWVGLILCMMWEEC 90
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
+ S W Y L R A ++P+ W+ EL L G+ +I + E +R+Y
Sbjct: 91 RWTDSKWCGYFNILPR-------AFDTPMFWTGDELKELDGTDVLGKIGK--EQAERDYY 141
Query: 230 E-LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCG-CTPQMGTESF 287
E L+ LF P I + ++ E + + S H+++ TP G++S
Sbjct: 142 EILNPAVRTRPDLFD--PGHIAS-FYSLENYHVMGSRILSRSFHVEKWKEQTP--GSQSR 196
Query: 288 ASHLVHLQTSCTTTRTPLS----------PTHNPKEKVSLARRFALVPLGPPLLAYSSKC 337
AS +H C S + ++ A+VP+ L A
Sbjct: 197 ASSELHENGDCMDIDDESSNLSAVGAENGGDDDSDDEAENPSDIAMVPMADMLNAQYGSE 256
Query: 338 KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD----------EDNPY 387
A L + +V +P + GE I G PNS LL YG VD E NP
Sbjct: 257 NAKLFYEPTHLNMVSTKPIRRGEQIYNAYGDLPNSALLREYGHVDLVPLPGVPWKEGNPA 316
Query: 388 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS------DMLPYLRL 441
D + + A L R+ A+ + K + + G + + D++
Sbjct: 317 DVVEIPADLALHAVLSSQARVDAE-SLKERIDWWLEEGGDDVFVLGTDLELPDVMISFLK 375
Query: 442 GYVSDTSEMQSVISSLGPICP-VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDY 500
+ E + S P P + + + + RLA YP TL DEA+L+
Sbjct: 376 LLLLSKLEWEKARSKSKPPKPKLDMDSKLQTFPLVLGMLERRLAKYPTTLEHDEALLSGQ 435
Query: 501 NLHPKKRVATQLVRM-EKKMLNACL 524
P +VR+ EK +L C+
Sbjct: 436 TSLPYNVRNAIIVRIGEKHILVGCM 460
>gi|440797255|gb|ELR18348.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
Length = 431
Score = 59.3 bits (142), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 74/296 (25%), Positives = 107/296 (36%), Gaps = 82/296 (27%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVT------------LERVLGNETIAELLTTNKLSELACL 160
V A+ D+ G+ SVP SLVV + R+L E N L
Sbjct: 59 VVAAHDIATGETLLSVPFSLVVDSADAPLATAAPEIRRILDEEFPLSATNENAL------ 112
Query: 161 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILER 220
L+ K S W YI L + L +S+ EL+YL GS +R
Sbjct: 113 ---LLLVHKNDPNSPWQRYIDVLPS-------TFSTTLFFSDDELSYLEGSSLHHFARQR 162
Query: 221 AEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV--QSCVVHLQRCGC 278
I+ +Y+ + T LF YP E F+ + +K A + +S VV + G
Sbjct: 163 RRAIESQYDTIFT------PLFVDYPEHFAPEQFSLDAWKWALSVIWSRSFVVDEGKRGL 216
Query: 279 TPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCK 338
P A F + P + + K
Sbjct: 217 VP------------------------------------WADMFNMAP-------ETEQVK 233
Query: 339 AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ---PNSKLLINYGFVDEDNPYDRLV 391
+ AVD + P K GE I V G N++LL++YGFV E+NP+D +V
Sbjct: 234 VAVDAVDHHLIYSARSPIKKGEQIFVAYGQSRQMSNAQLLMDYGFVLENNPHDAVV 289
>gi|71895277|ref|NP_001025965.1| SET domain-containing protein 4 [Gallus gallus]
gi|53134599|emb|CAG32346.1| hypothetical protein RCJMB04_23h14 [Gallus gallus]
Length = 439
Score = 59.3 bits (142), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 100/467 (21%), Positives = 178/467 (38%), Gaps = 84/467 (17%)
Query: 86 HKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVVTLERV 139
HK K LK++ + RP + + ++ LQAG+ S+P +VT V
Sbjct: 28 HKLEYIKLKKWLKDRGFGDSSLRPAQFWGTGRGLMTTKALQAGELVISLPEKCLVTTTTV 87
Query: 140 LGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
L N + E + K +S L L +L+ EK G++S W PY+ L + P
Sbjct: 88 L-NSCLGEYIMKWKPPVSPLIALCPFLIAEKHAGERSLWKPYLDVLPK-------TYSCP 139
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
+ E ++ L P + + E+ + Y + SLF + I F +
Sbjct: 140 VC-LEQDVVQLLPEPLRKQAQEQRTAVHELYMSSKAFFSSLQSLFAENTATI----FNYS 194
Query: 258 IFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSL 317
+ A+ + + ++++ H Q C + L P + +L
Sbjct: 195 ALEWAWCTINTRTIYMK------------------HSQRECFS----LEP-----DVYAL 227
Query: 318 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 377
A L+ P + + KA ++ + K E + + GP N +LL+
Sbjct: 228 APYLDLLNHSP-----NVQVKAAFNEQSRNYEIQTNSQCKKYEEVFICYGPHDNQRLLLE 282
Query: 378 YGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP 437
YGFV DNP+ + V + + DK Q+N KLS+ H D+L
Sbjct: 283 YGFVAVDNPHSSVYVSSDTLLKYFPSLDK----QKNAKLSILKEH-----------DLLE 327
Query: 438 YLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM- 496
L G+ + + + + L C RA+ L D AR +E +A+
Sbjct: 328 NLTFGWDGPSWRLLTALKVLSLGGDEFTCWRRAL---LGDVISAR--------NEQQALN 376
Query: 497 ----LTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTV 539
+ + + + V Q+ +M++ N Q+T + L D+ +
Sbjct: 377 ITTKICHFLIEETQHVLLQISQMKRDKENLKEQLTLVEALRLEDLKI 423
>gi|440804743|gb|ELR25614.1| SET domain containing protein, partial [Acanthamoeba castellanii
str. Neff]
Length = 273
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 74/293 (25%), Positives = 106/293 (36%), Gaps = 82/293 (27%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVT------------LERVLGNETIAELLTTNKLSELACL 160
V A+ D+ AG+ SVP SLVV + R+L E N L
Sbjct: 45 VVAAHDIAAGETLLSVPFSLVVDSADALLATSAPEIRRILDEEFPLSPTNENAL------ 98
Query: 161 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILER 220
L+ K S W YI L + L +S+ EL+YL GS +R
Sbjct: 99 ---LLLVHKNDPNSPWQRYIDVLPS-------TFSTTLFFSDDELSYLEGSSLHYFARQR 148
Query: 221 AEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV--QSCVVHLQRCGC 278
I+ +Y+ + T LF YP E F+ + +K A + +S VV + G
Sbjct: 149 RRAIESQYDTIFT------PLFVDYPEHFAPEQFSLDAWKWALSVIWSRSFVVDEGKSGL 202
Query: 279 TPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCK 338
P A F + P + + K
Sbjct: 203 VP------------------------------------WADMFNMAP-------ETEQVK 219
Query: 339 AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ---PNSKLLINYGFVDEDNPYD 388
+ AVD + P K GE I V G N++LL++YGFV E+NP+D
Sbjct: 220 VAVDAVDHHLIYSARSPIKKGEQIFVAYGQSRQMSNAQLLMDYGFVLENNPHD 272
>gi|149044196|gb|EDL97578.1| rCG27725, isoform CRA_b [Rattus norvegicus]
Length = 538
Score = 58.9 bits (141), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 47/193 (24%), Positives = 92/193 (47%), Gaps = 10/193 (5%)
Query: 345 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 404
DD + V + ++AG+ I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 234 DDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 293
Query: 405 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD---------TSEMQSVIS 455
K V R G + VF +H E + +L +LR+ +++ S + + +
Sbjct: 294 MKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFT 352
Query: 456 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 515
PVS E + L D L Y T+ ED+ +L + +L + +A +L
Sbjct: 353 LGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKTVLKNPDLSVRATMAIKLRLG 412
Query: 516 EKKMLNACLQVTA 528
EK++L ++ A
Sbjct: 413 EKEILEKAVKSAA 425
>gi|302792358|ref|XP_002977945.1| hypothetical protein SELMODRAFT_107696 [Selaginella moellendorffii]
gi|300154648|gb|EFJ21283.1| hypothetical protein SELMODRAFT_107696 [Selaginella moellendorffii]
Length = 467
Score = 58.9 bits (141), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 70/302 (23%), Positives = 124/302 (41%), Gaps = 40/302 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A DL G+ ++P + +TL + I L L + LMYE+ +GK
Sbjct: 34 VRALRDLHHGELIATIPKAACLTLLTTAARDAIERARLGGGLG----LTVALMYERSKGK 89
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL 231
S W Y++ L RQ P LWSE E+ L G+ + E +K ++ E
Sbjct: 90 GSKWYRYLKTLPRQE-------SVPFLWSEEEIDGLLLGTELHKALKEDKLLMKEDWEE- 141
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIF--KQAFVAVQSCVVHLQRC-GCTPQMGTESFA 288
L ++ P + P + FTFE + ++ V+ +S + + G P
Sbjct: 142 -----NIAPLTKEDPLEFPAQDFTFESYLAAKSLVSSRSFEIDAEHGYGMVP-------L 189
Query: 289 SHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV 348
+ L + +T L+ + + + L L + C+ +++ +
Sbjct: 190 ADLFNHKTDAEDVHFMLNASDSDDDDNGLIIDDGLA---------NGDCRE-ISSDKSVL 239
Query: 349 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD--RLVVEAALNTEDPQYQDK 406
++V+ + AG I G N+ LL YGF + +NP+D L ++ L ++Q K
Sbjct: 240 EMVMVKDVAAGSEIFNTYGQLGNAALLHRYGFTEPNNPHDIVNLDMDCVLEVLLSRFQKK 299
Query: 407 RM 408
R+
Sbjct: 300 RV 301
>gi|307190530|gb|EFN74527.1| SET domain-containing protein 3 [Camponotus floridanus]
Length = 232
Score = 58.9 bits (141), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 55/217 (25%), Positives = 94/217 (43%), Gaps = 11/217 (5%)
Query: 322 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 381
AL+P+ + + A D + R +K GE + + GP+ NS ++ GFV
Sbjct: 18 ALIPMWDMCNHENGRITTDFNATSDHCECYALRNFKKGEQVFISYGPRTNSDFFVHSGFV 77
Query: 382 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-MLPYLR 440
+N D + ++ D +++ + + G SV F + G E ISD +L +LR
Sbjct: 78 YMNNKQDGFKLRLGISKADSLQKERIELLSKLGLPSVGEFLLKPG--TEPISDTLLAFLR 135
Query: 441 LGYVSDTSEMQSVISSLGPI------CPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 494
+ + +E+ + S C + +E V L + +A YP TL ED
Sbjct: 136 V-FSMRKAELAHWLRSDKVFDLKHMDCALETVVEENVRKFLLTRLQLLIANYPTTLKEDL 194
Query: 495 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 531
+L + L K++A QL EK++L L+ I
Sbjct: 195 ELL-ETTLPQIKKMAVQLRVTEKRILLGALEYVEQWI 230
>gi|307195794|gb|EFN77608.1| SET domain-containing protein 3 [Harpegnathos saltator]
Length = 245
Score = 58.9 bits (141), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 61/242 (25%), Positives = 100/242 (41%), Gaps = 16/242 (6%)
Query: 297 SCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 356
S TR L P+ + + AL+P+ + + A D + R +
Sbjct: 11 STVMTRQNLVPSPDGSRMIH-----ALIPMWDMCNHENGRITTDFNATSDRCECYALRNF 65
Query: 357 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL 416
+ GE I + GP+ NS ++ GFV DN D + ++ D +++ + +
Sbjct: 66 QKGEQIFISYGPRTNSDFFVHSGFVYMDNEQDGFKLRLGISKADSLQKERTELLGKLDLP 125
Query: 417 SVQVFHVHAGREKEAISDM-LPYLRLGYVSDTSEMQSVISSLGPI------CPVSPCMER 469
SV F + G E ISDM L +LR+ + +E+ + S C + +E
Sbjct: 126 SVGEFLLKPG--TEPISDMLLAFLRV-FSMRKAELAHWLRSDKVFDLKHMDCALETVVEE 182
Query: 470 AVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTAD 529
V L + +A YP TL ED +L + L K++ QL EKK+L L+
Sbjct: 183 NVRKFLLTRLQLLIANYPTTLKEDLELL-ETTLPQVKKMTVQLRVTEKKILLGALEYVEQ 241
Query: 530 MI 531
I
Sbjct: 242 WI 243
>gi|242049248|ref|XP_002462368.1| hypothetical protein SORBIDRAFT_02g024510 [Sorghum bicolor]
gi|241925745|gb|EER98889.1| hypothetical protein SORBIDRAFT_02g024510 [Sorghum bicolor]
Length = 489
Score = 58.9 bits (141), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 102/424 (24%), Positives = 166/424 (39%), Gaps = 73/424 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNET-IAELLTTNKLSELACLALYLMYEKKQG 171
+ A+ DL G+ VP L + + V ++ A L +AL L+ E +G
Sbjct: 83 LVAARDLPRGEVVAEVPKKLWMDADAVAASDIGRACGGGGGGLRPWVAVALLLLSEVARG 142
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY--N 229
S W PY+ L RQ S A L S + ++L G+K EY +
Sbjct: 143 ADSPWAPYLAILPRQTD------------STIFCAGLKKSSLRYKLLSTTVGVK-EYVQS 189
Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFA- 288
E D+V S + D+ + TF+ F AF ++S V P++ + A
Sbjct: 190 EFDSVQAEIISRNK----DLFPGSITFDDFLWAFGILRSRVF--------PELRGDKLAL 237
Query: 289 ---SHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD 345
+ LV+ T+ + ++ K K + +
Sbjct: 238 VPFADLVNHSPDITSEGS----------------------------SWEIKGKGLFGR-E 268
Query: 346 DAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 404
L K+G+ I + + + N++L ++YGFV+ + D V ++ DP Y
Sbjct: 269 PMFSLRTPVDVKSGQQIYIQYDLDKSNAELALDYGFVESNPSRDSYTVTLEISESDPFYG 328
Query: 405 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM-------QSVISSL 457
DK +A+ N F + ++ MLPYLRL + T SV L
Sbjct: 329 DKLDIAELNELGETAYFDIIL--DEPLPPQMLPYLRLLCIGGTDAFILEALFRNSVWGHL 386
Query: 458 GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEK 517
P+SP E ++ + D K+ LA Y T+ EDE + NL P+ +A + EK
Sbjct: 387 E--LPLSPDNEESICQVMRDACKSALAAYHTTIEEDEELSERENLQPRLTIAIGVRAGEK 444
Query: 518 KMLN 521
K+L
Sbjct: 445 KVLQ 448
>gi|351701197|gb|EHB04116.1| SET domain-containing protein 3 [Heterocephalus glaber]
Length = 705
Score = 58.9 bits (141), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 65/256 (25%), Positives = 114/256 (44%), Gaps = 18/256 (7%)
Query: 285 ESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAV 344
+SF S TR PT + +V+LA L+PL +
Sbjct: 346 DSFTYEDYRWAVSSVMTRQNQIPTEDGS-RVTLA----LIPLWDMCNHTNGLITTGYNLE 400
Query: 345 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 404
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 401 DDRCECVALQDFQAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 460
Query: 405 DKRMVAQRNG---KLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV-------- 453
K V R G + VF +H E + +L +LR+ +++ + +
Sbjct: 461 MKAEVLARAGIPTYVWSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGENAIDR 519
Query: 454 ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQL 512
I +LG PVS E + L D L Y T ED+A+L + +L + ++A +L
Sbjct: 520 IFTLGNSEFPVSWENEVKLWSFLEDRASLLLKTYKTTTEEDKAVLKNPDLPARTKMAIKL 579
Query: 513 VRMEKKMLNACLQVTA 528
EK++L +Q A
Sbjct: 580 RLGEKEILEKAVQSAA 595
>gi|145250231|ref|XP_001396629.1| SET domain protein [Aspergillus niger CBS 513.88]
gi|134082145|emb|CAK42259.1| unnamed protein product [Aspergillus niger]
gi|350636112|gb|EHA24472.1| hypothetical protein ASPNIDRAFT_48629 [Aspergillus niger ATCC 1015]
Length = 489
Score = 58.5 bits (140), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 78/317 (24%), Positives = 131/317 (41%), Gaps = 33/317 (10%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
+W+ G P K+ K + + H V A DL G+ F++P + V++++ N
Sbjct: 22 TWLA--GKPGVKINPKIQIADLRSHAAGRGVVAQSDLDEGEELFTIPRAHVLSVQ----N 75
Query: 143 ETIAELLTTN--KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
+ LL+ N L L + ++YE QG +S W Y R L R ++ + W
Sbjct: 76 SNLKNLLSQNLDDLGPWLSLMVVMIYEYLQGDQSAWASYFRVLPRN-------FDTLMFW 128
Query: 201 SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK 260
S +EL L GS I+E+ E + L+T+ + + +P ++ +
Sbjct: 129 SASELEELQGSA----IVEKIGKQGAEESILETIAPIVRANPALFPPIDGVASYDGDAGT 184
Query: 261 QAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARR 320
QA ++HL MG+ A + + E+ + +
Sbjct: 185 QA-------LLHLAHT-----MGSLIMAYAFDIEKPEDEEGERDGEDGYLTDEEEEQSSK 232
Query: 321 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 380
+VPL L A + + A L ++ + + +P KAGE I G P S LL YG+
Sbjct: 233 -GMVPLADLLNADADRNNARLFQEEEVLVMKAIKPIKAGEEIFNDYGEIPRSDLLRRYGY 291
Query: 381 VDEDNPYDRLVVEAALN 397
V DN VVE +L+
Sbjct: 292 V-TDNYAQYDVVELSLD 307
>gi|225448769|ref|XP_002275729.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Vitis
vinifera]
Length = 480
Score = 58.5 bits (140), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 70/330 (21%), Positives = 136/330 (41%), Gaps = 72/330 (21%)
Query: 75 EEDLGDLKSWM-HKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
++D D W+ K G+ V+ K ++ + AS+ +Q GD VP ++
Sbjct: 39 DKDCDDFLPWLEQKAGVEISSVLSIGKSTYGRS------LFASKSIQTGDCILKVPYNVQ 92
Query: 134 VTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
++ + V I LL +++ +A LA+ + E K G+ S W PYI L Q G
Sbjct: 93 ISPDNV--PSKINSLLG-DEVGNIAKLAIVISVEWKMGQDSEWAPYINRLP-QPGE---- 144
Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW-FMAGSLFQQYPYDIPTE 252
+ S + WSE EL + S E + + I++++ + V + +LF+ DI +
Sbjct: 145 MHSTIFWSEGELKMIQQSSVYQETINQKAQIQKDFLAIKPVLHHFSENLFK----DISLK 200
Query: 253 AFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPK 312
F + +C + +G+ ++ S
Sbjct: 201 EF-----------MHACAL----------VGSRAWGS----------------------- 216
Query: 313 EKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGESIVVWCGPQP 370
+ +L+P + + ++L +D +++ DR Y GE +++ G P
Sbjct: 217 -----TKGLSLIPFA-DFVNHDGFSDSVLLGDEDKQLSEVIADRNYAPGEQVLIRYGKFP 270
Query: 371 NSKLLINYGFVDEDNPYDRLVVEAALNTED 400
N+ LL+++GF N YD++ ++ + D
Sbjct: 271 NATLLLDFGFTLPYNIYDQVQIQVNIPHHD 300
>gi|255083899|ref|XP_002508524.1| set domain protein [Micromonas sp. RCC299]
gi|226523801|gb|ACO69782.1| set domain protein [Micromonas sp. RCC299]
Length = 425
Score = 58.5 bits (140), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 72/316 (22%), Positives = 117/316 (37%), Gaps = 74/316 (23%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYE----K 168
+ A+E+++ G++ +P S ++T+ER + + L E + LA +L +
Sbjct: 24 LVATEEVRRGESLLDIPESTLITVERAIAESNLGP--AHANLQEWSVLAAFLAEQALAID 81
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA-YLTGSPTKAEILERAEGIKRE 227
S + Y+R L R+ G L W E ++ L GSP++ +ER +
Sbjct: 82 AGADGSRFATYVRALPRRTG-------GVLDWPEEDVKELLAGSPSQRAAMERQASVDAA 134
Query: 228 YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESF 287
+E + +P P + AF + S ++ L G
Sbjct: 135 IDE----------IRASFPQLTPG------ALRWAFDVLFSRLIRLPNRGGA-------- 170
Query: 288 ASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA 347
ALVP +L + C A + A
Sbjct: 171 ---------------------------------LALVPWAD-MLNHRPGCDAYIDDTGGA 196
Query: 348 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE--DNPYDRLVVEAALNTEDPQYQD 405
V L DR YK GE + GP+P+S+LLI+YGF +NP D V ++ D
Sbjct: 197 VCLSPDRRYKPGEQVYASYGPRPSSELLISYGFAPAVGENPDDEFEVVLGIDPNDRHADA 256
Query: 406 KRMVAQRNGKLSVQVF 421
K +R G V+ F
Sbjct: 257 KADALRRIGLSPVEAF 272
>gi|255562948|ref|XP_002522479.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
gi|223538364|gb|EEF39971.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
Length = 502
Score = 58.5 bits (140), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 47/171 (27%), Positives = 86/171 (50%), Gaps = 9/171 (5%)
Query: 357 KAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 415
K+GE +++ + + N++L ++YGF+++ + + ++ DP + DK +A+ NG
Sbjct: 293 KSGEQVLIQYDLNKSNAELALDYGFIEKTPDRNTYTLTLQISESDPFFGDKLDIAETNGS 352
Query: 416 LSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE--MQSVISS--LGPI-CPVSPCMERA 470
F + G MLPYLRL + T ++S+ + G + P+S E
Sbjct: 353 GETADFDIVLGNPLPPA--MLPYLRLVALGGTDAFLLESIFRNTIWGHLELPISRANEEL 410
Query: 471 VLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 521
+ + D K+ L+GY T+ EDE L +L+P+ +A + EKK+L
Sbjct: 411 ICRVVRDACKSALSGYHTTIEEDEK-LEAADLNPRLEIAVGIRAGEKKVLQ 460
>gi|327290197|ref|XP_003229810.1| PREDICTED: SET domain-containing protein 4-like [Anolis
carolinensis]
Length = 440
Score = 58.5 bits (140), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 83/344 (24%), Positives = 138/344 (40%), Gaps = 63/344 (18%)
Query: 86 HKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVVTLERV 139
HK+ K LKEK + K RP + + ++ LQ G+ S+P ++T + V
Sbjct: 29 HKDEYILLKKWLKEKGCNVNKLRPAQFPETGRGLVTTKGLQVGELIISLPEKCLLTTDTV 88
Query: 140 LGNETIAELLT--TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
L N + E + T +S L L +L+ EK +KS W PY+ L + S
Sbjct: 89 L-NSYLREYIVKWTPPISPLIALCTFLIAEKWAQEKSPWKPYLDLLPE--------IYSC 139
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
+ E ++ L P + + E+ + ++ + +F LF P D+ + F ++
Sbjct: 140 PVCLEQKIVNLFPEPLRRKAHEQRKLVQELFISSQQFFFSLQPLF---PKDVAS-VFNYQ 195
Query: 258 IFKQAFVAVQSCVV---HLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEK 314
FK A+ + + V H QR + T + A +L L HNP +
Sbjct: 196 AFKWAWCTINTRTVYMKHSQRDCFSRDTDTYALAPYLDLL-------------NHNPTVQ 242
Query: 315 VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 374
V +KC + V + + E + + GP N +L
Sbjct: 243 VKAG------------FNEKTKCYEITT---------VTQCHHYNEVFICY-GPHDNQRL 280
Query: 375 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 418
L+ YGFV DNP+ + V ++ +DK QR KLS+
Sbjct: 281 LLEYGFVSRDNPHSSVYVGTDTLLKNVFPEDK----QRPKKLSI 320
>gi|303288796|ref|XP_003063686.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226454754|gb|EEH52059.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 538
Score = 58.5 bits (140), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 43/143 (30%), Positives = 64/143 (44%), Gaps = 14/143 (9%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKV---ILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
++ + D L +W+ + G V ++ P + A+ D+ GDAA V
Sbjct: 73 ARTQADFDALWTWLEREGADVASVSPALVDATPGGRGWG-----LVATRDVGGGDAAIVV 127
Query: 129 PNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
P +L +T E ++ I L LAL L++EK G S W YIR L R
Sbjct: 128 PRALWMTKETAFASK-IGTALDPETTPPWCALALQLLHEKSLGDDSRWAAYIRCLPRVE- 185
Query: 189 RGQLAVESPLLWSETELAYLTGS 211
A+++PL WS ELA L G+
Sbjct: 186 ----ALDAPLFWSSEELAELAGT 204
>gi|260831632|ref|XP_002610762.1| hypothetical protein BRAFLDRAFT_91548 [Branchiostoma floridae]
gi|229296131|gb|EEN66772.1| hypothetical protein BRAFLDRAFT_91548 [Branchiostoma floridae]
Length = 604
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 61/263 (23%), Positives = 115/263 (43%), Gaps = 60/263 (22%)
Query: 151 TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 210
T++ + L+L+L+ EK +GK SFW PYIR L + +P+ ++E+EL L+
Sbjct: 230 TSRFTCAQVLSLFLLLEKNKGKDSFWYPYIRSLPN-------SFTTPVYFTESELNALSP 282
Query: 211 SPTKAEILERAEGIKRE----YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 266
S + E+A +K+E +N+L+ F+ L + FTF+ F+ A+ +
Sbjct: 283 S-----LQEKARDLKKELLHAFNDLEP--FVTSCLPEL------DSTFTFDAFRWAWSVL 329
Query: 267 QSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPL 326
++ ++ + C R+P P+ LVP+
Sbjct: 330 KTRTLYQEDC-------------------------RSPYLSNKEPQTS-------TLVPM 357
Query: 327 GPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 384
L+ +S KA ++ V PY+ + + + G + N++L++ +GF +
Sbjct: 358 -LDLINHSPSAKARFGYNVNTSCYEVRVLEPYRKYDQVFISYGFEENTELMLKFGFFVPE 416
Query: 385 NPYDRLVVEAALNTED-PQYQDK 406
NP D + + + E PQ D+
Sbjct: 417 NPKDFMKINLSEMLESLPQINDE 439
>gi|357615786|gb|EHJ69829.1| putative SET domain containing 3 [Danaus plexippus]
Length = 489
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 80/371 (21%), Positives = 145/371 (39%), Gaps = 79/371 (21%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
SW+H++G V + E + + A++D G +VP ++++ E+
Sbjct: 88 SWLHEHGAEFEGVEISEFDGYG------FGLKATKDFSEGSLILTVPGKVMMS-EKDPKA 140
Query: 143 ETIAELLTTNKLSEL---ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
++E + + L + LAL+L+ EK SFW PYI L + + L
Sbjct: 141 SDLSEFINIDPLLQNMPNVTLALFLLLEK-NNPNSFWKPYIDVLPEK-------YSTVLY 192
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREY----NELDTVWFMAGSLFQQYPYDIPT---- 251
++ ELA L SP L+ I R+Y N++ T+ D+P
Sbjct: 193 FNSEELAELRPSPVFESSLKLYRSIVRQYAYFYNKIHTI-------------DLPVLKNL 239
Query: 252 -EAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHN 310
+ FTF+ ++ A V + R Q + + + L C ++ N
Sbjct: 240 QDIFTFDNYRWAVSTV------MTRQNNIVQGTAFTLTNAFIPLWDMCNHKHGKITTDFN 293
Query: 311 PKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 370
+ L R +C A+ + Y+ E I ++ G +P
Sbjct: 294 ----LELNR---------------GECYAL-------------QDYRRDEQIFIFYGARP 321
Query: 371 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 430
NS L ++ GFV DN YD L + ++ D K + + G V F ++ G
Sbjct: 322 NSDLFLHNGFVYPDNDYDSLSIALGISPNDALRNGKVNLLNKLGLSGVTNFSLYKGASPI 381
Query: 431 AISDMLPYLRL 441
++ ++L ++R+
Sbjct: 382 SV-ELLAFIRI 391
>gi|320168265|gb|EFW45164.1| hypothetical protein CAOG_03170 [Capsaspora owczarzaki ATCC 30864]
Length = 464
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 76/321 (23%), Positives = 133/321 (41%), Gaps = 48/321 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELA---------CLA 161
V A D+ AG +VP +L++T E+ +ET +L+T+ L +EL+ L
Sbjct: 170 VIARRDIPAGQTFINVPEALMMTAEKARKSETF-QLITSGALDSTELSPAMAKLDNFLLR 228
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
++L+ E+++G S+W PYI +L QR R PL ++E EL L SP E +
Sbjct: 229 MFLIVERRRGGNSYWSPYI-DLLPQRFR------LPLYFTEAELELLKPSPALQEAFVQL 281
Query: 222 EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQ 281
+ R+Y + ++QY + A E+ + A Q + +R P
Sbjct: 282 RNVVRQY-----------AAWKQYLMMLEL-ARAAELPSGSGDAHQKILDQRRRAQAMPV 329
Query: 282 MGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVP--------LGPPLLAY 333
E L +S TR + + AL+P G Y
Sbjct: 330 RYNE-LTYDLFCWASSAVATRQNQIVVGEVRANQAPELSLALIPGWDMCNHAFGGASSFY 388
Query: 334 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 393
++ +++ + V P GE +++ G + + N FV D+P D+ ++
Sbjct: 389 DTQTRSL--------ECVAVAPIAKGEPVLLHYGDRSSMAYFGNSEFVPADHPTDQYLIL 440
Query: 394 AALNTEDPQYQDKRMVAQRNG 414
A+ +DP ++ K + Q G
Sbjct: 441 LAVGKQDPLFKSKSTILQALG 461
>gi|303271159|ref|XP_003054941.1| methyltransferase [Micromonas pusilla CCMP1545]
gi|226462915|gb|EEH60193.1| methyltransferase [Micromonas pusilla CCMP1545]
Length = 544
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 104/409 (25%), Positives = 163/409 (39%), Gaps = 40/409 (9%)
Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWL 177
D++AG+ +P +L VT V + +A L EL LAL+L E+ +G S W
Sbjct: 98 DVRAGEPLLEIPQNLAVTSVDVSDHPIVAGLAAGR--GELVGLALWLCCERAKGSLSDWA 155
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L V+ PL W E+E+ + L GSPT + + RA + EY +
Sbjct: 156 PYVNTLPT-----GCTVDHPLRWEESEIRSLLKGSPTCEQAVGRAVDAREEYASIRAAIA 210
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQT 296
+ Y+ TE F A V + V L LV L
Sbjct: 211 ADADAYPADAYEFLTEL----AFTDALATVLARAVWLNAANVY----------ALVPLVD 256
Query: 297 SCTTTRTPLSPTHNPKEKVSLARRFAL-VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRP 355
P P NP + A L +G +++C A+++A +DA Q P
Sbjct: 257 LLPVVGAP-PPGVNPAAAAADAGARGLDAAVGVVDYDAATECVAVVSA-NDARQTA---P 311
Query: 356 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG- 414
+++ G L ++ G V+ + D L + D Y K+ + + G
Sbjct: 312 VVCADALGRNAG-----DLFLSTGRVNGAHVGDYLTFVTSTVMSDKLYAAKKQILEGMGY 366
Query: 415 KLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQ 474
Q F V A R + Y+R V + SE+ +V I VSP E +L
Sbjct: 367 SADAQAFPVFADRMP---LQLFAYMRFARVQEPSELMTVSFEEDRI--VSPMNEYEILQL 421
Query: 475 LADYFKARLAGYPATLSEDEAM-LTDYNLHPKKRVATQLVRMEKKMLNA 522
L + LA Y + E E + L + N+ ++ A +L EK+++NA
Sbjct: 422 LMGDAREMLAEYENSSEEFELLQLKETNISERQMTAAKLRLGEKRLINA 470
>gi|170067683|ref|XP_001868579.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167863782|gb|EDS27165.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 269
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 50/189 (26%), Positives = 89/189 (47%), Gaps = 24/189 (12%)
Query: 356 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 415
Y+ GE I ++ G + N+ L++ GFV DN + + +LN + Q++ ++ + ++ G
Sbjct: 74 YRKGEQIFIYYGNRTNADFLVHNGFVYPDNANSAVAIPLSLNPTEEQFEQRKQLLEKLGL 133
Query: 416 LSVQVFHVHAGREKEAIS-DMLPYLR--------LGYVSDTSEMQSVISSLGPICPVSPC 466
S F+V G IS ++L + R LG+ +QS + L P C P
Sbjct: 134 ASSGDFNVQRGGGDSFISPELLGFARVFNMTKEQLGHWQGEDAVQSQL--LEPDC---PG 188
Query: 467 MERAVLDQLADYFKARLA----GYPATLSEDEAMLTDYN------LHPKKRVATQLVRME 516
+E ++ +++ Y RL TL +DEA+L + L K + Q +E
Sbjct: 189 LEASLREKVWKYLSIRLQLALRMTGTTLDQDEALLANQGQKGAQKLGHIKSMLVQFRVVE 248
Query: 517 KKMLNACLQ 525
KK+L+ L+
Sbjct: 249 KKILSEALE 257
>gi|70984218|ref|XP_747626.1| SET domain protein [Aspergillus fumigatus Af293]
gi|66845253|gb|EAL85588.1| SET domain protein [Aspergillus fumigatus Af293]
Length = 492
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 73/297 (24%), Positives = 125/297 (42%), Gaps = 50/297 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
V A D+ G+ FS+P LV++ + N + +LL+ + L EL L L +MYE
Sbjct: 50 VVARSDIFDGEELFSIPRGLVLSAQ----NSKLKDLLSQD-LEELGPWLSLILVMMYEYL 104
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
G++S W PY + L + + ++ + WS +EL L GS ++I EG +
Sbjct: 105 LGEQSAWAPYFKILPK-------SFDTLMFWSPSELRELQGSAIVSKI--GKEGAE---- 151
Query: 230 ELDTVWFMAGSLFQQYPYDIPT---------EAFTFEIFKQAFVAVQSCVVHLQRCGCTP 280
D++ M + + P P+ EA + + + A + + +
Sbjct: 152 --DSIMQMIAPVVRANPSLFPSVDGLASWDGEAGSHALLRLAHIMGSLIMAY-------- 201
Query: 281 QMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAM 340
+F V + T + +++ S +VPL L A + + A
Sbjct: 202 -----AFDIEKVEDEDDENNDEEDGYVTDDEQDQSSK----GMVPLADILNADADRNNAR 252
Query: 341 LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 397
L DD++ + +P + GE I G P + LL YG+V DN VVE +L+
Sbjct: 253 LFQEDDSLVMKAIKPIRVGEEIFNDYGELPRADLLRRYGYV-TDNYAQYDVVELSLD 308
>gi|159122413|gb|EDP47534.1| SET domain protein [Aspergillus fumigatus A1163]
Length = 492
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 73/297 (24%), Positives = 125/297 (42%), Gaps = 50/297 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
V A D+ G+ FS+P LV++ + N + +LL+ + L EL L L +MYE
Sbjct: 50 VVARSDIFDGEELFSIPRGLVLSAQ----NSKLKDLLSQD-LEELGPWLSLILVMMYEYL 104
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
G++S W PY + L + + ++ + WS +EL L GS ++I EG +
Sbjct: 105 LGEQSAWAPYFKILPK-------SFDTLMFWSPSELRELQGSAIVSKI--GKEGAE---- 151
Query: 230 ELDTVWFMAGSLFQQYPYDIPT---------EAFTFEIFKQAFVAVQSCVVHLQRCGCTP 280
D++ M + + P P+ EA + + + A + + +
Sbjct: 152 --DSIMQMIAPVVRANPSLFPSVDGLASWDGEAGSHALLRLAHIMGSLIMAY-------- 201
Query: 281 QMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAM 340
+F V + T + +++ S +VPL L A + + A
Sbjct: 202 -----AFDIEKVEDEDDENNDEEDGYVTDDEQDQSSK----GMVPLADILNADADRNNAR 252
Query: 341 LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 397
L DD++ + +P + GE I G P + LL YG+V DN VVE +L+
Sbjct: 253 LFQEDDSLVMKAIKPIRVGEEIFNDYGELPRADLLRRYGYV-TDNYAQYDVVELSLD 308
>gi|400597281|gb|EJP65016.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
Length = 484
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 55/235 (23%), Positives = 94/235 (40%), Gaps = 34/235 (14%)
Query: 149 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 208
L+++ L L L L+YE +G S W PY+ L E+P+ W+ EL L
Sbjct: 96 LSSSPLDAWGALILVLLYEHLRGAASAWRPYLDVL-------PATFETPMFWTGAELGAL 148
Query: 209 TGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 268
T ++ RE E DT + + + +P ++F+ +
Sbjct: 149 QAGATAGKV-------GRESAE-DTFRGILLPVVRAHP----------DVFQGSAALSDE 190
Query: 269 CVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGP 328
+V L +MG+ A ++ ++ +VP+
Sbjct: 191 ALVALAH-----RMGSTIMAYAFDLENDEEREDEEDEDGWVEDRDGKAM---MGMVPMAD 242
Query: 329 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 383
+L ++ A + D+ + + RP KAGE I+ + GP PNS+LL YG+V E
Sbjct: 243 -ILNADAEFNAHVNHGDNELTVTALRPIKAGEEILNYYGPHPNSELLRRYGYVTE 296
>gi|357444999|ref|XP_003592777.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase [Medicago truncatula]
gi|355481825|gb|AES63028.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase [Medicago truncatula]
Length = 451
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 66/300 (22%), Positives = 122/300 (40%), Gaps = 64/300 (21%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ +Q GD VP SL +T + + + + + +A LA L+ K G+ S
Sbjct: 66 ASKSIQTGDCILQVPYSLQLTPDNLPPE---IKPFISEDVGNIAKLATVLLIHKNLGQDS 122
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI L Q + + + W+E+EL + S E + + I++++ E+ V
Sbjct: 123 EWHPYISCLPPQA-----EMHNTIFWNESELEMIRQSSVYQETIYQKSQIEKDFLEIKPV 177
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHL 294
FQ P+ FT++ F A CT +G+ ++ S
Sbjct: 178 -------FQ--PFCQSFGDFTWKDFMHA---------------CT-LVGSRAWGS----- 207
Query: 295 QTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD--AVQLVV 352
+ +L+P L + +A++ + DD ++
Sbjct: 208 -----------------------TKGLSLIPFA-DFLNHDGISEAIVMSDDDNKCSEVFS 243
Query: 353 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQR 412
DR Y GE +++ G N+ L++++GF N YD++ ++ + DP K + Q+
Sbjct: 244 DRDYVPGEQVLIRYGKFSNATLMLDFGFTIPYNIYDQVQIQYDIPKYDPLRHTKLELLQQ 303
>gi|170093191|ref|XP_001877817.1| SET-domain protein [Laccaria bicolor S238N-H82]
gi|164647676|gb|EDR11920.1| SET-domain protein [Laccaria bicolor S238N-H82]
Length = 524
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 113/454 (24%), Positives = 183/454 (40%), Gaps = 68/454 (14%)
Query: 115 ASEDLQAGDAAFSVPNSLVV-TLERVLGNETIAELLTTNKLSE-LACLALYLMYEKKQGK 172
A +D+ G F++P L + T L KL E A L L +M+E QG
Sbjct: 66 AVKDIPEGHVLFTIPRDLTLSTRTSSLPLRFGMGAWKNAKLHEGWAGLILCMMWEAAQGS 125
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W Y L + ++P+ W+E ELA L G+ + E + RA+ K +L
Sbjct: 126 SSKWSGYFDIL-------PTSFDTPMFWTEEELAELRGT-SVVEKIGRADAEKDYKEKLI 177
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQ-----MGTESF 287
LF P DI T ++ E++ + S ++++ + G S
Sbjct: 178 PAINSRPELF--LPRDIHTR-YSVEMYHVMGSRILSRSFNVEKWAPDEEEVGDGAGDVSM 234
Query: 288 ASHL-VHLQTSCTTTRTPLSPTHNPKEKVSL------------ARRFALVPLGPPLLAYS 334
S + V L P PTH+ L + A+VP+ L A
Sbjct: 235 GSGMDVDLPDGA-----PAPPTHSSHGTDDLEHEGGEEEEQEDSSDIAMVPMADILNARY 289
Query: 335 SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD--------EDNP 386
A L ++ ++++ RP K GE I G PN++LL YG VD + NP
Sbjct: 290 GSENAKLFYEENYLKMISTRPIKGGEQIWNTYGDLPNAELLRRYGHVDVIQLPNGGQGNP 349
Query: 387 YD------RLVVEAA-----LNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDM 435
D L+V A L+T+D + + + VF ++ + E +
Sbjct: 350 GDVAEIRADLIVSVAAEQHSLSTDDTHERIDWWLEEGGD----DVFDLYF--DLEIPPSI 403
Query: 436 LPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEA 495
+ +RL + D E + + P P M+ L L + + RL YP ++ +DE
Sbjct: 404 ISVIRLLLLPD-EEWEKIKEKA---KPPKPKMDAVALTVLHEVLQRRLKEYPTSIQDDEQ 459
Query: 496 ML-TDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
+L T +L+ + + +L EKK+L+ L TA
Sbjct: 460 LLMTAPSLNLRHAIIVRL--GEKKILDGILTKTA 491
>gi|119467702|ref|XP_001257657.1| SET domain protein [Neosartorya fischeri NRRL 181]
gi|119405809|gb|EAW15760.1| SET domain protein [Neosartorya fischeri NRRL 181]
Length = 492
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 72/292 (24%), Positives = 125/292 (42%), Gaps = 40/292 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
V A D+ G+ FS+P LV++ + N + +LL+ + L EL L L +MYE
Sbjct: 50 VVARSDIFDGEELFSIPRGLVLSAQ----NSKLKDLLSQD-LEELGPWLSLILVMMYEYL 104
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
G++S W PY + L + + ++ + WS +EL L GS ++I + EG +
Sbjct: 105 LGEQSAWAPYFKVLPK-------SFDTLMFWSPSELQELQGSAIVSKIGK--EGAE---- 151
Query: 230 ELDTVWFMAGSLFQQYPYDIPT----EAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTE 285
D++ M + + P P+ ++ E A + + + L +
Sbjct: 152 --DSIMQMIAPVVRANPSLFPSVEGLASWDGEAGSHALLGLAHIMGSL--------IMAY 201
Query: 286 SFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD 345
+F + T + +++ S +VPL L A + + A L +
Sbjct: 202 AFDIEKAEDEDDEDNDEEEGYVTDDEQDQSSK----GMVPLADILNADADRNNARLFQEE 257
Query: 346 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 397
D++ + +P AGE I G P + LL YG+V DN VVE +L+
Sbjct: 258 DSLVMKAIKPIHAGEEIFNDYGELPRADLLRRYGYV-TDNYAHYDVVELSLD 308
>gi|145516585|ref|XP_001444181.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124411592|emb|CAK76784.1| unnamed protein product [Paramecium tetraurelia]
Length = 658
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 37/143 (25%), Positives = 65/143 (45%), Gaps = 26/143 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL---------SELACLALY 163
V+A ++ A ++PN L+++ +VL +E ++++ T+K +E CLALY
Sbjct: 48 VSAKMNIPANKVIIAIPNKLIISHHKVLKSE-LSDMFKTHKQFFDDQITADAEFNCLALY 106
Query: 164 LMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG 223
+ Y K QG KSFW PY+ +++ + W +L L E +
Sbjct: 107 IFYHKLQGDKSFWYPYLNVVEQH---------TMFEWRNRDLFNLQDQSLIDEFMYIQ-- 155
Query: 224 IKREYNELDTVWFMAGSLFQQYP 246
+E+D W+ L +YP
Sbjct: 156 -----SEMDKSWYKFKGLMNKYP 173
>gi|224077384|ref|XP_002305239.1| SET domain protein [Populus trichocarpa]
gi|222848203|gb|EEE85750.1| SET domain protein [Populus trichocarpa]
Length = 518
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 81/370 (21%), Positives = 137/370 (37%), Gaps = 76/370 (20%)
Query: 56 RVSSSDTLVAGSREVVSKKEEDLGD------LKSWMHKNGLPPCKVILKEKP-------- 101
R+ +S T++ + K+ ED G W G+ C L P
Sbjct: 9 RIWASFTVLRRNSRQTKKEMEDAGQDEGFERFLKWAANLGISDCTTNLSLHPQSPTSCLG 68
Query: 102 -SHNEKHRPI---HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL 157
S H P +AA DL+ G+ VP S+++T + +L +E + + N S L
Sbjct: 69 HSLTVSHFPDAGGRGLAAVRDLKKGELVLRVPKSVLITRDSLLKDEKLCSFVNNNTYSSL 128
Query: 158 A---CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 214
+ LA+ L+YE +GK S+W PY+ L R S LA + +K
Sbjct: 129 SPTQILAVCLLYEMGKGKSSWWYPYLMHLPR---------------SYDVLASFKKAVSK 173
Query: 215 AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL- 273
A K E+ E A SL + + TF + A + S +H+
Sbjct: 174 A---------KSEWKE-------ANSLMD--ALKLKPQLLTFRAWIWASATISSRALHIP 215
Query: 274 -QRCGCTPQMGT-ESFAS---------HLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFA 322
GC +G ++A+ ++VH + + + LS + +
Sbjct: 216 WDEAGCLCPVGDLFNYAAPGEESNDLENVVHWMNASSLEDSSLSNGETTDDFIG------ 269
Query: 323 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 382
P + A + YK G +++ G N +LL +YGF+
Sbjct: 270 ----DQPDIGLERLTDGGFDENMAAYCFYARKNYKKGTQVLLGYGTYTNLELLEHYGFLL 325
Query: 383 EDNPYDRLVV 392
+NP D++ +
Sbjct: 326 NENPNDKVFI 335
>gi|444705829|gb|ELW47217.1| Histone-lysine N-methyltransferase setd3 [Tupaia chinensis]
Length = 539
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/193 (23%), Positives = 91/193 (47%), Gaps = 10/193 (5%)
Query: 345 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 404
DD + V + ++ GE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 234 DDRCECVALQDFRPGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 293
Query: 405 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD---------TSEMQSVIS 455
K V R G + VF +H + + +L +LR+ +++ S + + +
Sbjct: 294 MKAEVLARAGIPTSSVFALHF-TDPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFT 352
Query: 456 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 515
PVS E + L D L Y T+ ED+++L +L + +A +L
Sbjct: 353 LGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKSRDLSVRATMAIKLRLG 412
Query: 516 EKKMLNACLQVTA 528
EK++L ++ A
Sbjct: 413 EKEILERAVRSAA 425
Score = 40.0 bits (92), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 38/160 (23%), Positives = 73/160 (45%), Gaps = 18/160 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ +++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVDGFEMVNFKEEGFGLR---ATREIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERAS-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
++PL + E E+ YL + ++ + + R+Y
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY 217
>gi|241712095|ref|XP_002413441.1| conserved hypothetical protein [Ixodes scapularis]
gi|215507255|gb|EEC16749.1| conserved hypothetical protein [Ixodes scapularis]
Length = 227
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 63/234 (26%), Positives = 96/234 (41%), Gaps = 31/234 (13%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLG-- 141
W NG + L+ P + AA +D+Q G VP +++T +G
Sbjct: 11 WCLDNGATINGITLQALPDDE------YGFAAEQDIQVGPVFLGVPLGMMMT---TIGAR 61
Query: 142 NETIAELLTTN---KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPL 198
+ LL + K E L+++L+ E G SFW PYI L R + + L
Sbjct: 62 KSKLGALLKDDPIMKSMENVALSMFLILELCAGSASFWHPYISILPR-------SFNTVL 114
Query: 199 LWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEI 258
+S EL LTGS E L+ I R+Y + F L + PY + FT+++
Sbjct: 115 YFSVDELQLLTGSSVLDEALKLHRSIARQYAYFHKI-FRTHPLAKSLPY---KDCFTYDL 170
Query: 259 FKQAFVAV---QSCVVHLQRCGCTPQM---GTESFASHLVHLQTSCTTTRTPLS 306
++ A AV Q+ V CG G+ S + LV L C + +S
Sbjct: 171 YRWAVSAVMTRQNAVPRAVVCGGADDACARGSGSGVAALVPLFDLCNHSDGKVS 224
>gi|302834219|ref|XP_002948672.1| hypothetical protein VOLCADRAFT_104004 [Volvox carteri f.
nagariensis]
gi|300265863|gb|EFJ50052.1| hypothetical protein VOLCADRAFT_104004 [Volvox carteri f.
nagariensis]
Length = 510
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 97/397 (24%), Positives = 163/397 (41%), Gaps = 53/397 (13%)
Query: 157 LACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAE 216
A +A L++ K+QG +S P+I +L G P+LW E ++A L A+
Sbjct: 135 FAKMAAMLLWHKRQGSQSPLAPWIAQLPSDTG-------VPVLWDERQIAALQYPYLIAQ 187
Query: 217 ILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRC 276
+ E+ +RE+ +L +G+ P + +A + E F A V+S
Sbjct: 188 VKEQ----QREWQQLYGDLVRSGT-----PAGV--QAPSREDFFWAMSCVRSRTFSGPYI 236
Query: 277 GCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLA--------------RRFA 322
G T Q + + LV + + T P +++ +++A
Sbjct: 237 GSTLQDRLRT--AGLVAVLAAGNTVLGLADPQKTLSAAIAVLLFNVLYELILSRSLKQYA 294
Query: 323 LVPLGPPLLAYSSKCKAMLAA--VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 380
+ PL L +SS ++ +A D+ +V R +K GE + + G Q N L+ YGF
Sbjct: 295 ICPL-IDLFNHSSAVQSEVAYNYFGDSYSVVASREFKKGEQVFISYGAQSNDSLMQYYGF 353
Query: 381 VDEDNPYDRLVVEAALN--TEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPY 438
+ +NP D V+ L T R+ A + L+ + V R S+ L
Sbjct: 354 AEANNPQDVYVMTDMLRWLTAVRSVGQSRLDALKGSPLANSLQQVAIQRAGFP-SETLQA 412
Query: 439 LRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 498
+R +D SE + +SS SP E + + +A+ + L ++L ED A+L+
Sbjct: 413 VRFLLAAD-SEAGADVSSFSKSG--SPDQEAQLAEVVAEVVRRELGHLGSSLQEDLALLS 469
Query: 499 DYNLHPKKR---------VATQLVRMEKK-MLNACLQ 525
R VA R+EKK +L A LQ
Sbjct: 470 STGASAGGRKGGTAAAAAVAAVAFRVEKKRLLTAVLQ 506
>gi|79315114|ref|NP_001030864.1| SET domain-containing protein [Arabidopsis thaliana]
gi|51971180|dbj|BAD44282.1| unnamed protein product [Arabidopsis thaliana]
gi|332645817|gb|AEE79338.1| SET domain-containing protein [Arabidopsis thaliana]
Length = 353
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 60/266 (22%), Positives = 109/266 (40%), Gaps = 54/266 (20%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LA L+ EKK G+KS W+PYI L + + S + W E EL+ + S E ++
Sbjct: 2 LAAVLIREKKMGQKSRWVPYISRLPQ-----PAEMHSSIFWGEDELSMIRCSAVHQETVK 56
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCT 279
+ I+++++ F+A + Q P I TE E F A+ V
Sbjct: 57 QKAQIEKDFS------FVAQAFKQHCP--IVTERPDLEDFMYAYALV------------- 95
Query: 280 PQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKA 339
G+ ++ + + R L P + L+ L L +S+
Sbjct: 96 ---GSRAWEN----------SKRISLIPFADFMNHDGLSASIVLRDEDNQLSEFST---- 138
Query: 340 MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 399
+Q+ DR Y G+ + + G N+ L++++GF N +D + ++ + +
Sbjct: 139 --------LQVTADRNYSPGDEVFIKYGEFSNATLMLDFGFTFPYNIHDEVQIQMDVPND 190
Query: 400 DPQYQDKRMVAQRNGKLSVQ---VFH 422
DP K + Q + +V+ +FH
Sbjct: 191 DPLRNMKLGLLQTHHTRTVKDINIFH 216
>gi|412990750|emb|CCO18122.1| predicted protein [Bathycoccus prasinos]
Length = 543
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 71/301 (23%), Positives = 111/301 (36%), Gaps = 77/301 (25%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLM------- 165
+ A+E ++ G+ +P ++T+E L + E +L E + LA +L
Sbjct: 118 LVATESIKRGEKVLEIPQEAIITVEVALKESLLREKKKLAELQEWSILATFLAETAQNLS 177
Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGI 224
E K + Y++ L R G S L W E+++ L GSP+ LER +
Sbjct: 178 TEDNSSNKYRFATYVKALPRSTG-------SVLEWPESDVRTLLAGSPSLFSALERRASV 230
Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGT 284
E+ + Q+ +DI +F
Sbjct: 231 AAAIAEIRVNFPELNEKTLQWAFDI--------LF------------------------- 257
Query: 285 ESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAM--LA 342
S L+ L+ SL ALVP +L + C+A L
Sbjct: 258 ----SRLIRLE--------------------SLGGNLALVPWAD-MLNHQPGCEAFIDLD 292
Query: 343 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE--DNPYDRLVVEAALNTED 400
V L DR Y+ GE + G +P+S+LLI+YGF DNP D + ++ ED
Sbjct: 293 RGSRKVCLTTDRSYEPGEQVWASYGQRPSSELLISYGFAPAVGDNPDDEYALNLQIDEED 352
Query: 401 P 401
P
Sbjct: 353 P 353
>gi|449464220|ref|XP_004149827.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Cucumis
sativus]
Length = 499
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 72/288 (25%), Positives = 113/288 (39%), Gaps = 78/288 (27%)
Query: 112 YVAASEDLQAGDAAFSVP-------NSLVVTLERVLGNETIAELLTTNKLSELACLALYL 164
++ ASE ++AGD VP +SL + + +LGNE + +A LA+ +
Sbjct: 77 FLFASETIRAGDCILKVPFNVQISPDSLPLPIRDLLGNE----------IGNVAKLAVVV 126
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
+ E K G S W PYI L + + + + W E+EL + S E L + I
Sbjct: 127 LLEHKLGLGSEWAPYIIRLPQ-----PWEMHNTIFWKESELEMIRKSSLYEESLNQRSQI 181
Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGT 284
KRE F+A ++A A + R C M
Sbjct: 182 KRE--------FLA--------------------IRKALEAFPEII---DRISCDDFM-- 208
Query: 285 ESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAV 344
H L TS T E VS L+P L + +AML
Sbjct: 209 -----HAYALVTSRAWRST---------EGVS------LIPFA-DFLNHDGASEAMLLND 247
Query: 345 DDA--VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 390
DD ++V DR + GE +++ G N+ L++++GF N +D++
Sbjct: 248 DDKQLSEVVADRDFAPGEHVLIRYGKYSNATLMLDFGFALPYNIHDQV 295
>gi|403349615|gb|EJY74245.1| hypothetical protein OXYTRI_04500 [Oxytricha trifallax]
Length = 689
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 35/131 (26%), Positives = 66/131 (50%), Gaps = 19/131 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-------SELACLALYLM 165
+ A +D+ A +PNS ++++ RV + + ++L+ ++ ++ CLA++LM
Sbjct: 74 IGAKKDIGQYKAFLFIPNSCIISVTRVKKHPIVGQILSNHQELFMKHADADQLCLAVFLM 133
Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL--WSETELAYLTGSPTKAEILERAEG 223
E QG++SFW PYI ++ ES LL W + E+ L + E +
Sbjct: 134 NEYLQGQQSFWWPYINVMN----------ESDLLYKWKDEEIKLLNDFEIYQQAKEYRDD 183
Query: 224 IKREYNELDTV 234
I+ E+N+L +
Sbjct: 184 IEDEWNKLSKI 194
>gi|62860180|ref|NP_001017105.1| SET domain containing 4 [Xenopus (Silurana) tropicalis]
gi|89267009|emb|CAJ81787.1| novel protein containing a SET domain [Xenopus (Silurana)
tropicalis]
Length = 442
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 64/284 (22%), Positives = 120/284 (42%), Gaps = 51/284 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN-KLSELACLALYLMYEKKQG 171
+ A+ DLQ G+ S+P+S ++T E VL + + T + +S L L +L+ E+
Sbjct: 62 LMATRDLQPGELIISLPDSCLITTETVLQSYLGKYIRTWSPPVSPLLALCTFLIAERVAR 121
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
++S W PY+ L + P+ W E+E+ L +P + + LE+ +K + E
Sbjct: 122 ERSPWKPYLDVLPS-------SYSCPVYW-ESEIISLLPAPLRQKALEQQTEVKELHTE- 172
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ---RCGCTPQMGTESFA 288
W SL + +I T+ +T+ + A+ V + V+++ R G + Q + A
Sbjct: 173 --SWSFFVSLQPLFGGNI-TDIYTYGALRWAWCTVNTRTVYMKHPRRHGLSAQQDVYAMA 229
Query: 289 SHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV 348
+L L H+P +V A +C
Sbjct: 230 PYLDLL-------------NHSPAVQVEAA------------FNEERRC----------Y 254
Query: 349 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 392
++ + + + + GP N +LL+ YGF+ +NP+ + V
Sbjct: 255 EIRTNSGCRKHDQAFICYGPHDNQRLLLEYGFIAANNPHRSVYV 298
>gi|347836900|emb|CCD51472.1| similar to SET domain-containing protein [Botryotinia fuckeliana]
Length = 470
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 103/433 (23%), Positives = 164/433 (37%), Gaps = 86/433 (19%)
Query: 126 FSVPNSLVVTLERVLGNETIAELLTTNKLSEL--ACLALY-LMYEKKQGKKSFWLPYIRE 182
FS+P S V L + L + +L+E + LAL ++ + Q S W PY+
Sbjct: 55 FSIPRSAV------LNAQNAKPLAISKRLAEKMPSWLALTSILMAEGQVDDSKWAPYLAI 108
Query: 183 LDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA-----------EGIKREYNEL 231
L Q + S + WS++ELA L S +I ++ +G++ E+
Sbjct: 109 LPEQ-------LNSLVFWSDSELAELQASAVVKKIGKQGAEDMFKTYITPQGLQHSSTEM 161
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHL 291
S+ Y +DIP + G T E A L
Sbjct: 162 ---CHKVASVIMAYAFDIPDPS----------------------EGPTSGGKGEEAADDL 196
Query: 292 VHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 351
V EK L+ ++PL L A + + A L ++ +++
Sbjct: 197 VS--------------DDGEDEKTILS----MIPLADMLNADADRNNARLICDNEDLEMR 238
Query: 352 VDRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAAL-------NTEDPQY 403
+P GE I G P S LL YG+V D + YD + A L P
Sbjct: 239 AIKPIAKGEEIFNDYGQLPRSDLLRRYGYVTDGYSAYDVAEISAELIVSLFRNGKVHPSL 298
Query: 404 ----QDKRMV----AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS 455
QDK V A+R G VH+ ++ +I D L + D S +++++
Sbjct: 299 PKLTQDKLKVRLDLAEREGVYDESFDLVHSSPDEPSIPDELLAFLYLLLVDESHLKAILD 358
Query: 456 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 515
S + S LA +AR Y T+ EDE +L + +L + +A Q+
Sbjct: 359 SESSLPSRSKLTTELAGQVLAILLQARENEYSTTVEEDEDLLKNADLPIRTAMAIQVRSG 418
Query: 516 EKKMLNACLQVTA 528
EKK+L A ++ A
Sbjct: 419 EKKVLRAAIREAA 431
>gi|358397725|gb|EHK47093.1| hypothetical protein TRIATDRAFT_298882 [Trichoderma atroviride IMI
206040]
Length = 481
Score = 55.5 bits (132), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 58/223 (26%), Positives = 95/223 (42%), Gaps = 35/223 (15%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
L L+ E +G++SFW PYI+ L + A+ P W E E L G+ + + +
Sbjct: 90 LLLIKELLRGEESFWWPYIQALPQPEDVDDWAL--PPFWPEEEAELLEGTNVEVGLDKIR 147
Query: 222 EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQ 281
+ +KRE+ E + + Q+ D +E T E++ A+ C+
Sbjct: 148 DDLKREFREAKAMLLAS----QKDAEDDFSELLTRELYNWAY-----CI----------- 187
Query: 282 MGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPL---GPPLLAYSSKCK 338
+ SF + LV T P E VS+ L+PL G +A + +
Sbjct: 188 FSSRSFRASLV--MTEAQQQALP--------EDVSVDDFSVLLPLFDIGNHDMAVDVRWE 237
Query: 339 AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 381
A A QL V R ++ G+ I P+ N++LL+ YGF+
Sbjct: 238 LDAANSGAACQLRVGREHQPGQQIFNNYSPKTNAELLLGYGFM 280
>gi|281207968|gb|EFA82146.1| hypothetical protein PPL_04566 [Polysphondylium pallidum PN500]
Length = 510
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 40/159 (25%), Positives = 72/159 (45%), Gaps = 21/159 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A +DL+ +P S ++T +I+ L K+ + ++ L+YE G
Sbjct: 59 VIALQDLKIDHTVAIIPKSCLLTPHTT----SISAYLKKYKIKDATATSIALLYEASIGS 114
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+S W YI+ L L+V+ P+LW++ +L L G+ + + E E + YN+
Sbjct: 115 QSKWYGYIKSL-------PLSVDLPILWNDADLKNLKGTSIETVVYENKETVDATYNK-- 165
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 271
++ L +P F+ + FK+A SC+V
Sbjct: 166 ---YIKSKLIANHPDVFNEHVFSLDNFKRA-----SCLV 196
>gi|321470773|gb|EFX81748.1| hypothetical protein DAPPUDRAFT_317395 [Daphnia pulex]
Length = 495
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 104/473 (21%), Positives = 188/473 (39%), Gaps = 83/473 (17%)
Query: 88 NGLPPCKVILKEKPSHNEKHRPIHYVA---------ASEDLQAGDAAFSVPNSLVVTLER 138
+ LPP L+ +H+ K P+ V A++ + + FS+P L+++ E
Sbjct: 75 DTLPP---FLEWMTNHDVKMGPVELVELPLYGCCVRATKQVSTDELLFSIPQKLMLSNET 131
Query: 139 VLGNETIAELLTTNK-LSELACLAL-YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES 196
+ TI + + LS++ +AL + + + KSFW PY+ L + ++
Sbjct: 132 A-NSSTIGHFINNDPILSQMPNVALAFHVLNELYDPKSFWKPYLDALPS-------SYDT 183
Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 256
+ ++ E+ L GSP + L I R+Y+ +F SL Q+
Sbjct: 184 VMYFTPDEITELKGSPAFDDALRMCRNIARQYS-----YFY--SLLQK------------ 224
Query: 257 EIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVS 316
V + +L+ +F + S TR L P+ +E++S
Sbjct: 225 --------NVDPALSNLR----------ANFTYNDYRWAVSTVMTRQNLIPS---QEEIS 263
Query: 317 LARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQL-------VVDRPYKAGESIVVWCGPQ 369
+ L P+ L+ C QL R + GE + ++ G +
Sbjct: 264 GNDKDQLPPVNA-LIPLWDFCNHQDGQFSTEFQLESRRTVCQAGRDFGPGEQVFIFYGTR 322
Query: 370 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ------VFHV 423
++ I+ GFVD +N +D L ++ L+ DP + + + LS + F +
Sbjct: 323 TCAEQFIHNGFVDINNAHDALTLKVGLSKSDPLAGQRATLLCKLRILSDEKISGPIAFQL 382
Query: 424 HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPC-MERAVLDQLADYFKAR 482
AG + +L +LRL ++ S + + S C +E V D+ + KAR
Sbjct: 383 KAG-PQPVDGKLLAFLRLFCMTKDSLDRWLQSDNASNLMHEECGIETEVDDKSWSFLKAR 441
Query: 483 ----LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 531
L YP T D ML + +L +R+ L EK++L + ++ A I
Sbjct: 442 CQLLLQLYPTTKEADLKMLEE-DLSSHRRMCVLLRLAEKRILLSAIECAAQRI 493
>gi|296419472|ref|XP_002839331.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295635461|emb|CAZ83522.1| unnamed protein product [Tuber melanosporum]
Length = 541
Score = 55.5 bits (132), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 64/286 (22%), Positives = 121/286 (42%), Gaps = 56/286 (19%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V D+ + S P++L + + + A+ +T ++ A L ++L E +GK
Sbjct: 43 VITCTDIPSHSQLISCPHTLTINYTKARSAFS-ADFITNT--TQHAALCMFLCLEWLKGK 99
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+SFW PY+ L R+ ++PL +S+ +L +L G +A +E + I RE E
Sbjct: 100 ESFWWPYLCVLPRE-------FDTPLYFSDEDLQFLQGCNLEATEVEARKLIWREEFE-- 150
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLV 292
A S+ Q+ YD TE +T+E++ A S SF L+
Sbjct: 151 ----AAVSILQREGYD--TEYYTWELYLWASTIFTS----------------RSFPGKLM 188
Query: 293 HLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAA---VDDAVQ 349
+++ + +P+ PL+ + A + D +++
Sbjct: 189 DW------------------DRI-IVHEDDTMPILFPLIDSLNHYPATIITWQPSDTSLR 229
Query: 350 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 395
++ AG + GP+ N +LL+ YGF NP+D +++++
Sbjct: 230 IISGVGVSAGAEVYNNYGPKANEELLMGYGFTLLQNPFDSFLLKSS 275
>gi|348671353|gb|EGZ11174.1| hypothetical protein PHYSODRAFT_361758 [Phytophthora sojae]
Length = 486
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 100/481 (20%), Positives = 177/481 (36%), Gaps = 89/481 (18%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+L W+ NG K+ L+E + + R +H + + L G+ +P ++T+E
Sbjct: 44 ELIQWLEGNGADTKKLALQE---YAPEVRGVH---SRKVLAPGERILVIPKKCLITVE-- 95
Query: 140 LGNET-IAELLTTNKLSELA----CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
+G +T I L + +A L ++L+ + ++ + SF+ Y L
Sbjct: 96 MGKQTDIGRKLLARNVDFVAPKHIFLMMFLLTDMERAETSFFRNYYSTLP------STLS 149
Query: 195 ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
P+ WS+ EL +L GS +I ER I+++Y+ + V D F
Sbjct: 150 NMPIFWSDEELGWLKGSYIIQQIQERKAAIRKDYDVICRV-------------DPAFARF 196
Query: 255 TFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEK 314
+ + F A + V C+ G
Sbjct: 197 SLDRFSWARMIV-----------CSRNFGL-----------------------------T 216
Query: 315 VSLARRFALVPLGPPLLAYSSK-CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 373
+ + ALVP L Y + DA + G + G + N +
Sbjct: 217 IDGVKTAALVPFADMLNHYRPRETSWTFDQSIDAFTITSLGTIGTGAQVYDSYGKKCNHR 276
Query: 374 LLINYGFVDEDNPY------DRLVVEAALNTEDPQ-YQDKRMVAQRNGKLSVQVFHVHAG 426
L+NYGF EDN + ++++ L+ D Q + DKR +G ++ +
Sbjct: 277 FLLNYGFAVEDNTEEDGRNPNEVLIDFQLSQADGQLFYDKRAYLHESGIYTMDA-RLSCS 335
Query: 427 REKEAISDMLPYLRLGYVSDTSEMQSVISSLGPIC---PVSPCMERAVLDQLADYFKARL 483
+ + RL + T + S + P P+S E L L D +L
Sbjct: 336 HSDANTREGFSFARL--IVATEDEFSSMKMKSPAHSSPPISFDNEIRALQYLRDLMTHQL 393
Query: 484 AGYPATLSEDEAMLT--DYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSP 541
+ Y T+ ED +L Y L + A +R EK++ Q AD ++ L + ++
Sbjct: 394 SLYDTTIEEDNELLASKQYPLFSNRIQALFFIRGEKQVCR-YFQELADKVIQLFSLPLAE 452
Query: 542 C 542
C
Sbjct: 453 C 453
>gi|241603784|ref|XP_002405757.1| SET domain-containing protein, putative [Ixodes scapularis]
gi|215502568|gb|EEC12062.1| SET domain-containing protein, putative [Ixodes scapularis]
Length = 429
Score = 55.1 bits (131), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 68/318 (21%), Positives = 125/318 (39%), Gaps = 57/318 (17%)
Query: 79 GDLKSWMHKNGLP-PCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
G L +WM NG K+ L++ P V A E L G+ +P +L+++
Sbjct: 30 GRLLTWMEANGFRLHSKLGLRDFPDTGRG------VVALEKLVGGETFLKLPATLLISTR 83
Query: 138 RVLGNETIAELLTTN-KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES 196
L + + ++ + KL+ + L L+++ +K G+ S W P++ L R +
Sbjct: 84 TALQSRLHSFIIRHHAKLTPIDVLTLFVLDQKLLGEASRWWPFVDSLPR-------TFTT 136
Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 256
P+ L + E+ I+R + +L + + G + ++ + FT+
Sbjct: 137 PVFLRRKVFESLP-KDLREEVQTGITFIQRTFLKLKVL--LGGHVEEEPEVQCLSTGFTW 193
Query: 257 EIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVS 316
F A+ AV T C + S + + +
Sbjct: 194 NNFVWAWTAVN----------------------------TRCIFAQGSNSSSLWEDDHCA 225
Query: 317 LARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 374
LA P L + KA + A V + +++ + + A E + + GP N +L
Sbjct: 226 LA---------PFLDCLNHHWKASIETAMVGENFEILSHKSHDANEQVFISYGPHSNRRL 276
Query: 375 LINYGFVDEDNPYDRLVV 392
++YGFV DNP D +VV
Sbjct: 277 FLDYGFVLPDNPNDVVVV 294
>gi|224098926|ref|XP_002311320.1| SET domain-containing protein [Populus trichocarpa]
gi|222851140|gb|EEE88687.1| SET domain-containing protein [Populus trichocarpa]
Length = 490
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 90/419 (21%), Positives = 168/419 (40%), Gaps = 47/419 (11%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A D+ + +P L + + V +E + +AL+L+ EK + +
Sbjct: 80 LVAQRDISRNEVVLEIPKKLWINPDVVAASEIGN---VCGGVKPWVSVALFLIREKLK-E 135
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W PY+ L + S + WSE ELA L G+ + L ++RE+ +++
Sbjct: 136 DSTWRPYLDVLPE-------STNSTIFWSEEELAELQGTQLLSTTLGVKSYLRREFLKVE 188
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLV 292
+ Q +P + T + F AF + R ++ ++ L+
Sbjct: 189 EEILVPHK--QLFPSPV-----TLDDFSWAF--------GILRSRSFSRLRGQNLV--LI 231
Query: 293 HLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 352
L C T L N +++ Y K + + D L
Sbjct: 232 PLADLCNFLHTWLLDQVNHSPDITIEDG-----------VYEIKGAGLFSR-DLIFSLRS 279
Query: 353 DRPYKAGESIVVWCGPQ-PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 411
KAGE +++ N++L ++YGF++ + + + ++ DP + DK +A+
Sbjct: 280 PISLKAGEQVLIQYNLNLSNAELAVDYGFIEAKSDRNMYTLTLQISESDPFFGDKLDIAE 339
Query: 412 RNGKLSVQVFH-VHAGREKEAISDMLPYLRLGYVSDTSEMQSVISS--LGPI-CPVSPCM 467
NG + F V + L + LG +D+ ++S+ + G + PVS
Sbjct: 340 TNGLGEIADFDIVLGNPLPPTLLPYLRLVALGG-TDSFLLESIFRNTIWGHLELPVSRAN 398
Query: 468 ERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQV 526
E + + D K+ L+GY T+ EDE L L+P+ +A + EKK+L ++
Sbjct: 399 EELICRVVRDACKSALSGYHTTIEEDEK-LKGEELNPRLEIAVGIRAGEKKVLQQIEEI 456
>gi|358056251|dbj|GAA97802.1| hypothetical protein E5Q_04481 [Mixia osmundae IAM 14324]
Length = 433
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 99/415 (23%), Positives = 173/415 (41%), Gaps = 64/415 (15%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETI---AELLTTNKLSELACLALYLMYEKKQG 171
A+ +L++ FS+P SLV+++ +++ +E+ T + + CL MYE+
Sbjct: 39 ATSNLRSETELFSIPRSLVLSVHTSPLPKSLPDWSEISTQGWVGLILCL----MYEQID- 93
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-LERAEGIKREYNE 230
S W Y+ + +S + WS+ EL L GS +I E AEG Y+
Sbjct: 94 PASHWKRYLNSM-------PTCFDSLMFWSDDELRELQGSSVLDKIGREEAEG--SYYSI 144
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASH 290
L +F+ P EA++ ++ RCG + + SF H
Sbjct: 145 LVPYLSKHADIFK------PLEAYSLALY--------------HRCGSL--ILSRSF--H 180
Query: 291 LVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQL 350
+ + S + ++ E V +VP+ L A S A L DA+ +
Sbjct: 181 VSNQDDSASDASDDDDAAYHEVETV------GMVPMADVLNAKSGSANACLVYHPDALVM 234
Query: 351 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL-NTEDPQYQDKRMV 409
+ AGE I PN+ LL YG VDE N D + + A L +D
Sbjct: 235 TTTKEIAAGEQIFNTYNDPPNADLLRRYGHVDEVNLNDNVEISADLIGCKD--------- 285
Query: 410 AQRNGKLSVQVFHVHAGREKEAI-SDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCME 468
+R L ++ V+ + E + D + +++ + SE + + + P ++
Sbjct: 286 LERVDWLLDRLDDVYTLTQAEDLPEDFITAVKI-LTASKSEFRKIQKA--DDLP-DDVLD 341
Query: 469 RAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM-EKKMLNA 522
A ++ + + RLA Y +T+ EDE++L + + A LVR+ EK++L A
Sbjct: 342 EATAMRVREILQMRLAQYSSTIEEDESLLASSTMLTSRSRAALLVRLGEKRILAA 396
>gi|358388339|gb|EHK25932.1| hypothetical protein TRIVIDRAFT_82204 [Trichoderma virens Gv29-8]
Length = 915
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 90/386 (23%), Positives = 155/386 (40%), Gaps = 66/386 (17%)
Query: 158 ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
+ L + +M+E +G +S W PY+ L + E+P+ WS EL L S T+ ++
Sbjct: 543 SILIIIMMFEYFKGDESKWKPYMDVL-------PASFETPMFWSGAELDELQASATRTKV 595
Query: 218 LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL-QRC 276
K + E+ F A L P EIF + +V L R
Sbjct: 596 G------KADAEEM----FHAKVL--------PVIRANHEIFPSSQSYSDDELVQLAHRM 637
Query: 277 GCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSK 336
G T S+ Q +E S +VP+ +L ++
Sbjct: 638 GST-------IMSYAFDFQNEDEEDEEDEEEWVEDRESKST---MGMVPMAD-ILNADAE 686
Query: 337 CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD-EDNPYD------R 389
A + DDA+ + R KAGE I+ + GP PNS+LL YG+V + + YD +
Sbjct: 687 YNAHVNYGDDALTVTALRTIKAGEEILNYYGPHPNSELLRRYGYVTPKHSRYDVVELPWK 746
Query: 390 LV---VEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD 446
LV + A+L + Q + R + V RE + + + SD
Sbjct: 747 LVENSLAASLGLSEQQLDNAREYLDMDEIEDTFVL----DRESDEPNPDGTFTGSARFSD 802
Query: 447 TSE-----MQSVISSLGPICPVSPCMER-------AVLDQLADYFKARLAGYPATLSEDE 494
E ++ ++ ++ + P S +R +VL + D ++ YP T+ EDE
Sbjct: 803 IPEDLRDQLKLLLKAVRKVDPSSVADKRKRDEIQHSVLIKALDALASQ---YPTTIPEDE 859
Query: 495 AMLTDYNLHPKKRVATQLVRMEKKML 520
+L+ ++ ++R A + EK+++
Sbjct: 860 RILSGSDISERQRAAVTVRLGEKRLI 885
>gi|453083670|gb|EMF11715.1| SET domain-containing protein [Mycosphaerella populorum SO2202]
Length = 477
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 78/286 (27%), Positives = 114/286 (39%), Gaps = 46/286 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA----CLALYLMYEK 168
V A++DL + FS+P + ++T NET L N EL L L +++E
Sbjct: 45 VVATQDLSEDEELFSIPRASILT------NETTD--LPANLRKELDHPWLSLILVMVHEY 96
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
+G KS W PY L +S + WS+ EL L GS +I + +
Sbjct: 97 LKGTKSSWYPYFNLLPE-------TFDSLMFWSDEELLSLKGSAVVDKIGKES------- 142
Query: 229 NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFA 288
A S F + IP A IF+ A + + R G T ++A
Sbjct: 143 ---------ADSTFTEQL--IPLIAQHANIFQTAGRSNDELLSLCHRMGST----IMAYA 187
Query: 289 SHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV 348
L + S + + +SL + ++PL L A + A L DD V
Sbjct: 188 FDLEKPEPSQPPNQQDDEEWEEEESAISLPK--GMIPLADMLNANADHNNAKLFYQDDKV 245
Query: 349 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVE 393
+ +AGE + GP P S LL YG+V D+ YD VVE
Sbjct: 246 VMKTLHAVRAGEELFNDFGPLPRSDLLRRYGYVTDQYAKYD--VVE 289
>gi|66825817|ref|XP_646263.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|60474297|gb|EAL72234.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 567
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 99/492 (20%), Positives = 176/492 (35%), Gaps = 83/492 (16%)
Query: 65 AGSREVVSKKEEDL-GDLKSWMHKNGL--PPCKVILKEKPSHNEKHRPIHYVAASEDLQA 121
A S ++V E L + W+ G CKV + S + A++D++
Sbjct: 56 ANSGKIVEPTEAQLVANFIEWLKGKGFDESKCKVKIDRNTSEGTG------LVATQDIKE 109
Query: 122 GDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLP 178
G+ +P++L +T +L ++L + L+++L+ E S W P
Sbjct: 110 GEDFVEIPSNLFITTAVAFQGLGKPPILENDRLIQSIPGILLSIFLVKELSN-PTSEWGP 168
Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMA 238
YI+ L +Q + W E GSP + G R+Y ++
Sbjct: 169 YIKLLPKQ-------YNTVYYWGLKEFTQFRGSPNLEYAMRYVRGAMRQY------CYLY 215
Query: 239 GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSC 298
+ + +P +FT++ F A VQS ++ G S + L+ C
Sbjct: 216 SMIDRTQSNIMPISSFTWDAFVWAISTVQS-----RQNPVYAGNGNGSIMA-LIPFWDFC 269
Query: 299 TTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 358
+ T T S C A D +K
Sbjct: 270 NHSSTGSKIT--------------------SFYHMDSNCMTSGAIKD----------FKK 299
Query: 359 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL--NTEDPQYQDKRMVAQRNGKL 416
GE + ++ GP+ N++LL++ GF + N +D E L + ++ ++ +R +
Sbjct: 300 GEQVYMFYGPRDNTQLLMHAGFATKTNLHDSYPFELHLLEGNHEIRHDKVHLLEERGIRD 359
Query: 417 SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS---------------SLGPIC 461
V V E +++P+ R+ Y E +++ L P+
Sbjct: 360 GVVVNLNQNPTSNELPLELIPFYRI-YALSEQETRAIAPPQVPGEHNHHHGHQLELKPLA 418
Query: 462 --PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 519
++ E L K +LA YP TL EDE L N +R EKK+
Sbjct: 419 FKIITQENEEKAYSNLVQALKGKLASYPTTLEEDEQELKK-NPPANQRFILYTKINEKKI 477
Query: 520 LNACLQVTADMI 531
L+ ++ +I
Sbjct: 478 LDRNIKYLESLI 489
>gi|320170797|gb|EFW47696.1| hypothetical protein CAOG_05634 [Capsaspora owczarzaki ATCC 30864]
Length = 903
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 53/200 (26%), Positives = 87/200 (43%), Gaps = 21/200 (10%)
Query: 62 TLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQA 121
T V G+R + + +L W+H G+ I + S + V A+E ++A
Sbjct: 371 TAVIGTRPAALESRKIGDNLLQWLHNAGMTS---IAENHLSIADFEHTGRGVLANERIEA 427
Query: 122 GDAAFSVPNSLVVTLERVLG-NETIAELLTT--NKLSELACLALYLMYEK-KQGKKSFWL 177
G +P L++ + L + I +L+ ++ + L LY+++EK G S W
Sbjct: 428 GVEVLHLPQHLLINIHVALDESHPIGRVLSDLRDEYDDDTLLLLYVLHEKLVAGSASRWA 487
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
P+ L SPLL+ TEL L G+ E E +G++ + L
Sbjct: 488 PFFETL-------PATYNSPLLFHVTELLELEGTRLIDETFEIKDGLRVLHESL------ 534
Query: 238 AGSLFQQYPYDIPTEAFTFE 257
G L + YP PT+AFT+E
Sbjct: 535 -GPLAEAYPALFPTDAFTYE 553
>gi|302753470|ref|XP_002960159.1| hypothetical protein SELMODRAFT_437298 [Selaginella moellendorffii]
gi|300171098|gb|EFJ37698.1| hypothetical protein SELMODRAFT_437298 [Selaginella moellendorffii]
Length = 377
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 60/243 (24%), Positives = 94/243 (38%), Gaps = 59/243 (24%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LAL ++ E+ +G+ S W PYI L + +++ LW +TEL+YL SP + E
Sbjct: 114 LALIVLMERYKGQSSVWAPYISCLPQPA-----ELDNTFLWEDTELSYLKASPLYGKTRE 168
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCT 279
R E I E+ ++ + LF + + E FK + V
Sbjct: 169 RLEMITTEFGQVQNALNVWPQLFGK---------VSLEDFKHVYATV------------- 206
Query: 280 PQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKA 339
F+ L + T P+ N A FA + LL Y+
Sbjct: 207 -------FSRSLA-IGEDSTLVMIPMLDFFNHN-----ATSFAKLSFN-GLLNYAV---- 248
Query: 340 MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 399
+ DR Y + I + G N++L ++YGF +NPYD E L T+
Sbjct: 249 ----------VTADRAYTENDQIWINYGDLSNAELALDYGFTVPENPYD----ETDLLTQ 294
Query: 400 DPQ 402
P+
Sbjct: 295 FPE 297
>gi|168014081|ref|XP_001759585.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689124|gb|EDQ75497.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 340
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 51/212 (24%), Positives = 97/212 (45%), Gaps = 28/212 (13%)
Query: 330 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 389
LL +SS+ +++ +++V ++ + GE++V+ GP N LL++YGFV NP DR
Sbjct: 123 LLQHSSESQSL-----PVLEVVAEKDLEKGENVVLNYGPLSNDILLLDYGFVMPKNPNDR 177
Query: 390 --------------LVVEAALNT-EDPQYQDKRMVAQRN--GKLSVQVFHVHAGREKEAI 432
LV + +++ +DP ++ + N G S Q+ V G +
Sbjct: 178 VELRYDDQLLHMACLVAKVNIDSFKDPTTSQLALLTRLNLHGPSSSQM--VTLGGTELVE 235
Query: 433 SDMLPYLRLGYVSDTSEMQSV----ISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 488
+L +R+ + D E+ V + + P+ ER + L LA +P
Sbjct: 236 GRLLAAVRVMHAQDPMELLDVDLEALQTWNQSPPLGVLNERKTIRTLIGLGMLALASFPT 295
Query: 489 TLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 520
+ ED++ L ++ R+A Q ++K++L
Sbjct: 296 EIEEDQSELVKGDISENHRLAIQFRMLKKRLL 327
>gi|260819628|ref|XP_002605138.1| hypothetical protein BRAFLDRAFT_122719 [Branchiostoma floridae]
gi|229290469|gb|EEN61148.1| hypothetical protein BRAFLDRAFT_122719 [Branchiostoma floridae]
Length = 453
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 80/360 (22%), Positives = 145/360 (40%), Gaps = 57/360 (15%)
Query: 67 SREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAF 126
+R V EE W+H+NG C+ + + E R + A++ L+ +
Sbjct: 19 TRPVSLAHEESFVRFFQWLHRNG---CRNVPLKPAVFPETGRGL---MATKALKHEELIL 72
Query: 127 SVPNSLVVTLERVLGNETIAELL--TTNKLSELACLALYLMYEKKQGKKSFWLPYIRELD 184
+P L++T++ ++ + +A + ++L+ LA++LM EK + +KSFW PYI L
Sbjct: 73 VIPKRLLITIDAIM-DSYLAPYIERADSQLTPSQALAVFLMCEKCRREKSFWRPYIDILP 131
Query: 185 RQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQ 244
+ P ++E + L S ++ E +E+ EL + M LF
Sbjct: 132 EE-------YTCPAFFTEEDFRLLPNSLRGKAKAKKYE-CHKEFMELAPFFKMLADLFPD 183
Query: 245 YPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTP 304
+AF F+ FK A+ A+++ R P G + L + + T P
Sbjct: 184 -----QEDAFNFKDFKWAWSAIKT------RAFDVPLGGETCY--RLRDSEDTSNPTMFP 230
Query: 305 LSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 364
L + N + + R Y+ K + + + + Y+ ++
Sbjct: 231 LVDSINHAAQAKIRHR------------YNEKRRCLESRTETV--------YRRHAEVMN 270
Query: 365 WCGPQPNSKLLINYGFVDEDNPYDRL---VVEAALNTEDPQYQD----KRMVAQRNGKLS 417
G N LL+ +GFV NP D + +V+ L P+ + K M RN +S
Sbjct: 271 SYGRADNDNLLLEFGFVVPGNPADTVTFHLVQDVLEYLQPENNELLERKIMFLARNNLIS 330
>gi|326472332|gb|EGD96341.1| SET domain-containing protein [Trichophyton tonsurans CBS 112818]
Length = 485
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 104/480 (21%), Positives = 185/480 (38%), Gaps = 72/480 (15%)
Query: 97 LKEKPSHNEKHRPIHY-----------VAASEDLQAGDAAFSVPNSLVVTLERVLGNETI 145
LK H + H IH + AS D+ + F +P+ L+++++ +
Sbjct: 24 LKRSSPHFKMHSGIHIADLRSIGAGRGICASRDIAEDEELFVIPDDLILSVQNSEARSVL 83
Query: 146 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 205
L +L L + ++YE QG++S W PY R L + ++ + W++ +L
Sbjct: 84 G--LDDKQLGPWLSLIITMIYEYYQGEQSKWYPYFRILPS-------SFDTLMFWTDEQL 134
Query: 206 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 265
+ L GS +I + A DT+ L Q P+ P +
Sbjct: 135 SELQGSAVVGKIGKAAAD--------DTILQKVVPLIQANPHHFPPRPNMPPLNSPDSQN 186
Query: 266 VQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVP 325
C+ H R G ++A + + T T + E +VP
Sbjct: 187 ALLCLAH--RMGSI----IMAYAFDIEKADEADEDTAEDGYMTDDEDEPAK-----GMVP 235
Query: 326 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 385
L A + + A L + + + + +GE I G P + LL YG+V DN
Sbjct: 236 LADIFNADAQRNNARLFQEEGSFVMKAIKNIHSGEEIFNDYGELPRADLLRRYGYV-TDN 294
Query: 386 PYDRLVVE---------AALNTEDPQYQDKRMVAQRNGKLSVQVFHV----HAGREKEAI 432
VVE A L P + R+ N + + + + G K+AI
Sbjct: 295 YAQYDVVEFSLDGICKVAGLPDSKPSSTNPRLELLDNLDMLEEGYSIPRVPPNGTLKDAI 354
Query: 433 -SDMLPYLRLGY--VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPAT 489
D L LR V D + +++ + P S E ++L L + +R + YP +
Sbjct: 355 PEDFLVLLRALTLPVEDLNRLKARNKAPKPEFSAS---EASLLRSLVN---SRQSEYPTS 408
Query: 490 LSEDEAML---------TDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVS 540
+ EDE +L + ++ ++++A Q+ + EK++L L + D ++ PD S
Sbjct: 409 IQEDENILRYLEQQNGYINDSILIRRKMAVQVRKGEKEILTQILSLL-DTHLVQPDQNGS 467
>gi|296804474|ref|XP_002843089.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
gi|238845691|gb|EEQ35353.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
Length = 455
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 91/431 (21%), Positives = 164/431 (38%), Gaps = 71/431 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
V A + G+ ++P++ + T+E+ + + +L + + LS LA+YL++ +
Sbjct: 28 VKALRSFKEGERILTIPSACLWTVEKAYADPLLGPVLRSAQPPLSVEDALAVYLLFVRS- 86
Query: 171 GKKSFWLPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIK 225
R + R +A + + ++E EL GS A + + ++
Sbjct: 87 ----------RTSGYEGQRHHIAAMPQSYSASIFFTEDELQVCAGSSLYALTRQLEQRVR 136
Query: 226 REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTE 285
+Y +L L Q+ P + FT E +K A ++ S + G T
Sbjct: 137 DDYRQLLV------PLLSQHRDLFPLDQFTIEDYKWALCSIWSRAMDFAVSGTTSVRLVA 190
Query: 286 SFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD 345
A L H+P K +C A
Sbjct: 191 PLADML----------------NHSPDVK---------------------QCHAYDPTSG 213
Query: 346 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQD 405
D + ++ + Y+ G+ + ++ G PN++LL YGFV DNP D + + P Y+
Sbjct: 214 D-LSILAAKDYQVGDQVFIYYGSVPNNRLLRLYGFVLPDNPNDSYDLVLQTSPLAPLYEQ 272
Query: 406 KRMVAQRNGKLSVQVFHVHAGREKEAI-SDMLPYLRLGYVSDTSEMQSVISSL--GPICP 462
K + G S + K+ + +++L YLR+ + D S + + L G
Sbjct: 273 KERLWALAGLDSTCTIPLTV---KDPLPNNVLRYLRIQRL-DESNITDITLQLVNGTDGK 328
Query: 463 VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT--DYNLHPKKRVATQLVRMEKKML 520
VS E VL L D + L G+ L + EA L DY A + E+++L
Sbjct: 329 VSDGNEMQVLQFLVDSIGSLLEGFGIPLEKLEAQLAAGDYPAGGNAWAAAHVSAGEQRVL 388
Query: 521 NACLQVTADMI 531
+ D++
Sbjct: 389 TRAKRTAEDLL 399
>gi|452824261|gb|EME31265.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase
[Galdieria sulphuraria]
Length = 546
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 93/399 (23%), Positives = 156/399 (39%), Gaps = 67/399 (16%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPS---HNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
E +L++W+ NG+P +K KP HN + A L+ G+ ++P
Sbjct: 71 EKTEELENWLFDNGVPS----IKGKPVLSPHNCRT-----FRAKIPLKLGEEVLAIPERF 121
Query: 133 VVTL---ERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
+T E++LG + LS+ +A L+ E + + SFW P+I L
Sbjct: 122 WLTKQLSEKLLG-------FHVSDLSDEEAIAALLLVETARKETSFWKPWIETLPSSDEL 174
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY-D 248
L+WS E YL S T +IL E + EL+T LF ++ Y
Sbjct: 175 HHF-----LVWSTAETQYLESSSTFEDILSLRETASLVFEELNT------ELFPKFLYPQ 223
Query: 249 IPTEAFTFEIFKQAFVAVQSCVVH--LQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLS 306
+ FT F A VQS ++ + C G E + ++ L T + R
Sbjct: 224 YDVKYFTLPYFTWALSIVQSFGLYDIMDSCPLVIVPGLE-WLTYKYSLITEESFFRQYFH 282
Query: 307 PTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA-----VQLVVDRPYKAGES 361
++ VSL R +GP ++ + + + A +D V LV + ++
Sbjct: 283 ISN-----VSLIR------VGP---FFTQERRLKITASEDLKVGEPVSLVYEGNVSLIDT 328
Query: 362 IVVWCGPQPNSKLLINYGFVDEDN--PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ 419
W ++ G +DE+ + A+ T D + DK + +Q
Sbjct: 329 FCRWGWK-------LDLGALDEEQLLKMGSYEISFAVTTTDQFFDDKEDILDAQRLELLQ 381
Query: 420 VFHVHAGREKEAISDMLPYLRLGYVSDTSE--MQSVISS 456
F + KE + +LP+LRL + D ++SV S
Sbjct: 382 TFELRYDMSKELLQRILPFLRLICLKDKDSFILESVFRS 420
>gi|50557134|ref|XP_505975.1| YALI0F28061p [Yarrowia lipolytica]
gi|49651845|emb|CAG78787.1| YALI0F28061p [Yarrowia lipolytica CLIB122]
Length = 454
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 74/296 (25%), Positives = 120/296 (40%), Gaps = 46/296 (15%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V ASED++ + F +P S +++E + I ++ KL+ L LY+M K G
Sbjct: 42 VIASEDIEEDEVLFKIPRSSFLSVEN--DPDFIKQVPEAKKLNSWLQLILYMM---KAGS 96
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+ W PY L Q ++S ++W++ EL L GS +I G + +Y E
Sbjct: 97 MTKWKPYFDVLPTQ-------LDSLMMWTDDELEGLKGSMIVKKI--GKAGAEEDYQE-- 145
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLV 292
+ +P E FK +++S R G + SF +
Sbjct: 146 ----KLKPIIDAHP----------EYFKDCDTSLES----FHRMGGL--IMAYSFDAPDS 185
Query: 293 HLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD-AVQLV 351
+ +N L + A+VPL L A++ C A L A DD +
Sbjct: 186 FSEDEEDDEDIEHDDLYNE----GLVK--AMVPLADTLNAHTRFCNANLIAEDDGGFSMT 239
Query: 352 VDRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALNTEDPQYQDK 406
+P K GE + G PN L YG+V +E +D +VE +++ Y +K
Sbjct: 240 AIQPIKKGEQVYNTYGELPNCDFLRRYGYVENEGTEFD--IVEFSMDEISDFYANK 293
>gi|7329638|emb|CAB82703.1| putative protein [Arabidopsis thaliana]
Length = 486
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 44/159 (27%), Positives = 77/159 (48%), Gaps = 18/159 (11%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ + AGD VP + +T + + + +L +N++ + LA L+ EKK G+KS
Sbjct: 75 ASKVIYAGDCMLKVPFNAQITPDELPSD---IRVLLSNEVGNIGMLAAVLIREKKMGQKS 131
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W+PYI L + + S + W E EL+ + S E +++ I+++++
Sbjct: 132 RWVPYISRLPQPA-----EMHSSIFWGEDELSMIRCSAVHQETVKQKAQIEKDFS----- 181
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS--CVV 271
F+A + Q P I TE E F A+ + C+V
Sbjct: 182 -FVAQAFKQHCP--IVTERPDLEDFMYAYALGEKVLCIV 217
>gi|391340216|ref|XP_003744440.1| PREDICTED: SET domain-containing protein 4-like [Metaseiulus
occidentalis]
Length = 381
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 68/318 (21%), Positives = 122/318 (38%), Gaps = 80/318 (25%)
Query: 78 LGDLKSWMHKNGLPPCKVI-LKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTL 136
+G+L SW+ + G P V+ L P+ + +++AGD +P++L++T
Sbjct: 21 IGELYSWIQRLGFKPTSVLRLACTPASGRG------IVCLSNIEAGDVIIDLPSTLLITP 74
Query: 137 ERVLGNETIAEL-LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ V EL ++ LS L ++++ E+ G+KS W PYI +
Sbjct: 75 DLVR-----KELNMSKENLSAEEILTIFVLSERSLGEKSKWKPYIESI------------ 117
Query: 196 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
P ++ + P + A+ I R E V+ F+ D+
Sbjct: 118 -PDVFDGLQCRKSVRLPRRL-----AQAIDRWNAERRNVFSRLRMFFRGRGIDL-----N 166
Query: 256 FEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKV 315
FE F A+ AV + ++++ G T L+P +
Sbjct: 167 FETFSWAWSAVNTRCIYVEGHGST-------------------------LAPFLD----- 196
Query: 316 SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLL 375
LL + K + V++ + + Y+AG + + G N L
Sbjct: 197 --------------LLNHHWKASIETSFVNNHFIIRSNVGYEAGSEVFIGYGSHDNRTLF 242
Query: 376 INYGFVDEDNPYDRLVVE 393
+NYGFV ++NP D + VE
Sbjct: 243 LNYGFVLDENPNDCITVE 260
>gi|392594054|gb|EIW83379.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
Length = 508
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 98/449 (21%), Positives = 162/449 (36%), Gaps = 68/449 (15%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELAC------LALYLMYEK 168
A +D+ G FS+P L ++L T+ LL ++ E L L +M+E+
Sbjct: 43 ALQDIHEGTTLFSLPRELTLSLR----TSTLPSLLGVDRWKEFGLNKGWVGLILCMMWEE 98
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
+G +S W Y+ L ++P+ WS +L L G+ +I +++Y
Sbjct: 99 SRGVESKWDVYLSSLPS-------TFDTPMFWSAEDLEELKGTAVPDKI--GRNDAEKDY 149
Query: 229 NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFA 288
E + S +P D ++ E + + S H+++ P+
Sbjct: 150 RE--KLVPAVQSRPDLFPLDTLDRFYSVERYHIMGSRILSRSFHVEKWEGGPEDEEMGDV 207
Query: 289 SHLVHLQTSCTTTRTPLSPTHNPKEKVSLARR--------------------FALVPLGP 328
+ + + + PT A+VP+
Sbjct: 208 NGEPQAENGSAPSGEAVPPTEEAASGDDGEGPGDGEGADEEDNEEDAEDPADVAMVPMAD 267
Query: 329 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD------ 382
L A A L + A+++ R + G+ I G PNS LL YG VD
Sbjct: 268 LLNARYGSSNAKLFYEEHALEMRTTRFIRRGDQIWNTYGDPPNSDLLRRYGHVDLVPLAQ 327
Query: 383 --EDNPYDRLVVEAALNTEDPQYQDKRM-VAQR------NGKLSVQVFHVHAGREKEAI- 432
NP D + V A L T+ + VA+R G V V + I
Sbjct: 328 GGLGNPADVVEVRADLVTDVVSASGSSIPVAERIDWYLEMGGDDVFVLETDLDIPEPLIV 387
Query: 433 -SDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLS 491
+L L + T E + P P M+ AV+ L + RL YP T+
Sbjct: 388 LVRLLQLLEPDW-EKTREKEK---------PPKPKMDGAVIGVLVEVLHRRLKEYPTTID 437
Query: 492 EDEAMLTDYNLHPKKRVATQLVRMEKKML 520
EDEA+L + + +VRM +K++
Sbjct: 438 EDEALLYKEDTLSINKKNAIIVRMGEKII 466
>gi|302832548|ref|XP_002947838.1| hypothetical protein VOLCADRAFT_88145 [Volvox carteri f.
nagariensis]
gi|300266640|gb|EFJ50826.1| hypothetical protein VOLCADRAFT_88145 [Volvox carteri f.
nagariensis]
Length = 508
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 97/430 (22%), Positives = 156/430 (36%), Gaps = 68/430 (15%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+ +SW+ GL ++L+ R + AS L G+ +P+ LV+T ER
Sbjct: 24 EFQSWLRSEGLSTQPLLLRHC------GREGRGLVASRSLSRGEVLVKLPDHLVITAERA 77
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
G ++ LL LA ++ + W PY+ L ++ G + L
Sbjct: 78 AGEWSLLALLLAEVKGRLAA------GDRSSPAAARWGPYVAVLPQRPG-------TLLD 124
Query: 200 WSETELA-YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEI 258
W E+ L GSP + + EL+ + G P +P E
Sbjct: 125 WPAKEVQQLLRGSPLQRLADSITSAASASWRELEPL-IAQGRADGLVPEHVPLSKGDLEW 183
Query: 259 FKQAFVAVQSCVVHLQRCG-------CTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNP 311
AF + S + L G Q+ + A HL S P
Sbjct: 184 ---AFGVLLSRCIRLPSRGDLQVLAPWADQLNHDVSAEEGCHLDWSWDVA----GPAVPG 236
Query: 312 KEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPN 371
++ A + ALV L DRPY AG+ + V GP+ +
Sbjct: 237 GDRAGGATKGALV-------------------------LRADRPYAAGQQVYVSYGPKSS 271
Query: 372 SKLLINYGFV--DEDNPYDRLVVEAALNTE-DPQYQDKRMVAQRNGKLSVQVFHVHAGRE 428
+LL++YGF NP+ + A++ + DP K R+G S F +
Sbjct: 272 GELLLSYGFCPPPASNPHQDCRLRVAVDRQGDPLADLKEQALARHGLPSELEFPLKLEGI 331
Query: 429 KEAISDMLPYL--RLGYVSDTSEMQSVISSLGPICPVSPCMERAV--LDQLADYFKARLA 484
E + L +L R +T E+ SV+ G P+ + V L L++ A L
Sbjct: 332 PEGLLQYLAFLDARPKVAQETFELASVLFESGGF-PLLDGQDTLVLALRGLSNRCTAALK 390
Query: 485 GYPATLSEDE 494
YP ++ D+
Sbjct: 391 AYPTSMEADQ 400
>gi|428182808|gb|EKX51668.1| hypothetical protein GUITHDRAFT_102933 [Guillardia theta CCMP2712]
Length = 436
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 86/395 (21%), Positives = 162/395 (41%), Gaps = 64/395 (16%)
Query: 81 LKSWMHK-NGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
L+ W+ + +G+ KV L+ P V A+ L+ G+ F +P S + E V
Sbjct: 29 LRIWLEEEHGVDMSKVDLQRSPLEGLG------VFANRRLEPGETLFMIPKSCCIYPELV 82
Query: 140 LGNETIAELLTTNKLS-------ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
+ + + + KL+ E+ LA +L EK +G +S + P+I L
Sbjct: 83 FEDRQLGK--SMQKLASAAGEGIEVVALATFLAREKMKGSESSYKPFIDVL-------PW 133
Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 252
PLLW++ E+ L G+ EIL E ++ + V G ++Q+ I TE
Sbjct: 134 DSLHPLLWTDEEVDLLEGTYAHREILAFREQVEVATELFEPVLNPKG--WKQFFQTIETE 191
Query: 253 AFTFEIF----KQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHL--QTSCTTTRTPLS 306
T E F + AF +V S + G + E L+ + S + T +
Sbjct: 192 KMTPEEFGFMMRGAFASVLSRAFD-SKIGRGDKGLEERVVIPLLDIFNHGSYGPSITFDT 250
Query: 307 PTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 366
EK G P+ + K K++ + GE + +
Sbjct: 251 ALERDNEK------------GFPVRV-ADKGKSI----------------EEGEELFGFY 281
Query: 367 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV-QVFHVHA 425
G +PN +L YGFV + + +++ +DP + K + + G ++V Q+F +
Sbjct: 282 GDKPNWNMLTTYGFVSPNPKCQETTLSVSIDEKDPYFAQKEEILKARGMVAVEQLFDIR- 340
Query: 426 GREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPI 460
+ + + ++ Y R+ +S+ +++ V ++ G +
Sbjct: 341 -HDTDPMGPLINYFRIREISNEADLTKVQTNYGEM 374
>gi|348675930|gb|EGZ15748.1| hypothetical protein PHYSODRAFT_561468 [Phytophthora sojae]
Length = 430
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 54/192 (28%), Positives = 83/192 (43%), Gaps = 10/192 (5%)
Query: 358 AGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLS 417
AG ++ + G N+KLL +YGFV +N + + DP ++ K+ V N
Sbjct: 235 AGSAVHISYGQYSNAKLLYSYGFVSPENFRRGVDFWMKIPLSDPYFKLKQTVLDSNELTK 294
Query: 418 VQVFHVHAGREKEAISD-MLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLA 476
Q + H + + +L LR+ +++ Q + I V E AV + L
Sbjct: 295 EQTYDFHGTLLSNDVDERLLATLRVILMNEQEIRQYKKAFESSILSVRN--ELAVYENLQ 352
Query: 477 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQL-VRMEKKMLNACLQVTADMIMLLP 535
+ +L+ Y TL EDEA+L + K R+A + VRME K QVT +I L
Sbjct: 353 STCRRKLSNYATTLEEDEAILAETETESKPRLAFAVRVRMEDK------QVTTSVIETLE 406
Query: 536 DVTVSPCPAPYA 547
S P A
Sbjct: 407 QWKQSLASKPDA 418
Score = 43.5 bits (101), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 28/101 (27%), Positives = 47/101 (46%), Gaps = 7/101 (6%)
Query: 111 HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
H V A L +G VP L + E ++ L ++ + LAL+LM+E+ +
Sbjct: 35 HGVFAKRALTSGQVTLQVPFKLTMNTESAATSDLAPVLEKYPQIPDDEVLALHLMHERSK 94
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
G +SF+ P+I + + P+ W+E EL L G+
Sbjct: 95 GGESFFAPFIASM-------PTTFDLPVFWTEAELNELKGT 128
>gi|403412960|emb|CCL99660.1| predicted protein [Fibroporia radiculosa]
Length = 508
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 97/411 (23%), Positives = 151/411 (36%), Gaps = 65/411 (15%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
L L +M+E+ QG S W Y+ L + ++P+ W+ +L L G+ +I
Sbjct: 81 LILCIMWEEAQGSSSKWSGYLSSLPS-------SFDTPMFWNSADLQELQGTAVVDKI-- 131
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ----- 274
E +R+YN + + S +P + +E +T + + S H++
Sbjct: 132 GKEQAERDYN--GKLLPVVQSRVDLFPPESLSEHYTLHRYHLTGSRILSRSFHVEPWMGG 189
Query: 275 ---RCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKE-----------------K 314
Q E+ A V + S T T + KE
Sbjct: 190 DDDVSDDGSQGPDEAVAKMDVDAERSVVETGTIPGSSETTKELNDELEGPTDDEDDEDED 249
Query: 315 VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 374
A+VP+ L A A L + +++V +P KAGE I G PNS L
Sbjct: 250 ADDPADVAMVPMADMLNARFGSENAKLFYEEHHLKMVTTKPIKAGEQIWNTYGDPPNSDL 309
Query: 375 LINYGFVD----------EDNPYD------RLVVEAALNTEDPQYQDKRMVAQRNGKLSV 418
L YG VD NP D L V AA + QDK
Sbjct: 310 LRRYGHVDLVPLEPPLAGLGNPADIVEIGADLAVFAAKKDSPEKLQDKIDWWLEVANDDT 369
Query: 419 QVFHVHAGREKEAISDMLPYLRLGYV-SDTSEMQSVISSLGPICPVSPCMERAVLDQLAD 477
V G + + +++ + RL ++ D E S L P ++ VL D
Sbjct: 370 FVI----GTDCQLPEELVSFARLLFLPRDEWEKVRQKSKL-----PKPKIDAQVLSVAED 420
Query: 478 YFKARLAGYPATLSEDEAMLTDYNLHP---KKRVATQLVRMEKKMLNACLQ 525
R+ Y T+ +DEA+L N P K+ A + EK++L+ LQ
Sbjct: 421 VLSRRINEYSTTIEDDEALLALENAQPLSLNKKHALIVRHGEKRILHGTLQ 471
>gi|255568191|ref|XP_002525071.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
gi|223535652|gb|EEF37318.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
Length = 456
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 65/286 (22%), Positives = 122/286 (42%), Gaps = 62/286 (21%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ +Q GD VP S + + +L +++LL +++ +A LA+ L+ ++K G++S
Sbjct: 51 ASKSIQTGDCILRVPYSAQIASDNLLPE--LSDLLG-DEVGSVAKLAIVLLVDQKVGQES 107
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI L Q G + S + WS++EL + S E +++ I++++ + V
Sbjct: 108 KWAPYISRLP-QLGE----MHSTIFWSKSELDMIFQSSVYKETIKQKAQIEKDFLTIKPV 162
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHL 294
+ +P + + TF+ F A+ V+S + G T + FA L H
Sbjct: 163 -------LEHFPQ--ISRSITFQDFMHAYALVKS-----RAWGSTKGVSLIPFADFLNHD 208
Query: 295 QTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 354
S ++ ++VS ++ DR
Sbjct: 209 GFSEAVV------LNDEDKQVS--------------------------------EVAADR 230
Query: 355 PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 400
Y E +++ G N+ LL+++GF N +++ VE +N D
Sbjct: 231 NYAPHEEVLIRYGKFSNATLLLDFGFSLPYNIHEQ--VEIQINIPD 274
>gi|345325919|ref|XP_001512656.2| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Ornithorhynchus anatinus]
Length = 345
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 51/197 (25%), Positives = 94/197 (47%), Gaps = 18/197 (9%)
Query: 345 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 404
DD + V + + AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 42 DDRCECVALQDFTAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 101
Query: 405 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS--------S 456
K V R G + VF +H E + +L +LR+ +++ + +I +
Sbjct: 102 MKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLIGDHAIDKIFT 160
Query: 457 LG-PICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVATQ 511
LG PVS E +L + +AR L Y T+ ED++ L +L +A +
Sbjct: 161 LGNSEFPVSWDNEV----KLWTFLEARASLLLKTYKTTIEEDKSFLETPDLTFHATMAIK 216
Query: 512 LVRMEKKMLNACLQVTA 528
L EK++L ++ A
Sbjct: 217 LRLGEKEILEKAVKSAA 233
>gi|302815683|ref|XP_002989522.1| hypothetical protein SELMODRAFT_129980 [Selaginella moellendorffii]
gi|300142700|gb|EFJ09398.1| hypothetical protein SELMODRAFT_129980 [Selaginella moellendorffii]
Length = 464
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 93/404 (23%), Positives = 155/404 (38%), Gaps = 81/404 (20%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
L L L+YE+ Q K S+W PYI L + P+ +S ++ + +P ++ +
Sbjct: 105 LGLKLLYERAQ-KGSYWWPYISMLPH-------SFTLPIFFSGVDIESIDYAPVTHQVKK 156
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF---EIFKQAFVAVQSCVVHLQRC 276
R + + +EL + + P +I A F A AV S +
Sbjct: 157 RCRFLLQFSSEL--------AKLESLPEEIHPFAGQFVDSGALGWAMAAVSSRAFRIH-- 206
Query: 277 GCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSK 336
G T ++ ++ ++ L C + P N + L+R V
Sbjct: 207 GVTNKL----CSAMMLPLIDMCNHSFQP-----NAHIEEDLSRDAQDVSF---------- 247
Query: 337 CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 396
+++V R + G +I + GP N LL++YGFV DNP+DR+ L
Sbjct: 248 -----------LKVVTKRNLEKGSAITLNYGPLSNDLLLLDYGFVIPDNPHDRI----EL 292
Query: 397 NTEDPQYQDKRMVA--QRNGK------LSVQVFH--------------VHAGREKEAISD 434
+ ++ RM+A R G S QV V G +E
Sbjct: 293 RYDGSLMENARMIAGLSRTGSPPFSSPASWQVDRLKQLGLADSGESQKVTLGGPEEVDGR 352
Query: 435 MLPYLRLGYVSDTS--EMQSVIS--SLGPICPVSPCMERAVLDQLADYFKARLAGYPATL 490
+L LR+ + E + ++S + G VS E VL L + T+
Sbjct: 353 LLAALRILHAESQEPLERRELVSLQAWGVESMVSSDNEERVLRTLCGLGAIVFNQFKTTI 412
Query: 491 SEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLL 534
EDEA L+D +L R+A Q +K+++ L+ +M L
Sbjct: 413 EEDEAKLSDKSLAETSRIAVQFRLTKKRLVVRVLESLKKRLMDL 456
>gi|238485948|ref|XP_002374212.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
gi|83768069|dbj|BAE58208.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|220699091|gb|EED55430.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
Length = 713
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 59/232 (25%), Positives = 92/232 (39%), Gaps = 46/232 (19%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
+LM + QGK+ FW PYIR L Q G A+ +PL + +L +L G+ ++A
Sbjct: 131 FFLMGQYLQGKEGFWYPYIRTLP-QPG----ALTTPLYYEGDDLEWLEGTSLSPARQQKA 185
Query: 222 EGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTP 280
+K +Y + T AG ++Y +D+ A T + + V S V+
Sbjct: 186 NLLKEKYGTVYTELCKAGFDGAEKYTWDLYLWASTIFVSRAFSAKVLSGVI--------- 236
Query: 281 QMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAM 340
T P+E VS+ F +L + K
Sbjct: 237 -------------------------PDTQLPEENVSVLLPFI------DILNHRPLAKVE 265
Query: 341 LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 392
A V +V AG+ I GP+ N +L++NYGF +NP D +V
Sbjct: 266 WRAGKGNVAFLVLEDVAAGQEISNNYGPRNNEQLMMNYGFCLPNNPCDYRIV 317
>gi|403350379|gb|EJY74649.1| SET domain containing protein [Oxytricha trifallax]
Length = 2165
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 80/360 (22%), Positives = 146/360 (40%), Gaps = 80/360 (22%)
Query: 97 LKEKPSHNEKHRPIHYVA------ASEDLQAGDAAFSVPNSLVVTLERVL----GNETIA 146
L++ SH EK + +Y A A+ D++ G+ VP ++TLE + G +
Sbjct: 154 LEQGGSHFEKLKIRYYTADYRGVHAARDIKKGEIILYVPKHQIITLEMAMTSPVGKKMYE 213
Query: 147 ELLTTNKLS-ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 205
+ L +S + + L+ Y+M EK++ + S W YI L + P+ ++E E
Sbjct: 214 KGLRQRLISPKHSFLSTYIMQEKRKPE-SQWQIYIDILPKNFSN------FPIFFTEEER 266
Query: 206 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 265
+L GSP +ILE+ E IK +Y+ + ++ P + F + + +
Sbjct: 267 IWLKGSPFLDQILEKIEDIKADYD----------LICKEVPEYV---QFPIREYSEIRMM 313
Query: 266 VQSCVVHLQRCGCTPQMGTESFASHLVH---LQTSCTTTRTPLSPTHNPKEKVSLARRFA 322
V S + +Q G G ++A L H QTS T T
Sbjct: 314 VSSRIFGIQIEGVKTD-GFVAYADMLNHKRPRQTSWTYT--------------------- 351
Query: 323 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 382
K ++ A++D + GE + G + NS+ +NYGF++
Sbjct: 352 -----------DEKQGFIIEAMEDI---------QRGEQVYDSYGKKCNSRFFLNYGFIN 391
Query: 383 EDNPYDRLVVEAALNTEDPQYQDKR-MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 441
+N + + ++ +T+D Q K+ M+ + + F V E + + +LR
Sbjct: 392 LNNDANEVPIKVYYHTDDQLKQVKQDMIVDHS---EFKKFRVVENLEDRVMQEFFSWLRF 448
>gi|148908465|gb|ABR17345.1| unknown [Picea sitchensis]
Length = 350
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 43/154 (27%), Positives = 73/154 (47%), Gaps = 16/154 (10%)
Query: 356 YKAGESIVVWCG-PQPNSKLLINYGFVDED----NPYDRLVVEAALNTEDPQYQDKRMVA 410
++ GE +++ G + N +L ++YGFV+ + + D + ++ DP + DK +A
Sbjct: 136 FRTGEQVLMQYGMNKSNGQLALDYGFVERNRKNGSNRDIFTLTLEISESDPFFADKLDIA 195
Query: 411 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM-------QSVISSLGPICPV 463
+ NG + F + G + ML +LRL + T SV L PV
Sbjct: 196 ELNGMETTAYFDITQG--QGVPESMLTFLRLIALGGTDAFLLEPLFRDSVWEHLS--LPV 251
Query: 464 SPCMERAVLDQLADYFKARLAGYPATLSEDEAML 497
S E A+ + D ++ L+GY T+ EDEA+L
Sbjct: 252 SQENEAAICKVVLDGCQSTLSGYGTTIEEDEALL 285
>gi|302762396|ref|XP_002964620.1| hypothetical protein SELMODRAFT_81798 [Selaginella moellendorffii]
gi|300168349|gb|EFJ34953.1| hypothetical protein SELMODRAFT_81798 [Selaginella moellendorffii]
Length = 464
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 57/213 (26%), Positives = 90/213 (42%), Gaps = 30/213 (14%)
Query: 348 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 407
+++V R + G +I + GP N LL++YGFV DNP+DR+ L + ++ R
Sbjct: 248 LKVVTKRNLEKGSAITLNYGPLSNDLLLLDYGFVIPDNPHDRI----ELRYDGSLMENAR 303
Query: 408 MVA--QRNGK------LSVQVFH--------------VHAGREKEAISDMLPYLRLGYVS 445
M+A R G S QV V G +E +L LR+ +
Sbjct: 304 MIAGLSRTGSPPFSSPASWQVDRLKQLGLADSGESQKVTLGGPEEVDGRLLAALRILHAE 363
Query: 446 DTS--EMQSVIS--SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYN 501
E + ++S + G VS E VL L + T+ EDEA L+D +
Sbjct: 364 SQEPLERRELVSLQAWGVESMVSSDNEERVLRTLCGLAAIVFNQFKTTIEEDEAKLSDKS 423
Query: 502 LHPKKRVATQLVRMEKKMLNACLQVTADMIMLL 534
L R+A Q +K+++ L+ +M L
Sbjct: 424 LAETSRIAVQFRLTKKRLVVRVLESLKKRLMDL 456
>gi|317144568|ref|XP_001820210.2| SET domain protein [Aspergillus oryzae RIB40]
gi|391871646|gb|EIT80803.1| N-methyltransferase [Aspergillus oryzae 3.042]
Length = 703
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 59/232 (25%), Positives = 92/232 (39%), Gaps = 46/232 (19%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
+LM + QGK+ FW PYIR L Q G A+ +PL + +L +L G+ ++A
Sbjct: 121 FFLMGQYLQGKEGFWYPYIRTLP-QPG----ALTTPLYYEGDDLEWLEGTSLSPARQQKA 175
Query: 222 EGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTP 280
+K +Y + T AG ++Y +D+ A T + + V S V+
Sbjct: 176 NLLKEKYGTVYTELCKAGFDGAEKYTWDLYLWASTIFVSRAFSAKVLSGVI--------- 226
Query: 281 QMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAM 340
T P+E VS+ F +L + K
Sbjct: 227 -------------------------PDTQLPEENVSVLLPFI------DILNHRPLAKVE 255
Query: 341 LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 392
A V +V AG+ I GP+ N +L++NYGF +NP D +V
Sbjct: 256 WRAGKGNVAFLVLEDVAAGQEISNNYGPRNNEQLMMNYGFCLPNNPCDYRIV 307
>gi|378731232|gb|EHY57691.1| hypothetical protein HMPREF1120_05719 [Exophiala dermatitidis
NIH/UT8656]
Length = 714
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 62/241 (25%), Positives = 90/241 (37%), Gaps = 41/241 (17%)
Query: 161 ALYLMYEKKQGKKSFWLPYIRELD--RQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
A +L+ + G KS+W PYI L Q E+ LLW L G+ KA
Sbjct: 128 AFFLLEQLVLGDKSWWAPYISSLPTVEDVSHSQFEDEADLLW-------LEGTNLKAGFA 180
Query: 219 ERAEGIKREYNELDTVWFMAG--SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRC 276
A K Y + G L Q + A+T+E F+ A
Sbjct: 181 AEAARWKEMY--------LKGMHQLKQSQWENAVNGAYTWERFRWAMTI----------- 221
Query: 277 GCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFA-----LVPLGPPLL 331
G+ SF S ++ + ++ L FA L+PL +
Sbjct: 222 -----FGSRSFTSQVLDATLPADKALLQQYRHDDGRDLCVLGELFAQHFGVLLPL-VDIS 275
Query: 332 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 391
+ K A V L V PY++G+ I GP+ N LL+ YGF DNP+D +V
Sbjct: 276 NHKPGAKVEWQARYSFVGLQVLEPYESGQEIFNNYGPRDNETLLVAYGFTIPDNPFDHVV 335
Query: 392 V 392
+
Sbjct: 336 I 336
>gi|118395738|ref|XP_001030215.1| hypothetical protein TTHERM_01108540 [Tetrahymena thermophila]
gi|89284510|gb|EAR82552.1| hypothetical protein TTHERM_01108540 [Tetrahymena thermophila SB210]
Length = 1709
Score = 53.1 bits (126), Expect = 3e-04, Method: Composition-based stats.
Identities = 41/148 (27%), Positives = 70/148 (47%), Gaps = 37/148 (25%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLG-------NETIAELLTTNKLSEL----AC-- 159
+AA +D+ ++PN L+++ ++V G + +++ N+ EL C
Sbjct: 954 IAADQDISPQKVILAIPNKLIISEDKVYGCDLEEVLEKIQQQIIKQNRFPELFDEEKCGD 1013
Query: 160 -----LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 214
LALYLMYEK +G++SFW PY EL+++ + L WS ELA S
Sbjct: 1014 ADFNILALYLMYEKLKGEQSFWHPYF-ELNQKS-------YTLLDWSTEELAQFEDSY-- 1063
Query: 215 AEILERAEGIKREYNELDTVWFMAGSLF 242
I +E N+ + ++F+ S+
Sbjct: 1064 ---------ILQEVNQSNQIFFLQQSVL 1082
>gi|406860468|gb|EKD13526.1| putative SET domain-containing protein [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 474
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 102/444 (22%), Positives = 175/444 (39%), Gaps = 88/444 (19%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELAC-LALY-LMYEKKQ 170
+ A D+ + F++P V+ LG+ +L E+ C LAL ++ + Q
Sbjct: 42 LVAQSDIGEDEVLFTIPRDAVLNTTTALGSADNPAIL------EMPCWLALTAIILTEGQ 95
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEGIKREY- 228
+ S W PY+ L + ++S + WSE+EL L S +I AE + E+
Sbjct: 96 QEDSKWAPYLALLPSR-------LDSLVFWSESELLELQASTVVNKIGRASAEQLFLEHI 148
Query: 229 ------NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQM 282
N + S+ Y +DIP K+ +S P+
Sbjct: 149 SPLGLSNTNTEMCHKVASVVMAYAFDIPE--------KKGHDDPES-----------PED 189
Query: 283 GTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLA 342
G + LV + N +E+ ++ +++PL L A + A L
Sbjct: 190 GDD-----LV---------------SDNEEEENTI---LSMIPLADMLNADADGNNARLC 226
Query: 343 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVE-------A 394
++ +++ +P GE I+ G P S LL YG++ D+ YD V E A
Sbjct: 227 CDNEELEMRSIKPISKGEEILNDYGQLPRSDLLRRYGYISDKYAAYD--VAELSTQSLLA 284
Query: 395 ALNTEDP-------------QYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 441
+L+TE P + + + +AQR G H G + +I D L L
Sbjct: 285 SLSTEQPLLAGGTLQPLSREKLEQRVELAQREGVYEDSYDLTHPGPDDPSIPDELLALLY 344
Query: 442 GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYN 501
+ D + ++ +S + S V L ++R Y T+ D+A+L N
Sbjct: 345 ILLLDNENLAAIETSHASLPSRSKLATSLVGQILTKILESRKQEYATTIEADQAILQADN 404
Query: 502 LHPKKRVATQLVRMEKKMLNACLQ 525
L +KR+A ++ EK +L +Q
Sbjct: 405 LPSRKRMAVEVRLGEKLVLEKAIQ 428
>gi|449662705|ref|XP_002165483.2| PREDICTED: uncharacterized protein LOC100209819 [Hydra
magnipapillata]
Length = 819
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 101/434 (23%), Positives = 164/434 (37%), Gaps = 100/434 (23%)
Query: 107 HRPIHYVAASEDLQAGDAAFSVPNSLVV-----TLERVLGNETIAELLT----TNKLSEL 157
HR + + A+ED++ G+ F+VP L++ TL+ L NE L T N S
Sbjct: 127 HR--YGMLATEDIKKGEVLFTVPRQLLLNQNTATLKNRL-NEFEKWLDTHGKSLNDSSGW 183
Query: 158 ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY-LTGSPTKAE 216
L + LM+E Q K SFW Y+ + G PL W E E G P +
Sbjct: 184 LPLLITLMWEFNQ-KDSFWASYLLLVPEISEFGH-----PLFWKEEEYNLEFQGMPLLND 237
Query: 217 ILERAEGIKREYNELDTVWF-----MAGSLFQQYPYDIPTEAFTFEIFKQ--AFVAVQSC 269
I+ E I+ EY E ++ + GSL E ++ E FK+ AFV S
Sbjct: 238 IIVDRENIETEYAEFVLLFLRRNKDLFGSL----------ENYSLEFFKRMVAFVMAYSF 287
Query: 270 VVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPP 329
T SP ++VP+
Sbjct: 288 -------------------------------TEDEESP--------------SMVPMA-D 301
Query: 330 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD-EDNPYD 388
+L + S A L +Q++ R K GE + G N++LL YG+V+ N YD
Sbjct: 302 ILNHHSNNNAHLVFHKSNLQMISIRRIKKGEEVFNTFGKLGNTELLQMYGYVEIPSNQYD 361
Query: 389 RLVV-----------EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP 437
L++ + +DP K + R G V F + D++
Sbjct: 362 SLLLPVKDFYKIMTSKNGTANDDPYLLAKINLLNRTGIAEVDAFFMFDKNGLRCGPDLIQ 421
Query: 438 YLRLGYVSDTSEMQSVISSLGPICPVS---PCMERAVLDQLADYFKARLAGYPATLSEDE 494
+L++ + SD E++ ++ + P S + + L + + K L ++ED+
Sbjct: 422 FLKIFHASD-RELEKILKTRASKRPESFYHKLLRKLRLSKKTE--KNSLGMTVIDITEDD 478
Query: 495 AMLTDYNLHPKKRV 508
+ N + +K V
Sbjct: 479 TEMDIENFNKRKNV 492
>gi|255071849|ref|XP_002499599.1| predicted protein [Micromonas sp. RCC299]
gi|226514861|gb|ACO60857.1| predicted protein [Micromonas sp. RCC299]
Length = 588
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 82/325 (25%), Positives = 131/325 (40%), Gaps = 57/325 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVL---GNETIAELLTTNKLSELACLALYLMYEKK 169
AA+ + AGD A ++P + T+ L G A + L E AL+L+ E+
Sbjct: 188 AAATTHIPAGDIAAAIPVERLFTVRHALEMPGPRGDAYRMFA-ALGEDTIAALWLIAERA 246
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVE-----SPLLW-SETELAYLTGSPTKAEILERAEG 223
G+ S W I L G G+ + +P+ W E A L G+P A+ + +E
Sbjct: 247 LGEASPWHAVIASLPWPEG-GEGSASPCGGCTPVSWPREACDALLGGTPLLADAIAASEK 305
Query: 224 IKREYNELDTVWFMAGSLFQQYPYDI-PTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQM 282
+ R++ L F A ++ D+ P A+T + F++A
Sbjct: 306 LARQHAAL----FPA---LSEHMADVFPASAYTLDNFRRAH------------------- 339
Query: 283 GTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLA 342
E++ S+ + +Q S P P V++ AL P ++ YS
Sbjct: 340 --EAWNSYGMTVQAS------PGEPAATCLPPVAMLCNHALWPH---VVRYSRL------ 382
Query: 343 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL-NTEDP 401
D ++L V R AGE + V G + N++LL+ YGF NPYD + + L E
Sbjct: 383 -RDGTLRLPVARSVHAGEEVFVSYGAKSNAELLLFYGFALPGNPYDDVPLSLELPGGEVA 441
Query: 402 QYQDKRMVAQRNGKLSVQVFHVHAG 426
R A L++ V AG
Sbjct: 442 DVTKAREAALARAGLTLSPHAVRAG 466
>gi|145553305|ref|XP_001462327.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124430166|emb|CAK94954.1| unnamed protein product [Paramecium tetraurelia]
Length = 481
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 93/494 (18%), Positives = 186/494 (37%), Gaps = 80/494 (16%)
Query: 58 SSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASE 117
S L S+ + + + +L W+ KV ++ K +E +R + AS+
Sbjct: 20 DSESELRTKSKRITYEDPDPYKNLIQWLKDGKAEVSKVSIEVK---SEGYRTLR---ASQ 73
Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIA-ELLTTNKL-SELACLALYLMYEKKQGKKSF 175
++ G+ VP + ++LE V + I +++ N + + + + + ++ + + SF
Sbjct: 74 FIRQGEWVLFVPRTHYLSLEEVKKSCLINRKMIQLNYIPNNIQTYFVNHLLQENRRQNSF 133
Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 235
W PYI L + P + + A L GSPT ++ + + + EY+ L
Sbjct: 134 WKPYIDVLPKD------VSGFPTNFDAEQDALLKGSPTLFTVMNQRKTFQEEYDNL---- 183
Query: 236 FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQ 295
+ V QR G T + V +
Sbjct: 184 -------------------------------KEAVKEFQRYGYT--------YNDFVKFR 204
Query: 296 TSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD-- 353
T + P+ N ++++ LVPL + + + DA +
Sbjct: 205 TLTISRSFPVYIGENEQQQL-------LVPLAD-FINHDNNGFLQYGYSPDADGFFMQAV 256
Query: 354 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 413
R + GE + G N +NYGF NP ++ + L+ D ++ K + N
Sbjct: 257 RNIQKGEELFYNYGQWSNKYFFMNYGFASLTNPMNQFDFDICLDRNDRMFKMKVELTGGN 316
Query: 414 ---GKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICP---VSPC- 466
G V R+ A + + ++ + D +++ + + P +P
Sbjct: 317 ICWGNRLVNETDHDTFRQSLAT---VRFAQISKLDDFLQLEEDVQNYNQFWPGWHTTPKT 373
Query: 467 --MERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 524
+E+A L D + LA + +T+ +D+ L D + +R L EK+++ +
Sbjct: 374 IELEKATFKALRDLLVSELANFASTIEDDQRRLNDPSTPEFRRHIIMLTMREKQIIKKNI 433
Query: 525 QVTADMIMLLPDVT 538
+V DM++ + D T
Sbjct: 434 EV-CDMMLSVIDKT 446
>gi|409080258|gb|EKM80618.1| hypothetical protein AGABI1DRAFT_71041 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 492
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 104/441 (23%), Positives = 166/441 (37%), Gaps = 49/441 (11%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERV-LGNETIAELLTTNKLSE-LACLALYLMYEKKQGK 172
A D+ G FS+P +L ++ L + E +L + L L LM+E G
Sbjct: 36 ALRDIPDGHTLFSIPRALTISTRTCSLPTKFGLEAWRKAQLHQGWVGLILCLMWETAAGS 95
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE-L 231
S W Y+ L + ++P+ W+E +L L G+ E L + + + +YNE L
Sbjct: 96 SSKWAGYLDILPNR-------FDTPMFWTEYDLLELKGTSV-VEKLGKLDA-QADYNEKL 146
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHL 291
LF I + I ++ V + S S +
Sbjct: 147 IPAVKSRPDLFLPEHLSIHYTLERYHIMGSRILSRSFIVEKWNEDEDGNEAANTSLGSAM 206
Query: 292 VHLQTSCTTTRTPLSPTHNPKEKVSLARR----------FALVPLGPPLLAYSSKCKAML 341
++ SC TP P A+VP+ L A A L
Sbjct: 207 -DVEPSCNE-ETPSVPMETEDNSDDNDDDDDDDDEEASDVAMVPMADILNARYQTENAKL 264
Query: 342 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD--------EDNPYDRLVVE 393
D +++V +P K GE I G PN++LL YG VD NP D + ++
Sbjct: 265 FHEKDELKMVTTKPIKTGEQIWNTYGDLPNAELLRRYGHVDFLSLPSGGHGNPGDVVEIK 324
Query: 394 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD--------MLPYLRLGYVS 445
A L V K + + + G E I D M+ +++L ++
Sbjct: 325 ADLIISAVS-STPEAVKDDEAKERID-WWLEEGGEDVFILDYEYDLPPVMISFVKLLLLT 382
Query: 446 DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT-DYNLHP 504
++ S P +E + D L + RLA YP T+ D+A+LT D L+
Sbjct: 383 QADWEKAREKS----KPPKSRLEGILYDILISTLEKRLAEYPTTIETDKALLTNDTPLNN 438
Query: 505 KKRVATQLVRMEKKMLNACLQ 525
K + +L EK++L+ LQ
Sbjct: 439 KNAIIVRL--GEKEILHGILQ 457
>gi|367048695|ref|XP_003654727.1| hypothetical protein THITE_2117893 [Thielavia terrestris NRRL 8126]
gi|347001990|gb|AEO68391.1| hypothetical protein THITE_2117893 [Thielavia terrestris NRRL 8126]
Length = 481
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 54/227 (23%), Positives = 81/227 (35%), Gaps = 45/227 (19%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
+L+ E +G+ SFW PYI L + A+ P W E ++AYL G+ I E
Sbjct: 107 FFLIKEYLKGRDSFWAPYIATLPQPEHVSAWAL--PAFWPEEDIAYLAGTNAHVAIAEIQ 164
Query: 222 EGIKREYNE----LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSC---VVHLQ 274
+K E+ + L F A + Q Y FT F+ + V + + L
Sbjct: 165 ANVKSEFKQARKALKAAGFPAWQDYTQMLYKWAFCIFTSRSFRPSLVLSEPAKQQMAELL 224
Query: 275 RCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYS 334
GC Q+ S L + T R +P
Sbjct: 225 PPGC--QLDDFSILQPLFDIANHSMTARYAWDVASDPA---------------------- 260
Query: 335 SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 381
+ QLV Y+ GE + G + NS+LL+ YGF+
Sbjct: 261 ------------SCQLVCHDAYQPGEQVYNNYGLKTNSELLLAYGFI 295
>gi|429861365|gb|ELA36056.1| set domain protein [Colletotrichum gloeosporioides Nara gc5]
Length = 471
Score = 52.4 bits (124), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 51/223 (22%), Positives = 94/223 (42%), Gaps = 16/223 (7%)
Query: 321 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 380
+VP+ +L ++ A + DD + + RP AGE I+ + GP PNS+LL YG+
Sbjct: 228 LGMVPMAD-ILNADAEFNAHVNHGDDELTVTALRPIPAGEEILNYYGPHPNSELLRRYGY 286
Query: 381 VDEDN--------PYD--RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR-EK 429
V + P+D + V L D ++ + V V +G +
Sbjct: 287 VTPKHSRYDVVEIPWDLVQASVSEHLKIGDDVWKQVQEYVDPEELEDVFVLERESGEPDS 346
Query: 430 EAISDMLPYLRLGYVSDTSEMQSVISSL----GPICPVSPCMERAVLDQLADYFKARLAG 485
E + +R ++++V+ ++ G + P + + + L+
Sbjct: 347 EGQFRTVAEVREISAELEEQLKAVLKAIKKINGDLIPDKRKRDEVFHAVIVSTLQKILSQ 406
Query: 486 YPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 528
YP + EDEA+L +L ++R+A + EKK+L L+
Sbjct: 407 YPTSTQEDEALLATSDLTNRQRMAIHVRLGEKKLLKEALEFAG 449
>gi|440802665|gb|ELR23594.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
Length = 984
Score = 52.4 bits (124), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 66/289 (22%), Positives = 117/289 (40%), Gaps = 54/289 (18%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNET---IAELLTTNKLSEL------ACLALYLM 165
A+ED+ G+ S+P LV+T E +E +A L + L A L YL+
Sbjct: 32 ATEDILPGEELCSIPVRLVLTTEIARKSEVGRLVAAHLNAVQGERLRVSAGRAILCAYLI 91
Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIK 225
+++ + +FW PY+R L + R + ++ +L G+ + E+ + I+
Sbjct: 92 HQRA-AQDAFWGPYLRSLPKHDDR-----------PDEDIQHLAGTNLFYAMQEKQQQIR 139
Query: 226 REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTE 285
++ + +L +P P + FT++ F F A S
Sbjct: 140 ESFD------LLFPALCHAHPTVFPPDLFTWDHFLWTFTACSS----------------R 177
Query: 286 SFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLL--AYSSKCKAMLAA 343
SF LV T+ T+ +P + + + L+P G +L Y K L
Sbjct: 178 SFPQTLVQQPTATTSAHA------DPYDLLEIDE--CLLP-GLDMLNHQYRKKITWALDP 228
Query: 344 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 392
++ V + + G GP+ N +LL+ YGF EDN D +++
Sbjct: 229 STGRLKFVTEDTVEKGTEAFNNYGPKGNEELLMGYGFCIEDNEQDYVMI 277
Score = 39.7 bits (91), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 22/70 (31%), Positives = 39/70 (55%), Gaps = 3/70 (4%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELL--TTNKLSELACLALYLMYEKKQ 170
V A++ + AG A ++P L++T++ L + E L L E L L+L++EK +
Sbjct: 497 VFAAQAVPAGQALLTIPRQLLITVDTAL-ESPLGEALQYVEGGLDEDTVLTLFLVWEKGR 555
Query: 171 GKKSFWLPYI 180
G+ S W P++
Sbjct: 556 GQASPWYPFL 565
>gi|422293007|gb|EKU20308.1| ribulose- -bisphosphate carboxylase oxygenase small subunit
n-methyltransferase i [Nannochloropsis gaditana CCMP526]
Length = 385
Score = 52.4 bits (124), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 56/213 (26%), Positives = 86/213 (40%), Gaps = 20/213 (9%)
Query: 78 LGDLKSWMHKN---GLPPCKVILKEKP-SHNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
LG+ WM G+PP ++L + E + + G+A F +P S+V
Sbjct: 115 LGENGVWMQDKSGWGVPPHPLLLSSRTIDEIELEDSGRGLICKYPINMGNALFQLPLSIV 174
Query: 134 VTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
+ E+ L A ++E +AL L+ E+ G SFW PYI L
Sbjct: 175 IDKEKSLAAFDGA---LPADINEYFAIALMLIKERALGPSSFWAPYIDVLPTTE-----E 226
Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS-LFQQYPYDIPTE 252
V L+W E +LA L SP A + E+ L+ + A S +F
Sbjct: 227 VNPTLVWPEGDLALLEASPLVAATRSLKRKLAAEFALLEEQYMRARSDVFD-------PS 279
Query: 253 AFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTE 285
FTFE + AF+ + S + ++ G G E
Sbjct: 280 VFTFEAYLWAFINIFSRAIRVKIGGKRGPSGEE 312
>gi|350595011|ref|XP_003484025.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Sus
scrofa]
Length = 326
Score = 52.4 bits (124), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 47/189 (24%), Positives = 78/189 (41%), Gaps = 46/189 (24%)
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQT 296
M +F +YP P E F E FK +F + S +V L P M ++
Sbjct: 73 MRVRIFSKYPDFFPEEVFNIESFKWSFGILFSRMVRL------PSMDGKN---------- 116
Query: 297 SCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV--VDR 354
ALVP ++ +S + + L + +V DR
Sbjct: 117 -------------------------ALVPWAD-MMNHSCEVETFLDYDKSSKGIVFPTDR 150
Query: 355 PYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQR 412
PY+ GE + + G + N +LL++YGFV ++ NP D + + +L D Y++K + ++
Sbjct: 151 PYQPGEQVFISYGKKSNGELLLSYGFVPKEGTNPSDSVELSLSLKKSDESYKEKLELLKK 210
Query: 413 NGKLSVQVF 421
G Q F
Sbjct: 211 YGLSGSQCF 219
>gi|424512980|emb|CCO66564.1| predicted protein [Bathycoccus prasinos]
Length = 542
Score = 52.4 bits (124), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 92/340 (27%), Positives = 133/340 (39%), Gaps = 49/340 (14%)
Query: 83 SWMHKNG-LPPCKVILKEKPSHNEKHRPIHY--VAASEDLQAGDAAFSVPNSLVVTLERV 139
+W KN L P + S EK Y V A+ D+ + D +P T+ V
Sbjct: 76 AWRVKNNILAPNVEVAYVGGSEKEKGGDDLYRGVKATSDIASEDDLVRLPRE--ATMLVV 133
Query: 140 LGNETIAELLTTNKLSELAC-------LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
G E E +N+L A +AL L+YEK G +S + YI +L +
Sbjct: 134 EGQENPHEEYISNELWAKAGDERWALRVALVLLYEKSLGSRSKFYEYIEQLPK------- 186
Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 252
+ E+ W+E E+ L S + E+ +R NE A L Q+Y D +
Sbjct: 187 SFENLGTWTEEEVRELQYS-----VGEKFAKEQRLENE------KACELIQEYARDGGLK 235
Query: 253 AFTFEIFKQAFVAVQSCVVH--------LQRCGCTPQ---MGTESFASHLV----HLQTS 297
E A V+S V LQR P+ +GT FAS L L+
Sbjct: 236 TIEREEVIWALDVVRSRVFSGKIADQEALQR-KLLPRALSVGT-VFASFLTAQTTELKWL 293
Query: 298 CTTTRTPLSPTHNPKEK-VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 356
C L + KE V + L+PL + + K + L + Y
Sbjct: 294 CVFALLALVVFDSTKENDVKTDTAYVLMPL-IDAFNHQTMLKTEFEFTNSEFALKSPKSY 352
Query: 357 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 396
K GE +++ G PN +LL+ YGFVD+ N D E L
Sbjct: 353 KKGEEVLISYGLMPNDELLLRYGFVDDQNVADTYQFEGLL 392
>gi|85093434|ref|XP_959692.1| hypothetical protein NCU09581 [Neurospora crassa OR74A]
gi|28921141|gb|EAA30456.1| predicted protein [Neurospora crassa OR74A]
Length = 504
Score = 52.0 bits (123), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 74/332 (22%), Positives = 124/332 (37%), Gaps = 61/332 (18%)
Query: 72 SKKEEDLGDLKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
S +E L W HK+G P +V E + + +P +A+E L +G A S
Sbjct: 3 SPHKERFEALLDWAHKHGASLHPLLEVYEDEVTGFSLRVKP----SATERLGSGFKAVSC 58
Query: 129 PNSLVVTLERVLGNETIAELLTT---------------NKLSELACLALYLMYEKKQGKK 173
P S+ ++ L + I TT N L YL+ + +GK
Sbjct: 59 PTSITLSYLNALTDGPITPSSTTPAPNTKNPAFPERFMNSLPPHVIGRFYLIQQYLKGKS 118
Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
SFW PYI L + A+ P W+E ++ L G+ I E +K EY +
Sbjct: 119 SFWAPYISTLADPSQLDKWAL--PPFWAEDDIELLQGTNAYIAIQEIQNNVKSEYKQARK 176
Query: 234 VWFMAG----SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFAS 289
+ G + Q Y+ FT F+ + V +S +++R
Sbjct: 177 ILKKEGFPDYREYTQVLYNWAYCMFTSRSFRPSLVLSESAREYVER-------------- 222
Query: 290 HLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 349
L P + S+ + L +G ++ + L + A +
Sbjct: 223 ---------------LLPEGTKIDDFSVLQ--PLYDIGNH--SWDASYTWNLTSEPSACE 263
Query: 350 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 381
L+ + Y G+ + G + NS+LL+ YGF+
Sbjct: 264 LICNDSYGPGQQVFNNYGFKTNSELLLGYGFI 295
>gi|400602586|gb|EJP70188.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
Length = 797
Score = 52.0 bits (123), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 40/77 (51%), Gaps = 2/77 (2%)
Query: 320 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 379
R AL+P+ L ++ C + +A + DR Y+AGE + G N LL YG
Sbjct: 593 RLALLPVADVLNHANAGCSVAFST--EAYDITADRAYQAGEEVYTSYGAHSNDFLLAEYG 650
Query: 380 FVDEDNPYDRLVVEAAL 396
FV DNP+D+L ++ L
Sbjct: 651 FVLPDNPWDQLCLDKVL 667
>gi|336467028|gb|EGO55192.1| hypothetical protein NEUTE1DRAFT_147775 [Neurospora tetrasperma
FGSC 2508]
gi|350288355|gb|EGZ69591.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
Length = 504
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 75/329 (22%), Positives = 123/329 (37%), Gaps = 55/329 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
S +E L W HK+G P +V E + + +P +A+E L +G A S
Sbjct: 3 SPHKERFEALLDWAHKHGASLHPLLEVYEDEVTGFSLRVKP----SATELLGSGFKAVSC 58
Query: 129 PNSLVVTLERVLGNETIAELLTT---------------NKLSELACLALYLMYEKKQGKK 173
P S+ ++ L + I TT N L YL+ + +GK
Sbjct: 59 PTSITLSYLNALTDGPITPSSTTLAPNTENPAFPERFMNSLPPHVIGRFYLIQQYLKGKS 118
Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
SFW PYI L + A+ P W+E ++ L G+ I E +K EY +
Sbjct: 119 SFWAPYISTLADPSQLDKWAL--PPFWAEDDIELLKGTNAYVAIQEIQSNVKSEYKQARK 176
Query: 234 VWFMAGSLFQQYPYDIPTEAF-TFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLV 292
+ + E F + + Q C+ + SF LV
Sbjct: 177 I--------------LKKEGFPDYRDYTQVLYNWAYCM-----------FTSRSFRPSLV 211
Query: 293 HLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 352
L S L P + + S+ + L +G ++ + L + A +L+
Sbjct: 212 -LSESAREYVERLLPEGSKIDDFSILQ--PLYDIGNH--SWDASYTWNLTSEPSACELIC 266
Query: 353 DRPYKAGESIVVWCGPQPNSKLLINYGFV 381
+ Y G+ + G + NS+LL+ YGF+
Sbjct: 267 NDSYGPGQQVFNNYGFKTNSELLLGYGFI 295
>gi|299115166|emb|CBN75532.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 524
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 76/350 (21%), Positives = 132/350 (37%), Gaps = 86/350 (24%)
Query: 77 DLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTL 136
+L L SW ++G K+ L++ + + L G+ S+P SL +T+
Sbjct: 27 ELDGLLSWFVEHGGSMTKLCLEDLGGEMSLS-----LLTGQALNKGEVVMSIPISLCMTV 81
Query: 137 ERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES 196
+ VL AL+LM E+++G SFW Y+R L V++
Sbjct: 82 DSVL--------------------ALHLMAERRKGDGSFWKQYLRTLPDD-------VDT 114
Query: 197 PLLW----SETELAYLTGSPTKAEILERA--EGIKREYNELDTVWFMAGSLFQQYPYDIP 250
PL W +E E L G T +L R +++++ E L + +P +
Sbjct: 115 PLRWLVEQAEEEFRLLDG--TMVGLLSRMMHSQVRKDWEEFHL------PLVEAHPEILG 166
Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSP--- 307
TFE + A ++ S Q G + C +R + P
Sbjct: 167 --GVTFEDYLWAMSSIWSRSFDYQEPGPD---------------DSPC--SRRAMVPVIN 207
Query: 308 --THNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD------AVQLVVDRPYKAG 359
H+P SL+ ++ + ++ + + + +++ R Y A
Sbjct: 208 AANHDPSAADSLSE----------MIEFQAQEGGLSMGIGEPGRARGTLRVSAGRDYAAR 257
Query: 360 ESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 409
E + G N+KLL +YGFV NPY L + DP + K+ +
Sbjct: 258 EQFFILYGRYSNAKLLYSYGFVLASNPYGGLDYWVRVPQTDPGFAWKQAL 307
>gi|344277088|ref|XP_003410336.1| PREDICTED: SET domain-containing protein 4 [Loxodonta africana]
Length = 440
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 75/354 (21%), Positives = 131/354 (37%), Gaps = 57/354 (16%)
Query: 67 SREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAF 126
SR V + + +LK W+ +I P + + LQ G
Sbjct: 22 SRGVNESYKSEFIELKKWLKDRKFEDTNLIPARFPGTGRG------LMSKTSLQVGQMII 75
Query: 127 SVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFWLPYIRELD 184
S+P S +++ + V+ + +T K S L L +L+ EK G +S W PY+ L
Sbjct: 76 SLPESCLLSTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVLEKHAGDQSSWKPYLETLP 134
Query: 185 RQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQ 244
+ P+ W E E+ L P +A+ E+ ++ + + LF +
Sbjct: 135 K-------TYTCPVCW-EPEVVNLLPRPLRAKAQEQRTRVQEFFTSFRDFFSSLQPLFSE 186
Query: 245 YPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTP 304
+I FT+ A+ V + V+L+ H Q C +
Sbjct: 187 AVENI----FTYSALLWAWCTVNTRAVYLR------------------HRQLRCFSAE-- 222
Query: 305 LSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 364
+ +LA L+ P + + KA ++V + E + +
Sbjct: 223 -------PDTCALAPYLDLLNHSPDV-----QVKAAFNEKTRCYEIVAVSSCRKHEEVFI 270
Query: 365 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 418
GP N +LL+ YGFV NP+ + V + + DK+M N K+S+
Sbjct: 271 CYGPHDNHRLLLEYGFVSTRNPHACVYVSRDILVKYLPSTDKQM----NKKISI 320
>gi|169847976|ref|XP_001830696.1| hypothetical protein CC1G_03233 [Coprinopsis cinerea okayama7#130]
gi|116508170|gb|EAU91065.1| hypothetical protein CC1G_03233 [Coprinopsis cinerea okayama7#130]
Length = 496
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 72/308 (23%), Positives = 125/308 (40%), Gaps = 49/308 (15%)
Query: 115 ASEDLQAGDAAFSVPNSLVVT-----LERVLGNETIAELLTTNKLSE-LACLALYLMYEK 168
A +DL G F++P +L ++ L + G E L KL + A L L +M+E
Sbjct: 42 ALKDLPEGHVLFTIPRALTLSTRTSRLPELFGLEEWKRL----KLHQGWAGLMLCMMWEA 97
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
QGK+S W Y+ L A ++P+ W+E +L+ L G+ ++ + E +R+Y
Sbjct: 98 AQGKESRWAGYLDIL-------PAAFDTPMFWNEEDLSELAGTSIVGKLGK--EDAERDY 148
Query: 229 N-ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESF 287
+ ++ LF Q ++ E + + S +++ + +
Sbjct: 149 DSKIKPAIAKRPELFAQGEV-----YYSLERYHTMGSRILSRSFTVEKRDGEEEEEDGAN 203
Query: 288 ASHLVHLQTSCTTTRTPLSPTH--------------NPKEKVSLARRFALVPLGPPLLAY 333
A + H ++ + +P+ + E A A+VP+ L A
Sbjct: 204 ADNAKHAESENDSMDVDETPSKANDLEQEENEDEDEDNDEGEDDASDIAMVPMADMLNAR 263
Query: 334 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD--------EDN 385
A L + +++ +P + GE I G PN++LL YG VD E N
Sbjct: 264 FGTENAKLFHEKEVLKMTTTKPIRKGEQIWNTYGDLPNAELLRRYGHVDLLKLPDGREGN 323
Query: 386 PYDRLVVE 393
P D VVE
Sbjct: 324 PGD--VVE 329
>gi|115386294|ref|XP_001209688.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114190686|gb|EAU32386.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 486
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 90/406 (22%), Positives = 159/406 (39%), Gaps = 51/406 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
V A D+ + F++P LV++ + N + +LL+ + L EL L L +MYE
Sbjct: 50 VVAQTDIPENEELFTIPRDLVLSTQ----NSKLKDLLSQD-LEELGPWLSLMLVMMYEYL 104
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
G +S W Y + L R+ ++ + W+ +EL L GS +I +
Sbjct: 105 LGDQSTWAAYFKVLPRK-------FDTLMFWTPSELLELQGSAVIDKIGRQGAD------ 151
Query: 230 ELDTVWFMAGSLFQQYPYDIP----TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTE 285
+++ M + + +P P ++ + QA ++HL MG+
Sbjct: 152 --ESILEMIAPIVRAHPSLFPPVDGLPSYDGDAGTQA-------LLHLAHT-----MGSL 197
Query: 286 SFASHL-VHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAV 344
A + +E+ L++ +VPL L A + + A L
Sbjct: 198 IMAYAFDIEKPEDEDEEGDGEGGYMTDEEEEQLSK--GMVPLADLLNADADRNNARLFQD 255
Query: 345 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVE-------AAL 396
++A+ + +P GE I G P + LL YG+V D PYD + V A L
Sbjct: 256 ENALVMKAIKPIAKGEEIFNDYGEIPRADLLRRYGYVTDNYAPYDVVEVSLDVICKAAGL 315
Query: 397 NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP-YLRLGYVSDTSEMQSVIS 455
+ DP+ Q L +E + ++D+LP L + + T + +
Sbjct: 316 SDSDPEKQPPLEFLDELELLDDGYVIPRPSQEDDQLTDILPDELIILLRTLTLSPEQLAQ 375
Query: 456 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYN 501
P P A LA + + A Y T+++D+ +L+ N
Sbjct: 376 QRSKNKPPKPAFAEAEATILAKAIQLKQAQYATTIAQDQEILSQLN 421
>gi|302816067|ref|XP_002989713.1| hypothetical protein SELMODRAFT_447801 [Selaginella moellendorffii]
gi|300142490|gb|EFJ09190.1| hypothetical protein SELMODRAFT_447801 [Selaginella moellendorffii]
Length = 400
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 64/292 (21%), Positives = 113/292 (38%), Gaps = 59/292 (20%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+ ++AG+ +P+ LV+T E++ ++ + +LL+T + L L ++ E+ +G+ S
Sbjct: 14 AARSIRAGEQIVRIPHELVLTAEKL--DDCVKKLLSTEY--DWCPLTLLILAEQHKGEAS 69
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PY+ L S + W + EL +L + ER E I EYN + V
Sbjct: 70 RWAPYVSCLPSFGDH-----HSTIFWGKEELKFLECTRAFRGTAERREMISDEYNSVKDV 124
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHL 294
P+ + F+ F A+ V V + ++ F H
Sbjct: 125 -------ISSCPHVFGEDISLFQ-FAHAYATV---VSRAWNGALSSEISMRPFVDFCNHD 173
Query: 295 QTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 354
S T +H+ CK DA ++ +R
Sbjct: 174 PVSHATV------SHD-------------------------TCK-------DAT-IIAER 194
Query: 355 PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 406
Y GE + + G + N+ L ++YGFV +N D+ + + DP + K
Sbjct: 195 DYTKGEEVFISYGKRSNAVLAVDYGFVLPNNLSDQAELWMEIPWNDPLREKK 246
>gi|121703688|ref|XP_001270108.1| SET domain protein [Aspergillus clavatus NRRL 1]
gi|119398252|gb|EAW08682.1| SET domain protein [Aspergillus clavatus NRRL 1]
Length = 492
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 73/293 (24%), Positives = 120/293 (40%), Gaps = 42/293 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
V A + G+ FS+P LV++ E N + LL+ + L EL L L ++YE
Sbjct: 50 VVAQSAIVEGEELFSIPRDLVLSTE----NSKLKSLLSQD-LGELGPWLSLMLVMIYEYL 104
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
++S W PY R ++ + WS EL L GS + I R+
Sbjct: 105 LREQSAWAPYYRIFPEN-------FDTLMFWSPAELQELQGSAI-------VDKIGRQGA 150
Query: 230 ELDTVWFMAGSLFQQYPYDIP----TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTE 285
E +++ M + + P P ++ E QA + + + L +
Sbjct: 151 E-ESILQMIAPVVKANPSLFPPIQGLSSWEGEAGTQALLGLAHVMGSL--------IMAY 201
Query: 286 SFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD 345
+F V+ + T +++ S +VPL L A + + A L +
Sbjct: 202 AFDIEKVNDEDDEDNEGEDGYMTDEEEDQSSKG----MVPLADILNADADRNNARLFQEE 257
Query: 346 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALN 397
D++ + +P AG+ I G P S LL YG+V D PYD V+EA+L+
Sbjct: 258 DSLVMKAIKPIAAGDEIFNDYGELPRSDLLRRYGYVTDNYAPYD--VIEASLD 308
>gi|336371990|gb|EGO00330.1| hypothetical protein SERLA73DRAFT_89272 [Serpula lacrymans var.
lacrymans S7.3]
Length = 499
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 100/449 (22%), Positives = 161/449 (35%), Gaps = 58/449 (12%)
Query: 115 ASEDLQAGDAAFSVPNSLVV-TLERVLGNETIAELLTTNKLSE-LACLALYLMYEKKQGK 172
A D+ G F++P SL + T L + AE KL E A L L +M+E+ Q
Sbjct: 34 AVSDIPEGHTLFTLPRSLTLSTRTSYLPSNMGAESWKKFKLDEGWAGLILCMMWEEAQES 93
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
KS W Y+ L + +P+ WS +L L G+ +I E +R+Y D
Sbjct: 94 KSKWSEYLASLPS-------SFTTPMFWSSEDLFELRGTAVVDKI--GREDAERDY--YD 142
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRC-GCTPQMGTESFASHL 291
+ S +P + + ++ + + S +++ G + +S
Sbjct: 143 KLLPAIQSRLDLFPPTLIDQHYSLAKYHVMGSRILSRSFQVEKWDGPASGEHDANVSSDS 202
Query: 292 VHLQTSCTTTRTPLSPTHNPKEKV----------------SLARRFALVPLGPPLLAYSS 335
++ + T + +N E + A+VP+ L A
Sbjct: 203 TAMEVDGEVSVTQVHGDYNDDEDISRDIDDGSSDDDDDETEDPSDVAMVPIADLLNARYG 262
Query: 336 KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD--------EDNPY 387
A L + ++++ +P AGE I G PNS LL YG VD E NP
Sbjct: 263 SENAKLFYEERHLRMISTKPIMAGEQIWNTYGDLPNSDLLRRYGHVDLVSLPEGGEGNPA 322
Query: 388 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV--- 444
D + + A L V+Q +G L+ E D + + V
Sbjct: 323 DVIEIRADLVVS--------AVSQLSGTLTSDSNQTRIDWWLEEGGDDVFTVETDLVIPE 374
Query: 445 --------SDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM 496
SE + P V+ + F RL YP TL D+ +
Sbjct: 375 ALISLLRTLLLSEADWEKARNKSKPPKGKLDNPQVISLVTAVFHQRLNAYPTTLEADDEL 434
Query: 497 LTDYNLHPKKRVATQLVRMEKKMLNACLQ 525
L +L KR A + EKK+L ++
Sbjct: 435 LRQ-DLSLNKRHAIMVRCGEKKILTGVIK 462
>gi|328772383|gb|EGF82421.1| hypothetical protein BATDEDRAFT_86633 [Batrachochytrium
dendrobatidis JAM81]
Length = 648
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 78/350 (22%), Positives = 129/350 (36%), Gaps = 62/350 (17%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L W +G V +KE S +++ + AS+D+ +P++++++ V
Sbjct: 36 LVDWGRMHGANIENVEIKETASDDDR-KLTRGAYASKDIPPNSEICFIPSTILLSESDVR 94
Query: 141 GNETIAELLT--------TNKLSE---------LACLALYLMYEKKQ-GKKSFWLPYIRE 182
+E +LT K+S+ L +A +++++ S WLPY+
Sbjct: 95 ASEIGKAILTYIDEHQDAKQKISDKIKHPHAEILLAMAAFIVHQVSLPTADSHWLPYLAS 154
Query: 183 LDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL-ERAEGIKREYNELDTVWFMAGSL 241
L + PL+W+ + L G + ++ ER E I+ N V G
Sbjct: 155 LPKNYAL-------PLMWTRDRIQNLLGGTSLLYMMIERLEWIQ---NSTKVVENACGHY 204
Query: 242 FQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTT 301
F PT A T + + A ++ S + Q G+ + L C
Sbjct: 205 F-------PTGALTVQSMQWATCSIWSRAFPKAKPSLDLQDGSHQDVQDWIGLSEICLF- 256
Query: 302 RTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 361
P+ N K + R + G + CK G
Sbjct: 257 --PILDMFNHKRGYRVEWR--MTEKGVSFITPDGICK--------------------GSE 292
Query: 362 IVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 411
++ GP+ N LL NYGFV E+NP D V L EDP Y K+ V +
Sbjct: 293 LLNNYGPKGNENLLSNYGFVIENNPEDYFKVFLGLQQEDPLYTAKKAVLE 342
>gi|67540796|ref|XP_664172.1| hypothetical protein AN6568.2 [Aspergillus nidulans FGSC A4]
gi|40738718|gb|EAA57908.1| hypothetical protein AN6568.2 [Aspergillus nidulans FGSC A4]
gi|259480141|tpe|CBF71002.1| TPA: SET domain protein (AFU_orthologue; AFUA_6G04520) [Aspergillus
nidulans FGSC A4]
Length = 484
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 68/292 (23%), Positives = 122/292 (41%), Gaps = 40/292 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN--KLSELACLALYLMYEKKQ 170
V A D+ + F++P LV++ N + +LL+ + +L L L +++E Q
Sbjct: 50 VVAQADIDEDEELFAIPRDLVLSTH----NSKLKDLLSQDLDQLGPWLSLMLVMIFEYLQ 105
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
G KS W PY + L + ++ + WS EL L GS E I ++ E
Sbjct: 106 GGKSTWAPYFKVLPQN-------FDTLMFWSPEELEELQGSAV-------VEKIGKQGAE 151
Query: 231 LDTVWFMAGSLFQQYPYDIP----TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTES 286
+++ + + + P P ++ ++ QA + G MG+
Sbjct: 152 -ESILKLIIPVVRANPALFPPINGLASYDGDVGAQALL------------GLAHTMGSLI 198
Query: 287 FASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD 346
A + ++T + E+ + + +VPL L A + + A L ++
Sbjct: 199 MA-YAFDIETPENEDEREGEDGYLTDEEEEQSSK-GMVPLADMLNADAYRNNARLFQEEE 256
Query: 347 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 398
++ + +P +AGE I G P S LL YG+V DN V+E +L+T
Sbjct: 257 SLVMKAIKPIRAGEEIFNDYGEIPRSDLLRRYGYV-TDNYASYDVIELSLDT 307
>gi|387197713|gb|AFJ68815.1| set domain protein, partial [Nannochloropsis gaditana CCMP526]
Length = 327
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 47/196 (23%), Positives = 88/196 (44%), Gaps = 21/196 (10%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
+W+ +G K+ E PS+ + I A +D+ + + S+P L++T + L +
Sbjct: 74 AWLRAHGARCDKI---EWPSYATGSQ-IRGAVALDDINSNEDMVSIPEPLLLTPDVALKD 129
Query: 143 ETIAELLTTN--KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
I ++ N S+ L + LM+E+ +G+ SF+ PY+ L R ++ L W
Sbjct: 130 PDIGKVFEDNLEDFSDEDMLLILLMHERGKGETSFFYPYLATLPR-------LPDTLLNW 182
Query: 201 SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAG--SLFQQYPYDIP------TE 252
+E L++L E+ R + Y L AG LF + P D +
Sbjct: 183 NEEGLSWLQDEGLSLEVFLRESQLTAHYTRLVEEKLKAGWPGLFGEAPDDASDSESKGAD 242
Query: 253 AFTFEIFKQAFVAVQS 268
++ E F+ A++ +Q+
Sbjct: 243 PYSLENFRFAWLTIQA 258
>gi|327295769|ref|XP_003232579.1| hypothetical protein TERG_06571 [Trichophyton rubrum CBS 118892]
gi|326464890|gb|EGD90343.1| hypothetical protein TERG_06571 [Trichophyton rubrum CBS 118892]
Length = 488
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 101/478 (21%), Positives = 187/478 (39%), Gaps = 73/478 (15%)
Query: 97 LKEKPSHNEKHRPIHY-----------VAASEDLQAGDAAFSVPNSLVVTLERVLGNETI 145
LK H + H IH + AS D+ + F +P+ LV++++ +
Sbjct: 24 LKRSSPHFKMHPGIHIADLRSVGAGRGICASRDIAEDEELFIIPDDLVLSVQNSEARSAL 83
Query: 146 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 205
L +L L + ++YE QG++S W PY R L + ++ + W++ +L
Sbjct: 84 E--LDDKQLGPWLSLIITMIYEYYQGEQSKWYPYFRILPS-------SFDTLMFWTDEQL 134
Query: 206 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 265
L GS +I + A DT+ L Q P P +
Sbjct: 135 LELQGSAVVGKIGKAAAD--------DTILQKVVPLIQANPRHFPPRPNMPPLNSSDSQN 186
Query: 266 VQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVP 325
C+ H +MG+ A + ++ + + ++ A+ +VP
Sbjct: 187 ALLCLAH--------RMGSIIMA-YAFDIEKTDEVDEDTAEDGYMTDDEDEPAK--GMVP 235
Query: 326 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 385
L A + + A L + + + + +GE I G P + LL YG+V DN
Sbjct: 236 LADIFNADAQRNNARLFQEEGSFVMKAIKNIHSGEEIFNDYGELPRADLLRRYGYV-TDN 294
Query: 386 PYDRLVVEAALNT---------EDPQYQDKRMVAQRNGKLSVQVFHV----HAGREKEAI 432
VVE +L++ +P + R+ N + + + + G K+ I
Sbjct: 295 YAQYDVVEFSLDSICKVAGLPDSEPSSTNPRLELLDNLDMLEEGYSIPRIPPNGTLKDTI 354
Query: 433 -SDMLPYLRLGY--VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPAT 489
D L LR + D + +++ + P S E ++L L R + YP +
Sbjct: 355 PKDFLVLLRALTLPIEDLNRLKARNKAPKPEFSTS---EASLLRSLV---TCRQSEYPTS 408
Query: 490 LSEDEAML---------TDYNLHPKKRVATQLVRMEKKMLNACLQV--TADMIMLLPD 536
+ EDE++L + ++ +K++A Q+ + EK++L L + T D ++ PD
Sbjct: 409 VQEDESILRCLEQQNGYINDSIPIRKKMAVQVRKGEKEILTQILTLLDTQDTHLVQPD 466
>gi|403215215|emb|CCK69715.1| hypothetical protein KNAG_0C06190 [Kazachstania naganishii CBS
8797]
Length = 496
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/275 (21%), Positives = 115/275 (41%), Gaps = 35/275 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLT--TNKLSELACLALYLMYEKKQ 170
V A ED++ + F VP + ++ +E ++ E+ + + L + L++E K
Sbjct: 41 VIAIEDIEKDEILFEVPRTTMLNVENCELSKRYPEIKNHLVESVGQWEGLIIALLFEWKV 100
Query: 171 -GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
G+KS W PY++ L ++ QL + W++ EL L S ILER K +
Sbjct: 101 VGEKSKWWPYLQVLPKKTDMNQL-----IYWADDELELLKPSL----ILERVGADKAKEM 151
Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFAS 289
+ V + S ++ +++ ++ + F+ V S ++ S
Sbjct: 152 FENVVDIINKSTLKE------KDSYILKVTWENFLLVASIIM-----------------S 188
Query: 290 HLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 349
+ +Q + E ++ ++PL L + + KC A L + ++
Sbjct: 189 YSFDVQDYVEEKEGGTDEEEDDNESENVRSLKCMIPLADTLNSNTHKCNAHLIHGSNLLE 248
Query: 350 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 384
+ + K GE I G PNS++L YG+++ D
Sbjct: 249 MRSIKAIKKGEQIYNIYGDHPNSEILRRYGYIEPD 283
>gi|303275964|ref|XP_003057276.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226461628|gb|EEH58921.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 308
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 72/317 (22%), Positives = 124/317 (39%), Gaps = 76/317 (23%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEK---K 169
+ A ED++ G+ +P++ ++T+ER + + +L E + LA +L + +
Sbjct: 25 LVAREDVKRGEPLLEIPDASLITVERAVKESKLGP--KHAELQEWSLLAAFLAEQALDIE 82
Query: 170 QGKKS-FWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKRE 227
G +S + Y++ L R+ G L W E ++ L GSP++ ER +
Sbjct: 83 NGDESGVFAAYVKALPRRTG-------GVLDWPEEDVKTLLAGSPSQRAAYERQASVDGA 135
Query: 228 YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESF 287
E+ ++P P + AF + S ++ L P G E
Sbjct: 136 IEEIRA----------EFPQLTPG------ALRWAFDVLFSRLIRL------PNRGGE-- 171
Query: 288 ASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA 347
ALVP +L + C A +
Sbjct: 172 ---------------------------------LALVPWAD-MLNHKPGCNAYIDDSGGK 197
Query: 348 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE--DNPYDRLVVEAALNTEDPQYQD 405
V L DR YK GE + G +P+++LLI+YGF E +NP D + ++ D +Y D
Sbjct: 198 VCLQPDRAYKPGEQVFASYGQRPSAELLISYGFAPEVGENPDDEYEITLGIDPND-RYAD 256
Query: 406 KRMVA-QRNGKLSVQVF 421
+ A ++ G V+ F
Sbjct: 257 AKAAALEKIGLRPVESF 273
>gi|384249602|gb|EIE23083.1| SET domain-containing protein, partial [Coccomyxa subellipsoidea
C-169]
Length = 306
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 70/150 (46%), Gaps = 17/150 (11%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A +DL G +P + V++++ N IA++L +++ L + +MYE GK
Sbjct: 4 VFAVQDLCEGQRLCEIPKTAVLSVQ----NTGIADILEQHRIRGGLGLIIAIMYELSIGK 59
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+SFW Y+ EL ++ PL W+E E + L G+ + E E + ++
Sbjct: 60 ESFWHGYLEELHKRE-------YLPLFWAEQERSLLQGTEAEHRPQEDEELTQEDFET-- 110
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
L +Q+ + ++FT E F+ A
Sbjct: 111 ----HVPPLVEQHADRLRADSFTLESFRVA 136
>gi|255637489|gb|ACU19071.1| unknown [Glycine max]
Length = 497
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 69/309 (22%), Positives = 130/309 (42%), Gaps = 24/309 (7%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSELACLALYLMYEKKQG 171
+ A DL+ G+ VP S ++T E V+ ++ + + + + LS L + L+YE +G
Sbjct: 55 LGAVRDLRRGEIVLRVPKSALMTRETVMEDKKLCDAVNRHSSLSSAQILIVCLLYEMGKG 114
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
K S W PY+ L + ++ E E L ++ E ++ +
Sbjct: 115 KTSRWHPYLMHLPH-------TYDVLAMFGEFEKHAL-------QVDEAMWVTEKAMLKA 160
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL--QRCGCTPQMGTESFAS 289
+ W A SL Q + + FTF+ + +A + S +H+ GC +G + F
Sbjct: 161 KSEWKEAHSLMQDLMF--KPQFFTFKAWVRAAATISSRTLHIPWDEAGCLCPVG-DLFNY 217
Query: 290 HLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD-DAV 348
++ S L + + + L +V L ++S + D +A
Sbjct: 218 DAPGIEPSGIEDLDRLLSNTSIPDTIVLNGDKNIVVDAEQLDSHSWRLTDGGFEEDANAY 277
Query: 349 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV--EAALNTEDPQYQDK 406
YK G+ +++ G N +LL +YGF+ ++NP D++ + E AL + + +
Sbjct: 278 CFYAREHYKKGDQVLLCYGTYTNLELLEHYGFLLQENPNDKVFIPLEPALYSS-TSWSKE 336
Query: 407 RMVAQRNGK 415
+ NGK
Sbjct: 337 SLYIHHNGK 345
>gi|255070351|ref|XP_002507257.1| predicted protein [Micromonas sp. RCC299]
gi|226522532|gb|ACO68515.1| predicted protein [Micromonas sp. RCC299]
Length = 986
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 75/301 (24%), Positives = 120/301 (39%), Gaps = 60/301 (19%)
Query: 113 VAASEDLQA----GDAAFSVPNSLVVTLERVLGNET---IAELLTTNK-LSELACLALYL 164
V A+E++ GD FS+P + ++T + T + EL ++ + + L +L
Sbjct: 45 VIAAENVNGAQDGGDTIFSIPITCLMTPAAAFADVTYGKVFELFAAHQSVEDRTVLVFFL 104
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
E+++G S W PYIREL +PL WS E L G+ R G
Sbjct: 105 AIERQRGMTSHWGPYIRELPS-------IFSNPLNWSRAETLRLAGT--------RLGGA 149
Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGT 284
+ F +L Q +P AF+A+ R T
Sbjct: 150 TK---------FHDCALLQLTEVCVP-----------AFIAI-------LRAQLILSANT 182
Query: 285 ESFASHLVHL-QTSCTTTRTPLSPTHNPKEKVSL----ARRFALVPLGPPLLAYSSKCKA 339
++ AS + L Q + + R S + SL R ALVPLG +L +S +
Sbjct: 183 KAIASGAISLAQDALSPDRLAWSHSCVSSRAFSLFLNGQRTIALVPLG-DMLDHSPDAQI 241
Query: 340 MLAAVDDAVQLVV---DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 396
D A Q ++ DR AG + G + N +L++ YGF + + + L V A+
Sbjct: 242 EWRTDDTAGQFLIISHDR-LPAGSIMFNNYGAKSNEELILGYGFFMKSSVLETLYVRLAV 300
Query: 397 N 397
+
Sbjct: 301 D 301
>gi|388516285|gb|AFK46204.1| unknown [Lotus japonicus]
Length = 271
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/113 (28%), Positives = 55/113 (48%), Gaps = 5/113 (4%)
Query: 316 SLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 373
S+ + ALVP +L +S + L + DRPY+ GE + + G + N +
Sbjct: 20 SMDGKVALVPWAD-MLNHSCDVETFLDYDKQSKGIVFTTDRPYQPGEQVFISYGKKSNGE 78
Query: 374 LLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 424
LL++YGFV + NP D + + +L D Y++K + ++ G Q F +
Sbjct: 79 LLLSYGFVTREGANPSDSVELSLSLKKSDGSYKEKLELLKKYGLSGSQCFPIR 131
>gi|413917183|gb|AFW57115.1| hypothetical protein ZEAMMB73_742803 [Zea mays]
Length = 514
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 77/316 (24%), Positives = 121/316 (38%), Gaps = 63/316 (19%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASE + GD A +P SL+++ E + +E L N ++ L L+ M E+
Sbjct: 153 ASESIGVGDIALEIPESLIISDELLCQSEVFLSLKDFNNITSETMLLLWSMRERYNLGSK 212
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F PY L G L + LA L G+ EI++ + ++++Y+EL +
Sbjct: 213 F-KPYFDTLPANFNTG-------LSFGIDALAALEGTLLFDEIIQARQHLRQQYDELFPL 264
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHL 294
L +P EIF++ L C E + S+ + +
Sbjct: 265 ------LCTNFP----------EIFRKDVCTWDD---FLWAC--------ELWYSNSMMI 297
Query: 295 QTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA---VQLV 351
S T L P + P +L Y VD+A ++
Sbjct: 298 VLSSGKLSTCLVPVAGLLNH----------SVSPHILNY--------GRVDEATKSLKFP 339
Query: 352 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYDRLVVEAALNTEDPQYQDKRMVA 410
+ RP AGE + G P S LL YGF+ DNPYD + ++ D D+ + A
Sbjct: 340 LSRPCDAGEQCFLSYGKHPGSHLLTFYGFLPRGDNPYDVIPLDL-----DTSADDEDITA 394
Query: 411 QRNGKLSVQVFHVHAG 426
Q + S Q H+ G
Sbjct: 395 QSSATTS-QTTHMVRG 409
>gi|162606198|ref|XP_001713614.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Guillardia theta]
gi|13794534|gb|AAK39909.1|AF165818_117 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Guillardia theta]
Length = 460
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 80/416 (19%), Positives = 163/416 (39%), Gaps = 96/416 (23%)
Query: 139 VLGNETIAEL--------------LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELD 184
+L NE I E+ + +N + LA+ L+ E + KKSFW PYI L
Sbjct: 108 ILKNEKIIEISENLMFDKFEHNLEINSNGSDNYSDLAIKLLVELFKNKKSFWFPYIGILP 167
Query: 185 RQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQ 244
+ L W EL ++ GS + +K +Y ++ +FQ+
Sbjct: 168 EEYDLKLL-----FRWPLKELFFIKGSRLSKASDYLKKKLKAQYEMVNK------EVFQR 216
Query: 245 YPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTP 304
P++ F ++ ++ + + S + LQ T +
Sbjct: 217 NRLLYPSKIFNYQNWEWSMSILLSRTISLQE------------------------TKKVV 252
Query: 305 LSP-----THNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV--DRPYK 357
L P HNP ++ R +PL D+ ++VV D+
Sbjct: 253 LIPYIDLLNHNPFSSSFISYR--KIPLS------------------DSKEIVVYSDKNCN 292
Query: 358 AGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLS 417
+ + + G + N +LL YGF+ E NPYD +++ +++ +D +++K+ N K
Sbjct: 293 KFDQLYISYGQKSNLELLNLYGFIAERNPYDSVIIRISMSPKDIFFKEKKSFLFSNKKFF 352
Query: 418 VQVFHVHAGREKEAISDMLPYLRL----GYVSDTSEMQSVISSLGPICPVSPC----MER 469
+ + + + +M+ ++++ ++D + + I + + C +E+
Sbjct: 353 YNSYPIFLYKYPD---EMIEFIKICLFNTNINDKNFNLNKIENYDYTKIIKSCIVTVIEK 409
Query: 470 AVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 525
++ DY R L E+ ++D ++++ + +EKK+LN L+
Sbjct: 410 SLNSNYNDYENLR----NIMLKENLLHISD-----NQKISIKYNALEKKILNRFLE 456
>gi|323452617|gb|EGB08490.1| hypothetical protein AURANDRAFT_71532 [Aureococcus anophagefferens]
Length = 1114
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 99/402 (24%), Positives = 149/402 (37%), Gaps = 73/402 (18%)
Query: 123 DAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRE 182
DA + LV + ER + L E LAL L+YE+++G KS W P+I
Sbjct: 64 DAMLHARSPLVCSGEREANDARALGALLGKVTREDDALALRLLYERRKGAKSRWGPHI-- 121
Query: 183 LDRQRGRGQLAVESP----LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMA 238
L +P L WSE ELA L GS + LE A + + +
Sbjct: 122 --------ALLPATPPHALLRWSEAELAELAGS----DALELANRWRSQVS--------- 160
Query: 239 GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSC 298
S F + + KQ AV++ + P + E F+ + + + C
Sbjct: 161 -SDFSEIVDKSRAAVEESDPGKQLSAAVKASLRF-------PWLDLEGFSWAVSMIWSRC 212
Query: 299 TTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAY--------SSKCKAMLAAVDDAVQL 350
VS++R+ A PP+ A+ DDA
Sbjct: 213 ----------------VSVSRKGA-----PPIKAFLPVVDMHNHDPGAPENHGFDDARDG 251
Query: 351 VVDR---PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 407
V R K G+ + + PN+ LL+ YGF + + + A L+ E P Y+ KR
Sbjct: 252 FVLRRTGNAKKGDELKLCYDGLPNAWLLLLYGFALDHAAHAGRDLYAPLSPEAPHYEAKR 311
Query: 408 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCM 467
+ KL + A + A D LP RL ++ ++ + L + S
Sbjct: 312 AALE---KLGLGATADGAAPFRLAADDALPE-RL--LTALMAQRATLDELPGLPATSEAT 365
Query: 468 ERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVA 509
RA L A LA Y + ED A L D P+ R+A
Sbjct: 366 ARAAAGDLVAACDALLAAYRGSEDEDAAALADPATPPRLRLA 407
>gi|315039895|ref|XP_003169325.1| hypothetical protein MGYG_08872 [Arthroderma gypseum CBS 118893]
gi|311337746|gb|EFQ96948.1| hypothetical protein MGYG_08872 [Arthroderma gypseum CBS 118893]
Length = 455
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 93/433 (21%), Positives = 168/433 (38%), Gaps = 75/433 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
V A + G+ ++P++ + T+++ + + +L + LS LALYL++ K
Sbjct: 28 VKALRSFKEGERILTIPSACLWTVKKAYADPLLGPVLRAAQPPLSVEDSLALYLLFVKS- 86
Query: 171 GKKSFWLPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIK 225
R L + R +A + + +++ EL GS A + + +
Sbjct: 87 ----------RTLGYEGQRHHIAAMPQSYSASIFFTDDELQVCKGSSLYALTPQLEQRVH 136
Query: 226 REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTE 285
+Y +L +L Q+ P + FT E +K A ++ S +
Sbjct: 137 DDYRQLLV------ALLSQHRDLFPLDQFTIEDYKWALCSIWSRAM-------------- 176
Query: 286 SFASHLVHLQTSCTTTRTPLSP--THNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAA 343
FA +T+ PL+ H+P K +C A
Sbjct: 177 DFAVS----ETASVRLVAPLADMLNHSPDVK---------------------QCHAYDPT 211
Query: 344 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 403
D + ++ + Y+ G+ I ++ G PN++LL YGFV DNP D + + P Y
Sbjct: 212 SGD-LSILAAKDYQVGDQIFIYYGSVPNNRLLRLYGFVLPDNPNDSYDLVLQTSPLAPLY 270
Query: 404 QDKRMVAQRNGKLSVQVFHVHAGREKEAI-SDMLPYLRLGYVSDTSEMQSVISSL--GPI 460
+ K + G S + K+ + +++L YLR+ + D S + + L G
Sbjct: 271 EQKERLWALAGLDSTCTIPLTV---KDPLPNNVLRYLRIQRL-DESNITDITLRLVNGTD 326
Query: 461 CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT--DYNLHPKKRVATQLVRMEKK 518
V+ E VL L D + L G+ L + EA L DY A + E++
Sbjct: 327 GKVNDGNEIQVLQFLVDSIGSLLEGFGIPLEKLEAQLVAGDYPAGGNAWAAAHVSAGEQR 386
Query: 519 MLNACLQVTADMI 531
+L + D++
Sbjct: 387 VLTRAKKTAEDLL 399
>gi|17367341|sp|Q43088.1|RBCMT_PEA RecName: Full=Ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic; AltName:
Full=[Fructose-bisphosphate aldolase]-lysine
N-methyltransferase; AltName:
Full=[Ribulose-bisphosphate carboxylase]-lysine
N-methyltransferase; Short=PsLSMT; Short=RuBisCO LSMT;
Short=RuBisCO methyltransferase; Short=rbcMT; Flags:
Precursor
gi|508551|gb|AAA69903.1| ribulose-1,5 bisphosphate carboxylase large subunit
N-methyltransferase [Pisum sativum]
Length = 489
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 52/195 (26%), Positives = 88/195 (45%), Gaps = 9/195 (4%)
Query: 332 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRL 390
AY K A L + D L KAGE + + + + N++L ++YGF++ +
Sbjct: 253 AYEVKGAAGLFSWDYLFSLKSPLSVKAGEQVYIQYDLNKSNAELALDYGFIEPNENRHAY 312
Query: 391 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE- 449
+ ++ DP + DK VA+ NG F + R +LPYLRL + T
Sbjct: 313 TLTLEISESDPFFDDKLDVAESNGFAQTAYFDIFYNRTLPP--GLLPYLRLVALGGTDAF 370
Query: 450 -MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPK 505
++S+ + G + VS E + + + K+ LAGY T+ +D L + NL +
Sbjct: 371 LLESLFRDTIWGHLELSVSRDNEELLCKAVREACKSALAGYHTTIEQDRE-LKEGNLDSR 429
Query: 506 KRVATQLVRMEKKML 520
+A + EK +L
Sbjct: 430 LAIAVGIREGEKMVL 444
>gi|336384741|gb|EGO25889.1| hypothetical protein SERLADRAFT_437599 [Serpula lacrymans var.
lacrymans S7.9]
Length = 499
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 72/309 (23%), Positives = 118/309 (38%), Gaps = 38/309 (12%)
Query: 115 ASEDLQAGDAAFSVPNSLVV-TLERVLGNETIAELLTTNKLSE-LACLALYLMYEKKQGK 172
A D+ G F++P SL + T L + AE KL E A L L +M+E+ Q
Sbjct: 34 AVSDIPEGHTLFTLPRSLTLSTRTSYLPSNMGAESWKKFKLDEGWAGLILCMMWEEAQES 93
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
KS W Y+ L + +P+ WS +L L G+ +I E +R+Y D
Sbjct: 94 KSKWSEYLASLPS-------SFTTPMFWSSEDLFELRGTAVVDKI--GREDAERDY--YD 142
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRC-GCTPQMGTESFASHL 291
+ S +P + + ++ + + S +++ G + +S
Sbjct: 143 KLLPAIQSRLDLFPPTLIDQHYSLAKYHVMGSRILSRSFQVEKWDGPASGEHDANVSSDS 202
Query: 292 VHLQTSCTTTRTPLSPTHNPKEKV----------------SLARRFALVPLGPPLLAYSS 335
++ + T + +N E + A+VP+ L A
Sbjct: 203 TAMEVDGEVSVTQVHGDYNDDEDISRDIDDGSSDDDDDETEDPSDVAMVPIADLLNARYG 262
Query: 336 KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD--------EDNPY 387
A L + ++++ +P AGE I G PNS LL YG VD E NP
Sbjct: 263 SENAKLFYEERHLRMISTKPIMAGEQIWNTYGDLPNSDLLRRYGHVDLVSLPEGGEGNPA 322
Query: 388 DRLVVEAAL 396
D + + A L
Sbjct: 323 DVIEIRADL 331
>gi|281207217|gb|EFA81400.1| mRNA-decapping enzyme 2 [Polysphondylium pallidum PN500]
Length = 1078
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 106/501 (21%), Positives = 189/501 (37%), Gaps = 132/501 (26%)
Query: 61 DTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQ 120
D + G ++ +++D ++++W + P + EK + +S D++
Sbjct: 12 DIRIGGQTVQLTFRKDDGINIQTWKQDSKQPLLSLTPNEKG-----------IFSSRDIK 60
Query: 121 AGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLA--------LYLMY--EKKQ 170
G+ S+P +++ +V + + L NK+ +L A LY Y +
Sbjct: 61 EGEELLSLPWYNSLSMNKV--QQQLPWLF--NKIQDLELTAEDGLVVALLYYRYCMDDLS 116
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
S W + E+ + S L +S+ E L GSP +++ + K +
Sbjct: 117 FDYSEWFSAMPEV----------LNSGLFFSDAEAELLNGSPAYIDLMNQRLDAKELFGR 166
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASH 290
L SLF++ + A T++ K A+ V S ++
Sbjct: 167 L-------KSLFKEQQFS--KCAMTYDRLKWAYSVVDSRKIY------------------ 199
Query: 291 LVHLQTSCTTTRTP-LSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD---- 345
T P L NP V LA P L Y + + AA D
Sbjct: 200 ----------TEAPNLDANGNPFITVVLA----------PFLDYFNHAEDAQAAYDFDYD 239
Query: 346 -DAVQLVVDRPYKAGESIVVWCGPQP-NSKLLINYGFVDEDNPYDRLV---VEAALNT-- 398
A+++V +P K GE I + G Q NS LLI+YGF+D+ + V VE LNT
Sbjct: 240 ESAIKVVALQPIKKGEQIFLNYGNQDCNSDLLIHYGFIDQSSTAKHCVNVLVEELLNTIP 299
Query: 399 -EDPQYQDKRMVAQRNGKLS--VQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS 455
DPQ +K + + + + +++F E IS L Y
Sbjct: 300 ASDPQLIEKTELLTKAFEQNERMKLFKDSLTEELLKISKYLSYKNF-------------- 345
Query: 456 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML---TDYNLHPKKRVATQL 512
++L L ++ YP T+ ED A++ T++ ++ + +
Sbjct: 346 --------------SLLPYLKSLIDMKMKAYPTTMEEDRAIIEATTEFEKLSQRSKMSII 391
Query: 513 VRMEK----KMLNACLQVTAD 529
+R+++ K + A +QV D
Sbjct: 392 MRLQEKETLKEIGALIQVKID 412
>gi|255945819|ref|XP_002563677.1| Pc20g11910 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211588412|emb|CAP86520.1| Pc20g11910 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 487
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 69/292 (23%), Positives = 117/292 (40%), Gaps = 36/292 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
V A ++ G+ FSVP ++V+T++ N + LL N ++ L L ++YE
Sbjct: 50 VVAQSNISEGEELFSVPRAMVLTVQ----NSELRTLLGENLEEQMGPWLSLMLVMVYEYL 105
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
QG+KS W PY R L + ++ + WS EL L S T E + R+ + N
Sbjct: 106 QGEKSRWAPYFRVLPSR-------FDTLMFWSPAELQELQAS-TIVEKIGRSGAEESIRN 157
Query: 230 ELDTVWFMAGSLF--QQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESF 287
+ + LF Q +A + + + + + + G E
Sbjct: 158 SIAPILAKRPDLFPPPQGLASWEGDAGDAALIQVGHIMGSLIMAYAFDIEKSEDDGDEGE 217
Query: 288 ASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA 347
A+ ++ + P +VPL L A + + A L + A
Sbjct: 218 ANDESYMTDDEEEEQLPK----------------GMVPLADLLNADADRNNARLYQEEGA 261
Query: 348 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP-YDRLVVEAALNT 398
+ + +P + GE I G P + LL YG+V ++ YD V+E +L T
Sbjct: 262 LVMKAIKPIQQGEEIFNDYGEIPRADLLRRYGYVTDNYAVYD--VLELSLET 311
>gi|358369683|dbj|GAA86297.1| SET domain protein [Aspergillus kawachii IFO 4308]
Length = 489
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 41/144 (28%), Positives = 67/144 (46%), Gaps = 16/144 (11%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
+W+ G P K+ K + + H V A DL G+ F++P + V++++ N
Sbjct: 22 TWLA--GKPGVKINPKIQIADLRSHAAGRGVVAQSDLDEGEELFTIPRAHVLSVQ----N 75
Query: 143 ETIAELLTTN--KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
+ LL+ N L L + ++YE QG +S W Y R L R ++ + W
Sbjct: 76 SNLKNLLSQNLEDLGPWLSLMVVMIYEYLQGDQSAWASYFRVLPRN-------FDTLMFW 128
Query: 201 SETELAYLTGSPTKAEILER-AEG 223
S +EL L GS +I ++ AEG
Sbjct: 129 SASELEELQGSAIVEKIGKQGAEG 152
>gi|346980096|gb|EGY23548.1| SET domain-containing protein RMS1 [Verticillium dahliae VdLs.17]
Length = 469
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 89/397 (22%), Positives = 154/397 (38%), Gaps = 78/397 (19%)
Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 213
L L L ++YE QG S W PY L +Q ++P+ WS+ EL L G+
Sbjct: 91 LDSWGQLILVMLYEVLQGDASRWKPYFDILPQQ-------FDTPIFWSDGELLELQGTSL 143
Query: 214 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 273
AE + + E +++ + ++F YP + A + ++HL
Sbjct: 144 TAEKIGKVESDAMFRSKILPIVQANPAIF--YP-------------EGAAQPTEDELLHL 188
Query: 274 -QRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLA 332
R G T ++ L+ + + + L +VP+ L A
Sbjct: 189 AHRMGST-------IMAYAFDLENDDENENEEDGWVEDREGRTML----GMVPMADTLNA 237
Query: 333 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD-EDNPYDRLV 391
+++ A + + + KAG+ I+ + GP P S+LL YG+V E + YD +
Sbjct: 238 -NAEFNAHINHGESLEATAIRADIKAGDQILNYYGPLPTSELLRRYGYVTPEHSRYDVVE 296
Query: 392 VEAALNTEDPQYQDKRMVAQRNGKLSVQVF-HVHAGREKEAISDMLPYLRLGYVSDTSE- 449
V L E V + LS + + V + + E I D R D+ E
Sbjct: 297 VPWTLVKE---------VIVSSLSLSAEAWKQVESQIDDEEIEDYFVIER-----DSGEP 342
Query: 450 -------MQSVISSLGP---------ICPVSPCMERAVLDQ----------LADYFKARL 483
+V+ + P + V + D+ +A+ K RL
Sbjct: 343 GPDGRFTAPAVLREVSPELVEQLKEFLKAVKKLDSERIPDKRKRDEICDAVIAEVLKVRL 402
Query: 484 AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 520
A YP ++ DE +L + +L ++R+A + EKK+L
Sbjct: 403 AQYPTSIETDEKLLAEADLPARRRMAVVVRLGEKKLL 439
>gi|66819805|ref|XP_643561.1| hypothetical protein DDB_G0275621 [Dictyostelium discoideum AX4]
gi|60471605|gb|EAL69561.1| hypothetical protein DDB_G0275621 [Dictyostelium discoideum AX4]
Length = 526
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 64/314 (20%), Positives = 126/314 (40%), Gaps = 46/314 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V + ++L+ D +P S+++++ +I+ +L K+ ++ L++E G+
Sbjct: 69 VISLKELKVDDIVAKIPKSIILSIHT----SSISNILEKYKIENNIGTSIALIHEASLGE 124
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
KS W YI L R+ V+ P+LW L G+ + + + I + Y ++
Sbjct: 125 KSKWYGYISSLPRK-------VDVPILWDSESRKLLKGTAIEDVLNDDDILINQVYADV- 176
Query: 233 TVWFMAGSLFQQYPYDI-PTEAFTFEIFKQAFVAVQS---CVVHLQRCGCTP--QMGTES 286
+ L + +P E ++ E FK A + S CV P +
Sbjct: 177 ----IESILSKNHPEIFGDKELYSIENFKIANSIISSRAFCVDSYHGDSLVPLADIFNHQ 232
Query: 287 FASHLVHLQTS------CTTTRT----PLSPTHNPKEKVSLAR--RFALVPLGPPLLAYS 334
A VH++++ C + +T + H+ R + A +P +
Sbjct: 233 TAREHVHIESNGDVCNKCGSIKTCKHRKVVTQHHTVNSTKGKRTHKVAGIPSSKKHIHKG 292
Query: 335 SKC------------KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 382
+ C ++ D+ + + V + +A + + G N+ LL YGF++
Sbjct: 293 NCCSTTTTKTNEEDKDTIIEEDDEHLYIKVVKGVEANKEVYNTYGDHDNAILLSKYGFLE 352
Query: 383 EDNPYDRLVVEAAL 396
DNP DRL ++ L
Sbjct: 353 MDNPCDRLSIDKQL 366
>gi|367023575|ref|XP_003661072.1| hypothetical protein MYCTH_2300057 [Myceliophthora thermophila ATCC
42464]
gi|347008340|gb|AEO55827.1| hypothetical protein MYCTH_2300057 [Myceliophthora thermophila ATCC
42464]
Length = 496
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 70/292 (23%), Positives = 112/292 (38%), Gaps = 60/292 (20%)
Query: 113 VAASEDLQAGDAAFSVP-NSLVVTLERVLGNE----------------TIAELLTTNKLS 155
+ A D+ A F++P +S++ T L NE + E T++
Sbjct: 49 IVARTDIAADTVLFTIPRSSIICTATSALKNEIPGIFDLEGDEDGNSDSGGEDGTSSSQD 108
Query: 156 ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 215
L L L+YE QG S W PY+ L A ++P+ WS TELA L S
Sbjct: 109 SWTLLILILIYEYLQGDASQWKPYLDVL-------PSAFDTPMFWSPTELAELQASALVT 161
Query: 216 EI-LERAEGIKRE-----YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSC 269
++ E A+ + R D V+F G Q+ D FE+ + A+ +
Sbjct: 162 KVGREEADRMIRSKILPVIRGHDHVFFPHGR--QRLDDDQ-----LFELAHRMGSAIMAY 214
Query: 270 VVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPP 329
L++ + +E Q R + +VP+
Sbjct: 215 AFDLEKDDDANEEASE---------QDEWVDDREGRT-------------MLGMVPMA-D 251
Query: 330 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 381
+L ++ A + D++ R KAGE I+ + GP PN +LL YG+V
Sbjct: 252 MLNADAEFNAYINHGADSLTATALRTIKAGEEILNYYGPLPNGELLRRYGYV 303
>gi|365989356|ref|XP_003671508.1| hypothetical protein NDAI_0H00910 [Naumovozyma dairenensis CBS 421]
gi|343770281|emb|CCD26265.1| hypothetical protein NDAI_0H00910 [Naumovozyma dairenensis CBS 421]
Length = 540
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 69/280 (24%), Positives = 115/280 (41%), Gaps = 51/280 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLT-----TNKLSELA---CLALYL 164
+ AS+D+ + F +P S ++ N T ++L T KL EL+ L + +
Sbjct: 90 IIASKDIDTDELLFEIPRSSIL-------NVTTSQLCVDFPHITGKLMELSQWDSLIICM 142
Query: 165 MYEKKQGK-KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAE 222
MYE K + +S W Y L L + W++ EL++LT S + + AE
Sbjct: 143 MYEMKVLQHESRWSSYFNVLPSSESLNTL-----MYWNDKELSFLTPSLVVNRVGKGDAE 197
Query: 223 GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQM 282
+ R LDT+ ++ DI TE I + F+ + S ++
Sbjct: 198 TMYRRI--LDTI--------NEFNEDILTEKLG-SISWEEFLYIPSIIMAY--------- 237
Query: 283 GTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLA 342
SF + + P+ S+ +PL L A + KC A L
Sbjct: 238 ---SFDVEIKNDDDENEGDEEFDEKEEEPELLKSM------IPLADTLNADTHKCNANLT 288
Query: 343 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 382
D+++++ +P K GE + G PNS+LL YG+V+
Sbjct: 289 YDKDSLKMLAIKPIKKGEQVYNTYGELPNSELLRKYGYVE 328
>gi|238494116|ref|XP_002378294.1| SET domain protein [Aspergillus flavus NRRL3357]
gi|317148877|ref|XP_001822982.2| SET domain protein [Aspergillus oryzae RIB40]
gi|220694944|gb|EED51287.1| SET domain protein [Aspergillus flavus NRRL3357]
Length = 478
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 76/321 (23%), Positives = 133/321 (41%), Gaps = 44/321 (13%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
SW+ +G P KV K + + V A D+ G+ F++P V++ + N
Sbjct: 22 SWL--SGKPGVKVNPKIRLADLRSRAAGRGVVAQSDIAEGEELFTIPREHVLSTQ----N 75
Query: 143 ETIAELLTTN--KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
+ +LL+ + +L L L ++YE G +S W Y + L R+ ++ + W
Sbjct: 76 SKLKDLLSQDVEELGPWLSLMLVMIYEYLLGDQSAWASYFKILPRK-------FDTLMFW 128
Query: 201 SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIP----TEAFTF 256
S +EL L GS I++R I +E E +++ M + + P P ++
Sbjct: 129 SPSELQELQGSA----IVDR---IGKEGAE-ESILEMIAPIVRANPSLFPPVDGLASYDG 180
Query: 257 EIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVS 316
+ QA + + H+ MG+ A + + ++
Sbjct: 181 DAGTQALL----NLAHV--------MGSLIMAYAFDIEKPEDEDDEGDDESGYVTDDEEQ 228
Query: 317 LARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 376
L++ +VPL L A + + A L + + + +P AG I G P + LL
Sbjct: 229 LSK--GMVPLADLLNADADQNNARLFQEETGLVMKAIKPISAGAEIFNDYGEIPRADLLR 286
Query: 377 NYGFV-DEDNPYDRLVVEAAL 396
YG+V D +PYD VVE +L
Sbjct: 287 RYGYVTDNYSPYD--VVELSL 305
>gi|80479475|gb|AAI08868.1| Unknown (protein for MGC:132347) [Xenopus laevis]
Length = 456
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 60/281 (21%), Positives = 114/281 (40%), Gaps = 45/281 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAEL-LTTNKLSELACLALYLMYEKKQG 171
+ A+ DL+ G+ ++P + ++T E VL + + L +S L L +L+ E+ G
Sbjct: 62 LMATRDLKPGELIIALPETCLITTETVLQSYLGKYIRLWRPHVSPLLALCTFLIAERFAG 121
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
++S W PY+ + P+ W E E+ +L +P + + LE+ K E EL
Sbjct: 122 ERSQWKPYLDVIPS-------TYSCPVYW-ELEIVHLLPAPLRQKALEQ----KTEVQEL 169
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHL 291
T + Q D + +T++ + A+ V + V+++ T Q L
Sbjct: 170 HTESLAFFNSLQPLFCDNVADIYTYDALRWAWCTVNTRTVYMKH---TQQ-------DRL 219
Query: 292 VHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 351
+ Q C H+P+ +V +A + ++
Sbjct: 220 LAQQDVCALAPYLDLLNHSPEVQV----------------------EAEFSKDRRCYEIR 257
Query: 352 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 392
+ + + + GP N +LL+ YGFV +NP+ + V
Sbjct: 258 TNSGCRKHDQAFICYGPHDNQRLLLEYGFVAANNPHRSVYV 298
>gi|425773952|gb|EKV12277.1| hypothetical protein PDIG_46020 [Penicillium digitatum PHI26]
gi|425782378|gb|EKV20291.1| hypothetical protein PDIP_17950 [Penicillium digitatum Pd1]
Length = 487
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 71/308 (23%), Positives = 126/308 (40%), Gaps = 47/308 (15%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
V A ++ G+ FS+P ++V+T++ N + LL N ++ L L ++YE
Sbjct: 50 VVAQSNIVEGEELFSIPRTMVLTVQ----NSELRTLLAENLEEQMGPWLSLMLVMVYEYL 105
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
QG+KS W PY R L + ++ + WS EL L S T E + R+ + +
Sbjct: 106 QGEKSRWAPYFRVLPSR-------FDTLMFWSPAELQELQAS-TIVEKIGRSNAEESIRD 157
Query: 230 ELDTVWFMAGSLFQQYP-----YDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGT 284
+ + LF P I +A ++ + + +++ G
Sbjct: 158 SIAPILAKRPDLFPPPPGLASWEGIAGDAALIQVGHVMGSLIMAYAFDIEKAEDDDDEGE 217
Query: 285 ESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAV 344
+ S++ + +E+ L + +VPL L A + + A L
Sbjct: 218 VNDESYMT-----------------DDEEEEQLPK--GMVPLADLLNADADRNNARLYQE 258
Query: 345 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP-YDRL------VVEAA-L 396
+ A+ + +P + G+ I G P + LL YG+V ++ YD L + EAA L
Sbjct: 259 EGALVMKAIKPIQKGDEIFNDYGEIPRADLLRRYGYVTDNYAVYDVLELSLETICEAAGL 318
Query: 397 NTEDPQYQ 404
DP+ Q
Sbjct: 319 ANADPESQ 326
>gi|327295326|ref|XP_003232358.1| hypothetical protein TERG_07206 [Trichophyton rubrum CBS 118892]
gi|326465530|gb|EGD90983.1| hypothetical protein TERG_07206 [Trichophyton rubrum CBS 118892]
Length = 692
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 52/232 (22%), Positives = 94/232 (40%), Gaps = 48/232 (20%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LA ++++E+ +G+ S W PY+ L R + S L + +++L +L G+
Sbjct: 108 LAFFMVHEQLKGRDSHWWPYLATLPRAS-----ELTSALFFQDSDLEWLQGTSLYETHRA 162
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCT 279
+K EY+ +A S+ + Y + E++T++IF C
Sbjct: 163 YRNTVKEEYD-------LAISILRDEGY-LAIESYTWDIF----------------CWAY 198
Query: 280 PQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSK--- 336
+ + +F S ++ S P+ +E+ + PL+ +S+
Sbjct: 199 TLIASRAFTSRVLDAYLSN-------HPSLKQEEEFQIML---------PLVDFSNHKPL 242
Query: 337 CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 388
K A ++L V P GE + GP N +L+ YGF DNP D
Sbjct: 243 AKIEWQAEATEIRLKVVEPTFTGEEVHNNYGPLNNQQLMTTYGFCIVDNPCD 294
>gi|307109196|gb|EFN57434.1| hypothetical protein CHLNCDRAFT_142903 [Chlorella variabilis]
Length = 1233
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 92/413 (22%), Positives = 150/413 (36%), Gaps = 119/413 (28%)
Query: 111 HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
H A+ D+ G+ VP SL +T V G + E+L + SEL LAL+LM E+ +
Sbjct: 871 HGFVAARDVGQGEVLLQVPGSLAITAVDV-GKDAQLEVLARGR-SELVGLALWLMQERAK 928
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE-LAYLTGSPTKAEILERAEGIKREYN 229
A +P+LW + E L GSP E R + +++E+
Sbjct: 929 ----------------------ATLTPILWPDEERQQLLRGSPVLEEARTREQALRQEWQ 966
Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFAS 289
++ + + YP + E QAF+ S V+ +
Sbjct: 967 DIAAIAAQTSGGPEAYPAVVYNE--------QAFLEAMSVVL-----------------A 1001
Query: 290 HLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDA 347
H +L PK A+ FAL+PL L S A+L +A
Sbjct: 1002 HAAYL----------------PK-----AQCFALLPLVGGLCRTGSSSGALLDYDLEREA 1040
Query: 348 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 407
V +V R G+ + ++C L + A+L D Y KR
Sbjct: 1041 VTVVAQR--TPGQEVALYC-----------------------LFMAASLVAADRLYTTKR 1075
Query: 408 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCM 467
+ + G F + ++ A ++ + E ++I SP
Sbjct: 1076 EILEELGLGVKAEFPIF--EDRLATQQLINF----------EQDTII---------SPEN 1114
Query: 468 ERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 520
E +L L + R+ Y +D L +L P++R+A QL EK++L
Sbjct: 1115 EYEILQLLMGDLRDRIQAYATEFDDDIKDLQRTDLTPRQRLAAQLRLGEKRIL 1167
>gi|24987776|pdb|1MLV|A Chain A, Structure And Catalytic Mechanism Of A Set Domain Protein
Methyltransferase
gi|24987777|pdb|1MLV|B Chain B, Structure And Catalytic Mechanism Of A Set Domain Protein
Methyltransferase
gi|24987778|pdb|1MLV|C Chain C, Structure And Catalytic Mechanism Of A Set Domain Protein
Methyltransferase
gi|33357815|pdb|1OZV|A Chain A, Crystal Structure Of The Set Domain Of Lsmt Bound To
Lysine And Adohcy
gi|33357816|pdb|1OZV|B Chain B, Crystal Structure Of The Set Domain Of Lsmt Bound To
Lysine And Adohcy
gi|33357817|pdb|1OZV|C Chain C, Crystal Structure Of The Set Domain Of Lsmt Bound To
Lysine And Adohcy
gi|33357822|pdb|1P0Y|A Chain A, Crystal Structure Of The Set Domain Of Lsmt Bound To
Melysine And Adohcy
gi|33357823|pdb|1P0Y|B Chain B, Crystal Structure Of The Set Domain Of Lsmt Bound To
Melysine And Adohcy
gi|33357824|pdb|1P0Y|C Chain C, Crystal Structure Of The Set Domain Of Lsmt Bound To
Melysine And Adohcy
Length = 444
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 52/195 (26%), Positives = 88/195 (45%), Gaps = 9/195 (4%)
Query: 332 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRL 390
AY K A L + D L KAGE + + + + N++L ++YGF++ +
Sbjct: 209 AYEVKGAAGLFSWDYLFSLKSPLSVKAGEQVYIQYDLNKSNAELALDYGFIEPNENRHAY 268
Query: 391 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE- 449
+ ++ DP + DK VA+ NG F + R +LPYLRL + T
Sbjct: 269 TLTLEISESDPFFDDKLDVAESNGFAQTAYFDIFYNRTLPP--GLLPYLRLVALGGTDAF 326
Query: 450 -MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPK 505
++S+ + G + VS E + + + K+ LAGY T+ +D L + NL +
Sbjct: 327 LLESLFRDTIWGHLELSVSRDNEELLCKAVREACKSALAGYHTTIEQDRE-LKEGNLDSR 385
Query: 506 KRVATQLVRMEKKML 520
+A + EK +L
Sbjct: 386 LAIAVGIREGEKMVL 400
>gi|109158151|pdb|2H21|A Chain A, Structure Of Rubisco Lsmt Bound To Adomet
gi|109158152|pdb|2H21|B Chain B, Structure Of Rubisco Lsmt Bound To Adomet
gi|109158153|pdb|2H21|C Chain C, Structure Of Rubisco Lsmt Bound To Adomet
gi|109158154|pdb|2H23|A Chain A, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
Adohcy
gi|109158155|pdb|2H23|B Chain B, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
Adohcy
gi|109158156|pdb|2H23|C Chain C, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
Adohcy
gi|109158157|pdb|2H2E|A Chain A, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
gi|109158158|pdb|2H2E|B Chain B, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
gi|109158159|pdb|2H2E|C Chain C, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
gi|109158160|pdb|2H2J|A Chain A, Structure Of Rubisco Lsmt Bound To Sinefungin And
Monomethyllysine
gi|109158161|pdb|2H2J|B Chain B, Structure Of Rubisco Lsmt Bound To Sinefungin And
Monomethyllysine
gi|109158162|pdb|2H2J|C Chain C, Structure Of Rubisco Lsmt Bound To Sinefungin And
Monomethyllysine
Length = 440
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 52/195 (26%), Positives = 88/195 (45%), Gaps = 9/195 (4%)
Query: 332 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRL 390
AY K A L + D L KAGE + + + + N++L ++YGF++ +
Sbjct: 205 AYEVKGAAGLFSWDYLFSLKSPLSVKAGEQVYIQYDLNKSNAELALDYGFIEPNENRHAY 264
Query: 391 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE- 449
+ ++ DP + DK VA+ NG F + R +LPYLRL + T
Sbjct: 265 TLTLEISESDPFFDDKLDVAESNGFAQTAYFDIFYNRTLPP--GLLPYLRLVALGGTDAF 322
Query: 450 -MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPK 505
++S+ + G + VS E + + + K+ LAGY T+ +D L + NL +
Sbjct: 323 LLESLFRDTIWGHLELSVSRDNEELLCKAVREACKSALAGYHTTIEQDRE-LKEGNLDSR 381
Query: 506 KRVATQLVRMEKKML 520
+A + EK +L
Sbjct: 382 LAIAVGIREGEKMVL 396
>gi|432901733|ref|XP_004076920.1| PREDICTED: SET domain-containing protein 4-like [Oryzias latipes]
Length = 441
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 74/310 (23%), Positives = 126/310 (40%), Gaps = 64/310 (20%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
+Q G S+P S ++T VL + + L + K S L L ++L+ E+ +G+ S W
Sbjct: 68 IQPGGMLVSLPESCLLTTSTVL-HSYLGPFLKSWKPRPSSLVALCVFLVCERHRGEASDW 126
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYL-TGSPTKAEILERAEGIKREYNELDTVW 235
PYI L + P +++T +A L +G +AE E+ EG++ Y + +
Sbjct: 127 FPYIDVLP-------CSYCCPPYFTDTVMAVLPSGVRRRAE--EQREGLQHLY-AVHQDF 176
Query: 236 FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQ 295
FM+ +P P E T+E + A+ ++ + V + R + G +++A
Sbjct: 177 FMSLQPVLSHP---PEEVLTYEALRWAWCSINTRSVFMDRPSSSFLSGPDNYA------- 226
Query: 296 TSCTTTRTPLSP-----THNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQL 350
L+P H P +V +S C + +
Sbjct: 227 ---------LAPFLDLLNHRPDVQVKAG------------FNRTSGCYEIRSISG----- 260
Query: 351 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE----DPQYQDK 406
V R ++A + G N +LL+ YGFV NP+ + VE L E D +K
Sbjct: 261 -VQRYHQA----FINYGSHDNQRLLLEYGFVSSCNPHSVIYVEEDLLCEVLRGDESLDEK 315
Query: 407 RMVAQRNGKL 416
+ NG L
Sbjct: 316 MKFLRENGFL 325
>gi|159490820|ref|XP_001703371.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280295|gb|EDP06053.1| predicted protein [Chlamydomonas reinhardtii]
Length = 339
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 61/287 (21%), Positives = 108/287 (37%), Gaps = 53/287 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER-----VLGNETIAELLTTNKLSELACLALYLMYE 167
+ AS +++ G+ VP+ V+ E VL E + + ++ E+ L + +M+E
Sbjct: 68 LVASRNIKMGEVVVEVPDDAVLMAENCGLRDVLEEEGMTKDSADEEILEVQGLVIAVMWE 127
Query: 168 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
+ +G +S W PY+ L PL W E L G+ ++L RA+
Sbjct: 128 RWRGPESRWAPYLALLPDD------MTHMPLYWKRREFRELRGTAAYDKMLGRAQHPSDA 181
Query: 228 YNELDTVWF-MAGSLFQQYP-YDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTE 285
++ +W + G ++P +P +E+++ A AV S L +
Sbjct: 182 PTQVPLLWSEVVGPFIAEHPELGLPGGERGYELYRWATAAVASYSFILGDDKYQAMVPVW 241
Query: 286 SFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD 345
+H+ T R L + SK +
Sbjct: 242 DLLNHI----TGDVNVR----------------------------LHHCSKRHVL----- 264
Query: 346 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 392
Q++ R AG +V G N++LL YGFV+ N Y+ + V
Sbjct: 265 ---QMIAMRDIVAGSELVNNYGELSNAELLRGYGFVERANRYNHIPV 308
>gi|409045252|gb|EKM54733.1| hypothetical protein PHACADRAFT_97093 [Phanerochaete carnosa
HHB-10118-sp]
Length = 513
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 102/418 (24%), Positives = 162/418 (38%), Gaps = 73/418 (17%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
L L +++E+ QG S W + L ++P+ W + +L L G+ +I
Sbjct: 88 LILCMLWEEAQGVSSKWHGFFPILPD-------GFDTPMFWDDADLEELKGTAVVDKI-- 138
Query: 220 RAEGIKREYNELDTVWFMA-GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRC-- 276
G N T A S +P D +T E + S H+ R
Sbjct: 139 ---GKDDAENNFKTKLLPAVESRPDLFPPDTVATTYTLERHHLMGSRILSRSFHVLRWKA 195
Query: 277 -GCTP----QMGTESFASHLVHLQTSC-----------------TTTRTPLSPTH---NP 311
G P ++G E+ AS ++++ + TT S H N
Sbjct: 196 YGEQPDEEAEIG-EADASSVLNVDVAASESSVMDIDASNDEVENTTAEAADSAGHEAENG 254
Query: 312 KEKVSLARRF------ALVPLGPPLLAYSSKCKAMLAAVDDAV-QLVVDRPYKAGESIVV 364
K+ R+ A+VP+ L + A L D+ V +++ KAGE I
Sbjct: 255 KDDDDDDDRYEDPADVAMVPMADMLNGRFNTETARLFYDDEHVLRMMTVHEIKAGEQIWN 314
Query: 365 WCGPQPNSKLLINYGFVD----------EDNPYD------RLVVEAALNTEDPQYQDKRM 408
G PNS LL YGF+D NP D LVVEAA + QD+
Sbjct: 315 TYGDPPNSDLLRRYGFIDVTKLESPLSGAGNPADIVEIPANLVVEAATKHTTSKTQDR-- 372
Query: 409 VAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCME 468
V + VF V G + E +M+ RL + +E + + G + P M+
Sbjct: 373 VDWWLEEAEDDVFVV--GTDCELPPEMVSLARL-LLQPKAEWEKT-KAKGKVP--KPTMD 426
Query: 469 RAVLDQLADYFKARLAGYPATLSEDEAMLTDYN-LHPKKRVATQLVRMEKKMLNACLQ 525
+ D ++RL YP ++ EDE +L D + L +++A + EK++L L+
Sbjct: 427 TTIAAIAMDVLQSRLKEYPTSVEEDERLLADESQLGFNRKMAVTVRLGEKRILAGTLR 484
>gi|195132508|ref|XP_002010685.1| GI21676 [Drosophila mojavensis]
gi|193907473|gb|EDW06340.1| GI21676 [Drosophila mojavensis]
Length = 593
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 92/430 (21%), Positives = 161/430 (37%), Gaps = 69/430 (16%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+ D++AG+ SVP L+ + E L E +L N + L + L+ EK +G S
Sbjct: 206 ATRDIKAGEQVLSVPRKLIFSEE--LLPEKQRQLFR-NFPTHLK-VTYTLIMEKLRGADS 261
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W P+I L + + L ++ ++ L G+ + + I R Y +
Sbjct: 262 PWQPFIDTLPSR-------YNTVLYFTVEQMQRLRGTSACSAAVRHCRVIARLYASM--- 311
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTE-SFASHLVH 293
+K AF+ + V+ G + T+ L
Sbjct: 312 ------------------------YKCAFMQLDDSVM-----GGMANLFTDYGLCYELYR 342
Query: 294 LQTSCTTTRTPLSPTHN-PKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 352
S TTR L P P + +L AL+P S K + ++
Sbjct: 343 WAVSTVTTRQNLVPRQEIPSDAANLPIS-ALIPYWDMANHRSGKITSFYDQAAGQMECTA 401
Query: 353 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV--- 409
YK+GE ++ G + N+ L++ GFVD NP D + + L+ D + + ++
Sbjct: 402 QEAYKSGEQYFIYYGDRSNADRLVHNGFVDMQNPKDYVQIRLGLSPTDALAEQRAILLAE 461
Query: 410 --AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT------SEMQSVISSLGPIC 461
+R +L V H E +L ++R+ +S S+++ + L C
Sbjct: 462 LNIERKAELRVLPAPEHISGE------LLAFVRVFNMSKEQLEHWCSDLERAVDLLHIDC 515
Query: 462 PVSPCMERAVLDQLADYFKARLAGYPATLSED------EAMLTDYNLHPKKRVATQLVRM 515
+ +E L K L ATL E EA+ + + Q R+
Sbjct: 516 ALETDLETRTWQYLYQRLKLLLGVLEATLKETDELKQLEALQQQADASEIDIMVLQYRRL 575
Query: 516 EKKMLNACLQ 525
E+++L+ LQ
Sbjct: 576 ERRILSDALQ 585
>gi|167521575|ref|XP_001745126.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776740|gb|EDQ90359.1| predicted protein [Monosiga brevicollis MX1]
Length = 390
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 70/346 (20%), Positives = 129/346 (37%), Gaps = 66/346 (19%)
Query: 75 EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
EE+ +L W+ + G KV + + + A+ + G+ +P + ++
Sbjct: 24 EEEYDELVDWLKQCGATVDKVAVDHFNGMGQG------LKATAEAAPGETLLRIPEACML 77
Query: 135 --------TLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
TL + ++T+ +L+ L+ + + + SFW PYI L
Sbjct: 78 SEESARRSTLGAYMDSDTMLKLMPNVTLA-------FHLLLELHDLDSFWRPYIACL--- 127
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQY 245
++ PL W +L L GS E + + + R+Y L + + A +
Sbjct: 128 ----PVSYSVPLYWDLPDLMSLRGSSLFVEAIRLYKHVCRQYGYLHNKLSVRANPSCSCF 183
Query: 246 P--YDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRT 303
P + EAFTFE ++ A V + R PQ G +
Sbjct: 184 PLTLGLSPEAFTFEDWRWAVATV------MTRQNSIPQAGPDG----------------- 220
Query: 304 PLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 363
+ PT AL+PL + + + + ++ V P K G I
Sbjct: 221 QMKPT------------LALIPLWDMINHANHPMSTQFDSERECLEFVCPAPAKPGSQIT 268
Query: 364 VWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 409
+W G + N + L++ GF + D + V +L+ D Y+ K ++
Sbjct: 269 MWYGDRNNGQFLLHQGFFFAGHANDYVNVPFSLDETDSLYKIKALL 314
>gi|326473914|gb|EGD97923.1| hypothetical protein TESG_05224 [Trichophyton tonsurans CBS 112818]
Length = 692
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 52/229 (22%), Positives = 86/229 (37%), Gaps = 42/229 (18%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LA ++ +E+ +G+ S W PY+ L R + S L + +++L +L G+
Sbjct: 108 LAFFVAHEQLKGRDSHWWPYLATLPRAS-----ELTSALFYQDSDLDWLQGTNLYQTHQA 162
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCT 279
+K EY+ ++ G L E+++++IF A+ + S T
Sbjct: 163 YRNTVKEEYDSAISILRDEGCL--------AVESYSWDIFCWAYTLIASRAF-------T 207
Query: 280 PQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKA 339
++ F++H Q PL + N K + R +G
Sbjct: 208 SRVLDAYFSNHPTLKQDEEFQIMLPLVDSSNHKPLAKIEWRAEATEIG------------ 255
Query: 340 MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 388
L V P GE I GP N +L+ YGF DNP D
Sbjct: 256 ----------LKVIEPTFTGEEIHNNYGPLNNQQLMTTYGFCIVDNPCD 294
>gi|302810436|ref|XP_002986909.1| hypothetical protein SELMODRAFT_235145 [Selaginella moellendorffii]
gi|300145314|gb|EFJ11991.1| hypothetical protein SELMODRAFT_235145 [Selaginella moellendorffii]
Length = 447
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 72/309 (23%), Positives = 121/309 (39%), Gaps = 52/309 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A DL G+ ++P + +TL + IA L L + +MYE+ +GK
Sbjct: 10 VRALRDLHHGELIATIPKAACLTLLTTAARDAIARARLGGGLG----LTVAVMYERSKGK 65
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL 231
S W Y++ L Q P LWSE E+ L G+ + E +K ++ E
Sbjct: 66 GSKWYRYLKTLPCQE-------SVPFLWSEEEIDGLLLGTELHKALKEDKLLMKEDWEE- 117
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHL 291
L ++ P + P + FTFE +++A +S V +S
Sbjct: 118 -----NIAPLTKEDPLEFPAQDFTFE----SYLAAKSLV-----------------SSRS 151
Query: 292 VHLQTSCTTTRTPLSPTHNPK---EKVSL---ARRFALVPLGPPLLA----YSSKCKAML 341
+ PL+ N K E V A L+ + C+ +
Sbjct: 152 FEIDAEHGYGMVPLADLFNHKTDAEDVHFMLNASDSDDDDDNNGLIIDDGLANGDCRE-I 210
Query: 342 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD--RLVVEAALNTE 399
++ +++V+ + AG I G N+ LL YGF + +NP+D L ++ L
Sbjct: 211 SSDKSVLEMVMVKDVAAGSEIFNTYGQLGNAALLHRYGFTEPNNPHDIVNLDMDCLLEVL 270
Query: 400 DPQYQDKRM 408
++Q KR+
Sbjct: 271 LSRFQKKRV 279
>gi|354548388|emb|CCE45124.1| hypothetical protein CPAR2_701280 [Candida parapsilosis]
Length = 565
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 76/340 (22%), Positives = 133/340 (39%), Gaps = 62/340 (18%)
Query: 108 RPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYE 167
+P ++ A S+ G A+ +P LVVT ++ G + + T + + L +YL Y
Sbjct: 28 KPNYFGAISK--SNGKASIQIPRELVVTCDK--GIDLYKD--TYKNANHSSLLKIYLCYS 81
Query: 168 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
+ Q +SF PY+ L + A++SP +WS + A L G+ + E + E
Sbjct: 82 RTQ--QSFHQPYLDTLPSLQ-----AIDSPYIWSAEDKALLKGTNLGNSLKENISSLVEE 134
Query: 228 YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESF 287
W+ A +L P D+P F ++L+ + +
Sbjct: 135 -------WWNAINLL---PEDVPKPEQHF--------------INLKFYYENKFYTDDDY 170
Query: 288 ASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPP-------------LLAYS 334
S+ + TS T+ P + V +R F + P LL ++
Sbjct: 171 YSYFNEVDTSNWTSF----PNYLWASLVLKSRAFPAYIIDPSLPKNEPMLLPVVDLLNHN 226
Query: 335 SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEA 394
K K + D D +GE + G + N +LL+ YGF E+NP D A
Sbjct: 227 PKTKVQWSGTDGGFLFQSDDA-SSGEELFNNYGQKGNEELLLAYGFAIENNPADS----A 281
Query: 395 ALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD 434
AL + P D ++ ++ + + H + + +SD
Sbjct: 282 ALKIKIP---DSKLQVVKDLGIKLPSIHDYTNSVIDQVSD 318
>gi|403342378|gb|EJY70508.1| SET domain containing protein [Oxytricha trifallax]
Length = 653
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 31/124 (25%), Positives = 59/124 (47%), Gaps = 14/124 (11%)
Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGN------ETIAELLTTNKLSELACLALYLMYE 167
AA +++ D VP +++T+ER L + + A + + + L ++L+YE
Sbjct: 56 AAKLNIKNNDVIVYVPQKVLITVERALASPIGFIFDNHASIFKATEDRDYLVLLVFLIYE 115
Query: 168 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
++G +SFW PY +D G L P WS+ + L S K +I + + + +
Sbjct: 116 HQKGTRSFWHPYFEAID----PGLL----PCFWSDQTIEELADSELKDQIRQERDNYEED 167
Query: 228 YNEL 231
++ L
Sbjct: 168 WDML 171
>gi|443730800|gb|ELU16158.1| hypothetical protein CAPTEDRAFT_140019 [Capitella teleta]
Length = 255
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 71/287 (24%), Positives = 114/287 (39%), Gaps = 59/287 (20%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGN---ETIAELLTTNKLSELACLALYLMYEKK 169
V L GD ++P SL++T VL + I + L +LS L ++L+ E+
Sbjct: 8 VMVRRRLLTGDTIIAIPESLLITTSTVLRSYLGPVIHDFLPC-RLSPTETLVIFLLCERN 66
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
+G SFW PY+ L + L W+ E+ L TK + + +N
Sbjct: 67 KGCSSFWKPYVDILPS-------SYTDILHWTSKEMDLLPKF-TKRRACDLRLKAEESFN 118
Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFAS 289
L + L +Q P AFT+++FK A+ +V + V++ + PQ
Sbjct: 119 RLCNGFLPL--LVRQMPQF--NGAFTWDLFKWAWSSVNTRCVYMSQ----PQ-------- 162
Query: 290 HLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA-- 347
+ LSP K AL P LL ++ + A DD+
Sbjct: 163 ------------NSVLSPDEEDKS--------ALAPFL-DLLNHTVDVEVN-ARFDDSSK 200
Query: 348 ----VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 390
L +PY + + + GP N KLL+ YGF NP++ +
Sbjct: 201 SYKITTLTACKPY---DQVFINYGPHSNEKLLLEYGFTLPCNPHNNI 244
>gi|403158396|ref|XP_003307692.2| hypothetical protein PGTG_00642 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375163798|gb|EFP74686.2| hypothetical protein PGTG_00642 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 622
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 59/248 (23%), Positives = 102/248 (41%), Gaps = 41/248 (16%)
Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 235
W PY L + ++ + W++ ELA LTGS +I + E ++++Y + +
Sbjct: 189 WAPYFDTLPEE-------FDTLMFWNQDELAELTGSTILDKIGK--EEVEKDYETV--IK 237
Query: 236 FMAGSLFQQYPYDIPT---EAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLV 292
M S +P T E + ++ + V S H++ +P G + H V
Sbjct: 238 PMIESRADLFPVPEGTSWEENYGIAMYHRMGSLVLSRSFHVE---ASPSNGEDPDEGHDV 294
Query: 293 HLQTSCTT---------TRTPLSPTHNPKEKVSLARR-------------FALVPLGPPL 330
++++ + + P+ T + ++ + A+VPL L
Sbjct: 295 SMESAHSLVPESVDNGHSTEPIQETLDHHSELGIEGEDEDEDEEREAVEDIAMVPLADLL 354
Query: 331 LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 390
A + A L D +++ R K GE I G PNS LL YG VD+ N +D
Sbjct: 355 NAKTGSENARLFYETDCLKMKATRNIKKGEQIYNTYGDPPNSDLLRRYGHVDDPNRFD-- 412
Query: 391 VVEAALNT 398
VVE ++ T
Sbjct: 413 VVEISIKT 420
>gi|168016200|ref|XP_001760637.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687997|gb|EDQ74376.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 450
Score = 48.9 bits (115), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 75/337 (22%), Positives = 129/337 (38%), Gaps = 70/337 (20%)
Query: 81 LKSWMHKNGLPP--CKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLER 138
+ WM NG+ C++ +PS N ++ A ++ Q P L +T
Sbjct: 18 FRDWMQINGVQSRFCEI----RPSSNGENAGFGLFATKDNAQG--VLMVTPLLLAITPMT 71
Query: 139 VLGNETIA----ELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
VL + + +L+ ++ + + L+L+ E+ +G+ SFW PY+ L + G
Sbjct: 72 VLQDPELGGHYCKLMEEGEVDDRLLIMLFLVIERARGRFSFWAPYLEILPFKFG------ 125
Query: 195 ESPLLWSETELAYLTGSPT-KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 253
+PL +SE EL+ L G+ +A + G+ LD A S+F +IP
Sbjct: 126 -TPLSFSEEELSELKGTHLFQATQQQSTTGLILRCPVLDR----ANSVFWTRALNIP--- 177
Query: 254 FTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKE 313
P FA L TT + P + +
Sbjct: 178 -------------------------CPHSFNNRFAVDL-----DSTTHKKPEESSAADTD 207
Query: 314 KVSLARRFALVPLGPPLLAYSSKCKAM-----------LAAVDDAVQLV--VDRPYKAGE 360
V + + L P + + KA+ + V +++ LV +D G
Sbjct: 208 DVKIPSSVWVEGLVPGIDFCNHDLKAVALWEVDGPEGSVTGVPNSMYLVTGLDVVISNGS 267
Query: 361 SIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 397
I + G + N +LL YGFV +NP D L+V + +
Sbjct: 268 EIFISYGNKSNEELLYLYGFVLVENPDDYLMVRSTIG 304
>gi|396495152|ref|XP_003844476.1| similar to SET domain-containing protein [Leptosphaeria maculans
JN3]
gi|312221056|emb|CBY00997.1| similar to SET domain-containing protein [Leptosphaeria maculans
JN3]
Length = 475
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 69/297 (23%), Positives = 112/297 (37%), Gaps = 45/297 (15%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER-VLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
V A+++++ + F +P S V+++E +L E T + L L L ++YE G
Sbjct: 40 VVATQEIREHEVLFRIPRSAVLSVENSILSTEIPTS--TFDLLGPWLSLILVMLYEHLNG 97
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY--- 228
S W PY L + + + WSE ELA L S A+I EG +
Sbjct: 98 DASNWAPYFAVLPNE-------FNTLMFWSEHELAELQASAVLAKI--GREGANEAFLGQ 148
Query: 229 -----NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMG 283
E ++F S Q ++ E + K + + + ++ TP+
Sbjct: 149 LVPVIKEFAGIFFSGDSRAAQKAEEMRDEKNITLMHKMGSL-IMAYAFDIE--PATPRKD 205
Query: 284 TESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAA 343
+ + PL+ N A + +C A L
Sbjct: 206 VDEEGFAEEEEDEALPKGMIPLADMLN---------------------ADADRCNARLFY 244
Query: 344 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED-NPYDRLVVEAALNTE 399
+++ +P KAGE I GP P S LL YG+V E+ YD + V L +E
Sbjct: 245 EQKYLEMKALKPIKAGEEIFNDYGPLPRSDLLRRYGYVTENYAQYDVVEVPMELVSE 301
>gi|145518912|ref|XP_001445328.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124412772|emb|CAK77931.1| unnamed protein product [Paramecium tetraurelia]
Length = 761
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 41/141 (29%), Positives = 67/141 (47%), Gaps = 23/141 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERV-LGNETIA-----ELLTTNKLS--ELACLALYL 164
V A++D+ A A VP L+++ E+ L + +I EL N+ S E L YL
Sbjct: 46 VVATKDIPANTAIICVPQPLIISQEKCKLSSLSIVYDKHPELFDENETSDAEFNILIFYL 105
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
EKK+G+KSF+ PY++ + + + + WS+ EL Y+ E I
Sbjct: 106 FNEKKKGEKSFYHPYVQAIQ--------SNNTLIDWSKEELNYIEDPIILDEF-----AI 152
Query: 225 KREYNELDTVWFMAGSLFQQY 245
RE +L +W A +F ++
Sbjct: 153 VRE--DLKDLWNQAKEIFNEF 171
>gi|393230612|gb|EJD38215.1| SET domain-containing protein [Auricularia delicata TFB-10046 SS5]
Length = 381
Score = 48.5 bits (114), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 82/327 (25%), Positives = 129/327 (39%), Gaps = 72/327 (22%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVT-------LERVLGNETIAELLTTNKLSE--LACLALY 163
V SE+L S P SL +T L+R+LG A+L N LSE L C L
Sbjct: 3 VHTSEELPPDAPVISAPFSLAITPTVAADALQRILGPG--ADL---NSLSERELVCTYLA 57
Query: 164 LMYEKKQ---GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILER 220
+ + K+ G + L + +D R QL +PL ++ ELA L G+ A +R
Sbjct: 58 MHWIAKEVDLGPSAASLDHGPYVDSLPSRAQL--RTPLHFTPQELALLKGTNMAAATTDR 115
Query: 221 AEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTP 280
+ E V G E T+E + A +
Sbjct: 116 EADWRSECERCRAVLGHWG------------EHLTWEHYLTA----------------ST 147
Query: 281 QMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPL-LAYSSKCKA 339
+ + +F S L+ + + T P+ +P LVPL L A +
Sbjct: 148 HLSSRAFPSTLLSPEPALIPT-----PSSHP----------VLVPLIDSLNHARAHPVSW 192
Query: 340 MLAAVDDAVQ---LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 396
++ D+ +V P AG ++ GP+PN++L++ YGF DNP D LV++ +
Sbjct: 193 SVSPADNGAHTLSIVQHAPVAAGAEVLNNYGPKPNAELVLGYGFALPDNPDDTLVLKVS- 251
Query: 397 NTEDPQYQDKRMVAQRNGKLSVQVFHV 423
D+R + + G L +F V
Sbjct: 252 -----GAADRREIWRAGGGLQRILFDV 273
>gi|159476096|ref|XP_001696150.1| protein N-methyltransferase [Chlamydomonas reinhardtii]
gi|158275321|gb|EDP01099.1| protein N-methyltransferase [Chlamydomonas reinhardtii]
Length = 474
Score = 48.5 bits (114), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 62/242 (25%), Positives = 100/242 (41%), Gaps = 38/242 (15%)
Query: 157 LACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAE 216
A + L++ K+QG +S P+I +L G PL WS+ +LA L A+
Sbjct: 138 FAKMGAMLLWHKRQGSQSPLAPWIAQLPADTG-------VPLNWSDKQLAALQYPYLVAQ 190
Query: 217 ILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRC 276
+ E+ +RE+ L +G P P+ E F A V+S R
Sbjct: 191 VKEQ----QREWTALYDTLRGSGMAAGAAP---PSR----EEFWWAMGVVRS------RT 233
Query: 277 GCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSK 336
P +G S S + L LS + +++A+ PL L ++S
Sbjct: 234 FSGPYIG--STLSDRLRLAGLVAALVVILSRS---------LKQYAICPL-IDLFNHTSA 281
Query: 337 CKAMLAA--VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEA 394
++ ++ D+ +V R +K GE + + G Q N L+ YGF + DNP D V+
Sbjct: 282 AQSEVSYNYFGDSYSVVASRDFKKGEQVFITYGAQSNDSLMQYYGFAEADNPQDTYVISD 341
Query: 395 AL 396
L
Sbjct: 342 VL 343
>gi|320170159|gb|EFW47058.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 640
Score = 48.5 bits (114), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 78/336 (23%), Positives = 125/336 (37%), Gaps = 68/336 (20%)
Query: 64 VAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGD 123
A VS L L +W+ GL ++ +P N+ Y+ AS ++A
Sbjct: 156 AAAQLAAVSTPRGALARLTAWIDNAGL---EINSNARPGLNDVDE--LYLFASNPIEAAT 210
Query: 124 AAFSVPNSLVV--TLERVLGNETIAELLTTNKLSELA------CLALYLMYEKKQGKKSF 175
+VP LV+ T R L N I L + ++ LA+ L+YE + KS
Sbjct: 211 LVATVPAPLVMFETYLRTLENPMI--LAIDRRFKTMSVPDPSYALAMALLYESYE-PKSM 267
Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP--TKAEILERAEGIKREYNELDT 233
W +I L + ++S + WS E L P K +ILER +++ YN
Sbjct: 268 WREWISSLPQ-------TLDSTVFWSAEEQDALQSLPLKRKTQILER--HLQQLYNA--- 315
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCT-PQMGTESFASHLV 292
L +P+ +++E+FK A++ V S + T PQ+ L
Sbjct: 316 ---TTPRLLAAFPHIFAGGNYSYEMFKWAYMIVDSRSLTFSTGPDTLPQIMLAPLVDLLH 372
Query: 293 HLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 352
H +T + +P+E + + L
Sbjct: 373 H-----DPVQTNIQLGVHPEEVLGF-----------------------------EISLKT 398
Query: 353 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 388
R K GE +V G PN +LL+ +G NPY+
Sbjct: 399 TRAIKKGEPLVRHIGELPNHQLLLRFGLAMPRNPYE 434
>gi|451999637|gb|EMD92099.1| hypothetical protein COCHEDRAFT_1134267 [Cochliobolus
heterostrophus C5]
Length = 476
Score = 48.5 bits (114), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 49/180 (27%), Positives = 84/180 (46%), Gaps = 10/180 (5%)
Query: 322 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 381
+VPL L A + +C A L D +++ +P +AGE I GP P S LL YG++
Sbjct: 223 GMVPLADMLNADADRCNARLFYEKDGLEMKALKPIQAGEEIFNDYGPLPRSDLLRRYGYI 282
Query: 382 DED-NPYDRLVVEAALNTE----DPQYQDKRMVAQRNGKLSVQVFHVHAG---REKEAIS 433
E+ YD + + A L ++ D + +KR+ ++ + + A +E++S
Sbjct: 283 TENYAQYDVVEIPADLVSQALAHDGLWHEKRIEYLDEQEIVDTGYDIAASVPFSLEESLS 342
Query: 434 DMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSED 493
L L + + E + + S G + P + M + L +AR+A YP TL +D
Sbjct: 343 PELVILVETMLLPSEEFER-LQSKGRL-PKAEKMTGKAAEILYKIVQARIAQYPTTLEQD 400
>gi|148226164|ref|NP_001079674.1| SET domain containing 4 [Xenopus laevis]
gi|28422727|gb|AAH46855.1| MGC53706 protein [Xenopus laevis]
Length = 456
Score = 48.5 bits (114), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 61/281 (21%), Positives = 112/281 (39%), Gaps = 45/281 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAEL-LTTNKLSELACLALYLMYEKKQG 171
+ A+ DL+ G+ ++P + ++T E VL + + L +S L L +L+ E+ G
Sbjct: 62 LMATRDLKPGELIIALPETCLITTETVLQSYLGKYIRLWRPHVSPLLALCTFLIAERFAG 121
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
S W PY+ + P+ W E E+ +L +P + + LE+ K E EL
Sbjct: 122 DCSQWKPYLDVIPS-------TYSCPVYW-ELEIIHLLPAPLRKKALEQ----KTEVQEL 169
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHL 291
T S Q D + +T++ + A+ V + V+++ T Q L
Sbjct: 170 HTESLAFFSSLQPLFCDNVADIYTYDALRWAWCTVNTRTVYMKH---TQQ-------DRL 219
Query: 292 VHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 351
+ Q C H+P+ +V +A + ++
Sbjct: 220 LAQQDVCALAPYLDLLNHSPEVQV----------------------EAEFSKDRRCYEIR 257
Query: 352 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 392
+ + + + GP N +LL+ YGFV +NP+ + V
Sbjct: 258 TNSGCRKHDQAFICYGPHDNQRLLLEYGFVAANNPHRSVYV 298
>gi|302658278|ref|XP_003020845.1| SET domain protein [Trichophyton verrucosum HKI 0517]
gi|291184711|gb|EFE40227.1| SET domain protein [Trichophyton verrucosum HKI 0517]
Length = 692
Score = 48.1 bits (113), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 50/229 (21%), Positives = 85/229 (37%), Gaps = 42/229 (18%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LA ++++E+ +G+ S W PY+ L R S L + + +L +L G+
Sbjct: 108 LAFFMVHEQLKGRDSHWWPYLATLPRAS-----EFTSALFYQDNDLEWLQGTNLYQTHQA 162
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCT 279
++ EY+ ++ G L E++ ++IF A+ + S T
Sbjct: 163 YRNAVQEEYDSAISILRDEGFL--------AVESYRWDIFCWAYTLIASRAF-------T 207
Query: 280 PQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKA 339
++ F++H Q PL + N K + R +G
Sbjct: 208 SRVLDAYFSNHPTLKQDEEFQIMLPLVDSSNHKPLAKIEWRAEATEIG------------ 255
Query: 340 MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 388
L V P +GE + GP N +L+ YGF DNP D
Sbjct: 256 ----------LKVIEPTSSGEEVHNNYGPLNNQQLMTTYGFCIVDNPCD 294
>gi|449283795|gb|EMC90389.1| SET domain-containing protein 4 [Columba livia]
Length = 440
Score = 48.1 bits (113), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 84/395 (21%), Positives = 149/395 (37%), Gaps = 72/395 (18%)
Query: 97 LKEKPSHNEKHRPIHY------VAASEDLQAG-DAAFSVPNSLVVTLERVLGNETIAELL 149
LK++ + RP + + ++ LQ D S+P ++T + VL + + E +
Sbjct: 39 LKDRGFEDSHLRPAEFWDTGRGLMTTKTLQVSRDLIISLPEKCLLTTDTVLSS-CLGEYI 97
Query: 150 TTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY 207
K +S L L +L+ EK G+KS W PY+ L + P+ E ++
Sbjct: 98 MKWKPPVSPLTALCTFLIAEKHAGEKSLWKPYLDVLPK-------TYSCPVC-LEHDVVS 149
Query: 208 LTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ 267
L P + + E+ + Y + LF + I F + + A+ +
Sbjct: 150 LLPEPLRKKAQEQRTKVHELYISSKAFFSSLQPLFAENTETI----FNYSALEWAWCTIN 205
Query: 268 SCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLG 327
+ ++++ H Q C + L P + +LA L+
Sbjct: 206 TRTIYMK------------------HSQRKCFS----LEP-----DVYALAPYLDLLNHS 238
Query: 328 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 387
P + + KA + ++ + K E + + GP N +LL+ YGFV DNP+
Sbjct: 239 P-----NVQVKAAFNEQTRSYEIRTNSLCKKYEEVFICYGPHDNQRLLLEYGFVAMDNPH 293
Query: 388 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 447
+ V +A + DK QRN K+S+ H D+L L G+ +
Sbjct: 294 SSVYVSSATLLKYFPPLDK----QRNAKVSILKDH-----------DLLENLTFGWDGPS 338
Query: 448 SEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 482
+ + + L C R + L D AR
Sbjct: 339 WRLLTALKVLSLGADEFTCWRRTL---LGDVISAR 370
>gi|389741836|gb|EIM83024.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 502
Score = 48.1 bits (113), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 60/231 (25%), Positives = 102/231 (44%), Gaps = 34/231 (14%)
Query: 322 ALVPLGPPLLA-YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 380
A+VP+ L A Y S+ + +D ++++ +P GE I G PNS LL YG
Sbjct: 254 AMVPMADMLNARYRSENAKLFYETED-LRMITTKPILKGEQIFNTYGDPPNSDLLRRYGH 312
Query: 381 VDE--------DNPYDRLVVE-----AALNTEDPQYQDKRMVAQR------NGKLSVQVF 421
VD NP D +VE A + + Q A+R G V +
Sbjct: 313 VDLVPLPNGDIGNPAD--IVELRGDLAFFSISERHKQPVESSAERVDWWLEEGGEDVFIL 370
Query: 422 HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 481
+ E +++P+ RL S + ++ S P V + ++L +A+ +
Sbjct: 371 ETN----HELPDELVPFCRLLLQSQSEWEKTKSKSKLPKAKV----DESILSTIANALER 422
Query: 482 RLAGYPATLSEDEAMLTD-YNLHPKKRVATQLVRMEKKMLNACLQVTADMI 531
RLA YP ++ ED+ +LT+ +L+ K V +L EK++L+ L + +
Sbjct: 423 RLAEYPTSVEEDQKLLTEPLSLNRKHAVIVRL--GEKRILHGTLSTVKEKL 471
>gi|57899520|dbj|BAD87034.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
gi|57899939|dbj|BAD87851.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
Length = 509
Score = 48.1 bits (113), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 20/68 (29%), Positives = 38/68 (55%)
Query: 346 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQD 405
D +++ DR Y GE +++ G N+ L +N+GF N YD+ ++ + +DP Y+
Sbjct: 216 DVCEIIADRNYAVGEQVMIRYGKYSNATLALNFGFTLARNIYDQALIRIDMPVQDPLYKK 275
Query: 406 KRMVAQRN 413
K + Q++
Sbjct: 276 KLDIWQKH 283
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 32/114 (28%), Positives = 54/114 (47%), Gaps = 8/114 (7%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASE +Q GD VP + +TL+++ L + + + + LA L+ E+ G +S
Sbjct: 63 ASEPIQEGDCIMQVPYHVQLTLDKL---PQKFNTLLDHAVGDTSKLAALLIMEQHLGNES 119
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
W PYI+ L + + + +LW EL + S E +E E K+E+
Sbjct: 120 GWAPYIKSLPTKD-----QMHNMVLWDLNELHAVQNSSIYDEAIEHKEQAKKEF 168
>gi|358366345|dbj|GAA82966.1| SET domain protein [Aspergillus kawachii IFO 4308]
Length = 673
Score = 48.1 bits (113), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 63/267 (23%), Positives = 103/267 (38%), Gaps = 44/267 (16%)
Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
+ + E +L+ + +G + FW PYIR L Q G ++ +P + +L +L G+
Sbjct: 77 DAVGEKESTIFFLIGQYLRGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGT 131
Query: 212 PTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 271
A +R E +K +Y + T AG F+ +A+T++++ A
Sbjct: 132 SLLAAREKRLEVLKEKYEKGSTALRNAG--FEG------ADAYTWDLYLWA--------- 174
Query: 272 HLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLL 331
AS + S T +EK+S+ ++ +G
Sbjct: 175 ----------------ASMFISRAFSARVLSGVFPETDLSEEKLSVL--LPIIDMG---- 212
Query: 332 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 391
+ K A D V VV AG+ I GP+ N +L++NYGF NP D +
Sbjct: 213 NHRPLAKVEWRAGKDDVAFVVLEDVSAGQEISNNYGPRNNEQLMMNYGFCIPGNPCDHRI 272
Query: 392 VEAALNTEDPQYQDKRMVAQRNGKLSV 418
V P Y K Q L+V
Sbjct: 273 VSLRAPPGSPLYMAKSHQLQMYPDLAV 299
>gi|156384284|ref|XP_001633261.1| predicted protein [Nematostella vectensis]
gi|156220328|gb|EDO41198.1| predicted protein [Nematostella vectensis]
Length = 403
Score = 48.1 bits (113), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 89/429 (20%), Positives = 167/429 (38%), Gaps = 74/429 (17%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL--ACLALYLMYEKKQGK 172
A+ DLQ +VP L++++ + + + L + LAL+++ E+ +
Sbjct: 26 ATADLQENQVFVAVPEKLLMSVVTAKKSSLGPLISREHGLRSMPHVVLALHVLCERLH-E 84
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W PY+ L R + + L +S ++ L GSP+ E L++ GI ++Y
Sbjct: 85 DSTWAPYLNILPR-------SYSTCLYFSPDDMMALQGSPSMGEALKQFRGIVKQY---- 133
Query: 233 TVWFMAGSLFQQYPYDIPTE-AFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHL 291
V+F +P + +FTF+ F+ A V + R E+ + L
Sbjct: 134 -VYFFRLVQINPEASRLPLKNSFTFDDFRWAVSTV------MTRQNDVKVSSNETVKA-L 185
Query: 292 VHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 351
+ + C P + + K V+ +
Sbjct: 186 IPMWDMCNHCNGPFTTGFDDSTK--------------------------------EVKSL 213
Query: 352 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 411
+P +AG+ + ++ G + N+ L + GFV + D + ++ ++ D Y K +
Sbjct: 214 AFKPTRAGDQVFIFYGRRNNADRLFHNGFVYTEAEEDWVNIQLGVSKNDRLYAMKAQILA 273
Query: 412 RNGKLSVQVFHVHAGREKEAIS-DMLPYLRLGYVSDTSEMQSV--------ISSLGPICP 462
G L R E IS ++ +LR+ + +T E++ ++ L +C
Sbjct: 274 MVG-LDASGRSYRVLRGPEPISPELRIFLRV-FSMNTGELKPYLFNPEGLPVTPLAELCK 331
Query: 463 VSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLT--DYNLHPKKRVATQLVRME 516
+ +L +F R L Y T EDEA+L+ D LH R +L E
Sbjct: 332 AEFTLSEENELKLWSFFHTRLQLILGQYKTTKQEDEALLSRDDNTLH--TRNCIRLRMSE 389
Query: 517 KKMLNACLQ 525
+ +L + L+
Sbjct: 390 RDILVSALE 398
>gi|170588849|ref|XP_001899186.1| SET domain containing protein [Brugia malayi]
gi|158593399|gb|EDP31994.1| SET domain containing protein [Brugia malayi]
Length = 278
Score = 48.1 bits (113), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 58/114 (50%), Gaps = 11/114 (9%)
Query: 287 FASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYS--SKCKAMLAAV 344
+A H+V+ T C L P + E SLA +VPL +L +S S+C A+ +
Sbjct: 157 WAWHIVN--TRCIYRNNKLHPLIDNTEDDSLA----IVPL-IDMLNHSNDSQCCAIWDSK 209
Query: 345 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 398
+ +++V RP + GE I + G N L I YGF +DN D+ VE +L +
Sbjct: 210 FNLYKVIVTRPIRKGEQIFICYGSHTNGSLWIEYGFYLKDNICDK--VEISLGS 261
>gi|297816978|ref|XP_002876372.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297322210|gb|EFH52631.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 518
Score = 48.1 bits (113), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 74/344 (21%), Positives = 138/344 (40%), Gaps = 49/344 (14%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
K WM NG+ C L N+ V A DL+ GD ++ + +T++
Sbjct: 9 FKRWMKANGVD-CSDALNLVDDQNDGVS----VRAFCDLKEGDVVANISKTACLTIK--- 60
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
E++ + L L++ LMYE+ G++S W Y++ L Q + PL+W
Sbjct: 61 -TSGAREMIESADLDGSLGLSVALMYERSLGEESPWAGYLQILPVQE-------DLPLVW 112
Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
S +L + L+G+ + E I ++ E + + SL P ++ +++F + +
Sbjct: 113 SLQDLDSLLSGTELHKVVKEDHVLIYEDWKE--NILPLTSSL----PQNVDSDSFGIKEY 166
Query: 260 --KQAFVAVQSCVV-HLQRCGCTP-------QMGTESFASHLVHL------QTSCTTTRT 303
++ +A +S + G P + G E H H ++
Sbjct: 167 LAAKSLIASRSFQIDDYHGSGMVPLADLFNHKTGAEDV--HFTHESDTEADESENDDAAN 224
Query: 304 PLSPTHNPKEKVSLA--RRFALVP------LGPPLLAYSSKCKAMLAAVDDAVQLVVDRP 355
+ +P K+S + + F VP + +ML +++++ +
Sbjct: 225 EATDEDDPSSKISSSPEQSFEDVPGENTDDEAKEEEEEDDENSSMLQNDQSGLKMIMVKD 284
Query: 356 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 399
AG + G N+ LL YGF + DN YD + ++ L TE
Sbjct: 285 VSAGTEVFNTYGLMGNAALLHRYGFTEFDNLYDIVNIDLELVTE 328
>gi|384246985|gb|EIE20473.1| rubisco small subunit N-methyltransferase, partial [Coccomyxa
subellipsoidea C-169]
Length = 363
Score = 47.8 bits (112), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 35/107 (32%), Positives = 54/107 (50%), Gaps = 10/107 (9%)
Query: 124 AAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIREL 183
A +P +L VT V +E +A L EL LAL+LM E+++G++S W P++ L
Sbjct: 2 ALVELPGNLSVTAVDVAAHEEVAGL--AEGRGELTGLALWLMAERQKGEESRWAPFLECL 59
Query: 184 DRQRGRGQLAVESPLLW-SETELAYLTGSPTKAEILERAEGIKREYN 229
A SP+LW E + L SPT E R +++E++
Sbjct: 60 PE-------ATLSPVLWPEEVQDELLKNSPTLKECRARRAALQQEWD 99
>gi|295668911|ref|XP_002795004.1| SET domain-containing protein [Paracoccidioides sp. 'lutzii' Pb01]
gi|226285697|gb|EEH41263.1| SET domain-containing protein [Paracoccidioides sp. 'lutzii' Pb01]
Length = 488
Score = 47.8 bits (112), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 92/442 (20%), Positives = 170/442 (38%), Gaps = 54/442 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
+ A +D+ + F++P LV++ + N + +L+ N+ L + CL L ++YE Q
Sbjct: 50 IVAYDDINKEEELFAIPQGLVLSFQ----NSKLKDLMEINERDLGQWLCLILVMIYEYLQ 105
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
G S W PY + L ++ + W++ EL L GS I + A
Sbjct: 106 GVASPWAPYFKVLPTD-------FDTLMFWTDAELLELKGSAVLGRIGKSA--------- 149
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIF--KQAFVAVQSCVVHLQRCGCTPQMGTESFA 288
A +F + +P + E+F ++ S +MG+
Sbjct: 150 -------AEEVFLRDL--LPLVSKNSELFPLTSGLLSYNSPDGKAALLSLAHRMGS-LIM 199
Query: 289 SHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV 348
S+ ++ E+ L + ++PL L A + + A L D +
Sbjct: 200 SYAFDVKNDEAEEVEGEGGYVTDDEERQLPK--GMIPLADLLNADADRNNACLFQEDGYL 257
Query: 349 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVE-------AALNTED 400
+ + + GE I G P ++LL YG+V D YD V A L +
Sbjct: 258 AMKSIKSIRKGEEIFNDYGELPRAELLRRYGYVTDNYAQYDEAEVPIQTICKVAGLKSST 317
Query: 401 PQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPI 460
P + R+ + ++ + + +++ LP L ++ + L
Sbjct: 318 PGPDEPRLEFLDDLEVLDDGYGIPRPDRSTPLAETLPTELLVVLNILVMPLEQFNQLKQK 377
Query: 461 CPV-SPCMERAVLDQLADYFKARLAGYPATLSEDEAML-----TDYNLHPKK----RVAT 510
V P + A L + + L YP T+++D+ +L + PK ++A
Sbjct: 378 SKVPKPALGIAEATLLDEVVRLILGEYPTTVAQDKELLASCANNQGSTSPKSAGRLKMAL 437
Query: 511 QLVRMEKKMLNACLQVTADMIM 532
Q+ + EK++LNA L D I+
Sbjct: 438 QVRKGEKEILNAVLSELEDFIV 459
>gi|356564844|ref|XP_003550657.1| PREDICTED: uncharacterized protein LOC100778605 [Glycine max]
Length = 549
Score = 47.8 bits (112), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 65/275 (23%), Positives = 109/275 (39%), Gaps = 51/275 (18%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A +DL+ GD A +P S++++ E V + L + +S L L+ M EK
Sbjct: 178 ARKDLKVGDIALEIPVSIIISEELVHETDMYGVLKEIDGISSETILLLWSMKEKYNCDSK 237
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F + Y L + G L +S + L G+ EI++ + + +Y+EL
Sbjct: 238 FKI-YFDTLPEKFNTG-------LSFSIQAITMLDGTLLLEEIMQARQHLHAQYDEL--- 286
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHL 294
F A L +P P E +T+E F A E + S+ + +
Sbjct: 287 -FPA--LCNNFPDIFPPELYTWEKFLWA---------------------CELWYSNSMKI 322
Query: 295 QTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 354
S RT L P F L P ++ Y + ++++ + R
Sbjct: 323 MYSDGKLRTCLIPLAG----------FLNHSLCPHVMHYGK-----VDPATNSLKFCLSR 367
Query: 355 PYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYD 388
P ++GE + G +S L+ YGF+ + DN YD
Sbjct: 368 PCRSGEECCLSYGNFSSSHLITFYGFLPQGDNSYD 402
>gi|195439104|ref|XP_002067471.1| GK16171 [Drosophila willistoni]
gi|194163556|gb|EDW78457.1| GK16171 [Drosophila willistoni]
Length = 511
Score = 47.8 bits (112), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 79/395 (20%), Positives = 145/395 (36%), Gaps = 65/395 (16%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A++D+ A VP + + E++ E + T + LA L+ EK +G S
Sbjct: 118 ATKDINADQQVLRVPRKKIFSEEQLSKTERESFCNFTTNFN----LANALVVEKSRGADS 173
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI L + + L ++ ++ L G+ + L + I R+Y +L
Sbjct: 174 IWKPYIDVLPSR-------YNTVLYFTVEQMRRLRGTSVCSSALRQCRMIARKYAKLYAF 226
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTP-QMGTESFASHLVH 293
+ S + +E+++ A V + R P ++ T+ + +
Sbjct: 227 AYCDSSYLRPDTGLFTQHGLCYELYRWAVSTV------MTRQNLVPREIATKDDGNSPIS 280
Query: 294 LQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD 353
C H P + S Y S M +
Sbjct: 281 ALIPCWDM-----ANHRPGKITSF---------------YDSNAHQMECTAQEFC----- 315
Query: 354 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 413
KAG ++ G +PN+ LL++ GFVD +N D + + L+ D +A++
Sbjct: 316 ---KAGNQFFIYYGDRPNADLLVHNGFVDPNNNKDFVNIRLGLSPTDG-------LAEKR 365
Query: 414 GKLSVQVFHVHAGREK-----EAIS-DMLPYLRLGYVSDT------SEMQSVISSLGPIC 461
+L ++ H G + E IS +L ++R+ +S S+++ + L C
Sbjct: 366 SRLLDRLNIEHKGEFRVLPAPEYISGQLLAFVRVFNMSSDQLDHWCSDLERAVDLLHIDC 425
Query: 462 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM 496
+ +E FK L ATL E + +
Sbjct: 426 ALETDLETRTWQYFHQRFKLLLGVLEATLREADEL 460
>gi|367016539|ref|XP_003682768.1| hypothetical protein TDEL_0G01900 [Torulaspora delbrueckii]
gi|359750431|emb|CCE93557.1| hypothetical protein TDEL_0G01900 [Torulaspora delbrueckii]
Length = 573
Score = 47.8 bits (112), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 57/218 (26%), Positives = 88/218 (40%), Gaps = 33/218 (15%)
Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-DTVWFM--AGSLFQQYPYDIP 250
++ P W E+ L G+ + + I +E++ L D + G L++Q
Sbjct: 111 LDQPYFWKLQEVELLKGTDIYLLMKQNLRKIVKEWHVLLDQLKLKPEDGELYEQ------ 164
Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHN 310
+EA F+I K C Q + S+ S + +L + T P
Sbjct: 165 SEAQDFDILKYI-------------CEYREQHKSISWKSFVGYLWATGIFTSRAF-PKLI 210
Query: 311 PKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 370
+EK S L PL LL + + K +D V V K GE + G +
Sbjct: 211 LEEKCSSINEAFLYPL-VDLLNHKNDTKVKWTFTNDNVCFVSQEIMKEGEEVFNNYGEKS 269
Query: 371 NSKLLINYGFVDEDNPYD--RL-------VVEAALNTE 399
N LL++YGFV + NPYD RL +++ ALN E
Sbjct: 270 NEDLLLSYGFVQDQNPYDLTRLTLRLTKEMIDEALNAE 307
>gi|403414266|emb|CCM00966.1| predicted protein [Fibroporia radiculosa]
Length = 420
Score = 47.8 bits (112), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 85/377 (22%), Positives = 152/377 (40%), Gaps = 92/377 (24%)
Query: 61 DTLVAGSREVVSKKEEDLGDLKSWMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASED 118
D +V+ + +VV+ K+W+ +NG P E+ ++ V AS+D
Sbjct: 8 DGIVSANGDVVA--------FKNWLAENGAEFHPHAAFRTERSGYS--------VIASQD 51
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELL----TTNKLSE--LACLAL---YLMYEKK 169
L++ S P SL +T E + + LL T SE L C + +++
Sbjct: 52 LRSDTTVVSCPFSLAITPE--VSKNALTTLLGPTFTGQSWSERQLICSYICMHWILDPSA 109
Query: 170 QGKKSFWLPYIREL---DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 226
+ + W PYIR L D+ R +PL +S+TEL L GS L+R +
Sbjct: 110 SSELAHW-PYIRMLPAPDKLR--------TPLHFSDTELEALKGSNLYGATLDRRRDWQS 160
Query: 227 EYNE-------LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCT 279
E+ + +D W E F++E + A +
Sbjct: 161 EWEQCQKTIATVDLTW---------------GEQFSWERYLSA----------------S 189
Query: 280 PQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKA 339
+ + +F S ++ S +T + ++ +R G P+ S +
Sbjct: 190 TYLSSRAFPSMVLSPNPSLVSTEESYPVLLPGIDSLNHSR-------GQPV----SWVVS 238
Query: 340 MLAAVD-DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 398
+ + D + + LV+ + AG ++ GP+PN++L++ YGF +NP D +V++ N+
Sbjct: 239 IGTSSDVNRISLVLHKSTPAGSELLNNYGPKPNAELILGYGFSLPENPDDTIVLKIGGNS 298
Query: 399 EDPQYQDKRMVAQRNGK 415
Q K V RN +
Sbjct: 299 ASGLQQQKWEVG-RNAQ 314
>gi|225678514|gb|EEH16798.1| SET domain-containing protein RMS1 [Paracoccidioides brasiliensis
Pb03]
Length = 488
Score = 47.8 bits (112), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 51/107 (47%), Gaps = 13/107 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
+ A +D+ + F++P LV++ + N + +L+ N+ L + CL L ++YE Q
Sbjct: 50 IVAYDDINEEEELFAIPQGLVLSFQ----NSKLKDLMEINERDLGQWLCLILVMIYEYLQ 105
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
G S W PY + L ++ + W++ EL L GS I
Sbjct: 106 GAASPWAPYFKVLPTD-------FDTLMFWTDAELLELKGSAVLGRI 145
>gi|50294638|ref|XP_449730.1| hypothetical protein [Candida glabrata CBS 138]
gi|49529044|emb|CAG62706.1| unnamed protein product [Candida glabrata]
Length = 510
Score = 47.8 bits (112), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 74/320 (23%), Positives = 127/320 (39%), Gaps = 45/320 (14%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
SW+ NG+ K+ K K N V + D+Q + F +P ++++ E
Sbjct: 14 SWLTNNGV---KISPKLKVEDNRYKDEGRCVVTTTDIQKDELLFEIPRNVLLNCETSQLV 70
Query: 143 ETIAELLT---TNKLSE-------LACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
+ I +LT T SE + CL Y MY K KS W PY L L
Sbjct: 71 KDIPAVLTELETFSGSEPLSWEPLILCL-FYEMYILKD--KSRWWPYFEVLPTLEDMNVL 127
Query: 193 AVESPLLWSETELA-----YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY 247
+LWS+ +LA Y+ K ++ + +KR F+ S +Q
Sbjct: 128 -----VLWSDEDLAALEPSYVLSCIGKEQVENMYQLLKR---------FIEASDHEQLKS 173
Query: 248 DIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSP 307
++ F+++ +F+ + S ++ ++ E ++ + T
Sbjct: 174 NL--NKFSWD----SFIRIGSLIMSYS-FDVGKEIHNEGKEGESMNENDNMTNGDEDEDE 226
Query: 308 THNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG 367
E V + + ++VPL L A + KC A L ++++ R +GE + G
Sbjct: 227 DEEDLE-VEMIK--SMVPLADTLNADTKKCNANLLHSKQTLRMIAIRDIPSGEQVYNTYG 283
Query: 368 PQPNSKLLINYGFVDEDNPY 387
NS+LL YG+V+ D Y
Sbjct: 284 ELSNSELLRRYGYVEWDGSY 303
>gi|225554758|gb|EEH03053.1| SET domain-containing protein [Ajellomyces capsulatus G186AR]
Length = 485
Score = 47.8 bits (112), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 72/322 (22%), Positives = 127/322 (39%), Gaps = 38/322 (11%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+ SW+ + P KV K K + + A +D+ + F++P +LV++ +
Sbjct: 19 EFMSWLKQR--PGVKVSPKIKIADLRSEGAGRGIVADDDIGEDEELFAIPQNLVLSFQ-- 74
Query: 140 LGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
N ++ +LL N+ CL + ++YE QG S W Y + L ++
Sbjct: 75 --NSSLKDLLDFNERDFDPWLCLIVVMIYEYLQGGASTWSRYFQLLPTN-------FDTL 125
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
+ W++ EL L+GS +L + E N L + + +P +F
Sbjct: 126 MFWTDEELRELSGSA----VLNKIGRSDAEANILRNILPLVSGNPSHFPPMSGVASFDSP 181
Query: 258 IFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSL 317
K A ++ + H +MG+ A + + +E++S
Sbjct: 182 EGKAALLS----LAH--------RMGSLIMAYAFDIEKGENDGGEGQDGYVTDDEEELSK 229
Query: 318 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 377
+VPL L A + + A L D + + +P + GE I G P + LL
Sbjct: 230 ----GMVPLADLLNADTDRNNARLFQEDCYLSMRSIKPIRKGEEIFNDYGELPRADLLRR 285
Query: 378 YGFV-DEDNPYDRLVVEAALNT 398
YG+V D YD VE ++ T
Sbjct: 286 YGYVTDNYAQYDE--VEISMRT 305
>gi|297836754|ref|XP_002886259.1| hypothetical protein ARALYDRAFT_319874 [Arabidopsis lyrata subsp.
lyrata]
gi|297332099|gb|EFH62518.1| hypothetical protein ARALYDRAFT_319874 [Arabidopsis lyrata subsp.
lyrata]
Length = 541
Score = 47.8 bits (112), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 63/275 (22%), Positives = 105/275 (38%), Gaps = 52/275 (18%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASEDL+ GD A +P S +++ E V ++ L + ++ + L+ M EK
Sbjct: 173 ASEDLKFGDVALEIPISSIISEEYVFNSDMYPILEKIDGITSETMVLLWTMREKHNLDSK 232
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F PY L G + + + L G+ EI++ E ++ Y+EL
Sbjct: 233 F-KPYFDSLQENFCTG-------MSFGVNAIMELDGTLLLDEIMQAKELLRERYDELI-- 282
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHL 294
L + + P E +T+E + L C E + S+ + +
Sbjct: 283 -----PLLSNHRHVFPPEHYTWEHY-------------LWAC--------ELYYSNSMQI 316
Query: 295 QTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 354
+ +T L P F + P ++ Y C +++ V R
Sbjct: 317 KFPDGKLKTCLIPVAG----------FLNHSIYPHIVKYGKVCVET-----SSLKFPVSR 361
Query: 355 PYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYD 388
P GE + G +S LL YGF+ + DNPYD
Sbjct: 362 PCNKGEQCFLSYGNYSSSHLLTFYGFLPKGDNPYD 396
>gi|294948379|ref|XP_002785721.1| hypothetical protein Pmar_PMAR008080 [Perkinsus marinus ATCC 50983]
gi|239899769|gb|EER17517.1| hypothetical protein Pmar_PMAR008080 [Perkinsus marinus ATCC 50983]
Length = 353
Score = 47.8 bits (112), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 21/71 (29%), Positives = 37/71 (52%)
Query: 321 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 380
++PL S+K + V++ Q++ ++P K GE I G N LL+ +GF
Sbjct: 171 LCVIPLADQFNHSSTKWHTRVREVEEGFQMLAEKPVKKGEEIFNNYGLYTNEMLLLTHGF 230
Query: 381 VDEDNPYDRLV 391
++ DNP+D +
Sbjct: 231 IEFDNPHDHFI 241
>gi|226294776|gb|EEH50196.1| SET domain-containing protein [Paracoccidioides brasiliensis Pb18]
Length = 488
Score = 47.8 bits (112), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 51/107 (47%), Gaps = 13/107 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
+ A +D+ + F++P LV++ + N + +L+ N+ L + CL L ++YE Q
Sbjct: 50 IVAYDDINEEEELFAIPQGLVLSFQ----NSKLKDLMEINERDLGQWLCLILVMIYEYLQ 105
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
G S W PY + L ++ + W++ EL L GS I
Sbjct: 106 GAASPWAPYFKVLPTD-------FDTLMFWTDAELLELKGSAVLGRI 145
>gi|212546319|ref|XP_002153313.1| SET domain protein [Talaromyces marneffei ATCC 18224]
gi|210064833|gb|EEA18928.1| SET domain protein [Talaromyces marneffei ATCC 18224]
Length = 481
Score = 47.4 bits (111), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 57/231 (24%), Positives = 100/231 (43%), Gaps = 27/231 (11%)
Query: 322 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 381
+VPL L A + + A L + A+ + +P KAG+ I G P S LL YG+V
Sbjct: 230 GMVPLADLLNADADRNNARLFQEEGALVMRAIKPIKAGDEIFNDYGELPRSDLLRRYGYV 289
Query: 382 DEDNPYDRLVVEAAL----------NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEA 431
DN VVE L N ED +Y +++ Q ++ + + ++
Sbjct: 290 -TDNYAQYDVVELPLTGICHAAGFDNIEDKEYPQLKLLDQL--EILEDGYCILRPSPEDT 346
Query: 432 ISDMLP----YLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP 487
+ D+LP L D+ E+Q ++S P+ E + L D ++++ Y
Sbjct: 347 LLDILPDELLALLKTLTLDSEELQRLLSKNKHPKPILGAREARI---LLDAAQSKMGQYG 403
Query: 488 ATLSEDEAMLTDY-------NLHPKKRVATQLVRMEKKMLNACLQVTADMI 531
T+ ED+ +L + ++ +A Q+ EK++L A L + D +
Sbjct: 404 TTIQEDKILLQQFASSSVLRTRERRRHMAVQVRVGEKEILQALLMMLQDFL 454
>gi|449520517|ref|XP_004167280.1| PREDICTED: LOW QUALITY PROTEIN: sulfate transporter 4.1,
chloroplastic-like, partial [Cucumis sativus]
Length = 923
Score = 47.4 bits (111), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 58/124 (46%), Gaps = 22/124 (17%)
Query: 112 YVAASEDLQAGDAAFSVP-------NSLVVTLERVLGNETIAELLTTNKLSELACLALYL 164
++ ASE ++AGD VP +SL + + +LGNE + +A LA+ +
Sbjct: 734 FLFASETIRAGDCILKVPFNVQISPDSLPLPIRDLLGNE----------IGNVAKLAVVV 783
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
+ E K G S W PYI L + + + + W E+EL + S E L + I
Sbjct: 784 LLEHKLGLGSEWAPYIIRLPQ-----PWEMHNTIFWKESELEMIRKSSLYEESLNQRSQI 838
Query: 225 KREY 228
KRE+
Sbjct: 839 KREF 842
>gi|226508108|ref|NP_001151788.1| SET domain containing protein [Zea mays]
gi|195649689|gb|ACG44312.1| SET domain containing protein [Zea mays]
Length = 536
Score = 47.4 bits (111), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 71/303 (23%), Positives = 116/303 (38%), Gaps = 59/303 (19%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASE + GD A +P L+++ E + +E L N ++ L L+ M E+
Sbjct: 184 ASESIGVGDIALEIPEFLIISDELLCQSEVFLALKDFNNITSETMLLLWSMRERYNLGSK 243
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F PY L G L + LA L G+ EI++ + ++++Y+EL +
Sbjct: 244 F-KPYFDTLPANFNTG-------LSFGIDALAALEGTLLFDEIIQARQHLRQQYDELFPL 295
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHL 294
L +P + T++ F L C E + S+ + +
Sbjct: 296 ------LCTNFPEMFRKDVCTWDDF-------------LWAC--------ELWYSNSMMI 328
Query: 295 QTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA---VQLV 351
S T L P + P +L Y VD+A ++
Sbjct: 329 VLSSGKLSTCLVPVAGLLNH----------SVSPHILNY--------GRVDEATKSLKFP 370
Query: 352 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYDR--LVVEAALNTEDPQYQDKRM 408
+ RP AGE + G P S L+ YGF+ DNPYD L ++ +++ ED Q
Sbjct: 371 LSRPCDAGEQCFLSYGKHPGSHLVTFYGFLPRGDNPYDVIPLDLDTSVDDEDIAAQSSAT 430
Query: 409 VAQ 411
+Q
Sbjct: 431 TSQ 433
>gi|66828265|ref|XP_647487.1| hypothetical protein DDB_G0268558 [Dictyostelium discoideum AX4]
gi|60475797|gb|EAL73732.1| hypothetical protein DDB_G0268558 [Dictyostelium discoideum AX4]
Length = 459
Score = 47.4 bits (111), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 79/344 (22%), Positives = 134/344 (38%), Gaps = 69/344 (20%)
Query: 78 LGDLKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
L + W+ N + P ++ + EK + + A + ++ + SVP +++
Sbjct: 35 LNEFNKWLINNKVYKNPKIEIKVLEKYGRS--------IVAKQSIKKNEKLISVPKLIIM 86
Query: 135 T----LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRG 190
+ L NE I E + +S A++LMY K KSFW PY+ L ++
Sbjct: 87 SNMGGFSHHLPNE-IYEPSISIGISPTNLQAIFLMY-CKLNDKSFWYPYVSVLPKE---- 140
Query: 191 QLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIP 250
+ + +SE EL L S K + R +GI+R YN T ++ ++ P
Sbjct: 141 ---FTTSIYFSEEELDELQSSKLKEFTIIRKDGIERHYN--STFTRLSNRGIAEFS---P 192
Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHN 310
T T LQ+ G T ++ T + SC +R S + +
Sbjct: 193 TSTQT-----------------LQQKGYTLELFTWAL---------SCVWSRA-FSLSDS 225
Query: 311 PKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG--- 367
V LA F + SK K D + + GE I G
Sbjct: 226 DGGMVPLADMFNAEEI--------SKSKVQPKVTDSTLDYYASDDIEIGEQIFTPYGVYK 277
Query: 368 PQPNSKLLINYGFV-DEDNPYDRLVVEA-ALNTEDPQYQDKRMV 409
P +S++L++YGFV D P D + + + ++P Q K+ +
Sbjct: 278 PLSSSQMLMDYGFVFDHGTPSDNVAISVPIFHPDEPNIQVKQSI 321
>gi|384248108|gb|EIE21593.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 229
Score = 47.0 bits (110), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 52/110 (47%), Gaps = 9/110 (8%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
D W+ K G + E + E R V A +++ G +VP L+++
Sbjct: 5 DFAEWLQKGGALIADI---EPGAVAEGFRG---VIAKANIEEGTLLVAVPERLLLSAHSA 58
Query: 140 LGNETIAE-LLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
+ AE LL TNK + LA +L++E +G++SFW PY+ L RQ
Sbjct: 59 KKDRAFAEALLATNKQSIGSSQVLAAHLLHEASKGQESFWRPYLATLPRQ 108
>gi|46129354|ref|XP_389038.1| hypothetical protein FG08862.1 [Gibberella zeae PH-1]
Length = 478
Score = 47.0 bits (110), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 46/87 (52%), Gaps = 4/87 (4%)
Query: 311 PKEKV-SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 369
PK K+ + + R +P+ L + CK + +A+ +VQ DR YK GE + V GP
Sbjct: 168 PKMKIYAHSDRLVCMPVAD-LFNHDQGCKLVYSALGYSVQ--TDRVYKQGEEVYVSYGPH 224
Query: 370 PNSKLLINYGFVDEDNPYDRLVVEAAL 396
N LL YGF+ + N +D + ++ +
Sbjct: 225 SNDFLLTEYGFILDTNRWDEVYLDEVI 251
>gi|428175234|gb|EKX44125.1| hypothetical protein GUITHDRAFT_109909 [Guillardia theta CCMP2712]
Length = 442
Score = 47.0 bits (110), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 23/91 (25%), Positives = 42/91 (46%)
Query: 318 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 377
R+ + P+ K A +L + + GE I + + N++ L+
Sbjct: 244 GRQRIMAPMADLFNHGDVKTSYTFNAASGHFELFTQQFFSRGEQIFMNYDSKNNAEFLLQ 303
Query: 378 YGFVDEDNPYDRLVVEAALNTEDPQYQDKRM 408
YGFV E NP+D + + A++ + P Y+DK +
Sbjct: 304 YGFVIESNPHDYVGIAASIGNDQPFYRDKSL 334
>gi|340966944|gb|EGS22451.1| hypothetical protein CTHT_0019870 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 499
Score = 47.0 bits (110), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 52/225 (23%), Positives = 85/225 (37%), Gaps = 41/225 (18%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
+L+ E +G+ SFW PYI L + + P W E ++ +L G+ I E
Sbjct: 111 FFLIKEYLKGENSFWWPYIATLPQPEQVNSWTL--PAFWPEDDIQFLEGTNAHVAIGEIQ 168
Query: 222 EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQ 281
IKREY + V + F + + ++ ++K AF S
Sbjct: 169 ANIKREYKQARKV--LKEENFPNW------KEYSQMLYKWAFSIFTS------------- 207
Query: 282 MGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAML 341
SF L+ L S + L P+ AR + PL ++
Sbjct: 208 ---RSFRPSLI-LSQSVKDYVSTLLPS---------AREIDDFSILQPLFDIANHSMTAT 254
Query: 342 AAVD-----DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 381
D + QL+ Y+ G+ + G + NS+LL+ YGF+
Sbjct: 255 YTWDTTSDPNCCQLICQDSYRPGDQVFNNYGFKTNSELLLAYGFI 299
>gi|297735395|emb|CBI17835.3| unnamed protein product [Vitis vinifera]
Length = 583
Score = 47.0 bits (110), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 65/280 (23%), Positives = 109/280 (38%), Gaps = 51/280 (18%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+EDL+ GD A +P S+V++ E V ++ L + +S L L+ M EK
Sbjct: 211 ATEDLKVGDVALEIPMSIVISEELVHESDMFPILEKIDGISSETMLLLWSMKEKHNSNSK 270
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F Y L A + L + + L G+ EI+E + + +Y EL
Sbjct: 271 F-NTYFNALPE-------AFNTGLSFEFDAIMVLAGTLLLEEIIEAKKHLNAQYEEL--- 319
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHL 294
+L + +P P E +T E F A E + S+ + +
Sbjct: 320 ---VPALCKDHPDIFPPEFYTQEQFLWA---------------------CELWYSNGMQV 355
Query: 295 QTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 354
+ RT L P F L P ++ Y + + ++++ V +
Sbjct: 356 MFTDGKLRTCLIPIAG----------FLNHSLYPHIMHYGK-----VDSKTNSLKFCVSK 400
Query: 355 PYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYDRLVVE 393
P GE + G +S L+ YGF+ + DN YD + +E
Sbjct: 401 PCNMGEQCYLSYGNFSSSHLVTFYGFIPQGDNLYDTIPLE 440
>gi|146180409|ref|XP_001020886.2| hypothetical protein TTHERM_00411920 [Tetrahymena thermophila]
gi|146144524|gb|EAS00641.2| hypothetical protein TTHERM_00411920 [Tetrahymena thermophila
SB210]
Length = 726
Score = 47.0 bits (110), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 63/246 (25%), Positives = 103/246 (41%), Gaps = 39/246 (15%)
Query: 67 SREVVSK--KEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDA 124
++E+ +K K E + +W+ NG+ V E P ++ I AA++D+ A
Sbjct: 22 NKELTAKLQKLEKYVNFNNWLKANGVVYDGV---EYPVAFGQYGLIG-AAATKDIAPLTA 77
Query: 125 AFSVPNSLVVTLERVLGNE-----TIAELLTTNKLSELA---CLALYLMYEKKQGKKSFW 176
S+PN ++++ +R +E +E L + K ++ A L ++ MYE+ +GKKS W
Sbjct: 78 FISIPNKIIISYDRARFSELKSFFKQSEDLFSEKENDEAGVNVLTVFFMYERLKGKKSLW 137
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE-LDTVW 235
Y L+ E+ L W+ E+ + + + REY E +D +W
Sbjct: 138 HEYFEILENN--------ETILTWTAEEINRIPDPYIQKQA--------REYKEQVDELW 181
Query: 236 FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTES--FASHLVH 293
L P T E+F A+ V S RC Q GT FA L H
Sbjct: 182 DELKELLHSQPNFFQKATATKELFLWAYNIVMS------RCFGYTQKGTSIVPFADCLNH 235
Query: 294 LQTSCT 299
+ T
Sbjct: 236 NKYHAT 241
>gi|169626351|ref|XP_001806576.1| hypothetical protein SNOG_16462 [Phaeosphaeria nodorum SN15]
gi|160705819|gb|EAT76160.2| hypothetical protein SNOG_16462 [Phaeosphaeria nodorum SN15]
Length = 474
Score = 47.0 bits (110), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 74/327 (22%), Positives = 125/327 (38%), Gaps = 48/327 (14%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLER-VLG 141
+W+ ++G+ I E + + R V A++D+ + F +P + ++++E +L
Sbjct: 13 AWLRRSGVEISPKIQLEDLRNAQAGRG---VVATQDIPEHELLFRIPRTAILSVENSILS 69
Query: 142 NETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
E A T L L L ++YE G S W PY L + + + WS
Sbjct: 70 TEIPA--ATFEMLGPWLSLILVMLYEYINGDASNWAPYFSVLPTE-------FNTLMFWS 120
Query: 202 ETELAYLTGSPTKAEILERAEGIKREY--------NELDTVWFMAGSLFQQYPYDIPTEA 253
E ELA L S +I EG + E ++F +Q ++ E
Sbjct: 121 EDELAELQASAVLNKI--GKEGANEAFMEQLLPIIKEFADIFFAGDERAKQRAEEMRDER 178
Query: 254 FTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKE 313
++H +MG+ A T+R + +E
Sbjct: 179 -------------NVLLMH--------KMGSLIMA---YAFDVEPATSRKDVDEEGFAEE 214
Query: 314 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 373
+ A ++PL L A + A L + +++ +P +AGE + GP P S
Sbjct: 215 EEDEALPKGMIPLADMLNADADCNNARLFYEEKYLEMKALKPIRAGEEVFNDYGPLPRSD 274
Query: 374 LLINYGFV-DEDNPYDRLVVEAALNTE 399
LL YG+V D YD + + L TE
Sbjct: 275 LLRRYGYVTDNYAQYDVVEINMDLVTE 301
>gi|340520781|gb|EGR51016.1| N-methyltransferase [Trichoderma reesei QM6a]
Length = 470
Score = 47.0 bits (110), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 54/215 (25%), Positives = 83/215 (38%), Gaps = 40/215 (18%)
Query: 196 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
S + +SE EL G+ + + IK +Y +L A LF Q+P P + FT
Sbjct: 107 SSIFFSEGELEVCAGTSLYTVTKQLEQRIKDDYRQL------AVRLFAQHPDLFPLQKFT 160
Query: 256 FEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKV 315
E V L R P + CT + T +
Sbjct: 161 IED------------VRLLRRATDP------------YKWALCTVWSRSMDFTLPDGSSI 196
Query: 316 SLARRFALVPLGPPLLAYSS---KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 372
L FA +L +SS +C A D + + + Y+ G+ + ++ GP PN+
Sbjct: 197 RLLAPFA------DMLNHSSEVKQCHAYDVKSGD-LSVFAGKDYEIGDQVYIYYGPIPNN 249
Query: 373 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 407
+LL YGFV DNP D + + P Y+ K+
Sbjct: 250 RLLRLYGFVIPDNPNDSYDLVLTTHPMAPFYEQKQ 284
>gi|302410103|ref|XP_003002885.1| SET domain-containing protein RMS1 [Verticillium albo-atrum
VaMs.102]
gi|261357909|gb|EEY20337.1| SET domain-containing protein RMS1 [Verticillium albo-atrum
VaMs.102]
Length = 469
Score = 47.0 bits (110), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 87/399 (21%), Positives = 153/399 (38%), Gaps = 82/399 (20%)
Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 213
L L L ++YE QG S W PY L +Q ++P+ WS+ EL L G+
Sbjct: 91 LDSWGQLILVMLYEVLQGDSSRWKPYFDILPQQ-------FDTPIFWSDGELLELQGTSL 143
Query: 214 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 273
AE + + E +++ + ++F YP + A + ++HL
Sbjct: 144 TAEKIGKVESDAMFRSKILPIVQANPAIF--YP-------------EGAAQPTEDELLHL 188
Query: 274 -QRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLA 332
R G T ++ L+ + + + L +VP+ L A
Sbjct: 189 AHRMGST-------IMAYAFDLENDDENENEEDGWVEDREGRTML----GMVPMADTLNA 237
Query: 333 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD-EDNPYDRLV 391
+++ A + + + +AG+ ++ + GP P S+LL YG+V E + YD +
Sbjct: 238 -NAEFNAHINHGESLEATAIRADIRAGDQVLNYYGPLPTSELLRRYGYVTPEHSRYDVVE 296
Query: 392 VEAALNTEDPQYQDKRMVAQRNGKLSVQVF-HVHAGREKEAISDMLPYLRLGYVSDTSEM 450
V L E V LS + + V + + E I D Y + S
Sbjct: 297 VPWTLVKE---------VIVSCLSLSAEAWKQVESQIDDEEIED---YFVIERDSGEPGP 344
Query: 451 QSVISSLGPICPVSPCMERAVLDQLADYFKA----------------------------- 481
++ + VSP + ++QL ++ KA
Sbjct: 345 DGRFTAPAVLREVSPEL----VEQLKEFLKAVKKLDSERIPDKRKRDEICDAVIAEVLKV 400
Query: 482 RLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 520
RLA YP ++ DE +L + +L ++R+A + EKK+L
Sbjct: 401 RLAQYPTSIETDEKLLAEADLPARRRMAVVVRLGEKKLL 439
>gi|125528589|gb|EAY76703.1| hypothetical protein OsI_04658 [Oryza sativa Indica Group]
Length = 495
Score = 47.0 bits (110), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 71/329 (21%), Positives = 125/329 (37%), Gaps = 45/329 (13%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
+ WM +G+ C L+ + E YV A L+ GD ++P +T R
Sbjct: 16 FRRWMRDHGVV-CSDALRLDAA--EDGGGGVYVRALAALREGDLVATIPRGACLT-PRTS 71
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
G +L LA+ +MYE+ +G +S W Y+R + + PL+W
Sbjct: 72 GAAEAI---EAAELGGPLALAVAVMYERARGAESPWDAYLRLIPERE-------PVPLVW 121
Query: 201 SETELA-YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
E L G+ + + + I ++ E ++G L ++ + F+ E +
Sbjct: 122 PADEAERLLAGTELDKIVKQDRQFICEDWKECIEPLILSGEL------EVDPDDFSLENY 175
Query: 260 KQAFVAVQSCVVHLQR---CGCTP--QMGTESFASHLVH----LQTSCTTTRTPLSPTHN 310
A + S + G P + VH L+ S + + P +
Sbjct: 176 FSAKSLLSSRSFRIDSYHGSGMVPLADLFNHKTGGEHVHFTSVLEASDSDSEDGEDPNNA 235
Query: 311 PKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 370
++ S A +P G D+ ++++V R GE + G
Sbjct: 236 SADEQSTIENSADIPSGDD---------------DEDLEMIVVRDVNEGEEVFNTYGTMG 280
Query: 371 NSKLLINYGFVDEDNPYDRLVVEAALNTE 399
N+ LL YGF + DN YD + ++ AL T+
Sbjct: 281 NAALLHRYGFTEMDNSYDIVNIDLALVTK 309
>gi|21952799|dbj|BAC06215.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
gi|22202682|dbj|BAC07340.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
gi|215769224|dbj|BAH01453.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222619626|gb|EEE55758.1| hypothetical protein OsJ_04288 [Oryza sativa Japonica Group]
Length = 495
Score = 47.0 bits (110), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 71/329 (21%), Positives = 125/329 (37%), Gaps = 45/329 (13%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
+ WM +G+ C L+ + E YV A L+ GD ++P +T R
Sbjct: 16 FRRWMRDHGVV-CSNALRLDAA--EDGGGGVYVRALAALREGDLVATIPRGACLT-PRTS 71
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
G +L LA+ +MYE+ +G +S W Y+R + + PL+W
Sbjct: 72 GAAEAI---EAAELGGPLALAVAVMYERARGAESPWDAYLRLIPERE-------PVPLVW 121
Query: 201 SETELA-YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
E L G+ + + + I ++ E ++G L ++ + F+ E +
Sbjct: 122 PADEAERLLAGTELDKIVKQDRQFICEDWKECIEPLILSGEL------EVDPDDFSLENY 175
Query: 260 KQAFVAVQSCVVHLQR---CGCTP--QMGTESFASHLVH----LQTSCTTTRTPLSPTHN 310
A + S + G P + VH L+ S + + P +
Sbjct: 176 FSAKSLLSSRSFRIDSYHGSGMVPLADLFNHKTGGEHVHFTSVLEASDSDSEDGEDPNNA 235
Query: 311 PKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 370
++ S A +P G D+ ++++V R GE + G
Sbjct: 236 SADEQSTIENSADIPSGDD---------------DEDLEMIVVRDVNEGEEVFNTYGTMG 280
Query: 371 NSKLLINYGFVDEDNPYDRLVVEAALNTE 399
N+ LL YGF + DN YD + ++ AL T+
Sbjct: 281 NAALLHRYGFTEMDNSYDIVNIDLALVTK 309
>gi|225446052|ref|XP_002268920.1| PREDICTED: uncharacterized protein LOC100256524 [Vitis vinifera]
Length = 566
Score = 47.0 bits (110), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 65/280 (23%), Positives = 109/280 (38%), Gaps = 51/280 (18%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+EDL+ GD A +P S+V++ E V ++ L + +S L L+ M EK
Sbjct: 194 ATEDLKVGDVALEIPMSIVISEELVHESDMFPILEKIDGISSETMLLLWSMKEKHNSNSK 253
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F Y L A + L + + L G+ EI+E + + +Y EL
Sbjct: 254 F-NTYFNALPE-------AFNTGLSFEFDAIMVLAGTLLLEEIIEAKKHLNAQYEEL--- 302
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHL 294
+L + +P P E +T E F A E + S+ + +
Sbjct: 303 ---VPALCKDHPDIFPPEFYTQEQFLWA---------------------CELWYSNGMQV 338
Query: 295 QTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 354
+ RT L P F L P ++ Y + + ++++ V +
Sbjct: 339 MFTDGKLRTCLIPIAG----------FLNHSLYPHIMHYGK-----VDSKTNSLKFCVSK 383
Query: 355 PYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYDRLVVE 393
P GE + G +S L+ YGF+ + DN YD + +E
Sbjct: 384 PCNMGEQCYLSYGNFSSSHLVTFYGFIPQGDNLYDTIPLE 423
>gi|348690659|gb|EGZ30473.1| hypothetical protein PHYSODRAFT_553476 [Phytophthora sojae]
Length = 437
Score = 47.0 bits (110), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 99/438 (22%), Positives = 171/438 (39%), Gaps = 83/438 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLS---ELACLALYLMYEK- 168
V +ED+ FS+P V++++ + N + + +L+ E LA+ L+YEK
Sbjct: 47 VFIAEDVTPHAEVFSIPLDSVLSVKSLQENAVLQSIAFFQQLTPEREDDQLAIALLYEKF 106
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
+G KS W +I L R + L + EL L GS + E + +Y
Sbjct: 107 VRGSKSKWAKHIELLPR-------TYHNALYFGPEELRALEGSNVYFIAQQMEEKVAHDY 159
Query: 229 NELDTVWFMAGSLFQQYP----YDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGT 284
L + LF+ P D+ E F+ E +K A + S R G P +
Sbjct: 160 ARLKESVLL--ELFENVPEGINVDLFDEFFSLENYKWALSTIWS------RFGDVP-VAK 210
Query: 285 ESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAV 344
+SF + P+ N + ++ F +
Sbjct: 211 QSFKA------------MVPVFDMLNHDPEAEMSHFFDMST------------------- 239
Query: 345 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQ-- 402
+LV + + AG + + GP N KLL YGFV NP+D VE L ++
Sbjct: 240 -QRFKLVSHQHWNAGAQMFINYGPLSNHKLLALYGFVIIGNPFD--AVEMWLPMDEASTK 296
Query: 403 -YQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-----MLPYLRLGYVS--DTSEMQSVI 454
+Q+K + NG HA E ++D +L R+ + E + +
Sbjct: 297 FFQEKEQLLLTNGL-------DHATNPFELVADESNDLLLMAARIQEIDCETVEEFEELA 349
Query: 455 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML------TDYNLHPKKRV 508
+ +S E+ L +L + L +P ++ ED+ +L TD NL+ +R+
Sbjct: 350 NKALEGEMISLENEQEALTRLIYTLEKMLESFPTSIEEDDILLEQDDKKTD-NLN-HERM 407
Query: 509 ATQLVRMEKKMLNACLQV 526
A + R +K +L+ + +
Sbjct: 408 AVAVRRSDKYILSENINM 425
>gi|299470104|emb|CBN78133.1| protein N-methyltransferase [Ectocarpus siliculosus]
Length = 482
Score = 47.0 bits (110), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 36/74 (48%)
Query: 320 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 379
R AL+PL + YS M D A+ L V + G+ GP N LL YG
Sbjct: 215 RMALLPLIDSINHYSRMPTHMYWEADGALSLSVGAAFDPGDHAFASYGPVSNDDLLQYYG 274
Query: 380 FVDEDNPYDRLVVE 393
FV++DNP D V+E
Sbjct: 275 FVEQDNPSDTYVLE 288
>gi|367013376|ref|XP_003681188.1| hypothetical protein TDEL_0D03930 [Torulaspora delbrueckii]
gi|359748848|emb|CCE91977.1| hypothetical protein TDEL_0D03930 [Torulaspora delbrueckii]
Length = 484
Score = 46.6 bits (109), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 48/100 (48%), Gaps = 6/100 (6%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A +DL+ G +P S + + N +IA LL +++ + L + +YE K
Sbjct: 40 VFAKQDLEEGTVLLKLPKSCLFSA----SNSSIANLLVDDEIDGVLALNIAFLYETTVFK 95
Query: 173 -KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
KS W PY++ + G L V P WSETE L GS
Sbjct: 96 EKSHWFPYLKSIRIYNDDG-LLVLPPSHWSETEKLLLKGS 134
>gi|224042477|ref|XP_002188626.1| PREDICTED: SET domain-containing protein 4 [Taeniopygia guttata]
Length = 457
Score = 46.6 bits (109), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 92/423 (21%), Positives = 150/423 (35%), Gaps = 90/423 (21%)
Query: 81 LKSWM------HKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSV 128
LKS+M HK K LKE+ + RP + + ++ LQAGD S+
Sbjct: 17 LKSFMDGVNCSHKLEYIKLKKWLKERGFEDSNLRPAEFWETGRGLMTTKALQAGDLIISL 76
Query: 129 PNSLVVT----LERVLGNE------------TIAELLTTNKLSELACLALYL---MYEKK 169
P ++T L LG + L L L C L + EK
Sbjct: 77 PEKCLLTTGTVLSSCLGGHIEKWKPPVSPLLALCTFLIGQNLELLECFQFLLVNGIAEKH 136
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
G+KS W PY+ L + A P E ++ L P + + E+ I+ +
Sbjct: 137 AGQKSPWKPYLDVLPK-------AYTCPAC-LEPDIINLLPKPLQKKAQEQKMLIQELFQ 188
Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFAS 289
+ LF + +I F F + A+ V + ++++
Sbjct: 189 SSRAFFSSLQPLFAEDTGNI----FNFSALQWAWCTVNTRTIYMK--------------- 229
Query: 290 HLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 349
H C + L P + +LA L+ P + + KA + +
Sbjct: 230 ---HPHRECFS----LEP-----DVYALAPYLDLLNHSP-----NVQVKAGFNEQTRSYE 272
Query: 350 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 409
+ D K + +++ GP N +LL+ YGFV DNP+ + V A + DK
Sbjct: 273 IWTDSQCKKYQEVLICYGPHDNQRLLLEYGFVATDNPHSSVYVSADTLLKYFSSLDK--- 329
Query: 410 AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMER 469
QR K+S+ H D L L G+ + + + + L C R
Sbjct: 330 -QREAKVSILKDH-----------DFLENLTFGWEGPSWRLLTALKVLSLAADEFACWRR 377
Query: 470 AVL 472
+L
Sbjct: 378 ILL 380
>gi|301094750|ref|XP_002896479.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262109454|gb|EEY67506.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 478
Score = 46.6 bits (109), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 42/162 (25%), Positives = 77/162 (47%), Gaps = 20/162 (12%)
Query: 74 KEEDLG-DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
KE+ +G +L W+ NG K+ L+E + + R +H + + L G+ +P
Sbjct: 29 KEDVVGQELIQWLETNGADSKKLTLQE---YAPEVRGVH---SRKVLVPGERILVIPKKC 82
Query: 133 VVTLERVLGNET-IAELLTTNKLSELA----CLALYLMYEKKQGKKSFWLPYIRELDRQR 187
++T+E +G +T I L + +A L ++L+ + + + SF+ Y L
Sbjct: 83 LITVE--MGKQTDIGRKLLARNVDFVAPKHIFLMMFLLTDMEHVETSFFRNYYSTLPSTL 140
Query: 188 GRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
P+ WSE EL++L GS +I ER I+++Y+
Sbjct: 141 S------NMPIFWSEEELSWLKGSYIIQQIQERKAAIRKDYD 176
Score = 38.9 bits (89), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 54/234 (23%), Positives = 91/234 (38%), Gaps = 17/234 (7%)
Query: 322 ALVPLGPPLLAYSSK-CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 380
ALVP L Y + DA + G + G + N + L+NYGF
Sbjct: 216 ALVPFADMLNHYRPRETSWTFDQSIDAFTITSLGTIGTGAQVYDSYGKKCNHRFLLNYGF 275
Query: 381 VDEDNPY------DRLVVEAALNTEDPQ-YQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 433
EDN + ++++ L+ D Q + DKR +G ++ +
Sbjct: 276 AVEDNTEEDGRNPNEVLIDFQLSPADGQLFYDKRAYLHESGIYTMDA-RLSCSHSDANTR 334
Query: 434 DMLPYLRLGYVSDTSEMQSVISSLGPIC---PVSPCMERAVLDQLADYFKARLAGYPATL 490
+ + RL + T E S + P P+S E L+ L + +L+ Y T+
Sbjct: 335 EGFSFARL--IVATEEEFSTMKMKSPAHSSPPISFDNEIRALEYLRNLMTHQLSLYDTTI 392
Query: 491 SEDEAMLT--DYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPC 542
ED +L Y L + A +R EK++ Q AD ++ L + ++ C
Sbjct: 393 EEDNELLASKQYPLFSNRIQALFFIRGEKQVCRY-FQELADKVIPLFSLPLAEC 445
>gi|281201674|gb|EFA75882.1| tryptophan 2,3-dioxygenase [Polysphondylium pallidum PN500]
Length = 732
Score = 46.6 bits (109), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 53/210 (25%), Positives = 94/210 (44%), Gaps = 35/210 (16%)
Query: 75 EEDLGDLKSWMHKNG----LPPCKVI--LKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
++ + K+W+ NG L K++ L E + A+ +++ GD V
Sbjct: 64 QQIVDQFKNWLASNGCQESLDKVKIVRTLAEGTG----------LIANTEIKEGDEFIKV 113
Query: 129 PNSLVVTLE---RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDR 185
P L ++ E + +G++ E L K+ L ++L+ E ++ ++SFW PYIR L +
Sbjct: 114 PLKLFMSQETAFKSIGDKVSREPLF--KMLPNMLLVIHLIQETQKQQQSFWAPYIRMLPK 171
Query: 186 QRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQY 245
+ ++ L ++ E L GSP +LE E I N L F+ F +
Sbjct: 172 -------SYKTALYFTLAEFQLLIGSP----VLE--ESINTYRNTLRQYCFLY-DFFGKN 217
Query: 246 PYDIPTEAFTFEIFKQAFVAVQSCVVHLQR 275
P + T FT+E + A +S V L++
Sbjct: 218 PGILSTSNFTWEFEQNELAAYKSIVSLLKK 247
>gi|294868786|ref|XP_002765694.1| hypothetical protein Pmar_PMAR013760 [Perkinsus marinus ATCC 50983]
gi|239865773|gb|EEQ98411.1| hypothetical protein Pmar_PMAR013760 [Perkinsus marinus ATCC 50983]
Length = 330
Score = 46.6 bits (109), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 21/71 (29%), Positives = 36/71 (50%)
Query: 321 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 380
++PL S+K + V+ Q++ ++P K GE I G N LL+ +GF
Sbjct: 171 LCVIPLADQFNHSSTKWHTRVREVEGGFQMLAEKPVKKGEEIFNNYGLYTNEMLLLTHGF 230
Query: 381 VDEDNPYDRLV 391
++ DNP+D +
Sbjct: 231 IEFDNPHDHFI 241
>gi|302804448|ref|XP_002983976.1| hypothetical protein SELMODRAFT_423083 [Selaginella moellendorffii]
gi|300148328|gb|EFJ14988.1| hypothetical protein SELMODRAFT_423083 [Selaginella moellendorffii]
Length = 266
Score = 46.6 bits (109), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 66/276 (23%), Positives = 104/276 (37%), Gaps = 58/276 (21%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK-- 172
AS ++AG+ + L++ + G + T ++LA + L Y K Q K
Sbjct: 25 ASRPVRAGERVLEISLDLMIAPSDLPGELSTVLSSTVKPWTKLALIVLMERY-KGQAKLQ 83
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W PYI L +++ LW +TEL+YL SP + ER E I E+ ++
Sbjct: 84 SSAWAPYISCLPEPA-----ELDNTFLWEDTELSYLRASPLYGKTRERLEMITTEFGQVQ 138
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLV 292
+ LF + + E FK + V S + + G +S
Sbjct: 139 NALDVWPQLFGK---------VSLEDFKHVYATVFSRSLAI---------GEDS------ 174
Query: 293 HLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 352
T P+ N A FA + LL Y+ +
Sbjct: 175 ------TLVMIPMLDFFNHN-----ATSFAKLSFN-GLLNYAV--------------VTA 208
Query: 353 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 388
DR Y + I + G N++L ++YGF +NPYD
Sbjct: 209 DRDYAENDQIWINYGDLSNAELALDYGFAVPENPYD 244
>gi|260807503|ref|XP_002598548.1| hypothetical protein BRAFLDRAFT_118329 [Branchiostoma floridae]
gi|229283821|gb|EEN54560.1| hypothetical protein BRAFLDRAFT_118329 [Branchiostoma floridae]
Length = 448
Score = 46.6 bits (109), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 63/303 (20%), Positives = 121/303 (39%), Gaps = 49/303 (16%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN-KLSELACLALYLMYEKKQGKKSFWL 177
++ G +P ++++ + VL + + + +L+ + + +L+Y+K G+ SFW
Sbjct: 65 IKRGQTMIKMPQHMILSTKTVLDSVLGPYIESAEPQLTTIQAITTFLIYQKHIGETSFWK 124
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
PY+ L + P+ + E + YL S +A I + + + Y EL +
Sbjct: 125 PYLDILPNE-------YTHPVYFGEEDFLYLPHS-LRANIKAKKQECIKSYEELKPFFPS 176
Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTS 297
L + FTF+ ++ A+ V++ +++ G T L +L S
Sbjct: 177 LEPLLPNW-----EGIFTFDAYRWAWSTVKTRSLYVDDKGSTV----------LRNLDKS 221
Query: 298 CTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYK 357
T L P + AR L+ CK D + + YK
Sbjct: 222 GLGV-TSLVPMVDLLNHSHSARTGLLI---------KKSCKNG----DYFYTVTAEDDYK 267
Query: 358 AGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL----------VVE-AALNTEDPQYQDK 406
G+ ++ N LL+NYGFV DN D + ++E EDP+++ +
Sbjct: 268 RGDQVLFCYRRADNQTLLLNYGFVLPDNHLDTIKFFLVKDIIGILELMNFEEEDPKFRRR 327
Query: 407 RMV 409
+++
Sbjct: 328 KVL 330
>gi|408392258|gb|EKJ71616.1| hypothetical protein FPSE_08255 [Fusarium pseudograminearum CS3096]
Length = 527
Score = 46.2 bits (108), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 35/63 (55%), Gaps = 2/63 (3%)
Query: 334 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 393
S CK + +A+ +VQ DR YK GE + V GP N LL YGF+ + N +D + ++
Sbjct: 194 SQGCKLVYSALGYSVQ--TDRAYKQGEEVFVSYGPHSNDFLLTEYGFILDTNRWDEVYLD 251
Query: 394 AAL 396
+
Sbjct: 252 EVI 254
>gi|145356486|ref|XP_001422460.1| chloroplast lysine N-methyltransferase [Ostreococcus lucimarinus
CCE9901]
gi|144582703|gb|ABP00777.1| chloroplast lysine N-methyltransferase [Ostreococcus lucimarinus
CCE9901]
Length = 529
Score = 46.2 bits (108), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 87/409 (21%), Positives = 153/409 (37%), Gaps = 62/409 (15%)
Query: 48 RRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKH 107
RR+ R+ +S + D L W+ NG V + + +E
Sbjct: 24 RRRARWGDATTSKTRRPRTRARRDAASSADHDALHEWLSANGADVASVEFYDARAGDEDD 83
Query: 108 RPIHYVA--ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELL----TTNKLSELACLA 161
A+ L G A VP SL +T E + ++ + + L L+ LA
Sbjct: 84 GGDAGWGARATRALARGAKAIVVPKSLWITPEVGMNDDELGKALRDEDVAGGLARWTTLA 143
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
L L+ E+++G++S + Y++ L + SPL W+ EL+ + G+ ++L+ A
Sbjct: 144 LTLLKERERGEESKYAAYVKTLPE-------VLHSPLFWNAEELSEIQGT----QLLDNA 192
Query: 222 EG----IKREYNELDTVWFMAGSLFQQYP--YDIPTEAFTFEIFKQAFVAVQSCVVHLQR 275
G ++ Y L T +F ++ +D+ AF+ + F+ AF ++S
Sbjct: 193 AGYDGYVRGVYETLRT------GMFAKHADVFDVEG-AFSEDNFRWAFGILRS------- 238
Query: 276 CGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSS 335
C L P + SL++ V G +A +
Sbjct: 239 -----------------RTMAPCDGANIALVPGVDLVNHSSLSQARWRVSGG---VAGAV 278
Query: 336 KCKAMLAAVDDAVQLVV--DRPYKAGESIVVWCGPQ-PNSKLLINYGFVDEDNPYDRLVV 392
DD V V DR E + V P+ ++ +++GFVD P +
Sbjct: 279 AGLFGGGKGDDGVSARVECDRALNVNEPLYVNYNPEGTDTSFALDFGFVDTITPSPGYAL 338
Query: 393 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 441
++ +DP DK V G F + A + + D+ +LRL
Sbjct: 339 SLSVPEDDPNVFDKLDVLDVCGLGETPTFTLRAYSDPD--PDLRTFLRL 385
>gi|328771298|gb|EGF81338.1| hypothetical protein BATDEDRAFT_87914 [Batrachochytrium
dendrobatidis JAM81]
Length = 607
Score = 46.2 bits (108), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 41/154 (26%), Positives = 71/154 (46%), Gaps = 17/154 (11%)
Query: 78 LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
L LK W +N + + ++ + N R V A + L+ GD ++P +++++
Sbjct: 4 LNILKQWFGENKIAYDEEKIRIEHDTNNGFR----VFAKQTLEVGDILCAIPKEAILSIK 59
Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
N +A++L L L + LM+E+ G+KS W YI+ L L P
Sbjct: 60 ----NCGVADVLEEQGLGGQLGLVIALMFERSLGEKSPWYGYIQSL-------PLRENIP 108
Query: 198 LLWSETELAYLTGSPTKAEILE-RAEGIKREYNE 230
L W + + A L G+ A +LE + +K +Y E
Sbjct: 109 LFWEKDQQACLDGTAV-AHLLEPMPKDLKADYKE 141
>gi|452982650|gb|EME82409.1| hypothetical protein MYCFIDRAFT_40308 [Pseudocercospora fijiensis
CIRAD86]
Length = 449
Score = 46.2 bits (108), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 69/270 (25%), Positives = 109/270 (40%), Gaps = 37/270 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A+ DL + + F +P + ++T E + I + LT LS L L +++E G
Sbjct: 42 VVATSDLTSDEEIFRIPRTSILTTETTDLPQEILQQLTDPWLS----LILAMIFEYLLGT 97
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S + PY+ L + + + W++ EL YL GS ++I +
Sbjct: 98 NSRFKPYLDILPE-------SFNTLMFWTDNELQYLQGSAILSKIGKEE----------- 139
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLV 292
A + F + I T+ EIFK Q + R G S+A L
Sbjct: 140 -----ADNTFSEQLLPIITK--NPEIFKIGTCNNQDLLALCHRMGSIIM----SYAFDLD 188
Query: 293 HLQTSCTTTRTPLSPTHNP-KEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 351
T+ T++ + EK+S AL+PL L A + L D+ +
Sbjct: 189 PPPTTTTSSSEEWESDSDSENEKISPK---ALIPLADMLNANGDLTNSKLFFSSDSFIMK 245
Query: 352 VDRPYKAGESIVVWCGPQPNSKLLINYGFV 381
+P AGE ++ GP P + LL YGFV
Sbjct: 246 TLQPVAAGEELLNDFGPLPPADLLRRYGFV 275
>gi|299472213|emb|CBN77183.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloropl [Ectocarpus
siliculosus]
Length = 460
Score = 46.2 bits (108), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 72/303 (23%), Positives = 119/303 (39%), Gaps = 62/303 (20%)
Query: 84 WMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLG 141
W+ K+G L V+ P E+ + A++ ++ G + ++P SL +T L
Sbjct: 18 WLTKSGVRLTDNAVLAGRSPLAGERG-----LVAAKAIETGQSVLAIPQSLGLTATG-LK 71
Query: 142 NETIAELLTT--NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
+ IA+ + E +AL +++E+ QG+ S P+I L ++ G+L E PL
Sbjct: 72 SSGIAQYVEGFEGWTGETGLIALQVLWERAQGEGSKMAPWIAVLPKE---GEL--EMPLF 126
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
W E +L S T+ G + +E D W ++ + F ++P P + F F
Sbjct: 127 WGEADLTLADASSTRG-----ISGFVADVDE-DFAW-LSENAFAKHPKVFPADKFGPGDF 179
Query: 260 KQAF-VAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLA 318
+ A VA+ TP + FA+H ++ R PT
Sbjct: 180 RWAVGVALSRSFFVDGELRLTPLV---DFANH--------SSLRGVSEPT---------- 218
Query: 319 RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 378
G + SK AV L + Y+ GE V GP+ + L
Sbjct: 219 --------GGTTGLFGSK----------AVVLRAGKNYEEGEEFFVSYGPKGAAGYLEEN 260
Query: 379 GFV 381
GFV
Sbjct: 261 GFV 263
>gi|308809221|ref|XP_003081920.1| N-methyltransferase (ISS) [Ostreococcus tauri]
gi|116060387|emb|CAL55723.1| N-methyltransferase (ISS) [Ostreococcus tauri]
Length = 403
Score = 46.2 bits (108), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 54/112 (48%), Gaps = 14/112 (12%)
Query: 158 ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
A LA+ LM + G + W Y L AV+S ++WS+ EL L GS +
Sbjct: 47 ATLAVALMQQTNGGASARWRAYCDAL-------PAAVDSLMMWSDEELEVLQGSALRQRA 99
Query: 218 LERAEGIKREYNELDTVWFMAGSLFQQYPYDI-PTEAFTFEIFKQAFVAVQS 268
+ R + KREY+ L F A L + P EA++F++F+ A+ V +
Sbjct: 100 VFRRDLCKREYDAL----FPA--LARADPETFGDVEAYSFDVFRWAYATVMA 145
>gi|325183831|emb|CCA18289.1| conserved hypothetical protein [Albugo laibachii Nc14]
gi|325183979|emb|CCA18437.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 561
Score = 46.2 bits (108), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 41/171 (23%), Positives = 82/171 (47%), Gaps = 21/171 (12%)
Query: 68 REVVSKKEEDL--GDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAA 125
RE V+ E D+ +L W+ G K++L++ + + R +H +L G+
Sbjct: 105 REDVADLENDVVGAELIDWLQNQGAETKKLMLQQ---YAPEVRGVH---CRNELVPGERI 158
Query: 126 FSVPNSLVVTLERVLGNET-IAELLTTNKLSELA----CLALYLMYEKKQGKKSFWLPYI 180
+P + ++T+E +G +T I + + + + +A L LYL+ + ++ +F+ Y
Sbjct: 159 LFIPKNCLITVE--MGKQTEIGQKVLAHNIEFVAPKHIFLILYLLTDMEKKDLTFFKYYY 216
Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
L P+ WS+ EL++L GS +I ER I+++Y+ +
Sbjct: 217 STL------PSTLKNMPIFWSDQELSWLKGSYILHQIQERKAAIRKDYDAI 261
Score = 38.5 bits (88), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 45/178 (25%), Positives = 76/178 (42%), Gaps = 23/178 (12%)
Query: 358 AGESIVVWCGPQPNSKLLINYGFVDEDNPY------DRLVVEAALNTEDPQYQDKRMVAQ 411
+G + G + N + L+NYGF EDN + ++V+ L+ D Q ++
Sbjct: 336 SGAQVYDSYGKKCNHRFLLNYGFAVEDNTEEDGSNPNEIMVDFQLDPGDGQ-----LLYD 390
Query: 412 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGY-----VSDTSEMQSVISSLGPIC---PV 463
+ L + R + SD P R G+ ++ T + S + P P+
Sbjct: 391 KTAYLYESGIYTMNARLSCSHSD--PSTREGFSFARLIAATEDEFSSMKMRSPAHASPPI 448
Query: 464 SPCMERAVLDQLADYFKARLAGYPATLSEDEAML--TDYNLHPKKRVATQLVRMEKKM 519
S E A L+ L +L YP +L E EA+L +Y L+ + A +R EK++
Sbjct: 449 SFRNEIAALNLLKQLMDTQLDQYPTSLDEGEAILKSKEYPLYSNRIQALFFIRGEKQV 506
>gi|308811012|ref|XP_003082814.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplast precursor (ISS)
[Ostreococcus tauri]
gi|116054692|emb|CAL56769.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplast precursor (ISS)
[Ostreococcus tauri]
Length = 588
Score = 45.8 bits (107), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 89/426 (20%), Positives = 151/426 (35%), Gaps = 90/426 (21%)
Query: 49 RKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSH-NEKH 107
R +R S+ V G + E L W+ + G +V+ + N+
Sbjct: 12 RASRARWTTRSTRARVRGDAQRARASREAYDGLWMWLERRGADVSRVVADAVTTDANDSE 71
Query: 108 RPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL----ACLALY 163
R V A L+ G A +P + + R + + L + + +AL
Sbjct: 72 RAQFGVRAKTTLRRGTRAMVIPREVWMDATRATEDADVGAALRDARYDAVKQPWVRVALL 131
Query: 164 LMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG 223
L+ E+++G + Y+ L + ++SPL WS EL + G+ ++L+ A G
Sbjct: 132 LLKERERGADGEFAAYVATLPK-------TLDSPLFWSADELRDIAGT----QLLDNAAG 180
Query: 224 ----IKREYNELDTVWFMAGSLFQQYP--YDIPTEAFTFEIFKQAFVAVQSCVVHLQRCG 277
++ Y EL +F +Y +D+ AF F+ AF ++S
Sbjct: 181 YDAYVRAVYEEL------KNGVFVEYASTFDVDG-AFDEASFRWAFGILRS--------- 224
Query: 278 CTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKC 337
T PL + ALVP G L+ +SS
Sbjct: 225 ----------------------RTMAPLDGAN-----------VALVP-GLDLINHSSLS 250
Query: 338 KAMLAAVDD---------------AVQLVVDRPYKAGESIVVWCGPQP-NSKLLINYGFV 381
A A + DR Y G I V P+ +SK ++YGF+
Sbjct: 251 GARWRVGGGGGMGGLFGGGSGSGVAAYVECDRDYDEGAEIFVNYDPEGIDSKFALDYGFI 310
Query: 382 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 441
D NP + ++ +D DK V + G F + + + ++ +LRL
Sbjct: 311 DVVNPSPGYALTLSIPEDDANLFDKLDVLETQGLPEAPTFTLRPYSDPD--RELRTFLRL 368
Query: 442 GYVSDT 447
+ DT
Sbjct: 369 LHCKDT 374
>gi|325186532|emb|CCA21071.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 441
Score = 45.8 bits (107), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 26/99 (26%), Positives = 51/99 (51%), Gaps = 7/99 (7%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A++ LQ G+ +P L ++ + ++ L N+L + +AL+LM E+ +
Sbjct: 39 VYAAKSLQKGEITMEIPFHLTISKVTAMQSDLRQILQDKNELDQDEIVALFLMIERFKSS 98
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
SF+ P+I+ L Q + P+ W++++ A L G+
Sbjct: 99 DSFFEPFIQSLPSQ-------FDLPIFWNDSDFAELEGT 130
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 50/206 (24%), Positives = 88/206 (42%), Gaps = 19/206 (9%)
Query: 358 AGESIVVWCGPQPNSKLLINYGFV--DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 415
A + + G ++KLL +YGFV +E N ++ + + + DP ++ K+ + + N
Sbjct: 238 ANDPFYISYGSYSDAKLLYSYGFVSLNEKNRFNGIDLWMRVPVTDPNFKLKQAILEGNAA 297
Query: 416 LSVQVFHVHAGREKEAISD-MLPYLRLGYVS--DTSEMQSVISSLGPICPVSPCMERAVL 472
Q + + + + L R+ +S + E + S VS E AV
Sbjct: 298 TRDQTYDFRGTIHLDDVDERFLASFRIILLSQEEFREYEKAFDS----TIVSVRNELAVY 353
Query: 473 DQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQL-VRME-KKMLNACLQVTADM 530
+ D + RLA +P +L +D L + ++ R + VRME KK+L + ++ +
Sbjct: 354 AAIHDVCEKRLARFPTSLEDDLKKLAELEMNSDLRKTYAISVRMEDKKILQSVCRLMKEW 413
Query: 531 IMLL--------PDVTVSPCPAPYAP 548
LL PDVT P P
Sbjct: 414 RNLLENDSNIYPPDVTRQQQPQLSMP 439
>gi|367009050|ref|XP_003679026.1| hypothetical protein TDEL_0A04830 [Torulaspora delbrueckii]
gi|359746683|emb|CCE89815.1| hypothetical protein TDEL_0A04830 [Torulaspora delbrueckii]
Length = 484
Score = 45.8 bits (107), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 39/68 (57%), Gaps = 1/68 (1%)
Query: 322 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 381
+++PL L A +SKC A L +++++ +P GE + G PNS+LL YG+V
Sbjct: 213 SMIPLADTLNANTSKCNANLVYDIESLKMCATKPIGMGEQVYNIYGDHPNSELLRRYGYV 272
Query: 382 D-EDNPYD 388
+ E + YD
Sbjct: 273 EWEGSKYD 280
>gi|396468374|ref|XP_003838159.1| hypothetical protein LEMA_P116830.1 [Leptosphaeria maculans JN3]
gi|312214726|emb|CBX94680.1| hypothetical protein LEMA_P116830.1 [Leptosphaeria maculans JN3]
Length = 660
Score = 45.8 bits (107), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 32/120 (26%), Positives = 57/120 (47%), Gaps = 17/120 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLE-----RVL-----GNETIAELLTTNKLSELACLAL 162
+ A+ D+ A F++P + ++ +E R+L G AE L A L L
Sbjct: 41 IVATRDIPAETTLFTIPRNAIINVETSDLARLLPGIFDGTLNDAEDEKAEPLDPWASLIL 100
Query: 163 YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAE 222
++ E G++S+W PYI L + ++P+ W++ EL L G+ AE + ++E
Sbjct: 101 VMLREYLHGEQSYWKPYIDIL-------PTSFDTPIFWTQDELKELEGTVLTAEKIGKSE 153
>gi|330806388|ref|XP_003291152.1| hypothetical protein DICPUDRAFT_155733 [Dictyostelium purpureum]
gi|325078672|gb|EGC32310.1| hypothetical protein DICPUDRAFT_155733 [Dictyostelium purpureum]
Length = 465
Score = 45.8 bits (107), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 94/444 (21%), Positives = 160/444 (36%), Gaps = 89/444 (20%)
Query: 72 SKKEEDLGDLKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
+K+ E L + K W+ N P + L +K + + A + ++ D S+
Sbjct: 34 TKEIESLKEFKEWLVNNNAYINPNIDIELLDKYGRS--------IVAKKSIKKQDKLISI 85
Query: 129 PNSLVVTLERVLGN-----ETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIREL 183
P ++++ + G + I E + + LS A+++MY K +KSFW PY+ L
Sbjct: 86 PKDIIMS--NIGGYPKKIPKEIYEQVQSIGLSPTNLQAVFIMY-SKLNEKSFWHPYVTVL 142
Query: 184 DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQ 243
+ + L +S+ EL L S K + R +GI+R Y
Sbjct: 143 PE-------SFSTSLYFSDNELDELQASQLKEFTIIRKDGIERHY--------------- 180
Query: 244 QYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRT 303
+ F + V Q L SC +R
Sbjct: 181 ----------------ESTFSRLSKLVPEFSNLALYNQ--------ELFTWALSCVWSRA 216
Query: 304 PLSPTHNPKEKVSLARR-FALVPLGPPLLAYS-SKCKAMLAAVDDAVQLVVDRPYKAGES 361
SLA +VPL A SK K + D + GE
Sbjct: 217 -----------FSLAENDGGMVPLADMFNAEDRSKSKVLPKVTDTTLDYYASDDIAEGEQ 265
Query: 362 IVVWCG---PQPNSKLLINYGFV-DEDNPYDRLVVEA-ALNTEDPQYQDKRMVAQRNGKL 416
I G P +S++L++YGF+ DE D + + + ++P K+ + + N +
Sbjct: 266 IFTPYGVYKPLSSSQMLMDYGFIFDEGTVSDNVAITVPVFHNDEPNLSTKQEILEENDII 325
Query: 417 SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGP---ICPVSPCMERAVLD 473
+ +VF + A D+L Y R+ + Q+ L P P++ E+ L
Sbjct: 326 N-EVFLLQKTDPLPA--DLLLYARVKNLIAKECDQAKKHFLSPNTRNTPLNTRNEKVSLR 382
Query: 474 QLADYFKARLAGYPATLSEDEAML 497
L + L Y L D+ +L
Sbjct: 383 FLENLIHRYLDSYGTNLESDKNLL 406
>gi|322706860|gb|EFY98439.1| SET domain protein [Metarhizium anisopliae ARSEF 23]
Length = 595
Score = 45.8 bits (107), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 71/327 (21%), Positives = 132/327 (40%), Gaps = 75/327 (22%)
Query: 113 VAASEDLQAGDAA------FSVPNSLVVTLERVLG----NETIAELL-TTNKLSELACLA 161
+ A DL++ +A ++P+ LV++ E V + +LL + S +
Sbjct: 143 LVAHADLESAEADGTSKGPVTIPHDLVLSAEAVEDFAKVDHNFKQLLEAVGRQSTRGDIM 202
Query: 162 LYLMYEKKQGKK------SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 215
LYL+ + Q + + W YIR L R + P +W+E E L G+ +A
Sbjct: 203 LYLVSQFAQSSRPKGLSPTPWTEYIRLLPR-------PIPVPTMWTEPERLLLNGTSLEA 255
Query: 216 EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQR 275
+ + + +E++ L V + +P+ E+ + +V V + + R
Sbjct: 256 ALEAKLLSLGKEFDTLREV-------SEDFPFWNEFLWSGEEVSLEDWVLVDAW--YRSR 306
Query: 276 CGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSS 335
C P+ GT A+VP G ++ +SS
Sbjct: 307 CLELPRSGT-------------------------------------AMVP-GLDMVNHSS 328
Query: 336 KCKAMLAAVD-DAVQLVV--DRPYKAGESIVVWCG-PQPNSKLLINYGFVDEDNPYDRLV 391
K A D D V L++ P ++GE + + G +P S++L +YGF+D +N D+L
Sbjct: 329 KATAYYEEDDHDNVVLLIRPGCPVRSGEEVTISYGDAKPASEMLFSYGFIDPNNIVDKLT 388
Query: 392 VEAALNTEDPQYQDKRMVAQRNGKLSV 418
+ +DP + K ++ L++
Sbjct: 389 LRLDPFPDDPLARAKLRISNSGPTLTI 415
>gi|211826273|gb|AAH09054.2| SETD3 protein [Homo sapiens]
Length = 228
Score = 45.8 bits (107), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 48/203 (23%), Positives = 92/203 (45%), Gaps = 28/203 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 5 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 58
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 59 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 114
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y V Q +P+
Sbjct: 115 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV-------IQTHPHA 162
Query: 248 -DIP-TEAFTFEIFKQAFVAVQS 268
+P ++FT+E ++ A +V +
Sbjct: 163 NKLPLKDSFTYEDYRWAVSSVMT 185
>gi|440804394|gb|ELR25271.1| rubisco lsmt substrate-binding protein [Acanthamoeba castellanii
str. Neff]
Length = 408
Score = 45.8 bits (107), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 57/197 (28%), Positives = 79/197 (40%), Gaps = 18/197 (9%)
Query: 345 DDAVQLV---VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN-------PYDRLVVEA 394
DD Q V +PY GE + + N +LL Y + EDN D +
Sbjct: 188 DDKTQTFIVHVQQPYAEGEQVFITYCTDSNFELLKTYAMMVEDNYNKYTEIRLDETTIAR 247
Query: 395 ALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVI 454
E + R + QR L+ Q + V + +E D++ LRL ++ T
Sbjct: 248 ICPDEVERLTKTRALTQRG--LAKQTYPV---KSEEFPLDLVQALRLYHLPLTDSHTE-- 300
Query: 455 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT-DYNLHPKKRVATQLV 513
S+ PVS E V D +A K L+ YP T ED AML D L R+A
Sbjct: 301 STCFETDPVSVQNELMVYDTIAGCVKELLSQYPITAQEDAAMLAHDPRLSATARLAVAYR 360
Query: 514 RMEKKMLNACLQVTADM 530
R +K L V A+M
Sbjct: 361 REDKLFLTEVGSVFAEM 377
Score = 38.5 bits (88), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 29/115 (25%), Positives = 52/115 (45%), Gaps = 13/115 (11%)
Query: 121 AGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELAC-----LALYLMYEKKQGKKSF 175
A + VP SL++ L E + + K + A LAL++++E ++ SF
Sbjct: 4 ASERILEVPFSLLLDAGAALRAEDVGSVFAAVKPALDAVDNRLPLALFMLHELRK-PDSF 62
Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
W PY L + V P+ W++ ++ L GSP A +L + + + + E
Sbjct: 63 WRPYFDALPSR-------VNLPMFWADEDMQLLAGSPLHAAVLAQKKQARDWHTE 110
>gi|303275314|ref|XP_003056953.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226461305|gb|EEH58598.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 701
Score = 45.8 bits (107), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 38/136 (27%), Positives = 59/136 (43%), Gaps = 19/136 (13%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
WM + G+ V + P R A+ D+ GD SVP ++T E + +
Sbjct: 37 WMKRRGIVLNGVGVGRFP------RTGRGCVATRDIAPGDVLVSVPEDAIITAETSVAAD 90
Query: 144 TIAEL-LTTNKLS-------ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ + L +++S E L L ++ E +G +S + PY+ L R A
Sbjct: 91 ALTKFGLGGDEMSAEASPRLEREALVLAVLAEMSRGHESDFAPYLAALPTLR-----ATH 145
Query: 196 SPLLWSETELAYLTGS 211
SPL WS ELA L G+
Sbjct: 146 SPLAWSGAELAELEGT 161
>gi|242081035|ref|XP_002445286.1| hypothetical protein SORBIDRAFT_07g007800 [Sorghum bicolor]
gi|241941636|gb|EES14781.1| hypothetical protein SORBIDRAFT_07g007800 [Sorghum bicolor]
Length = 490
Score = 45.4 bits (106), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 67/280 (23%), Positives = 108/280 (38%), Gaps = 57/280 (20%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ ASE + G+ A +P SL+++ E + +E L N ++ L L+ M E+
Sbjct: 182 MVASESIGVGEIALEIPESLIISDELLCQSEVFLALKDFNSITSETMLLLWSMRERYNLA 241
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
F PY L G L + LA L G+ EI++ + ++++Y+EL
Sbjct: 242 SKF-KPYFDTLPANFNTG-------LSFGIDGLAALEGTLLFDEIMQAKQHLRQQYDELF 293
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLV 292
+ L +P EIF++ + L C E + S+ +
Sbjct: 294 PL------LCTNFP----------EIFRKDVCTWDN---FLWAC--------ELWYSNSM 326
Query: 293 HLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA---VQ 349
+ S T L P + P +L Y VD+A ++
Sbjct: 327 MVVLSSGKLSTCLVPVAGLLNH----------SVSPHILNYGR--------VDEATKSLK 368
Query: 350 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYD 388
+ RP AGE + G P S L+ YGF+ DNPYD
Sbjct: 369 FPLSRPCDAGEQCFLSYGKHPGSHLVTFYGFLPRGDNPYD 408
>gi|428171155|gb|EKX40074.1| hypothetical protein GUITHDRAFT_113813 [Guillardia theta CCMP2712]
Length = 353
Score = 45.4 bits (106), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 24/79 (30%), Positives = 41/79 (51%), Gaps = 1/79 (1%)
Query: 314 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 373
++ L+R FA G L +S Q+V ++ +K G+S+ + G + N +
Sbjct: 183 EIVLSRAFAFSRTGGDDLVFSG-TSVKYDNSKQEFQIVAEKDFKVGQSVEISYGLKSNHE 241
Query: 374 LLINYGFVDEDNPYDRLVV 392
LL++YGF+ DNP D V+
Sbjct: 242 LLLSYGFILPDNPEDFFVI 260
>gi|195565510|ref|XP_002106342.1| GD16174 [Drosophila simulans]
gi|194203718|gb|EDX17294.1| GD16174 [Drosophila simulans]
Length = 395
Score = 45.4 bits (106), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 63/292 (21%), Positives = 107/292 (36%), Gaps = 49/292 (16%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLE-----RVLGNETIAELLTTNKLSELACLALYLMYEKK 169
A+ L + SVP L+ + E R+ G T A L LA L+ EK
Sbjct: 59 ATRPLAKDELVLSVPRKLIFSEESNSDCRLFGKMTQATHLN---------LAYDLVIEKI 109
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
+G+ S W PYI L + + L ++ ++ L G+ + L + I ++Y
Sbjct: 110 RGEFSEWRPYIDVLPAK-------YSTVLYFTTKQMELLRGTAAASLALRQCRVIAKQY- 161
Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTE-SFA 288
AF +++ A + + G T+
Sbjct: 162 -----------------------AF---LYRYAHTMTEPSTGNRSHPGERGLFFTQHGLC 195
Query: 289 SHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV 348
L S TR L P+ + + + AL+P K + AAV +
Sbjct: 196 YELYRWAVSTVMTRQNLVPSEKQESEDTPKLISALIPYWDMANHRPGKITSFYAAVPRQL 255
Query: 349 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 400
+ AGE ++ G + N+ LL++ GFVD++N D + + L+ D
Sbjct: 256 ECTAQEAVDAGEQFFIYYGDRSNTDLLVHNGFVDDNNLKDYVNIRVGLSLTD 307
>gi|428163884|gb|EKX32933.1| hypothetical protein GUITHDRAFT_120884 [Guillardia theta CCMP2712]
Length = 320
Score = 45.4 bits (106), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 40/154 (25%), Positives = 70/154 (45%), Gaps = 17/154 (11%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ + G+ VP L++ + L ++ LL +L + C+ L LM E S
Sbjct: 32 ASKRISPGETFLKVPRHLLLGPHQ-LRASSLDRLLEGWQLPD--CMLLLLMCESVN-SSS 87
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F+ PY+ L V++P+ WS+ E L GSP ++ + R + E
Sbjct: 88 FFRPYLDLLPD-------TVDTPITWSKEEAKELVGSPVLHRAVKLRHELARSFQE---- 136
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 268
M +F +YP P F++E ++ A+ ++S
Sbjct: 137 --MKDKVFDKYPDRFPPLLFSYERYQWAYSILRS 168
>gi|346327621|gb|EGX97217.1| SET domain-containing protein, putative [Cordyceps militaris CM01]
Length = 371
Score = 45.4 bits (106), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 38/77 (49%), Gaps = 2/77 (2%)
Query: 320 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 379
R AL+P+ S C + + + DR Y+A E + G N LL YG
Sbjct: 178 RLALLPVADMFNHASVGCAVAFST--EVYDVTADRDYEADEELYTSYGAHSNDFLLAEYG 235
Query: 380 FVDEDNPYDRLVVEAAL 396
F+ +DNP+D+L ++A L
Sbjct: 236 FMLQDNPHDQLCLDAVL 252
>gi|154272535|ref|XP_001537120.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150409107|gb|EDN04563.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 485
Score = 45.4 bits (106), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 67/324 (20%), Positives = 125/324 (38%), Gaps = 52/324 (16%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+ SW+ + P KV K K + + A +D+ + F++P SLV++ +
Sbjct: 19 EFMSWLKQR--PGVKVSPKIKIADLRSEGAGRGIVADDDIGEDEELFAIPQSLVLSFQ-- 74
Query: 140 LGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
N + +LL N+ CL + ++YE QG S W Y + L ++
Sbjct: 75 --NSRLKDLLDFNERDFDPWLCLIVVMIYEYLQGGASTWSRYFQLLPTN-------FDTL 125
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP-------YDIP 250
+ W++ EL L+GS +L + E N + + +P +D P
Sbjct: 126 MFWTDEELRELSGSA----VLNKIGRSDAEANIFRNILPLVSGNPSLFPPMSGVASFDSP 181
Query: 251 -TEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTH 309
+A + + V + +++ + G + + +
Sbjct: 182 EGKAALLSLAHRMGSLVMAYAFDIEKGENDGREGQDGYVT-------------------- 221
Query: 310 NPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 369
+ +E++S +VPL L A + + A L D + + +P + GE I G
Sbjct: 222 DDEEELSKG----MVPLADLLNADADRNNARLFQEDCYLSMRSIKPIRKGEEIFNDYGEL 277
Query: 370 PNSKLLINYGFV-DEDNPYDRLVV 392
P + LL YG+V D YD + +
Sbjct: 278 PRADLLRRYGYVTDNYAQYDEVEI 301
>gi|428174941|gb|EKX43834.1| hypothetical protein GUITHDRAFT_140267 [Guillardia theta CCMP2712]
Length = 805
Score = 45.4 bits (106), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 43/190 (22%), Positives = 77/190 (40%), Gaps = 35/190 (18%)
Query: 69 EVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
E +K+ EDL + W+ +NG+ KV L+ H + + A + ++ + F +
Sbjct: 540 EGSAKRNEDLIEFSKWLRRNGVDDSKVKLRADGGHGMG----NSLYARQMIKEDELLFRI 595
Query: 129 PNSLVVTLERVLGNETIAELLTTNKL-----SELACLALYLM--------YE-------- 167
P + + V + T+ ++ ++ E L+L LM YE
Sbjct: 596 PLKIAFYSDAVRRHPTLGSVIKGARIPQGMQGETFLLSLMLMGPLTHLEQYEACQVGHME 655
Query: 168 ---KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
K + SFWLPYI+ L + +P+ W+E E L GS + +
Sbjct: 656 TGCKLSNETSFWLPYIKILPK-------TFSAPIFWNEVERQELKGSQVMEMLNDDLAQA 708
Query: 225 KREYNELDTV 234
+RE+ + V
Sbjct: 709 RREWEMMKIV 718
>gi|168005531|ref|XP_001755464.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693592|gb|EDQ79944.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1033
Score = 45.4 bits (106), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 75/334 (22%), Positives = 117/334 (35%), Gaps = 96/334 (28%)
Query: 70 VVSKKEEDLGD-LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
VV + D D SWM NG S +EK H +A L G
Sbjct: 520 VVHQNGTDTTDQFVSWMEGNGF-----------SISEKLSITHLLAGDGKLVRG------ 562
Query: 129 PNSLVVTLERVLGNETIAEL-----------LTTNKLSELACLALYLMYEKKQGKKSFWL 177
VV L+ + ET+ L + ++ A L+ EK +G S W
Sbjct: 563 ----VVVLKNIRRGETLCNLPLDMGLYDNETIVAGEVDSWDRAAARLLREKAKGSSSAWA 618
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
YI L + + P+L + EL + P E+++ + I+ ++ L +V +
Sbjct: 619 SYINILPQN-------MTVPILLEDHELHEVQWWPVLRELVQVRKSIRESFSLL-SVDDL 670
Query: 238 AGSLFQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHL 294
AG+ F++Y + + + AFT +F H P M +H H
Sbjct: 671 AGADFEEYRWAAMMVHSRAFTLPVFADD---------HYAPYVMMPYM---DMINHHYHY 718
Query: 295 QTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 354
Q + P K ++ +ARR
Sbjct: 719 QADWMS-----QPIWGGKVEI-VARR---------------------------------- 738
Query: 355 PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 388
K GE + GP+ N L + YGFV +DNP+D
Sbjct: 739 DIKKGEELFASFGPRANDNLFLYYGFVLKDNPFD 772
>gi|44890428|gb|AAH66931.1| SETD3 protein [Homo sapiens]
Length = 292
Score = 45.4 bits (106), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 48/203 (23%), Positives = 92/203 (45%), Gaps = 28/203 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y V Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV-------IQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQS 268
+P ++FT+E ++ A +V +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMT 253
>gi|328772032|gb|EGF82071.1| hypothetical protein BATDEDRAFT_23340 [Batrachochytrium
dendrobatidis JAM81]
Length = 419
Score = 45.4 bits (106), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 100/424 (23%), Positives = 170/424 (40%), Gaps = 63/424 (14%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERV--LGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
A+ D Q GD +P L++ R L N A + L + +AL++ ++K
Sbjct: 49 ATSDFQIGDPVVRIPARLLLVPRRTHKLFNNHPAIV----ALKQHPSIALFIAWQKIHPT 104
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
W PYI L R L ++ LL L Y +I E A K + ++LD
Sbjct: 105 PE-WSPYIDILPRSFDTMPLCIDLKLL---AMLPY--------DIQEIA---KNQQSKLD 149
Query: 233 TVWFMAGSLFQQYPYD-IPTEAFTFEIFKQAFVAVQS-CVVHLQRCGCTPQMGTESFASH 290
T + + Y+ IP + IFK A++ V + C+ PQ+ SH
Sbjct: 150 TDYAFVCTALAVSGYEMIPKD-----IFKWAWIVVNTRCITMNTNAISKPQL------SH 198
Query: 291 LVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQL 350
+H Q T L+P + S AR A Y + KA + ++
Sbjct: 199 -IHQQPIIT-----LAPFLDCLNHTSTARISA---------GYDTVEKAYI------IRT 237
Query: 351 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 410
+V PYK G + + GP N+ LL YGF NP++ +V++ ++ + +
Sbjct: 238 LV--PYKKGSQVFINYGPHDNNFLLAEYGFAILKNPFNHVVLDREVDFMMQHFGTVSDLL 295
Query: 411 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL--GPICPVSPCME 468
+ G L + + ++ M Y+ + SD S + S+ G + +S +E
Sbjct: 296 KSEG-LYGEFIIANDDLGYRLMNAMRLYVAVSQGSDLSSVLPAWRSVLGGTLQYISKDLE 354
Query: 469 RAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVAT-QLVRMEKKMLNACLQVT 527
+A L QL +L + +L +A +Y V T Q+V ++L +++
Sbjct: 355 KATLQQLIRICVDKLNWFQQSLVTMDA--AEYEEFKFATVFTRQIVNEAVEILQEAIELA 412
Query: 528 ADMI 531
D I
Sbjct: 413 TDNI 416
>gi|15228958|ref|NP_191216.1| SET domain-containing protein [Arabidopsis thaliana]
gi|7594535|emb|CAB88060.1| putative protein [Arabidopsis thaliana]
gi|332646015|gb|AEE79536.1| SET domain-containing protein [Arabidopsis thaliana]
Length = 531
Score = 45.4 bits (106), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 75/359 (20%), Positives = 138/359 (38%), Gaps = 64/359 (17%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
K WM NG+ C L N+ V A DL+ GD ++ + +T++
Sbjct: 9 FKRWMQANGVD-CSEALNLVDDENDGVS----VRAFCDLKEGDVVANISKTACLTIK--- 60
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
E++ + L L++ LMYE+ G++S W Y++ L Q + PL+W
Sbjct: 61 -TSGAREMIESADLDGSLGLSVALMYERSLGEESPWAGYLQILPIQE-------DLPLVW 112
Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
S +L + L+G+ + E I ++ E + + SL P ++ +++F + +
Sbjct: 113 SLEDLDSLLSGTELHKLVKEDHVLIYEDWKE--NILPLTSSL----PQNVDSDSFGIKEY 166
Query: 260 --KQAFVAVQSCVV-HLQRCGCTP-------QMGTESFASHLVHL------QTSCTTTRT 303
++ +A +S + G P + G E H H ++
Sbjct: 167 LAAKSLIASRSFEIDDYHGSGMVPLADLFNHKTGAEDV--HFTHESDSEADESDNDDAAN 224
Query: 304 PLSPTHNPKEKVSLA--RRFALVP---------------------LGPPLLAYSSKCKAM 340
+ P K+S + + F VP + +M
Sbjct: 225 ETTDEDEPSSKISSSPEQSFEEVPGENTDDEAKEEEEEEEEEEEGEEEEEGEEEEENSSM 284
Query: 341 LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 399
L +++++ + AG + G N+ LL YGF + DNPYD + ++ L TE
Sbjct: 285 LQNDQSGLKMIMVKDVSAGAEVFNTYGLMGNAALLHRYGFTELDNPYDIVNIDLELVTE 343
>gi|254577261|ref|XP_002494617.1| ZYRO0A05654p [Zygosaccharomyces rouxii]
gi|238937506|emb|CAR25684.1| ZYRO0A05654p [Zygosaccharomyces rouxii]
Length = 494
Score = 45.4 bits (106), Expect = 0.083, Method: Compositional matrix adjust.
Identities = 62/276 (22%), Positives = 108/276 (39%), Gaps = 31/276 (11%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAEL--LTTNKLSELACLALYLMYE-KK 169
V AS+D+ + + F +P S V+ + +L + +L L L ++YE K
Sbjct: 42 VLASQDIGSDEVLFEIPRSSVLNVATSQLVRDFPQLKDVFWQELGHWEGLILCMVYEIKV 101
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
G++SFW Y++ L + + L + WS +LA L S + A+ + Y
Sbjct: 102 MGQQSFWWNYLQVLPKSQDLNTL-----VYWSADQLAALEPSLVVGRL--GADESQEMYR 154
Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFAS 289
++ G FQ TFE FV V S ++ + SF
Sbjct: 155 QILKYIQNFGPEFQS-----KIGQLTFE----EFVHVASVIM------------SYSFDV 193
Query: 290 HLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 349
L ++VPL L A + + A L ++++
Sbjct: 194 DLKGEDDEDDEDEDEGEEEEGESNVAHDKYMKSMVPLADTLNADTKQFNAHLVYDKESLK 253
Query: 350 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 385
+V +P K G+ + + G PN+++L YG+V+ D
Sbjct: 254 MVSVKPIKMGQQVYNFYGEHPNAEILRRYGYVEWDG 289
>gi|28393324|gb|AAO42088.1| unknown protein [Arabidopsis thaliana]
Length = 543
Score = 45.1 bits (105), Expect = 0.087, Method: Compositional matrix adjust.
Identities = 65/275 (23%), Positives = 105/275 (38%), Gaps = 52/275 (18%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASEDL+ GD A +P S +++ E V ++ L T + ++ L L+ M EK
Sbjct: 180 ASEDLKLGDVALEIPVSSIISEEYVYNSDMYPILETFDGITSETMLLLWTMREKHNLDSK 239
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F PY L G L + + L G+ EI++ E ++ Y+EL
Sbjct: 240 F-KPYFDSLQENFCTG-------LSFGVDAIMELDGTLLLDEIMQAKELLRERYDELI-- 289
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHL 294
L + P E +T+E + L C E + S+ + +
Sbjct: 290 -----PLLSNHREVFPPELYTWEHY-------------LWAC--------ELYYSNSMQI 323
Query: 295 QTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 354
+ +T L P F + P ++ Y + +++ V R
Sbjct: 324 KFPDGKLKTCLIPVAG----------FLNHSIYPHIVKYGK-----VDIETSSLKFPVSR 368
Query: 355 PYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYD 388
P GE + G +S LL YGF+ + DNPYD
Sbjct: 369 PCNKGEQCFLSYGNYSSSHLLTFYGFLPKGDNPYD 403
>gi|403350232|gb|EJY74567.1| hypothetical protein OXYTRI_04175 [Oxytricha trifallax]
Length = 766
Score = 45.1 bits (105), Expect = 0.087, Method: Compositional matrix adjust.
Identities = 38/162 (23%), Positives = 66/162 (40%), Gaps = 23/162 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--------SELACLALYL 164
V A ED++ +A VP L++T+E + I + NK E L +++
Sbjct: 27 VRAREDIEHREAFLYVPFKLLITMELAHNHPIIGHVFKENKQIFTKEHEDFEQLTLTVFM 86
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
+YE ++G +SFW PY+ L VE WS++++ + E I
Sbjct: 87 LYEYQKGLESFWFPYLNLLP--------DVEFFCNWSKSDIEAIDDQELAYETKSYKRDI 138
Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 266
+ E+ E++ L YP + +F + F V
Sbjct: 139 EIEWKEIEL-------LLLHYPQHFSSALIDKHLFMRIFAQV 173
>gi|366987955|ref|XP_003673744.1| hypothetical protein NCAS_0A08050 [Naumovozyma castellii CBS 4309]
gi|342299607|emb|CCC67363.1| hypothetical protein NCAS_0A08050 [Naumovozyma castellii CBS 4309]
Length = 499
Score = 45.1 bits (105), Expect = 0.087, Method: Compositional matrix adjust.
Identities = 65/302 (21%), Positives = 121/302 (40%), Gaps = 40/302 (13%)
Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAE------LLTTNKLSELACLALYLM 165
++ A+ED++ + F +P ++ VL + ++E +L + L + ++
Sbjct: 42 FILATEDIKTDELLFEIPRESILN---VLTSSLVSEYPAWENILLDGDVGHWEGLIICML 98
Query: 166 YEKKQGKK-SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
+E K K S W PY L + S + W+ EL L S +L+R G
Sbjct: 99 FEIKVKKNMSKWAPYFDVLPESTD-----LNSLMYWTAEELEALKPSL----VLDRI-GN 148
Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGT 284
+ + V + + + + D+ T+E F+ V S ++ P
Sbjct: 149 DGAHQMHEKVMELIRTFEKDHSVDLSFGTITWE----DFLYVASIIMSYSFDVELPPTSA 204
Query: 285 ESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAV 344
+ + T R S +++PL L + ++KC A L
Sbjct: 205 DENEEDDEVEEDVEQTVRNEGSLK-------------SMIPLADTLNSDTNKCNAHLIYD 251
Query: 345 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD-EDNPYD--RLVVEAALNTEDP 401
+D++++ KAGE + G PN+++L YG+V+ E + YD L +E + T
Sbjct: 252 EDSLKMRAISNIKAGEQVYNIYGNHPNAEILRRYGYVEWEGSKYDFGELPLEVIIETLHE 311
Query: 402 QY 403
QY
Sbjct: 312 QY 313
>gi|302510645|ref|XP_003017274.1| hypothetical protein ARB_04152 [Arthroderma benhamiae CBS 112371]
gi|291180845|gb|EFE36629.1| hypothetical protein ARB_04152 [Arthroderma benhamiae CBS 112371]
Length = 479
Score = 45.1 bits (105), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 30/126 (23%), Positives = 55/126 (43%), Gaps = 14/126 (11%)
Query: 97 LKEKPSHNEKHRPIHY-----VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTT 151
LK H + H IH A + + F +PN L+++++ + L
Sbjct: 24 LKRSSPHFKMHPGIHIADLRSTGAGRGISEDEELFVIPNDLILSVQNSEARSVLG--LDD 81
Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
+L L + ++YE QG++S W PY R L + ++ + W++ +L+ L GS
Sbjct: 82 KQLGPWLSLIITMIYEYYQGEQSKWYPYFRILPS-------SFDTLMFWTDEQLSELQGS 134
Query: 212 PTKAEI 217
+I
Sbjct: 135 AVVGKI 140
>gi|449301991|gb|EMC98000.1| hypothetical protein BAUCODRAFT_146595 [Baudoinia compniacensis
UAMH 10762]
Length = 633
Score = 45.1 bits (105), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 61/252 (24%), Positives = 104/252 (41%), Gaps = 32/252 (12%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW-SETELAYLTGSPTKAEILER 220
YLM + ++SFW PY+ L +PL + + +LA+L G+ +L R
Sbjct: 87 FYLMTQYLNKEQSFWKPYLDVLPSPS-----EFSTPLWFDAPADLAWLDGTDVLHTMLAR 141
Query: 221 AEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTP 280
E + Y V +G I +T+++F+ A S R +
Sbjct: 142 REVYAQYYQSGLKVLSESG---------IDVTLYTWDLFRWAITTFTS------RSFTSR 186
Query: 281 QMGTESFASHLVHLQTSCTTTRTP--LSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCK 338
+ ++ VH +TS R L +H+P E + + F P L + +
Sbjct: 187 VLLPQNRKYWPVH-RTSTNGRRQTVLLDMSHSPAEDLDFSVLF------PGLDSGNHDPN 239
Query: 339 AMLAAVDDAVQLVVD--RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 396
A + DA Q + +P +AG + GP+ N +LL+ YGF +NP D +++
Sbjct: 240 AQVDWSFDANQFSIALVQPIEAGAEVCNNYGPKANDELLMGYGFCIPNNPRDEVLLTLKA 299
Query: 397 NTEDPQYQDKRM 408
E Q + KR+
Sbjct: 300 PPEALQVELKRI 311
>gi|356577306|ref|XP_003556768.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Glycine
max]
Length = 487
Score = 45.1 bits (105), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 48/194 (24%), Positives = 89/194 (45%), Gaps = 7/194 (3%)
Query: 332 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRL 390
AY K A L + D L KAG+ + + + + N++L ++YGF++ + +
Sbjct: 251 AYEIKGAAGLFSWDYLFSLRSPLSLKAGDQVYIQYDLNKSNAELALDYGFIEPNADRNAY 310
Query: 391 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV-HAGREKEAISDMLPYLRLGYVSDTSE 449
+ ++ DP + DK +A+ NG F + ++ + L + LG +D
Sbjct: 311 TLTLQISESDPFFGDKLDIAESNGFGETAYFDIFYSRPLPPGLLPYLRLVALG-GTDAFL 369
Query: 450 MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKK 506
++S+ S G + PVS E + + + K LAGY T+ ED+ L + L +
Sbjct: 370 LESIFRNSIWGHLELPVSRDNEELICRVVRETCKTALAGYHTTIEEDQK-LKEAKLDSRH 428
Query: 507 RVATQLVRMEKKML 520
+A + EK++L
Sbjct: 429 AIAVGIREGEKQLL 442
>gi|40068483|ref|NP_954574.1| histone-lysine N-methyltransferase setd3 isoform b [Homo sapiens]
gi|28071060|emb|CAD61911.1| unnamed protein product [Homo sapiens]
gi|111309143|gb|AAI20968.1| SET domain containing 3 [Homo sapiens]
gi|118341365|gb|AAI27625.1| SET domain containing 3 [Homo sapiens]
gi|118341638|gb|AAI27626.1| SET domain containing 3 [Homo sapiens]
gi|119602071|gb|EAW81665.1| SET domain containing 3, isoform CRA_b [Homo sapiens]
gi|156138972|gb|AAI48252.1| SET domain containing 3 [Homo sapiens]
Length = 296
Score = 45.1 bits (105), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 48/203 (23%), Positives = 92/203 (45%), Gaps = 28/203 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y V Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV-------IQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQS 268
+P ++FT+E ++ A +V +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMT 253
>gi|296810368|ref|XP_002845522.1| SET domain-containing protein [Arthroderma otae CBS 113480]
gi|238842910|gb|EEQ32572.1| SET domain-containing protein [Arthroderma otae CBS 113480]
Length = 491
Score = 45.1 bits (105), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 60/286 (20%), Positives = 112/286 (39%), Gaps = 32/286 (11%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A D+ + F +P L++++E E + L +L L + ++YE QG+
Sbjct: 61 LGAVRDIAEDEELFVIPEDLILSVENSKAREALG--LNETQLGPWLSLIIVMIYEYYQGE 118
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+S W PY L + ++ + W+E +L L G +I + A +
Sbjct: 119 QSRWEPYFHIL-------PTSFDTLMFWTEAQLQELQGCAVVDKIGKSAAD--------E 163
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLV 292
+ L Q P+ P + + + C+ H +MG+ A +
Sbjct: 164 AILQKVVPLIQANPHHFPARSGMPPLDSNDALL---CLAH--------RMGSLIMA-YAF 211
Query: 293 HLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 352
++ + + ++ A+ +VPL A + + A L + + +
Sbjct: 212 DIEKTEGADDDAAEDGYMTDDEDEPAK--GMVPLADIFNADAQRNNARLFQEEGSFVMKA 269
Query: 353 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 398
R +AGE I G P + LL YG+V DN VVE +L++
Sbjct: 270 IRNIQAGEEIFNDYGELPRADLLRRYGYV-TDNYAQYDVVEFSLDS 314
>gi|111306423|gb|AAI20969.1| SETD3 protein [Homo sapiens]
Length = 284
Score = 45.1 bits (105), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 48/203 (23%), Positives = 92/203 (45%), Gaps = 28/203 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y V Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV-------IQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQS 268
+P ++FT+E ++ A +V +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMT 253
>gi|255080174|ref|XP_002503667.1| set domain protein [Micromonas sp. RCC299]
gi|226518934|gb|ACO64925.1| set domain protein [Micromonas sp. RCC299]
Length = 401
Score = 45.1 bits (105), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 33/135 (24%), Positives = 60/135 (44%), Gaps = 11/135 (8%)
Query: 144 TIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 203
++AE L +L L + +M+E+ G+ S W Y L RG+ + P+ W+
Sbjct: 38 SVAETLREARLGGGLALNIAIMHERSLGEGSRWAGYFAVLP---ARGERTL--PMFWTSA 92
Query: 204 ELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 263
+L +L G+ + E AE ++ ++NE + L +P P T E + +A
Sbjct: 93 QLEHLRGTDLLRHVTEDAESMRLDFNE-----NVVDGLCVTHPVAFPPGKHTLEAYMEAA 147
Query: 264 VAVQSCVVHL-QRCG 277
S ++ + CG
Sbjct: 148 SLAASRAFYIGEECG 162
>gi|384483765|gb|EIE75945.1| hypothetical protein RO3G_00649 [Rhizopus delemar RA 99-880]
Length = 376
Score = 45.1 bits (105), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 69/281 (24%), Positives = 116/281 (41%), Gaps = 59/281 (20%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A+ED++AG+ SVP + ++T NE++ +L T+ LS LAL+L+ + K
Sbjct: 1 MMATEDIEAGEVIVSVPRNFLIT------NESLTKLYGTHSLSPHQLLALHLVLLTRD-K 53
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+S+W PY L + S L L++L S K E +++ + I +Y
Sbjct: 54 QSWWKPYTDLLPMHFNTMPVNYPSEL------LSHLPNS-LKQETMQQKDNIHTDY---- 102
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLV 292
V + +Q P DI T E FK A++ V + +H+ + + A L
Sbjct: 103 -VTCLKFCKSKQLPQDI-----TAEEFKWAWLCVNTRCIHMTVPDYLAKGENIALAPMLD 156
Query: 293 HLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 352
L TT + N + +RF + L
Sbjct: 157 FLN---HTTEAKIESGFNIR-----TQRFEIKTL-------------------------- 182
Query: 353 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 393
YK GE + + GP N +L YGFV +N Y+ ++++
Sbjct: 183 -TAYKKGEQVYINYGPHDNLAMLKEYGFVLNENIYNFVLLD 222
>gi|334184301|ref|NP_001189551.1| SET domain-containing protein [Arabidopsis thaliana]
gi|330251720|gb|AEC06814.1| SET domain-containing protein [Arabidopsis thaliana]
Length = 536
Score = 45.1 bits (105), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 65/275 (23%), Positives = 105/275 (38%), Gaps = 52/275 (18%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASEDL+ GD A +P S +++ E V ++ L T + ++ L L+ M EK
Sbjct: 180 ASEDLKFGDVALEIPVSSIISEEYVYNSDMYPILETFDGITSETMLLLWTMREKHNLDSK 239
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F PY L G L + + L G+ EI++ E ++ Y+EL
Sbjct: 240 F-KPYFDSLQENFCTG-------LSFGVDAIMELDGTLLLDEIMQAKELLRERYDELI-- 289
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHL 294
L + P E +T+E + L C E + S+ + +
Sbjct: 290 -----PLLSNHREVFPPELYTWEHY-------------LWAC--------ELYYSNSMQI 323
Query: 295 QTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 354
+ +T L P F + P ++ Y + +++ V R
Sbjct: 324 KFPDGKLKTCLIPVAG----------FLNHSIYPHIVKYGK-----VDIETSSLKFPVSR 368
Query: 355 PYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYD 388
P GE + G +S LL YGF+ + DNPYD
Sbjct: 369 PCNKGEQCFLSYGNYSSSHLLTFYGFLPKGDNPYD 403
>gi|189189204|ref|XP_001930941.1| SET domain-containing protein RMS1 [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187972547|gb|EDU40046.1| SET domain-containing protein RMS1 [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 476
Score = 45.1 bits (105), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 76/341 (22%), Positives = 113/341 (33%), Gaps = 90/341 (26%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLER-VLG 141
+W+ ++G I E + + R V AS+D+ + F +P + ++++E +L
Sbjct: 13 AWLRQSGAEISPKIKLEDLRNKDAGRG---VVASQDIAEHELLFRIPRASILSVENSILS 69
Query: 142 NETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
E A L+ L L L ++YE G S W PY L + + + W+
Sbjct: 70 TEIPAATLSL--LGPWLSLILVMLYEYHNGSASNWAPYFAVLPTE-------FNTLMFWT 120
Query: 202 ETELAYLTGSPTKAEIL---------------------------ERAEGIKREYNELDTV 234
E ELA L S ++ ERA+ +E L+ +
Sbjct: 121 EDELAELQASAVVGKVGKESADEAFLEQLLPVIEEFADIVFSGDERAKDKAKEMRSLENL 180
Query: 235 WFM--AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLV 292
M GSL Y +D+ TP +
Sbjct: 181 ELMHKMGSLIMAYAFDVEP--------------------------ATPTKEVDEEGFAEE 214
Query: 293 HLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 352
+ PL+ N AR F Y C M A
Sbjct: 215 EEDAALPKGMVPLADMLNADADRCNARLF-----------YEKDCLEMKAL--------- 254
Query: 353 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 393
+P +AGE I GP P S LL YG+V DN VVE
Sbjct: 255 -KPIQAGEEIFNDYGPLPRSDLLRRYGYVT-DNYAQYDVVE 293
>gi|79557522|ref|NP_179475.3| SET domain-containing protein [Arabidopsis thaliana]
gi|56381987|gb|AAV85712.1| At2g18850 [Arabidopsis thaliana]
gi|330251719|gb|AEC06813.1| SET domain-containing protein [Arabidopsis thaliana]
Length = 543
Score = 45.1 bits (105), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 65/275 (23%), Positives = 105/275 (38%), Gaps = 52/275 (18%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASEDL+ GD A +P S +++ E V ++ L T + ++ L L+ M EK
Sbjct: 180 ASEDLKFGDVALEIPVSSIISEEYVYNSDMYPILETFDGITSETMLLLWTMREKHNLDSK 239
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F PY L G L + + L G+ EI++ E ++ Y+EL
Sbjct: 240 F-KPYFDSLQENFCTG-------LSFGVDAIMELDGTLLLDEIMQAKELLRERYDELI-- 289
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHL 294
L + P E +T+E + L C E + S+ + +
Sbjct: 290 -----PLLSNHREVFPPELYTWEHY-------------LWAC--------ELYYSNSMQI 323
Query: 295 QTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 354
+ +T L P F + P ++ Y + +++ V R
Sbjct: 324 KFPDGKLKTCLIPVAG----------FLNHSIYPHIVKYGK-----VDIETSSLKFPVSR 368
Query: 355 PYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYD 388
P GE + G +S LL YGF+ + DNPYD
Sbjct: 369 PCNKGEQCFLSYGNYSSSHLLTFYGFLPKGDNPYD 403
>gi|50556556|ref|XP_505686.1| YALI0F20944p [Yarrowia lipolytica]
gi|49651556|emb|CAG78495.1| YALI0F20944p [Yarrowia lipolytica CLIB122]
Length = 402
Score = 45.1 bits (105), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 63/249 (25%), Positives = 109/249 (43%), Gaps = 56/249 (22%)
Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIREL-DRQRGRGQLAVESPLLWSETELAYLTGSP 212
+S LAL+L+ ++ G KS W ++ L DR+ G ++ PL WS+ + LT P
Sbjct: 79 MSAHQVLALFLVIQQSLGSKSDWKAFMGLLPDRKEG----FLDVPLQWSKEDQDSLT--P 132
Query: 213 TKAEILERA-EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 271
+L++ + + +Y++ T F+A +Y D P +A+ + A++ V S
Sbjct: 133 EGIVVLKKTLDTFEADYDKTKT--FVA-----KYDSD-PRDAYLW-----AWLCVNS--- 176
Query: 272 HLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFAL----VPLG 327
RC ++ + TT + + ++LA L V G
Sbjct: 177 ---RC---------------LYFDLTLTTGK---KDAQEVPDNITLAPYVDLINHSVESG 215
Query: 328 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 387
P + C+ +++ + L R Y A E I + GP+ NS LL YGF +NP+
Sbjct: 216 P------THCQLKTSSIGFEI-LCGQRGYTADEEIFLCYGPRSNSVLLCEYGFTVPENPW 268
Query: 388 DRLVVEAAL 396
D + + AL
Sbjct: 269 DDVDISDAL 277
>gi|299115489|emb|CBN75653.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 451
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 21/70 (30%), Positives = 38/70 (54%)
Query: 345 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 404
+D + LV + + +G + GP PNS+LL+ +GF DNP++ + + A + P +
Sbjct: 229 NDCLHLVTLQDWASGSEVKFSYGPLPNSRLLLLHGFCLPDNPFESVELWAMMEPGAPGFA 288
Query: 405 DKRMVAQRNG 414
+K + NG
Sbjct: 289 EKNKIMLDNG 298
>gi|225561342|gb|EEH09622.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
Length = 487
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 100/455 (21%), Positives = 164/455 (36%), Gaps = 93/455 (20%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
+ G+ F++P ++ T+E + + L + + LS LA+Y+++ +
Sbjct: 52 FKEGERIFTIPADVLWTVEHAYADSLLGPALRSARPPLSVDDTLAMYILFVRS------- 104
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
RE R LA S + +++ EL GS A I+ +Y L
Sbjct: 105 ----RESGYDGPRSHLATLPKSYSSSIFFTDDELEVCAGSSLYALTKRLGRCIEDDYRAL 160
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHL 291
L Q+ P + FT E +K A V S + G FA L
Sbjct: 161 VV------RLLVQHQDLFPLDKFTIEDYKWALCTVWSRAMDFVLPGGKSIRLMAPFADML 214
Query: 292 VHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 351
H + + H A PL L +LA D
Sbjct: 215 NH--------SSEVRQCH------------AYDPLSGNL--------TILAGKD------ 240
Query: 352 VDRPYKAGES-----IVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 406
Y+AG+ + ++ G PN++LL YGFV NP D + + P ++ K
Sbjct: 241 ----YEAGDQGVFFQVFIYYGSIPNNRLLRLYGFVMPGNPNDSYDLVLETHPMAPFFEQK 296
Query: 407 RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV----SDTSEMQSVISSLGPICP 462
R + G S + ++D LP LGY+ SD S++ S+ I P
Sbjct: 297 RKLWDLAGFDSTSTISI-------TLTDPLPKNVLGYLRIQRSDESDLASIARQ--RIDP 347
Query: 463 ----VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKR---VATQLVRM 515
+S E VL L + F L + L E L + ++P + A +
Sbjct: 348 KYEKISDSNEVEVLQSLIESFCGLLDSFGTQLESLEKQLAE-GVYPSRGNAWAAAHVSLG 406
Query: 516 EKKMLNACLQVTADMIMLLPDVTVS-----PCPAP 545
E+++L + DM+ + + + P PAP
Sbjct: 407 EQQVLRLARKRAEDMLAAVESGSGNEKGSLPAPAP 441
>gi|356521657|ref|XP_003529470.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Glycine
max]
Length = 487
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 89/413 (21%), Positives = 165/413 (39%), Gaps = 57/413 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A +D+ + VP L + + V +E I ++ + L +AL+L+ E+ +
Sbjct: 82 LVALKDISRNEVVLQVPKRLWINPDAVAASE-IGKVCSG--LKPWLAVALFLIRERSR-S 137
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W Y L ++ +S + WSE EL+ L G+ + ++ E+ L+
Sbjct: 138 DSLWKHYFSILPKE-------TDSTIYWSEEELSELQGTQLLNTTRSVKQYVQNEFRRLE 190
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLV 292
+ + +P I T + F AF ++S R +LV
Sbjct: 191 EEIIIPNK--KLFPSSI-----TLDDFFWAFGILRSRAFSRLR------------NENLV 231
Query: 293 HLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 352
+ PL+ N +V+ AY K A L + D L
Sbjct: 232 VI---------PLADLINHSARVTTDDH-----------AYEIKGAAGLFSWDYLFSLRS 271
Query: 353 DRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 411
KAG+ + + + + N++L ++YGF++ + + + ++ DP + DK +A+
Sbjct: 272 PLSLKAGDQVYIQYDLNKSNAELALDYGFIEPNTDRNAYTLTLQISESDPFFGDKLDIAE 331
Query: 412 RNGKLSVQVFHV-HAGREKEAISDMLPYLRLGYVSDTSEMQSVI--SSLGPI-CPVSPCM 467
NG F + + + L + LG +D ++S+ S G + PVS
Sbjct: 332 SNGFGETAYFDIFYNRPLPPGLLPYLRLVALG-GTDAFLLESIFRNSIWGHLELPVSRDN 390
Query: 468 ERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 520
E + + + K LAGY T+ ED+ L + L + +A + EK +L
Sbjct: 391 EELICRVVRETCKTALAGYHTTIEEDQK-LKEAKLDSRHAIAVGIREGEKNLL 442
>gi|145517214|ref|XP_001444490.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124411912|emb|CAK77093.1| unnamed protein product [Paramecium tetraurelia]
Length = 748
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 37/141 (26%), Positives = 66/141 (46%), Gaps = 23/141 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERV-LGNETIA-----ELLTTNKLS--ELACLALYL 164
V A++D+ A A VP +L+++ E+ L + +I EL N+ S E L YL
Sbjct: 46 VVATQDIPANTAIICVPQTLIISQEKCKLSSLSIVYDKHPELFDENQTSDAEFNILIFYL 105
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
EKK+G++SF+ PYI+ + + + W++ EL+ + E +E +
Sbjct: 106 FNEKKKGEQSFFYPYIQAIQTNN--------TLIDWTKEELSQIEDPIVLDEFAIVSEDL 157
Query: 225 KREYNELDTVWFMAGSLFQQY 245
K +W A +F ++
Sbjct: 158 K-------VLWNYAQDIFNEF 171
>gi|390602144|gb|EIN11537.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
SS5]
Length = 503
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 59/220 (26%), Positives = 87/220 (39%), Gaps = 24/220 (10%)
Query: 322 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 381
A+VP+ L A A L +++V +P +GE I G PNS LL YG V
Sbjct: 255 AMVPMADMLNARYGSENAKLFYESRDLRMVTTKPIASGEQIWNTYGDPPNSDLLRRYGHV 314
Query: 382 D---------EDNPYDRLVVEA--ALNTEDPQYQDKRMVAQ-----RNGKLSVQVFHVHA 425
D NP D + V A LN + + Q + + G V VF
Sbjct: 315 DLLALSDGDGMGNPSDIVEVRADLVLNHVNSKKQSHELEERIDWWLEEGGDDVFVFT--- 371
Query: 426 GREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAG 485
R+ E S+++ +RL + T ++ P V +L + RL
Sbjct: 372 -RDAELPSELVSLIRLLILPPTEWTKTRDKGKLPKGKVDDVR---ILHVVTGALHERLQQ 427
Query: 486 YPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 525
YP ++ +DEA+L L KR A + EK +L L
Sbjct: 428 YPTSIEDDEALLA-TALSENKRQAVIVRLAEKHILRKALH 466
>gi|303271033|ref|XP_003054878.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226462852|gb|EEH60130.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 664
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 39/121 (32%), Positives = 60/121 (49%), Gaps = 14/121 (11%)
Query: 149 LTTNKLSELACLALYLMYE-KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY 207
+T+ +++ A +AL+L++E Q +KS W P++ L R VE+PLLW+ ELA
Sbjct: 189 ITSREVTIDAVIALHLLHELYVQREKSEWWPWVSILPRD-------VETPLLWTPRELAQ 241
Query: 208 LTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ 267
L GS I R +K + D ++ L Q++P P E F E + A V
Sbjct: 242 LEGSNL---IGFRDAVLKGWTTQRDALF---PKLTQKFPSLFPEEHFRTERWAWAMAIVW 295
Query: 268 S 268
S
Sbjct: 296 S 296
>gi|255581713|ref|XP_002531659.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase,
putative [Ricinus communis]
gi|223528717|gb|EEF30729.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase,
putative [Ricinus communis]
Length = 558
Score = 44.7 bits (104), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 60/275 (21%), Positives = 109/275 (39%), Gaps = 51/275 (18%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+EDL+ GD A +P S++++ E V ++ L + +S L L+ M E+
Sbjct: 190 ATEDLKVGDIALEIPVSIIISEELVRHSDMYHILEKIDGISSETMLLLWSMKERHNCNSK 249
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
+ Y L ++ G ++ S+ L + EI++ E ++ +Y+EL
Sbjct: 250 SKI-YFDTLPKEFNTGLSFGVDAIMASDGTLLF-------DEIMQAKEHLRVQYDEL--- 298
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHL 294
+L YP P E +T+E F A E + S+ + +
Sbjct: 299 ---VPALCNNYPDVFPPELYTWEQFLWA---------------------CELWYSNSMKI 334
Query: 295 QTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 354
+ RT L P F L P ++ Y + ++ + ++ + R
Sbjct: 335 KFLDGKLRTCLIPIAG----------FLNHSLHPHIIHYGK-----VDSITNTLKFPLSR 379
Query: 355 PYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYD 388
P + GE + G + L+ YGF+ + DN YD
Sbjct: 380 PCRVGEQCCLSYGNFSGAHLITFYGFLPQGDNRYD 414
>gi|323447496|gb|EGB03414.1| hypothetical protein AURANDRAFT_72732 [Aureococcus anophagefferens]
Length = 403
Score = 44.7 bits (104), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 69/308 (22%), Positives = 110/308 (35%), Gaps = 72/308 (23%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
W+ +NG + E S++++ R +H A+ DL+ + VP ++T+E +G
Sbjct: 37 WLTENGGKFADCV--ELRSYDDEVRGVH---ATRDLETEEILVEVPLKCLITVE--MGKA 89
Query: 144 TIAELLTTNKLSELAC-----LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPL 198
T EL L L+++ +++ +F+ PY L P+
Sbjct: 90 TDVGRAVLEAELELDAPKHVFLMLFVLLDRRDSS-TFFAPYYDIL------PSTLSNMPI 142
Query: 199 LWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEI 258
W EL +L GS +I ER IK +Y + +W P I + T E
Sbjct: 143 FWQPDELEWLKGSYLLTQIEERKRAIKADYEAICGIW----------PSFI--DVCTLEE 190
Query: 259 FKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLA 318
FK A + C C+ G V+ A
Sbjct: 191 FKWA-----------RMCVCSRNFGV-----------------------------VVNGA 210
Query: 319 RRFALVPLGPPLLAYSSK-CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 377
R A+VP L + + K A + + G I G + N + L+N
Sbjct: 211 RTSAMVPYADMLNHFRPRETKWTFDNSRGAFTITSLQKISVGSQIYDSYGQKCNHRFLLN 270
Query: 378 YGFVDEDN 385
YGF EDN
Sbjct: 271 YGFAIEDN 278
>gi|384246211|gb|EIE19702.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 503
Score = 44.7 bits (104), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 108/482 (22%), Positives = 191/482 (39%), Gaps = 95/482 (19%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E L L +W+ + GLP K+ ++ + + + S+ + G +VP+S +T
Sbjct: 50 ETLPPLSAWVEQRGLPLKKLNVRPEIVEGDL-----CLVVSKPTKKGQPLVAVPSSAWLT 104
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
++V+ + +I L+ L +AL+L++E+ + + W ++ + A +
Sbjct: 105 -QQVVRSSSIGSLV--EDLEPWLQIALFLLHERSKPDAA-WQGFLDSI-------PAAPD 153
Query: 196 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
PL WSE EL+ L G+ + + + + +Y EL+ LF + P ++
Sbjct: 154 VPLFWSEEELSQLEGTQLLSSVQGYRQFFEAKYAELEE------QLFAPHREAFPPKSHQ 207
Query: 256 FEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKV 315
+ F A V+S V H+P +
Sbjct: 208 LDDFLWAVATVRSRV--------------------------------------HSPLD-- 227
Query: 316 SLARRFALVPLGPPLLAY----SSKCKAMLAA--VDDAVQLVVD--RPYKAGESIVVWCG 367
ALVPL L+ + ++ + LA A LVV+ R Y GE + + G
Sbjct: 228 --GEDVALVPLAD-LVQHRKLQGARWQLQLAGGLFSKAQALVVEAQRDYAEGEVVTMDFG 284
Query: 368 P--------QPNSKLLINYGFVDEDNPY-------DRLVVEAALNTEDPQYQDKRMVAQR 412
+ +S++L++YG +D D P ++ AL +D Y DK + +
Sbjct: 285 APLTEEDQEKLDSQVLLDYGALDADRPQADPGVVQGGFILSLALPEDDKYYDDKADILEL 344
Query: 413 NGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLG---PICPVSPCMER 469
NG F + A E L D ++ + + + PVS ER
Sbjct: 345 NGLSEAASFVLRANEEPSEQLLGFLRLLNLSGQDAFLLEPLFRNEAWGHMLAPVSEANER 404
Query: 470 AVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQ-LVRM-EKKMLNACLQVT 527
AV + + + +A L GY ++ +D L D P R+ LVR+ EK+ L+A L
Sbjct: 405 AVYESMMEGCRAALQGYATSIDDDLRALRDT--QPGTRLEKAILVRLGEKETLDATLAFF 462
Query: 528 AD 529
D
Sbjct: 463 ED 464
>gi|346324642|gb|EGX94239.1| SET domain-containing protein RMS1 [Cordyceps militaris CM01]
Length = 482
Score = 44.3 bits (103), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 62/244 (25%), Positives = 95/244 (38%), Gaps = 46/244 (18%)
Query: 158 ACLALYLMYEKKQ------GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
+ L L L+YE Q G W PY+ L A +P+ WS EL L S
Sbjct: 102 SALILVLLYEHLQRDADATGAACRWRPYLDVL-------PAAFATPMFWSPAELGALQAS 154
Query: 212 PTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 271
P A+ + RE A ++F+ +P +F + +V
Sbjct: 155 PAVAK-------VGRES---------ADNMFRGIL--LPAVRAHAHVFAGSERLSDEQIV 196
Query: 272 HL-QRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPL 330
L R G T ++A L + + K + + VP+ +
Sbjct: 197 ALAHRMGST----IMAYAFDLDKEEDEDEDGEDGWVEDRDGKALMGM------VPMAD-I 245
Query: 331 LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP-YDR 389
L ++ + DD + + RP +AGE I+ + GP PNS+LL YG+V E + YD
Sbjct: 246 LNADAEFNVHVNHGDDDLTVTALRPIRAGEEILNYYGPHPNSELLRRYGYVTERHARYD- 304
Query: 390 LVVE 393
VVE
Sbjct: 305 -VVE 307
>gi|240276868|gb|EER40379.1| SET domain-containing protein [Ajellomyces capsulatus H143]
Length = 485
Score = 44.3 bits (103), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 68/326 (20%), Positives = 126/326 (38%), Gaps = 54/326 (16%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
W+ + P KV K K + + A +D+ + F++P +LV+ + N
Sbjct: 23 WLKQR--PGVKVSPKIKIADLRSEGAGRGIVADDDIGEDEELFAIPQNLVLGFQ----NS 76
Query: 144 TIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
+ +LL N+ CL + ++YE QG S W Y + L ++ + W+
Sbjct: 77 RLKDLLDFNERDFDPWLCLIVVMIYEYLQGGASTWSRYFQLLPTN-------FDTLMFWT 129
Query: 202 ETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP-------YDIP-TEA 253
+ EL L+GS +L + E N L + + +P +D P +A
Sbjct: 130 DEELRELSGSA----VLNKIGRSDAEANILRNILPLVSGNPSHFPPMSGVASFDSPEGKA 185
Query: 254 FTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKE 313
+ + + + +++ + G + + + + +E
Sbjct: 186 ALLSLAHRMGSLIMAYAFDIEKGENDGREGQDGYVT--------------------DDEE 225
Query: 314 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 373
++S +VPL L A + + A L D + + +P + GE I G P +
Sbjct: 226 ELSK----GMVPLADLLNADADRNNARLFQEDCYLSMRSIKPIRKGEEIFNDYGELPRAD 281
Query: 374 LLINYGFV-DEDNPYDRLVVEAALNT 398
LL YG+V D YD VE ++ T
Sbjct: 282 LLRRYGYVTDNYAQYDE--VEISMRT 305
>gi|358056332|dbj|GAA97699.1| hypothetical protein E5Q_04377 [Mixia osmundae IAM 14324]
Length = 347
Score = 44.3 bits (103), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 38/75 (50%), Gaps = 2/75 (2%)
Query: 321 FALVPLGPPLLAYSSKCKAMLAAVDDA-VQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 379
F LVPL + +SS C D A +L + RP K GE IV G + L YG
Sbjct: 154 FTLVPL-LDMANHSSTCANATVKYDHAHFELKLTRPVKRGEEIVFEYGGHDQATLWAEYG 212
Query: 380 FVDEDNPYDRLVVEA 394
F++ NP++R+ + A
Sbjct: 213 FIESSNPHERIDLTA 227
>gi|212542185|ref|XP_002151247.1| SET domain protein [Talaromyces marneffei ATCC 18224]
gi|210066154|gb|EEA20247.1| SET domain protein [Talaromyces marneffei ATCC 18224]
Length = 709
Score = 44.3 bits (103), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 69/259 (26%), Positives = 97/259 (37%), Gaps = 57/259 (22%)
Query: 161 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE--LAYLTGSPTKAEIL 218
A +LM + + FW PYIR L G+ + +PL + E E L +L G + A
Sbjct: 108 AFFLMGQYLLQEHGFWYPYIRSLP-----GKEELTTPLFFREEEGDLEWL-GMTSLAASR 161
Query: 219 ERAEGI-----KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA-VQSCVVH 272
ER I +R Y L + F + Y +D+ A T I +AF A V + V+
Sbjct: 162 ERRLAIWRGNYERGYTMLKELGFEG---VEGYTWDLYLWASTI-ISSRAFTAKVLASVI- 216
Query: 273 LQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLA 332
P++ V L T PLS
Sbjct: 217 -------PELKNAEVDRVSVLLPLIDATNHKPLS-------------------------- 243
Query: 333 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 392
K A D++ LVV AGE + GP+ N +L++NYGF DNP + VV
Sbjct: 244 -----KVEWRAGTDSIGLVVMSDVAAGEEVGNNYGPRNNEQLMMNYGFCIPDNPCEYRVV 298
Query: 393 EAALNTEDPQYQDKRMVAQ 411
+ P Q K Q
Sbjct: 299 SLRAPLDSPLAQIKAQYEQ 317
>gi|195480581|ref|XP_002101314.1| GE17555 [Drosophila yakuba]
gi|194188838|gb|EDX02422.1| GE17555 [Drosophila yakuba]
Length = 548
Score = 44.3 bits (103), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 63/281 (22%), Positives = 103/281 (36%), Gaps = 49/281 (17%)
Query: 126 FSVPNSLVVTLE-----RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYI 180
SVP L+ + E R+ G T A L LA L+ EK +G+ S W PYI
Sbjct: 164 LSVPRKLIFSEENNSDCRLFGKMTQATHLN---------LAYDLLIEKIRGEFSEWRPYI 214
Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS 240
L + + L ++ ++ L G+ + L + I ++Y
Sbjct: 215 DVLPAK-------YSTVLYFTTKQMERLRGTAACSLALRQCRVIAKQY------------ 255
Query: 241 LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTE-SFASHLVHLQTSCT 299
AF +++ A +S + G T+ L S
Sbjct: 256 ------------AF---LYRYAHTLAESSTGNRSHPGERGLFFTQRGLCYKLYRWAVSTV 300
Query: 300 TTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 359
TR L P+ + + S AL+P K + AAV ++ AG
Sbjct: 301 MTRQNLVPSEKQEAQDSPKFISALIPYWDMANHRPGKITSFYAAVSRQLECTAQEAVAAG 360
Query: 360 ESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 400
E ++ G + N+ LL++ GFVD +N D + + L+ D
Sbjct: 361 EQFFIYYGDRSNTDLLVHNGFVDVNNLKDYVNIRVGLSPTD 401
>gi|384251065|gb|EIE24543.1| hypothetical protein COCSUDRAFT_40909 [Coccomyxa subellipsoidea
C-169]
Length = 685
Score = 44.3 bits (103), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 51/204 (25%), Positives = 82/204 (40%), Gaps = 45/204 (22%)
Query: 201 SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK 260
+E E++ L G+P +E + I+ +Y V +L YP DI + T + F
Sbjct: 65 TEEEVSMLEGTPAHTTFVEARQHIREQYRAAQPV---LQALTAAYPDDITPDLVTEDKF- 120
Query: 261 QAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSP--THNPKEKVSLA 318
+ +C + S+A + ++ + T P++ H+P
Sbjct: 121 -----IWACELWY------------SYAIEVEYVDGAVRQTLVPIAHLLNHSPW------ 157
Query: 319 RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 378
P ++ Y L A D+++L R AGE + GP PN KLL+ Y
Sbjct: 158 ---------PHIVRYGR-----LDAATDSLRLRAFRHCAAGEQCFLSYGPLPNLKLLLFY 203
Query: 379 GFVDEDNPYDR--LVVEAALNTED 400
GF DNP+D + EA N D
Sbjct: 204 GFALPDNPHDTVPITFEAEKNEGD 227
>gi|428173103|gb|EKX42007.1| hypothetical protein GUITHDRAFT_141487 [Guillardia theta CCMP2712]
Length = 355
Score = 44.3 bits (103), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 74/324 (22%), Positives = 117/324 (36%), Gaps = 73/324 (22%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV- 139
+ W+ L K+ +K P A +D+ G+ ++P+ +++ ERV
Sbjct: 49 FEKWIQSQKLAVNKLEVKSIPGFRMG------TTAKDDIADGELYIAIPDHMLMGPERVE 102
Query: 140 ------------LGNETIAELLTTNKLSELACLALYL---MYEKKQGKKSFWLPYIRELD 184
+ +++I+ LSE + +Y MY K K+SFW PY +
Sbjct: 103 PGSRLDKKLMKIVKSQSISMQEQRRLLSEKNKVLMYFLLQMYNPK--KESFWKPYFDIMP 160
Query: 185 RQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQ 244
+ SP+ WSE EL L GS + ++ Y+EL +F+
Sbjct: 161 TN-------LTSPIFWSEDELQELAGSEVSNMARIEKKRLRAMYDELRE------RIFKH 207
Query: 245 YPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTP 304
+AFT + + A S V+ L R Q G + T P
Sbjct: 208 DRKTFLKQAFTLKNWFWANGLYDSRVIQLNR-----QTGHGN------------VPTFIP 250
Query: 305 LSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 364
L N + + Y K +A + D AV R + ES
Sbjct: 251 LIDMVN-----------CIESQDKTFIQYDKKLRAAVMYADRAVS----RGVQVFESY-- 293
Query: 365 WCGPQPNSKLLINYGFVDEDNPYD 388
G + N + L+ GFV EDNP D
Sbjct: 294 --GNKSNYEYLLYNGFVMEDNPND 315
>gi|255077808|ref|XP_002502485.1| set domain protein [Micromonas sp. RCC299]
gi|226517750|gb|ACO63743.1| set domain protein [Micromonas sp. RCC299]
Length = 728
Score = 44.3 bits (103), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 61/137 (44%), Gaps = 19/137 (13%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE-RVLG 141
+WM K G+ V + P H + A+ D++ GD VP + ++T + V G
Sbjct: 51 AWMKKKGVKLNGVSIGRFP-HTGRG-----CVATRDIKEGDVLVEVPEAAIITADGSVAG 104
Query: 142 NETIAELLTTNKL-------SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
+ +A L L E L L +M E +G++S + PY+ L R A
Sbjct: 105 SALVAFGLGGEALLHEYSPRLEREALVLAVMAEMSRGEESEFAPYLAALPTLR-----AT 159
Query: 195 ESPLLWSETELAYLTGS 211
SPL WS EL+ L G+
Sbjct: 160 HSPLGWSGAELSELEGT 176
>gi|171684553|ref|XP_001907218.1| hypothetical protein [Podospora anserina S mat+]
gi|170942237|emb|CAP67889.1| unnamed protein product [Podospora anserina S mat+]
Length = 396
Score = 44.3 bits (103), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 33/139 (23%), Positives = 59/139 (42%), Gaps = 18/139 (12%)
Query: 285 ESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAV 344
+ F H + + T TP + E+++ + AL P+ L C+ +
Sbjct: 172 QEFLYHWLLVNTRTFYHETPAT------ERLTKDDKMALQPVADLLNHSDEGCEVVFDT- 224
Query: 345 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL----VVEAALNTED 400
+ DR YK GE + + G N L++ YGF E+N +D + VV ++T
Sbjct: 225 -GCYTISADREYKQGEEVYICYGTHSNDFLMVEYGFCPEENKWDEVCIDEVVLEEMSTAR 283
Query: 401 PQYQDKR------MVAQRN 413
++ D R ++ +RN
Sbjct: 284 KKWLDGRDFLGKYLIDERN 302
>gi|224012755|ref|XP_002295030.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220969469|gb|EED87810.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 753
Score = 44.3 bits (103), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 56/108 (51%), Gaps = 14/108 (12%)
Query: 127 SVPNSLVVTLERVLGNET-IAELLTTNKLSELA----CLALYLMYEKK-QGKKSFWLPYI 180
S+P S ++T+E +G T I + T+ L A L +Y+++++K G+ SF+ PY
Sbjct: 141 SIPKSCLITVE--MGQATPIGRKILTSDLELDAPKHIFLMIYILWDRKVNGETSFFAPYY 198
Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
+ L + P+ W+ EL L GS +I +RAE IK +Y
Sbjct: 199 KILP------ETLRNMPIFWTREELDALEGSYLLLQIADRAEAIKEDY 240
>gi|358386801|gb|EHK24396.1| hypothetical protein TRIVIDRAFT_168260 [Trichoderma virens Gv29-8]
Length = 370
Score = 44.3 bits (103), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 22/77 (28%), Positives = 38/77 (49%), Gaps = 2/77 (2%)
Query: 320 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 379
R AL+P+ CK +A D +V DR YK GE + + N +L+ YG
Sbjct: 172 RLALIPVADLFNHSDDGCKVYYSA--DGYHIVADREYKKGEELFISYSSHSNDYILLEYG 229
Query: 380 FVDEDNPYDRLVVEAAL 396
F+ +++ D + ++ A+
Sbjct: 230 FIPDESLDDDVYIDDAV 246
>gi|325095092|gb|EGC48402.1| SET domain-containing protein [Ajellomyces capsulatus H88]
Length = 485
Score = 44.3 bits (103), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 68/326 (20%), Positives = 126/326 (38%), Gaps = 54/326 (16%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
W+ + P KV K K + + A +D+ + F++P +LV+ + N
Sbjct: 23 WLKQR--PGVKVSPKIKIADLRSEGAGRGIVADDDIGEDEELFAIPQNLVLGFQ----NS 76
Query: 144 TIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
+ +LL N+ CL + ++YE QG S W Y + L ++ + W+
Sbjct: 77 RLKDLLDFNERDFDPWLCLIVVMIYEYLQGGASTWSRYFQLLPTN-------FDTLMFWT 129
Query: 202 ETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP-------YDIP-TEA 253
+ EL L+GS +L + E N L + + +P +D P +A
Sbjct: 130 DEELRELSGSA----VLNKIGRSDAEANILRNILPLVSGNPSHFPPMSGVASFDSPEGKA 185
Query: 254 FTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKE 313
+ + + + +++ + G + + + + +E
Sbjct: 186 ALLSLAHRMGSLIMAYAFDIEKGENDGREGQDGYVT--------------------DDEE 225
Query: 314 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 373
++S +VPL L A + + A L D + + +P + GE I G P +
Sbjct: 226 ELSK----GMVPLADLLNADADRNNARLFQEDCYLSMRSIKPIRKGEEIFNDYGELPRAD 281
Query: 374 LLINYGFV-DEDNPYDRLVVEAALNT 398
LL YG+V D YD VE ++ T
Sbjct: 282 LLRRYGYVTDNYAQYDE--VEISMRT 305
>gi|402862437|ref|XP_003895567.1| PREDICTED: SET domain-containing protein 4 [Papio anubis]
Length = 456
Score = 44.3 bits (103), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 69/310 (22%), Positives = 117/310 (37%), Gaps = 36/310 (11%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ + +T K S L L +L+ EK G +S W
Sbjct: 68 LQEGQMIISLPESCLLTADTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGDRSLW 126
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L + A P+ E E+ L KA+ E+ ++ + +
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQT 296
LF + I F++ A+ V + V+L+ Q S L
Sbjct: 179 SLQPLFVEAVDSI----FSYSALLWAWCTVNTRAVYLR----PRQRECLSAEPDTCALAP 230
Query: 297 SCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 356
+C +PL S + L PL S + KA + ++ +
Sbjct: 231 ACLCPFSPLCS--------SGLQNLMLHPLS----GLSQQVKAAFNEETHSYEIRTTSRW 278
Query: 357 KAGESIVVWCGPQPNSKLLINYGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQ 411
+ E + + GP N +L + YGFV NP+ R ++ L + D Q K + +
Sbjct: 279 RKHEEVFICYGPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSRDKQMDKKISILK 338
Query: 412 RNGKLSVQVF 421
+G + F
Sbjct: 339 DHGYIENLTF 348
>gi|451852693|gb|EMD65988.1| hypothetical protein COCSADRAFT_86793 [Cochliobolus sativus ND90Pr]
Length = 478
Score = 44.3 bits (103), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 47/178 (26%), Positives = 78/178 (43%), Gaps = 12/178 (6%)
Query: 358 AGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRL-----VVEAALNTEDPQYQDKRMVAQ 411
AG I+ + GP P+S+LL YG+V E + YD +V AL E +D +
Sbjct: 275 AGSEILNYYGPLPSSELLRRYGYVTSEHHRYDVAEISWSLVRTALAEELKLSEDTIADIE 334
Query: 412 RNGKLSVQVFHV---HAGREKEAISDML--PYLRLGYVSDTSEMQSVISSLGPICPVSPC 466
R + ++ F V AG E + + P LR + ++ + +L P
Sbjct: 335 RKLESELEEFFVIERDAG-EPSSYGTLTQPPVLREISTELEEQTKAFLKALKKRDPKRKR 393
Query: 467 MERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 524
E L + RL YP + +DE++L+ L + R+A ++ EK++L L
Sbjct: 394 SETICNTVLEKALRTRLGQYPTSAKQDESLLSKEGLSKRHRMAVEVRLGEKRLLQEAL 451
>gi|345566622|gb|EGX49564.1| hypothetical protein AOL_s00078g53 [Arthrobotrys oligospora ATCC
24927]
Length = 611
Score = 44.3 bits (103), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 75/343 (21%), Positives = 135/343 (39%), Gaps = 64/343 (18%)
Query: 158 ACLAL---YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 214
AC L L+ E+ Q FW PYIR L + ++PL +++ E+ L G+
Sbjct: 112 ACFHLSQHLLLKEQSQ----FW-PYIRLLPK-------TFDTPLYFNDDEMERLAGTNLG 159
Query: 215 A-EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 273
A ++L R + E+ AG F + +T+++F +++ ++
Sbjct: 160 AGDVLLRKQLWMEEWE--------AGKQFLEGVGAERAREYTWDLF------LRAATIYT 205
Query: 274 QRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAY 333
R SF S LV + + + N L+PL +L +
Sbjct: 206 SR----------SFPSKLVGITMDSSIEENTMLSDDNGFP--------VLIPL-VDILNH 246
Query: 334 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 393
K + + L+ G + GP+ N +LL+ YGFV +NP D L ++
Sbjct: 247 KPNTKIIWEPTQTSFSLITPETISEGSQVFNNYGPKGNEELLMGYGFVIPENPGDSLAMK 306
Query: 394 AALNTEDPQYQDKRMVAQRNGKLSV-QVFHVHAGREKEAISDMLPYLRLGY--------- 443
++ P+ Q ++ QR K + +VFH+ + + +P L +
Sbjct: 307 FTIS---PRGQAAQIWEQRALKQTWREVFHLTKSADSGQKTSTVPALESDWPEAFVDLFR 363
Query: 444 --VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 484
V++ +E+ + + P+S E AV L K +LA
Sbjct: 364 ILVANENEIDDLENGDINATPISIRNELAVALGLKAAIKQKLA 406
>gi|298715435|emb|CBJ28046.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 719
Score = 44.3 bits (103), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 76/347 (21%), Positives = 134/347 (38%), Gaps = 67/347 (19%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
W+ +G + E PS +E + A D+ GD +P++L+++ +
Sbjct: 26 WLRSHG---AAIDCVEWPS-SETESGVRGAVARRDIAPGDHMVIIPHALMMSEFHAKADP 81
Query: 144 TIAEL--LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
+ L T L LALY+M E + ++SF+ PY+R L ES LL
Sbjct: 82 KYGHVHRLNTRLLGSDNGLALYIMQEILKEERSFYWPYLRMLPTPCNLRNWNRESLLLLQ 141
Query: 202 ETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ 261
+ +L T + ++ ++L + RE T+ F++ S YP + +TFE+F
Sbjct: 142 DHKLVRRTAARSR-QLL----ALYRE-----TIEFLSSS----YPELYTADRYTFELFDF 187
Query: 262 AFVAVQSCVV--HLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLAR 319
A+ +Q+ L+ P FA L H T+ N
Sbjct: 188 AWRTIQARAFGKRLKSSALVP------FADCLNHGNVQ---TKYDFDVGGNGT------- 231
Query: 320 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 379
F L P G + Y ++ G + N LL++YG
Sbjct: 232 -FRLFPSG-------------------------NNRYPRNSEVLNSYGRRANDNLLLDYG 265
Query: 380 FVDEDNPYDRLVVEAAL---NTEDPQYQDKRMVAQRNGKLSVQVFHV 423
F DN +D V +L + + P + ++ + +G+ +V++ V
Sbjct: 266 FAMLDNEWDAAEVICSLPPSHDQSPLDRRRKACLRASGQHTVRILRV 312
>gi|345325921|ref|XP_001512684.2| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Ornithorhynchus anatinus]
Length = 392
Score = 43.9 bits (102), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 37/164 (22%), Positives = 73/164 (44%), Gaps = 29/164 (17%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHY-----VAASEDLQAGDAAFS 127
K+E+ DL W NG + E +++ + A+ +++A +
Sbjct: 74 KREDYFPDLMKWATANG------------ASTEGFELVNFEEGFGLRATREIKAEELFLW 121
Query: 128 VPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELD 184
VP L++T+E N + L + +++ + LA +L+ E+ SFWLPYI+ L
Sbjct: 122 VPRKLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERAN-PSSFWLPYIQTLP 179
Query: 185 RQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
+ ++PL + E E+ YL + ++ + + R+Y
Sbjct: 180 SE-------YDTPLYFEEDEVQYLQSTQAIHDVFSQYKNTARQY 216
>gi|357131408|ref|XP_003567330.1| PREDICTED: ribosomal N-lysine methyltransferase 3-like
[Brachypodium distachyon]
Length = 495
Score = 43.9 bits (102), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 70/320 (21%), Positives = 121/320 (37%), Gaps = 28/320 (8%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
K WM K+G V+ + + YV A L+ GD ++P +T R
Sbjct: 13 FKRWMSKHG-----VVCSDALCLDASEAGGVYVRALSALREGDLVATIPRRACLT-PRTS 66
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
G +L LA+ +MYE+ +G +S W Y+R + PL+W
Sbjct: 67 GAAAAI---EAAELGGTLALAVAVMYERARGAESPWNAYLRLIPD-------CEPVPLVW 116
Query: 201 SETELA-YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
+ E L+G+ + + E + ++ E +G L + E F+ E +
Sbjct: 117 PDEEAERLLSGTELDKIVKQDREFLCEDWKECIEPLISSGDL------GVNPEDFSLEKY 170
Query: 260 KQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLAR 319
A + S H+ + G A H T +S + E
Sbjct: 171 FAAKSLLSSRSFHIDSYHGS---GMVPLADLFNHKTDGEHVHFTKVSDASDSDEGEDDDD 227
Query: 320 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 379
+ P + S+ + D+ +++++ R AGE + G N+ LL YG
Sbjct: 228 QSNAGSDEEPTVENSATNPS--GYNDEDLEMIIVRDANAGEEVYNTYGTMGNAALLHRYG 285
Query: 380 FVDEDNPYDRLVVEAALNTE 399
F + DNPYD + ++ L T+
Sbjct: 286 FTELDNPYDIVNIDLTLVTK 305
>gi|428181778|gb|EKX50641.1| hypothetical protein GUITHDRAFT_135258 [Guillardia theta CCMP2712]
Length = 254
Score = 43.9 bits (102), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 44/158 (27%), Positives = 66/158 (41%), Gaps = 25/158 (15%)
Query: 336 KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 395
+ K+ L V+ VQL+ P KAGE I ++ G + L +GF D DNP D + E
Sbjct: 72 RYKSELGRVE--VQLLA--PVKAGEQIFIYYGALSTASELTRFGFCDRDNPNDTVPFELD 127
Query: 396 LNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS 455
L +E + Q K M +V+ ++ D LP RL +
Sbjct: 128 L-SEMTELQRKAM----------EVWEFRPDVQQLLKRDGLPSWRL----------LAML 166
Query: 456 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSED 493
+ + +S E+ V + + A AGYP L ED
Sbjct: 167 RILHLNQLSVANEKLVWGTMEELLNAVTAGYPTRLEED 204
>gi|226492747|ref|NP_001140859.1| uncharacterized protein LOC100272935 [Zea mays]
gi|194701488|gb|ACF84828.1| unknown [Zea mays]
gi|413951742|gb|AFW84391.1| hypothetical protein ZEAMMB73_159573 [Zea mays]
Length = 495
Score = 43.9 bits (102), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 73/324 (22%), Positives = 121/324 (37%), Gaps = 34/324 (10%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
K WM +G V+ + S + +V A+ L+ GD ++P +T R
Sbjct: 15 FKRWMRAHG-----VVCSDALSLDVSDPLGVHVRAATPLRDGDLVATIPRGACLT-PRTT 68
Query: 141 GNETIAELLTTNKLSELACLALYL--MYEKKQGKKSFWLPYIRELDRQRGRGQLAVES-P 197
G E CLAL + MYE+ QG S W Y++ L ES P
Sbjct: 69 GAAAAIEAAELG-----GCLALTVAVMYERAQGADSPWDAYLQLLPD--------CESVP 115
Query: 198 LLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 256
L+W E L G+ + + E + ++ E ++G L D+ + F+
Sbjct: 116 LVWPAGEAECLLAGTELDKIVKQDKEFLCEDWKECIEPLMLSGEL------DVDPDDFSL 169
Query: 257 EIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTR-TPLSPTHNPKEKV 315
E + A V S + + + +H +T C T S + +
Sbjct: 170 EKYLSAKTLVSSRSFQIDSYHGSGMVPLADLFNH----KTDCEHVHFTSASDASDSDGEE 225
Query: 316 SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLL 375
R P + A D+ +++++ R GE + G N+ LL
Sbjct: 226 EEDDRSDASADDKPTTKNPTSSPPGSRANDEDLEIIIVRDVNEGEEVYNTYGTMGNAALL 285
Query: 376 INYGFVDEDNPYDRLVVEAALNTE 399
YGF + DN YD + ++ AL T+
Sbjct: 286 HRYGFTELDNQYDIVNIDLALVTK 309
>gi|407852222|gb|EKG05847.1| hypothetical protein TCSYLVIO_003073 [Trypanosoma cruzi]
Length = 565
Score = 43.9 bits (102), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 43/148 (29%), Positives = 70/148 (47%), Gaps = 15/148 (10%)
Query: 359 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRL-----VVEAALNTEDPQYQDKR--MVAQ 411
G I + GP N +LL YGFV E N +DRL EAA+ E + +R +VA+
Sbjct: 362 GREIWMSYGPLQNWELLQFYGFVLEGNEHDRLPFPLDFPEAAVGDE---WDGRRAALVAK 418
Query: 412 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 471
L+ + H GR A ++ LR+ ++++ E ++ + GP + E V
Sbjct: 419 YGLHLAGCCWICHDGRPPPA---LVALLRV-HLAEAEEFDTMERN-GPFASLGAGTEARV 473
Query: 472 LDQLADYFKARLAGYPATLSEDEAMLTD 499
+AD + L + +L EDE +L +
Sbjct: 474 FATIADTIRCILDLFSTSLEEDERLLEN 501
>gi|299473350|emb|CBN77749.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 563
Score = 43.9 bits (102), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 81/337 (24%), Positives = 129/337 (38%), Gaps = 69/337 (20%)
Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIP-TEAFT 255
P+ W+E E+ L GS ++ ER + I+ +Y G + YP P + T
Sbjct: 212 PIFWTEEEMRLLQGSYLVTQVEERNQAIEGDY----------GVICDLYP---PFRDVAT 258
Query: 256 FEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKV 315
E FK A + C C+ +F + L+TS
Sbjct: 259 LEEFKWA-----------RMCVCS-----RNFGLDINGLRTS------------------ 284
Query: 316 SLARRFALVPLGPPLLAYSSK-CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 374
ALVP L Y + K + G + G + N +
Sbjct: 285 ------ALVPYADMLNHYRPRETKWTYDNNRGGFTITTLHRILGGAQVYDSYGQKCNHRF 338
Query: 375 LINYGFVDEDNPY------DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGRE 428
L+NYGF E+N + + + L+ DP Q K + +G +V + G +
Sbjct: 339 LLNYGFAIENNQEANGFCPNEVPLLFRLDARDPLRQKKARFWRMDGPEQRRV-RLCVG-D 396
Query: 429 KEAISDMLPYLRLGYVSDTSEMQS--VISSLGPI-CPVSPCMERAVLDQLADYFKARLAG 485
+A+ L LR+ V+D +EM + + ++ + P+S E A +++L L
Sbjct: 397 TDAVRGALSMLRV-IVADAAEMGARYMYRTVKDVRFPLSVRNEVAAMERLLLLTTGALDA 455
Query: 486 YPATLSEDEAMLTDYNLHP--KKRVATQLVRMEKKML 520
YP TL ED A L + L P +R A V EK +L
Sbjct: 456 YPTTLEEDRAALKNGGLEPFSNRRHALIQVYGEKVVL 492
>gi|410079629|ref|XP_003957395.1| hypothetical protein KAFR_0E01060 [Kazachstania africana CBS 2517]
gi|372463981|emb|CCF58260.1| hypothetical protein KAFR_0E01060 [Kazachstania africana CBS 2517]
Length = 534
Score = 43.9 bits (102), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 40/144 (27%), Positives = 63/144 (43%), Gaps = 9/144 (6%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A DL AG +P S + + N TI+ LL ++ + L L +YE +
Sbjct: 40 VFAKRDLPAGTTLLQLPKSAIFSA----SNSTISNLLVEEEIDGVLALNLAFIYETTVFR 95
Query: 173 -KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE-YNE 230
KS W PY++ + +G ++V P WSE L G T + L A ++E Y
Sbjct: 96 EKSHWYPYLKSIQVVDSQGNISV-PPGYWSEEAKDLLRG--TTLDTLYDALSPQQEVYEG 152
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAF 254
+ +A Q++ +P E F
Sbjct: 153 FEISLHVAKKWNQEFSLPLPEEYF 176
>gi|403338831|gb|EJY68658.1| hypothetical protein OXYTRI_10728 [Oxytricha trifallax]
Length = 770
Score = 43.9 bits (102), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 21/77 (27%), Positives = 40/77 (51%), Gaps = 6/77 (7%)
Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGNE------TIAELLTTNKLSELACLALYLMYE 167
A ED+Q +A +PN ++T+ER +E + +++ + L +++M E
Sbjct: 67 AVKEDIQHNEAFVYIPNKCLITVERARSSEIGFIFANHENVFKSSEDRDFLTLLVFMMCE 126
Query: 168 KKQGKKSFWLPYIRELD 184
++G +SFW PY +D
Sbjct: 127 FQKGDQSFWYPYFNAVD 143
>gi|422293951|gb|EKU21251.1| hypothetical protein NGA_2061300, partial [Nannochloropsis gaditana
CCMP526]
Length = 452
Score = 43.9 bits (102), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 43/173 (24%), Positives = 71/173 (41%), Gaps = 12/173 (6%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L W +KNG+ I S + A+ ++ G+ +VP +L +++ V
Sbjct: 66 LLEWCNKNGIKDASKITIGPVSQAGMGLGL---VATAPIKQGETLATVPLNLCFSMDSVR 122
Query: 141 G---NETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
+ I E L + + +AL L+YE G KS + YI+ L R GQ + P
Sbjct: 123 ASPLGKVIGEF--EPALGDASLIALQLLYEAHMGPKSKYAVYIKSLPRP---GQDGFDHP 177
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDI 249
L WS E L S T+ + + +Y + + G S Q +D+
Sbjct: 178 LFWSTAEQGVLAKSSTRNLGETLIDAVAEDYGWIQSALARGGISGLQADSFDL 230
>gi|340503949|gb|EGR30449.1| SET domain protein [Ichthyophthirius multifiliis]
Length = 518
Score = 43.9 bits (102), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 50/211 (23%), Positives = 91/211 (43%), Gaps = 25/211 (11%)
Query: 345 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 404
D+ + +P K G+ I G + N LL+ YGF N YD + +N Q
Sbjct: 250 DNYFVVTTQKPEKKGQQIYNCYGQRTNKFLLMWYGFCFNKNRYDSYSLRLWINMRQEQLN 309
Query: 405 D---KRMVAQ---------------RNGKLSVQVFHVHAGREKEAIS-DMLPYLRLGYVS 445
+ +++V Q + K+++ + +K I+ D++ YLRL +
Sbjct: 310 NDLFEKIVFQEFLEKEDCKGGFVWKKQEKVNLDDITQNFRIKKNKINIDLIIYLRLYLMM 369
Query: 446 DTS--EMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 503
+++ V+ SL PVSP E VL L+ + T+ +D+ +L + NL+
Sbjct: 370 HYKGPDLKRVMVSL----PVSPVYECFVLSFAIRLLSYLLSRFTTTIKDDKELLQNQNLN 425
Query: 504 PKKRVATQLVRMEKKMLNACLQVTADMIMLL 534
K R A +K++L + + ++LL
Sbjct: 426 YKYRFAIIYRLNQKEILQEQISLMNQALILL 456
>gi|207346544|gb|EDZ73016.1| YDR257Cp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 354
Score = 43.9 bits (102), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 322 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 381
+++PL L A +SKC A L + +++V R + E + G PNS+LL YG+V
Sbjct: 80 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 139
Query: 382 DED-NPYD 388
+ D + YD
Sbjct: 140 EWDGSKYD 147
>gi|255078794|ref|XP_002502977.1| set domain protein [Micromonas sp. RCC299]
gi|226518243|gb|ACO64235.1| set domain protein [Micromonas sp. RCC299]
Length = 536
Score = 43.9 bits (102), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 58/231 (25%), Positives = 87/231 (37%), Gaps = 29/231 (12%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LA L+ E+ +G+ S + PYI + + SPL+W++ E+A L P AE
Sbjct: 154 LACRLLQERAKGEDSDYAPYIALIP------ESVPGSPLMWTDDEVASLQYPPAVAE--- 204
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCT 279
RE + WF L + P + + FK A VVH + G
Sbjct: 205 -----AREMRDAVATWFR--KLSAEAP--VALAGADLDAFKSAV-----SVVHSRTYGVA 250
Query: 280 PQMGTESFASHLVHLQT--SCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKC 337
E + L+ L + P S + +P + A + P PP A S
Sbjct: 251 SSASGEGYFRALLPLADLLNHGGDEYPESAS-SPANRGGKANKSPASPKWPP--AGCSDN 307
Query: 338 KAMLAAVDDAV-QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 387
A D+ V + R E + G + N L+ YGFV NP+
Sbjct: 308 IAWSELSDEGVIEFAATRAIAPHEEAAMSYGERSNDHFLVYYGFVPPRNPH 358
>gi|387193935|gb|AFJ68731.1| hypothetical protein NGATSA_2061300, partial [Nannochloropsis
gaditana CCMP526]
Length = 446
Score = 43.9 bits (102), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 38/137 (27%), Positives = 60/137 (43%), Gaps = 11/137 (8%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L W +KNG+ I S + A+ ++ G+ +VP +L +++ V
Sbjct: 60 LLEWCNKNGIKDASKITIGPVSQAGMGLGL---VATAPIKQGETLATVPLNLCFSMDSVR 116
Query: 141 GN---ETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
+ + I E L + + +AL L+YE G KS + YI+ L R GQ + P
Sbjct: 117 ASPLGKVIGEF--EPALGDASLIALQLLYEAHMGPKSKYAVYIKSLPRP---GQDGFDHP 171
Query: 198 LLWSETELAYLTGSPTK 214
L WS E L S T+
Sbjct: 172 LFWSTAEQGVLAKSSTR 188
>gi|308812294|ref|XP_003083454.1| N-methyltransferase (ISS) [Ostreococcus tauri]
gi|116055335|emb|CAL58003.1| N-methyltransferase (ISS) [Ostreococcus tauri]
Length = 492
Score = 43.9 bits (102), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 20/56 (35%), Positives = 32/56 (57%)
Query: 333 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 388
+S A + ++ V+LV R KAG+ I + G N +L ++YGF+ EDN +D
Sbjct: 248 HSFDASARVRECENGVELVTTRDLKAGQPIELCYGELSNDELFLDYGFIVEDNAFD 303
>gi|330924929|ref|XP_003300837.1| hypothetical protein PTT_12198 [Pyrenophora teres f. teres 0-1]
gi|311324820|gb|EFQ91062.1| hypothetical protein PTT_12198 [Pyrenophora teres f. teres 0-1]
Length = 372
Score = 43.9 bits (102), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 28/111 (25%), Positives = 48/111 (43%), Gaps = 2/111 (1%)
Query: 286 SFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD 345
S SH +L + T E++ R A++P+ C+A A+
Sbjct: 145 SNVSHEQYLHSWLLVNTRSFYCTTPSMERLPHDDRLAILPVADLFNHADVGCEAQFAS-- 202
Query: 346 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 396
+ + DR Y+AGE + + G LL YGFV +N +D + ++ A+
Sbjct: 203 ENYSFIADRTYRAGEELYISYGTHSTDFLLAEYGFVPAENRWDVVCLDEAI 253
>gi|255075907|ref|XP_002501628.1| predicted protein [Micromonas sp. RCC299]
gi|226516892|gb|ACO62886.1| predicted protein [Micromonas sp. RCC299]
Length = 607
Score = 43.9 bits (102), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 61/256 (23%), Positives = 97/256 (37%), Gaps = 49/256 (19%)
Query: 145 IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 204
+A + +L+ A LAL++++E +S Y+ L G+ +V PLLW+ T+
Sbjct: 137 VAASMGAPELATHAALALHVLFELGD-PRSEGFAYLATLPGLAGKASPSV--PLLWTPTQ 193
Query: 205 LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
+A L G+PT +L RA+ + + L +P E F + A
Sbjct: 194 VATLRGTPTHGRVLRRAKFVSDAHAAL---------FGSGGGGGVPLEKFAW-----ALS 239
Query: 265 AVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALV 324
+V S R G + V +C + L+P N +E V+
Sbjct: 240 SVLSRAASGDRMPYAFLPGVDLLNHGGV--DANCELSAVKLAPGGN-EENVTWG------ 290
Query: 325 PLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP-NSKLLINYGFVDE 383
D V V D P AGE + + G + N +LL YGF
Sbjct: 291 --------------------DVEVTCVKDTP--AGEQLTISYGDESDNCRLLRLYGFATR 328
Query: 384 DNPYDRLVVEAALNTE 399
N +DR +E L +
Sbjct: 329 GNVHDRRTIELRLTGD 344
>gi|145353540|ref|XP_001421068.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581304|gb|ABO99361.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 813
Score = 43.5 bits (101), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 64/281 (22%), Positives = 104/281 (37%), Gaps = 53/281 (18%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A D G+ +P TL L ++ + + + + +AL++ E+ +G+K+
Sbjct: 26 ALRDCARGEVLLEIPLERGFTLAAALEDDAVKRVASCCARHD-DVVALHVCAERFRGEKA 84
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
++ L R + ++ WSE EL LTG+ E + E K +Y L
Sbjct: 85 TRAAHVATLPR-------SFDTAFFWSEEELRELTGTTCLRETMNLREETKNDYETLTKK 137
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMG-TESFASHLVH 293
G +++ E + + ++ + + C + + T M T +H
Sbjct: 138 MEAIGEGGWMREHEVDYERYAW---ARSNLWSRQCDLLMPDGKRTRAMVPTFDIFNH--- 191
Query: 294 LQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD 353
+ + PL TH L C + AA D
Sbjct: 192 ------SAKAPLGKTHK--------------------LNAEKNCVTVYAADD-------- 217
Query: 354 RPYKAGES--IVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 392
YKAGE I G NSKLL YGF +DNPY+ L V
Sbjct: 218 --YKAGEQAFISYGSGEAANSKLLTWYGFCIDDNPYEELDV 256
>gi|426218421|ref|XP_004003445.1| PREDICTED: SET domain-containing protein 4 [Ovis aries]
Length = 439
Score = 43.5 bits (101), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 77/353 (21%), Positives = 132/353 (37%), Gaps = 66/353 (18%)
Query: 39 NFGSSLRLVRRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILK 98
N G +RR+ F SS+ SR V + + +LK W+ +I
Sbjct: 3 NGGGRTSRIRRRKLFR----SSE-----SRGVNESYKPEFIELKKWLKDRRFEDATLIPA 53
Query: 99 EKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-SEL 157
P + + LQ G S+P S ++T + V+ + A + S L
Sbjct: 54 RFPGTGRG------LMSKTSLQEGQTIISLPESCLLTTDTVIRSYLGAYIAKWQPPPSPL 107
Query: 158 ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
L +L+ EK G +S W PY+ L + A P+ E E+ L +P K +
Sbjct: 108 LALCTFLVSEKHAGDRSPWKPYLEVLPK-------AYTCPVC-LEPEVVNLLPNPLKTKA 159
Query: 218 LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQR-- 275
E+ ++ ++ + LF + I F++ + A+ V + V+++R
Sbjct: 160 WEQRSHVQEFFSSSRGFFSSLQPLFSEAIETI----FSYRALRWAWCTVNTRAVYMKRPP 215
Query: 276 -CGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYS 334
+P+ T + A +L L H+P +V A
Sbjct: 216 QLCLSPEPDTCALAPYLDLLN-------------HSPDVQVKAA------------FNEE 250
Query: 335 SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 387
++C + A R K E + + GP N +LL+ YGFV NP+
Sbjct: 251 TRCYEIRTAT---------RCGKHKEVFICY-GPHDNHRLLLEYGFVSVSNPH 293
>gi|194896580|ref|XP_001978500.1| GG17647 [Drosophila erecta]
gi|190650149|gb|EDV47427.1| GG17647 [Drosophila erecta]
Length = 544
Score = 43.5 bits (101), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 29/110 (26%), Positives = 47/110 (42%)
Query: 291 LVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQL 350
L S TR L P+ + + S AL+P K + AAV ++
Sbjct: 288 LYRWAVSTVMTRQNLVPSEKQESQDSPKFISALIPYWDMANHKPGKITSFYAAVSRQLEC 347
Query: 351 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 400
+AGE ++ G + N+ LL++ GFVD +N D + + L+ D
Sbjct: 348 TAQEAVEAGEQFFIYYGDRSNTDLLVHNGFVDVNNLKDYVNIRVGLSPTD 397
>gi|323355591|gb|EGA87411.1| Set7p [Saccharomyces cerevisiae VL3]
Length = 515
Score = 43.5 bits (101), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 322 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 381
+++PL L A +SKC A L + +++V R + E + G PNS+LL YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279
Query: 382 DED-NPYD 388
+ D + YD
Sbjct: 280 EWDGSKYD 287
>gi|307104961|gb|EFN53212.1| hypothetical protein CHLNCDRAFT_137077 [Chlorella variabilis]
Length = 512
Score = 43.5 bits (101), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 40/73 (54%), Gaps = 2/73 (2%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
V A++D+ AG VP L++++E + + L ++ L+ LA++L+ E +
Sbjct: 37 VLATQDIPAGTCVLRVPRHLLMSVESARRDAELCTALRQHRAALTSDQVLAVHLLCEASK 96
Query: 171 GKKSFWLPYIREL 183
G SFW PY+R L
Sbjct: 97 GAASFWQPYLRSL 109
>gi|329663327|ref|NP_001192753.1| SET domain-containing protein 4 [Bos taurus]
gi|296490853|tpg|DAA32966.1| TPA: SET domain containing 4 [Bos taurus]
Length = 440
Score = 43.5 bits (101), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 78/353 (22%), Positives = 133/353 (37%), Gaps = 66/353 (18%)
Query: 39 NFGSSLRLVRRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILK 98
N G +RR+ F+ SS+ SR V + + +LK W+ +I
Sbjct: 3 NGGGRTSRIRRRKLFT----SSE-----SRGVNESYKPEFIELKKWLKDRRFEDTTLIPA 53
Query: 99 EKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-SEL 157
P + + LQ G S+P S ++T + V+ + A + S L
Sbjct: 54 HFPGTGRG------LMSKTSLQEGQTIISLPESCLLTTDTVIRSYLGAYIAKWQPPPSPL 107
Query: 158 ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
L +L+ EK G +S W PY+ L + A P+ E E+ L +P K +
Sbjct: 108 LALCTFLVSEKHAGDRSPWKPYLEVLPK-------AYTCPVC-LEPEVVNLLPNPLKTKA 159
Query: 218 LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQR-- 275
E+ + ++ + LF + I F++ + A+ AV + V+++R
Sbjct: 160 WEQRSHVWEFFSSSRGFFSSLQPLFSEAVETI----FSYRALRWAWCAVNTRAVYMKRPP 215
Query: 276 -CGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYS 334
+P+ T + A +L L H+P +V A
Sbjct: 216 LLCLSPEPDTCALAPYLDLLN-------------HSPDVQVKAA------------FNEE 250
Query: 335 SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 387
++C + A R K E + + GP N +LL+ YGFV NP+
Sbjct: 251 TRCYEIRTAT---------RCGKHKEVFICY-GPHDNHRLLLEYGFVCVSNPH 293
>gi|346319394|gb|EGX88996.1| Protein kinase-like domain [Cordyceps militaris CM01]
Length = 1753
Score = 43.5 bits (101), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 51/200 (25%), Positives = 84/200 (42%), Gaps = 8/200 (4%)
Query: 336 KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 395
+C A D + ++ + Y+ G+ I ++ G PN++LL YGFV DNP D +
Sbjct: 1088 QCHAYDPTSGD-LSILAAKDYQVGDQIFIYYGSVPNNRLLRLYGFVLLDNPNDSYDLVLQ 1146
Query: 396 LNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS 455
+ P Y+ K + G S + A + ++L YLR + D +++ +
Sbjct: 1147 TSPMAPLYEQKERLWALAGLDSTCTIPLTA--KHPLPKNVLRYLRTQRL-DAADVADMTL 1203
Query: 456 SL--GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD--YNLHPKKRVATQ 511
L G V+ E VL L D + L G+ L + EA L Y A Q
Sbjct: 1204 QLLNGTDGKVNDGNEIQVLQFLIDSLGSVLEGFGIPLEKLEAQLAGGFYPAGGNAWAAAQ 1263
Query: 512 LVRMEKKMLNACLQVTADMI 531
+ E+ +L + DM+
Sbjct: 1264 VSAGEQGILTRAKKTAEDML 1283
>gi|323334121|gb|EGA75505.1| Set7p [Saccharomyces cerevisiae AWRI796]
Length = 515
Score = 43.5 bits (101), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 322 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 381
+++PL L A +SKC A L + +++V R + E + G PNS+LL YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279
Query: 382 DED-NPYD 388
+ D + YD
Sbjct: 280 EWDGSKYD 287
>gi|151942233|gb|EDN60589.1| SET domain-containing protein [Saccharomyces cerevisiae YJM789]
Length = 494
Score = 43.5 bits (101), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 322 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 381
+++PL L A +SKC A L + +++V R + E + G PNS+LL YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279
Query: 382 DED-NPYD 388
+ D + YD
Sbjct: 280 EWDGSKYD 287
>gi|6320463|ref|NP_010543.1| Rkm4p [Saccharomyces cerevisiae S288c]
gi|46577338|sp|Q12504.1|RKM4_YEAST RecName: Full=Ribosomal N-lysine methyltransferase 4; AltName:
Full=SET domain-containing protein 7
gi|1136212|emb|CAA92714.1| unknown [Saccharomyces cerevisiae]
gi|1226033|emb|CAA94096.1| unknown [Saccharomyces cerevisiae]
gi|51830266|gb|AAU09704.1| YDR257C [Saccharomyces cerevisiae]
gi|190404795|gb|EDV08062.1| hypothetical protein SCRG_00269 [Saccharomyces cerevisiae RM11-1a]
gi|259145494|emb|CAY78758.1| Set7p [Saccharomyces cerevisiae EC1118]
gi|285811273|tpg|DAA12097.1| TPA: Rkm4p [Saccharomyces cerevisiae S288c]
gi|323349272|gb|EGA83501.1| Set7p [Saccharomyces cerevisiae Lalvin QA23]
gi|365766338|gb|EHN07836.1| Set7p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
gi|392300372|gb|EIW11463.1| Rkm4p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 494
Score = 43.5 bits (101), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 322 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 381
+++PL L A +SKC A L + +++V R + E + G PNS+LL YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279
Query: 382 DED-NPYD 388
+ D + YD
Sbjct: 280 EWDGSKYD 287
>gi|388250581|gb|AFK23406.1| histone-lysine N-methyltransferase [Cordyceps militaris]
Length = 479
Score = 43.5 bits (101), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 48/188 (25%), Positives = 81/188 (43%), Gaps = 7/188 (3%)
Query: 348 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 407
+ ++ + Y+ G+ I ++ G PN++LL YGFV DNP D + + P Y+ K
Sbjct: 239 LSILAAKDYQVGDQIFIYYGSVPNNRLLRLYGFVLLDNPNDSYDLVLQTSPMAPLYEQKE 298
Query: 408 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV--SDTSEMQSVISSLGPICPVSP 465
+ G S + A + ++L YLR + +D ++M + + G V+
Sbjct: 299 RLWALAGLDSTCTIPLTA--KHPLPKNVLRYLRTQRLDAADVADMTLQLLN-GTDGKVND 355
Query: 466 CMERAVLDQLADYFKARLAGYPATLSEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNAC 523
E VL L D + L G+ L + EA L Y A Q+ E+ +L
Sbjct: 356 GNEIQVLQFLIDSLGSVLEGFGIPLEKLEAQLAGGFYPAGGNAWAAAQVSAGEQGILTRA 415
Query: 524 LQVTADMI 531
+ DM+
Sbjct: 416 KKTAEDML 423
>gi|349577313|dbj|GAA22482.1| K7_Set7p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 494
Score = 43.5 bits (101), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 322 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 381
+++PL L A +SKC A L + +++V R + E + G PNS+LL YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279
Query: 382 DED-NPYD 388
+ D + YD
Sbjct: 280 EWDGSKYD 287
>gi|330933580|ref|XP_003304225.1| hypothetical protein PTT_16721 [Pyrenophora teres f. teres 0-1]
gi|311319308|gb|EFQ87682.1| hypothetical protein PTT_16721 [Pyrenophora teres f. teres 0-1]
Length = 476
Score = 43.5 bits (101), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 28/72 (38%), Positives = 37/72 (51%), Gaps = 1/72 (1%)
Query: 322 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 381
+VPL L A + +C A L D +++ +P +AGE I GP P S LL YG+V
Sbjct: 223 GMVPLADMLNADADRCNARLFYEKDCLEMKALKPIQAGEEIFNDYGPLPRSDLLRRYGYV 282
Query: 382 DEDNPYDRLVVE 393
DN VVE
Sbjct: 283 T-DNYAQYDVVE 293
Score = 39.7 bits (91), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 35/136 (25%), Positives = 59/136 (43%), Gaps = 13/136 (9%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLER-VLG 141
+W+ K+G I E + + R V AS+++ + F +P + ++++E +L
Sbjct: 13 AWLRKSGAEISPKIKLEDLRNKDAGRG---VVASQEIAEHELLFRIPRTSILSVENSILS 69
Query: 142 NETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
E A L+ L L L ++YE G S W PY L + + + W+
Sbjct: 70 TEIPAATLSL--LGPWLSLILVMLYEYHNGSASNWAPYFAVLPTE-------FNTLMFWT 120
Query: 202 ETELAYLTGSPTKAEI 217
E ELA L S +I
Sbjct: 121 EDELAELQASAVVGKI 136
>gi|302896942|ref|XP_003047350.1| hypothetical protein NECHADRAFT_106552 [Nectria haematococca mpVI
77-13-4]
gi|256728280|gb|EEU41637.1| hypothetical protein NECHADRAFT_106552 [Nectria haematococca mpVI
77-13-4]
Length = 471
Score = 43.5 bits (101), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 58/265 (21%), Positives = 104/265 (39%), Gaps = 67/265 (25%)
Query: 161 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-LE 219
A++L+ + G++SFW PYI+ L + A+ PLLW E++L +L G+ + + +
Sbjct: 111 AIFLVQQYLLGEQSFWYPYIQILPQPDDDKDSAI--PLLWPESDLLWLRGTHLEEAVSKQ 168
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA-----VQSCVVHLQ 274
+ + +KR W A Q+Y +D P++ FT E+ A+ S ++
Sbjct: 169 KVDHVKR--------WTEAMETLQKYGWD-PSQ-FTLELGLWAYYCFYSRYFWSIILEPD 218
Query: 275 RCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPT-------------HNPKEK-VSLARR 320
P+ HLV + T L P +N +K +S+++
Sbjct: 219 VANIKPEF------QHLVKAGMNLDDTAKILLPILETLNHAQETNTEYNLDDKGLSVSKN 272
Query: 321 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 380
L P P +AY + + N+ LL ++GF
Sbjct: 273 IELKPGDPFYIAYDKETQRF-----------------------------NNTVLLKDFGF 303
Query: 381 VDEDNPYDRLVVEAALNTEDPQYQD 405
+ DN LV+ + + P + D
Sbjct: 304 ILPDNEAAELVLSSPFDLTRPMHLD 328
>gi|348684109|gb|EGZ23924.1| hypothetical protein PHYSODRAFT_296170 [Phytophthora sojae]
Length = 452
Score = 43.5 bits (101), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 25/59 (42%), Positives = 32/59 (54%), Gaps = 5/59 (8%)
Query: 354 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD--RLVVEAAL---NTEDPQYQDKR 407
R Y+ GE + + G N +LL NYGF +NPYD L + AAL N DP + KR
Sbjct: 257 RSYEPGEQLFINYGSHGNLRLLRNYGFTMPNNPYDVVNLPMPAALQQPNEADPAFAQKR 315
>gi|256270722|gb|EEU05884.1| Set7p [Saccharomyces cerevisiae JAY291]
Length = 494
Score = 43.5 bits (101), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 322 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 381
+++PL L A +SKC A L + +++V R + E + G PNS+LL YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279
Query: 382 DED-NPYD 388
+ D + YD
Sbjct: 280 EWDGSKYD 287
>gi|302820198|ref|XP_002991767.1| hypothetical protein SELMODRAFT_430007 [Selaginella moellendorffii]
gi|300140448|gb|EFJ07171.1| hypothetical protein SELMODRAFT_430007 [Selaginella moellendorffii]
Length = 389
Score = 43.1 bits (100), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 32/120 (26%), Positives = 58/120 (48%), Gaps = 9/120 (7%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+ ++AG+ +P+ LV+T E++ ++ + +LL+T + L L ++ E+ +G+ S
Sbjct: 14 AARSIRAGEQIVRIPHDLVLTAEKL--DDCVKKLLSTEY--DWCPLTLLILAEQHKGEAS 69
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PY+ L S + W + EL +L + ER E I EY + V
Sbjct: 70 RWAPYVSCLPSFGDH-----HSTIFWEKEELKFLECTRAFRGTAERREMISDEYISVKNV 124
>gi|336258546|ref|XP_003344085.1| hypothetical protein SMAC_09068 [Sordaria macrospora k-hell]
gi|380093059|emb|CCC09296.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 421
Score = 43.1 bits (100), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 29/50 (58%)
Query: 347 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 396
A + RPY AGE + + G N LLI YGF+ ++N +D + ++ A+
Sbjct: 263 AFTITTTRPYSAGEEVYICYGNHSNDFLLIEYGFLFDENVWDEVCIDDAI 312
>gi|384251962|gb|EIE25439.1| ResB-domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 889
Score = 43.1 bits (100), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 35/135 (25%), Positives = 55/135 (40%), Gaps = 15/135 (11%)
Query: 79 GDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLER 138
G L+ W+ GLPP KV + + V E L +VP L++T +
Sbjct: 32 GSLEDWLTHRGLPPQKVAISHEIPEGRGLVATRRVRKHEKL------LNVPAQLLLTADV 85
Query: 139 VLGNETIAELLTTNKLSELACLALYLMYEKKQ--GKKSFWLPYIRELDRQRGRGQLAVES 196
L + LL + + + LA +L ++Q G K+ W Y+ L Q G
Sbjct: 86 ALQHSAYGGLLESCGVPAWSVLATFLAETRRQPEGDKNVWGQYVDALPSQTG-------C 138
Query: 197 PLLWSETELAYLTGS 211
L W+ E+ L G+
Sbjct: 139 VLEWASEEVDLLRGT 153
>gi|402581480|gb|EJW75428.1| hypothetical protein WUBG_13665, partial [Wuchereria bancrofti]
Length = 118
Score = 43.1 bits (100), Expect = 0.37, Method: Composition-based stats.
Identities = 34/101 (33%), Positives = 50/101 (49%), Gaps = 9/101 (8%)
Query: 287 FASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYS--SKCKAMLAAV 344
+A H+V+ T C L P + E SLA +VPL +L +S S+C A+
Sbjct: 16 WAWHIVN--TRCIYRNNKLHPLIDNTEDDSLA----IVPL-IDMLNHSNDSQCCAIWDGK 68
Query: 345 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 385
+ +++V RP + GE I + G N L I YGF +DN
Sbjct: 69 LNLCKVIVTRPIRKGEQIFICYGSHTNGSLWIEYGFYLKDN 109
>gi|336468018|gb|EGO56181.1| hypothetical protein NEUTE1DRAFT_83233 [Neurospora tetrasperma FGSC
2508]
gi|350289741|gb|EGZ70966.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
Length = 459
Score = 43.1 bits (100), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 63/296 (21%), Positives = 110/296 (37%), Gaps = 62/296 (20%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
+ G+ ++P ++ T++ + + L + + LS LA Y+++ K
Sbjct: 34 FKEGEKILTIPAGILWTVKHAYADPLLGPALRSAQPPLSVEDTLATYILFVKS------- 86
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
RE R +A S +L++E +L G+ + + I+ ++ L
Sbjct: 87 ----RESGYDGQRSHIAALPTSYSSSILFAEDDLEACAGTSLYTITKQLEQSIEDDHRAL 142
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHL 291
LF Q+P P + FT E +K A V S + FA L
Sbjct: 143 VV------RLFVQHPDLFPLDKFTVEDYKWALCTVWSRAMDFVLADGNSIRLLAPFADML 196
Query: 292 VHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 351
H T+ ++P SS ++LA D
Sbjct: 197 NH------TSEVKQCHVYDP----------------------SSGNLSVLAGKD------ 222
Query: 352 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 407
Y+AG+ + + GP PNS+LL YGFV NP D + + + + P ++ K+
Sbjct: 223 ----YEAGDQVFINYGPVPNSRLLRLYGFVIPGNPNDSYDLVLSTHPQAPFFEQKQ 274
>gi|126325439|ref|XP_001376285.1| PREDICTED: SET domain-containing protein 4-like [Monodelphis
domestica]
Length = 437
Score = 43.1 bits (100), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 65/317 (20%), Positives = 124/317 (39%), Gaps = 57/317 (17%)
Query: 86 HKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVVTLERV 139
HK + LK++ + RP + + A + LQ G+ S+P ++T + V
Sbjct: 30 HKQEFIELRKWLKKRKFEDHNLRPTRFSNTGRGLMAVKSLQPGELIISLPKECLLTTDTV 89
Query: 140 LGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
+ + + + +T +S L L +L+ EK G KS W PY+ L + +
Sbjct: 90 IRS-YLGDYITKWMPPISPLLALCAFLISEKHAGNKSPWKPYLDVLPK--------AYTC 140
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
L+ E E+ L P + + E+ +++ + + SLF + D+ F +
Sbjct: 141 LVCLEPEVVRLLPRPLQMKAEEQRMQVQKLFISSRGFFSSLQSLFTE---DV-KHVFHYH 196
Query: 258 IFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSL 317
F A+ + + V+++ H Q C + +
Sbjct: 197 AFLWAWCTINTRTVYMK------------------HAQKQCLSAEPDV------------ 226
Query: 318 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAV--QLVVDRPYKAGESIVVWCGPQPNSKLL 375
+AL P LL +S + A ++ ++ K E + + GP N +LL
Sbjct: 227 ---YALAPY-LDLLNHSPRVWVEAAFNEETCCYEIRTTSHCKKFEELFICYGPHDNHRLL 282
Query: 376 INYGFVDEDNPYDRLVV 392
+ YGFV +NP+ + +
Sbjct: 283 LEYGFVASNNPHSAVYI 299
>gi|1150596|emb|CAA86307.1| putative transcription regulator [Saccharomyces cerevisiae]
Length = 496
Score = 43.1 bits (100), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 322 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 381
+++PL L A +SKC A L + +++V R + E + G PNS+LL YG+V
Sbjct: 222 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 281
Query: 382 DED-NPYD 388
+ D + YD
Sbjct: 282 EWDGSKYD 289
>gi|358399747|gb|EHK49084.1| hypothetical protein TRIATDRAFT_213818 [Trichoderma atroviride IMI
206040]
Length = 378
Score = 43.1 bits (100), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 21/77 (27%), Positives = 37/77 (48%), Gaps = 2/77 (2%)
Query: 320 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 379
R AL+P+ + C+ + + +V DR YK GE + + N L+ YG
Sbjct: 179 RLALIPVADLFNHADAGCRVYYSP--EGYHIVADRDYKRGEELYISYSSHSNDYNLVEYG 236
Query: 380 FVDEDNPYDRLVVEAAL 396
FV ++NP D + ++ +
Sbjct: 237 FVPDENPSDDVYIDDVI 253
>gi|340507383|gb|EGR33354.1| SET domain protein [Ichthyophthirius multifiliis]
Length = 165
Score = 43.1 bits (100), Expect = 0.39, Method: Composition-based stats.
Identities = 33/103 (32%), Positives = 50/103 (48%), Gaps = 20/103 (19%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAEL--------LTTNKLSELACLALYL 164
V A E++ A ++PN+L+++ V +E L L + ++ LALYL
Sbjct: 51 VIAKEEIPANKVFVAIPNNLLLSTYLVEQSELKVILEENPHLFDLDEDDDAQFNKLALYL 110
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL--WSETEL 205
M EK +G+ SFW PY+ Q+A ES L W E E+
Sbjct: 111 MKEKIKGENSFWYPYL----------QIAPESFTLLDWKEEEV 143
>gi|402076002|gb|EJT71425.1| hypothetical protein GGTG_10683 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 497
Score = 43.1 bits (100), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 53/230 (23%), Positives = 85/230 (36%), Gaps = 44/230 (19%)
Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
L L +++E +G S W PY+ L + E+P+ WS ELA L SP A +
Sbjct: 104 SLILVMIHEHLRGSASPWRPYLDVLPAR-------FETPMFWSAAELAELQASPVVASV- 155
Query: 219 ERAEG-------IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 271
RAEG I E + ++F AG A + + V
Sbjct: 156 GRAEGDAMIRSRILPVIRENEALFFGAGG---------------------AAMGDEELVE 194
Query: 272 HLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLL 331
R G T ++ L+ + E +VP+ +L
Sbjct: 195 LAHRMGST-------IMAYAFDLERDDDAMDEDDAEGDGWVEDRDGRTVMGMVPMA-DIL 246
Query: 332 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 381
++ A + ++A+ + R AGE I+ + GP PN +L YG+
Sbjct: 247 NADAEFNAHINHSEEALVAISLRKIPAGEEILNYYGPLPNGQLCRRYGYT 296
>gi|222640175|gb|EEE68307.1| hypothetical protein OsJ_26571 [Oryza sativa Japonica Group]
Length = 422
Score = 43.1 bits (100), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 31/112 (27%), Positives = 53/112 (47%), Gaps = 17/112 (15%)
Query: 344 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR--LVVEAALNTEDP 401
V +++ + RP KAGE + G P S L+ YGF+ DNPYD L ++ +++ ED
Sbjct: 315 VTKSLKFPLSRPCKAGEQCFLSYGKHPGSHLITFYGFLPRDNPYDVIPLDLDTSVDEEDS 374
Query: 402 ----------QYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 443
+ +RM+ R + +Q + ++ + YLRLG+
Sbjct: 375 SSPSVTTSQTSHMGERMLG-RQSRTGLQ----RSTKKDSFVHCYFVYLRLGH 421
>gi|301122791|ref|XP_002909122.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262099884|gb|EEY57936.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 426
Score = 43.1 bits (100), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 52/214 (24%), Positives = 92/214 (42%), Gaps = 15/214 (7%)
Query: 341 LAAVDDAVQLV---VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 397
+ D+ Q++ V + AG ++ + G N+KLL +YGFV ++N + +
Sbjct: 215 FVSFDEETQMLTHHVPKEVAAGSALQISYGQYSNAKLLFSYGFVAKENSRRAVDFWMKIP 274
Query: 398 TEDPQYQDKRMVAQRNGKLSVQVFHVHAGR-EKEAISDMLPYLRLGYVSDTSEMQSVISS 456
DP + K+ V N Q + E + +L LR+ +++ E++ +
Sbjct: 275 PNDPYLKLKQTVLDSNELTRDQTYDFCGTLFENDVDERLLATLRVILMNE-QEIR-LYKK 332
Query: 457 LGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQL-VRM 515
+S E AV + L + + +LA + TL EDEA+L + R++ + VR+
Sbjct: 333 AFETSIISIRNELAVYENLQNTCRRKLANFATTLEEDEAILAEMATESSPRLSFAVRVRV 392
Query: 516 EKKM--------LNACLQVTADMIMLLPDVTVSP 541
E K L QV A + + P T P
Sbjct: 393 EDKQVLTGVIDTLEKWKQVLASNLEMYPPSTTRP 426
Score = 42.7 bits (99), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 28/101 (27%), Positives = 48/101 (47%), Gaps = 7/101 (6%)
Query: 111 HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
H V A + L +G +P L + +E ++ L ++ + LAL+LM+E+ +
Sbjct: 35 HGVFAKQALTSGQVTLRIPFKLTMNIESAARSDLARVLEKYPQIPDDEVLALHLMHERSK 94
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
SF+ P+I L + P+ WSE+EL L G+
Sbjct: 95 RSDSFFAPFIASLPT-------TFDLPVFWSESELNELKGT 128
>gi|217074704|gb|ACJ85712.1| unknown [Medicago truncatula]
Length = 209
Score = 43.1 bits (100), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 32/120 (26%), Positives = 56/120 (46%), Gaps = 8/120 (6%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ +Q GD VP SL +T + + + + + +A LA L+ K G+ S
Sbjct: 66 ASKSIQTGDCILQVPYSLQLTPDNLPPE---IKPFISEDVGNIAKLATVLLIHKNLGQDS 122
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI L Q + + + W+E+EL + S E + + I++++ E+ V
Sbjct: 123 EWHPYISCLPPQA-----EMHNTIFWNESELEMIRQSSVYQETIYQKSQIEKDFLEIKPV 177
>gi|323309789|gb|EGA62995.1| Set7p [Saccharomyces cerevisiae FostersO]
Length = 417
Score = 43.1 bits (100), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 322 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 381
+++PL L A +SKC A L + +++V R + E + G PNS+LL YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279
Query: 382 DED-NPYD 388
+ D + YD
Sbjct: 280 EWDGSKYD 287
>gi|242059429|ref|XP_002458860.1| hypothetical protein SORBIDRAFT_03g041640 [Sorghum bicolor]
gi|241930835|gb|EES03980.1| hypothetical protein SORBIDRAFT_03g041640 [Sorghum bicolor]
Length = 491
Score = 43.1 bits (100), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 68/324 (20%), Positives = 122/324 (37%), Gaps = 36/324 (11%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
K WM +G V+ + S + +V A L+ GD ++P +T R
Sbjct: 13 FKRWMRAHG-----VVCSDALSLDVSDPLGVHVRAVTPLRDGDLVATIPRGACLT-PRTT 66
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIREL-DRQRGRGQLAVESPLL 199
G +L LA+ +MYE+ +G S W Y++ L DR+ PL+
Sbjct: 67 GAAAAI---EAAELGGCLALAVAVMYERARGTDSPWDAYLQLLPDRE--------SVPLV 115
Query: 200 WSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEI 258
W E L G+ + + E + ++ E ++G L D+ + F+ E
Sbjct: 116 WPADEAECLLAGTELDKIVKQDREFLCEDWKECIEPLLLSGEL------DVDPDDFSLEK 169
Query: 259 FKQAFVAVQSCVVHLQR---CGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKV 315
+ A V S + G P + + H+ + + + ++
Sbjct: 170 YFSAKTLVSSRSFQIDSYHGFGMVPLADLFNHKTDCEHVHFTSASDASDSDGEDADDDQS 229
Query: 316 SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLL 375
+ P + SK D+ +++++ R GE + G N+ LL
Sbjct: 230 DASADDESTIENPTSSSPGSK--------DEDLEMIIVRDVNEGEEVYNTYGTMGNAALL 281
Query: 376 INYGFVDEDNPYDRLVVEAALNTE 399
YGF + DN YD + ++ AL T+
Sbjct: 282 HRYGFTELDNQYDIVNIDLALVTK 305
>gi|391342782|ref|XP_003745694.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Metaseiulus occidentalis]
Length = 278
Score = 43.1 bits (100), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 18/79 (22%), Positives = 43/79 (54%)
Query: 354 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 413
R YK E + ++ G + N++ +++ GFV ++N +D L ++ L+ D ++ KR + ++
Sbjct: 96 REYKKNEQVNIFYGNRANAQFMLHNGFVPDENQWDSLAIKIGLSKADKLFEMKRRLCEQM 155
Query: 414 GKLSVQVFHVHAGREKEAI 432
+ VF + + + +
Sbjct: 156 KIPTSDVFELKKAPDGDGV 174
>gi|403370373|gb|EJY85047.1| hypothetical protein OXYTRI_17100 [Oxytricha trifallax]
Length = 777
Score = 43.1 bits (100), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 39/165 (23%), Positives = 74/165 (44%), Gaps = 31/165 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIA-------ELLTTNKLSELACLALYLM 165
VAA + + +A +PN L++ +++ +E E T K S+ L ++
Sbjct: 112 VAAKKFIGPNEAYLYIPNKLIINEDKLYKSEYAQIFIDHPNEFKNTEK-SDQTSLIFFVA 170
Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVES--PLLWSETELAYLTGSPTKAEILERAEG 223
E +G++S+W PY + A +S P W + + L + KAE+
Sbjct: 171 LELLKGEESYWHPYF----------ETAQDSDLPQFWEDQNIDELEDALIKAEL------ 214
Query: 224 IKREYNELDTV--WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 266
+ +++D + + +A + YP + E FT EI+K+A+ V
Sbjct: 215 ---QMHQVDFIGDYEIAHGIANHYPDLVHAEKFTIEIYKRAYNIV 256
>gi|387191841|gb|AFJ68625.1| set domain-containing protein [Nannochloropsis gaditana CCMP526]
Length = 736
Score = 43.1 bits (100), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 18/46 (39%), Positives = 29/46 (63%)
Query: 347 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 392
++QLV R + G+++ + G N +LL++YGF DNP+DR V
Sbjct: 398 SIQLVAGRDLQPGDALTISYGNLTNPQLLLDYGFTLSDNPHDRFEV 443
>gi|85113406|ref|XP_964517.1| hypothetical protein NCU02158 [Neurospora crassa OR74A]
gi|28926302|gb|EAA35281.1| hypothetical protein NCU02158 [Neurospora crassa OR74A]
Length = 504
Score = 42.7 bits (99), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 29/50 (58%)
Query: 347 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 396
A + RPY AGE + + G N LLI YGF+ ++N +D + ++ A+
Sbjct: 270 AFTITTTRPYAAGEEVYICYGNHSNDFLLIEYGFLFDENVWDEVCIDDAI 319
>gi|24640264|ref|NP_727144.1| CG32732 [Drosophila melanogaster]
gi|22831862|gb|AAF46222.2| CG32732 [Drosophila melanogaster]
gi|28316927|gb|AAO39485.1| RE55639p [Drosophila melanogaster]
gi|220957744|gb|ACL91415.1| CG32732-PA [synthetic construct]
Length = 537
Score = 42.7 bits (99), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 62/292 (21%), Positives = 104/292 (35%), Gaps = 49/292 (16%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLE-----RVLGNETIAELLTTNKLSELACLALYLMYEKK 169
A+ L + SVP L+++ E R+ G T A L LA L+ EK
Sbjct: 149 ATRPLAKDELVLSVPRKLILSEENNSDCRLFGKMTQATHLN---------LAYDLVIEKI 199
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
+G+ S W PYI L + + L ++ ++ L G+ A + + I ++Y
Sbjct: 200 RGEFSEWRPYIDVLPAK-------YNTVLYFTTKQMELLRGTAAAALAMRQCRVIAKQY- 251
Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTE-SFA 288
AF ++K A + + G T+
Sbjct: 252 -----------------------AF---LYKYAHTMTEPSTGNRSHPGERGLFFTQHGLC 285
Query: 289 SHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV 348
L S TR L P+ + + AL+P K + A V +
Sbjct: 286 YKLYRWAVSTVMTRQNLVPSEKQESEDGPKLISALIPYWDMANHRPGKITSFYATVSRQL 345
Query: 349 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 400
+ GE ++ G + N+ LL++ GFVD +N D + + L+ D
Sbjct: 346 ECTAQEAVNTGEQFFIYYGDRSNTDLLVHNGFVDPNNTKDYVNIRVGLSLTD 397
>gi|401624185|gb|EJS42251.1| set7p [Saccharomyces arboricola H-6]
Length = 494
Score = 42.7 bits (99), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 58/243 (23%), Positives = 99/243 (40%), Gaps = 43/243 (17%)
Query: 152 NKLSELACLALYLMYEKKQGKK-SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 210
N++ L + ++YE + K+ S W PY + ++ L + W E EL L
Sbjct: 82 NEIGSWEGLIICMLYEMEVLKENSQWAPYFKVWNKPTDMNVL-----IFWDEGELELLQP 136
Query: 211 SPTKAEILERAEGIKREYNEL--DTVWF--MAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 266
S +LER K+E E+ T+ F + F D + FT+ +
Sbjct: 137 SL----VLERIG--KKEAKEMYERTIEFIKLIDGEFAAAAMDFGFDDFTY---------I 181
Query: 267 QSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPL 326
S ++ SF + + TS S + +E S +++PL
Sbjct: 182 ASIILSY------------SFDVEIRDMNTSENK-----SDDSDEEESKSACYLKSMIPL 224
Query: 327 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED-N 385
L A +SKC A L +++++ R + E + G PNS++L YG+V+ D +
Sbjct: 225 ADTLNADTSKCNANLTYDSGSLKMIAVRDIEIDEQVYNIYGEHPNSEILRRYGYVEWDGS 284
Query: 386 PYD 388
YD
Sbjct: 285 KYD 287
>gi|242823770|ref|XP_002488126.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
gi|218713047|gb|EED12472.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
Length = 480
Score = 42.7 bits (99), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 49/193 (25%), Positives = 85/193 (44%), Gaps = 20/193 (10%)
Query: 322 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 381
+VPL L A + + A L D A+ + +P K G+ I G P S LL YG+V
Sbjct: 230 GMVPLADLLNADADRNNARLFQEDGALVMRAIKPIKTGDEIFNDYGELPRSDLLRRYGYV 289
Query: 382 DEDNPYDRLVVEAAL----------NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEA 431
DN VVE L N E +Y +++ + ++ + + +++
Sbjct: 290 -TDNYAQYDVVELPLTGICHAAGLDNIESQEYPHLKLLHEL--EILEDGYCILRPSAEDS 346
Query: 432 ISDMLP----YLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP 487
++D+LP L + E+Q + S P P+ E + L D K++L+ Y
Sbjct: 347 LTDILPDELLALLKSLTLEREELQRLQSKQKPPKPILAAREARI---LLDSVKSKLSQYG 403
Query: 488 ATLSEDEAMLTDY 500
T+ +D+A+L +
Sbjct: 404 TTVEQDKAILQQF 416
>gi|452986759|gb|EME86515.1| hypothetical protein MYCFIDRAFT_131111 [Pseudocercospora fijiensis
CIRAD86]
Length = 391
Score = 42.7 bits (99), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 22/68 (32%), Positives = 34/68 (50%), Gaps = 9/68 (13%)
Query: 347 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV---------DEDNPYDRLVVEAALN 397
++ +R Y AGE ++ G PN KLL++YGF+ D+D D +++ N
Sbjct: 216 GYEVTANRDYVAGEEVLATYGAHPNDKLLVHYGFINSSKPGAPSDDDIRLDHYILDNLSN 275
Query: 398 TEDPQYQD 405
T Q QD
Sbjct: 276 TTRDQLQD 283
>gi|356511297|ref|XP_003524363.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
setd3-like [Glycine max]
Length = 449
Score = 42.7 bits (99), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 70/172 (40%), Gaps = 19/172 (11%)
Query: 345 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD--RLVVEAAL------ 396
D +++V + K + +++ G N L++YGFV NPYD L + AL
Sbjct: 232 DSKMKVVAETAIKEDDPLLLCYGCLNNDLFLLDYGFVMHSNPYDCIELKYDGALLDAAST 291
Query: 397 -------NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE 449
N P + +++Q N V G ++ +L LR+ ++
Sbjct: 292 AAGVSSPNFSTPAPWQELILSQLNLAGETPDLKVSLGGQETVEGRLLAALRVILSTNVET 351
Query: 450 MQ----SVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 497
MQ S++ SL P+ E AV L L +P + +DE++L
Sbjct: 352 MQKYDLSILQSLDAEAPLGVANEIAVFRTLIALCVIALGHFPTKIMDDESLL 403
>gi|336463341|gb|EGO51581.1| hypothetical protein NEUTE1DRAFT_125257 [Neurospora tetrasperma
FGSC 2508]
gi|350297448|gb|EGZ78425.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
Length = 503
Score = 42.7 bits (99), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 29/50 (58%)
Query: 347 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 396
A + RPY AGE + + G N LLI YGF+ ++N +D + ++ A+
Sbjct: 270 AFTITTTRPYAAGEEVYICYGNHSNDFLLIEYGFLFDENVWDEVCIDDAI 319
>gi|255584095|ref|XP_002532790.1| Protein SET DOMAIN GROUP, putative [Ricinus communis]
gi|223527460|gb|EEF29592.1| Protein SET DOMAIN GROUP, putative [Ricinus communis]
Length = 510
Score = 42.7 bits (99), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 23/74 (31%), Positives = 39/74 (52%), Gaps = 1/74 (1%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSELACLALYLMYEKKQG 171
+ A+ DL+ G+ VP S ++T + L + + + + LS L + L+YE +G
Sbjct: 57 LGAARDLKKGELVLRVPKSALLTKDSFLKDGLLLSAINNHSALSPTQTLTVCLLYEMSKG 116
Query: 172 KKSFWLPYIRELDR 185
+ SFW PY+ L R
Sbjct: 117 QSSFWYPYLMHLPR 130
>gi|326913214|ref|XP_003202935.1| PREDICTED: SET domain-containing protein 4-like, partial [Meleagris
gallopavo]
Length = 241
Score = 42.7 bits (99), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 50/108 (46%), Gaps = 9/108 (8%)
Query: 86 HKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVVTLERV 139
HK K LK++ + RP + + + LQAG+ S+P +VT V
Sbjct: 28 HKLEYIKLKKWLKDRGFGDSSLRPAQFWGTGRGLMTTRALQAGELVISLPEKCLVTTNTV 87
Query: 140 LGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDR 185
L N + E + K +S L L +L+ EK G+KS W PY+ L +
Sbjct: 88 L-NSCLGEYIMKWKPPVSPLIALCTFLIAEKHAGEKSLWKPYLDVLPK 134
>gi|358335378|dbj|GAA53907.1| histone-lysine N-methyltransferase setd3 [Clonorchis sinensis]
Length = 254
Score = 42.7 bits (99), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 37/173 (21%), Positives = 82/173 (47%), Gaps = 15/173 (8%)
Query: 362 IVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 421
I++ G + +++ L+ GFV NP++ + + ++ D + + + S +
Sbjct: 58 ILMDYGKRTSAEFLMFSGFVPATNPHNNVRIVLGVSKSDQLSSKREQLLELIALQSPLIL 117
Query: 422 HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS---------SLGPICPVSPCMERAVL 472
H+ + ++SD + + R+ +V D+ ++ + +S P+CP P ++A+
Sbjct: 118 HITG--DLSSLSDAIAFARV-FVMDSDQLDAHLSMTTSALHALRTSPLCPGDPIDDQAIA 174
Query: 473 DQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 525
L F+ ++ Y +SEDE NL P +R +L E ++L +C++
Sbjct: 175 -FLIMRFELLVSAYGPMVSEDEVGYE--NLTPIQRYCERLRVQEVQILRSCIE 224
>gi|358384831|gb|EHK22428.1| hypothetical protein TRIVIDRAFT_84056 [Trichoderma virens Gv29-8]
Length = 458
Score = 42.7 bits (99), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 21/67 (31%), Positives = 37/67 (55%)
Query: 348 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 407
+ ++ + Y+A + + ++ GP PNS+LL YGFV DNP D + + + P Y+ K+
Sbjct: 216 LSILAGKDYEAEDQVFIYYGPMPNSRLLRLYGFVIPDNPNDSYDLVLSTHPLAPFYEQKQ 275
Query: 408 MVAQRNG 414
+ G
Sbjct: 276 KLWASAG 282
>gi|323449371|gb|EGB05259.1| hypothetical protein AURANDRAFT_66448 [Aureococcus anophagefferens]
Length = 762
Score = 42.7 bits (99), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 44/197 (22%), Positives = 75/197 (38%), Gaps = 29/197 (14%)
Query: 66 GSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAA 125
GS VV+ + +W+ G K+ +K H R + AA E G+
Sbjct: 11 GSSAVVTS------EFVAWLRAGGASFDKLAIK----HTALGRGVVATAAYE---PGETL 57
Query: 126 FSVPNSLVVTLERVLGNETIAELLTTNKLSELAC------LALYLMYEKKQGKKSFWLPY 179
SVP +L++T+++ +A L + + LAL+L ++ + W PY
Sbjct: 58 LSVPEALLLTVDKASRRADVAASLGAARARGVDANGGNLALALFLAGDRSEA----WRPY 113
Query: 180 IRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAG 239
+ R P W + A L GSP +++ R + I+R+ L
Sbjct: 114 RNVISRS------VSHLPCFWPTADEALLAGSPLGEDVVRRRDEIRRDCRSLGLTAVEDR 167
Query: 240 SLFQQYPYDIPTEAFTF 256
F + + AF F
Sbjct: 168 QAFAFAEAQVLSRAFAF 184
>gi|297738159|emb|CBI27360.3| unnamed protein product [Vitis vinifera]
Length = 449
Score = 42.7 bits (99), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 43/74 (58%), Gaps = 1/74 (1%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN-KLSELACLALYLMYEKKQG 171
+AA+ DL G+ +VP S ++T + +L +E ++ + + LS L + L+ E +G
Sbjct: 51 LAAARDLSQGELILTVPKSALMTSQSLLKDEKLSVAVKRHTSLSSPQILTICLLAEMSKG 110
Query: 172 KKSFWLPYIRELDR 185
K S+W PY+ +L R
Sbjct: 111 KSSWWHPYLMQLPR 124
>gi|297608243|ref|NP_001061350.2| Os08g0244400 [Oryza sativa Japonica Group]
gi|255678277|dbj|BAF23264.2| Os08g0244400, partial [Oryza sativa Japonica Group]
Length = 195
Score = 42.7 bits (99), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 34/59 (57%), Gaps = 2/59 (3%)
Query: 344 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR--LVVEAALNTED 400
V +++ + RP KAGE + G P S L+ YGF+ DNPYD L ++ +++ ED
Sbjct: 14 VTKSLKFPLSRPCKAGEQCFLSYGKHPGSHLITFYGFLPRDNPYDVIPLDLDTSVDEED 72
>gi|359476494|ref|XP_002269094.2| PREDICTED: protein SET DOMAIN GROUP 40-like [Vitis vinifera]
Length = 504
Score = 42.7 bits (99), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 43/74 (58%), Gaps = 1/74 (1%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN-KLSELACLALYLMYEKKQG 171
+AA+ DL G+ +VP S ++T + +L +E ++ + + LS L + L+ E +G
Sbjct: 51 LAAARDLSQGELILTVPKSALMTSQSLLKDEKLSVAVKRHTSLSSPQILTICLLAEMSKG 110
Query: 172 KKSFWLPYIRELDR 185
K S+W PY+ +L R
Sbjct: 111 KSSWWHPYLMQLPR 124
>gi|242769547|ref|XP_002341787.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
gi|218724983|gb|EED24400.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
Length = 739
Score = 42.7 bits (99), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 65/262 (24%), Positives = 105/262 (40%), Gaps = 54/262 (20%)
Query: 161 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE--LAYLTGSPTKAEIL 218
+LM + + ++ FW PYI+ L G + +PLL+ E + LA+L + A
Sbjct: 138 TFFLMGQYLRREEGFWYPYIQSLP-----GPEELTTPLLFKEEDGDLAWLNMTSLAASRE 192
Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA---FTFEIFKQAFVAVQSCVVHLQR 275
R + K Y + A S+ Q D+ E +T++++ A + S
Sbjct: 193 RRLQIWKVNYEK-------AYSMMQ----DLGVENARLYTWDLYLWASTIISSRA----- 236
Query: 276 CGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSS 335
+ AS + LQT+ R + L+PL + A +
Sbjct: 237 ------FTAKVLASVIPKLQTAEEGDRISV-----------------LLPL---IDATNH 270
Query: 336 K--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 393
K K A D++ LVV +AG+ + GP+ N +L++NYGF DNP + VV
Sbjct: 271 KPLSKVEWRAGTDSIGLVVMSDLRAGDEVGNNYGPRNNEQLMMNYGFCIPDNPCEYRVVS 330
Query: 394 AALNTEDPQYQDKRMVAQRNGK 415
+ P Q K Q K
Sbjct: 331 LRAPPDSPLAQIKAQYEQHCSK 352
>gi|400594002|gb|EJP61885.1| histone-lysine N-methyltransferase [Beauveria bassiana ARSEF 2860]
Length = 481
Score = 42.4 bits (98), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 39/163 (23%), Positives = 71/163 (43%), Gaps = 13/163 (7%)
Query: 348 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 407
+ ++ + Y G+ + ++ GP PN++LL YGFV +NP+D + + P Y+ K
Sbjct: 221 LSILASKDYNVGDQVFIYYGPVPNNRLLRLYGFVLPENPHDSYDLVLQTSPMAPLYEQKE 280
Query: 408 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL------GPIC 461
+ + G + + A +D LP L Y+ +S++ ++ G
Sbjct: 281 RLWKLAGLDTACTIPLTA-------NDPLPRSVLRYLRIQRLDESLLGAMTMQIATGADE 333
Query: 462 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 504
+S E +L L D A L G+ L A L +++P
Sbjct: 334 KISDDSETLILQFLIDSISAILEGFSIPLDILTAQLAAGDVYP 376
>gi|302784522|ref|XP_002974033.1| hypothetical protein SELMODRAFT_414219 [Selaginella moellendorffii]
gi|300158365|gb|EFJ24988.1| hypothetical protein SELMODRAFT_414219 [Selaginella moellendorffii]
Length = 527
Score = 42.4 bits (98), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 32/100 (32%), Positives = 48/100 (48%), Gaps = 7/100 (7%)
Query: 135 TLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
T ER L + +L N + +L+ E+ +GK+SFW PYI L +L++
Sbjct: 90 TAERCL---LVGPMLRKNDFRPWLTMCAHLLVERSRGKESFWHPYISALPSVE---ELSI 143
Query: 195 ESPLLW-SETELAYLTGSPTKAEILERAEGIKREYNELDT 233
PLLW +ET L GSP I R + + ++ L T
Sbjct: 144 SHPLLWPAETIQELLQGSPMLDTIATRLKLCQEDHEALLT 183
>gi|383863095|ref|XP_003707018.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Megachile
rotundata]
Length = 277
Score = 42.4 bits (98), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 46/91 (50%), Gaps = 4/91 (4%)
Query: 354 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE---DPQYQDKRMVA 410
R +K G+ I + GP+PNS ++ GFV D+ +D L E DP ++R +
Sbjct: 80 RDFKKGDQIFISYGPRPNSDFFLHSGFVYMDHKHDTLKFWVGSFLESNLDPHLAERRQLL 139
Query: 411 QRNGKLSVQVFHVHAGREKEAISDMLPYLRL 441
++ F V++GRE S +L Y+R+
Sbjct: 140 KKLHLQPWSEFVVNSGREPIPGS-VLAYMRV 169
>gi|315045047|ref|XP_003171899.1| SET domain-containing protein 6 [Arthroderma gypseum CBS 118893]
gi|311344242|gb|EFR03445.1| SET domain-containing protein 6 [Arthroderma gypseum CBS 118893]
Length = 485
Score = 42.4 bits (98), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 62/287 (21%), Positives = 113/287 (39%), Gaps = 31/287 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ AS D+ + F +P L+++++ + L +L L + ++YE QG+
Sbjct: 51 ICASRDITEDEELFVIPEDLILSVQNSEARTVLG--LDDKQLGPWLSLIIAMIYEYYQGE 108
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+S W PY L + ++ + W++ +L+ L GS +I + A D
Sbjct: 109 QSKWYPYFGVLPS-------SFDTLMFWTDEQLSELQGSAVVGKIGKAAAD--------D 153
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLV 292
T+ L Q P + ++ QS ++ L + M ++A +
Sbjct: 154 TILQKVVPLIQANSLHFPPRS---DMPPLNSPDSQSALLSLAHRMASLIM---AYAFDIE 207
Query: 293 HLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 352
+ + T T + E +VPL A + + A L + + +
Sbjct: 208 KAEEADEDTAEDGYMTDDEDEPAK-----GMVPLADIFNADAQRNNARLFQEEGSFVMKA 262
Query: 353 DRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALNT 398
R +GE I G P + LL YG+V D YD VVE +L++
Sbjct: 263 VRNIHSGEEIFNDYGELPRADLLRRYGYVTDNYTQYD--VVEFSLDS 307
>gi|212544736|ref|XP_002152522.1| hypothetical protein PMAA_003730 [Talaromyces marneffei ATCC 18224]
gi|210065491|gb|EEA19585.1| hypothetical protein PMAA_003730 [Talaromyces marneffei ATCC 18224]
Length = 429
Score = 42.4 bits (98), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 32/133 (24%), Positives = 54/133 (40%), Gaps = 6/133 (4%)
Query: 280 PQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKA 339
P E F+ H + + T L P +P E + A A+VP ++ +
Sbjct: 196 PDTDWEKFSYHWLIVNTRSFYY---LMPGQDPPEDTNDA--MAMVPFAD-YFNHTDDAEC 249
Query: 340 MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 399
+ + R YK GE I + GP PN L + YGF E N D + ++ + +
Sbjct: 250 EVHFDGKSYTFRATRLYKKGEEIYMSYGPHPNDFLFVEYGFYLETNESDAIFLDDIIFKD 309
Query: 400 DPQYQDKRMVAQR 412
+ + ++ QR
Sbjct: 310 FTVAEKEELIRQR 322
>gi|395518633|ref|XP_003763464.1| PREDICTED: SET domain-containing protein 4 [Sarcophilus harrisii]
Length = 440
Score = 42.4 bits (98), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 65/315 (20%), Positives = 120/315 (38%), Gaps = 53/315 (16%)
Query: 86 HKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVVTLERV 139
HK + LKE+ + RP + + A + LQ G+ S+P ++T + V
Sbjct: 29 HKLEFIELRKWLKERKFEDHNLRPTRFSGTGRGLMAVKSLQPGELIISLPEKCLLTTDTV 88
Query: 140 LGNETIAELLT--TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
+ + + + +T T +S L L +L+ E G KS W PY+ L + +
Sbjct: 89 IKS-YLGDYITKWTPPISPLLALCTFLISENNAGNKSPWKPYLDILPKDY--------TC 139
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
L+ E ++ L P K + E+ ++ + + SLF + D+ F +
Sbjct: 140 LVCLEPQVVRLLPKPLKIKAQEQKTQVQELFVSSRGFFSSLQSLFTE---DVK-HIFHYH 195
Query: 258 IFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSL 317
F A+ + + V+++ H Q C + + +L
Sbjct: 196 AFLWAWCTINTRTVYMK------------------HAQKKCLSAEPDV---------YAL 228
Query: 318 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 377
A L+ P + + A ++ K E + + GP N +LL+
Sbjct: 229 APYLDLLNHSPGV-----QVNAAFNEKTRCYEIRTTSSCKKYEELFICYGPHDNHRLLLE 283
Query: 378 YGFVDEDNPYDRLVV 392
YGFV +NP+ + V
Sbjct: 284 YGFVAINNPHSAVYV 298
>gi|358055500|dbj|GAA98620.1| hypothetical protein E5Q_05307 [Mixia osmundae IAM 14324]
Length = 462
Score = 42.4 bits (98), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 18/46 (39%), Positives = 28/46 (60%)
Query: 350 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 395
LV D P AG+ + GP+ N + L+ YGF+ +NP D +V++ A
Sbjct: 225 LVQDEPTAAGQQVFNNYGPKSNEEFLLGYGFIIPNNPDDHMVLKLA 270
>gi|449702130|gb|EMD42824.1| Hypothetical protein EHI5A_004190 [Entamoeba histolytica KU27]
Length = 749
Score = 42.4 bits (98), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 22/65 (33%), Positives = 33/65 (50%)
Query: 357 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL 416
K GE I + N KLL+ YGFV+E+NP D L++ +D QY + + ++
Sbjct: 187 KPGEQIFNNYRIRSNEKLLLGYGFVEENNPCDNLLLRIYFEVDDNQYNEIEEILKQEEIK 246
Query: 417 SVQVF 421
S F
Sbjct: 247 SFDFF 251
>gi|303272215|ref|XP_003055469.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463443|gb|EEH60721.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 468
Score = 42.4 bits (98), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 32/119 (26%), Positives = 52/119 (43%), Gaps = 10/119 (8%)
Query: 144 TIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 203
++A+ L +L L + +M E+ G +S W Y L RG L P+ W+E
Sbjct: 85 SVAKELRDARLGGGLALNVAVMVERALGSESRWRDYFAVLP-SRGERTL----PMFWTEA 139
Query: 204 ELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
L L G+ + E AE ++ +Y+E + L +P E TFE + +A
Sbjct: 140 RLEALKGTDLATHVREDAENLRADYDEE-----VVNGLCVAHPEKFRREELTFERYLEA 193
>gi|164423408|ref|XP_963594.2| hypothetical protein NCU08733 [Neurospora crassa OR74A]
gi|157070080|gb|EAA34358.2| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 459
Score = 42.4 bits (98), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 62/296 (20%), Positives = 109/296 (36%), Gaps = 62/296 (20%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
+ G+ ++P ++ T++ + + L + + LS LA Y+++ K
Sbjct: 34 FKEGEKILTIPAGILWTVKHAYADPLLGPALRSAQPPLSVEDTLATYILFVKS------- 86
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
RE R +A S +L++E +L G+ + + I+ ++ L
Sbjct: 87 ----RESGYDGQRSHIAALPASYSSSILFAEDDLEACAGTSLYTITKQLEQSIEDDHRAL 142
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHL 291
LF Q+P P + FT E +K A V S + FA L
Sbjct: 143 VV------RLFVQHPDLFPLDKFTVEDYKWALCTVWSRAMDFVLADGNSIRLLAPFADML 196
Query: 292 VHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 351
H T+ ++P SS ++ A D
Sbjct: 197 NH------TSEVKQCHVYDP----------------------SSGTLSVFAGKD------ 222
Query: 352 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 407
Y+AG+ + + GP PNS+LL YGFV NP D + + + + P ++ K+
Sbjct: 223 ----YEAGDQVFINYGPVPNSRLLRLYGFVIPGNPNDSYDLVLSTHPQAPFFEQKQ 274
>gi|384254260|gb|EIE27734.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 724
Score = 42.4 bits (98), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 86/355 (24%), Positives = 131/355 (36%), Gaps = 63/355 (17%)
Query: 71 VSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVP- 129
+ K E + W ++G+ + L E + +AA++++ G+ S+P
Sbjct: 67 IQKSGEGPLGFQEWALQSGITSPSLRLAEFAG-------LRGMAAADNIAKGEVLVSLPV 119
Query: 130 -NSLVVT-LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQR 187
+LVV+ ER T +K +AL L+YE++ G S PY+ L
Sbjct: 120 AAALVVSPKERSQLPGTFCSSAFYSKKPWYVQMALNLLYERQLGPASKLAPYVAALP--- 176
Query: 188 GRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY 247
+ +PL WSE +L L E+ + EG+KR + EL
Sbjct: 177 ----VDFSTPLSWSEAQLQALCYPQLIREVATQREGLKRLHAELAV-------------- 218
Query: 248 DIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSP 307
P T + A AV+S R P G ++ S L ++
Sbjct: 219 STPGTPITEQDLIWALQAVRS------RAFSGPYAG-PTWRSRLKTFGALGALAAASITV 271
Query: 308 THNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD-----DAVQLVVDRPYKA---- 358
H ++ A F L L +L+ K AM VD VQ V+ Y A
Sbjct: 272 AHVLNGAIA-AALFNL--LYDVVLSQKVKWYAMCPVVDFLNHKSTVQSEVEYEYFADRFS 328
Query: 359 ---------GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV----EAALNTED 400
GE + + G Q N LL YGFV+ P+D + AAL D
Sbjct: 329 VRCQSYFSKGEQVFISYGKQSNDSLLQYYGFVEPGIPHDTYTIPDLRAAALALSD 383
>gi|50303805|ref|XP_451849.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49640981|emb|CAH02242.1| KLLA0B07161p [Kluyveromyces lactis]
Length = 553
Score = 42.4 bits (98), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 25/64 (39%), Positives = 33/64 (51%), Gaps = 1/64 (1%)
Query: 345 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 404
D+ V +++ KAGE I G NS LL YGF EDNP+D + + L E Q
Sbjct: 320 DECVDIILSNDVKAGEEIFNSYGDHSNSYLLARYGFCIEDNPHDVVDLSEEL-VEYGQKN 378
Query: 405 DKRM 408
KR+
Sbjct: 379 SKRI 382
>gi|367036287|ref|XP_003648524.1| hypothetical protein THITE_2106073 [Thielavia terrestris NRRL 8126]
gi|346995785|gb|AEO62188.1| hypothetical protein THITE_2106073 [Thielavia terrestris NRRL 8126]
Length = 496
Score = 42.4 bits (98), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 21/61 (34%), Positives = 34/61 (55%), Gaps = 1/61 (1%)
Query: 321 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 380
+VP+ +L ++ A + DDA+ RP +AG+ I+ + GP PN +LL YG+
Sbjct: 240 LGMVPMAD-MLNADAEFNAHINHGDDALTATALRPIRAGDEILNYYGPLPNGELLRRYGY 298
Query: 381 V 381
V
Sbjct: 299 V 299
>gi|195353393|ref|XP_002043189.1| GM17489 [Drosophila sechellia]
gi|194127287|gb|EDW49330.1| GM17489 [Drosophila sechellia]
Length = 537
Score = 42.4 bits (98), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 28/110 (25%), Positives = 46/110 (41%)
Query: 291 LVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQL 350
L S TR L P+ + + + AL+P K + AAV ++
Sbjct: 288 LYRWAVSTVMTRQNLVPSEKQESEDTPKLISALIPYWDMANHRQGKITSFYAAVPRQLEC 347
Query: 351 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 400
AGE ++ G + N+ LL++ GFVD+ N D + + L+ D
Sbjct: 348 TAQEAVDAGEQFFIYYGDRSNTDLLVHNGFVDDYNLKDYVNIRVGLSLTD 397
>gi|322698908|gb|EFY90674.1| putative histone-lysine N-methyltransferase [Metarhizium acridum
CQMa 102]
Length = 437
Score = 42.4 bits (98), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 64/305 (20%), Positives = 117/305 (38%), Gaps = 66/305 (21%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELL--TTNKLSELACLALYLMYEKKQGKKSFW 176
+ G+ ++P+ ++ T+E + + +L T+ LS LA+Y+++ + + K +
Sbjct: 34 FKEGENILTIPSGILWTVEHAYADSILGPVLRSTSLPLSVEDTLAIYILFVRSR-KSGYD 92
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
P R +A S + + E +L G+ + + I+ +Y L
Sbjct: 93 GP----------RNHVAALPASYSSSIFFMEDQLEVCAGTSLYTITKQLEQRIEDDYRGL 142
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHL 291
V M G QYP P + FT E +K A
Sbjct: 143 --VVRMLG----QYPDLFPLDKFTVEDYKWAL---------------------------- 168
Query: 292 VHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKA--MLAAVDDAVQ 349
CT + + + L FA +L +SS+ K + A +
Sbjct: 169 ------CTVWSRAMDFVLPDGKSIRLLAPFA------DMLNHSSEAKQCHVYDASSGNLS 216
Query: 350 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 409
++ + Y+AG+ + + GP PN++LL YGFV NP D + A + P ++ K+ +
Sbjct: 217 VLAGKDYEAGDQVFINYGPMPNNRLLRLYGFVVPGNPNDSYDLVLATHPMAPFFKQKQKL 276
Query: 410 AQRNG 414
G
Sbjct: 277 WASAG 281
>gi|67484540|ref|XP_657490.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56474743|gb|EAL52100.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
Length = 791
Score = 42.4 bits (98), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 22/65 (33%), Positives = 33/65 (50%)
Query: 357 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL 416
K GE I + N KLL+ YGFV+E+NP D L++ +D QY + + ++
Sbjct: 229 KPGEQIFNNYRIRSNEKLLLGYGFVEENNPCDNLLLRIYFEVDDNQYNEIEEILKQEEIK 288
Query: 417 SVQVF 421
S F
Sbjct: 289 SFDFF 293
>gi|195040205|ref|XP_001991024.1| GH12451 [Drosophila grimshawi]
gi|193900782|gb|EDV99648.1| GH12451 [Drosophila grimshawi]
Length = 573
Score = 42.4 bits (98), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 65/308 (21%), Positives = 119/308 (38%), Gaps = 45/308 (14%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A D+ A + SVP L+ + E L E EL N + L + L+ EK +G S
Sbjct: 158 AKRDIAAEELVLSVPRKLIFSEE--LLPEWKRELFR-NFPTHLN-VTYTLIIEKVRGAAS 213
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL--- 231
W P+I L + + L ++ ++ L G+ + + I R Y +
Sbjct: 214 AWQPFIDTLPTR-------YSTVLYFTVDQMQRLRGTSACSAAMRHCLVIARLYASMYKC 266
Query: 232 ------DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTP-QMGT 284
D V +LF +Y +E+++ A V + R P ++ T
Sbjct: 267 AYIQPGDNVMAAKANLFTEY-------GLCYELYRWAVSTVTT------RQNLVPRELST 313
Query: 285 ESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAV 344
+ L T ++ + AR + L P + +C + +
Sbjct: 314 VGEVDQVCQL--------GGFEGTEIKRDAETGARNAPISALIPYWDMTNHRCGKITSYY 365
Query: 345 DDAVQLV---VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 401
D A Q + +KAGE ++ G + N+ L+++GF+D N D + + L+ DP
Sbjct: 366 DRAAQQMECTAQEAFKAGEQFFIYYGDRSNADRLVHHGFLDMHNLKDYVQIRLGLSPTDP 425
Query: 402 QYQDKRMV 409
+ + ++
Sbjct: 426 LVEQRSLL 433
>gi|167389227|ref|XP_001738871.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165897700|gb|EDR24782.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 791
Score = 42.4 bits (98), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 22/65 (33%), Positives = 33/65 (50%)
Query: 357 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL 416
K GE I + N KLL+ YGFV+E+NP D L++ +D QY + + ++
Sbjct: 229 KPGEQIFNNYRIRSNEKLLLGYGFVEENNPCDNLLLRIYFEVDDNQYNEIEEILKQEEIK 288
Query: 417 SVQVF 421
S F
Sbjct: 289 SFDFF 293
>gi|451854554|gb|EMD67847.1| hypothetical protein COCSADRAFT_34629 [Cochliobolus sativus ND90Pr]
Length = 476
Score = 42.4 bits (98), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 60/230 (26%), Positives = 97/230 (42%), Gaps = 27/230 (11%)
Query: 322 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 381
+VPL L A +C A L D +++ +P +AG+ I GP P S LL YG++
Sbjct: 223 GMVPLADMLNADGDRCNARLFYEKDGLEMKALKPIQAGDEIFNDYGPLPRSDLLRRYGYI 282
Query: 382 -DEDNPYD-----------RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQV-FHVHAGRE 428
D YD L + + E +Y D++ + ++ + F +
Sbjct: 283 TDNYAQYDVVEIPVDLVSQTLAHDGLWHEERIEYLDEQEIVDTGYDIAASIPFSL----- 337
Query: 429 KEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 488
+E++S L L + E + + S G + P + M L +AR+A YP
Sbjct: 338 EESLSPELVILVETMLLPREEFER-LQSKGRL-PKAEKMTGKAAKFLYKIVQARIAQYPT 395
Query: 489 TLSED-----EAMLTDYNLHPKKRVA-TQLVRM-EKKMLNACLQVTADMI 531
TL +D E ++RVA + VR+ EKK+L + AD I
Sbjct: 396 TLEQDLQISSETQPVQTMSRKERRVAMARAVRIGEKKLLVQTEERLADKI 445
>gi|407035166|gb|EKE37568.1| [Ribulose-bisphosphate-carboxylase]-lysine N-methyltransferase
[Entamoeba nuttalli P19]
Length = 791
Score = 42.0 bits (97), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 22/65 (33%), Positives = 33/65 (50%)
Query: 357 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL 416
K GE I + N KLL+ YGFV+E+NP D L++ +D QY + + ++
Sbjct: 229 KPGEQIFNNYRIRSNEKLLLGYGFVEENNPCDNLLLRIYFEVDDNQYNEIEEILKQEEIK 288
Query: 417 SVQVF 421
S F
Sbjct: 289 SFDFF 293
>gi|307103393|gb|EFN51653.1| hypothetical protein CHLNCDRAFT_139846 [Chlorella variabilis]
Length = 712
Score = 42.0 bits (97), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 53/101 (52%), Gaps = 7/101 (6%)
Query: 354 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 413
RP +AG+ + + GP PN KLL YGFV NP+D +V L + + +++ A
Sbjct: 444 RPCQAGQQVFISYGPVPNLKLLCYYGFVVPHNPHD--LVPLQLEPPEGPLKQQQLAAMEA 501
Query: 414 GKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVI 454
L ++ H+ ++ +L LRL V+ ++E+Q V+
Sbjct: 502 LGLGLE----HSLQDGPLSKQLLACLRL-IVATSAELQLVV 537
>gi|10177069|dbj|BAB10511.1| unnamed protein product [Arabidopsis thaliana]
Length = 447
Score = 42.0 bits (97), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 40/179 (22%), Positives = 80/179 (44%), Gaps = 22/179 (12%)
Query: 346 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV--EAALNTEDPQY 403
+A L R Y+ GE +++ G N +LL +YGF+ E+N D++ + E +L + +
Sbjct: 213 NAYCLYARRNYQLGEQVLLCYGTYTNLELLEHYGFMLEENSNDKVFIPLETSLFSLASSW 272
Query: 404 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPV 463
+ ++GKLS ++ LRL + + +SV+ + +
Sbjct: 273 PKDSLYIHQDGKLSFA---------------LISTLRLWLIPQSQRDKSVMRLVYAGSQI 317
Query: 464 SPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 522
S E V+ +++ + L P +++ED + LH ++ +R+E+K A
Sbjct: 318 SVKNEILVMKWMSEKCGSVLRDLPTSVTEDTVL-----LHNIDKLQDPELRLEQKETEA 371
>gi|71425330|ref|XP_813082.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70877934|gb|EAN91231.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 565
Score = 42.0 bits (97), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 51/184 (27%), Positives = 80/184 (43%), Gaps = 31/184 (16%)
Query: 359 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRL-----VVEAALNTEDPQYQDKR--MVAQ 411
G I + GP N +LL YGFV E N +DRL E + E + +R +VA
Sbjct: 362 GREIWMSYGPLQNWELLQFYGFVLEGNEHDRLPFPFDFPEGVVGDE---WDGRRAALVAT 418
Query: 412 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 471
L+ + + H GR A ++ LR+ ++++ E + + GP + E V
Sbjct: 419 YGLHLAGRCWICHDGRPPPA---LVALLRV-HLAEAEEFDT-MERKGPFASLGAGTEARV 473
Query: 472 LDQLADYFKARLAGYPATLSEDEAML------------TDYNLHP---KKRVATQLVRME 516
+ +AD + L + +L EDE +L D N P KR+A L+RM
Sbjct: 474 VATIADTIRCILDLFSTSLEEDERLLENGSGPVATHSGDDGNTQPLSCNKRLAI-LLRMG 532
Query: 517 KKML 520
K +
Sbjct: 533 MKRI 536
>gi|395326815|gb|EJF59220.1| SET domain-containing protein [Dichomitus squalens LYAD-421 SS1]
Length = 429
Score = 42.0 bits (97), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 72/338 (21%), Positives = 132/338 (39%), Gaps = 68/338 (20%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHY--------VAASEDLQAGD 123
S + E++ + KSW+ + G + H +H+ VAA D+ +
Sbjct: 3 SIEPENVANFKSWIAQQG--------------GQIHAGVHFEPVEFGFNVAARSDIPSDA 48
Query: 124 AAFSVPNSLVVTLERVLGNETIAELLTTNKLS----ELAC--LALYLMYEKKQGKKSFWL 177
S+P SL +T + I +LL T + +L C + L+ + E
Sbjct: 49 TVVSIPFSLAITPN--VARHAIKQLLNTEPQNWSERQLECTYIVLHSIVEPIDPSILRHR 106
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
PY+ L + +PL ++E EL+ GS L+R + E+ +
Sbjct: 107 PYLDTLPSPE-----QLRTPLHFTEAELSSFRGSNLFGATLDRKHEWETEWQQCKNTVSA 161
Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVHLQTS 297
A + + Q +FT+E + A + S +F S ++ S
Sbjct: 162 AIAGWGQ--------SFTWEKYLTAATYLSS----------------RAFPSTILSDTPS 197
Query: 298 CTTTRTPLSPTHNPKEKVSLARRFALVPLGPPL--LAYSSKCKAMLAAVDDAVQLVVDRP 355
TT T + ++ AR G P+ + + + + ++ LV+ P
Sbjct: 198 LVTTETSYPVLLPGIDALNHAR-------GHPVSWVVSAPSQTSSSQRSESSISLVIHTP 250
Query: 356 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 393
G ++ GP+PNS+L++ YGF +NP D +V++
Sbjct: 251 TPRGSELLNNYGPKPNSELILGYGFSLPNNPDDTIVLK 288
>gi|400598098|gb|EJP65818.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
Length = 356
Score = 42.0 bits (97), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 11/100 (11%)
Query: 300 TTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 359
T +T L P H+ R A++P+ S CK A ++ +V DR Y G
Sbjct: 165 TQQTLLYPWHD---------RLAMLPVADLFNHASVGCKVSYCA--ESYDIVADREYGTG 213
Query: 360 ESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 399
+ + G N LL YGF+ ++N DR + +++E
Sbjct: 214 DEVCTCYGEHSNDFLLAEYGFLLQNNTNDRFDPDDLISSE 253
>gi|302829721|ref|XP_002946427.1| hypothetical protein VOLCADRAFT_86703 [Volvox carteri f.
nagariensis]
gi|300268173|gb|EFJ52354.1| hypothetical protein VOLCADRAFT_86703 [Volvox carteri f.
nagariensis]
Length = 658
Score = 41.6 bits (96), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 25/90 (27%), Positives = 40/90 (44%), Gaps = 17/90 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN-----------------KLS 155
+ A+ DLQ G+A VP L++T + +A L + L
Sbjct: 32 IVATRDLQPGEAVLRVPERLLLTTRSAARDPQLAAALQRHTERSRGVAAAPSCGGGCGLG 91
Query: 156 ELACLALYLMYEKKQGKKSFWLPYIRELDR 185
LA +L+ E +G +SFW PY+++L R
Sbjct: 92 PHQVLACHLLLEVSRGPQSFWWPYLKQLPR 121
Score = 39.7 bits (91), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 29/91 (31%), Positives = 42/91 (46%), Gaps = 12/91 (13%)
Query: 345 DDAVQ---LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL----- 396
D+A Q +VV RPY+ GE +++ G N +LL YGFV E N +D ++ AL
Sbjct: 312 DEATQQYCIVVRRPYREGEQVMLCYGRYTNLELLEYYGFVLEGNLHDTARLDPALLPLPS 371
Query: 397 ----NTEDPQYQDKRMVAQRNGKLSVQVFHV 423
P NG+ S Q+ H+
Sbjct: 372 AARTAGGAPHLAPSDCFLHANGQPSWQLLHL 402
>gi|407417214|gb|EKF38012.1| hypothetical protein MOQ_001785 [Trypanosoma cruzi marinkellei]
Length = 578
Score = 41.6 bits (96), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 37/137 (27%), Positives = 65/137 (47%), Gaps = 9/137 (6%)
Query: 367 GPQPNSKLLINYGFVDEDNPYDRLVVEAAL--NTEDPQYQDKR--MVAQRNGKLSVQVFH 422
GP N +LL YGFV E+N +DRL ++ +R +VA L+ + +
Sbjct: 370 GPLQNWELLQFYGFVVEENEHDRLPFPFDFPEGVAGDEWDRRRATLVATYGLHLAGRCWI 429
Query: 423 VHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 482
H GR A ++ LR+ ++++ E ++ + GP + E V+ +AD +
Sbjct: 430 CHDGRPPPA---LVALLRV-HLAEAEEFDTMERN-GPFASLGAGTEARVVATIADTIRCI 484
Query: 483 LAGYPATLSEDEAMLTD 499
L + +L EDE +L +
Sbjct: 485 LDLFSTSLEEDEWLLEN 501
>gi|218200744|gb|EEC83171.1| hypothetical protein OsI_28399 [Oryza sativa Indica Group]
Length = 437
Score = 41.6 bits (96), Expect = 1.00, Method: Compositional matrix adjust.
Identities = 21/56 (37%), Positives = 33/56 (58%), Gaps = 2/56 (3%)
Query: 347 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR--LVVEAALNTED 400
+++ + RP KAGE + G P S L+ YGF+ DNPYD L ++ +++ ED
Sbjct: 300 SLKFPLSRPCKAGEQCFLSYGKHPGSHLITFYGFLPRDNPYDVIPLDLDTSVDEED 355
>gi|302660547|ref|XP_003021952.1| hypothetical protein TRV_03939 [Trichophyton verrucosum HKI 0517]
gi|291185873|gb|EFE41334.1| hypothetical protein TRV_03939 [Trichophyton verrucosum HKI 0517]
Length = 479
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 97/461 (21%), Positives = 173/461 (37%), Gaps = 67/461 (14%)
Query: 97 LKEKPSHNEKHRPIHY-----VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTT 151
LK H + H IH A + + F +P+ L+++++ + L
Sbjct: 24 LKRSSPHFKMHPGIHIADLRSTGAGRGISEDEELFVIPDDLILSVQNSEARSVLG--LDD 81
Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
+L L + ++YE QG++S W Y R L + ++ + W++ +L+ L GS
Sbjct: 82 KQLGPWLSLIITMIYEYYQGEQSKWYSYFRILPS-------SFDTLMFWTDEQLSELQGS 134
Query: 212 PTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 271
+I + A DT+ L Q P + C+
Sbjct: 135 SVVGKIGKAAAD--------DTILQKVVPLIQANSRHFPPRPNMPPLNSPDSQNALLCLA 186
Query: 272 HLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLL 331
H R G ++A + + T T + E +VPL
Sbjct: 187 H--RMGSI----IMAYAFDIEKTDEADEHTADDGYMTDDEDEPAK-----GMVPLADIFN 235
Query: 332 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 391
A + + A L + + + + +GE I G P + LL YG+V DN V
Sbjct: 236 ADAQRNNARLFQEEGSFVMKAIKNIYSGEEIFNDYGELPRADLLRRYGYV-TDNYAQYDV 294
Query: 392 VE---------AALNTEDPQYQDKRMVAQRNGKLSVQVFHV----HAGREKEAI-SDMLP 437
VE A L +P + R+ N + + +++ G ++AI D L
Sbjct: 295 VEFSLDAICKVAGLPDSEPSPSNPRLELLDNLDMLEEGYNISRIPRNGTLEDAIPEDFLV 354
Query: 438 YLRLGY--VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEA 495
LR + D + + + + P S E ++L L R + YP ++ EDE+
Sbjct: 355 LLRALTLPIEDLNRLGARNKAPKPEFSAS---EASLLRSLV---TLRQSEYPTSVQEDES 408
Query: 496 ML----------TDYNLHPKKRVATQLVRMEKKMLNACLQV 526
+L D L+ +K++A Q+ + EK++L L +
Sbjct: 409 ILNCLEQQNGYINDSGLN-RKKMAVQVRKGEKEILTQILSL 448
>gi|302790237|ref|XP_002976886.1| hypothetical protein SELMODRAFT_416932 [Selaginella moellendorffii]
gi|300155364|gb|EFJ21996.1| hypothetical protein SELMODRAFT_416932 [Selaginella moellendorffii]
Length = 177
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 47/98 (47%), Gaps = 6/98 (6%)
Query: 330 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 389
L S CK + AV +++++ R KAG + G PN LL YGFV E+NP+D
Sbjct: 54 FLWASELCK--IDAVTNSLKVYSLRSCKAGMQCFISYGALPNIDLLCFYGFVLENNPFDT 111
Query: 390 LVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR 427
+ VE E P+ K + +R +S F + R
Sbjct: 112 IPVE----LEVPESPAKVALMERYNLVSHISFELRGFR 145
>gi|365982325|ref|XP_003667996.1| hypothetical protein NDAI_0A05980 [Naumovozyma dairenensis CBS 421]
gi|343766762|emb|CCD22753.1| hypothetical protein NDAI_0A05980 [Naumovozyma dairenensis CBS 421]
Length = 573
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 23/54 (42%), Positives = 28/54 (51%)
Query: 335 SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 388
SK + L DD V +V R GE I + GP PN+ LL GF DNP+D
Sbjct: 335 SKPEEELNNPDDYVDIVTTRGILKGEEIFISYGPLPNAFLLAKCGFTMADNPFD 388
>gi|301094169|ref|XP_002997928.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262109714|gb|EEY67766.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 440
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 43/168 (25%), Positives = 71/168 (42%), Gaps = 31/168 (18%)
Query: 354 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL-----NTEDPQYQDKRM 408
+ Y+ GE + + G N +LL NYGF +NPYD + + + N DP + KR
Sbjct: 247 KAYEPGEQLYINYGSHSNLRLLRNYGFTTPNNPYDVVTLPMPIALQQPNPADPAFLQKRG 306
Query: 409 VAQR-NGKLSVQV-------FHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVI------ 454
+ Q G S + F+ H G+ L L ++ E+ +I
Sbjct: 307 LLQSATGSHSTDIPALRSLRFN-HDGQLAPNAEHWLEIL----LATPEELSEIITQAASQ 361
Query: 455 -----SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 497
S++ P+S ++ V ++ ARL + +TL ED+A L
Sbjct: 362 SGAADSTISLALPMS--LKHKVHSEVGSLVTARLKQHSSTLEEDDAFL 407
>gi|322697804|gb|EFY89580.1| putative histone-lysine N-methyltransferase [Metarhizium acridum
CQMa 102]
Length = 466
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 23/74 (31%), Positives = 40/74 (54%), Gaps = 1/74 (1%)
Query: 334 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 393
S +C A+ + + ++ + Y+AG+ + + G PNS+LL YGFV DNP D +
Sbjct: 202 SKQCHVYDASTGN-LSILAGKDYEAGDQVYIHYGSIPNSRLLRLYGFVIPDNPNDSYDLV 260
Query: 394 AALNTEDPQYQDKR 407
A + P ++ K+
Sbjct: 261 LATHPMAPFFEQKQ 274
>gi|218200748|gb|EEC83175.1| hypothetical protein OsI_28406 [Oryza sativa Indica Group]
Length = 319
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 34/59 (57%), Gaps = 2/59 (3%)
Query: 344 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR--LVVEAALNTED 400
V +++ + RP KAGE + G P S L+ YGF+ DNPYD L ++ +++ ED
Sbjct: 179 VTKSLKFPLSRPCKAGEQCFLSYGKHPGSHLITFYGFLPRDNPYDVIPLDLDTSVDEED 237
>gi|218189844|gb|EEC72271.1| hypothetical protein OsI_05430 [Oryza sativa Indica Group]
Length = 1243
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 31/114 (27%), Positives = 54/114 (47%), Gaps = 8/114 (7%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ +Q GD VP + +TL+++ L + + + + LA L+ E+ G +S
Sbjct: 66 ASKPIQEGDCIMQVPYHVQLTLDKLPQKFNT---LLDHAVGDTSKLAALLIMEQHLGNES 122
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
W PYI+ L + + + +LW EL + S E +E E K+E+
Sbjct: 123 GWAPYIKSLPTKD-----QMHNMVLWDLNELHAVQNSSIYDEAIEHKEQAKKEF 171
>gi|389646769|ref|XP_003721016.1| hypothetical protein MGG_02740 [Magnaporthe oryzae 70-15]
gi|86196443|gb|EAQ71081.1| hypothetical protein MGCH7_ch7g488 [Magnaporthe oryzae 70-15]
gi|351638408|gb|EHA46273.1| hypothetical protein MGG_02740 [Magnaporthe oryzae 70-15]
gi|440466942|gb|ELQ36183.1| hypothetical protein OOU_Y34scaffold00666g44 [Magnaporthe oryzae
Y34]
gi|440488101|gb|ELQ67845.1| hypothetical protein OOW_P131scaffold00283g3 [Magnaporthe oryzae
P131]
Length = 390
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 39/84 (46%), Gaps = 2/84 (2%)
Query: 313 EKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 372
E++ R L P+ + C +A D+ + DR Y AGE +++ G N
Sbjct: 173 ERLGKEDRMVLQPVADLFNHADAGC--AVAFNDEDFTIRADRDYDAGEEVLICYGNHSND 230
Query: 373 KLLINYGFVDEDNPYDRLVVEAAL 396
LL YGFV N +D + ++ A+
Sbjct: 231 FLLAEYGFVLAANRWDEVCIDDAI 254
>gi|402077770|gb|EJT73119.1| hypothetical protein GGTG_09969 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 377
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 30/114 (26%), Positives = 45/114 (39%), Gaps = 8/114 (7%)
Query: 280 PQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKA 339
P + E + + + + T TP + P A R ALVP+ + C A
Sbjct: 145 PDLQQEDYLHNWLAVSTRSFYYWTPQMELYPP------ADRLALVPIADLFNHADTGCGA 198
Query: 340 MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 393
D + DR Y G+ I + G N LL YGFV N +D+ ++
Sbjct: 199 SFTP--DGFVVSTDRKYHVGQEIYISYGTHTNDLLLAEYGFVPMANRWDKTCLD 250
>gi|428175768|gb|EKX44656.1| hypothetical protein GUITHDRAFT_109433 [Guillardia theta CCMP2712]
Length = 591
Score = 41.6 bits (96), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 72/308 (23%), Positives = 124/308 (40%), Gaps = 70/308 (22%)
Query: 100 KPSHNEKH-RPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK----- 153
KP E+ R I +A E++ FS+P ++++ + + + +IA + +K
Sbjct: 35 KPHDGERGVRVISDIAPCEEM------FSIPEKILMSRKSCMAS-SIAHVFRKHKDVLFS 87
Query: 154 -LSELACLALYLMYEK-KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
ELA L L ++YEK QG SFW P I L G + WSE EL L
Sbjct: 88 SRDELA-LTLLILYEKLDQGNASFWKPMIDILPADPG-------AASKWSEEELQELQDE 139
Query: 212 PTKAEILERAEGIKREYNE-LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
KAE + +++ Y L + G +F + +T+E F+ A + V+S
Sbjct: 140 SLKAEAMIVVASMQQTYQRVLRPILVQHGDVF-------SVDRYTWEEFRWALLCVES-- 190
Query: 271 VHLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPL 330
+F L H + P + V++ + +P
Sbjct: 191 --------------RTFGRFLPH---------PSIVPFADLLNHVNVQTSYRWLP----- 222
Query: 331 LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 390
+ M A + V ++ GE + GP+ N++LL++YGF + N Y+ +
Sbjct: 223 --EERRAAYMCDASGEHV-------HRRGEEAFMSYGPRSNAELLLHYGFALQSNRYEAV 273
Query: 391 VVEAALNT 398
+ +NT
Sbjct: 274 ELNFRINT 281
>gi|297807745|ref|XP_002871756.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297317593|gb|EFH48015.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 22/75 (29%), Positives = 40/75 (53%), Gaps = 2/75 (2%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAE--LLTTNKLSELACLALYLMYEKKQ 170
+ A +L+ G+ VP + ++T E ++ + ++ LS L++ L+YE +
Sbjct: 54 LGAVRELKKGELVLKVPRNALMTTESMIAKDRKLNDAVILHGSLSSTQILSVCLLYEMGK 113
Query: 171 GKKSFWLPYIRELDR 185
GK+SFW PY+ L R
Sbjct: 114 GKRSFWYPYLVHLPR 128
>gi|159464317|ref|XP_001690388.1| hypothetical protein CHLREDRAFT_144255 [Chlamydomonas reinhardtii]
gi|158279888|gb|EDP05647.1| predicted protein [Chlamydomonas reinhardtii]
Length = 486
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 35/55 (63%), Gaps = 3/55 (5%)
Query: 345 DDAVQ---LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 396
D+A Q +VV R AG+ +++ G N +LL +YGFV +DNP+D ++AAL
Sbjct: 244 DEARQQYVIVVRRRVAAGQQVLLCYGRHTNLELLEHYGFVMQDNPHDTAPLDAAL 298
>gi|297845640|ref|XP_002890701.1| hypothetical protein ARALYDRAFT_472886 [Arabidopsis lyrata subsp.
lyrata]
gi|297336543|gb|EFH66960.1| hypothetical protein ARALYDRAFT_472886 [Arabidopsis lyrata subsp.
lyrata]
Length = 471
Score = 41.2 bits (95), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 38/175 (21%), Positives = 71/175 (40%), Gaps = 19/175 (10%)
Query: 342 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL----------V 391
A + V++V + K + +++ G N L++YGFV E NPYD +
Sbjct: 252 AESNTLVKVVAETELKENDPLLLNYGCLSNDFFLLDYGFVIESNPYDTIELKYDEQLMDA 311
Query: 392 VEAALNTEDPQYQ-----DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD 446
A P++ ++++Q N + V G + +L +R+ +
Sbjct: 312 ASMAAGVSSPKFSSPAPWQHQLLSQLNLAGEMPNLKVTIGGPEPVEGRLLAAIRILLCGE 371
Query: 447 TSEMQ----SVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 497
E++ + SL I P+ E AV + L+ +P + EDEA++
Sbjct: 372 MVEVEKHDLDTLKSLSAIAPLGIANEIAVFRTVIALCVIALSHFPTKIMEDEAII 426
>gi|50546259|ref|XP_500648.1| YALI0B08624p [Yarrowia lipolytica]
gi|49646514|emb|CAG82890.1| YALI0B08624p [Yarrowia lipolytica CLIB122]
Length = 490
Score = 41.2 bits (95), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 46/157 (29%), Positives = 61/157 (38%), Gaps = 14/157 (8%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A +DL AGD VP S ++ R G IA LL + L +A L + +YE+ G
Sbjct: 36 VFAKKDLDAGDIVLKVPKSACLS-PRTCG---IANLLDEHDLDNIAGLLVAFLYERSLGD 91
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE-YNEL 231
+S W + L E P WS E L EI G E Y EL
Sbjct: 92 QSPWHEFFESLKPVIAD---VPEIPKFWSNDEDRALLSGTEVEEIGGLETGEDEEVYQEL 148
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 268
+F I E +F+ FK+ V + S
Sbjct: 149 IVPFFEDNGKL------INLECPSFDEFKKLVVVIAS 179
>gi|302803412|ref|XP_002983459.1| hypothetical protein SELMODRAFT_445547 [Selaginella moellendorffii]
gi|300148702|gb|EFJ15360.1| hypothetical protein SELMODRAFT_445547 [Selaginella moellendorffii]
Length = 536
Score = 41.2 bits (95), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 44/91 (48%), Gaps = 4/91 (4%)
Query: 145 IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW-SET 203
+ +L N + +L+ E+ +GK+SFW PYI L +L++ PLLW +ET
Sbjct: 97 VGPMLRKNDFRPWLTMCAHLLVERSRGKESFWHPYIAALPSV---DELSISHPLLWPAET 153
Query: 204 ELAYLTGSPTKAEILERAEGIKREYNELDTV 234
L GSP I R + + ++ L T
Sbjct: 154 IQELLQGSPMLDTIATRLKLCQEDHEALLTA 184
>gi|396469509|ref|XP_003838423.1| similar to SET domain-containing protein [Leptosphaeria maculans
JN3]
gi|312214991|emb|CBX94944.1| similar to SET domain-containing protein [Leptosphaeria maculans
JN3]
Length = 415
Score = 41.2 bits (95), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 24/77 (31%), Positives = 37/77 (48%), Gaps = 2/77 (2%)
Query: 312 KEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPN 371
+ K++ A +A+ P S C+ A ++ DR Y+AGE + V GP N
Sbjct: 188 RAKLTSADCYAMCPFMDYFNHSDSGCEPQHNA--HGYSVLADRAYRAGEEVYVSYGPHTN 245
Query: 372 SKLLINYGFVDEDNPYD 388
LL+ YGF+ + N D
Sbjct: 246 DFLLVEYGFLLDANSND 262
>gi|297598048|ref|NP_001044988.2| Os01g0879500 [Oryza sativa Japonica Group]
gi|255673923|dbj|BAF06902.2| Os01g0879500 [Oryza sativa Japonica Group]
Length = 263
Score = 41.2 bits (95), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 20/63 (31%), Positives = 33/63 (52%)
Query: 337 CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 396
C + D+ ++++V R GE + G N+ LL YGF + DN YD + ++ AL
Sbjct: 15 CSYYVGDDDEDLEMIVVRDVNEGEEVFNTYGTMGNAALLHRYGFTEMDNSYDIVNIDLAL 74
Query: 397 NTE 399
T+
Sbjct: 75 VTK 77
>gi|71659283|ref|XP_821365.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70886742|gb|EAN99514.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 661
Score = 41.2 bits (95), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 45/95 (47%), Gaps = 10/95 (10%)
Query: 149 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 208
L ++ + +AC+A Y+ YEKKQ + + L Y R L Q+ V++ LW+ L
Sbjct: 466 LDSSNMESIACIAAYMFYEKKQPEIALRL-YRRLL-------QMGVQTTELWNNLGLCCF 517
Query: 209 TGSPTKAEI--LERAEGIKREYNELDTVWFMAGSL 241
S + L+RA I E L VW+ G +
Sbjct: 518 YSSQYDIALSCLQRAVAISTEDETLADVWYNIGHI 552
>gi|400596811|gb|EJP64567.1| histone-lysine N-methyltransferase [Beauveria bassiana ARSEF 2860]
Length = 406
Score = 41.2 bits (95), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 70/150 (46%), Gaps = 9/150 (6%)
Query: 354 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 413
R Y+ G+ + ++ G NS+LL YGFV DNP D + ++ P Y+ K QR
Sbjct: 223 RDYEVGDQVFIYYGNVSNSRLLRLYGFVLPDNPNDNYELVLQTSSMAPLYEQK----QRL 278
Query: 414 GKLSV--QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLG--PICPVSPCMER 469
KL+ ++ + + +L YLR+ + D S++ ++ + +S E
Sbjct: 279 WKLAGLDEISTIPLSLQNPLPDSVLRYLRIQRL-DASDLGTMTMQIATESYTKISDENES 337
Query: 470 AVLDQLADYFKARLAGYPATLSEDEAMLTD 499
+L L+ +A L G+ +L + E L +
Sbjct: 338 QILLFLSQSIEALLEGFEISLEKLETQLAE 367
>gi|356553227|ref|XP_003544959.1| PREDICTED: protein SET DOMAIN GROUP 40-like [Glycine max]
Length = 475
Score = 41.2 bits (95), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 39/72 (54%), Gaps = 1/72 (1%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSELACLALYLMYEKKQG 171
+ A DL+ G+ VP S ++T E V+ ++ + + + + LS L + L+YE +G
Sbjct: 55 LGAVRDLRRGEIVLRVPKSALMTRETVMEDKKLCDAVNRHSSLSSAQILIVCLLYEMGKG 114
Query: 172 KKSFWLPYIREL 183
K S W PY+ L
Sbjct: 115 KTSRWHPYLMHL 126
>gi|58261130|ref|XP_567975.1| nucleus protein [Cryptococcus neoformans var. neoformans JEC21]
gi|134115865|ref|XP_773415.1| hypothetical protein CNBI2600 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50256040|gb|EAL18768.1| hypothetical protein CNBI2600 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57230057|gb|AAW46458.1| nucleus protein, putative [Cryptococcus neoformans var. neoformans
JEC21]
Length = 495
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 65/283 (22%), Positives = 111/283 (39%), Gaps = 53/283 (18%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL----ACLALYLMYEKKQ 170
A +D++ G F VP+ L+++ + + L ++ +L A L L +M+E +
Sbjct: 47 AVKDIEEGTPLFHVPDDLILSAY----TSDLKDHLDASEWDQLNKGWAQLILVMMWETIK 102
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
G KS W Y+ + + E+P+ W+E + L+G+ A+ I RE E
Sbjct: 103 GSKSRWAGYLANM-------PVLFETPMFWTERQREQLSGTDI-------ADRIGREDAE 148
Query: 231 LDTVWFMAGSLFQQYPYDIPTEA--FTFEIF--KQAFVAVQSCVVHLQRCGCTPQMGTES 286
+ +A + +P P ++ T + F + + + +S V L R G +
Sbjct: 149 AEYTSVLA-PFIKAHPDLFPVDSPHITMDAFHIQGSRILSRSFTVPLHRFGRS------- 200
Query: 287 FASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAML----- 341
H Q+ S + +E V ++P L A K A L
Sbjct: 201 ------HSQSRSDGNSEKESDDEDEEEMV------VMIPFADMLNAAWGKDNAHLYVDED 248
Query: 342 --AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 382
D+ V + + K E I PNS+LL YG VD
Sbjct: 249 TIEGFDEGVVMKSTQLVKQSEQIYNTYDSPPNSELLRKYGHVD 291
>gi|323473309|gb|ADX78230.1| CIA6 [Chlamydomonas reinhardtii]
Length = 699
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 35/55 (63%), Gaps = 3/55 (5%)
Query: 345 DDAVQ---LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 396
D+A Q +VV R AG+ +++ G N +LL +YGFV +DNP+D ++AAL
Sbjct: 396 DEARQQYVIVVRRRVAAGQQVLLCYGRHTNLELLEHYGFVMQDNPHDTAPLDAAL 450
>gi|320169513|gb|EFW46412.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 495
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 49/199 (24%), Positives = 88/199 (44%), Gaps = 21/199 (10%)
Query: 354 RPYKAGESIVV-WCGPQ--PNSKLLINYGFVDEDNPYDRLVV------EAALNTEDPQYQ 404
RP G+ ++ + G + N +L+++YG +NP D + + E A+ +
Sbjct: 282 RPVAEGQELLTPYGGAEQLSNGQLIMDYGVTFRNNPSDLVALPIPKLRETAVAYDSKMRL 341
Query: 405 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS---DTSEMQSVISSLGPIC 461
M R +L + V KE +L + R+ YVS D S+++ V+ +
Sbjct: 342 LMAMSLDRFDRLQLPVLDHFESIPKE----LLAFARV-YVSTPSDLSDLEHVLELMKEHR 396
Query: 462 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH--PKKRVATQLV-RM-EK 517
++P ER L+ L + Y T+ EDE ML + + P +V R+ EK
Sbjct: 397 AINPSNERRALELLLQLTNEMILKYITTIEEDETMLRELDAESVPNANAVNAVVLRLGEK 456
Query: 518 KMLNACLQVTADMIMLLPD 536
++L++ Q+ I LP+
Sbjct: 457 RILSSLWQLLDSAIEALPE 475
>gi|303279242|ref|XP_003058914.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226460074|gb|EEH57369.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 457
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 29/48 (60%)
Query: 345 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 392
DDA+ + V + KAGE + G N+KLL +YGF DNP D++ +
Sbjct: 232 DDALFVRVVKASKAGEEVFNTYGKLGNAKLLCSYGFAQLDNPADKVTI 279
>gi|4185151|gb|AAD08954.1| unknown protein [Arabidopsis thaliana]
gi|20197036|gb|AAM14885.1| unknown protein [Arabidopsis thaliana]
Length = 441
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 40/152 (26%), Positives = 64/152 (42%), Gaps = 15/152 (9%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASEDL+ GD A +P S +++ E V ++ L T + ++ L L+ M EK
Sbjct: 173 ASEDLKFGDVALEIPVSSIISEEYVYNSDMYPILETFDGITSETMLLLWTMREKHNLDSK 232
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F PY L G L + + L G+ EI++ E ++ Y+EL
Sbjct: 233 F-KPYFDSLQENFCTG-------LSFGVDAIMELDGTLLLDEIMQAKELLRERYDEL--- 281
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 266
L + P E +T+E + A+ V
Sbjct: 282 ----IPLLSNHREVFPPELYTWEHYLWAYFDV 309
>gi|428177750|gb|EKX46628.1| hypothetical protein GUITHDRAFT_107412 [Guillardia theta CCMP2712]
Length = 606
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 17/49 (34%), Positives = 27/49 (55%)
Query: 348 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 396
+Q P K G + + GP N++LL+ YG+ ++DNPY +E L
Sbjct: 440 LQFCTMAPIKQGSQVFLNYGPLDNTQLLLYYGYAEQDNPYQTYAIELEL 488
>gi|66813084|ref|XP_640721.1| hypothetical protein DDB_G0281543 [Dictyostelium discoideum AX4]
gi|60468751|gb|EAL66753.1| hypothetical protein DDB_G0281543 [Dictyostelium discoideum AX4]
Length = 1339
Score = 40.8 bits (94), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 57/278 (20%), Positives = 102/278 (36%), Gaps = 52/278 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V ++ + +A VP ++ ++ + + + L++ L L+++YEK
Sbjct: 787 VVTTKKVDENEAVVVVPKKYLINVDVAKAHPILGPIFEELHLNDDTILFLFVIYEKGNA- 845
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
SFW P+ L + + +S TEL L G+ + E K++ N
Sbjct: 846 NSFWRPFYDTLPS-------YFTTSIHYSATELLELEGT----NLFEETLHTKQQLNSFR 894
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLV 292
F L +QYP P F++E F A + S + L+ G
Sbjct: 895 DYLF--PELSKQYPDIFPESQFSWENFLWARSLLDSRAIQLKIDGSIK------------ 940
Query: 293 HLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 352
SC P++ N ++ RF + S+ M+++ +
Sbjct: 941 ----SCL---VPMADMINHHTNAQISERF---------FDHDSQSFKMISSCN------- 977
Query: 353 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 390
A I + G N +L + YGF+ +N YD L
Sbjct: 978 ---IPANNQIFLHYGALQNWELALYYGFIIPNNIYDSL 1012
>gi|323455796|gb|EGB11664.1| hypothetical protein AURANDRAFT_61664 [Aureococcus anophagefferens]
Length = 1916
Score = 40.8 bits (94), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 17/46 (36%), Positives = 27/46 (58%)
Query: 346 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 391
DA + R Y AG+ + G + N++L+ NYGF++ NP+D V
Sbjct: 294 DAFAVNAHRDYDAGDEVHASYGKKSNAQLVANYGFLEPGNPFDDYV 339
>gi|342875304|gb|EGU77102.1| hypothetical protein FOXB_12400 [Fusarium oxysporum Fo5176]
Length = 371
Score = 40.8 bits (94), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 21/60 (35%), Positives = 32/60 (53%), Gaps = 2/60 (3%)
Query: 337 CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 396
CK +A+ +VQ DR Y GE + V GP N LL YGF+ + N +D + ++ +
Sbjct: 195 CKLAYSALGYSVQ--ADRVYHQGEEVYVSYGPHSNDFLLSEYGFILDTNRWDEVYLDEVI 252
>gi|384248321|gb|EIE21805.1| SET domain-containing protein, partial [Coccomyxa subellipsoidea
C-169]
Length = 275
Score = 40.8 bits (94), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 50/117 (42%), Gaps = 14/117 (11%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLS----ELACLALYLMYEK 168
V A++D+ G+ VP+ V+ E +E + + TN E L L LM EK
Sbjct: 32 VVATKDISCGEVVVHVPDESVLMPENCSCSEALEDAGLTNASGDAEMESIGLILALMTEK 91
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIK 225
K GK S W Y+ L + PL W +L L G+ ++E+ G K
Sbjct: 92 KLGKSSKWKGYLDFLPKS------IPGMPLFWDSEQLQSLEGT----SLIEKMNGCK 138
>gi|156849027|ref|XP_001647394.1| hypothetical protein Kpol_1018p68 [Vanderwaltozyma polyspora DSM
70294]
gi|156118080|gb|EDO19536.1| hypothetical protein Kpol_1018p68 [Vanderwaltozyma polyspora DSM
70294]
Length = 494
Score = 40.8 bits (94), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 57/271 (21%), Positives = 104/271 (38%), Gaps = 33/271 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEK-KQG 171
+ A ED+ G+ F +P ++ + T++L E L L ++YE G
Sbjct: 43 MVAVEDVAEGETLFEIPRGSILNVNTSALTRDYPSF-GTSQLGEWEELILCMLYEMFVLG 101
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
+ S W PY L + S + WS+ EL L S ++ER K +
Sbjct: 102 ENSRWYPYFNVLP-----SSAELNSLIYWSDRELGLLKPSF----VIERIGRGKSQEMFS 152
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHL 291
+ ++ Q + + T+E F + S ++ PQ
Sbjct: 153 KVLSYIEN---QDSDLSLIAKYLTWENFVYVASIIMSYSFDVE--DLNPQ--------SD 199
Query: 292 VHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 351
+ + +SP + K +++PL L + + C A L + +++
Sbjct: 200 EDDEIEDDDNDSEMSPDKSIK---------SMIPLADTLNSDTHLCNANLMYDKETLKMT 250
Query: 352 VDRPYKAGESIVVWCGPQPNSKLLINYGFVD 382
+P +AGE + G PNS++L YG+V+
Sbjct: 251 AIKPIRAGEEVFNIYGEHPNSEILRRYGYVE 281
>gi|71019075|ref|XP_759768.1| hypothetical protein UM03621.1 [Ustilago maydis 521]
gi|46099291|gb|EAK84524.1| hypothetical protein UM03621.1 [Ustilago maydis 521]
Length = 685
Score = 40.8 bits (94), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 38/83 (45%), Gaps = 6/83 (7%)
Query: 321 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 380
++ P+ L A A L +++ +P AGE I PNS LL YG
Sbjct: 368 ISMTPMADMLNAKFESDNARLFYKSHVLEMRATKPIAAGEQIFNTYADPPNSDLLRRYGH 427
Query: 381 VDEDNPYD------RLVVEAALN 397
VDE N D +LVV+AA+N
Sbjct: 428 VDEPNGNDVVELDAKLVVQAAVN 450
>gi|363747032|ref|XP_003643892.1| PREDICTED: histone-lysine N-methyltransferase setd3-like, partial
[Gallus gallus]
Length = 283
Score = 40.4 bits (93), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 29/119 (24%), Positives = 59/119 (49%), Gaps = 12/119 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
+ A+ +++A + VP L++T+E N + L + +++ + LA +L+ E+
Sbjct: 108 LKATREIKAEELFLWVPRKLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERA 166
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
SFWLPYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 167 -NPNSFWLPYIQTLPSE-------YDTPLYFEEDEVQYLRSTQAIHDVFSQYKNTARQY 217
>gi|145354549|ref|XP_001421544.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581782|gb|ABO99837.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 488
Score = 40.4 bits (93), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 17/42 (40%), Positives = 27/42 (64%)
Query: 347 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 388
V+L+ R +GE I + G N +LL++YGF+ +DNP+D
Sbjct: 276 GVELIARRALTSGEPIELSYGNLSNDELLLDYGFIVKDNPFD 317
>gi|302498903|ref|XP_003011448.1| SET domain protein [Arthroderma benhamiae CBS 112371]
gi|291174999|gb|EFE30808.1| SET domain protein [Arthroderma benhamiae CBS 112371]
Length = 689
Score = 40.4 bits (93), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 49/230 (21%), Positives = 85/230 (36%), Gaps = 43/230 (18%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LA ++++E+ +G+ S W PY+ L R + S L + + +L +L G+
Sbjct: 104 LAFFMVHEQLKGRDSHWWPYLATLPRAS-----ELTSALFYQDNDLEWLQGTNLYQTHQA 158
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCT 279
+K EY+ ++ G L E++ ++IF A+ + S T
Sbjct: 159 YRNAVKEEYDSAISILRDEGFL--------AVESYRWDIFCWAYTLIASRAF-------T 203
Query: 280 PQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKA 339
++ F++H Q PL + N K + R +G
Sbjct: 204 SRVLDAYFSNHPTLKQDEEFQIMLPLVDSSNHKPLAKIEWRAEATEIG------------ 251
Query: 340 MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK-LLINYGFVDEDNPYD 388
L V P +GE + G N + ++ YGF DNP D
Sbjct: 252 ----------LKVIEPTFSGEEVHNNYGSLNNQQSVMTTYGFCIVDNPCD 291
>gi|428167603|gb|EKX36559.1| hypothetical protein GUITHDRAFT_155193 [Guillardia theta CCMP2712]
Length = 321
Score = 40.4 bits (93), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 31/133 (23%), Positives = 58/133 (43%), Gaps = 10/133 (7%)
Query: 57 VSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNG-LPPCKVILKEKPSHNEKHRPIHYVAA 115
V++ D A + ++ +ED W NG + K+ +K + V
Sbjct: 50 VAAGDQGAASGADQQAQLQEDWTAFVKWFRSNGGIISSKLTVKVRNGRQG-------VYF 102
Query: 116 SEDLQAGDAAFSVPNSLVVTLERVLGNET--IAELLTTNKLSELACLALYLMYEKKQGKK 173
E ++ G+ S P +L + + + + + + L +K + L++++E K GK
Sbjct: 103 KERMRRGETIVSFPRNLRLDEKTAMKGKAGHVFQRLKQDKCYPDLMVILHVVHEDKLGKD 162
Query: 174 SFWLPYIRELDRQ 186
SFW PY + L RQ
Sbjct: 163 SFWFPYFKLLRRQ 175
>gi|145502426|ref|XP_001437191.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124404340|emb|CAK69794.1| unnamed protein product [Paramecium tetraurelia]
Length = 637
Score = 40.4 bits (93), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 38/132 (28%), Positives = 64/132 (48%), Gaps = 23/132 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIA------ELLT---TNKLSELACLALY 163
V ASEDL + +P SL+++ ++ + I E+ TN+ +E L Y
Sbjct: 46 VVASEDLPSDTVIICIPQSLIISPDKCKQSTLITVYNSHPEMFDEEETNE-AEFNILTFY 104
Query: 164 LMYEKKQGKKSFWLPYIRELDRQRGRGQLA--------VESPLLWSETELAY--LTGSPT 213
+ EKK+G++SF+ PYI+ + Q +A +E PL+ E +L G +
Sbjct: 105 MFNEKKKGEQSFYYPYIQAI--QTSNTLMAWSNEDLQKIEDPLILEEFQLIKQDFLGLWS 162
Query: 214 KAE-ILERAEGI 224
KA+ I + A+ I
Sbjct: 163 KAKLIFDNAQDI 174
>gi|158295743|ref|XP_001688855.1| AGAP006364-PD [Anopheles gambiae str. PEST]
gi|347965224|ref|XP_003435732.1| AGAP013401-PA [Anopheles gambiae str. PEST]
gi|333469389|gb|EGK97284.1| AGAP013401-PA [Anopheles gambiae str. PEST]
Length = 451
Score = 40.4 bits (93), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 41/86 (47%), Gaps = 13/86 (15%)
Query: 349 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT------EDPQ 402
L D Y+AGE I + G N+KLL+ YGF NP D VE + T DP+
Sbjct: 276 NLHTDTAYRAGEQIFISYGTHNNTKLLLEYGFSIPSNPDD--FVELTIGTINAFMKHDPE 333
Query: 403 YQDKRMVAQR-----NGKLSVQVFHV 423
+ R+ ++ + +L Q+F V
Sbjct: 334 LRCLRLPREKYRFLADHRLDEQLFFV 359
>gi|410082051|ref|XP_003958604.1| hypothetical protein KAFR_0H00600 [Kazachstania africana CBS 2517]
gi|372465193|emb|CCF59469.1| hypothetical protein KAFR_0H00600 [Kazachstania africana CBS 2517]
Length = 508
Score = 40.4 bits (93), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 18/61 (29%), Positives = 34/61 (55%)
Query: 322 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 381
+++PL L + + +C A L D +++ +P K GE + G PN+++L YG+V
Sbjct: 215 SMIPLADTLNSDTHQCNANLMYDDKFLKMYAIKPIKKGEQVFNIYGNHPNAEILRRYGYV 274
Query: 382 D 382
+
Sbjct: 275 E 275
>gi|12718364|emb|CAC28558.1| related to histone-lysine N-methyltransferase [Neurospora crassa]
Length = 471
Score = 40.4 bits (93), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 19/60 (31%), Positives = 34/60 (56%)
Query: 348 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 407
+ + + Y+AG+ + + GP PNS+LL YGFV NP D + + + + P ++ K+
Sbjct: 227 LSVFAGKDYEAGDQVFINYGPVPNSRLLRLYGFVIPGNPNDSYDLVLSTHPQAPFFEQKQ 286
>gi|378728064|gb|EHY54523.1| SET domain-containing protein 6 [Exophiala dermatitidis NIH/UT8656]
Length = 495
Score = 40.0 bits (92), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 44/139 (31%), Positives = 57/139 (41%), Gaps = 20/139 (14%)
Query: 322 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 381
+VP L A + K A L D + + +P AGE I GP P S LL YG+V
Sbjct: 233 GMVPFADMLNADADKNNARLFQEPDYLIMKATKPISAGEQIFNDYGPLPRSDLLRMYGYV 292
Query: 382 -DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 440
D YD + L E VA ++ K QV+ RE+E D L L
Sbjct: 293 TDNYAQYDVVEFSHDLLLE---------VAGKHSKSKDQVW-----REREQQLDELGVLD 338
Query: 441 LGYV-----SDTSEMQSVI 454
GY DT +Q V+
Sbjct: 339 DGYAITRPEYDTQGLQDVL 357
>gi|380089029|emb|CCC12973.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 465
Score = 40.0 bits (92), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 88/428 (20%), Positives = 155/428 (36%), Gaps = 78/428 (18%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
+ G+ ++P+S++ T+E + + L + + LS L YL++ +
Sbjct: 43 FKEGEKILTIPSSILWTVEHAYADPLLGPALCSVQPPLSPEDTLTTYLLFVRS------- 95
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
RE R +A S + ++E EL G+ + + I+ ++ L
Sbjct: 96 ----RESGYDGQRSHVAALPTSYSSSIFFTEEELEVCAGTSLYTITKQLEQSIEDDHRAL 151
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ-RCGCTPQMGTESFASH 290
LF Q+ P + F+ E +K A V S + Q R G + ++ FA
Sbjct: 152 ------VMQLFIQHRDLFPLDKFSIEDYKWALCTVWSRRMDFQLRDGKSMRL-LAPFADM 204
Query: 291 LVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQL 350
L H + P H + SS ++LA D
Sbjct: 205 LNH--------SSEAKPCH--------------------VYDVSSGNLSVLAGKD----- 231
Query: 351 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 410
Y+ G+ + + G PNS+LL YGFV NP D + + + + P Y+ K +
Sbjct: 232 -----YEPGDQVFINYGSVPNSRLLRLYGFVIPGNPNDTYDLVLSTHPQAPFYEQKHKLW 286
Query: 411 QRNGKLSVQVFHVHAGREKEAISDMLP-----YLRLGYVSDTSEMQSVISSLGPICPVSP 465
G S + ++D LP YLR+ + + + VS
Sbjct: 287 VSAGLDSTSTIPL-------TLTDPLPKNVLRYLRIQRADASDLAAMALQNAKADEKVSD 339
Query: 466 CMERAVLDQLADYFKARLAGYPATLSEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNAC 523
E +L L + F L G+ L + E L Y+ A + E+++L
Sbjct: 340 SNEVEILQFLVESFGHLLGGFGTPLEKLEEQLAQGVYSPGGNAWAAAHVSLGEQRVLRLA 399
Query: 524 LQVTADMI 531
+ D++
Sbjct: 400 KKRAEDLL 407
>gi|302828172|ref|XP_002945653.1| hypothetical protein VOLCADRAFT_120141 [Volvox carteri f.
nagariensis]
gi|300268468|gb|EFJ52648.1| hypothetical protein VOLCADRAFT_120141 [Volvox carteri f.
nagariensis]
Length = 163
Score = 40.0 bits (92), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 20/73 (27%), Positives = 36/73 (49%)
Query: 111 HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
+ + A E ++ G VP L+++ + +E + L+E L L+L+ E+
Sbjct: 77 YSLVADEPVRRGQILVRVPRRLLMSQDTARASEACGRTVREAGLNEWQSLILHLLCERAL 136
Query: 171 GKKSFWLPYIREL 183
G +SFW PY+ L
Sbjct: 137 GSRSFWAPYLDTL 149
>gi|299748031|ref|XP_002911244.1| tho2 protein [Coprinopsis cinerea okayama7#130]
gi|298407787|gb|EFI27750.1| tho2 protein [Coprinopsis cinerea okayama7#130]
Length = 2474
Score = 40.0 bits (92), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 15/47 (31%), Positives = 30/47 (63%)
Query: 347 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 393
++ L+ G+ + GP+PNS+L+++YGF +DNP D ++++
Sbjct: 209 SISLIAHSAIWTGQEVFNNYGPKPNSELILSYGFSIQDNPDDSIILK 255
>gi|387219019|gb|AFJ69218.1| set domain-containing protein 3, partial [Nannochloropsis gaditana
CCMP526]
Length = 265
Score = 40.0 bits (92), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 38/163 (23%), Positives = 63/163 (38%), Gaps = 29/163 (17%)
Query: 356 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEA-ALNTEDPQYQDKRMVAQRNG 414
YK GE + G + N++LL+ YGF DN ++ + + + + +Q G
Sbjct: 18 YKKGEEVFTSYGRRTNAELLLFYGFALLDNEHESVALSMPGIPSPPSWFQASHSALGTAG 77
Query: 415 KLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---SEMQSVISSLGPICPVS------- 464
+ GR + D+L L + T SE+ + +L C ++
Sbjct: 78 SV--------GGRARSMAEDVLRPSHLLFAGATELPSELVAYFRALTACCSMNEKDLVEQ 129
Query: 465 --------PC--MERAVLDQLADYFKARLAGYPATLSEDEAML 497
PC ER L + A LA +P ++ EDE L
Sbjct: 130 KLDYMQHFPCSRHERDAFSTLGAHMSASLAAFPTSIEEDEVEL 172
>gi|270005260|gb|EFA01708.1| hypothetical protein TcasGA2_TC007288 [Tribolium castaneum]
Length = 253
Score = 40.0 bits (92), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 35/119 (29%), Positives = 58/119 (48%), Gaps = 12/119 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSEL--ACLALYLMYEKK 169
V A+ D+ +VP L++++E + +L+ +K L + L+++L+ EK
Sbjct: 116 VKANVDIAESSLVIAVPRKLMMSVENA-KESVLKDLIEKDKILGSMPNVALSIFLLLEKY 174
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
+G SFW PYI L + + L +S EL L GSPT L + + I R+Y
Sbjct: 175 KGD-SFWKPYIDILPK-------TYTTVLYFSIDELEELRGSPTLEVALRQIKSITRQY 225
>gi|407846232|gb|EKG02467.1| hypothetical protein TCSYLVIO_006496 [Trypanosoma cruzi]
Length = 546
Score = 40.0 bits (92), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 45/95 (47%), Gaps = 10/95 (10%)
Query: 149 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA-- 206
L ++ + +AC+A Y+ YEKKQ + + L Y R L Q+ V++ LW+ L
Sbjct: 351 LDSSNMESIACIAAYMFYEKKQPEIALRL-YRRLL-------QMGVQTTELWNNLGLCCF 402
Query: 207 YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSL 241
Y + L+RA I E L VW+ G +
Sbjct: 403 YSSQYDIALSCLQRAVAISTEDETLADVWYNIGHI 437
>gi|154290554|ref|XP_001545870.1| hypothetical protein BC1G_15621 [Botryotinia fuckeliana B05.10]
Length = 336
Score = 40.0 bits (92), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 41/90 (45%), Gaps = 2/90 (2%)
Query: 305 LSPTHNP-KEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 363
LSPT P K + S R AL P + +SS+ L +P K G +
Sbjct: 126 LSPTIKPSKPQPSKENRLALNPFAD-YINHSSEPTVDATLSRAGYTLTASQPIKQGSEVH 184
Query: 364 VWCGPQPNSKLLINYGFVDEDNPYDRLVVE 393
+ G N LL+ YGF+ EDN +D + ++
Sbjct: 185 ISYGSHNNDFLLVEYGFILEDNRWDEVTLD 214
>gi|194764087|ref|XP_001964163.1| GF21412 [Drosophila ananassae]
gi|190619088|gb|EDV34612.1| GF21412 [Drosophila ananassae]
Length = 1017
Score = 40.0 bits (92), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 17/52 (32%), Positives = 30/52 (57%)
Query: 358 AGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 409
AGE ++ G + N++ L+N GFVD DN D + + L+ DP + + ++
Sbjct: 838 AGEQFFIYYGDRTNTEFLVNNGFVDPDNRNDYVNIRLGLSPTDPLAEKRAII 889
>gi|340505659|gb|EGR31971.1| SET domain protein [Ichthyophthirius multifiliis]
Length = 705
Score = 40.0 bits (92), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 26/104 (25%), Positives = 52/104 (50%), Gaps = 10/104 (9%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN---------KLSELACLALY 163
+AA+ED+ A +PN ++++L ++ E + +++ N +E +A+Y
Sbjct: 21 IAAAEDIPANTIIACIPNKIMISLNQIKECE-LKDIINENPSLFDEEENAEAEFNIIAMY 79
Query: 164 LMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY 207
+++EK +G+KSF+ PY + R +E L E+ Y
Sbjct: 80 VIHEKLKGEKSFYKPYFDTIQRSYTMYDWTIEEVKLTESEEIIY 123
>gi|347841961|emb|CCD56533.1| similar to SET domain-containing protein [Botryotinia fuckeliana]
Length = 377
Score = 40.0 bits (92), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 41/90 (45%), Gaps = 2/90 (2%)
Query: 305 LSPTHNP-KEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 363
LSPT P K + S R AL P + +SS+ L +P K G +
Sbjct: 167 LSPTIKPSKPQPSKENRLALNPFAD-YINHSSEPTVDATLSRAGYTLTASQPIKQGSEVH 225
Query: 364 VWCGPQPNSKLLINYGFVDEDNPYDRLVVE 393
+ G N LL+ YGF+ EDN +D + ++
Sbjct: 226 ISYGSHNNDFLLVEYGFILEDNRWDEVTLD 255
>gi|240278777|gb|EER42283.1| conserved hypothetical protein [Ajellomyces capsulatus H143]
gi|325090312|gb|EGC43622.1| conserved hypothetical protein [Ajellomyces capsulatus H88]
Length = 471
Score = 40.0 bits (92), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 60/305 (19%), Positives = 109/305 (35%), Gaps = 66/305 (21%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
+ G+ ++P+ ++ T+E + + L + + LS LA Y+++ +
Sbjct: 34 FKEGERILTIPSDVLWTVEHAYADSLLGPTLHSARPPLSVDDTLATYILFVRS------- 86
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
RE R LA S + ++E EL TG+ A + I+ +Y L
Sbjct: 87 ----RESGYNGLRSHLAALPKSYSSSIFFTEDELEVCTGTSLYAITKQLGRCIQDDYKAL 142
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHL 291
L Q+ P FT E +K A
Sbjct: 143 VV------RLLIQHRDLFPLSKFTIEDYKWAL---------------------------- 168
Query: 292 VHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAA--VDDAVQ 349
CT + + + L FA +L +SS + A + +
Sbjct: 169 ------CTVWSRAMDFVLPDGKSIRLLAPFA------DMLNHSSDVRQCHAYDPLSGNLS 216
Query: 350 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 409
++ + YKAG+ + ++ G PN++LL YGF+ NP D + + P ++ K +
Sbjct: 217 ILAGKDYKAGDQVFIYYGSIPNNRLLRLYGFIIPSNPNDNYELVLETHPMAPFFEQKHKL 276
Query: 410 AQRNG 414
+ G
Sbjct: 277 WESAG 281
>gi|322712432|gb|EFZ04005.1| histone-lysine N-methyltransferase [Metarhizium anisopliae ARSEF
23]
Length = 462
Score = 40.0 bits (92), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 82/428 (19%), Positives = 159/428 (37%), Gaps = 67/428 (15%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
V A + G+ ++P+ L T++ + + L + + LS LA+++++ +
Sbjct: 28 VKARRRFKQGERILTIPSGLHWTVKHAQNDSLLGPALCSAQPPLSVEDTLAVHILFVRS- 86
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
++S + ++R + S + +++ EL G+ + + I+ +Y +
Sbjct: 87 -RESGYDGLRSHVERLPA----SYSSSIFFTDDELEVCAGASLYTITKQLQQRIEDDYRD 141
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASH 290
L + QYP P + FT +K A AV S + Q + FA
Sbjct: 142 LVV------RVLVQYPDLFPLDKFTLHHYKWALCAVWSRAMDFQLSDGSSIRLLAPFADM 195
Query: 291 LVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQL 350
L H S +C A+ D + +
Sbjct: 196 LNHSSES-------------------------------------KQCHVYDASSGD-LSV 217
Query: 351 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 410
+ + Y+AG+ + + G PN +LL YGF+ NP D + A + P ++ K+ +
Sbjct: 218 LAGKDYEAGDQVYIHYGSIPNHRLLRLYGFIIPGNPNDSYDLVLATHPLAPFFELKQKLW 277
Query: 411 QRNGKLSVQVFHVHAGREKEAISDMLP-----YLRLGYVSDTSEMQSVISSLGPICPVSP 465
G S + ++D LP YLR+ + D S++ S+ +S
Sbjct: 278 ALAGLDSTCTISL-------TLTDPLPKNVIRYLRIQRL-DESDLASIALGQAADEKISN 329
Query: 466 CMERAVLDQLADYFKARLAGYPATLSEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNAC 523
E VL L + + L + L + E L Y + A + E+++L
Sbjct: 330 SNEVQVLQSLVESIASLLGSFGTRLEKLEEQLATGVYPVGGNAWAAAHVSLGEQRVLKLA 389
Query: 524 LQVTADMI 531
+ D++
Sbjct: 390 KKKAEDLL 397
>gi|302840199|ref|XP_002951655.1| hypothetical protein VOLCADRAFT_105180 [Volvox carteri f.
nagariensis]
gi|300262903|gb|EFJ47106.1| hypothetical protein VOLCADRAFT_105180 [Volvox carteri f.
nagariensis]
Length = 517
Score = 40.0 bits (92), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 25/119 (21%), Positives = 55/119 (46%), Gaps = 8/119 (6%)
Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
+++ E+ L + +MYEK +G++S W PY+ + PL W E L G+
Sbjct: 179 DEILEVQGLIIAVMYEKSRGRQSRWAPYLNLIPDD------MTHMPLYWKHREFKELRGT 232
Query: 212 PTKAEILERAEGIKREYNELDTVWF-MAGSLFQQYP-YDIPTEAFTFEIFKQAFVAVQS 268
+++ + + ++ +W + Q++P ++P +++++ A AV S
Sbjct: 233 AAYDKMMGKVQCPADAPTQVPVLWSEVVEPFIQEHPELELPEGKAGYDLYRWATCAVAS 291
>gi|71409849|ref|XP_807248.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70871208|gb|EAN85397.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 544
Score = 40.0 bits (92), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 45/95 (47%), Gaps = 10/95 (10%)
Query: 149 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA-- 206
L ++ + +AC+A Y+ YEKKQ + + L Y R L Q+ V++ LW+ L
Sbjct: 349 LDSSNMESIACIAAYMFYEKKQPEIALRL-YRRLL-------QMGVQTTELWNNLGLCCF 400
Query: 207 YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSL 241
Y + L+RA I E L VW+ G +
Sbjct: 401 YSSQYDIALSCLQRAVAISTEDETLADVWYNIGHI 435
>gi|194707708|gb|ACF87938.1| unknown [Zea mays]
Length = 352
Score = 39.7 bits (91), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 17/48 (35%), Positives = 31/48 (64%)
Query: 343 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 390
++D +V+++ ++ K E+I + G PN L++YGFV NPYD++
Sbjct: 136 SLDMSVKVLAEKKIKQNEAITLNYGCYPNDFFLLDYGFVITQNPYDQV 183
>gi|226505024|ref|NP_001151430.1| SET domain containing protein [Zea mays]
gi|195646778|gb|ACG42857.1| SET domain containing protein [Zea mays]
gi|413923893|gb|AFW63825.1| SET domain containing protein [Zea mays]
Length = 491
Score = 39.7 bits (91), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 17/48 (35%), Positives = 31/48 (64%)
Query: 343 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 390
++D +V+++ ++ K E+I + G PN L++YGFV NPYD++
Sbjct: 275 SLDMSVKVLAEKKIKQNEAITLNYGCYPNDFFLLDYGFVITQNPYDQV 322
>gi|320163048|gb|EFW39947.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 476
Score = 39.7 bits (91), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 56/239 (23%), Positives = 104/239 (43%), Gaps = 20/239 (8%)
Query: 320 RFALVPLGPPLLAYSSKCK-AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 378
+ A+VPL +L + + C A L + + P AG + G NS+LL Y
Sbjct: 235 KIAMVPLAD-MLDHKTGCNNARLFYGKTTLAMSCIEPCAAGHELYNTYGDLSNSELLRKY 293
Query: 379 GFVDEDNPYDRLVVEAAL---NTEDPQYQDKRMVAQRNGKLSVQVFHVHAG----REKEA 431
GF+D+ N ++ + + + E + ++ M A + FH+ A +E EA
Sbjct: 294 GFIDDVNEHNSVDIPVEMLEERFESCSFMEEAMEALEEIGCWLPEFHIPADALPPQELEA 353
Query: 432 ISDML-----PYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGY 486
+L L + D E++S +++L V+ C R V + L + + R Y
Sbjct: 354 SIALLFQSPKQVRALRALDDEDEIRSFLATL-----VNKC-RRKVSETLLAFGQKRAEEY 407
Query: 487 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAP 545
T EDE L + +L ++++A ++ E+ +L+ + + + P + PAP
Sbjct: 408 TTTREEDEERLKESDLTHRQKMALRVRIGERTILHNYISHLKERLETTPPDQETKEPAP 466
>gi|159471213|ref|XP_001693751.1| transcription factor, E2F and DP-related [Chlamydomonas
reinhardtii]
gi|158283254|gb|EDP09005.1| transcription factor, E2F and DP-related [Chlamydomonas
reinhardtii]
Length = 656
Score = 39.7 bits (91), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%)
Query: 354 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 390
RP G + + GP PNSKLL+ YGF DNP D +
Sbjct: 273 RPCARGRQLFLSYGPYPNSKLLLFYGFALPDNPVDEV 309
>gi|412987667|emb|CCO20502.1| related to histone-lysine N-methyltransferase (ISS) [Bathycoccus
prasinos]
Length = 866
Score = 39.7 bits (91), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 64/293 (21%), Positives = 111/293 (37%), Gaps = 62/293 (21%)
Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELAC---------LALYL 164
A +ED++ GD +P S +LE +E + + + + +A+++
Sbjct: 28 AVTEDVRRGDVLLEIPLSRCFSLESAQKSEMLTKAMAKAAAAAAGTRFTPTHDQYMAMFI 87
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE--LAYLTGSPTKAEILERAE 222
+ E+ GK+S +I + + A + PL WSE E + L G+ T AE L E
Sbjct: 88 LLEQNLGKQSSHYEHILSIPK-------AYDLPLFWSEEERQRSLLFGTTTYAETLALDE 140
Query: 223 GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQM 282
+ ++Y L + F++ + T + FK + S L R P
Sbjct: 141 EVIQDYELLKH--HLGEDFFRE-------QNITMDRFKWVRATLWSRQCDLLR----PAP 187
Query: 283 GTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLA 342
T + +++ PL +H L FA + AY S
Sbjct: 188 ETTRLRVLIPEFDMFNHSSKVPLGSSHKLNYSRGLVTAFATANVPKGEQAYIS------- 240
Query: 343 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV---DEDNPYDRLVV 392
Y +GE+ +SKLL+ YGF + +NP+++L V
Sbjct: 241 -------------YGSGEA--------SSSKLLLWYGFAPLNEGENPFEQLDV 272
>gi|300122775|emb|CBK23792.2| unnamed protein product [Blastocystis hominis]
Length = 854
Score = 39.7 bits (91), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 32/121 (26%), Positives = 56/121 (46%), Gaps = 10/121 (8%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGN----ETIAELLTTNKLSELACLALYLMYEK 168
V A E +Q G+ + N V+ L L + + + N+LSE A +AL L++EK
Sbjct: 512 VIAKEAIQKGEEVLRIHNDTVIGLHTALTHPRFGKAFSAFYHQNQLSEYALIALTLLWEK 571
Query: 169 KQGKK-SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
++ S + P++ +L P+L S+ +L +L GS E+ + RE
Sbjct: 572 FDNERWSLFAPFLAKLPSIE-----EFHHPVLLSKDDLLHLYGSALLDEVSALNATLHRE 626
Query: 228 Y 228
+
Sbjct: 627 F 627
>gi|367042232|ref|XP_003651496.1| hypothetical protein THITE_2111880 [Thielavia terrestris NRRL 8126]
gi|346998758|gb|AEO65160.1| hypothetical protein THITE_2111880 [Thielavia terrestris NRRL 8126]
Length = 377
Score = 39.7 bits (91), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 17/47 (36%), Positives = 27/47 (57%)
Query: 350 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 396
+ DR Y GE +++ G N LL+ YGFV E N +D + ++ A+
Sbjct: 211 ITADRAYAEGEEVLICYGRHSNDFLLVEYGFVLEQNRWDEVGLDEAV 257
>gi|149742140|ref|XP_001496337.1| PREDICTED: SET domain-containing protein 4 [Equus caballus]
Length = 440
Score = 39.7 bits (91), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 65/295 (22%), Positives = 112/295 (37%), Gaps = 53/295 (17%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSELACLALYLMYEKKQGKKSFWL 177
LQ G S+P S ++T + V+ + A + LS L L +L+ EK G +S W
Sbjct: 68 LQEGQMIISLPESCLLTTDTVIRSYLGAYIAKWQPPLSPLLALCTFLVAEKHAGDRSVWK 127
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
PY+ L + A P+ E E+ L P KA+ E+ ++ + +
Sbjct: 128 PYLEVLPK-------AYTCPVC-LEPEVVDLLPKPLKAKAREQRTRLQAFFTSSRDFFSS 179
Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ----RCGCTPQMGTESFASHLVH 293
LF + I F++ F A+ V + V+++ RC + + T + A +L
Sbjct: 180 LRPLFSEAVESI----FSYSAFLWAWCTVNTRAVYMKPRRRRCF-SAEPDTYALAPYLDL 234
Query: 294 LQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD 353
L H+P +V +A ++
Sbjct: 235 LN-------------HSPDVQV----------------------RAGFNEETRCYEIRTV 259
Query: 354 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRM 408
+ E + + GP N +LL+ YGFV NP+ + V + + DK+M
Sbjct: 260 SSCRKHEEVFICYGPHDNQRLLLEYGFVSIHNPHACVYVSKDILVKYLPSTDKQM 314
>gi|392563539|gb|EIW56718.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
Length = 441
Score = 39.7 bits (91), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 16/46 (34%), Positives = 29/46 (63%)
Query: 348 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 393
+ LV+ P G ++ GP+PN++L++ YGF +NP D +V++
Sbjct: 250 ISLVIHTPTTTGSELLNNYGPKPNAELILGYGFSLPNNPDDTIVLK 295
>gi|363746364|ref|XP_003643627.1| PREDICTED: histone-lysine N-methyltransferase setd3-like, partial
[Gallus gallus]
Length = 225
Score = 39.7 bits (91), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 29/119 (24%), Positives = 59/119 (49%), Gaps = 12/119 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
+ A+ +++A + VP L++T+E N + L + +++ + LA +L+ E+
Sbjct: 108 LKATREIKAEELFLWVPRKLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERA 166
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
SFWLPYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 167 -NPNSFWLPYIQTLPSE-------YDTPLYFEEDEVQYLRSTQAIHDVFSQYKNTARQY 217
>gi|328872715|gb|EGG21082.1| hypothetical protein DFA_00957 [Dictyostelium fasciculatum]
Length = 643
Score = 39.7 bits (91), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 96/490 (19%), Positives = 186/490 (37%), Gaps = 94/490 (19%)
Query: 71 VSKKEEDLGDLKSWM-HKNG-LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
+S EDL + W+ +KN L P I+ P + A+ +++ + +
Sbjct: 202 ISTTPEDLKSFQQWLSNKNTYLNPSIDIVDLGPPFGRS------MVANTNIKKDEILVEI 255
Query: 129 PNSLVVTLERVLGN--ETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
P +++T + ++ N I + + K+S A+ ++Y + S+W Y+ L +Q
Sbjct: 256 PKGIMMTPKSMIKNLPRFIIDWMDEMKISRTDQQAIAIIYSILH-EDSYWYEYVSILPKQ 314
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN-------------ELDT 233
+ + ++ E+ L SP R G+ R Y+ E D+
Sbjct: 315 -------FTTTVYFTREEMTQLQASPVHRFTEMRLNGVHRHYDTTISRLRFGYEGGEDDS 367
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHLVH 293
S + + +T + FK A V S L + + G
Sbjct: 368 TKTKTKSQLDAMK-EFKDDRYTLDQFKWALGCVWSRAFSL-----SEEDGG--------- 412
Query: 294 LQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD 353
PL+ N +S ++ + P L Y++
Sbjct: 413 --------MVPLADMFNADTVISRSKVHPKISASSPSLVYTAS----------------- 447
Query: 354 RPYKAGESIVVWCGPQP---NSKLLINYGFVDED-NPYDRLVVEAA-LNTEDPQYQDKRM 408
+ +AGE I G + ++L++YGF+ ED + D +V A + +P Y KR
Sbjct: 448 QDIEAGEQIFTPYGVYKTLGSGQMLMDYGFIHEDGSSADSTIVTVAPIPPSEPLYDLKRH 507
Query: 409 VAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS--DTSEMQSVISSLGPICPVSPC 466
+ Q NG + + F + + + ++ + R+ ++ ++ + + S ++P
Sbjct: 508 LMQSNG-IESEEFTI---TKNKLAKELFLFARIKSINKKESDQASAHFMSTQRHSMLNPR 563
Query: 467 MERAVLDQLADYFKARLAGYPATLSEDEAML-------TDYNLHPKKRVAT----QLVRM 515
E+A L L++ L Y T+ +D +L T+ N H T +L M
Sbjct: 564 NEKAALRLLSNLISRHLDAYQTTIDQDNQILKEIEKDKTNTN-HSSVTFNTINAIKLRLM 622
Query: 516 EKKMLNACLQ 525
EK +LN+ L+
Sbjct: 623 EKNILNSFLK 632
>gi|302754812|ref|XP_002960830.1| hypothetical protein SELMODRAFT_402221 [Selaginella moellendorffii]
gi|300171769|gb|EFJ38369.1| hypothetical protein SELMODRAFT_402221 [Selaginella moellendorffii]
Length = 393
Score = 39.7 bits (91), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 39/72 (54%), Gaps = 6/72 (8%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LAL ++ E+ +G+ + W PYI L + +++ W +TEL+YL SP + E
Sbjct: 167 LALIVLMERYKGQ-AIWAPYISCLPQPA-----ELDNTFRWEDTELSYLRASPLYGKARE 220
Query: 220 RAEGIKREYNEL 231
R E I E+ ++
Sbjct: 221 RLEMITTEFGQV 232
>gi|336260071|ref|XP_003344832.1| hypothetical protein SMAC_06115 [Sordaria macrospora k-hell]
Length = 456
Score = 39.7 bits (91), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 88/428 (20%), Positives = 155/428 (36%), Gaps = 78/428 (18%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
+ G+ ++P+S++ T+E + + L + + LS L YL++ +
Sbjct: 34 FKEGEKILTIPSSILWTVEHAYADPLLGPALCSVQPPLSPEDTLTTYLLFVRS------- 86
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
RE R +A S + ++E EL G+ + + I+ ++ L
Sbjct: 87 ----RESGYDGQRSHVAALPTSYSSSIFFTEEELEVCAGTSLYTITKQLEQSIEDDHRAL 142
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ-RCGCTPQMGTESFASH 290
LF Q+ P + F+ E +K A V S + Q R G + ++ FA
Sbjct: 143 ------VMQLFIQHRDLFPLDKFSIEDYKWALCTVWSRRMDFQLRDGKSMRL-LAPFADM 195
Query: 291 LVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQL 350
L H + P H + SS ++LA D
Sbjct: 196 LNH--------SSEAKPCH--------------------VYDVSSGNLSVLAGKD----- 222
Query: 351 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 410
Y+ G+ + + G PNS+LL YGFV NP D + + + + P Y+ K +
Sbjct: 223 -----YEPGDQVFINYGSVPNSRLLRLYGFVIPGNPNDTYDLVLSTHPQAPFYEQKHKLW 277
Query: 411 QRNGKLSVQVFHVHAGREKEAISDMLP-----YLRLGYVSDTSEMQSVISSLGPICPVSP 465
G S + ++D LP YLR+ + + + VS
Sbjct: 278 VSAGLDSTSTIPL-------TLTDPLPKNVLRYLRIQRADASDLAAMALQNAKADEKVSD 330
Query: 466 CMERAVLDQLADYFKARLAGYPATLSEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNAC 523
E +L L + F L G+ L + E L Y+ A + E+++L
Sbjct: 331 SNEVEILQFLVESFGHLLGGFGTPLEKLEEQLAQGVYSPGGNAWAAAHVSLGEQRVLRLA 390
Query: 524 LQVTADMI 531
+ D++
Sbjct: 391 KKRAEDLL 398
>gi|358395377|gb|EHK44764.1| hypothetical protein TRIATDRAFT_80097 [Trichoderma atroviride IMI
206040]
Length = 463
Score = 39.7 bits (91), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 76/381 (19%), Positives = 143/381 (37%), Gaps = 76/381 (19%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
Q G+ ++P + T+E + + +L + + LS LA+YL++ +
Sbjct: 34 FQQGERILTIPGDSLWTVEHADSDPLLGPVLRSVQPPLSVEDTLAVYLLFVR-------- 85
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
+RE + R +A S + ++E EL G+ + E I+ +Y
Sbjct: 86 ---LREHGYEGPRSHVAAMPARYSSSIFFNEDELEVCAGTSLYTITKQLEERIEDDYR-- 140
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHL 291
+ +F Q+P +P + + +K A V S + P FA +
Sbjct: 141 ----VLVMRVFTQHPDLLPLAKISIQDYKWALCTVWSRAMDFVLPNGKPLRVLAPFADMI 196
Query: 292 VHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 351
H+P+ K +C A + + + ++
Sbjct: 197 ----------------NHSPEVK---------------------QCHAYDPSSGN-LSVL 218
Query: 352 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 411
+ Y+ G+ I + G PN++LL YGFV +NP D + + + P Y+ K+ +
Sbjct: 219 AGKDYEIGDQIYISYGSIPNNRLLRLYGFVIPENPNDSYDLVLSTHPMAPFYEQKQKLWA 278
Query: 412 RNGKLSVQVFHVHAGREKEAISDMLP-----YLRLGYVSDTSEMQSV-ISSLGPICPVSP 465
G S + + D LP YLR+ + D S++ ++ + L +S
Sbjct: 279 SAGLDSASTIPL-------TLIDPLPKSVLRYLRIQRL-DASDLAAIALQKLDTNEKISN 330
Query: 466 CMERAVLDQLADYFKARLAGY 486
E +L L + A L G+
Sbjct: 331 SKEVEILQFLVESISALLDGF 351
>gi|255071473|ref|XP_002499410.1| predicted protein [Micromonas sp. RCC299]
gi|226514673|gb|ACO60669.1| predicted protein [Micromonas sp. RCC299]
Length = 323
Score = 39.3 bits (90), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 24/35 (68%)
Query: 359 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 393
GE +V+ G + N +LL +GF D DNP+D LV++
Sbjct: 199 GEEVVISYGDKTNEELLFVHGFADRDNPHDALVLQ 233
>gi|213408453|ref|XP_002174997.1| SET domain-containing protein [Schizosaccharomyces japonicus
yFS275]
gi|212003044|gb|EEB08704.1| SET domain-containing protein [Schizosaccharomyces japonicus
yFS275]
Length = 441
Score = 39.3 bits (90), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 59/280 (21%), Positives = 101/280 (36%), Gaps = 37/280 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A ++++A + P V+ +V G+ + L +L A L L +M EK
Sbjct: 40 IVAVDNIKADETVVFFPKDSVM---KVSGS-YLQHLEGIEELPNWAALLLLMMNEK-NNP 94
Query: 173 KSFWLPYIREL-DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
+SFW PYI ++R + S W + L S + +R+E +
Sbjct: 95 ESFWKPYISVFPTKER------ITSLFYWDAEKQKRLLKSTVLENMQDRSE--------V 140
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQRCGCTPQMGTESFASHL 291
TVW F E T E F+ + S ++R + TE+ S
Sbjct: 141 KTVWKETVLPFIDKNKSKLREGLTLEDFEHMAAVMSSYSFDVKR------IKTENNDSQK 194
Query: 292 VHLQTSCTTTRTPLSPTHNPKEKVSLARRF-------ALVPLGPPLLAYSSKCKAMLAAV 344
Q + + N ++ L + A+ P+ C + +
Sbjct: 195 ASKQMDVDNS----EHSENNEDDSDLESEYDPEVFEKAMCPIADMFNGDDELCNVRMYDL 250
Query: 345 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 384
+D ++V R + GE + G N +LL YGF D
Sbjct: 251 EDGYHMMVTRDIEKGEQLWNTYGDIDNGELLRKYGFTKPD 290
>gi|413951745|gb|AFW84394.1| hypothetical protein ZEAMMB73_159573, partial [Zea mays]
Length = 339
Score = 39.3 bits (90), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 19/57 (33%), Positives = 32/57 (56%)
Query: 343 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 399
A D+ +++++ R GE + G N+ LL YGF + DN YD + ++ AL T+
Sbjct: 99 ANDEDLEIIIVRDVNEGEEVYNTYGTMGNAALLHRYGFTELDNQYDIVNIDLALVTK 155
>gi|449544081|gb|EMD35055.1| hypothetical protein CERSUDRAFT_107074 [Ceriporiopsis subvermispora
B]
Length = 457
Score = 39.3 bits (90), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 18/49 (36%), Positives = 30/49 (61%)
Query: 345 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 393
D AV L++ P G ++ GP+PN++L++ YGF NP D +V++
Sbjct: 275 DLAVSLLLHSPTPRGAELLNNYGPKPNAELVLGYGFALPSNPDDTIVLK 323
>gi|407923069|gb|EKG16157.1| hypothetical protein MPH_06594 [Macrophomina phaseolina MS6]
Length = 305
Score = 38.9 bits (89), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 27/100 (27%), Positives = 45/100 (45%), Gaps = 9/100 (9%)
Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWL 177
D+Q G+ F++P S V++ + + +L L A L + ++YE +G S W
Sbjct: 3 DIQEGEVLFTIPRSAVLSATNSSLSSILPQLF--EHLDPWASLIVTMIYEYLRGDASPWK 60
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
PY L ++ + WS+ ELA L S +I
Sbjct: 61 PYFDVLPAH-------FDTLMFWSDDELAELQASAVTQKI 93
>gi|302835223|ref|XP_002949173.1| hypothetical protein VOLCADRAFT_120737 [Volvox carteri f.
nagariensis]
gi|300265475|gb|EFJ49666.1| hypothetical protein VOLCADRAFT_120737 [Volvox carteri f.
nagariensis]
Length = 593
Score = 38.9 bits (89), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 23/37 (62%)
Query: 354 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 390
RP +AG + + GP N+KLL+ YGF DNP D +
Sbjct: 390 RPCEAGNQLFLSYGPYSNAKLLLFYGFAVRDNPADEV 426
>gi|302826668|ref|XP_002994755.1| hypothetical protein SELMODRAFT_432653 [Selaginella moellendorffii]
gi|300136963|gb|EFJ04180.1| hypothetical protein SELMODRAFT_432653 [Selaginella moellendorffii]
Length = 688
Score = 38.9 bits (89), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 35/72 (48%), Gaps = 8/72 (11%)
Query: 167 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 226
+K Q + S W PYI L +++ LW +TEL+YL SP + ER E I
Sbjct: 503 QKFQLQSSAWAPYISCLPEPA-----ELDNTFLWEDTELSYLRASPLYGKTRERLEIITT 557
Query: 227 EYNELDT---VW 235
E+ ++ VW
Sbjct: 558 EFGQVQNALDVW 569
>gi|403375581|gb|EJY87766.1| hypothetical protein OXYTRI_23666 [Oxytricha trifallax]
Length = 789
Score = 38.9 bits (89), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 42/194 (21%), Positives = 84/194 (43%), Gaps = 26/194 (13%)
Query: 62 TLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQA 121
T + + + K++E + + W+ NG+ V + P + + +AA +D+
Sbjct: 28 TFIHHEKTNLLKQQEKYVNFQKWLEDNGVLHPGV---DYPVAFGRQGQLIGMAARKDIPP 84
Query: 122 GDAAFSVPNSLVVTLERVLGNETIA-------ELLTTNKLSELACLALYLMYEKKQGKKS 174
A VP L+++ E + N IA E+ ++ +E + ++ +E +G+ S
Sbjct: 85 QKAFLFVPQRLMIS-EVTVRNSKIAPLLSKHPEIFKHHQDAEYLVIIAFVWHELMKGEAS 143
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-NELDT 233
FW PY + ++ + P+LWS+ E+ + +I + K EY NE
Sbjct: 144 FWHPYFQIIN--------LSDLPMLWSDQEIQEFQDQVLQKDI----QDYKVEYENEWKL 191
Query: 234 VW--FMAGSLFQQY 245
V+ F + +Y
Sbjct: 192 VYEAFSKDETYDEY 205
>gi|308802011|ref|XP_003078319.1| N-methyltransferase (ISS) [Ostreococcus tauri]
gi|116056770|emb|CAL53059.1| N-methyltransferase (ISS) [Ostreococcus tauri]
Length = 429
Score = 38.9 bits (89), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 51/185 (27%), Positives = 76/185 (41%), Gaps = 17/185 (9%)
Query: 354 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQD-KRMVAQR 412
+ Y+ GE +++ G N +L+ YGFVD DN D E ++ Y KR +
Sbjct: 246 KDYETGEEVLISYGVLNNDELITRYGFVDVDNVADIYRFEGLMSYLQASYDPMKRALGAD 305
Query: 413 NGKLSVQVFHVH-----AGREKEAISD------MLPYLRLGYVSDTSEMQSVISSLGPIC 461
+LS + H A E ISD +L LR V T E + +
Sbjct: 306 QKRLST-LKRTHPELDQALWEGNFISDGNADPKLLWALRT--VLATPEEYAAAKGVDGFK 362
Query: 462 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 521
ER D + ++RLA YP T+ EDE L N +R A Q +K++L
Sbjct: 363 LGGGAPERRAADAVRAAVESRLAEYPTTIEEDEEALKTAN--GNERTAIQYRIRKKRILR 420
Query: 522 ACLQV 526
++
Sbjct: 421 DASRI 425
>gi|223946389|gb|ACN27278.1| unknown [Zea mays]
Length = 289
Score = 38.9 bits (89), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 19/57 (33%), Positives = 32/57 (56%)
Query: 343 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 399
A D+ +++++ R GE + G N+ LL YGF + DN YD + ++ AL T+
Sbjct: 47 ANDEDLEIIIVRDVNEGEEVYNTYGTMGNAALLHRYGFTELDNQYDIVNIDLALVTK 103
>gi|219122993|ref|XP_002181819.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407095|gb|EEC47033.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 579
Score = 38.9 bits (89), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 17/47 (36%), Positives = 28/47 (59%)
Query: 346 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 392
+A L D+ +G+ + + GP+ N +LL YGFV+ +NP D V+
Sbjct: 361 NAYSLATDQAIPSGDEVYISYGPRSNDQLLQYYGFVERNNPNDVYVM 407
>gi|440302460|gb|ELP94773.1| hypothetical protein EIN_341910 [Entamoeba invadens IP1]
Length = 823
Score = 38.9 bits (89), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 34/149 (22%), Positives = 65/149 (43%), Gaps = 36/149 (24%)
Query: 363 VVWCGPQ--------PNSKLLINYGFVDEDNPYDRLVV-----EAALNTEDPQYQDKRM- 408
V+ CG Q PN K+L+ YGFV +NPYD +++ E + + ++ +M
Sbjct: 230 VLHCGEQIFNNYRIRPNEKMLLGYGFVISENPYDEVLLRINFKERHFEKQVEESEESKME 289
Query: 409 -------------------VAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS--DT 447
+ Q + V F + REKE +D+L LR+ +S +
Sbjct: 290 VENKENERMEVEEEDNEDEITQILKREGVDRFDYYLTREKELPTDLLRVLRIVNLSLVEA 349
Query: 448 SEMQSVISSLGPICPVSPC-MERAVLDQL 475
++ + L + P++ R++++Q+
Sbjct: 350 NQYSQALLDLSYVSPINEIKATRSLMEQI 378
>gi|169595142|ref|XP_001790995.1| hypothetical protein SNOG_00305 [Phaeosphaeria nodorum SN15]
gi|160701026|gb|EAT91800.2| hypothetical protein SNOG_00305 [Phaeosphaeria nodorum SN15]
Length = 391
Score = 38.9 bits (89), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 26/48 (54%), Gaps = 5/48 (10%)
Query: 350 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN-----PYDRLVV 392
+ DR YKAGE + V G N LL+ YGF+ + N P D L++
Sbjct: 203 VTADREYKAGEEVFVSYGAHTNDFLLVEYGFILDSNRNDAIPLDHLIL 250
>gi|328726082|ref|XP_001952202.2| PREDICTED: SET domain-containing protein 3-like [Acyrthosiphon
pisum]
Length = 241
Score = 38.9 bits (89), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 38/161 (23%), Positives = 73/161 (45%), Gaps = 20/161 (12%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
+ ++ + L W KNG IL H ++ + + A++++ GD +VP +L
Sbjct: 81 RNDQSIEKLTKWATKNG-----AILNGVEIHQFENYA-YGMKANKNITVGDKLVTVPRAL 134
Query: 133 VVTLERV----LGNETIAELLTTNKLSELACLALYLMYEK-KQGKKSFWLPYIRELDRQR 187
++T E + L +++ N + LA++++ E ++ KKSFW Y+ L
Sbjct: 135 MMTEENIPSSPLWKLHSQDMMLRNMPN--VALAIFILVESLRKDKKSFWHSYLTTLP--- 189
Query: 188 GRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
+ +P+ + +L L GSP L+ I R+Y
Sbjct: 190 ----VTYSTPVYFDVADLEALKGSPAFEAALKLNRNIARQY 226
>gi|351694473|gb|EHA97391.1| SET domain-containing protein 3 [Heterocephalus glaber]
Length = 297
Score = 38.5 bits (88), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 40/72 (55%), Gaps = 4/72 (5%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQG 171
A+ D++AG+ VP LV+T+E N + L + +++ + LA +L+ ++
Sbjct: 112 ATRDIKAGELFLWVPRKLVMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLLCERAS 170
Query: 172 KKSFWLPYIREL 183
SFWLPYI+ L
Sbjct: 171 PISFWLPYIQTL 182
>gi|384490907|gb|EIE82103.1| hypothetical protein RO3G_06808 [Rhizopus delemar RA 99-880]
Length = 216
Score = 38.5 bits (88), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 28/119 (23%), Positives = 59/119 (49%), Gaps = 12/119 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V ++ ++ + +VP S+ +T E+V N T S +L+L+ +K GK
Sbjct: 22 VYTTDTVKENEKFATVPFSICIT-EKVARNA----FPTLTGFSGRVLQSLFLVQQKNLGK 76
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
KSF+ PYI L ++ + + L + E ++ Y+ + + + ER ++ ++++L
Sbjct: 77 KSFYFPYINILPKK-------IVTALHFDENDMNYIKKTNLELALRERKTALRDDFDKL 128
>gi|340522118|gb|EGR52351.1| predicted protein [Trichoderma reesei QM6a]
Length = 377
Score = 38.5 bits (88), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 19/74 (25%), Positives = 35/74 (47%), Gaps = 1/74 (1%)
Query: 320 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 379
R AL+P+ + C+ + + +V DR YK GE + + N L+ YG
Sbjct: 179 RLALIPVADLFNHAAGGCRVYYSP-EGCYHVVADRAYKKGEELFISYSSHSNDYNLLEYG 237
Query: 380 FVDEDNPYDRLVVE 393
F+ ++N D + ++
Sbjct: 238 FIPDENSLDDVYID 251
>gi|261190993|ref|XP_002621905.1| SET domain-containing protein [Ajellomyces dermatitidis SLH14081]
gi|239590949|gb|EEQ73530.1| SET domain-containing protein [Ajellomyces dermatitidis SLH14081]
gi|239613147|gb|EEQ90134.1| SET domain-containing protein [Ajellomyces dermatitidis ER-3]
gi|327354785|gb|EGE83642.1| SET domain-containing protein [Ajellomyces dermatitidis ATCC 18188]
Length = 481
Score = 38.5 bits (88), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 26/101 (25%), Positives = 49/101 (48%), Gaps = 13/101 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELL--TTNKLSELACLALYLMYEKKQ 170
+ A ++ + F++P +LV++ + N + +LL + L CL L ++YE Q
Sbjct: 50 IVALSNINEDEELFAIPQNLVLSFQ----NSKLKDLLHISEKDLGPWLCLILVMIYEYLQ 105
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
G S W Y + L + ++ + W++ EL L+GS
Sbjct: 106 GGASPWSRYFQVLPTE-------FDTLMFWTDEELRELSGS 139
>gi|258567286|ref|XP_002584387.1| predicted protein [Uncinocarpus reesii 1704]
gi|237905833|gb|EEP80234.1| predicted protein [Uncinocarpus reesii 1704]
Length = 706
Score = 38.5 bits (88), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 57/237 (24%), Positives = 95/237 (40%), Gaps = 45/237 (18%)
Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIREL--DRQRGRGQLAVESPLLWSETELAYLTGS 211
+ E LA +LM + G +SFW PYI+ L D Q R + ++ +L +L G+
Sbjct: 120 VEEPGALAFFLMDQYLLGDESFWAPYIQSLPDDSQFTRLEY-------YTGDDLKWLEGT 172
Query: 212 PTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 271
+ E +K +Y L +++P + T +T+E F A + S
Sbjct: 173 NLLKLREKLLERLKAKYE-------TGLRLLKEFP-NKNTPKYTWERFLWASSIILS--- 221
Query: 272 HLQRCGCTPQMGTESFASHLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLL 331
+F+S ++ T TR P E S+ LVPL +
Sbjct: 222 -------------RAFSSEVLKDYIKGTPTRV------KPLEDFSV-----LVPLVD-IS 256
Query: 332 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 388
+ + A + + L+V + GE + GP+ N +L++NYGF N D
Sbjct: 257 NHQPLAQVEWATSLEKIGLIVHKTLLPGEEVPNNYGPRSNERLMMNYGFCIRGNVCD 313
>gi|414886517|tpg|DAA62531.1| TPA: hypothetical protein ZEAMMB73_960129 [Zea mays]
Length = 147
Score = 38.5 bits (88), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 23/74 (31%), Positives = 43/74 (58%), Gaps = 3/74 (4%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNE-TIAELLTTNK--LSELACLALYLMYEKK 169
+AA+ DL+ G+ +P + ++T +RV ++ IA ++ +K LS + L + L+ E
Sbjct: 51 LAAARDLRRGELVLRLPRAALLTSDRVTADDPRIAACVSAHKPRLSSVQILIVCLLAEVG 110
Query: 170 QGKKSFWLPYIREL 183
+G S W PY+ +L
Sbjct: 111 KGSNSVWYPYLCQL 124
>gi|311031127|ref|ZP_07709217.1| D-alanyl-alanine synthetase A [Bacillus sp. m3-13]
Length = 356
Score = 38.5 bits (88), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 43/174 (24%), Positives = 75/174 (43%), Gaps = 19/174 (10%)
Query: 35 IRDPNFGSSLRLVRRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCK 94
++ N GSS+ + + KNR + + + + +V + E +++ + N P C
Sbjct: 176 VKPANLGSSVGISKAKNREELETAFREAFDYDRKIIVEQGLEGAREIEIGVLGNDEPECS 235
Query: 95 VILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAEL-LTTNK 153
VI + P K Y A ED GD A +P + + E + E+ +T K
Sbjct: 236 VIGEIAP----KTEFYDYKAKYED---GDTAMIIPAEISEDI-----YEQVKEMAITAFK 283
Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY 207
+ A L + K+GK YI E++ G ++ PLLW T+++Y
Sbjct: 284 AIDGAGLVRADFFLTKEGK-----LYINEVNTMPGFTPFSM-FPLLWQHTDVSY 331
>gi|301099608|ref|XP_002898895.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262104601|gb|EEY62653.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 440
Score = 38.5 bits (88), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 63/143 (44%), Gaps = 10/143 (6%)
Query: 90 LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELL 149
L P +L+ P R Y+ +E+++ G S+P S V+++E + LL
Sbjct: 16 LAPMSTVLQ--PEGFNFGRGTAYIT-TENVEVGSVLLSLPMSQVMSVESA-ARGRVGLLL 71
Query: 150 TTN-KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 208
N L L L+L+ E+ G S + ++ L A+ S L +SE E+ L
Sbjct: 72 EVNPDLPSAIALGLHLLEERALGAASNFSDFVATLPTIE-----AINSTLFYSEDEMKGL 126
Query: 209 TGSPTKAEILERAEGIKREYNEL 231
GS + L RA+ + Y+ L
Sbjct: 127 EGSQLQRFTLGRAQAVDAFYDAL 149
>gi|301112144|ref|XP_002905151.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262095481|gb|EEY53533.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 510
Score = 38.5 bits (88), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 38/141 (26%), Positives = 63/141 (44%), Gaps = 10/141 (7%)
Query: 92 PCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTT 151
P +L+ P R Y+ A E+++ G S+P S V+++E + LL
Sbjct: 88 PMSTVLQ--PEGFNFGRGTAYITA-ENVEVGSELLSLPMSQVMSVESA-ARGRVGLLLEV 143
Query: 152 N-KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 210
N L L L+L+ E+ G S + ++ L A+ S L +SE E+ L G
Sbjct: 144 NPDLPSAIALGLHLLEERALGAASNFSDFVATLPTIE-----AINSTLFYSEDEMNELEG 198
Query: 211 SPTKAEILERAEGIKREYNEL 231
S + L RA+ ++ Y+ L
Sbjct: 199 SQLQRFTLGRAQAVEAFYDAL 219
>gi|357122881|ref|XP_003563142.1| PREDICTED: protein SET DOMAIN GROUP 40-like [Brachypodium
distachyon]
Length = 480
Score = 38.5 bits (88), Expect = 10.0, Method: Compositional matrix adjust.
Identities = 64/293 (21%), Positives = 119/293 (40%), Gaps = 39/293 (13%)
Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGNE-TIAELLTTN--KLSELACLALYLMYEKKQ 170
AA+ DL+ G+ VP + ++T +RV+ ++ IA + +LS + L + L+ E +
Sbjct: 45 AAARDLRRGELVLRVPRAALLTSDRVMADDPEIASCIAARHPRLSSVQRLIVCLLAEVGK 104
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI-KREYN 229
GK S W Y+ +L T LA +A ++ A I ++ +
Sbjct: 105 GKSSSWYLYLSQLPSYY---------------TVLATFNDFEIEALQVDDAIWIAQKSLS 149
Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL--QRCGCTPQMGT-ES 286
+ + W A L Q + + F+ + AF V S +H+ GC +G +
Sbjct: 150 AIRSEWEDATPLMQGLKF--KPKLLIFKTWLWAFATVSSRTLHVAWDDAGCLCPVGDLFN 207
Query: 287 FAS--HLVHLQTSCTTTRTPLSPTHNPKEKVSLARRFALVPLGPPLLAYSSKCKAMLAAV 344
+A+ + + T + E+V R + G Y
Sbjct: 208 YAAPDDDISSEEENREEVTKCQQKNEMLEEVKFGRSSERLSDG----GYEDS-------- 255
Query: 345 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 397
+A L + Y GE +++ G N +LL +YGF+ +NP ++ ++ L+
Sbjct: 256 -EAYCLYARKCYTKGEQVLLGYGTYTNLELLEHYGFLLAENPNEKTYIQLDLD 307
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.133 0.393
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,356,355,841
Number of Sequences: 23463169
Number of extensions: 336894931
Number of successful extensions: 898175
Number of sequences better than 100.0: 746
Number of HSP's better than 100.0 without gapping: 332
Number of HSP's successfully gapped in prelim test: 414
Number of HSP's that attempted gapping in prelim test: 896853
Number of HSP's gapped (non-prelim): 1326
length of query: 551
length of database: 8,064,228,071
effective HSP length: 148
effective length of query: 403
effective length of database: 8,886,646,355
effective search space: 3581318481065
effective search space used: 3581318481065
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 80 (35.4 bits)