BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 010377
(512 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|449455876|ref|XP_004145676.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cucumis
sativus]
gi|449492872|ref|XP_004159127.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cucumis
sativus]
Length = 521
Score = 828 bits (2139), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 404/499 (80%), Positives = 442/499 (88%), Gaps = 10/499 (2%)
Query: 21 HHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSSDTLVAGSR-------EVVSK 73
H PL + S IS+S R +F +S +RR N S SSS+TLVAGSR E V+K
Sbjct: 24 HRPLLLLSKISVSAPRISHFSNSFSPIRRWNVCS--ASSSETLVAGSRKENGKTGEAVTK 81
Query: 74 KEED-LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
KE+D GDLK+WMH NGLPPCKVIL+EKPSH++ HRPIHYVAASEDL+ GD AFSVPNSL
Sbjct: 82 KEDDEFGDLKAWMHDNGLPPCKVILEEKPSHDKNHRPIHYVAASEDLEVGDVAFSVPNSL 141
Query: 133 VVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
VVTLERVLGNET+AELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQL
Sbjct: 142 VVTLERVLGNETVAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQL 201
Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 252
AVESPLLWSE EL YL+GSPTK E+LERAEGIK+EYNELDTVWFMAGSLFQQYPYDIPTE
Sbjct: 202 AVESPLLWSEDELDYLSGSPTKKEVLERAEGIKKEYNELDTVWFMAGSLFQQYPYDIPTE 261
Query: 253 AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 312
AF+FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY S CKAML AVD AV+LV
Sbjct: 262 AFSFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYRSNCKAMLTAVDGAVELV 321
Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 372
VDRPYKAGESI VWCGPQPNSKLL+NYGFVDEDN YDRLVVEAALNTEDPQYQDKRMVAQ
Sbjct: 322 VDRPYKAGESIAVWCGPQPNSKLLLNYGFVDEDNRYDRLVVEAALNTEDPQYQDKRMVAQ 381
Query: 373 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 432
RNG+LS+Q F+V+AG+EKEA+ DMLPYLRLGYV+ SEMQSVISS GP+CPVSPCMERA+
Sbjct: 382 RNGRLSIQAFYVYAGKEKEAVLDMLPYLRLGYVTHPSEMQSVISSQGPVCPVSPCMERAM 441
Query: 433 LDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 492
L+Q+ADYFK RLAGYP TLSEDE +L D NL+PKKRVATQLVR+EKK+L++CL+VT D I
Sbjct: 442 LEQVADYFKRRLAGYPTTLSEDEFLLADGNLNPKKRVATQLVRLEKKLLHSCLEVTIDFI 501
Query: 493 MLLPDVTVSPCPAPYAPLL 511
LPD TVSPCPAPYAPLL
Sbjct: 502 NQLPDHTVSPCPAPYAPLL 520
>gi|225452167|ref|XP_002264334.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
[Vitis vinifera]
Length = 509
Score = 828 bits (2138), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 396/462 (85%), Positives = 423/462 (91%), Gaps = 7/462 (1%)
Query: 58 SSSDTLVAGSR-------EVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPI 110
S SDTLVAGSR E KKE++ GDLKSWMH+NGLPPCKV+LKE+PSH+E+H+ I
Sbjct: 48 SGSDTLVAGSRKEDGRVSEAARKKEDEFGDLKSWMHENGLPPCKVVLKERPSHHEQHKAI 107
Query: 111 HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
HY+AASEDLQAGD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ
Sbjct: 108 HYIAASEDLQAGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 167
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
GKKSFW PYIRELDRQRGRGQLAVESPLLWSE+ELAYLTGSPTKAE+LERAEGIKREYNE
Sbjct: 168 GKKSFWYPYIRELDRQRGRGQLAVESPLLWSESELAYLTGSPTKAEVLERAEGIKREYNE 227
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 290
LDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFVA+QSCVVHLQKVSLARRFALVPLGPP
Sbjct: 228 LDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFVAIQSCVVHLQKVSLARRFALVPLGPP 287
Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 350
LLAY S CKAMLAAVD +VQLVVDRPYKAGESIVVWCGPQPNSKLL+NYGFVDEDN YDR
Sbjct: 288 LLAYRSNCKAMLAAVDGSVQLVVDRPYKAGESIVVWCGPQPNSKLLLNYGFVDEDNSYDR 347
Query: 351 LVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE 410
+VVEAALNTEDPQYQDKRMVAQRNGKL+VQ FHV G+E+EA+SDMLPYLRLGYVSD SE
Sbjct: 348 IVVEAALNTEDPQYQDKRMVAQRNGKLTVQKFHVSVGKEREAVSDMLPYLRLGYVSDPSE 407
Query: 411 MQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVA 470
MQSVISS GPICPVSPCMERAVLDQL DYF+ RLAGYP T+SEDE +L D NL+PKK VA
Sbjct: 408 MQSVISSQGPICPVSPCMERAVLDQLVDYFERRLAGYPTTMSEDECLLADSNLNPKKLVA 467
Query: 471 TQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 512
TQLVR+EKKMLNACL+ T D+I LPD TVSPCPAPY PLL
Sbjct: 468 TQLVRLEKKMLNACLKATVDLINQLPDHTVSPCPAPYTPLLK 509
>gi|359488614|ref|XP_003633789.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
[Vitis vinifera]
Length = 515
Score = 821 bits (2121), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 396/468 (84%), Positives = 423/468 (90%), Gaps = 13/468 (2%)
Query: 58 SSSDTLVAGSR-------EVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPI 110
S SDTLVAGSR E KKE++ GDLKSWMH+NGLPPCKV+LKE+PSH+E+H+ I
Sbjct: 48 SGSDTLVAGSRKEDGRVSEAARKKEDEFGDLKSWMHENGLPPCKVVLKERPSHHEQHKAI 107
Query: 111 HYVAASEDLQ------AGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYL 164
HY+AASEDLQ AGD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYL
Sbjct: 108 HYIAASEDLQGFLLLQAGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYL 167
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
MYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE+ELAYLTGSPTKAE+LERAEGI
Sbjct: 168 MYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESELAYLTGSPTKAEVLERAEGI 227
Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 284
KREYNELDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFVA+QSCVVHLQKVSLARRFAL
Sbjct: 228 KREYNELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFVAIQSCVVHLQKVSLARRFAL 287
Query: 285 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 344
VPLGPPLLAY S CKAMLAAVD +VQLVVDRPYKAGESIVVWCGPQPNSKLL+NYGFVDE
Sbjct: 288 VPLGPPLLAYRSNCKAMLAAVDGSVQLVVDRPYKAGESIVVWCGPQPNSKLLLNYGFVDE 347
Query: 345 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 404
DN YDR+VVEAALNTEDPQYQDKRMVAQRNGKL+VQ FHV G+E+EA+SDMLPYLRLGY
Sbjct: 348 DNSYDRIVVEAALNTEDPQYQDKRMVAQRNGKLTVQKFHVSVGKEREAVSDMLPYLRLGY 407
Query: 405 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 464
VSD SEMQSVISS GPICPVSPCMERAVLDQL DYF+ RLAGYP T+SEDE +L D NL+
Sbjct: 408 VSDPSEMQSVISSQGPICPVSPCMERAVLDQLVDYFERRLAGYPTTMSEDECLLADSNLN 467
Query: 465 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 512
PKK VATQLVR+EKKMLNACL+ T D+I LPD TVSPCPAPY PLL
Sbjct: 468 PKKLVATQLVRLEKKMLNACLKATVDLINQLPDHTVSPCPAPYTPLLK 515
>gi|224117488|ref|XP_002331687.1| SET domain protein [Populus trichocarpa]
gi|222874165|gb|EEF11296.1| SET domain protein [Populus trichocarpa]
Length = 502
Score = 809 bits (2090), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 405/517 (78%), Positives = 439/517 (84%), Gaps = 22/517 (4%)
Query: 1 MEASCSLRSSKFISPPIRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSS 60
ME +C +K ISP L++ S +SIS P S RR F+
Sbjct: 1 MEFTC--LHNKCISPS-------LTVLSRVSISFSNLPKRAVSFHRRRRNLCFA------ 45
Query: 61 DTLVAGSR--EVVSKK----EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVA 114
TLV G R EVVSK+ E++ GDLKSWMHKNGLPPCKV+LKE+PSH++K RPIHYVA
Sbjct: 46 -TLVDGKRTSEVVSKRGGEEEDEFGDLKSWMHKNGLPPCKVVLKERPSHDKKLRPIHYVA 104
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASEDLQA D A SVPNSLVVTLERVLGNET+AELLTTNKLSELACLALYLMYEKKQGKKS
Sbjct: 105 ASEDLQASDVAVSVPNSLVVTLERVLGNETLAELLTTNKLSELACLALYLMYEKKQGKKS 164
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
FW PYIRELDRQRGRGQLAVESPLLWSE ELAYLTGSPTKAE+L+RA+GIKREY ELDTV
Sbjct: 165 FWYPYIRELDRQRGRGQLAVESPLLWSEAELAYLTGSPTKAEVLDRADGIKREYEELDTV 224
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
WFMAGSLFQQYPYDIPTEAF FEIFKQAFVA+QSCVVHLQKVSLARRFALVPLGPPLLAY
Sbjct: 225 WFMAGSLFQQYPYDIPTEAFPFEIFKQAFVAIQSCVVHLQKVSLARRFALVPLGPPLLAY 284
Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
SS CKAML AVD AV+LVVDRPYKAGE IVVWCGPQPNSKLL+NYGFVDEDNPYDR+ VE
Sbjct: 285 SSNCKAMLTAVDGAVELVVDRPYKAGEPIVVWCGPQPNSKLLLNYGFVDEDNPYDRIAVE 344
Query: 355 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV 414
AALNTEDPQYQDKRMVAQRNGKLSVQVF V+AG+EKEA+SD+LPYLRLGYVSD SEMQSV
Sbjct: 345 AALNTEDPQYQDKRMVAQRNGKLSVQVFQVYAGKEKEAVSDILPYLRLGYVSDPSEMQSV 404
Query: 415 ISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLV 474
ISS GP+CPVSPCME+AVLDQL YF+ RLAGY ++SEDE ML D NL+PKKRVATQLV
Sbjct: 405 ISSQGPVCPVSPCMEQAVLDQLTVYFRTRLAGYCTSISEDELMLADPNLNPKKRVATQLV 464
Query: 475 RMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 511
R+EKKML ACLQ T D+I LPD T+ PCPAPYAPLL
Sbjct: 465 RLEKKMLKACLQATVDLINQLPDHTMPPCPAPYAPLL 501
>gi|296090251|emb|CBI40070.3| unnamed protein product [Vitis vinifera]
Length = 428
Score = 793 bits (2047), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/427 (88%), Positives = 401/427 (93%)
Query: 85 MHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNET 144
MH+NGLPPCKV+LKE+PSH+E+H+ IHY+AASEDLQAGD AFSVP+SLVVTLERVLGNET
Sbjct: 1 MHENGLPPCKVVLKERPSHHEQHKAIHYIAASEDLQAGDVAFSVPDSLVVTLERVLGNET 60
Query: 145 IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 204
IAELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE+E
Sbjct: 61 IAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESE 120
Query: 205 LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
LAYLTGSPTKAE+LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFV
Sbjct: 121 LAYLTGSPTKAEVLERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFV 180
Query: 265 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 324
A+QSCVVHLQKVSLARRFALVPLGPPLLAY S CKAMLAAVD +VQLVVDRPYKAGESIV
Sbjct: 181 AIQSCVVHLQKVSLARRFALVPLGPPLLAYRSNCKAMLAAVDGSVQLVVDRPYKAGESIV 240
Query: 325 VWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV 384
VWCGPQPNSKLL+NYGFVDEDN YDR+VVEAALNTEDPQYQDKRMVAQRNGKL+VQ FHV
Sbjct: 241 VWCGPQPNSKLLLNYGFVDEDNSYDRIVVEAALNTEDPQYQDKRMVAQRNGKLTVQKFHV 300
Query: 385 HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARL 444
G+E+EA+SDMLPYLRLGYVSD SEMQSVISS GPICPVSPCMERAVLDQL DYF+ RL
Sbjct: 301 SVGKEREAVSDMLPYLRLGYVSDPSEMQSVISSQGPICPVSPCMERAVLDQLVDYFERRL 360
Query: 445 AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCP 504
AGYP T+SEDE +L D NL+PKK VATQLVR+EKKMLNACL+ T D+I LPD TVSPCP
Sbjct: 361 AGYPTTMSEDECLLADSNLNPKKLVATQLVRLEKKMLNACLKATVDLINQLPDHTVSPCP 420
Query: 505 APYAPLL 511
APY PLL
Sbjct: 421 APYTPLL 427
>gi|357497055|ref|XP_003618816.1| SET domain protein [Medicago truncatula]
gi|355493831|gb|AES75034.1| SET domain protein [Medicago truncatula]
Length = 501
Score = 782 bits (2019), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/476 (78%), Positives = 418/476 (87%), Gaps = 2/476 (0%)
Query: 36 RDPNFGSSLRLVRRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKV 95
R P+F S RR+ R S+SDTLVA + + K++ED GDLK+WMHKNGLPPCKV
Sbjct: 27 RLPSFLSLSTNHRRRRRSFCSASNSDTLVAATGK--KKRDEDDGDLKTWMHKNGLPPCKV 84
Query: 96 ILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLS 155
+LK+KPS ++ +PIHYVAASEDLQ GD AFSVPNSLVVTLERVLGNETIAELLTTNK S
Sbjct: 85 VLKDKPSLDDSVKPIHYVAASEDLQKGDIAFSVPNSLVVTLERVLGNETIAELLTTNKFS 144
Query: 156 ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 215
ELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE+ELAYL GSP K
Sbjct: 145 ELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESELAYLEGSPLKD 204
Query: 216 EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 275
EI++R EGI++EYNELDTVWFM+GSLFQQYPYD+PTEAF FEIFKQAF AVQSCVVHLQ
Sbjct: 205 EIVKRIEGIRKEYNELDTVWFMSGSLFQQYPYDLPTEAFPFEIFKQAFAAVQSCVVHLQN 264
Query: 276 VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
VSLARRFALVPLGPPLLAY S CKAML AVD AVQLVVDRPYKAG+ IVVWCGPQPN+KL
Sbjct: 265 VSLARRFALVPLGPPLLAYCSNCKAMLTAVDGAVQLVVDRPYKAGDPIVVWCGPQPNTKL 324
Query: 336 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD 395
L NYGFVDEDN DRL+VE AL+TEDPQYQDKR+VAQRNGKLS+Q F+V+ G+E+EA+SD
Sbjct: 325 LTNYGFVDEDNSNDRLIVEVALSTEDPQYQDKRIVAQRNGKLSIQTFYVYTGKEREAVSD 384
Query: 396 MLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 455
M+PY+RLGYVSD SEMQSVISS GP+CPVSPCMERAVLDQLADYF RLA YP TL+EDE
Sbjct: 385 MIPYMRLGYVSDPSEMQSVISSQGPVCPVSPCMERAVLDQLADYFNTRLAAYPTTLAEDE 444
Query: 456 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 511
+MLTD +L+PK+RVATQLVR+EKKML+ACLQ D+I LPD +VSPCPAPYAP L
Sbjct: 445 SMLTDGSLNPKRRVATQLVRLEKKMLHACLQAIMDLISQLPDHSVSPCPAPYAPSL 500
>gi|22326803|ref|NP_196930.2| Rubisco methyltransferase family protein [Arabidopsis thaliana]
gi|30684815|ref|NP_851038.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
gi|42573363|ref|NP_974778.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
gi|17473570|gb|AAL38260.1| putative protein [Arabidopsis thaliana]
gi|23297671|gb|AAN13005.1| unknown protein [Arabidopsis thaliana]
gi|332004624|gb|AED92007.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
gi|332004625|gb|AED92008.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
gi|332004626|gb|AED92009.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
Length = 514
Score = 778 bits (2010), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/520 (74%), Positives = 433/520 (83%), Gaps = 16/520 (3%)
Query: 1 MEASCSLRSSKFISPPIRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSS 60
ME + +K +S PIR PLS S S+ R+ SS R V + S+ VSSS
Sbjct: 1 MEGVITCFHTKCVSLPIR--SFPLSRVS--SLPRWRNNKLISSSRSVHLR---SLCVSSS 53
Query: 61 DTLVAGSR--------EVVSKKE-EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIH 111
DTLVA +V SKKE +D DLK WM KNGLPPCKVILKE+P+H++KH+PIH
Sbjct: 54 DTLVASGSPKEDERQSKVSSKKEGDDSEDLKFWMDKNGLPPCKVILKERPAHDQKHKPIH 113
Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
YVAASEDLQ GD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG
Sbjct: 114 YVAASEDLQKGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 173
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
KKS W PYIRELDRQRGRGQL ESPLLWSE EL YLTGSPTKAE+LERAEGIKREYNEL
Sbjct: 174 KKSVWYPYIRELDRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGIKREYNEL 233
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
DTVWFMAGSLFQQYP+DIPTEAF+FEIFKQAFVA+QSCVVHLQ V LARRFALVPLGPPL
Sbjct: 234 DTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQNVGLARRFALVPLGPPL 293
Query: 292 LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
LAY S CKAML AVD AV+LVVDRPYKAG+ IVVWCGPQPN+KLL+NYGFVDEDNPYDR+
Sbjct: 294 LAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVVWCGPQPNAKLLLNYGFVDEDNPYDRV 353
Query: 352 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM 411
+VEAALNTEDPQYQDKRMVAQRNGKLS QVF V G+E+EA+ DMLPYLRLGY+SD SEM
Sbjct: 354 IVEAALNTEDPQYQDKRMVAQRNGKLSQQVFQVRVGKEREAVQDMLPYLRLGYMSDPSEM 413
Query: 412 QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVAT 471
QSVISS GP+CP+SPCMERAVLDQLA+YF RL+GYP T ED+A+L D +L P+KRVAT
Sbjct: 414 QSVISSQGPVCPMSPCMERAVLDQLANYFMRRLSGYPTTPKEDDALLADPSLSPRKRVAT 473
Query: 472 QLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 511
+LV++EKK+L ACL T D++ LPD +SPCPAPYAP L
Sbjct: 474 RLVQLEKKILVACLTTTVDLLNQLPDTAISPCPAPYAPSL 513
>gi|18377718|gb|AAL67009.1| unknown protein [Arabidopsis thaliana]
Length = 514
Score = 776 bits (2004), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/520 (74%), Positives = 432/520 (83%), Gaps = 16/520 (3%)
Query: 1 MEASCSLRSSKFISPPIRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSS 60
ME + +K +S PIR PLS S S+ R+ SS R V + S+ VSSS
Sbjct: 1 MEGVITCFHTKCVSLPIR--SFPLSRVS--SLPRWRNNKLISSSRSVHLR---SLCVSSS 53
Query: 61 DTLVAGSR--------EVVSKKE-EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIH 111
DTLVA +V SKKE +D DLK WM KNGLPPCKVILKE+P+H++KH+PIH
Sbjct: 54 DTLVASGSPKEDERQSKVSSKKEGDDSEDLKFWMDKNGLPPCKVILKERPAHDQKHKPIH 113
Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
YVAASEDLQ GD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG
Sbjct: 114 YVAASEDLQKGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 173
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
KKS W PYIRELDRQRGRGQL ESPLLWSE EL YLTGSPTKAE+LERAEGIKREYNEL
Sbjct: 174 KKSVWYPYIRELDRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGIKREYNEL 233
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
DTVWFMAGSLFQQYP+DIPTEAF+FEIFKQAFVA+QSCVVHLQ V LARRFALVPLGPPL
Sbjct: 234 DTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQNVGLARRFALVPLGPPL 293
Query: 292 LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
LAY S CKAML AVD AV+LVVDRPYKAG+ IVVWCGPQPN+KLL+NYGFVDEDNPYDR+
Sbjct: 294 LAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVVWCGPQPNAKLLLNYGFVDEDNPYDRV 353
Query: 352 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM 411
+VEAALNTE PQYQDKRMVAQRNGKLS QVF V G+E+EA+ DMLPYLRLGY+SD SEM
Sbjct: 354 IVEAALNTEGPQYQDKRMVAQRNGKLSQQVFQVRVGKEREAVQDMLPYLRLGYMSDPSEM 413
Query: 412 QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVAT 471
QSVISS GP+CP+SPCMERAVLDQLA+YF RL+GYP T ED+A+L D +L P+KRVAT
Sbjct: 414 QSVISSQGPVCPMSPCMERAVLDQLANYFMRRLSGYPTTPKEDDALLADPSLSPRKRVAT 473
Query: 472 QLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 511
+LV++EKK+L ACL T D++ LPD +SPCPAPYAP L
Sbjct: 474 RLVQLEKKILVACLTTTVDLLNQLPDTAISPCPAPYAPSL 513
>gi|297807453|ref|XP_002871610.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297317447|gb|EFH47869.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 516
Score = 772 bits (1994), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/521 (73%), Positives = 431/521 (82%), Gaps = 14/521 (2%)
Query: 1 MEASCSLRSSKFISPPIRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSS 60
ME + +K +S PIR PLS S S+ R+ SS R V ++ + SSS
Sbjct: 1 MEGVITCFHTKCVSLPIR--SFPLSRVS--SLPRWRNTKLISSSRSVPLRS-LCVSASSS 55
Query: 61 DTLVAGSR--------EVVSKKE-EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIH 111
DTLVAG +V SKKE +D DLK WM KNGLPPCKV+LKE+P+H+ K++PIH
Sbjct: 56 DTLVAGGSPKEDERQSKVSSKKEGDDSEDLKFWMDKNGLPPCKVLLKERPAHDLKYKPIH 115
Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
YVAASEDLQ GD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG
Sbjct: 116 YVAASEDLQKGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 175
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
KKS W PYIRELDRQRGRGQL ESPLLWSE EL YLTGSPTKAE+LERAEGIKREYNEL
Sbjct: 176 KKSVWYPYIRELDRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGIKREYNEL 235
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
DTVWFMAGSLFQQYP+DIPTEAF+FEIFKQAFVA+QSCVVHLQ V LARRFALVPLGPPL
Sbjct: 236 DTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQNVGLARRFALVPLGPPL 295
Query: 292 LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
LAY S CKAML AVD AV+LVVDRPYKAG+ IVVWCGPQPN+KLL+NYGFVDEDNPYDR+
Sbjct: 296 LAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVVWCGPQPNAKLLLNYGFVDEDNPYDRI 355
Query: 352 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM 411
+VEAALNTEDPQYQDKRMVAQRNGKLS QVF V G+E+EA+ DMLPYLRLGY+SD SEM
Sbjct: 356 IVEAALNTEDPQYQDKRMVAQRNGKLSQQVFQVRVGKEREAVQDMLPYLRLGYMSDPSEM 415
Query: 412 QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVAT 471
QSVISS GP+C +SPCMERAVLDQLA+YF RL+GYP T ED+A+L D +L P+KRVAT
Sbjct: 416 QSVISSQGPVCTMSPCMERAVLDQLANYFMRRLSGYPTTPKEDDALLADPSLSPRKRVAT 475
Query: 472 QLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 512
+LV++EKK+L ACL T D++ LPD +SPCPAPYAP L
Sbjct: 476 RLVQLEKKILAACLTTTVDLLNQLPDTAISPCPAPYAPSLK 516
>gi|356571407|ref|XP_003553868.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
isoform 1 [Glycine max]
gi|356571409|ref|XP_003553869.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
isoform 2 [Glycine max]
Length = 502
Score = 769 bits (1987), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/432 (83%), Positives = 397/432 (91%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
DLKSWMHK+GLPPCKV+LK+KP N+ H+PIHYVAAS+DLQ GD AFSVPNSLVVTLERV
Sbjct: 70 DLKSWMHKHGLPPCKVVLKDKPCPNDSHKPIHYVAASQDLQVGDVAFSVPNSLVVTLERV 129
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
LGNET+AELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQL+VESPLL
Sbjct: 130 LGNETVAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLSVESPLL 189
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
W ++EL YL+GSP K E+++R E I++EYNELDTVWFMAGSLFQQYPYDIPTEAF+FEIF
Sbjct: 190 WLKSELDYLSGSPIKDEVIQREEAIRKEYNELDTVWFMAGSLFQQYPYDIPTEAFSFEIF 249
Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 319
KQAF A+QSCVVHLQKVSLARRFALVPLGPPLL+Y S CKAML AVD AV+L VDRPYKA
Sbjct: 250 KQAFAAIQSCVVHLQKVSLARRFALVPLGPPLLSYQSNCKAMLTAVDGAVELAVDRPYKA 309
Query: 320 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 379
G+ IVVWCGPQPNSKLLINYGFVDE+N DRL+VEAALNTEDPQYQDKRMVAQRNGKLSV
Sbjct: 310 GDPIVVWCGPQPNSKLLINYGFVDENNSNDRLIVEAALNTEDPQYQDKRMVAQRNGKLSV 369
Query: 380 QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADY 439
QVFHV+AG+E+EA+ DML Y+RLGYVSD SEM+SVISS GP+CPVSPCMERA LDQLADY
Sbjct: 370 QVFHVYAGKEREAVLDMLRYMRLGYVSDPSEMESVISSQGPVCPVSPCMERAALDQLADY 429
Query: 440 FKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVT 499
FKARLAGYP TL+EDE+MLTD NL+PKKRVATQ VR+EKKML+ACLQ T D I LPD T
Sbjct: 430 FKARLAGYPTTLAEDESMLTDDNLNPKKRVATQYVRLEKKMLHACLQATTDFINQLPDHT 489
Query: 500 VSPCPAPYAPLL 511
+SPCPAPYAPLL
Sbjct: 490 ISPCPAPYAPLL 501
>gi|356511552|ref|XP_003524489.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
[Glycine max]
Length = 503
Score = 765 bits (1975), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/433 (83%), Positives = 396/433 (91%), Gaps = 1/433 (0%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
DLKSWMHK+GLPPCKV+LK+KP N+ H+PIHYVAAS+DLQ GD AFSVPNSLVVTLERV
Sbjct: 70 DLKSWMHKHGLPPCKVVLKDKPCPNDSHKPIHYVAASQDLQVGDVAFSVPNSLVVTLERV 129
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
LGNET+AELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQL+VESPLL
Sbjct: 130 LGNETVAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLSVESPLL 189
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
WS++EL YL+GSP K E+++R E I++EY ELDTVWFMAGSLFQQYPYDIPTEAF+FEIF
Sbjct: 190 WSKSELDYLSGSPIKDEVIQREEAIRKEYKELDTVWFMAGSLFQQYPYDIPTEAFSFEIF 249
Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 319
KQAF A+QSCVVHLQKVSLARRFALVPLGPPLL+Y S CKAML AVD AV+L VDRPYKA
Sbjct: 250 KQAFAAIQSCVVHLQKVSLARRFALVPLGPPLLSYQSNCKAMLTAVDGAVELAVDRPYKA 309
Query: 320 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 379
G+ IVVWCGPQPNSKLLINYGFVDE+N DRL+VEAALNTEDPQYQDKRMVAQRNGKLSV
Sbjct: 310 GDPIVVWCGPQPNSKLLINYGFVDENNSNDRLIVEAALNTEDPQYQDKRMVAQRNGKLSV 369
Query: 380 QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADY 439
QVFHV+AG+E+EA+ DML Y+RLGYVSD SEMQSVISS GP+CPVSPCMERA LDQLADY
Sbjct: 370 QVFHVYAGKEREAVLDMLRYMRLGYVSDPSEMQSVISSQGPVCPVSPCMERAALDQLADY 429
Query: 440 FKARLAGYPATLSEDEAMLTD-YNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDV 498
FKARLAGYP L+EDE+MLTD NL+PKKRVATQ VR+EKKML+ACLQ T D I LPD
Sbjct: 430 FKARLAGYPTILAEDESMLTDGGNLNPKKRVATQYVRLEKKMLHACLQATIDFINQLPDH 489
Query: 499 TVSPCPAPYAPLL 511
T+SPCPAPYAPLL
Sbjct: 490 TISPCPAPYAPLL 502
>gi|116786810|gb|ABK24248.1| unknown [Picea sitchensis]
Length = 507
Score = 724 bits (1870), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 348/476 (73%), Positives = 403/476 (84%), Gaps = 7/476 (1%)
Query: 42 SSLRLVRRKNRFSIRVSSSDTLVAGS------REVVSKKEEDLGDLKSWMHKNGLPPCKV 95
S +RL R F + V S+DTL A S ++ + KEE++ DLKSWMH++GLPPC+V
Sbjct: 32 SRVRLPGRCVGFPMVVYSADTLTASSQHGEDKKDAIRGKEEEV-DLKSWMHRHGLPPCRV 90
Query: 96 ILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLS 155
+LKE+PS + KH+PI YVAASEDLQ GD AFS+PNSL+VTLERVLGNETIAELLTTNKLS
Sbjct: 91 MLKERPSPDGKHKPIKYVAASEDLQPGDVAFSIPNSLIVTLERVLGNETIAELLTTNKLS 150
Query: 156 ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 215
ELACLALYLMYEKKQG +SFW P+IRELDRQRGRGQLAVESPLLWS EL Y TGSP K
Sbjct: 151 ELACLALYLMYEKKQGNQSFWRPFIRELDRQRGRGQLAVESPLLWSSEELKYFTGSPMKE 210
Query: 216 EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 275
+LER GIKREY ELDTVWFMAGSLF+QYPYDIPTEAF FEIFKQAFVAVQSCVVHLQ
Sbjct: 211 IMLERNSGIKREYEELDTVWFMAGSLFKQYPYDIPTEAFPFEIFKQAFVAVQSCVVHLQN 270
Query: 276 VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
V+LARRFALVPLGPPLL+Y S CKAML AV D+VQL VDR YKAGE IVVWCGPQPN++L
Sbjct: 271 VNLARRFALVPLGPPLLSYKSNCKAMLKAVGDSVQLEVDREYKAGEPIVVWCGPQPNARL 330
Query: 336 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD 395
L+NYGFVDEDNP+DRL+VE +L+T+DP YQDKR++AQRNGKLSVQ F+++ GREKEA+ D
Sbjct: 331 LLNYGFVDEDNPHDRLIVEVSLDTKDPLYQDKRIIAQRNGKLSVQTFNIYIGREKEAVLD 390
Query: 396 MLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 455
MLPYLRL YVSD SEMQSV+SS GP+CPVSPC ERAVLDQL+ YF+ RLAGYP T SEDE
Sbjct: 391 MLPYLRLAYVSDPSEMQSVLSSQGPVCPVSPCTERAVLDQLSRYFRERLAGYPTTASEDE 450
Query: 456 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 511
+L D +PK++VATQLV +EKKMLN+CL ++I LPD+ V+PCP+PY+P+L
Sbjct: 451 IVLADPTTNPKRQVATQLVLIEKKMLNSCLAAVYEIIDQLPDLAVTPCPSPYSPIL 506
>gi|7573451|emb|CAB87765.1| putative protein [Arabidopsis thaliana]
Length = 537
Score = 698 bits (1802), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 357/489 (73%), Positives = 392/489 (80%), Gaps = 34/489 (6%)
Query: 1 MEASCSLRSSKFISPPIRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSS 60
ME + +K +S PIR PLS S S+ R+ SS R V + S+ VSSS
Sbjct: 1 MEGVITCFHTKCVSLPIR--SFPLSRVS--SLPRWRNNKLISSSRSVHLR---SLCVSSS 53
Query: 61 DTLVAGSR--------EVVSKKE-EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIH 111
DTLVA +V SKKE +D DLK WM KNGLPPCKVILKE+P+H++KH+PIH
Sbjct: 54 DTLVASGSPKEDERQSKVSSKKEGDDSEDLKFWMDKNGLPPCKVILKERPAHDQKHKPIH 113
Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
YVAASEDLQ GD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG
Sbjct: 114 YVAASEDLQKGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 173
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
KKS W PYIRELDRQRGRGQL ESPLLWSE EL YLTGSPTKAE+LERAEGIKREYNEL
Sbjct: 174 KKSVWYPYIRELDRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGIKREYNEL 233
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL------------------ 273
DTVWFMAGSLFQQYP+DIPTEAF+FEIFKQAFVA+QSCVVHL
Sbjct: 234 DTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQVVLVASSNLDCYASSCT 293
Query: 274 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 333
Q V LARRFALVPLGPPLLAY S CKAML AVD AV+LVVDRPYKAG+ IVVWCGPQPN+
Sbjct: 294 QNVGLARRFALVPLGPPLLAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVVWCGPQPNA 353
Query: 334 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI 393
KLL+NYGFVDEDNPYDR++VEAALNTEDPQYQDKRMVAQRNGKLS QVF V G+E+EA+
Sbjct: 354 KLLLNYGFVDEDNPYDRVIVEAALNTEDPQYQDKRMVAQRNGKLSQQVFQVRVGKEREAV 413
Query: 394 SDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSE 453
DMLPYLRLGY+SD SEMQSVISS GP+CP+SPCMERAVLDQLA+YF RL+GYP T E
Sbjct: 414 QDMLPYLRLGYMSDPSEMQSVISSQGPVCPMSPCMERAVLDQLANYFMRRLSGYPTTPKE 473
Query: 454 DEAMLTDYN 462
D+A+ N
Sbjct: 474 DDALEASCN 482
>gi|125536207|gb|EAY82695.1| hypothetical protein OsI_37912 [Oryza sativa Indica Group]
Length = 505
Score = 678 bits (1750), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 320/434 (73%), Positives = 369/434 (85%), Gaps = 3/434 (0%)
Query: 81 LKSWMHKNGLPPCKVILKEKPS---HNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
W+ ++GLPP KV + ++P K P+HYVAA +DL+AGD AF VP SLVVTLE
Sbjct: 71 FSDWLREHGLPPGKVAILDRPVPCFREGKDLPLHYVAAGQDLEAGDVAFEVPMSLVVTLE 130
Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
RVLG+E++AELLTTNKLSELACLALYLMYEKKQG+ SFW PYI+ELDRQRGRGQLAVESP
Sbjct: 131 RVLGDESVAELLTTNKLSELACLALYLMYEKKQGQDSFWYPYIKELDRQRGRGQLAVESP 190
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
LLW+E+EL YL GSP K E++ R EGI+REYNELDT+WFMAGSLFQQYP+DIPTEAF FE
Sbjct: 191 LLWTESELNYLKGSPIKDEVVARDEGIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFE 250
Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 317
IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL Y S CKAML AV D+V+LVVDRPY
Sbjct: 251 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTAVGDSVRLVVDRPY 310
Query: 318 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL 377
KAGE I+VWCGPQPNS+LL+NYGF+DEDNPYDR+V+EA+LN EDPQ+Q+KRMVAQRNGKL
Sbjct: 311 KAGEPIIVWCGPQPNSRLLLNYGFIDEDNPYDRIVIEASLNIEDPQFQEKRMVAQRNGKL 370
Query: 378 SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLA 437
++Q FHV G+EKE I++MLPYLRLGY+SD EMQS++SS G CPVSPC ERAVLDQL
Sbjct: 371 AIQNFHVCVGKEKETIAEMLPYLRLGYISDPDEMQSILSSEGDTCPVSPCTERAVLDQLV 430
Query: 438 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPD 497
Y ++RLA YP TL ED+AML D NL PKK VAT+LVR+EKK+L+ CLQ + I LPD
Sbjct: 431 GYLESRLADYPTTLDEDDAMLADGNLEPKKEVATRLVRLEKKLLHGCLQAANEFINDLPD 490
Query: 498 VTVSPCPAPYAPLL 511
TVSPCPAP+AP L
Sbjct: 491 HTVSPCPAPFAPEL 504
>gi|115487958|ref|NP_001066466.1| Os12g0236900 [Oryza sativa Japonica Group]
gi|77554044|gb|ABA96840.1| SET domain containing protein, expressed [Oryza sativa Japonica
Group]
gi|113648973|dbj|BAF29485.1| Os12g0236900 [Oryza sativa Japonica Group]
Length = 509
Score = 678 bits (1749), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 320/434 (73%), Positives = 369/434 (85%), Gaps = 3/434 (0%)
Query: 81 LKSWMHKNGLPPCKVILKEKPS---HNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
W+ ++GLPP KV + ++P K P+HYVAA +DL+AGD AF VP SLVVTLE
Sbjct: 75 FSDWLREHGLPPGKVAILDRPVPCFREGKDLPLHYVAAGQDLEAGDVAFEVPMSLVVTLE 134
Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
RVLG+E++AELLTTNKLSELACLALYLMYEKKQG+ SFW PYI+ELDRQRGRGQLAVESP
Sbjct: 135 RVLGDESVAELLTTNKLSELACLALYLMYEKKQGQDSFWYPYIKELDRQRGRGQLAVESP 194
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
LLW+E+EL YL GSP K E++ R EGI+REYNELDT+WFMAGSLFQQYP+DIPTEAF FE
Sbjct: 195 LLWTESELNYLKGSPIKDEVVARDEGIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFE 254
Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 317
IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL Y S CKAML AV D+V+LVVDRPY
Sbjct: 255 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTAVGDSVRLVVDRPY 314
Query: 318 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL 377
KAGE I+VWCGPQPNS+LL+NYGF+DEDNPYDR+V+EA+LN EDPQ+Q+KRMVAQRNGKL
Sbjct: 315 KAGEPIIVWCGPQPNSRLLLNYGFIDEDNPYDRIVIEASLNIEDPQFQEKRMVAQRNGKL 374
Query: 378 SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLA 437
++Q FHV G+EKE I++MLPYLRLGY+SD EMQS++SS G CPVSPC ERAVLDQL
Sbjct: 375 AIQNFHVCVGKEKETIAEMLPYLRLGYISDPDEMQSILSSEGDTCPVSPCTERAVLDQLV 434
Query: 438 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPD 497
Y ++RLA YP TL ED+AML D NL PKK VAT+LVR+EKK+L+ CLQ + I LPD
Sbjct: 435 GYLESRLADYPTTLDEDDAMLADGNLEPKKEVATRLVRLEKKLLHGCLQAANEFINDLPD 494
Query: 498 VTVSPCPAPYAPLL 511
TVSPCPAP+AP L
Sbjct: 495 HTVSPCPAPFAPEL 508
>gi|357160358|ref|XP_003578740.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
[Brachypodium distachyon]
Length = 516
Score = 677 bits (1748), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 321/432 (74%), Positives = 366/432 (84%), Gaps = 3/432 (0%)
Query: 84 WMHKNGLPPCKVILKEKP---SHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
W+ +GLPP KV + E+P S K RP+H+VAA +DL+ GD AF +P SLVVTLERVL
Sbjct: 85 WLLTHGLPPGKVAILERPVPCSRGGKDRPLHFVAAGQDLEVGDVAFEMPMSLVVTLERVL 144
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
G+E++AELLTTNKLSELACLALYLMYEKKQGK S W PYI+ELDRQRGRGQLAVESPLLW
Sbjct: 145 GDESVAELLTTNKLSELACLALYLMYEKKQGKDSLWYPYIKELDRQRGRGQLAVESPLLW 204
Query: 201 SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK 260
+E+EL YL GSP + E++ R EGI+REYNELDT+WFMAGSLF+QYP+D+PTEAF FEIFK
Sbjct: 205 TESELDYLNGSPMRDEVVVRDEGIRREYNELDTLWFMAGSLFKQYPFDVPTEAFPFEIFK 264
Query: 261 QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 320
QAFVAVQSCVVHLQKVSLARRFALVPLGPPLL Y S CKAML AVDD+V+LVVDRPYKAG
Sbjct: 265 QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTAVDDSVRLVVDRPYKAG 324
Query: 321 ESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ 380
E I+VWCGPQPNS+LL+NYGFVDEDNPYDR+ +EA+LN EDPQYQ+KRMVAQRNGKL++Q
Sbjct: 325 EPIIVWCGPQPNSRLLLNYGFVDEDNPYDRIAIEASLNMEDPQYQEKRMVAQRNGKLAIQ 384
Query: 381 VFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYF 440
F V G+EKE IS+MLPYLRLGY+SD EMQ ++SS G CPVSPC ERAVLDQL Y
Sbjct: 385 KFQVCVGKEKETISEMLPYLRLGYISDPDEMQCILSSEGDTCPVSPCSERAVLDQLVVYL 444
Query: 441 KARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTV 500
K+RLAGYP TL EDEAML D NL PKK VAT+LVR+EKK+L+ CLQ + I LPD TV
Sbjct: 445 KSRLAGYPTTLDEDEAMLADGNLEPKKEVATRLVRLEKKLLHGCLQAAHEFISALPDHTV 504
Query: 501 SPCPAPYAPLLN 512
SPCPA YAP L
Sbjct: 505 SPCPALYAPNLK 516
>gi|326510275|dbj|BAJ87354.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525555|dbj|BAJ88824.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 523
Score = 677 bits (1746), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 336/519 (64%), Positives = 395/519 (76%), Gaps = 23/519 (4%)
Query: 8 RSSKFISPP-----IRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSSDT 62
RSS+ +PP + HH L + + R P GS R +R + +DT
Sbjct: 14 RSSEARAPPMASSALSGTHHRLLLPCFLR----RLPQPGS-----RSCSRLRLAACHADT 64
Query: 63 LVAGS------REVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKP---SHNEKHRPIHYV 113
L++ S G W+ NGLPP K+ + E+P S + RP+H+V
Sbjct: 65 LLSSSGAQGPPSPAACLSASSAGGFSDWLLTNGLPPGKLAILERPVPCSRGGRDRPLHFV 124
Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKK 173
AA +DL+AGD AF VP SLVVTLERVLG+E++AELLTTNKLSELACLALYLMYEKKQG+
Sbjct: 125 AAGQDLEAGDVAFEVPMSLVVTLERVLGDESVAELLTTNKLSELACLALYLMYEKKQGRD 184
Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
S W PYI+ELDRQRGRGQLAVESPLLW+E+EL YL GSP + E++ R EGIK+EYNELDT
Sbjct: 185 SLWYPYIKELDRQRGRGQLAVESPLLWTESELDYLNGSPMRDEVVVRDEGIKKEYNELDT 244
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLA 293
+WFMAGSLF+QYP+D+PTEAF FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL
Sbjct: 245 LWFMAGSLFKQYPFDVPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLT 304
Query: 294 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
Y S CKAML AVD +V+L+VDRPYKAGE I+VWCGPQPNS+LL+NYGFVDEDNPYDR+ +
Sbjct: 305 YKSNCKAMLTAVDGSVRLLVDRPYKAGEPIIVWCGPQPNSRLLLNYGFVDEDNPYDRIAI 364
Query: 354 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 413
EA+LNTEDPQYQ+KRMVAQRNGKL++Q F V G+EK+ IS+MLPYLRLGY+SD EMQ
Sbjct: 365 EASLNTEDPQYQEKRMVAQRNGKLAIQKFQVCVGKEKQTISEMLPYLRLGYISDPDEMQC 424
Query: 414 VISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQL 473
++SS G CPVSPC ERAVLDQL Y K+RLAGYP L EDEAML D +L PKK VAT+L
Sbjct: 425 ILSSEGDTCPVSPCSERAVLDQLVVYLKSRLAGYPTNLDEDEAMLADGSLEPKKEVATRL 484
Query: 474 VRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 512
VR+EKKML+ CL+ + I LPD TVSPCPA YAP L
Sbjct: 485 VRLEKKMLHGCLEAANEFISGLPDHTVSPCPALYAPELK 523
>gi|242053769|ref|XP_002456030.1| hypothetical protein SORBIDRAFT_03g029140 [Sorghum bicolor]
gi|241928005|gb|EES01150.1| hypothetical protein SORBIDRAFT_03g029140 [Sorghum bicolor]
Length = 512
Score = 673 bits (1736), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 318/431 (73%), Positives = 367/431 (85%), Gaps = 3/431 (0%)
Query: 84 WMHKNGLPPCKVILKEKPSH---NEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
W+ GLPP KV ++E+P N K P+ YVAA DLQAGD AF VP SLVVTLERVL
Sbjct: 81 WLRARGLPPGKVDIRERPVPCLLNGKDLPLRYVAAGVDLQAGDVAFEVPMSLVVTLERVL 140
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
G+E+IAELLT NKLSELACLALYLMYEKKQGK SFW PYI+ELDR RGRGQLAVESPLLW
Sbjct: 141 GDESIAELLTNNKLSELACLALYLMYEKKQGKDSFWYPYIKELDRHRGRGQLAVESPLLW 200
Query: 201 SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK 260
+E+EL YLTGSP K E++ R E I+REYNELDT+WFMAGSLFQQYP+DIPTEAF FEIFK
Sbjct: 201 TESELDYLTGSPLKDEVVARDEAIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIFK 260
Query: 261 QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 320
QAFVAVQSCVVHLQKVSLARRFALVPLGPPLL Y S CKAML A D+V+LVVDRPYKAG
Sbjct: 261 QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTADGDSVRLVVDRPYKAG 320
Query: 321 ESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ 380
E I++WCGPQ NS+L++NYGFVDEDNP+DR+ +EA+LN+EDPQYQ+KRMVAQRNGKL++Q
Sbjct: 321 EPIIIWCGPQTNSRLVLNYGFVDEDNPFDRIAIEASLNSEDPQYQEKRMVAQRNGKLAIQ 380
Query: 381 VFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYF 440
F+V+ G+EK+ +++MLPYLRLGY+SD EMQS++SS G CP+SPC ERAVLDQL Y
Sbjct: 381 NFNVYVGKEKQTVAEMLPYLRLGYISDPDEMQSILSSEGDTCPLSPCTERAVLDQLVGYL 440
Query: 441 KARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTV 500
++RLAGYP TL EDEAML D +L PKK VAT+LVR+EKKM++ACLQ T + I LPD TV
Sbjct: 441 ESRLAGYPTTLDEDEAMLADGSLEPKKEVATRLVRLEKKMIHACLQATNEFINDLPDHTV 500
Query: 501 SPCPAPYAPLL 511
SPCPAPYAP L
Sbjct: 501 SPCPAPYAPEL 511
>gi|414881266|tpg|DAA58397.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
Length = 512
Score = 672 bits (1734), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 324/457 (70%), Positives = 374/457 (81%), Gaps = 3/457 (0%)
Query: 58 SSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPS---HNEKHRPIHYVA 114
SSS+ A V E W+ GLPP KV ++E+P + K +P+ YV+
Sbjct: 55 SSSEARAAPGPAVEPSSESATDCFVDWLRARGLPPGKVDIRERPVPCLRDGKDQPLRYVS 114
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A DLQAGD AF V SLVVTLERVLG+E+IAELLT NKLSELACLALYLMYEKKQGK S
Sbjct: 115 AVVDLQAGDVAFEVSMSLVVTLERVLGDESIAELLTNNKLSELACLALYLMYEKKQGKDS 174
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
FW PYI+ELDR RGRGQLAVESPLLW+E+EL YLTGSP K E++ R E I+REYNELDT+
Sbjct: 175 FWYPYIKELDRHRGRGQLAVESPLLWTESELDYLTGSPLKDEVVARDEAIRREYNELDTL 234
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
WFMAGSLFQQYP+DIPTEAF FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL Y
Sbjct: 235 WFMAGSLFQQYPFDIPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTY 294
Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
S CKAML A D+V+LVVDRPYKAGE I++WCGPQ NS+L++NYGFVDEDNP+DR+ +E
Sbjct: 295 RSNCKAMLTADGDSVRLVVDRPYKAGEPIIIWCGPQTNSRLVLNYGFVDEDNPFDRVAIE 354
Query: 355 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV 414
A+LNTEDPQYQ+KRMVAQRNGKL++Q F+V+ G+EK+ +++MLPYLRLGY+S+ EMQS+
Sbjct: 355 ASLNTEDPQYQEKRMVAQRNGKLAIQNFNVYVGKEKQTVAEMLPYLRLGYISNPDEMQSI 414
Query: 415 ISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLV 474
+SS G CPVSPC ERAVLDQL Y ++RLAGYP TL EDEAML D NL PKK VAT+LV
Sbjct: 415 LSSEGDTCPVSPCTERAVLDQLVGYLESRLAGYPTTLDEDEAMLADGNLEPKKEVATRLV 474
Query: 475 RMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 511
R+EKKML+ACLQ T + I LPD TVSPCPAPYAP L
Sbjct: 475 RLEKKMLHACLQATNEFINDLPDHTVSPCPAPYAPEL 511
>gi|125578929|gb|EAZ20075.1| hypothetical protein OsJ_35675 [Oryza sativa Japonica Group]
Length = 536
Score = 648 bits (1671), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 305/393 (77%), Positives = 345/393 (87%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLP 178
LQAGD AF VP SLVVTLERVLG+E++AELLTTNKLSELACLALYLMYEKKQG+ SFW P
Sbjct: 143 LQAGDVAFEVPMSLVVTLERVLGDESVAELLTTNKLSELACLALYLMYEKKQGQDSFWYP 202
Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMA 238
YI+ELDRQRGRGQLAVESPLLW+E+EL YL GSP K E++ R EGI+REYNELDT+WFMA
Sbjct: 203 YIKELDRQRGRGQLAVESPLLWTESELNYLKGSPIKDEVVARDEGIRREYNELDTLWFMA 262
Query: 239 GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC 298
GSLFQQYP+DIPTEAF FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL Y S C
Sbjct: 263 GSLFQQYPFDIPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNC 322
Query: 299 KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 358
KAML AV D+V+LVVDRPYKAGE I+VWCGPQPNS+LL+NYGF+DEDNPYDR+V+EA+LN
Sbjct: 323 KAMLTAVGDSVRLVVDRPYKAGEPIIVWCGPQPNSRLLLNYGFIDEDNPYDRIVIEASLN 382
Query: 359 TEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL 418
EDPQ+Q+KRMVAQRNGKL++Q FHV G+EKE I++MLPYLRLGY+SD EMQS++SS
Sbjct: 383 IEDPQFQEKRMVAQRNGKLAIQNFHVCVGKEKETIAEMLPYLRLGYISDPDEMQSILSSE 442
Query: 419 GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEK 478
G CPVSPC ERAVLDQL Y ++RLA YP TL ED+AML D NL PKK VAT+LVR+EK
Sbjct: 443 GDTCPVSPCTERAVLDQLVGYLESRLADYPTTLDEDDAMLADGNLEPKKEVATRLVRLEK 502
Query: 479 KMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 511
K+L+ CLQ + I LPD TVSPCPAP+AP L
Sbjct: 503 KLLHGCLQAANEFINDLPDHTVSPCPAPFAPEL 535
>gi|168044593|ref|XP_001774765.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673920|gb|EDQ60436.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 523
Score = 607 bits (1566), Expect = e-171, Method: Compositional matrix adjust.
Identities = 305/481 (63%), Positives = 361/481 (75%), Gaps = 10/481 (2%)
Query: 38 PNFGSSLRLVRRKNRFSIRVSSSDTLVAGSRE---VVSKKEEDLG--DLKSWMHKNGLPP 92
P FG+ V + R S ++ T V E SKK+E DLK WM + GLP
Sbjct: 45 PRFGTQKVAVSSEKRGSRCRNTLTTDVYKQDENDLAQSKKQEHESGIDLKQWMEEQGLPE 104
Query: 93 CKVILKE-KPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTT 151
CKV L E +PS +K +PIHYV ASEDLQ G+ A ++P SLVVTLERVLG+ETIAELLTT
Sbjct: 105 CKVSLAEHQPSEGDKGKPIHYVVASEDLQPGELALTIPKSLVVTLERVLGDETIAELLTT 164
Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTG 210
NKLSELACLALYLMYEKKQGK+S+W PYIRELDRQRGRGQL+V SPLLWS EL Y TG
Sbjct: 165 NKLSELACLALYLMYEKKQGKESYWYPYIRELDRQRGRGQLSVASPLLWSREELNEYFTG 224
Query: 211 SPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
S K +LER GIKREY ELDTVWFMAGSLF+QYP+D+PTEAF+FEIFKQAFVAVQSCV
Sbjct: 225 STMKEVVLERLAGIKREYEELDTVWFMAGSLFKQYPFDLPTEAFSFEIFKQAFVAVQSCV 284
Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 330
VHLQ VSLARRFALVPLGPPLLAY S CKAML AVDD V L VDR YKAG+ I VWCGPQ
Sbjct: 285 VHLQGVSLARRFALVPLGPPLLAYKSNCKAMLKAVDDNVVLEVDRAYKAGDPIAVWCGPQ 344
Query: 331 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREK 390
PNSKLL+NYGFVDEDNPYDRL VEA+L+TEDP YQ KR + Q+N +L++Q F ++ G+E
Sbjct: 345 PNSKLLLNYGFVDEDNPYDRLAVEASLDTEDPLYQQKRAIVQKNNRLTIQTFQIYKGKEM 404
Query: 391 EAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPAT 450
EA+ DMLPY+RL +++D EM++V + GP+CPVS C ERAVL+QL YF+ RLAGY ++
Sbjct: 405 EAVLDMLPYMRLAHLADPEEMETVSFAQGPVCPVSACNERAVLEQLEQYFEKRLAGYKSS 464
Query: 451 LSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPL 510
+ + D + KKRVA +L+ +EK +L L ++I LPD +SPC PY P
Sbjct: 465 HATEGG---DAKKNAKKRVAEKLLCIEKSILRNALAAVQELISQLPDSAISPCIGPYLPN 521
Query: 511 L 511
L
Sbjct: 522 L 522
>gi|302794360|ref|XP_002978944.1| hypothetical protein SELMODRAFT_110000 [Selaginella moellendorffii]
gi|300153262|gb|EFJ19901.1| hypothetical protein SELMODRAFT_110000 [Selaginella moellendorffii]
Length = 432
Score = 593 bits (1530), Expect = e-167, Method: Compositional matrix adjust.
Identities = 293/432 (67%), Positives = 344/432 (79%), Gaps = 7/432 (1%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
WM + GLPPCKV LKE+ + + I YV ASEDL+ GD A SVP SLVVTLERVLGNE
Sbjct: 3 WMLEQGLPPCKVSLKERDLNG---KTIRYVVASEDLKPGDLALSVPMSLVVTLERVLGNE 59
Query: 144 TIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 203
TIAELLTTNKLSELACLALYLMYEKK+GK+SFW P+IRELDRQRGRGQ+AVESPLLW+
Sbjct: 60 TIAELLTTNKLSELACLALYLMYEKKRGKESFWYPFIRELDRQRGRGQVAVESPLLWTSE 119
Query: 204 EL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
EL Y TGS K +LER EGIKREY ELDTVWFMAGSLF++YP+DIPTEAF+FEIFKQA
Sbjct: 120 ELDEYFTGSRMKEVVLERLEGIKREYQELDTVWFMAGSLFKEYPFDIPTEAFSFEIFKQA 179
Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
FVAVQSCVVHLQ VSL RRFALVPLGPPLLAY S CKAML A D V+L VDR YK GE
Sbjct: 180 FVAVQSCVVHLQGVSLPRRFALVPLGPPLLAYKSNCKAMLKAAGDLVRLEVDRAYKKGEQ 239
Query: 323 IVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
I+VWCGPQPN++LL+NYGFVD DNP+DRL VEA+LNT DP YQ+KR++ Q+N +L++Q F
Sbjct: 240 ILVWCGPQPNTRLLLNYGFVDPDNPHDRLSVEASLNTRDPFYQNKRIIVQKNNRLTIQNF 299
Query: 383 HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 442
+ GREKEA+ +MLPYLRLG+VSD M+SV S+ GP CPVS C ERAVLDQLA YF+
Sbjct: 300 QIFKGREKEAVLEMLPYLRLGHVSDPYHMESVFSAEGPTCPVSACNERAVLDQLAQYFQE 359
Query: 443 RLAGYPATLSEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTV 500
R+A Y T+ ED A+L D +++PK+RVATQL+ +EK++L+ L V LPD +V
Sbjct: 360 RIAKYKTTIDEDRALLEDGSSDINPKQRVATQLLLIEKEILHNTLDVVNGFRNQLPDGSV 419
Query: 501 S-PCPAPYAPLL 511
+ PC + P L
Sbjct: 420 APPCCGDFVPKL 431
>gi|302809535|ref|XP_002986460.1| hypothetical protein SELMODRAFT_269129 [Selaginella moellendorffii]
gi|300145643|gb|EFJ12317.1| hypothetical protein SELMODRAFT_269129 [Selaginella moellendorffii]
Length = 432
Score = 593 bits (1529), Expect = e-167, Method: Compositional matrix adjust.
Identities = 293/432 (67%), Positives = 344/432 (79%), Gaps = 7/432 (1%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
WM + GLPPCKV LKE+ + + I YV ASEDL+ GD A SVP SLVVTLERVLGNE
Sbjct: 3 WMLEQGLPPCKVSLKERDLNG---KTIRYVVASEDLKPGDLALSVPMSLVVTLERVLGNE 59
Query: 144 TIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 203
TIAELLTTNKLSELACLALYLMYEKK+GK+SFW P+IRELDRQRGRGQ+AVESPLLW+
Sbjct: 60 TIAELLTTNKLSELACLALYLMYEKKRGKESFWYPFIRELDRQRGRGQVAVESPLLWTSE 119
Query: 204 EL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
EL Y TGS K +LER EGIKREY ELDTVWFMAGSLF++YP+DIPTEAF+FEIFKQA
Sbjct: 120 ELDEYFTGSRMKEVVLERLEGIKREYQELDTVWFMAGSLFKEYPFDIPTEAFSFEIFKQA 179
Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
FVAVQSCVVHLQ VSL RRFALVPLGPPLLAY S CKAML A D V+L VDR YK GE
Sbjct: 180 FVAVQSCVVHLQGVSLPRRFALVPLGPPLLAYKSNCKAMLKAAGDLVRLEVDRAYKKGEQ 239
Query: 323 IVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
I+VWCGPQPN++LL+NYGFVD DNP+DRL VEA+LNT DP YQ+KR++ Q+N +L++Q F
Sbjct: 240 ILVWCGPQPNTRLLLNYGFVDPDNPHDRLSVEASLNTRDPFYQNKRIIVQKNNRLTIQNF 299
Query: 383 HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 442
+ GREKEA+ +MLPYLRLG+VSD M+SV S+ GP CPVS C ERAVLDQLA YF+
Sbjct: 300 QIFKGREKEAVLEMLPYLRLGHVSDPYHMESVFSAEGPTCPVSACNERAVLDQLAQYFQE 359
Query: 443 RLAGYPATLSEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTV 500
R+A Y T+ ED A+L D +++PK+RVATQL+ +EK++L+ L V LPD +V
Sbjct: 360 RIAKYKTTIDEDRALLEDCSSDINPKQRVATQLLLIEKEILHNTLDVVNGFRNQLPDGSV 419
Query: 501 S-PCPAPYAPLL 511
+ PC + P L
Sbjct: 420 APPCCGDFVPKL 431
>gi|168020073|ref|XP_001762568.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686301|gb|EDQ72691.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 427
Score = 574 bits (1480), Expect = e-161, Method: Compositional matrix adjust.
Identities = 281/429 (65%), Positives = 330/429 (76%), Gaps = 5/429 (1%)
Query: 85 MHKNGLPPCKVILKE-KPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
M + GLP C V L E + + +K +PIHYV AS+DLQ GD A +VP SLVVTLERVLG+E
Sbjct: 1 MEEQGLPKCNVALVEHQLAEGDKGKPIHYVVASQDLQPGDVALTVPKSLVVTLERVLGDE 60
Query: 144 TIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 203
TIAELLTTNKLSELACLALYLMYEKKQGK+S+W PYIRELDRQRGRGQL+V SPLLWS
Sbjct: 61 TIAELLTTNKLSELACLALYLMYEKKQGKESYWYPYIRELDRQRGRGQLSVASPLLWSPE 120
Query: 204 EL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
EL Y TGS K +LER GIKREY ELDTVWFMAGSLF+QYP+D+PTEAF+FEIFKQA
Sbjct: 121 ELNEYFTGSTMKEVVLERLAGIKREYEELDTVWFMAGSLFKQYPFDLPTEAFSFEIFKQA 180
Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
FVAVQSCVVHLQ VSLARRFALVPLGPPLLAY S CKAML AV D VQL VD YK G+
Sbjct: 181 FVAVQSCVVHLQGVSLARRFALVPLGPPLLAYKSNCKAMLKAVGDNVQLEVDHAYKTGDP 240
Query: 323 IVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
I VWCGPQPNSKLL+NYGFVDEDNP+DRL VEA+LNTEDP YQ KR V Q+N +L++Q F
Sbjct: 241 IAVWCGPQPNSKLLLNYGFVDEDNPFDRLAVEASLNTEDPLYQQKRAVVQKNNRLTIQTF 300
Query: 383 HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 442
++ G+E EA+ DMLPY+RLG+++D E+++V + P+C VS C ERAVL+Q+ +F+
Sbjct: 301 QIYKGKEMEAVRDMLPYMRLGHLADPEEIETVSFAQEPLCYVSACNERAVLNQIEHFFER 360
Query: 443 RLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSP 502
RLAGY S D D K+ VA +L+ +EK +L L ++I LPD +SP
Sbjct: 361 RLAGYK---SSDTTKAVDAKKDAKRTVAKKLMSIEKNILRNALAAVHELIRELPDGAISP 417
Query: 503 CPAPYAPLL 511
C PY P L
Sbjct: 418 CIGPYLPNL 426
>gi|326503142|dbj|BAJ99196.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 425
Score = 555 bits (1430), Expect = e-155, Method: Compositional matrix adjust.
Identities = 274/426 (64%), Positives = 326/426 (76%), Gaps = 23/426 (5%)
Query: 8 RSSKFISPP-----IRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSSDT 62
RSS+ +PP + HH L + + R P GS R +R + +DT
Sbjct: 9 RSSEARAPPMASSALSGTHHRLLLPCFLR----RLPQPGS-----RSCSRLRLAACHADT 59
Query: 63 LVAGS------REVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKP---SHNEKHRPIHYV 113
L++ S G W+ NGLPP K+ + E+P S + RP+H+V
Sbjct: 60 LLSSSGAQGPPSPAACLSASSAGGFSDWLLTNGLPPGKLAILERPVPCSRGGRDRPLHFV 119
Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKK 173
AA +DL+AGD AF VP SLVVTLERVLG+E++AELLTTNKLSELACLALYLMYEKKQG+
Sbjct: 120 AAGQDLEAGDVAFEVPMSLVVTLERVLGDESVAELLTTNKLSELACLALYLMYEKKQGRD 179
Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
S W PYI+ELDRQRGRGQLAVESPLLW+E+EL YL GSP + E++ R EGIK+EYNELDT
Sbjct: 180 SLWYPYIKELDRQRGRGQLAVESPLLWTESELDYLNGSPMRDEVVVRDEGIKKEYNELDT 239
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLA 293
+WFMAGSLF+QYP+D+PTEAF FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL
Sbjct: 240 LWFMAGSLFKQYPFDVPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLT 299
Query: 294 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
Y S CKAML AVD +V+L+VDRPYKAGE I+VWCGPQPNS+LL+NYGFVDEDNPYDR+ +
Sbjct: 300 YKSNCKAMLTAVDGSVRLLVDRPYKAGEPIIVWCGPQPNSRLLLNYGFVDEDNPYDRIAI 359
Query: 354 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 413
EA+LNTEDPQYQ+KRMVAQRNGKL++Q F V G+EK+ IS+MLPYLRLGY+SD EMQ
Sbjct: 360 EASLNTEDPQYQEKRMVAQRNGKLAIQKFQVCVGKEKQTISEMLPYLRLGYISDPDEMQC 419
Query: 414 VISSLG 419
++SS G
Sbjct: 420 ILSSEG 425
>gi|384246822|gb|EIE20311.1| hypothetical protein COCSUDRAFT_48681 [Coccomyxa subellipsoidea
C-169]
Length = 539
Score = 397 bits (1019), Expect = e-107, Method: Compositional matrix adjust.
Identities = 197/372 (52%), Positives = 264/372 (70%), Gaps = 9/372 (2%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+ DLQAG+ A +P+ LV+TL+RV +E++AELLTT+KLSELACL LYLMYEKK G++S
Sbjct: 49 AARDLQAGELALRIPDHLVITLDRVFEDESLAELLTTDKLSELACLTLYLMYEKKNGRQS 108
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDT 233
W +I+ELDR +GRGQ+ +SPLLW E ++ YL GSP AEI ER +GI++EY ELDT
Sbjct: 109 VWYEFIKELDRIQGRGQMGAKSPLLWDEGQVDEYLAGSPLVAEIKERLKGIEKEYAELDT 168
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLA 293
VWFMAGSLF+ YPYD+PTEAF+ ++F+Q F AVQ+ VVHLQ V L++RFALVPLGPPLL+
Sbjct: 169 VWFMAGSLFKSYPYDVPTEAFSLKLFRQGFAAVQASVVHLQGVPLSKRFALVPLGPPLLS 228
Query: 294 YSSKCKAMLAAVDDA--VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
YSS KAML +A VQL VDR Y GE I WCGPQPN +LL+NYG V ++NP+D++
Sbjct: 229 YSSTAKAMLTYNREAKEVQLAVDRSYTKGEPIEAWCGPQPNRRLLLNYGIVTDNNPHDKM 288
Query: 352 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM 411
+ L DP +Q KR V Q+N + Q F + R+K +LPYLRL + +D + +
Sbjct: 289 ALTVTLPHADPLFQAKRAVLQQNNLSTQQTFQLQ--RDKGLPELLLPYLRLAHCTDAASL 346
Query: 412 QSVISSLGPIC--PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRV 469
+ +++L C P+SP ER VL QLA + + RL Y T EDE ++ P+++V
Sbjct: 347 K--LATLDTCCAAPISPENERTVLHQLASHLQDRLDRYKTTCEEDEVIIRSTTAGPRQKV 404
Query: 470 ATQLVRMEKKML 481
A +L+R+EK +L
Sbjct: 405 AARLLRIEKAIL 416
>gi|212721460|ref|NP_001132025.1| uncharacterized protein LOC100193433 [Zea mays]
gi|194693232|gb|ACF80700.1| unknown [Zea mays]
gi|414881264|tpg|DAA58395.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
gi|414881265|tpg|DAA58396.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
Length = 252
Score = 374 bits (961), Expect = e-101, Method: Compositional matrix adjust.
Identities = 173/242 (71%), Positives = 206/242 (85%)
Query: 270 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 329
++ QKVSLARRFALVPLGPPLL Y S CKAML A D+V+LVVDRPYKAGE I++WCGP
Sbjct: 10 LIQEQKVSLARRFALVPLGPPLLTYRSNCKAMLTADGDSVRLVVDRPYKAGEPIIIWCGP 69
Query: 330 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGRE 389
Q NS+L++NYGFVDEDNP+DR+ +EA+LNTEDPQYQ+KRMVAQRNGKL++Q F+V+ G+E
Sbjct: 70 QTNSRLVLNYGFVDEDNPFDRVAIEASLNTEDPQYQEKRMVAQRNGKLAIQNFNVYVGKE 129
Query: 390 KEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 449
K+ +++MLPYLRLGY+S+ EMQS++SS G CPVSPC ERAVLDQL Y ++RLAGYP
Sbjct: 130 KQTVAEMLPYLRLGYISNPDEMQSILSSEGDTCPVSPCTERAVLDQLVGYLESRLAGYPT 189
Query: 450 TLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAP 509
TL EDEAML D NL PKK VAT+LVR+EKKML+ACLQ T + I LPD TVSPCPAPYAP
Sbjct: 190 TLDEDEAMLADGNLEPKKEVATRLVRLEKKMLHACLQATNEFINDLPDHTVSPCPAPYAP 249
Query: 510 LL 511
L
Sbjct: 250 EL 251
>gi|255536985|ref|XP_002509559.1| conserved hypothetical protein [Ricinus communis]
gi|223549458|gb|EEF50946.1| conserved hypothetical protein [Ricinus communis]
Length = 348
Score = 373 bits (957), Expect = e-100, Method: Compositional matrix adjust.
Identities = 174/190 (91%), Positives = 182/190 (95%)
Query: 85 MHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNET 144
MHKNGLPPCKV+LKE+PSH+ K RPIHYVAASEDLQ GD AFSVPNSLVVTLERVLGNET
Sbjct: 1 MHKNGLPPCKVVLKERPSHDAKLRPIHYVAASEDLQTGDVAFSVPNSLVVTLERVLGNET 60
Query: 145 IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 204
+ ELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE E
Sbjct: 61 VVELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSEAE 120
Query: 205 LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
LAYLTGSPTKAE+LERA+GIKREY+ELDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFV
Sbjct: 121 LAYLTGSPTKAEVLERADGIKREYDELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFV 180
Query: 265 AVQSCVVHLQ 274
A+QSCVVHLQ
Sbjct: 181 AIQSCVVHLQ 190
Score = 278 bits (712), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 130/159 (81%), Positives = 148/159 (93%)
Query: 353 VEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQ 412
++AALNTEDPQYQDKRMVAQRNGKLS+QVF ++ G+EKEAISD+LPYLRLGYVSD SEMQ
Sbjct: 189 LQAALNTEDPQYQDKRMVAQRNGKLSIQVFQIYVGKEKEAISDILPYLRLGYVSDPSEMQ 248
Query: 413 SVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQ 472
SVISS GPICPVSPCME+AVLDQLADYFK RLAGYP +L+EDE ML D+NL+PKKRVATQ
Sbjct: 249 SVISSQGPICPVSPCMEQAVLDQLADYFKRRLAGYPTSLNEDELMLADHNLNPKKRVATQ 308
Query: 473 LVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 511
LVR+EKK+LNACLQ TAD+I LPD++VSPCPAPYAP+L
Sbjct: 309 LVRLEKKILNACLQATADLINQLPDLSVSPCPAPYAPIL 347
>gi|307107385|gb|EFN55628.1| hypothetical protein CHLNCDRAFT_57818 [Chlorella variabilis]
Length = 435
Score = 365 bits (936), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 191/404 (47%), Positives = 262/404 (64%), Gaps = 9/404 (2%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
+ W+ ++G P KV L+ + + A+E LQ GD A +P L+VTL+RVL
Sbjct: 9 MMQWLTESGAPQQKVKLQTVVREGTE---VDITVAAEALQPGDVALRIPEHLIVTLDRVL 65
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
+ T+AEL+TT KLSELACL LYL YEKK+GK+ W +I+ELDR +GRG +SPLLW
Sbjct: 66 EDNTLAELVTTGKLSELACLTLYLAYEKKRGKEGCWYRFIKELDRMQGRGSQGAKSPLLW 125
Query: 201 SETELA-YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
E + A L GSP EI R +GI++EY ELDTVW++AGSLF + P+ PTE F+F +F
Sbjct: 126 DEGQAAELLAGSPVVGEIEARLQGIRKEYEELDTVWYLAGSLFNRQPFSPPTEQFSFPVF 185
Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPY 317
+QAF AVQS VVHLQ V+L +RFALVP+GPPLL YSS KAML ++ V+L VDR Y
Sbjct: 186 RQAFTAVQSSVVHLQGVALGKRFALVPMGPPLLTYSSTAKAMLKFDPESHEVRLAVDRAY 245
Query: 318 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL 377
+ GE+++ WCGPQPNS+LLINYG VDE NPYD+L + + ++DP Y+ KR G
Sbjct: 246 QPGEAVLAWCGPQPNSRLLINYGIVDESNPYDKLPLSITIPSDDPLYRLKRDRLAERGLS 305
Query: 378 SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLA 437
+ Q F + A A +LPYLRL + + ++++ V PV+P E VL+QL
Sbjct: 306 TQQTFQLQAAASLPA--QLLPYLRLVHSTREADVEGVKWE-EEAGPVAPENELTVLNQLI 362
Query: 438 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 481
+ + R + Y T+ EDEA++ D P+ VA +L+++EK +L
Sbjct: 363 THLRLRQSRYRTTIEEDEAIIADPAKGPRPTVAARLLKIEKGIL 406
>gi|413950742|gb|AFW83391.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
gi|413950743|gb|AFW83392.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
Length = 252
Score = 363 bits (933), Expect = 9e-98, Method: Compositional matrix adjust.
Identities = 166/240 (69%), Positives = 203/240 (84%)
Query: 270 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 329
++ QKVSLARRFALVPLGPPLL Y S CKAML ++V+LVVDRPYKAGE I++WCGP
Sbjct: 10 LIQEQKVSLARRFALVPLGPPLLTYKSNCKAMLTVDGESVRLVVDRPYKAGEPIIIWCGP 69
Query: 330 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGRE 389
Q NS+L++NYGFVDE+NP+DR+ +EA+LNTEDPQYQ+KRMVAQRNGK ++Q F+V+ G+E
Sbjct: 70 QTNSRLVLNYGFVDENNPFDRISIEASLNTEDPQYQEKRMVAQRNGKHAIQNFNVYVGKE 129
Query: 390 KEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 449
K+ +++MLPYLRLGY+SD EMQS++SS G CPVSPC ERAVLDQL Y ++RLAGYP
Sbjct: 130 KQTVAEMLPYLRLGYISDPDEMQSILSSEGDTCPVSPCTERAVLDQLGGYLESRLAGYPT 189
Query: 450 TLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAP 509
TL+EDEAML D +L PK+ VAT+LVR+EKKML+ACLQ T + I LPD TVSPCPA YAP
Sbjct: 190 TLNEDEAMLADGSLEPKQEVATRLVRLEKKMLHACLQATNEFITDLPDHTVSPCPAQYAP 249
>gi|302847476|ref|XP_002955272.1| hypothetical protein VOLCADRAFT_76643 [Volvox carteri f.
nagariensis]
gi|300259344|gb|EFJ43572.1| hypothetical protein VOLCADRAFT_76643 [Volvox carteri f.
nagariensis]
Length = 488
Score = 347 bits (890), Expect = 8e-93, Method: Compositional matrix adjust.
Identities = 192/408 (47%), Positives = 255/408 (62%), Gaps = 14/408 (3%)
Query: 80 DLKSWMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
+L W+ +NG + +V + PS RP+ V A L AG+ A SVP L +TL+
Sbjct: 43 ELVDWLRENGAKIDAVEVKTMDVPSAG---RPLDVVVAGRSLAAGEVALSVPERLCLTLD 99
Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
R+ +E +AELLTT+KLSELACLALYLMYEKK KKSFW PYI+ELD+Q+ RG A ESP
Sbjct: 100 RIFESEFVAELLTTDKLSELACLALYLMYEKKLKKKSFWYPYIKELDKQQARGPQAAESP 159
Query: 198 LLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 256
LLW + EL + L GSP + +R GI++EY LDTVWFMAGSLF +YP+D+PTE F+F
Sbjct: 160 LLWGDQELDSLLKGSPLLPAVRQRQAGIRKEYEALDTVWFMAGSLFNKYPFDLPTETFSF 219
Query: 257 EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD--AVQLVVD 314
E+F+QAF VQ+ +VHLQ V +A+RFALVPLGPPL+AYSS K M+ +D +V+LVV
Sbjct: 220 ELFQQAFAVVQASIVHLQGVPIAKRFALVPLGPPLMAYSSTSKNMMTYDEDSRSVRLVVS 279
Query: 315 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE--AALNTEDPQYQDKRMVAQ 372
P +AG + WCGPQPNS+LL+NYG VDE NP+D+L L T DP + KR V
Sbjct: 280 GPVEAGRPVAAWCGPQPNSRLLLNYGVVDEHNPFDKLQARFTFTLPTSDPLFPAKRAVLS 339
Query: 373 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 432
G + Q F V R +LPY+ L + ++ SV S +E A
Sbjct: 340 EAGLATQQSFDVSVARPLP--PQLLPYMMLALATTPEQVASV--SFSDTAGHDRELEAAA 395
Query: 433 LDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 480
L L Y + R A Y L D ++ D + P+++VA +L ++EK +
Sbjct: 396 LAALMAYVQRRTAAYAHPLWRDLEIINDPSSTPRQKVAARLTKIEKSI 443
>gi|413950744|gb|AFW83393.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
Length = 281
Score = 347 bits (889), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 166/269 (61%), Positives = 203/269 (75%), Gaps = 29/269 (10%)
Query: 270 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 329
++ QKVSLARRFALVPLGPPLL Y S CKAML ++V+LVVDRPYKAGE I++WCGP
Sbjct: 10 LIQEQKVSLARRFALVPLGPPLLTYKSNCKAMLTVDGESVRLVVDRPYKAGEPIIIWCGP 69
Query: 330 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGRE 389
Q NS+L++NYGFVDE+NP+DR+ +EA+LNTEDPQYQ+KRMVAQRNGK ++Q F+V+ G+E
Sbjct: 70 QTNSRLVLNYGFVDENNPFDRISIEASLNTEDPQYQEKRMVAQRNGKHAIQNFNVYVGKE 129
Query: 390 KEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 449
K+ +++MLPYLRLGY+SD EMQS++SS G CPVSPC ERAVLDQL Y ++RLAGYP
Sbjct: 130 KQTVAEMLPYLRLGYISDPDEMQSILSSEGDTCPVSPCTERAVLDQLGGYLESRLAGYPT 189
Query: 450 TLSEDEAM-----------------------------LTDYNLHPKKRVATQLVRMEKKM 480
TL+EDEAM L D +L PK+ VAT+LVR+EKKM
Sbjct: 190 TLNEDEAMVMSCDFLRVVSWSLYKLAECYGIGFGHCQLADGSLEPKQEVATRLVRLEKKM 249
Query: 481 LNACLQVTADMIMLLPDVTVSPCPAPYAP 509
L+ACLQ T + I LPD TVSPCPA YAP
Sbjct: 250 LHACLQATNEFITDLPDHTVSPCPAQYAP 278
>gi|413950741|gb|AFW83390.1| hypothetical protein ZEAMMB73_201403, partial [Zea mays]
Length = 130
Score = 233 bits (593), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 108/128 (84%), Positives = 117/128 (91%)
Query: 147 ELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA 206
ELLT NKLSELACLALYLMYEKKQGK SFW PYI+ELDR RGRGQLAVESPLLW+E+EL
Sbjct: 1 ELLTNNKLSELACLALYLMYEKKQGKDSFWYPYIKELDRHRGRGQLAVESPLLWTESELD 60
Query: 207 YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 266
YL+GSP K E++ R E I+REYNELDT+WFMAGSLFQQYP+DIPTEAF FEIFKQAFVAV
Sbjct: 61 YLSGSPLKDEVVARDEAIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIFKQAFVAV 120
Query: 267 QSCVVHLQ 274
QSCVVHLQ
Sbjct: 121 QSCVVHLQ 128
>gi|168002824|ref|XP_001754113.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694667|gb|EDQ81014.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 638
Score = 144 bits (364), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 127/445 (28%), Positives = 201/445 (45%), Gaps = 40/445 (8%)
Query: 45 RLVRRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHN 104
R V + R +R + ++ S + ++ L L W+ K G P VIL
Sbjct: 54 RTVWNEGRRGLRGVARCSMSGNSMQSMA-----LHQLSEWLSKQGFPTQDVILT---GFG 105
Query: 105 EKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYL 164
E+ + AA D + G+ A +P + VT V+ + +A ++ L L+L
Sbjct: 106 EEGVGL---AAGRDFKEGEVALKIPENYTVTGVDVVNHPVVAAPAAGR--GDVIGLTLWL 160
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS-ETELAYLTGSPTKAEILERAEG 223
MYE+ G+KS W PY++ SP+LW+ E + L GSP E+ +R+
Sbjct: 161 MYERSLGEKSVWYPYLQTFPS-------TTLSPILWTAEEQQKLLKGSPALEEVQQRSAA 213
Query: 224 IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 283
++ EY +L S F + P P E F+ E FK AF + S V+L L FA
Sbjct: 214 LEGEYEDLQ-------SYFTKDPQAFPQEYFSLEAFKSAFSVILSRAVYLPSADL---FA 263
Query: 284 LVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCG-PQPNSKLLINYG 340
LVP L + + +A L + D AV VDR YK GE + G + N+ LLI YG
Sbjct: 264 LVPYADAL-NHRADSQAYLDYSMEDQAVVFPVDRNYKEGEQVFTSYGRERSNADLLITYG 322
Query: 341 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL 400
FVDE+N D L +E L D K+ + Q+ S Q F ++ R + +L Y+
Sbjct: 323 FVDENNAMDYLDLEVGLVDGDRLLVLKQQILQQAMLDSPQTFPLYLDRFP---TQLLTYM 379
Query: 401 RLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 460
RL + D + ++ + + E L L + +L Y + ++ +L +
Sbjct: 380 RLSRLQDPALFPKIVFDKDIM--LDQANEYECLQLLMGECRTKLGNYEGGVDDEIRLLKN 437
Query: 461 YNLHPKKRVATQLVRMEKKMLNACL 485
+ ++RVA QL EKK+L + +
Sbjct: 438 KKISQRERVAAQLRLCEKKILTSTM 462
>gi|298706765|emb|CBJ29688.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Ectocarpus siliculosus]
Length = 521
Score = 141 bits (355), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 122/433 (28%), Positives = 197/433 (45%), Gaps = 39/433 (9%)
Query: 81 LKSWMHKNGL-----------PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVP 129
LK WM +NG+ P + + NE + A+ +++ GD F++P
Sbjct: 92 LKEWMGENGVWVYDKSDWGVGPHALSVAVDTVDENENETAGRGMIANREIKEGDELFTLP 151
Query: 130 NSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
L++T + E A+++T + LSE +AL ++EK +GK+SFW YI L
Sbjct: 152 IDLLLT-KDAAKKEFGADVITED-LSEYIAIALLAVHEKAKGKESFWSSYIGVLPTVE-- 207
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDI 249
V LW+E +LA L GSP A ++ EY ++ L ++P +
Sbjct: 208 ---EVYPTYLWAEEDLALLEGSPVIAATESMRRKLEVEYATVEN------DLLDKFPEIL 258
Query: 250 PTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG------PPLLAY-SSKCKAML 302
P E T+E F+ AF + S + L +S ALVP P +Y ++ + +
Sbjct: 259 PREVHTYEEFQWAFAMLFSRAIRLGGLSTGEAVALVPYADLFNHNPFANSYIDARQQGLF 318
Query: 303 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 362
+ D V + DR YK E + + GP+ NS LL+ YGF + NPY+ + V +L+ D
Sbjct: 319 FSKTDEVVVYADRSYKKMEQVYISYGPKGNSDLLLLYGFSLDRNPYNSVDVTVSLDENDE 378
Query: 363 QYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPIC 422
Y+ K+ G + F ++ R + ++L YLRL ++ + L
Sbjct: 379 LYERKKAFLSEAGLPPTKAFPLYNDRYPD---ELLQYLRLIQLNTDQLRGRTLEDLSFEK 435
Query: 423 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD----YNLHPKKRVATQLVRMEK 478
+ E VLD L + KA +AGYP T +D ++ D L +R+A + R EK
Sbjct: 436 KQTDVNELMVLDSLVEACKATIAGYPTTEEQDSKLMNDPGMFRALSKTQRMAVKHRRQEK 495
Query: 479 KMLNACL-QVTAD 490
+L + VT D
Sbjct: 496 VILRRTIAAVTKD 508
>gi|452821842|gb|EME28868.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Galdieria sulphuraria]
Length = 490
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 119/451 (26%), Positives = 208/451 (46%), Gaps = 52/451 (11%)
Query: 56 RVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGL----------PPCKVILKEKPSHNE 105
R S + + +G V ++ W+ +NG+ P ++++ E+ + +E
Sbjct: 58 RSSDAFSFTSGDPAVQKGWSSEISAFYDWLKENGVYLSEKASWTHAPHRLVIAEE-TKDE 116
Query: 106 KHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLM 165
+ +S + G+ +P L+ T R L ET + + E + L L+
Sbjct: 117 GEYSGRGLLSSRSVNLGEKVLEIPEKLMFT--RKLALETFPTSIIASIEDEYVSIGLLLL 174
Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPL-LWSETELAYLTGSPTKAEILERAEGI 224
YEK +G SF+ PY+ L L +PL LWS +L L GSPT + + + +
Sbjct: 175 YEKAKGFDSFFKPYLDILP------TLDELNPLFLWSNKDLDLLQGSPTLSACEQLRDKL 228
Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 284
REY ++ ++ Q P + ++ F+ F+ AF + S + ++R AL
Sbjct: 229 LREYT------YLGKNIIPQIP-NFASKPIDFKQFQWAFGILFSRAICFPS---SKRIAL 278
Query: 285 VPLGPPLLAYSSKCKAML--------AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLL 336
VP LL +S C A + V +AV + VDR Y+ E + V GP+ N +LL
Sbjct: 279 VPYAD-LLNHSPFCSAFIDEEKIPFGNGVTEAV-VYVDRLYEPYEQVYVSYGPRSNQELL 336
Query: 337 INYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDM 396
+ YGF E NP+D + + L+ DP Y +K + + GK +Q F ++ R +M
Sbjct: 337 LLYGFSLERNPFDCVEITIGLDKTDPLYLEKCRMLESYGKSPLQSFPLYMDRYP---VEM 393
Query: 397 LPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEA 456
+LR + +++Q+ ++ VS E + LD+L +Y +L YP +L +DE
Sbjct: 394 AEFLRFCCIDTETDLQADFGTI-----VSASNEESALDKLLNYIVDQLRKYPTSLEDDEK 448
Query: 457 MLTD----YNLHPKKRVATQLVRMEKKMLNA 483
++ D L +R+A + EK++L+A
Sbjct: 449 IIRDRAMFQTLEKNQRMAIRQRLGEKRILHA 479
>gi|302764082|ref|XP_002965462.1| hypothetical protein SELMODRAFT_406852 [Selaginella moellendorffii]
gi|300166276|gb|EFJ32882.1| hypothetical protein SELMODRAFT_406852 [Selaginella moellendorffii]
Length = 481
Score = 125 bits (313), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 121/409 (29%), Positives = 185/409 (45%), Gaps = 35/409 (8%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
D+ W+ + G P +++ S +K A+ DLQAGDAA S+P + VT V
Sbjct: 49 DMTKWLQEQGFPQQPLLVS---SFEDKGLG---CCATRDLQAGDAALSIPENFTVTAVDV 102
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
+ I+ EL LAL+LMYE+++ + S W PY++ + SPLL
Sbjct: 103 ANHPVISS--AAEGRDELVGLALWLMYEQERSQDSPWYPYLKVF-------PASTLSPLL 153
Query: 200 WSETELAYLT-GSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFE 257
W + E L GS A++ ++ +++ ++ L DT+ + D P E FTF
Sbjct: 154 WEQEEQEELLRGSSALAKVKDQLTSLRQTFDALKDTL---------KDNKDFPMEKFTFS 204
Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 317
FK AF V S V+L L FALVP G + SS+ + V+L VD+ Y
Sbjct: 205 AFKAAFSVVLSRAVYLPSAEL---FALVPFGDLINHESSRSLLDYDIEEQKVKLAVDKRY 261
Query: 318 KAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 376
K G+ + + ++ LI YGF+DE + D + +E L + D KR + Q G
Sbjct: 262 KKGDQVFASYAQNLTSADFLIRYGFLDESDENDFIEIEVGLVSGDSLAPLKREILQEVGL 321
Query: 377 LSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQL 436
Q F V+ R + +L Y RL + D+ + I V E L L
Sbjct: 322 TVPQKFPVYLNRFP---TQLLTYTRLARIQDSGLFAKITFEKDLI--VCQTNEYETLMLL 376
Query: 437 ADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 485
+ +L + T+ +D L NL K+RVA QL EK++L +
Sbjct: 377 MADCRTKLLSFSDTMEDDMQTLKRKNLSYKQRVAAQLRLKEKRILTDTM 425
>gi|302823067|ref|XP_002993188.1| hypothetical protein SELMODRAFT_449044 [Selaginella moellendorffii]
gi|300138958|gb|EFJ05708.1| hypothetical protein SELMODRAFT_449044 [Selaginella moellendorffii]
Length = 600
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 116/405 (28%), Positives = 180/405 (44%), Gaps = 33/405 (8%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
D+ W+ + G P +++ S +K A+ DLQAGDAA S+P + VT V
Sbjct: 49 DMTKWLQEQGFPQQPLLV---SSFEDKGLG---CCATRDLQAGDAALSIPENFTVTAVDV 102
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
+ I+ EL LAL+LMYE+++ + S W PY++ + L
Sbjct: 103 ANHPVISS--AAEGRDELVGLALWLMYEQERSQDSPWYPYVKVFPAS------TLSLLLW 154
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFEI 258
E + L GS A++ ++ +++ ++ L DT+ + D P E FTF
Sbjct: 155 EQEEQEELLRGSSALAKVKDQLTSLRQTFDALKDTL---------KDNKDFPMEKFTFSA 205
Query: 259 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYK 318
FK AF V S V+L L FALVP G + SS+ + V+L VD+ YK
Sbjct: 206 FKTAFSVVLSRAVYLPSAEL---FALVPFGDLINHESSRSLLDYDIEEQKVKLAVDKRYK 262
Query: 319 AGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL 377
G+ + + ++ LI YGF+DE + D + +E L + D KR + Q G
Sbjct: 263 KGDQVFASYAQNLTSADFLIRYGFLDESDENDCIEIEVGLVSGDSLAPLKREILQEVGLT 322
Query: 378 SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLA 437
Q F ++ R + +L Y RL + D+ + I VS E L L
Sbjct: 323 VPQKFPLYLNR---FPTQLLTYTRLARIQDSGLFAKITFEKDLI--VSQTNEYETLMLLM 377
Query: 438 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 482
+ +L T+ ++ L NL K+RVA QL EK++L
Sbjct: 378 ADCRTKLLSSSDTMEDEMQTLRRKNLSYKQRVAAQLRLKEKRILT 422
>gi|297829320|ref|XP_002882542.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297328382|gb|EFH58801.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 504
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 93/315 (29%), Positives = 152/315 (48%), Gaps = 27/315 (8%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L++W+ +GLPP K+ + ++ E+ + AS++L+ G+ VP SLV++
Sbjct: 72 ENATSLQNWLSDSGLPPQKMAI-DRVDIGERG-----LVASQNLRKGEKLLFVPPSLVIS 125
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ N E++ + + LA YL+ E K S W YI L RQ
Sbjct: 126 ADSEWTNPEAGEVMKRYDVPDWPLLATYLISEASLQKSSRWYNYISALPRQ-------PY 178
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ TEL YL S + +ER + Y +L + +F ++P+ P E F
Sbjct: 179 SLLYWTRTELDMYLEASQIRERAIERITNVVGTYEDLRS------RIFSKHPHLFPKEVF 232
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLV 312
E FK +F + S +V L S+ RFALVP +L ++ + + L V
Sbjct: 233 NDETFKWSFGILFSRLVRLP--SMDGRFALVPWA-DMLNHNCEVETFLDYDKSSKGVVFT 289
Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMV 370
DRPY+ GE + + G + N +LL++YGFV + NP D + + +L D Y++K
Sbjct: 290 TDRPYQPGEQVFISYGNKSNGELLLSYGFVPREGTNPSDSVELALSLRKNDKCYKEKLDA 349
Query: 371 AQRNGKLSVQVFHVH 385
+++G + Q F V
Sbjct: 350 LKKHGLSTPQCFPVR 364
>gi|21537309|gb|AAM61650.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
Length = 504
Score = 119 bits (297), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 93/315 (29%), Positives = 151/315 (47%), Gaps = 27/315 (8%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L++W+ +GLPP K+ + ++ E+ + AS++L+ G+ VP SLV++
Sbjct: 72 ENATSLQNWLSDSGLPPQKMAI-DRVDIGERG-----LVASQNLRKGEKLLFVPPSLVIS 125
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ N E++ + + LA YL+ E K S W YI L RQ
Sbjct: 126 ADSEWTNAEAGEVMKRYDVPDWPLLATYLISEANLQKSSRWFNYISALPRQ-------PY 178
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ TEL YL S + +ER + Y +L + +F ++P P E F
Sbjct: 179 SLLYWTRTELDMYLEASQIRERAIERITNVVGTYEDLRS------RIFSKHPQLFPKEVF 232
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLV 312
E FK +F + S +V L S+ RFALVP +L ++ + + L V
Sbjct: 233 NDETFKWSFGILFSRLVRLP--SMDGRFALVPWA-DMLNHNCEVETFLDYDKSSKGVIFT 289
Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMV 370
DRPY+ GE + + G + N +LL++YGFV + NP D + + +L D Y++K
Sbjct: 290 TDRPYQPGEQVFISYGNKSNGELLLSYGFVPREGTNPSDSVELALSLRKNDKCYEEKLDA 349
Query: 371 AQRNGKLSVQVFHVH 385
+++G + Q F V
Sbjct: 350 LKKHGLSTPQCFPVR 364
>gi|15231493|ref|NP_187424.1| rubisco methyltransferase-like protein [Arabidopsis thaliana]
gi|6466950|gb|AAF13085.1|AC009176_12 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
gi|6648179|gb|AAF21177.1|AC013483_1 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
gi|15028205|gb|AAK76599.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
gi|19310671|gb|AAL85066.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
gi|332641064|gb|AEE74585.1| rubisco methyltransferase-like protein [Arabidopsis thaliana]
Length = 504
Score = 119 bits (297), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 93/315 (29%), Positives = 151/315 (47%), Gaps = 27/315 (8%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L++W+ +GLPP K+ + ++ E+ + AS++L+ G+ VP SLV++
Sbjct: 72 ENATSLQNWLSDSGLPPQKMAI-DRVDIGERG-----LVASQNLRKGEKLLFVPPSLVIS 125
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ N E++ + + LA YL+ E K S W YI L RQ
Sbjct: 126 ADSEWTNAEAGEVMKRYDVPDWPLLATYLISEASLQKSSRWFNYISALPRQ-------PY 178
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ TEL YL S + +ER + Y +L + +F ++P P E F
Sbjct: 179 SLLYWTRTELDMYLEASQIRERAIERITNVVGTYEDLRS------RIFSKHPQLFPKEVF 232
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLV 312
E FK +F + S +V L S+ RFALVP +L ++ + + L V
Sbjct: 233 NDETFKWSFGILFSRLVRLP--SMDGRFALVPWA-DMLNHNCEVETFLDYDKSSKGVVFT 289
Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMV 370
DRPY+ GE + + G + N +LL++YGFV + NP D + + +L D Y++K
Sbjct: 290 TDRPYQPGEQVFISYGNKSNGELLLSYGFVPREGTNPSDSVELALSLRKNDKCYEEKLDA 349
Query: 371 AQRNGKLSVQVFHVH 385
+++G + Q F V
Sbjct: 350 LKKHGLSTPQCFPVR 364
>gi|3065835|gb|AAC14296.1| putative methyltransferase [Arabidopsis thaliana]
Length = 504
Score = 114 bits (286), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 92/315 (29%), Positives = 149/315 (47%), Gaps = 27/315 (8%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L++W+ +GLPP K+ + ++ E+ + AS++L+ G+ V SLV+
Sbjct: 72 ENATSLQNWLSDSGLPPQKMAI-DRVDIGERG-----LVASQNLRKGEKLLFVSPSLVIC 125
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ N E++ + + LA YL+ E K S W YI L RQ
Sbjct: 126 ADSEWTNAEAGEVMKRYDVPDWPLLATYLISEASLQKSSRWFNYISALPRQ-------PY 178
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ TEL YL S + +ER + Y +L + +F ++P P E F
Sbjct: 179 SLLYWTRTELDMYLEASQIRERAIERITNVVGTYEDLRS------RIFSKHPQLFPKEVF 232
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLV 312
E FK +F + S +V L S+ RFALVP +L ++ + + L V
Sbjct: 233 NDETFKWSFGILFSRLVRLP--SMDGRFALVPWAD-MLNHNCEVETFLDYDKSSKGVVFT 289
Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMV 370
DRPY+ GE + + G + N +LL++YGFV + NP D + + +L D Y++K
Sbjct: 290 TDRPYQPGEQVFISYGNKSNGELLLSYGFVPREGTNPSDSVELALSLRKNDKCYEEKLDA 349
Query: 371 AQRNGKLSVQVFHVH 385
+++G + Q F V
Sbjct: 350 LKKHGLSTPQCFPVR 364
>gi|219121061|ref|XP_002185762.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Phaeodactylum tricornutum CCAP
1055/1]
gi|209582611|gb|ACI65232.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Phaeodactylum tricornutum CCAP
1055/1]
Length = 575
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 103/406 (25%), Positives = 184/406 (45%), Gaps = 54/406 (13%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKK-QGKK 173
A D+ GD +P +L +T + + + + + ++++E +A +L+YE+ +G++
Sbjct: 131 ARRDINDGDELLRIPMALCMT--KSAARKAVGKDVLPSEINEYLAMACHLIYERNVRGEE 188
Query: 174 SFWLPYIR---ELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
S W PY+ ++D V W + +LA+L GSP A ++REY+
Sbjct: 189 SPWKPYLDVLPDIDE--------VNPTFTWPDEDLAFLNGSPVIAATKSLQMKLRREYDA 240
Query: 231 LDTVWFMAG--SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG 288
L + G L +YP P EAF F+ ++ AF + S + L+ + ALVP
Sbjct: 241 L-----LGGEDGLLAKYPDRFPAEAFNFKAWEWAFTMLFSRAIRLRSLKQGETLALVPYA 295
Query: 289 PPLLAYSSKCKAMLAAV----------DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 338
L+ +S +A + A D+ V L DR Y+ E I + GP+ N++LL+
Sbjct: 296 D-LINHSPFSQAYIDARQNGDWLFKSGDEEVILYADRGYRRMEQIYISYGPKSNAELLLL 354
Query: 339 YGFVDEDNPYDRLVVEAALNTE---------------DPQYQDKRMVAQRNGKLSVQVFH 383
YGF E NP++ + V ++ DP ++K ++ G+ + F
Sbjct: 355 YGFAVERNPFNSVDVTVSIAPRTASFVKELDDDTIPVDPLAEEKAAFLEQVGRDATVDFP 414
Query: 384 VHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 443
+A R +ML YLRL ++ ++ +S E AVL + +
Sbjct: 415 CYADRYP---VEMLEYLRLMQMTPEDTRGKPLAEFDYSRTISLGNEAAVLTSVITAVSRQ 471
Query: 444 LAGYPATLSEDEAMLTDYNLHP----KKRVATQLVRMEKKMLNACL 485
L+ YP + +D A++ D +L +R+A + R EK++L +
Sbjct: 472 LSNYPQSEEDDAALIKDKSLFRLLSYNQRMAVRHRRNEKRLLKRTI 517
>gi|224129218|ref|XP_002320530.1| predicted protein [Populus trichocarpa]
gi|222861303|gb|EEE98845.1| predicted protein [Populus trichocarpa]
Length = 503
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 93/312 (29%), Positives = 145/312 (46%), Gaps = 27/312 (8%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L+ W+ +GLPP K+ + +K E+ + A ++++ G+ VP SLV+
Sbjct: 71 ENAEALQKWLSDSGLPPQKMAI-QKVEVGERG-----LVALKNIRKGEMLLFVPPSLVIA 124
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ E+L + + LA YL+ E K S W YI L RQ
Sbjct: 125 ADSEWSCPEAGEVLKKYSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PY 177
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ EL YL S + +ER + YN+L +F +YP+ P E F
Sbjct: 178 SLLYWTRAELDTYLEASQIRERAIERITNVTGTYNDLRL------RIFSKYPHLFPEEVF 231
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLV 312
E FK +F + S +V L S+ R ALVP +L +SS+ + L V
Sbjct: 232 NMETFKWSFGILFSRLVRLP--SMDGRVALVPWAD-MLNHSSEVETFLDYDKSSKGVVFT 288
Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMV 370
DRPY+ GE + + G + N +LL++YGFV + NP D + + +L D Y++K
Sbjct: 289 TDRPYQPGEQVFISYGRKSNGELLLSYGFVPREGTNPSDSVELSLSLKKSDKCYKEKLEA 348
Query: 371 AQRNGKLSVQVF 382
+++G Q F
Sbjct: 349 LKKHGLSVSQCF 360
>gi|168003103|ref|XP_001754252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694354|gb|EDQ80702.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 431
Score = 112 bits (279), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 89/327 (27%), Positives = 154/327 (47%), Gaps = 31/327 (9%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L+ W+ K GL K++L S + A++ L+ G+ VP+ L++T +
Sbjct: 16 LQDWLMKEGLAKQKLVLDRVDSGGRG------LVATQSLRQGERLLFVPSGLLITADSEW 69
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
G ++ L E LA++L+ E + + S W PY L + S L W
Sbjct: 70 GCAETGRIIKEAGLPEWPMLAIFLISEASREESSRWFPYFATLPK-------TPSSILQW 122
Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
+E E+ +LT SP + + LE + Y +L ++F ++P P++ +T F
Sbjct: 123 TEEEVNTWLTASPVREKALECIRDVTETYRDL------RATIFLKHPEVFPSQVYTLAAF 176
Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD---AVQLVVDRP 316
K AF + S +V L V + ALVP +L +S + + L + +V V DR
Sbjct: 177 KWAFGILFSRLVRLPSVG---KLALVPWA-DMLNHSPQVDSFLDFDQNNAKSVVTVTDRA 232
Query: 317 YKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 375
Y++GE + + G + + +L + YGF+ E N +D + +E ++++DP ++ K A G
Sbjct: 233 YQSGEQVFISYGKRSSGELFLAYGFIPSELNVHDSVELEMEIDSDDPSFEAKLRAANEQG 292
Query: 376 KLSVQVFHVHAGREKEAISDMLPYLRL 402
S Q F V R+ + +L Y RL
Sbjct: 293 LSSPQRFPV---RKDGFPAQLLAYARL 316
>gi|356547583|ref|XP_003542190.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Glycine
max]
Length = 499
Score = 112 bits (279), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 91/315 (28%), Positives = 147/315 (46%), Gaps = 27/315 (8%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L+ W+ ++GLPP K+ + E+ E+ + A ++++ G+ VP SLV+T
Sbjct: 67 ENSSALQRWLSESGLPPQKMGI-ERVEVGERG-----LVALKNIRKGEKLLFVPPSLVIT 120
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ E+L N + + LA YL+ E + S W YI L RQ
Sbjct: 121 PDSEWSCPEAGEVLKRNSVPDWPLLATYLISEASLMESSRWSNYISALPRQ-------PY 173
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W++ EL YL S + +ER + YN+L +F +YP P E F
Sbjct: 174 SLLYWTQAELDRYLEASQIRERAIERINNVIGTYNDLRL------RIFSKYPDLFPDEVF 227
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLV 312
E FK +F + S +V L S+ ALVP +L +S + L +
Sbjct: 228 NIESFKWSFGILFSRLVRLP--SMGGNVALVPWAD-MLNHSCDVETFLDYDKTSKGIVFT 284
Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMV 370
DRPY+ GE + + G + N +LL++YGFV ++ NP D + + +L D Y++K +
Sbjct: 285 TDRPYQPGEQVFISYGKKSNGELLLSYGFVPKEGANPSDSVELSLSLKKSDASYKEKLEL 344
Query: 371 AQRNGKLSVQVFHVH 385
+ G + Q F +
Sbjct: 345 LKNYGLSASQCFPIQ 359
>gi|357462493|ref|XP_003601528.1| SET domain-containing protein [Medicago truncatula]
gi|355490576|gb|AES71779.1| SET domain-containing protein [Medicago truncatula]
gi|388500078|gb|AFK38105.1| unknown [Medicago truncatula]
Length = 497
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 91/315 (28%), Positives = 148/315 (46%), Gaps = 27/315 (8%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L+ W+ ++GLP K+ + +K E+ + A +++ G+ VP LV+T
Sbjct: 65 ENSSSLQKWLSQSGLPSQKMSI-DKVDVGERG-----LVALNNIRKGEKLLFVPPQLVIT 118
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ E+L N + + LA YL+ E K S W YI L RQ
Sbjct: 119 PDSEWSCPEAGEVLKKNSVPDWPLLATYLISEASLMKSSRWFSYISALPRQ-------PY 171
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L WS+ EL YL S + +ER + YN+ M +F +YP P E F
Sbjct: 172 SLLYWSQAELDRYLEASQIRERAIERTNNVIGTYND------MRVRIFSKYPDFFPEEVF 225
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV-- 312
E FK +F + S +V L S+ + ALVP ++ +S + + L + +V
Sbjct: 226 NIESFKWSFGILFSRMVRLP--SMDGKNALVPWA-DMMNHSCEVETFLDYDKSSKGIVFP 282
Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMV 370
DRPY+ GE + + G + N +LL++YGFV ++ NP D + + +L D Y++K +
Sbjct: 283 TDRPYQPGEQVFISYGKKSNGELLLSYGFVPKEGTNPSDSVELSLSLKKSDESYKEKLEL 342
Query: 371 AQRNGKLSVQVFHVH 385
++ G Q F +
Sbjct: 343 LKKYGLSGSQCFPIR 357
>gi|357469947|ref|XP_003605258.1| SET domain-containing protein [Medicago truncatula]
gi|355506313|gb|AES87455.1| SET domain-containing protein [Medicago truncatula]
Length = 494
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 91/315 (28%), Positives = 148/315 (46%), Gaps = 27/315 (8%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L+ W+ ++GLP K+ + +K E+ + A +++ G+ VP LV+T
Sbjct: 62 ENSSSLQKWLSQSGLPSQKMSI-DKVDVGERG-----LVALNNIRKGEKLLFVPPQLVIT 115
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ E+L N + + LA YL+ E K S W YI L RQ
Sbjct: 116 PDSEWSCPEAGEVLKKNSVPDWPLLATYLISEASLMKSSRWFSYISALPRQ-------PY 168
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L WS+ EL YL S + +ER + YN+ M +F +YP P E F
Sbjct: 169 SLLYWSQAELDRYLEASQIRERAIERTNNVIGTYND------MRVRIFSKYPDFFPEEVF 222
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV-- 312
E FK +F + S +V L S+ + ALVP ++ +S + + L + +V
Sbjct: 223 NIESFKWSFGILFSRMVRLP--SMDGKNALVPWA-DMMNHSCEVETFLDYDKSSKGIVFP 279
Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMV 370
DRPY+ GE + + G + N +LL++YGFV ++ NP D + + +L D Y++K +
Sbjct: 280 TDRPYQPGEQVFISYGKKSNGELLLSYGFVPKEGTNPSDSVELSLSLKKSDESYKEKLEL 339
Query: 371 AQRNGKLSVQVFHVH 385
++ G Q F +
Sbjct: 340 LKKYGLSGSQCFPIR 354
>gi|225447500|ref|XP_002267469.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic [Vitis
vinifera]
gi|296085051|emb|CBI28466.3| unnamed protein product [Vitis vinifera]
Length = 497
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 92/316 (29%), Positives = 151/316 (47%), Gaps = 29/316 (9%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L+ W+ +GLPP K+ + E+ E+ + A ++++ G+ VP SLV+T
Sbjct: 65 ENAALLQKWLSDSGLPPQKMGI-ERVEVGERG-----LVALKNIRKGEKLLFVPPSLVIT 118
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ E+L N + + LA YL+ E + S W YI L RQ
Sbjct: 119 ADSEWSCTEAGEVLKRNSVPDWPLLATYLIGEASFMQSSRWSNYISALPRQ-------PY 171
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ EL YL S + +ER + YN+L +F ++P+ P E F
Sbjct: 172 SLLYWTRAELDKYLEASQIRERAIERINDVTGTYNDLRL------RIFSKHPHLFPEEVF 225
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV- 313
E FK +F + S +V L S+ + ALVP +L +S + + L D + Q VV
Sbjct: 226 NMETFKWSFGILFSRLVRLP--SMDEKIALVPWA-DMLNHSCEVETFL-DYDKSSQGVVF 281
Query: 314 --DRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRM 369
DR Y+ E + + G + N +LL++YGFV + NP D++ + +L D Y++K
Sbjct: 282 TTDRTYQPSEQVFISYGKKSNGELLLSYGFVPREGTNPNDKVELLLSLKKSDKCYKEKSE 341
Query: 370 VAQRNGKLSVQVFHVH 385
+++G + Q F +
Sbjct: 342 AMKKHGLSTSQCFPIQ 357
>gi|255582876|ref|XP_002532210.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
gi|223528106|gb|EEF30179.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
Length = 508
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 94/311 (30%), Positives = 147/311 (47%), Gaps = 29/311 (9%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L+ W+ NGLP K+ + +K E+ + A ++++ G+ VP SLV+T +
Sbjct: 81 LQRWLSNNGLPDQKMAI-DKVEVGERG-----LVALKNIRKGEKLLFVPPSLVITADSEW 134
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
E+L + + LA+YL+ E K S W YI L RQ S L W
Sbjct: 135 SCPEAGEVLKQYSVPDWPLLAIYLISEANLQKSSKWSNYISALPRQ-------PYSLLYW 187
Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
+ EL YL S + +ER + YN+L +F +YP P E F E F
Sbjct: 188 TRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNLETF 241
Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DRP 316
K +F + S +V L S+ + ALVP +L +S + + L D + Q VV DR
Sbjct: 242 KWSFGILFSRLVRLP--SMDGKVALVPWA-DMLNHSCEVETFL-DYDKSSQGVVFTTDRQ 297
Query: 317 YKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRN 374
Y+ GE + + G + N +LL++YGFV + NP D + + +L D Y++K +++
Sbjct: 298 YEPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELSLSLKKSDKSYKEKLEALKKH 357
Query: 375 GKLSVQVFHVH 385
G + Q F V
Sbjct: 358 GFSASQCFPVR 368
>gi|242066146|ref|XP_002454362.1| hypothetical protein SORBIDRAFT_04g029430 [Sorghum bicolor]
gi|241934193|gb|EES07338.1| hypothetical protein SORBIDRAFT_04g029430 [Sorghum bicolor]
Length = 499
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 104/386 (26%), Positives = 175/386 (45%), Gaps = 59/386 (15%)
Query: 9 SSKFISPPIRPPHH----PLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSSDTLV 64
S+ + PP+R P H P S +S+ S R + R IR S++
Sbjct: 4 STTTLHPPLRAPRHLRPLPHSYSSSFS----------------RTRGRAPIRASAASASA 47
Query: 65 AGSREVVS--------KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAAS 116
RE + + E L+ W+ +GLP ++ + ++ E+ + A
Sbjct: 48 PAQREAAAGVPWGCEIESLESAASLERWLIDSGLPEQRLAI-QRVDIGERG-----LVAL 101
Query: 117 EDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFW 176
++++ G+ VP SLV+T + G + E++ N + + +A YL+ E S W
Sbjct: 102 KNIRKGEKLLFVPPSLVITADSEWGRPEVGEVMKRNSVPDWPLIATYLISEASLEGSSRW 161
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL-DTV 234
YI L RQ S L W+ EL AYL SP + ++R + YN+L D +
Sbjct: 162 SSYIAALPRQ-------PYSLLYWTRAELDAYLVASPIRKRAIQRITDVIGTYNDLRDRI 214
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
+ LF P E + E F +F + S +V L S+ + ALVP +L +
Sbjct: 215 FSRHSDLF-------PEEVYNIETFLWSFGILFSRLVRLP--SMDEKVALVPWA-DMLNH 264
Query: 295 SSKCKAMLAAVDDAVQLVV---DRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYD 349
S + + L D + Q +V DR Y+ GE + + G + + +LL++YGFV ++ NP D
Sbjct: 265 SPEVETFL-DFDKSSQGIVFTTDRSYQPGEQVFISYGKKSSGELLLSYGFVPKEGTNPND 323
Query: 350 RLVVEAALNTEDPQYQDKRMVAQRNG 375
+ + +L+ D Y++K +RNG
Sbjct: 324 SVELLVSLDKSDKCYKEKLQALKRNG 349
>gi|440792294|gb|ELR13522.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
Length = 568
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 123/461 (26%), Positives = 194/461 (42%), Gaps = 70/461 (15%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
+DL L+ W+ KNGL + E ++ + V A +D + G+ VP L+ T
Sbjct: 66 DDLEQLRVWLLKNGLDSKWLEGIEFAANLPEGSG---VVAKKDFKKGEPFLQVPRKLMFT 122
Query: 136 LERVLGNETIAELLTTNKL---SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
+ + N + +LL +K S CLAL+L+ EK SFW PYI+ L + G
Sbjct: 123 CQ-AMQNTPLGQLLKVDKFLAQSPSLCLALHLLVEK-HNHSSFWTPYIKTLPKSYG---- 176
Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 252
+ L ++ EL L GSPT ++ + +Y + LFQ +
Sbjct: 177 ---TCLYFTLEELEGLRGSPTFTSAIKVIATVAIQYTYIH-------DLFQIRKDILHIN 226
Query: 253 AFTFEIFKQAFVAV---QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV 309
AFT++ F A AV Q+ V +L+ +AL+P + D+
Sbjct: 227 AFTWDEFIWAMSAVGSRQNQVPQWGHNALSE-YALIPAWDMCNHDHGDLQTFWDVNSDST 285
Query: 310 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK-R 368
+ R YK GE + ++ GP+PNS LL++ GFV E+N +D L + L + +DK R
Sbjct: 286 ESHAMRAYKKGEQVYIFYGPRPNSDLLLHAGFVYENNRFDALAIRVRLAPDAEHIKDKLR 345
Query: 369 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISS----------- 417
++ N K+ Q + G D++ +LR+ + + E+Q V+ +
Sbjct: 346 LLHLNNMKMDSQYYLYGLG----LAVDLMAFLRI-HAMNEQELQQVLGAYDQQEAKVHNG 400
Query: 418 --------------LGPICPVSPCMERAVLDQLADYFKARLAGYPATLS---------ED 454
P ++ E A L + L+ YP TL ED
Sbjct: 401 EHPASNGEVVASGVFDPRVKLNDRNELAALQLAEAKCLSLLSLYPTTLQVANGVELKQED 460
Query: 455 EAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLL 495
+A L +L P R T L EK++LN L D I LL
Sbjct: 461 QAALRTTSLTPNMRAVTLLRLKEKEILNRTL----DAIRLL 497
>gi|226501968|ref|NP_001140387.1| uncharacterized protein LOC100272441 [Zea mays]
gi|194699272|gb|ACF83720.1| unknown [Zea mays]
gi|413923744|gb|AFW63676.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Zea mays]
Length = 503
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 83/305 (27%), Positives = 145/305 (47%), Gaps = 27/305 (8%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E L+ W+ +GLP ++ + ++ E+ + A ++++ G+ VP SLV+T
Sbjct: 71 ESAASLERWLIDSGLPEQRLAI-QRVDIGERG-----LVALKNIRKGEKLLFVPPSLVIT 124
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ G + +++ N + + +A YL+ E S W+ YI L RQ
Sbjct: 125 ADSEWGRPEVGDVMKRNSVPDWPLIATYLISEASLEGSSRWISYIAALPRQ-------PY 177
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ EL AYL SP + ++R + YN+L +F ++P P E +
Sbjct: 178 SLLYWTRAELDAYLVASPIRKRAIQRITDVIGTYNDLRD------RIFSRHPDLFPEEVY 231
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLV 312
E F +F + S +V L S+ R ALVP +L +S + + L +
Sbjct: 232 NIETFLWSFGILFSRLVRLP--SMDGRVALVPWA-DMLNHSPEVETFLDFDKSSRGIVFT 288
Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMV 370
DR Y+ GE + + G + + +LL++YGFV ++ NP D + + +L+ D Y++K
Sbjct: 289 TDRSYQPGEQVFISYGKKSSGELLLSYGFVPKEGTNPNDSVELLVSLDKSDNCYKEKLQA 348
Query: 371 AQRNG 375
+RNG
Sbjct: 349 LKRNG 353
>gi|397613505|gb|EJK62256.1| hypothetical protein THAOC_17139 [Thalassiosira oceanica]
Length = 648
Score = 105 bits (262), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 106/414 (25%), Positives = 180/414 (43%), Gaps = 58/414 (14%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLE---RVLGNETIAELLTTNKLSELACLALYLMYEK-KQ 170
A + GD +P L +T + R LG + + E ++E +A L++EK +
Sbjct: 210 ARRSINDGDELLKIPLDLCLTRKSARRELGKDALQE-----GINEYLAVACQLIHEKFVK 264
Query: 171 GKKSFWLPYIR---ELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
G+ SF+ Y+ E+D V W + +LA+L GSP A ++RE
Sbjct: 265 GEDSFYAAYMGVLPEVDE--------VNPTFTWPDEDLAFLEGSPVVAATRSLQMKLRRE 316
Query: 228 YNELDTVWFMAG--SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
Y++L + G L ++P P E +TFE ++ AF + S + L+ + + R A+V
Sbjct: 317 YDDL-----LGGPDGLVAKFPLRFPAEHYTFENWEWAFTMLFSRAIRLRNLQVGERLAMV 371
Query: 286 PLGPPLLAYSSKCKAMLAA----------VDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
P L+ +S+ +A + A ++ V L DR Y+ E + + G + N++L
Sbjct: 372 PYAD-LINHSAFSQAFIDARESGDWLFKSGEEEVILYADRGYRQMEQVYISYGQKSNAEL 430
Query: 336 LINYGFVDEDNPYDRLVVEAALN-------------TEDPQYQDKRMVAQRNGKLSVQVF 382
L+ YGF E NPY+ + V ++ EDP +K G+ F
Sbjct: 431 LLLYGFALERNPYNSVDVTVSIAPRTKQIAEANEGVEEDPLADEKLEFLLSVGRDQTVDF 490
Query: 383 HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 442
+A R +ML YLRL ++ +S +S E +VL + K
Sbjct: 491 PCYADRYP---VEMLEYLRLMMMTPEDTRGKPLSDFDYSRTISSANEASVLRSVVAAVKY 547
Query: 443 RLAGYPATLSEDEAMLTDYNLHP----KKRVATQLVRMEKKMLNACLQVTADMI 492
+L +P T +D A++ D + +R+A + R EK++L L I
Sbjct: 548 QLGLFPQTEEDDAAIIKDKGMFRLFSYNQRMAVRHRRNEKRLLKRTLAALEKQI 601
>gi|449453618|ref|XP_004144553.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Cucumis
sativus]
gi|449511789|ref|XP_004164054.1| PREDICTED: LOW QUALITY PROTEIN: ribulose-1,5 bisphosphate
carboxylase/oxygenase large subunit N-methyltransferase,
chloroplastic-like [Cucumis sativus]
Length = 497
Score = 105 bits (261), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 92/316 (29%), Positives = 150/316 (47%), Gaps = 29/316 (9%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L+ W+ ++GLP K+ ++ N R + A ++++ G+ VP SLV++
Sbjct: 65 ENASALQKWLSESGLPDQKMSIQRV---NVGERGL---VALKNVRKGEKLLFVPPSLVIS 118
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
E E+L N + + +A YL+ E K S W YI L RQ
Sbjct: 119 AESEWSCPEAGEVLKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQ-------PY 171
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ EL YL S + +ER + YN+L +F ++P P E F
Sbjct: 172 SLLYWTREELDRYLEASEIRERAIERITNVVGTYNDLSI------RVFSKHPELFPEEVF 225
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV- 313
E FK +F + S +V L S+ + ALVP +L ++ + + L D A Q VV
Sbjct: 226 NIETFKWSFGILFSRLVRLP--SMDGKVALVPWA-DMLNHNCEVETFL-DYDKASQGVVF 281
Query: 314 --DRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRM 369
DR Y+ GE + + G + N +LL++YGFV ++ NP D + + +L D Y++K
Sbjct: 282 TTDRAYQPGEQVFISYGKKSNGELLLSYGFVPKEGSNPSDSVELLLSLKKSDKCYKEKLE 341
Query: 370 VAQRNGKLSVQVFHVH 385
+++G + Q F +
Sbjct: 342 ALKKHGLRASQCFPIQ 357
>gi|195651313|gb|ACG45124.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Zea mays]
Length = 503
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 82/305 (26%), Positives = 144/305 (47%), Gaps = 27/305 (8%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E L+ W+ +GLP ++ + ++ E+ + A ++++ G+ VP SLV+T
Sbjct: 71 ESAASLERWLIDSGLPEQRLAI-QRVDIGERG-----LVALKNIRKGENLLFVPPSLVIT 124
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ G + +++ N + + +A YL+ E S W+ YI L RQ
Sbjct: 125 ADSEWGRPEVGDVMKRNSVPDWPLIATYLISEASLEGSSRWISYIAALPRQ-------PY 177
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ EL AYL SP + ++R + YN+L +F ++P P E +
Sbjct: 178 SLLYWTRAELDAYLVASPIRKRAIQRITDVIGTYNDLRD------RIFSRHPDLFPEEVY 231
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLV 312
E F +F + S +V L S+ R LVP +L +S + + L +
Sbjct: 232 NIETFLWSFGILFSRLVRLP--SMDGRVVLVPWA-DMLNHSPEVETFLDFDKSSRGIVFT 288
Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMV 370
DR Y+ GE + + G + + +LL++YGFV ++ NP D + + +L+ D Y++K
Sbjct: 289 TDRSYQPGEQVFISYGKKSSGELLLSYGFVPKEGTNPNDSVELLVSLDKSDNCYKEKLQA 348
Query: 371 AQRNG 375
+RNG
Sbjct: 349 LKRNG 353
>gi|307109960|gb|EFN58197.1| hypothetical protein CHLNCDRAFT_142047 [Chlorella variabilis]
Length = 485
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 112/423 (26%), Positives = 189/423 (44%), Gaps = 40/423 (9%)
Query: 49 RKNRFSIRVSSSDTLV-----AGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSH 103
R R R+++ L+ AGS E+ + E + +LK+W+ + GLPP K+ P
Sbjct: 26 RHRRCRCRLAAQAGLLDLLRGAGSTEIATDAEGE--ELKAWLIERGLPPPKLAAAATPGS 83
Query: 104 NEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALY 163
+ A++ + G++ S+P LV+T L + LL L + LAL+
Sbjct: 84 GRG------LVAAQPIGKGESLLSIPQQLVLTPAAALEQSCLRPLLEEQPLPAWSVLALW 137
Query: 164 LMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG 223
L ++ G W PY+R L + G L WSE E+ +L GS ++ LE
Sbjct: 138 LAEQRAAGSAGGWWPYVRLLPERTG-------CVLEWSEEEVEWLCGSQLHSDALEIRAA 190
Query: 224 IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 283
+ + E+ V A + + + AF + AF + S +V L L + A
Sbjct: 191 AEASWAEMQAVLAAAKAQGRAPAHG----AFGRAQLQWAFAVLLSRLVRL--AGLGDQEA 244
Query: 284 LVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
L+P LL + + L +A + AV L +R Y+AGE +++ G + + +LL++YGF
Sbjct: 245 LLPWA-DLLNHDCAAASFLDWSATEAAVVLRAERRYRAGEQLLISYGQKTSGELLLSYGF 303
Query: 342 VDE--DNPYD--RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML 397
+ NP+D RL++E L D K +++G + Q+F + R A +++
Sbjct: 304 CPDLGSNPHDGCRLLLE--LAPGDAARNWKAAALRQHGLAASQLFPL---RMAAAPFELV 358
Query: 398 PY--LRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 455
Y V E + + L + P ++ A L+ + KA LA YP + D
Sbjct: 359 HYTAFSAAVVGSRQEAEQLARRLFEEGDIPPALQTAALEAVVAACKAALAAYPRSFDGDR 418
Query: 456 AML 458
A L
Sbjct: 419 AEL 421
>gi|326495906|dbj|BAJ90575.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 507
Score = 102 bits (253), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 85/306 (27%), Positives = 147/306 (48%), Gaps = 29/306 (9%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E L+ W+ +GLP ++ L EK E+ + A ++++ G+ VP +LV+T
Sbjct: 74 ESAASLERWLTASGLPEQRLAL-EKVDIGERG-----LVALKNVRNGEKLLFVPPTLVIT 127
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ N + +++ + + LA YL+ E S W YI L RQ
Sbjct: 128 ADSEWTNREVGDVMKRYSVPDWPLLATYLISEASLEGSSRWSSYIDALPRQ-------PY 180
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ TE+ AYL SP + + R + YN+L +F ++P P + +
Sbjct: 181 SLLYWTRTEIDAYLVASPIRERAISRISDVIGTYNDLRD------RIFSKHPDLFPEKVY 234
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV- 313
E F+ +F + S +V L+ S+ + ALVP +L +S + A L D + Q +V
Sbjct: 235 NMENFRWSFGILFSRLVRLE--SMGGKVALVPWA-DMLNHSPEVDAFL-DYDKSSQGIVF 290
Query: 314 --DRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRM 369
DR Y+ GE + + G + + +LL++YGFV ++ NP D + +L D Y++K
Sbjct: 291 TTDRSYQPGEQVFISYGKKSSGELLLSYGFVPKEGTNPNDSVEFLVSLKKSDECYKEKLQ 350
Query: 370 VAQRNG 375
+++G
Sbjct: 351 ALKKHG 356
>gi|223992783|ref|XP_002286075.1| rubisco small subunit small subunit n-methyltransferase
[Thalassiosira pseudonana CCMP1335]
gi|220977390|gb|EED95716.1| rubisco small subunit small subunit n-methyltransferase
[Thalassiosira pseudonana CCMP1335]
Length = 434
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 101/401 (25%), Positives = 171/401 (42%), Gaps = 46/401 (11%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKK-QGKK 173
A + GD +P L +T R + + + + ++E +A L++EK G +
Sbjct: 50 ARRSINDGDELLKIPMDLCIT--RKSARKALGKDALQDGINEYLAIACQLIHEKYVLGDE 107
Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
S W Y+ L V W + +LA+L GSP A ++REY+ L
Sbjct: 108 SEWDAYMGVLPEVE-----EVNPTFTWKDEDLAFLDGSPVVAATRSLQMKLRREYDAL-- 160
Query: 234 VWFMAG--SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
+ G L ++P P E FT+E + AF + S + L+ + + R A+VP L
Sbjct: 161 ---LGGQDGLIAKFPDRFPAEHFTYENWVWAFTMLFSRAIRLRNLQVGERLAMVPYAD-L 216
Query: 292 LAYSSKCKAMLAAVD----------DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
+ +S+ A + A + + V L DR Y+ E + + G + N++LL+ YGF
Sbjct: 217 INHSAFSGAFIDARESGDWLFKNGEEEVILYADRGYRQMEQVYISYGQKSNAELLLLYGF 276
Query: 342 VDEDNPYDRLVVEAALNTE-------------DPQYQDKRMVAQRNGKLSVQVFHVHAGR 388
E NPY+ + V ++ DP Q+K G+ F +A R
Sbjct: 277 ALERNPYNSVDVTVSIAPRTAALAAANEGIEVDPLAQEKVEFLASVGRDQTVDFPCYADR 336
Query: 389 EKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP 448
+ML +LRL ++ ++ +SP E AVL + + K +L YP
Sbjct: 337 YP---VEMLEFLRLMMMTPEDTRGKPLADFDYSRTISPANEAAVLSSVVEAVKYQLNLYP 393
Query: 449 ATLSEDEAMLTDYNLHP----KKRVATQLVRMEKKMLNACL 485
+ +D ++ D L +R+A + R EK++L L
Sbjct: 394 QSEEDDANIIKDKALFRLLSYNQRMAVRHRRNEKRLLKRTL 434
>gi|302785554|ref|XP_002974548.1| hypothetical protein SELMODRAFT_101776 [Selaginella moellendorffii]
gi|300157443|gb|EFJ24068.1| hypothetical protein SELMODRAFT_101776 [Selaginella moellendorffii]
Length = 467
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 82/299 (27%), Positives = 136/299 (45%), Gaps = 26/299 (8%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L+ W+ + GLP KV LK + + + L GD +P +L +T E
Sbjct: 41 LQQWLSQAGLPIQKVELKNVGAGGRG------LVSKRMLYKGDRLLFLPATLAITTESEW 94
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
+++ L E LA YL+ E GK S W PYI L R+ G S LLW
Sbjct: 95 ACAEAGKVIRAKDLPEWPFLACYLISEASLGKSSPWYPYIAALPRRPG-------SILLW 147
Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
+ ++ A+L+ + K L+ ++ +N+L+ FM + P E F E F
Sbjct: 148 TALDVEAHLSATSIKDRALQCVREVEDTFNDLNKQVFMKNR------EEFPPEVFNLESF 201
Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPY 317
K AF + S +V L SL ++ AL+P G +L + ++ L + ++ +DR Y
Sbjct: 202 KWAFGILFSRLVRLP--SLGQKLALIPFG-DMLNHDTEVTTFLDFDSGSKSITCTLDRGY 258
Query: 318 KAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 375
++ + + + G + N +LL+ YGFV N D + + L+ D Y+ K + +G
Sbjct: 259 ESNKEVFISYGKRSNGELLVAYGFVPSGKNSEDSVSITLGLDPADEMYEAKLGALKEHG 317
>gi|444909511|ref|ZP_21229702.1| hypothetical protein D187_00317 [Cystobacter fuscus DSM 2262]
gi|444720460|gb|ELW61244.1| hypothetical protein D187_00317 [Cystobacter fuscus DSM 2262]
Length = 445
Score = 99.0 bits (245), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 110/433 (25%), Positives = 180/433 (41%), Gaps = 45/433 (10%)
Query: 68 REVVSKKEEDLGDLKSWMHKNG-LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAF 126
R E+ L L WM + G L P I+++ V A D+ G+
Sbjct: 2 RTSAESSEQKLSSLLRWMEQGGALFPKMHIVRQADGERS-------VLARTDIAEGEVVL 54
Query: 127 SVPNSLVVTLERV----LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRE 182
+P + + TLER +G ++L N + LA +L+ EK +G SFW P++
Sbjct: 55 QIPTTHLFTLERAKASDIGRRIQSQLQPDN---DFLYLASWLLEEKHRGADSFWKPFVDS 111
Query: 183 LDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLF 242
L + PL +SE E A + GS LER ++R+ E + +
Sbjct: 112 LP------EAYPHVPLFYSEQERARMKGSQ-----LERLVEVQRQSFEQE---------Y 151
Query: 243 QQYPYDIPT-EAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAM 301
Q +P E F FE + A +++ S + L+ +LVPL + + +
Sbjct: 152 AQLREKLPEYERFGFEEYVWARISLYSRLFSLKGGLQGP--SLVPLSD-MFNHRQPPDVL 208
Query: 302 LAAVDDA--VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALN 358
+ +D +++ R AG I G + + L++ GFV D + D + + L
Sbjct: 209 WSTSEDGQTFRMIAQRAVPAGTEIHTHYGAKSSDVFLLHSGFVPDGNEENDEVYLSVGLP 268
Query: 359 TEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM---QSVI 415
DP K+ + + F V + A + +LR+ + S + ++
Sbjct: 269 PGDPLASVKQQMFGLASATAKHPFKVSRQGKYLASWSVFSFLRMAHASPDEFLALSNRLL 328
Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
S I PVS E VL LA + RL +P TL EDE +L + L P +R L R
Sbjct: 329 SGTKTIAPVSVACEERVLGTLAAACEERLKAFPTTLEEDERLLREGPLSPNERSCVLLRR 388
Query: 476 MEKKMLNACLQVT 488
EK++L L++T
Sbjct: 389 QEKRLLGDYLELT 401
>gi|115448405|ref|NP_001047982.1| Os02g0725200 [Oryza sativa Japonica Group]
gi|45735887|dbj|BAD12920.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase [Oryza sativa Japonica
Group]
gi|45736017|dbj|BAD13045.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase [Oryza sativa Japonica
Group]
gi|113537513|dbj|BAF09896.1| Os02g0725200 [Oryza sativa Japonica Group]
gi|215737236|dbj|BAG96165.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623589|gb|EEE57721.1| hypothetical protein OsJ_08208 [Oryza sativa Japonica Group]
Length = 502
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 86/308 (27%), Positives = 145/308 (47%), Gaps = 29/308 (9%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L+ W+ +GLP ++ + ++ E+ + A ++++ G+ VP SLV+T +
Sbjct: 75 LERWLTDSGLPEQRLGI-QRVDVGERG-----LVALKNIRKGEKLLFVPPSLVITADSEW 128
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
G + +L N + + +A YL+ E S W YI L RQ S L W
Sbjct: 129 GCPEVGNVLKRNSVPDWPLIATYLISEASLESSSRWSSYIAALPRQ-------PYSLLYW 181
Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFEI 258
+ EL AYL SP + ++R + YN+L D ++ LF P E + E
Sbjct: 182 TRPELDAYLVASPIRERAIQRITDVVGTYNDLRDRIFSKHSDLF-------PEEVYNLET 234
Query: 259 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRP 316
F+ +F + S +V L S+ R ALVP +L +S + + L + DR
Sbjct: 235 FRWSFGILFSRLVRLP--SMDGRVALVPWA-DMLNHSPEVETFLDYDKSSGGIVFTTDRS 291
Query: 317 YKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRN 374
Y+ GE + + G + + +LL++YGFV ++ NP D + + +LN D Y++K +RN
Sbjct: 292 YQPGEQVFISYGKKSSGELLLSYGFVPKEGTNPNDSVELLVSLNKSDKCYKEKLQALKRN 351
Query: 375 GKLSVQVF 382
G + F
Sbjct: 352 GLSEFESF 359
>gi|308807993|ref|XP_003081307.1| putative methyltransferase (ISS) [Ostreococcus tauri]
gi|116059769|emb|CAL55476.1| putative methyltransferase (ISS) [Ostreococcus tauri]
Length = 505
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 108/410 (26%), Positives = 173/410 (42%), Gaps = 67/410 (16%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
DL W+ NGL K+ L+ + + A+E+++ G+A V S ++T+ER
Sbjct: 66 DLTRWLASNGLRAQKMTLESNLAEG------RGLVATEEIKRGEALLGVDASCLITVERA 119
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEK---KQGKKSFWLPYIRELDRQRGRGQLAVES 196
+ + +L E + LA +L + + G + YIR L R+ G S
Sbjct: 120 IAEAKLGP--RHAELQEWSVLATFLAQQAMALESGNAGTFGEYIRALPRRTG-------S 170
Query: 197 PLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
L W E E+ L GSP++ ER E + E+ + +P DI A
Sbjct: 171 VLDWPEDEVETLLKGSPSRLAAAERQESVNAAIAEIRS----------SFP-DITEGALR 219
Query: 256 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 315
+ AF + S ++ L ++ ALVP +L + C A + AV L DR
Sbjct: 220 W-----AFDILFSRLIRLD--AMGGELALVPW-ADMLNHKPGCAAFIDLNGSAVNLTTDR 271
Query: 316 PYKAGESIVVWCGPQPNSKLLINYGFVDE--DNPYDRLVVEAALNTEDPQYQDKRMVAQR 373
Y AGE + G +P+S+LLI+YGF E +NP D + ++ DP Q K V +R
Sbjct: 272 AYAAGEQVWASYGQRPSSELLISYGFAPEVGENPDDEYSLTLGVDVNDPYAQAKADVLRR 331
Query: 374 NGKLSVQVFHVH-AGREKEAI-------------SDMLPYLRLGYVSDTSEMQSVISSL- 418
G V+ F + G ++ + S++ R + + QS+ S+
Sbjct: 332 MGLSPVETFPLRLNGYPRQLLQYASFILCNPDKPSELEGLARTAFTGSANFGQSIFDSVR 391
Query: 419 -----------GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM 457
G I P E AV + LAD L+ YP +L +D+ +
Sbjct: 392 GLAQGQARGKQGVILGGVPG-EIAVREMLADMCAEALSAYPNSLEKDKGI 440
>gi|218191491|gb|EEC73918.1| hypothetical protein OsI_08761 [Oryza sativa Indica Group]
Length = 502
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 86/308 (27%), Positives = 145/308 (47%), Gaps = 29/308 (9%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L+ W+ +GLP ++ + ++ E+ + A ++++ G+ VP SLV+T +
Sbjct: 75 LERWLTDSGLPEQRLGI-QRVDVGERG-----LVALKNIRKGEKLLFVPPSLVITADSEW 128
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
G + +L N + + +A YL+ E S W YI L RQ S L W
Sbjct: 129 GCPEVGNVLKRNSVPDWPLIATYLISEASLESSSRWSSYIAALPRQ-------PYSLLYW 181
Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFEI 258
+ EL AYL SP + ++R + YN+L D ++ LF P E + E
Sbjct: 182 TRPELDAYLVASPIRERAIQRITDVVGTYNDLRDRIFSKHSDLF-------PEEVYNLET 234
Query: 259 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRP 316
F+ +F + S +V L S+ R ALVP +L +S + + L + DR
Sbjct: 235 FRWSFGILFSRLVRLP--SMDGRVALVPWA-DMLNHSPEVETFLDYDKSSGGIVFTTDRS 291
Query: 317 YKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRN 374
Y+ GE + + G + + +LL++YGFV ++ NP D + + +LN D Y++K +RN
Sbjct: 292 YQPGEQVFISYGKKSSGELLLSYGFVPKEGTNPNDSVELLVSLNKSDKCYKEKLQALKRN 351
Query: 375 GKLSVQVF 382
G + F
Sbjct: 352 GLSEFESF 359
>gi|323456050|gb|EGB11917.1| hypothetical protein AURANDRAFT_61181 [Aureococcus anophagefferens]
Length = 516
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 92/343 (26%), Positives = 151/343 (44%), Gaps = 34/343 (9%)
Query: 151 TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 210
+ +E +AL L+ E+ +G +SFW YI L G + W ELAYL G
Sbjct: 162 NDDTNEYIAIALLLILERSKGSRSFWSEYIAILPTNEDVG-----ATFTWPAEELAYLEG 216
Query: 211 SPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
SP + ++ E+ A L D E FTFE ++ AF + S
Sbjct: 217 SPAASATASMMAKLRAEH---------AAVLEGNSALD--PEIFTFEAWQWAFTNLFSRA 265
Query: 271 VHLQKVSLARRFALVPL-----GPPLLAYSSKCKAMLAAV-----DDAVQLVVDRPYKAG 320
+ L+ A+VP P + + + A +D V L DR YK
Sbjct: 266 IRLKASRAGELLAMVPYVDFINHSPFSSSYVDAREVPKAFPWEEKEDEVVLFADRAYKKF 325
Query: 321 ESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK-LSV 379
E + + GP+ N+ LL+ YGF + NP++ + + + +D Y K A+ G+ +S
Sbjct: 326 EQVFISYGPKSNADLLLLYGFALDRNPFNSVDLAVGASKDDALYDAKERFARGAGRDVSS 385
Query: 380 QVFHVHAGREKEAISDMLPYLRLGYVS-DTSEMQSVISSLGPICPVSPCMERAVLDQLAD 438
F ++A R + +++ +LR+ + D + + + +S E AVLD + D
Sbjct: 386 AAFPLYADRFPD---ELVQFLRMACATEDHLGARPLDDPDNYVDILSLDNELAVLDTIRD 442
Query: 439 YFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 481
A +A YPA +D + D L +R+A +LV EK++L
Sbjct: 443 ACDAAVAAYPAKSGDD---VPDAFLSRNQRMAKRLVNTEKRIL 482
>gi|357137766|ref|XP_003570470.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like
[Brachypodium distachyon]
Length = 389
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 77/269 (28%), Positives = 131/269 (48%), Gaps = 23/269 (8%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A +++ G+ VP SLV++ + N + +++ + + + LA YL+ E
Sbjct: 13 LVALTNVRNGEKLLFVPPSLVISADSEWSNREVGDVMKSYSVPDWPLLATYLISEASLEG 72
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL 231
S W YI L RQ S L W+ TE+ AYL SP + + R + YN+L
Sbjct: 73 SSRWSSYIDALPRQ-------PYSLLYWTRTEIDAYLVASPIRERAISRIGDVIGTYNDL 125
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
+F ++P P E + E F+ +F + S +V L S+ + ALVP +
Sbjct: 126 ------RDRIFSKHPELFPEEVYNMENFRWSFGILFSRLVRLP--SMDGKVALVPWA-DM 176
Query: 292 LAYSSKCKAMLAAVDDAVQLVV---DRPYKAGESIVVWCGPQPNSKLLINYGFVDED--N 346
L ++ + A L D + Q +V DR Y+ GE + + G + + +LL++YGFV ++ N
Sbjct: 177 LNHNPEVDAFL-DFDKSSQGIVFTTDRSYQPGEQVFISYGKKSSGELLLSYGFVPKEGTN 235
Query: 347 PYDRLVVEAALNTEDPQYQDKRMVAQRNG 375
P D + +LN D Y++K +R+G
Sbjct: 236 PNDSVEFSVSLNKSDDCYREKLQALKRHG 264
>gi|302759643|ref|XP_002963244.1| hypothetical protein SELMODRAFT_80789 [Selaginella moellendorffii]
gi|300168512|gb|EFJ35115.1| hypothetical protein SELMODRAFT_80789 [Selaginella moellendorffii]
Length = 467
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 80/299 (26%), Positives = 134/299 (44%), Gaps = 26/299 (8%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L+ W+ + GLP KV LK + + + L GD +P +L +T E
Sbjct: 41 LQQWLSQAGLPIQKVELKNVGAGGRG------LVSKRMLYKGDRLLFLPATLAITTESEW 94
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
+++ L E LA YL+ E GK S W PYI L R+ G S LLW
Sbjct: 95 ACAEAGKVIRAKDLPEWPFLACYLISEASLGKSSPWYPYIAALPRRPG-------SILLW 147
Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
+ ++ +L+ + K L+ ++ +N+L+ FM + P E F + F
Sbjct: 148 TALDVETHLSATSIKDRALQCVREVEDTFNDLNKQVFMKNR------EEFPPEVFNLKSF 201
Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPY 317
K AF + S +V L SL ++ AL+P G +L + ++ L + ++ +DR Y
Sbjct: 202 KWAFGILFSRLVRLP--SLGQKLALIPFG-DMLNHDTEVTTFLDFDSGSKSITCTLDRGY 258
Query: 318 KAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 375
++ + + G + N +LL+ YGFV N D + + L+ D Y+ K + +G
Sbjct: 259 ESNREVFISYGKRSNGELLVAYGFVPSGKNSEDSVSITLGLDPADEMYEAKLGTLKEHG 317
>gi|443722302|gb|ELU11224.1| hypothetical protein CAPTEDRAFT_181634 [Capitella teleta]
Length = 541
Score = 96.3 bits (238), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 98/423 (23%), Positives = 184/423 (43%), Gaps = 35/423 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPI--HYVAASEDLQAGDAAFSVP 129
S +E++ W+ N + V ++ H + + + A+ D + G+ ++P
Sbjct: 71 SGREKNFDGFMGWLKSNSVDAEAVEIQ--------HFDVGGYGIKATRDFKEGELFLAIP 122
Query: 130 NSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQ 186
S+++T + N + L+ N++ + LAL+++ E SFWLPY++ L
Sbjct: 123 RSVMMTTDTA-KNSALGALIADNRILQTMPNILLALHVLCELC-SPASFWLPYLKILPH- 179
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
+ SPL ++ +L L SPT +E++ + I R+Y + F L + P
Sbjct: 180 ------SYSSPLYFNPEDLQLLKASPTLSEMINQFRNITRQYAYFFNL-FQGHELASKLP 232
Query: 247 YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF-ALVPLGPPLLAYSSKCKAMLAAV 305
I + ++ ++ A +V + + + R AL+PL + + +
Sbjct: 233 --IQVKNICYDDYRWAVSSVMTRQNQIPTLDGQRMISALIPLWDMCNHTNGQITTDFSLK 290
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 365
+D + AG + ++ G + N++LLI+ GFV N DRL + ++ DP +
Sbjct: 291 NDRSECFSLEGTVAGAQVFIFYGSRSNAELLIHNGFVYPQNHSDRLTIRLGISKNDPLFS 350
Query: 366 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICP-- 423
K V R + ++F +H G SD L +LR+ +++ ++++ ++ I
Sbjct: 351 MKSEVLSRLSMQASRLFSLHCG-VNPVDSDTLAFLRVVVMTE-DDLRTALACRQQISKLR 408
Query: 424 -----VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEK 478
VS ER LA L YP + ED +L +L R+A QL EK
Sbjct: 409 DFDDFVSEDNERKAWAFLATRVLLLLKAYPTSAQEDATLLQGNDLSTHARLAVQLRHCEK 468
Query: 479 KML 481
+L
Sbjct: 469 NIL 471
>gi|62857953|ref|NP_001016577.1| histone-lysine N-methyltransferase setd3 [Xenopus (Silurana)
tropicalis]
gi|89272100|emb|CAJ81720.1| novel protein containing a SET domain [Xenopus (Silurana)
tropicalis]
Length = 581
Score = 96.3 bits (238), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 105/455 (23%), Positives = 202/455 (44%), Gaps = 47/455 (10%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ +L W +NG L E P + A+ +++A + VP
Sbjct: 72 GKREDYFPELMEWCKENGASTDGFELVEFPEEG------FGLKATREIKAEELFLWVPRK 125
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E G+ + L + +++ + LA +L+ E+ SFWLPYI+ L +
Sbjct: 126 LLMTVESAKGS-VLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWLPYIKTLPNE-- 181
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL ++E E+ YL + ++ + + R+Y +F + Q +P
Sbjct: 182 -----YDTPLYFNEDEVQYLQSTQAILDVFSQYKNTARQY-----AYFY--KVIQTHPNA 229
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FTF+ ++ A +V + + +R AL+PL +
Sbjct: 230 NKLPLKDSFTFDDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 289
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V + +K+GE I ++ G + N++ +I+ GF E+N +DR+ ++ ++ D Y
Sbjct: 290 EDDRCECVALQDFKSGEQIYIFYGTRSNAEFVIHNGFFFENNLHDRVKIKLGVSKSDRLY 349
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL---------GYVSDTSEMQSVI 415
K V R G + VF +H E + +L +LR+ G++ + +
Sbjct: 350 AMKAEVLARAGIPTSSVFALHV-TEPPISAQLLAFLRVFCMNEDELKGHLIGDHAIDKIF 408
Query: 416 SSLGPICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVAT 471
+ PVS E + +L + +AR L Y T+ +D +L ++ +A
Sbjct: 409 TLGNSEFPVS--WENEI--KLWTFLEARASLLLKTYKTTVEDDNKVLEQPDMTFHSAMAI 464
Query: 472 QLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAP 506
+L R+EK++L L+ +D L + P P
Sbjct: 465 KLRRVEKEILEKALKSASDNRKLYSKNSEEGTPLP 499
>gi|3403236|gb|AAC29137.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Spinacia oleracea]
Length = 491
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 98/384 (25%), Positives = 171/384 (44%), Gaps = 37/384 (9%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A +D+ + VP + + V +E + N L +AL+LM EKK G
Sbjct: 86 LVAQKDISRNEVVLEVPQKFWINPDTVAASEIGS---VCNGLKPWVSVALFLMREKKLGN 142
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W PYI L + S + WSE EL+ L GS L E + E+ +L+
Sbjct: 143 SSSWKPYIDILPD-------STNSTIYWSEEELSELQGSQLLNTTLGVKELVANEFAKLE 195
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG---- 288
+ Q +P+D+ + F F F +C+ + L+PL
Sbjct: 196 EEVLVPHK--QLFPFDVTQDDF-FWAFGMLRSRAFTCLE-------GQSLVLIPLADLAN 245
Query: 289 --PPLLA--YSSKCK-AMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFV 342
P + A Y+ + + A L + + L P KAG+ +++ + + N++L ++YG
Sbjct: 246 HSPDITAPKYAWEIRGAGLFSRELVFSLRNPTPVKAGDQVLIQYDLNKSNAELALDYGLT 305
Query: 343 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 402
+ + + + + D Y DK +A+ NG F + E+ ++MLPYLRL
Sbjct: 306 ESRSERNAYTLTLEIPESDSFYGDKLDIAESNGMGESAYFDIVL--EQPLPANMLPYLRL 363
Query: 403 GYVS--DTSEMQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM 457
+ D ++S+ S G + P+SP E + + D + L+GY T++EDE +
Sbjct: 364 VALGGEDAFLLESIFRNSIWGHLDLPISPANEELICQVIRDACTSALSGYSTTIAEDEKL 423
Query: 458 LTDYNLHPKKRVATQLVRMEKKML 481
L + ++ P+ +A + EKK+L
Sbjct: 424 LAEGDIDPRLEIAITIRLGEKKVL 447
>gi|340780678|pdb|3SMT|A Chain A, Crystal Structure Of Human Set Domain-Containing Protein3
Length = 497
Score = 95.9 bits (237), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 101/434 (23%), Positives = 195/434 (44%), Gaps = 39/434 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 72 GKREDYFPDLXKWASENG---ASVEGFEXVNFKEEGFGLR---ATRDIKAEELFLWVPRK 125
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L+ T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 126 LLXTVESA-KNSVLGPLYSQDRILQAXGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 181
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 182 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 229
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 230 NKLPLKDSFTYEDYRWAVSSVXTRQNQIPTEDGSRVTLALIPLWDXCNHTNGLITTGYNL 289
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 290 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 349
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL---------GYVSDTSEMQSVI 415
K V R G + VF +H E + +L +LR+ ++ S + +
Sbjct: 350 AXKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCXTEEELKEHLLGDSAIDRIF 408
Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+ PVS E + L D L Y T+ ED+++L +++L + + A +L
Sbjct: 409 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNHDLSVRAKXAIKLRL 468
Query: 476 MEKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 469 GEKEILEKAVKSAA 482
>gi|332321747|sp|B7ZUF3.1|SETD3_XENTR RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|213624517|gb|AAI71209.1| LOC549331 protein [Xenopus (Silurana) tropicalis]
Length = 582
Score = 95.9 bits (237), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 105/455 (23%), Positives = 202/455 (44%), Gaps = 47/455 (10%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ +L W +NG L E P + A+ +++A + VP
Sbjct: 73 GKREDYFPELMEWCKENGASTDGFELVEFPEEG------FGLKATREIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E G+ + L + +++ + LA +L+ E+ SFWLPYI+ L +
Sbjct: 127 LLMTVESAKGS-VLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWLPYIKTLPNE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL ++E E+ YL + ++ + + R+Y +F + Q +P
Sbjct: 183 -----YDTPLYFNEDEVQYLQSTQAILDVFSQYKNTARQY-----AYFY--KVIQTHPNA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FTF+ ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTFDDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V + +K+GE I ++ G + N++ +I+ GF E+N +DR+ ++ ++ D Y
Sbjct: 291 EDDRCECVALQDFKSGEQIYIFYGTRSNAEFVIHNGFFFENNLHDRVKIKLGVSKSDRLY 350
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL---------GYVSDTSEMQSVI 415
K V R G + VF +H E + +L +LR+ G++ + +
Sbjct: 351 AMKAEVLARAGIPTSSVFALHV-TEPPISAQLLAFLRVFCMNEDELKGHLIGDHAIDKIF 409
Query: 416 SSLGPICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVAT 471
+ PVS E + +L + +AR L Y T+ +D +L ++ +A
Sbjct: 410 TLGNSEFPVS--WENEI--KLWTFLEARASLLLKTYKTTVEDDNKVLEQPDMTFHSAMAI 465
Query: 472 QLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAP 506
+L R+EK++L L+ +D L + P P
Sbjct: 466 KLRRVEKEILEKALKSASDNRKLYSKNSEEGTPLP 500
>gi|308802083|ref|XP_003078355.1| ribulose-1,5-bisphosphate carb (ISS) [Ostreococcus tauri]
gi|116056807|emb|CAL53096.1| ribulose-1,5-bisphosphate carb (ISS) [Ostreococcus tauri]
Length = 520
Score = 95.9 bits (237), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 110/432 (25%), Positives = 184/432 (42%), Gaps = 48/432 (11%)
Query: 76 EDLGDLKSWM-HKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
ED +L +W+ + G+ + KE + V D +AG A VP S V
Sbjct: 48 EDARELAAWLSYDKGVDASALAFKEDAKGGVR------VILKADAEAGATALRVPQSAAV 101
Query: 135 TLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
T V + ++EL + EL LAL+L E+ +G S W PY++ L +
Sbjct: 102 TSVDVGEHPIVSELASGR--PELIGLALWLCAERIKGGASEWAPYVKTL-------RANP 152
Query: 195 ESPLLWSET-ELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 253
++PL W++ + A L GSP A+ +ER++ + EY + V + P P EA
Sbjct: 153 DAPLFWTDAKDFALLKGSPVAADAIERSKSARTEYASITEV-------IKSDPSSYPPEA 205
Query: 254 FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL----------GPPLLAYSS-----KC 298
+ F + A+ + + A+ +ALVPL P +L S+ +C
Sbjct: 206 YEFLTEARFVDALATVCAKATWLPTAQCYALVPLLDVISIGGAPVPGVLPPSASDGVVRC 265
Query: 299 KAMLAAVDDAVQLVVDRPYKAGESIVVWCGP--QPNSKLLINYGFVDEDNPYDRLVVEAA 356
VD A ++ A S V+ + N +L +N G+VD+ +P D + ++
Sbjct: 266 GPADYDVDTASVVLRCATKAAANSEVIQLDALQRNNGELFLNTGYVDQKHPGDYIYMKTD 325
Query: 357 LNTEDPQYQDKRMVAQRNGKLSV-QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVI 415
+ T D + K+ V + G + Q F V+ R + + YLR V D EM +V
Sbjct: 326 IQTSDRLFTAKKQVLEGMGFTAADQYFPVYKDRMP---TQLYSYLRFSRVQDPGEMMAVS 382
Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
I VS E +L L + +A Y ++ +L + P + + +R
Sbjct: 383 FEEDKI--VSVMNEYEILQILMGDCRELMAEYDTNEEDELNLLKLSDQMPVREIEAAKLR 440
Query: 476 M-EKKMLNACLQ 486
M EKK++ A +
Sbjct: 441 MSEKKLIGATMN 452
>gi|194038089|ref|XP_001925323.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Sus scrofa]
gi|456754196|gb|JAA74239.1| SET domain containing 3 [Sus scrofa]
Length = 595
Score = 95.5 bits (236), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 105/434 (24%), Positives = 194/434 (44%), Gaps = 39/434 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASDNG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYAQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP-- 246
++PL + E E+ YL + ++ + + R+Y +F + Q +P
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPQA 230
Query: 247 YDIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+ +P E+FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 HKLPLKESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V R ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 291 EDDRCECVALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------IS 416
K V R G + VF +H E + +L +LR+ +++ + + I
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPVSAQLLAFLRVFCMTEGELKEHLLGENAIDRIF 409
Query: 417 SLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+LG PVS E + L D L Y T+ ED+ L ++ L + +A +L
Sbjct: 410 TLGNSEYPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKTFLKNHGLSVRATMAVKLRL 469
Query: 476 MEKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 470 GEKEILEKAVESAA 483
>gi|3403234|gb|AAC29136.1| ribulose-1,5-bisphosphate carboxylase/oxygenase N-methyltransferase
[Spinacia oleracea]
gi|3403238|gb|AAC29138.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase II [Spinacia oleracea]
Length = 495
Score = 95.1 bits (235), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 95/381 (24%), Positives = 173/381 (45%), Gaps = 27/381 (7%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A +D+ + VP + + V +E + N L +AL+LM EKK G
Sbjct: 86 LVAQKDISRNEVVLEVPQKFWINPDTVAASEIGS---VCNGLKPWVSVALFLMREKKLGN 142
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W PYI L + S + WSE EL+ L GS L E + E+ +L+
Sbjct: 143 SSSWKPYIDILPD-------STNSTIYWSEEELSELQGSQLLNTTLGVKELVANEFAKLE 195
Query: 233 TVWFMAGSLFQQYPYDIPTEAF--TFEIFK-QAFVAVQS---CVVHLQKVSLARRFALVP 286
+ Q +P+D+ + F F + + +AF ++ ++ L + + +
Sbjct: 196 EEVLVPHK--QLFPFDVTQDDFFWAFGMLRSRAFTCLEGQSLVLIPLADLWVQQANHSPD 253
Query: 287 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDED 345
+ P A+ + A L + + L P KAG+ +++ + + N++L ++YG +
Sbjct: 254 ITAPKYAWEIR-GAGLFSRELVFSLRNPTPVKAGDQVLIQYDLNKSNAELALDYGLTESR 312
Query: 346 NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV 405
+ + + + D Y DK +A+ NG F + E+ ++MLPYLRL +
Sbjct: 313 SERNAYTLTLEIPESDSFYGDKLDIAESNGMGESAYFDIVL--EQPLPANMLPYLRLVAL 370
Query: 406 S--DTSEMQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 460
D ++S+ S G + P+SP E + + D + L+GY T++EDE +L +
Sbjct: 371 GGEDAFLLESIFRNSIWGHLDLPISPANEELICQVIRDACTSALSGYSTTIAEDEKLLAE 430
Query: 461 YNLHPKKRVATQLVRMEKKML 481
++ P+ +A + EKK+L
Sbjct: 431 GDIDPRLEIAITIRLGEKKVL 451
>gi|148744485|gb|AAI42996.1| SET domain containing 3 [Homo sapiens]
Length = 594
Score = 95.1 bits (235), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 101/434 (23%), Positives = 198/434 (45%), Gaps = 39/434 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD---------TSEMQSVI 415
K V R G + VF +H E + +L +LR+ +++ S + +
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIF 409
Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+ PVS E + L D L Y T+ ED+++L +++L + ++A +L
Sbjct: 410 TLGNSKFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNHDLSVRAKMAIKLRL 469
Query: 476 MEKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 470 GEKEILEKAVKSAA 483
>gi|40068481|ref|NP_115609.2| histone-lysine N-methyltransferase setd3 isoform a [Homo sapiens]
gi|74750394|sp|Q86TU7.1|SETD3_HUMAN RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|28071092|emb|CAD61927.1| unnamed protein product [Homo sapiens]
gi|119602070|gb|EAW81664.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
gi|119602072|gb|EAW81666.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
gi|119602073|gb|EAW81667.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
gi|194380984|dbj|BAG64060.1| unnamed protein product [Homo sapiens]
gi|307686103|dbj|BAJ20982.1| SET domain containing 3 [synthetic construct]
Length = 594
Score = 94.7 bits (234), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 101/434 (23%), Positives = 198/434 (45%), Gaps = 39/434 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD---------TSEMQSVI 415
K V R G + VF +H E + +L +LR+ +++ S + +
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIF 409
Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+ PVS E + L D L Y T+ ED+++L +++L + ++A +L
Sbjct: 410 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNHDLSVRAKMAIKLRL 469
Query: 476 MEKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 470 GEKEILEKAVKSAA 483
>gi|355718753|gb|AES06373.1| SET domain containing 3 [Mustela putorius furo]
Length = 585
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 105/434 (24%), Positives = 198/434 (45%), Gaps = 39/434 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRDLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P +AFT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDAFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V R ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 291 EDDRCECVALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------IS 416
K V R G + VF +H+ E + +L +LR+ +++ + + I
Sbjct: 351 AMKAEVLARAGIPTSSVFALHS-TEPPVSAQLLAFLRVFCMTEEELKEHLLGDNALDRIF 409
Query: 417 SLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+LG PVS E + L D L Y T+ ED++ L D++L + +A +L
Sbjct: 410 TLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLKDHDLSVRAAMAIKLRL 469
Query: 476 MEKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 470 GEKEILEKAVKSAA 483
>gi|426248573|ref|XP_004018037.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
setd3 [Ovis aries]
Length = 596
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 102/434 (23%), Positives = 196/434 (45%), Gaps = 39/434 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 80 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 133
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 134 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 189
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 190 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--RVIQTHPHA 237
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL S
Sbjct: 238 HKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTSGLITTGYNL 297
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 298 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 357
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVI 415
K V R G + VF +H E + +L +LR+ +++ S + +
Sbjct: 358 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIF 416
Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+ PVS E + L D L Y T+ ED++ L +++L + +A +L
Sbjct: 417 TLGNSEYPVSWDNEVRLWAFLEDRASLLLKTYKTTIEEDKSFLKNHDLSARATMAVKLRL 476
Query: 476 MEKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 477 GEKEILERAVKSAA 490
>gi|440907688|gb|ELR57800.1| SET domain-containing protein 3 [Bos grunniens mutus]
Length = 594
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 102/434 (23%), Positives = 196/434 (45%), Gaps = 39/434 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL S
Sbjct: 231 HKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTSGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVI 415
K V R G + VF +H E + +L +LR+ +++ S + +
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIF 409
Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+ PVS E + L D L Y T+ ED++ L +++L + +A +L
Sbjct: 410 TLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLKNHDLSARATMAIKLRL 469
Query: 476 MEKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 470 GEKEILERAVKSAA 483
>gi|431839268|gb|ELK01195.1| SET domain-containing protein 3 [Pteropus alecto]
Length = 805
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 104/434 (23%), Positives = 196/434 (45%), Gaps = 39/434 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E E+ + A+ D++A + VP
Sbjct: 252 GKREDYFPDLMKWASENG---ASVEGFEMVDFKEEGFGLR---ATRDIKAEELFLWVPRK 305
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 306 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 361
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 362 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 409
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 410 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 469
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V R ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 470 EDDRCECVALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 529
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------IS 416
K V R G + VF +H E + +L +LR+ +++ + + I
Sbjct: 530 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIF 588
Query: 417 SLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+LG PVS E + L D L Y T+ ED + L +++L + +A +L
Sbjct: 589 TLGNSEYPVSWDNEVKLWTFLEDRASLLLKTYKTTVEEDRSFLRNHDLSVRAAMAVKLRL 648
Query: 476 MEKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 649 GEKEILERAVKSAA 662
>gi|110331827|gb|ABG67019.1| hypothetical protein LOC84193 [Bos taurus]
Length = 488
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 101/431 (23%), Positives = 193/431 (44%), Gaps = 39/431 (9%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 80 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 133
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 134 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 189
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y V Q +P+
Sbjct: 190 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV-------IQTHPHA 237
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL S
Sbjct: 238 HKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTSGLITTGYNL 297
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 298 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 357
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD---------TSEMQSVI 415
K V R G + VF +H E + +L +LR+ +++ S + +
Sbjct: 358 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIF 416
Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+ PVS E + L D L Y T+ ED++ L +++L + +A +L
Sbjct: 417 TLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLKNHDLSARATMAIKLRL 476
Query: 476 MEKKMLNACLQ 486
EK++L ++
Sbjct: 477 GEKEILERAVK 487
>gi|10439587|dbj|BAB15525.1| unnamed protein product [Homo sapiens]
Length = 512
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 97/412 (23%), Positives = 191/412 (46%), Gaps = 38/412 (9%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQG 171
A+ D++A + VP L++T+E N + L + +++ + LA +L+ E+
Sbjct: 28 ATRDIKAEELFLWVPRKLLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERAS- 85
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 86 PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY--- 135
Query: 232 DTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 136 --AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPL 191
Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
+ DD + V + ++AGE I ++ G + N++ +I+ GF ++N
Sbjct: 192 WDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNS 251
Query: 348 YDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD 407
+DR+ ++ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 252 HDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTE 310
Query: 408 ---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 458
S + + + PVS E + L D L Y T+ ED+++L
Sbjct: 311 EELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVL 370
Query: 459 TDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPL 510
+++L + ++A +L EK++L ++ A + P + +SP APL
Sbjct: 371 KNHDLSVRAKMAIKLRLGEKEILEKAVKSAA----VNPGI-LSPTDGGKAPL 417
>gi|119914085|ref|XP_589822.3| PREDICTED: histone-lysine N-methyltransferase setd3 [Bos taurus]
gi|297488270|ref|XP_002696879.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Bos taurus]
gi|296475307|tpg|DAA17422.1| TPA: SET domain containing 3 [Bos taurus]
Length = 601
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 102/434 (23%), Positives = 196/434 (45%), Gaps = 39/434 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 80 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 133
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 134 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 189
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 190 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 237
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL S
Sbjct: 238 HKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTSGLITTGYNL 297
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 298 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 357
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVI 415
K V R G + VF +H E + +L +LR+ +++ S + +
Sbjct: 358 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIF 416
Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+ PVS E + L D L Y T+ ED++ L +++L + +A +L
Sbjct: 417 TLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLKNHDLSARATMAIKLRL 476
Query: 476 MEKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 477 GEKEILERAVKSAA 490
>gi|134254196|gb|AAI35195.1| LOC549331 protein [Xenopus (Silurana) tropicalis]
Length = 507
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 105/452 (23%), Positives = 201/452 (44%), Gaps = 47/452 (10%)
Query: 75 EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
E+ +L W +NG L E P + A+ +++A + VP L++
Sbjct: 1 EDYFPELMEWCKENGASTDGFELVEFPEEG------FGLKATREIKAEELFLWVPRKLLM 54
Query: 135 TLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQ 191
T+E G+ + L + +++ + LA +L+ E+ SFWLPYI+ L +
Sbjct: 55 TVESAKGS-VLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWLPYIKTLPNE----- 107
Query: 192 LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY--DI 249
++PL ++E E+ YL + ++ + + R+Y +F + Q +P +
Sbjct: 108 --YDTPLYFNEDEVQYLQSTQAILDVFSQYKNTARQY-----AYFY--KVIQTHPNANKL 158
Query: 250 P-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDD 307
P ++FTF+ ++ A +V + + +R AL+PL +S DD
Sbjct: 159 PLKDSFTFDDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNSLITTGYNLEDD 218
Query: 308 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 367
+ V + +K+GE I ++ G + N++ +I+ GF E+N +DR+ ++ ++ D Y K
Sbjct: 219 RCECVALQDFKSGEQIYIFYGTRSNAEFVIHNGFFFENNLHDRVKIKLGVSKSDRLYAMK 278
Query: 368 RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL---------GYVSDTSEMQSVISSL 418
V R G + VF +H E + +L +LR+ G++ + + +
Sbjct: 279 AEVLARAGIPTSSVFALHV-TEPPISAQLLAFLRVFCMNEDELKGHLIGDHAIDKIFTLG 337
Query: 419 GPICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVATQLV 474
PVS E + +L + +AR L Y T+ +D +L ++ +A +L
Sbjct: 338 NSEFPVS--WENEI--KLWTFLEARASLLLKTYKTTVEDDNKVLEQPDMTFHSAMAIKLR 393
Query: 475 RMEKKMLNACLQVTADMIMLLPDVTVSPCPAP 506
R+EK++L L+ +D L + P P
Sbjct: 394 RVEKEILEKALKSASDNRKLYSKNSEEGTPLP 425
>gi|302821397|ref|XP_002992361.1| hypothetical protein SELMODRAFT_430576 [Selaginella moellendorffii]
gi|300139777|gb|EFJ06511.1| hypothetical protein SELMODRAFT_430576 [Selaginella moellendorffii]
Length = 463
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 81/327 (24%), Positives = 136/327 (41%), Gaps = 57/327 (17%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+L SW+ G +LK P + A D++AG+ V ++T +R+
Sbjct: 39 ELVSWLKIRGEHDACSLLKTGPDKRG-------LFAVRDIKAGECILRVSRDTMMTADRL 91
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
+LL++ +SE A LAL L++EK+ G+ S W PYI L R + S
Sbjct: 92 --PLEFQQLLSSG-VSEWAQLALLLLFEKRAGEASIWAPYISCLPRWG-----TIHSTAF 143
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
W + ELA + S E + R I+ E+NE+ + FQ+Y + + ++ F
Sbjct: 144 WRKEELAMIQESSLSYETMSRRAAIREEFNEMQPI-------FQRYEH-VFGGPVSYASF 195
Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD------------ 307
K A+V C ++ + A+VP + + AML D
Sbjct: 196 KHAYVTATVCS-RAWRIDGLEKLAMVPFAD-FMNHDWSSNAMLTYDTDNGSTEVEEVKVY 253
Query: 308 ---------AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 358
QL D+ Y AGE + + GP N+ L +++GF NP+D++ + ++
Sbjct: 254 SDCLDIALFCAQLFADKNYAAGEQVTISFGPLCNASLALDFGFTVPYNPWDKVQLWLGIS 313
Query: 359 TEDPQYQDKRMVAQRNGKLSVQVFHVH 385
D ++K +Q H H
Sbjct: 314 RRDSLRKEK-----------LQYLHAH 329
>gi|426377975|ref|XP_004055723.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Gorilla
gorilla gorilla]
Length = 594
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 101/434 (23%), Positives = 197/434 (45%), Gaps = 39/434 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD---------TSEMQSVI 415
K V R G + VF +H E + +L +LR+ +++ S + +
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIF 409
Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+ PVS E + L D L Y T+ ED+++L + +L + ++A +L
Sbjct: 410 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRL 469
Query: 476 MEKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 470 GEKEILEKTVKSAA 483
>gi|386780935|ref|NP_001247800.1| SET domain containing 3 [Macaca mulatta]
gi|355693560|gb|EHH28163.1| hypothetical protein EGK_18532 [Macaca mulatta]
gi|380817110|gb|AFE80429.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
gi|383422129|gb|AFH34278.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
gi|384949778|gb|AFI38494.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
Length = 595
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 101/434 (23%), Positives = 197/434 (45%), Gaps = 39/434 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERAN-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVI 415
K V R G + VF +H E + +L +LR+ +++ S + +
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIF 409
Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+ PVS E + L D L Y T+ ED+++L + +L + ++A +L
Sbjct: 410 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRL 469
Query: 476 MEKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 470 GEKEILEKAVKSAA 483
>gi|114654683|ref|XP_522946.2| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2 [Pan
troglodytes]
gi|332843114|ref|XP_003314566.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Pan
troglodytes]
gi|397525919|ref|XP_003832895.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1 [Pan
paniscus]
gi|397525921|ref|XP_003832896.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2 [Pan
paniscus]
gi|410227562|gb|JAA11000.1| SET domain containing 3 [Pan troglodytes]
gi|410255618|gb|JAA15776.1| SET domain containing 3 [Pan troglodytes]
gi|410289938|gb|JAA23569.1| SET domain containing 3 [Pan troglodytes]
gi|410342147|gb|JAA40020.1| SET domain containing 3 [Pan troglodytes]
Length = 594
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 101/434 (23%), Positives = 197/434 (45%), Gaps = 39/434 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD---------TSEMQSVI 415
K V R G + VF +H E + +L +LR+ +++ S + +
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIF 409
Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+ PVS E + L D L Y T+ ED+++L + +L + ++A +L
Sbjct: 410 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRL 469
Query: 476 MEKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 470 GEKEILEKAVKSAA 483
>gi|332252553|ref|XP_003275417.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
[Nomascus leucogenys]
gi|332252555|ref|XP_003275418.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
[Nomascus leucogenys]
Length = 595
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 101/434 (23%), Positives = 197/434 (45%), Gaps = 39/434 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD---------TSEMQSVI 415
K V R G + VF +H E + +L +LR+ +++ S + +
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIF 409
Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+ PVS E + L D L Y T+ ED+++L + +L + ++A +L
Sbjct: 410 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRL 469
Query: 476 MEKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 470 GEKEILEKAVKSAA 483
>gi|355778846|gb|EHH63882.1| hypothetical protein EGM_16943 [Macaca fascicularis]
Length = 595
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 101/434 (23%), Positives = 197/434 (45%), Gaps = 39/434 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERAN-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVI 415
K V R G + VF +H E + +L +LR+ +++ S + +
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIF 409
Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+ PVS E + L D L Y T+ ED+++L + +L + ++A +L
Sbjct: 410 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRL 469
Query: 476 MEKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 470 GEKEILEKAVKSAA 483
>gi|297695854|ref|XP_002825140.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
[Pongo abelii]
gi|395746278|ref|XP_003778419.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
[Pongo abelii]
Length = 595
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 101/434 (23%), Positives = 197/434 (45%), Gaps = 39/434 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD---------TSEMQSVI 415
K V R G + VF +H E + +L +LR+ +++ S + +
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIF 409
Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+ PVS E + L D L Y T+ ED+++L + +L + ++A +L
Sbjct: 410 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRL 469
Query: 476 MEKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 470 GEKEILEKAVKSAA 483
>gi|281182452|ref|NP_001162549.1| histone-lysine N-methyltransferase setd3 [Papio anubis]
gi|332321745|sp|A9X1D0.1|SETD3_PAPAN RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|163781076|gb|ABY40825.1| SET domain containing 3, isoform 1 (predicted) [Papio anubis]
Length = 595
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 101/434 (23%), Positives = 197/434 (45%), Gaps = 39/434 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERAN-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVI 415
K V R G + VF +H E + +L +LR+ +++ S + +
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIF 409
Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+ PVS E + L D L Y T+ ED+++L + +L + ++A +L
Sbjct: 410 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRL 469
Query: 476 MEKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 470 GEKEILEKAVKSAA 483
>gi|146181028|ref|XP_001021989.2| SET domain containing protein [Tetrahymena thermophila]
gi|146144300|gb|EAS01744.2| SET domain containing protein [Tetrahymena thermophila SB210]
Length = 590
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 101/404 (25%), Positives = 185/404 (45%), Gaps = 58/404 (14%)
Query: 103 HNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNET-IAELLTTNKLSELA--- 158
+++ +R +H A + + +P S ++TLE + ET +A+ + KL+ L+
Sbjct: 173 YSKNYRGVH---ARRKVYNKETILFIPKSHLITLE--MAKETDVAKKIIAAKLNLLSPKH 227
Query: 159 -CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
L+ +L+ E+K K+S W PY+ L + P+ +SE +L++L GSP + ++
Sbjct: 228 SFLSTFLLQERK-NKESKWKPYLDILPSDYN------QFPIFFSEDDLSWLKGSPFQNQV 280
Query: 218 LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 277
E+ IKR+Y+++ +V F +Y TFE F A + S V LQ ++
Sbjct: 281 REKKADIKRDYDDICSV----APEFAEY---------TFEDFCWARMTASSRVFGLQ-IN 326
Query: 278 LARRFALVPLGPPLLAYSSKCKAMLAAVDD-----AVQLVVDRPYKAGESIVVWCGPQPN 332
+ A VPL L + K DD +Q + D P GE + G + N
Sbjct: 327 EQKTDAFVPLADML--NHRRPKQTSWQYDDQREGFVIQALEDIP--RGEQVYDSYGRKCN 382
Query: 333 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL-SVQVFHVHAGREKE 391
S+ +NYGF++ DN + + + + EDP + K+ + G + +V+ + +++
Sbjct: 383 SRFFLNYGFINLDNDANEVALRLTFDAEDPTIERKKEMM--GGDVPEFKVYRILENYQEQ 440
Query: 392 AISDMLPYLRLGYVSDTS-------------EMQSVISSLGP--ICPVSPCMERAVLDQL 436
+S+ + YLR + D S E +S P P+S E + ++
Sbjct: 441 NVSEFMSYLRFILIRDNSKLLMLSSLHEQQTENSENLSGYKPQKTPPISIQNETDMWVRI 500
Query: 437 ADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 480
++ + ++ Y TL ED+ +L NL +R L EK++
Sbjct: 501 SNMCQTSISLYNTTLKEDKELLAKDNLTQNQRNCVLLRSGEKEV 544
>gi|332320543|sp|B0VX69.2|SETD3_CALJA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
Length = 595
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 103/434 (23%), Positives = 199/434 (45%), Gaps = 39/434 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEEEVRYLQSTQAVHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------IS 416
K V R G + VF +H E + +L +LR+ +++ + + I
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIF 409
Query: 417 SLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+LG PVS E + L D L Y T+ ED+++L + +L + ++A +L
Sbjct: 410 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRL 469
Query: 476 MEKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 470 GEKEILEKAVKSAA 483
>gi|293333172|ref|NP_001168589.1| uncharacterized protein LOC100382373 [Zea mays]
gi|223949395|gb|ACN28781.1| unknown [Zea mays]
gi|414885391|tpg|DAA61405.1| TPA: hypothetical protein ZEAMMB73_723554 [Zea mays]
Length = 489
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 107/387 (27%), Positives = 168/387 (43%), Gaps = 38/387 (9%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A+ DL G+ VP L + + V ++ L +AL L+ E +G
Sbjct: 84 LVAARDLPRGEVVAEVPKKLWMDADAVAASDIGRACGGGGGLRPWVAVALLLLSEVARGA 143
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY--NE 230
S W PY+ L RQ +S + WSE EL + G+ ++L G+K EY +E
Sbjct: 144 DSPWAPYLAILPRQ-------TDSTIFWSEEELLEIQGT----QLLSTTVGVK-EYVQSE 191
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV---VHLQKVSLARRFALVPL 287
D+V S + D+ + TF+ F AF ++S V + K++L LV
Sbjct: 192 FDSVQAEIISTNK----DLFPGSITFDDFLWAFGMLRSRVFPELRGDKLALIPFADLVNH 247
Query: 288 GPPLLAYSS----KCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFV 342
P + + S K K + + L K+G+ I + + + N++L ++YGFV
Sbjct: 248 SPNITSEGSSWEIKGKGLFGR-ELMFSLRTPVNVKSGQQIYIQYDLDKSNAELALDYGFV 306
Query: 343 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 402
+ + D V ++ DP Y DK +A+ NG F V + MLPYLRL
Sbjct: 307 ESNPSRDSFTVTLEISESDPFYGDKLDIAEANGLGETAYFDVIL--NEPLPPQMLPYLRL 364
Query: 403 GYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 455
+ T SV L P+SP E ++ + D K+ LA Y T+ EDE
Sbjct: 365 LCIGGTDAFLLEALFRNSVWGHLE--LPLSPDNEESICQAMRDACKSALADYHTTIEEDE 422
Query: 456 AMLTDYNLHPKKRVATQLVRMEKKMLN 482
+ NL P+ +A + EKK+L
Sbjct: 423 ELSGRENLQPRLAIAIGVRAGEKKVLQ 449
>gi|296215874|ref|XP_002754318.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Callithrix jacchus]
Length = 610
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 103/434 (23%), Positives = 199/434 (45%), Gaps = 39/434 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 88 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 141
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 142 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 197
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 198 -----YDTPLYFEEEEVRYLQSTQAVHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 245
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 246 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 305
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 306 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 365
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------IS 416
K V R G + VF +H E + +L +LR+ +++ + + I
Sbjct: 366 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIF 424
Query: 417 SLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+LG PVS E + L D L Y T+ ED+++L + +L + ++A +L
Sbjct: 425 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRL 484
Query: 476 MEKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 485 GEKEILEKAVKSAA 498
>gi|332321478|sp|B1MTJ4.2|SETD3_CALMO RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
Length = 595
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 103/434 (23%), Positives = 199/434 (45%), Gaps = 39/434 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------IS 416
K V R G + VF +H E + +L +LR+ +++ + + I
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIF 409
Query: 417 SLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+LG PVS E + L D L Y T+ ED+++L + +L + ++A +L
Sbjct: 410 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRL 469
Query: 476 MEKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 470 GEKEILEKAVKSAA 483
>gi|168986666|gb|ACA35060.1| SET domain containing 3 isoform a (predicted) [Callithrix jacchus]
Length = 597
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 103/434 (23%), Positives = 199/434 (45%), Gaps = 39/434 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 75 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 128
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 129 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 184
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 185 -----YDTPLYFEEEEVRYLQSTQAVHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 232
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 233 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 292
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 293 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 352
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------IS 416
K V R G + VF +H E + +L +LR+ +++ + + I
Sbjct: 353 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIF 411
Query: 417 SLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+LG PVS E + L D L Y T+ ED+++L + +L + ++A +L
Sbjct: 412 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRL 471
Query: 476 MEKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 472 GEKEILEKAVKSAA 485
>gi|169409575|gb|ACA57918.1| SET domain containing 3 isoform a (predicted) [Callicebus moloch]
Length = 597
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 103/434 (23%), Positives = 199/434 (45%), Gaps = 39/434 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 75 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 128
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 129 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 184
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 185 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 232
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 233 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 292
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 293 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 352
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------IS 416
K V R G + VF +H E + +L +LR+ +++ + + I
Sbjct: 353 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIF 411
Query: 417 SLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+LG PVS E + L D L Y T+ ED+++L + +L + ++A +L
Sbjct: 412 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRL 471
Query: 476 MEKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 472 GEKEILEKAVKSAA 485
>gi|332321743|sp|C1FXW2.1|SETD3_DASNO RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|226526916|gb|ACO71275.1| SET domain containing 3 isoform a (predicted) [Dasypus
novemcinctus]
Length = 589
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 103/434 (23%), Positives = 198/434 (45%), Gaps = 39/434 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSMLGPLYSQDRILQAMGNITLAFHLLCERAN-PNSFWQPYIQSLPGE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLHSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------IS 416
K V R G + VF +H E + +L +LR+ +++ + + I
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGENAIDRIF 409
Query: 417 SLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+LG PVS E + L D L Y T+ ED++ L +++L + +A +L
Sbjct: 410 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSFLKNHDLSVRATMAIKLRL 469
Query: 476 MEKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 470 GEKEILEKAVKSAA 483
>gi|332321746|sp|B2KI88.1|SETD3_RHIFE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|183637154|gb|ACC64548.1| SET domain containing 3 isoform a (predicted) [Rhinolophus
ferrumequinum]
Length = 594
Score = 92.0 bits (227), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 103/434 (23%), Positives = 197/434 (45%), Gaps = 39/434 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E S E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVSFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFGEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 291 EDDRCECVALQDFQAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------IS 416
K V R G + VF +H E + +L +LR+ +++ + + I
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIF 409
Query: 417 SLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+LG PVS E + L D L Y + ED++ L +++L + +A +L
Sbjct: 410 TLGNSEYPVSWDNEVKLWTFLEDRASLLLKTYKTNIEEDKSFLKNHDLSVRATMAIKLRL 469
Query: 476 MEKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 470 GEKEILEKAVKSAA 483
>gi|348554489|ref|XP_003463058.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cavia
porcellus]
Length = 789
Score = 92.0 bits (227), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 105/434 (24%), Positives = 197/434 (45%), Gaps = 39/434 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFWLPYI+ L +
Sbjct: 127 LLMTVESA-KNSILGPLYSQDRILQAMGNIALAFHLLCERAN-PNSFWLPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEEEVQCLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------IS 416
K V R G + VF +H E + +L +LR+ +++ + + I
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGENAIDRIF 409
Query: 417 SLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+LG PVS E + L D L Y T+ ED+A+L L + ++A +L
Sbjct: 410 TLGNSEFPVSWENEVKLWSFLEDRASLLLKTYKTTIEEDKAVLKGPELPTRMKMAVKLRL 469
Query: 476 MEKKMLNACLQVTA 489
EK++L +Q A
Sbjct: 470 GEKEILERTVQSAA 483
>gi|344273731|ref|XP_003408672.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Loxodonta
africana]
Length = 597
Score = 91.7 bits (226), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 103/434 (23%), Positives = 197/434 (45%), Gaps = 39/434 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEVVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAN-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ +L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRHLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------IS 416
K V R G + VF +H E + +L +LR+ +++ + + I
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIF 409
Query: 417 SLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+LG PVS E + L D L Y T+ ED+A L ++L + +A +L
Sbjct: 410 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKAFLKGHDLSIRATMAVKLRL 469
Query: 476 MEKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 470 GEKEILERAVKSAA 483
>gi|332321744|sp|B5FW36.1|SETD3_OTOGA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|197215622|gb|ACH53017.1| SET domain containing 3 isoform a (predicted) [Otolemur garnettii]
Length = 595
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 103/434 (23%), Positives = 197/434 (45%), Gaps = 39/434 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKRENYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQSLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------IS 416
K V R G + VF +H E + +L +LR+ +++ + + I
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIF 409
Query: 417 SLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+LG PVS E + L D L Y T+ ED+ +L +++L + +A +L
Sbjct: 410 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKFVLKNHDLSVRATMAIKLRL 469
Query: 476 MEKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 470 GEKEILEKAVKSAA 483
>gi|444915331|ref|ZP_21235465.1| SET domain containing protein [Cystobacter fuscus DSM 2262]
gi|444713560|gb|ELW54457.1| SET domain containing protein [Cystobacter fuscus DSM 2262]
Length = 449
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 109/438 (24%), Positives = 189/438 (43%), Gaps = 46/438 (10%)
Query: 70 VVSKKEEDLGDLKSWMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFS 127
S + L +L W+ + G P +++ +E V A + AG+
Sbjct: 10 AASSSNQKLSNLLRWLEEGGARFPKLQLVRREDGERA--------VLAQAPISAGETVLQ 61
Query: 128 VPNSLVVTLERVLGNET-----IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRE 182
VP + ++TLE L E+ IAE L + +E LA +L+ EK + + SFW PYI
Sbjct: 62 VPRTHMLTLE--LARESDIGRAIAEGLDPD--NEDLYLASFLLQEKHR-EGSFWKPYIDS 116
Query: 183 LDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLF 242
L + PL + E A L G + +A+ ++ +Y SL
Sbjct: 117 LPESYS------QMPLFYGSEEHALLKGCFALTLLTHQAQSLREDYL----------SLC 160
Query: 243 QQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML 302
Q P E FT F A ++V S + L+K + LVP+ +L + +
Sbjct: 161 QNVP---GYERFTPGEFVWARLSVSSRLFSLKKGGFLGQ-TLVPMAD-MLNHRRPPDVLW 215
Query: 303 AAVDDAVQLVV--DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 360
+D V+ + AG+ + G + N +L+++GFV +DN +D + +
Sbjct: 216 ETTEDGESFVMKANNAVAAGDEVHDSYGAKSNDLMLLHFGFVTDDNEHDEAFLGLRILDG 275
Query: 361 DPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV--SDTSEMQS-VISS 417
DP K+M+ + + F + +LR+ +D ++ S V+S
Sbjct: 276 DPLAATKQMLLMLPSPTAARPFKISRPYVHTTTRMAFSFLRIAAAVPNDIEDISSRVMSG 335
Query: 418 LGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRME 477
+ P+S E VL+ LA +ARL+ +P +L++DE +L +L P R + R E
Sbjct: 336 ERALGPLSVENEENVLELLAATCQARLSIFPTSLAQDEELLRGESLSPNARNCVLVRRAE 395
Query: 478 KKMLNACLQVTADMIMLL 495
K+++ L++T + LL
Sbjct: 396 KQLIEDYLEMTRVCLKLL 413
>gi|346474100|gb|AEO36894.1| hypothetical protein [Amblyomma maculatum]
Length = 459
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 105/439 (23%), Positives = 189/439 (43%), Gaps = 65/439 (14%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
W +NG V +K++P + + + A E ++ +P LV+T ++
Sbjct: 50 WCSENGAYLGSVAIKDRPDGD------YGLVAEEKIEESMQFLGIPMKLVMTTASARKSK 103
Query: 144 TIAELLTTN----KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
+ LL + +S +A LA++L+ E G+ SFW PYI L + + L
Sbjct: 104 -LGPLLRDDPIMKSMSNVA-LAIFLILELSAGESSFWHPYISVLPD-------SFNTVLY 154
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
++ EL L+GS E L+ I R+Y + F L + P+ + FT++++
Sbjct: 155 FNIEELELLSGSAVLDEALKLHRSIARQYAYFHKI-FRTHPLAKSLPF---KDCFTYDLY 210
Query: 260 KQAFVAV---QSCVVHLQKVSL----------ARRFALVPL--------GPPLLAYSS-- 296
+ A AV Q+ V + L A ALVPL G L Y S
Sbjct: 211 RWAVSAVMTRQNAVPWTESDGLGGDDVEIDGTAAVTALVPLWDMCNHSDGKVLTDYDSSA 270
Query: 297 ---KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
+C AM R + GE + ++ G + N++ I+ GFV EDN YD + +
Sbjct: 271 SMVRCYAM-------------RDFDKGEEVTIFYGKRTNAEFFIHNGFVFEDNRYDAVDI 317
Query: 354 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 413
+ ++ +DP + K + + + LS+ R++ D+ +LR+ + D S+ ++
Sbjct: 318 KLGVSKKDPLFAVKSKLCE-DHDLSLSGTFALVARDRPVSEDLSTFLRILVLKDASQPEA 376
Query: 414 VISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQL 473
S I S R L L + L +P + E E ++ D + + ++A +L
Sbjct: 377 F--SAEHILTSSDSNARDALTFLVVRIELLLKAFPKSDEEYEDIIKDGASNARVKMAARL 434
Query: 474 VRMEKKMLNACLQVTADMI 492
+E K+L + L+ + +
Sbjct: 435 RLLESKVLASVLETLGNHV 453
>gi|395827792|ref|XP_003787079.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Otolemur
garnettii]
Length = 595
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 103/434 (23%), Positives = 197/434 (45%), Gaps = 39/434 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKRENYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQSLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIYDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------IS 416
K V R G + VF +H E + +L +LR+ +++ + + I
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIF 409
Query: 417 SLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+LG PVS E + L D L Y T+ ED+ +L +++L + +A +L
Sbjct: 410 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKFVLKNHDLSVRATMAIKLRL 469
Query: 476 MEKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 470 GEKEILEKAVKSAA 483
>gi|218202140|gb|EEC84567.1| hypothetical protein OsI_31339 [Oryza sativa Indica Group]
Length = 649
Score = 90.9 bits (224), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 105/397 (26%), Positives = 170/397 (42%), Gaps = 59/397 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A+ DL G+ VP L + + V ++ + + L +AL L+ E +G
Sbjct: 244 LVAARDLPRGEVLAEVPKKLWLDADAVAASD-LGGAVGRGGLRPWVAVALLLLREAARGA 302
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W PY+ L RQ +S + WSE EL + G+ ++L G+K EY +
Sbjct: 303 GSPWAPYLAILPRQ-------TDSTIFWSEEELLEIQGT----QLLSTTMGVK-EYVQ-- 348
Query: 233 TVWFMAGSLFQQYPYDIPTE-------AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
S F+ +I +E TF F AF ++S V + + AL+
Sbjct: 349 -------SEFESVEAEIISENRELFPGTVTFNDFLWAFGILRSRVFAELR---GDKLALI 398
Query: 286 PLGPPLLAYSS-----------KCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNS 333
P L+ +S K K + D L K+GE I + + + N+
Sbjct: 399 PFAD-LVNHSDDITSKESSWEIKGKGLFGR-DVVFSLRTPVNVKSGEQIYIQYDLDKSNA 456
Query: 334 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI 393
+L ++YGF + ++ D + ++ DP Y DK +A+ NG F + G E++
Sbjct: 457 ELALDYGFTESNSSRDAYTLTLEISESDPFYDDKLDIAELNGMGETAYFDIVLG---ESL 513
Query: 394 S-DMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYFKARLA 445
MLPYLRL + T +V L PVS E A+ + + K+ L
Sbjct: 514 PPQMLPYLRLLCLGGTDAFLLEALFRNAVWGHL--ELPVSQDNEEAICQVIRNACKSALG 571
Query: 446 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 482
Y T+ EDE +L NL P+ ++A ++ EKK+L
Sbjct: 572 AYHTTIEEDEELLGSENLQPRLQIAVEVRAGEKKVLQ 608
>gi|332321742|sp|E2RBS6.1|SETD3_CANFA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
Length = 588
Score = 90.9 bits (224), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 103/434 (23%), Positives = 197/434 (45%), Gaps = 39/434 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRDLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P +AFT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDAFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V R ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 291 EDDRCECVALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------IS 416
K V R G + VF +H + + +L +LR+ +++ + + I
Sbjct: 351 AMKAEVLARAGIPTSSVFALHY-TDPPVSAQLLAFLRVFCMTEEELKEHLLGDNALDRIF 409
Query: 417 SLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+LG PVS E + L D L Y T+ ED++ L +++L + +A +L
Sbjct: 410 TLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLRNHDLSVRATMAIKLRL 469
Query: 476 MEKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 470 GEKEILEKAVKSAA 483
>gi|410962953|ref|XP_003988033.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Felis catus]
Length = 591
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 103/433 (23%), Positives = 194/433 (44%), Gaps = 47/433 (10%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRDLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAV 305
+P +AFT+E ++ V S + + L + G P + +
Sbjct: 231 NKLPLKDAFTYEDYRLGLV---SLALGRWALGLECGVGIARCGKPQITTGYNLE------ 281
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 365
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 282 DDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 341
Query: 366 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISS 417
K V R G + VF +H E + +L +LR+ +++ + + I +
Sbjct: 342 MKAEVLARAGIPTSSVFALHF-TEPPVSAQLLAFLRVFCMTEEELKEHLLGDNAIDRIFT 400
Query: 418 LG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 476
LG PVS E + L D L Y T+ ED+A L ++NL + +A +L
Sbjct: 401 LGNSEYPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKAFLKNHNLSVRATMAIKLRLG 460
Query: 477 EKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 461 EKEILEKAVKSAA 473
>gi|73964462|ref|XP_547974.2| PREDICTED: SET domain containing 3 [Canis lupus familiaris]
Length = 589
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 103/434 (23%), Positives = 197/434 (45%), Gaps = 39/434 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRDLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P +AFT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDAFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V R ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 291 EDDRCECVALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------IS 416
K V R G + VF +H + + +L +LR+ +++ + + I
Sbjct: 351 AMKAEVLARAGIPTSSVFALHY-TDPPVSAQLLAFLRVFCMTEEELKEHLLGDNALDRIF 409
Query: 417 SLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+LG PVS E + L D L Y T+ ED++ L +++L + +A +L
Sbjct: 410 TLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLRNHDLSVRATMAIKLRL 469
Query: 476 MEKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 470 GEKEILEKAVKSAA 483
>gi|50252331|dbj|BAD28364.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplast precursor
[Oryza sativa Japonica Group]
gi|215769445|dbj|BAH01674.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 495
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 102/395 (25%), Positives = 167/395 (42%), Gaps = 55/395 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A+ DL G+ VP L + + V ++ + + L +AL L+ E +G
Sbjct: 90 LVAARDLPRGEVLAEVPKKLWLDADAVAASD-LGGAVGRGGLRPWVAVALLLLREAARGA 148
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W PY+ L RQ +S + WSE EL + G+ ++L G+K EY +
Sbjct: 149 GSPWAPYLAILPRQ-------TDSTIFWSEEELLEIQGT----QLLSTTMGVK-EYVQ-- 194
Query: 233 TVWFMAGSLFQQYPYDIPTE-------AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
S F+ +I +E TF F AF ++S V + + AL+
Sbjct: 195 -------SEFESVEAEIISENRELFPGTVTFNDFLWAFGILRSRVFAELR---GDKLALI 244
Query: 286 PLGPPL----------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSK 334
P + ++ K K + D L K+GE I + + + N++
Sbjct: 245 PFADLVNHSDDITSKESSWEIKGKGLFGR-DVVFSLRTPVNVKSGEQIYIQYDLDKSNAE 303
Query: 335 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 394
L ++YGF + ++ D + ++ DP Y DK +A+ NG F + G +
Sbjct: 304 LALDYGFTESNSSRDAYTLTLEISESDPFYDDKLDIAELNGMGETAYFDIVLG--ESLPP 361
Query: 395 DMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGY 447
MLPYLRL + T +V L PVS E A+ + + K+ L Y
Sbjct: 362 QMLPYLRLLCLGGTDAFLLEALFRNAVWGHLE--LPVSQDNEEAICQVIRNACKSALGAY 419
Query: 448 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 482
T+ EDE +L NL P+ ++A ++ EKK+L
Sbjct: 420 HTTIEEDEELLGSENLQPRLQIAVEVRAGEKKVLQ 454
>gi|343961019|dbj|BAK62099.1| SET domain containing 3 isoform a [Pan troglodytes]
Length = 492
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 97/418 (23%), Positives = 189/418 (45%), Gaps = 39/418 (9%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD---------TSEMQSVI 415
K V R G + VF +H E + +L +LR+ +++ S + +
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIF 409
Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQL 473
+ PVS E + L D L Y T+ ED+++L + +L + ++A +L
Sbjct: 410 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKL 467
>gi|338719872|ref|XP_001488117.2| PREDICTED: histone-lysine N-methyltransferase setd3-like [Equus
caballus]
Length = 609
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 101/434 (23%), Positives = 194/434 (44%), Gaps = 39/434 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 87 GKREDYFPDLMKWASENG---ASVDGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 140
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 141 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 196
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 197 -----YDTPLYFEEDEVRYLQSTQAVHDVFSQYKNTARQY-----AYFY--RVIQTHPHA 244
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 245 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTTGLITTGYNL 304
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 305 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 364
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVI 415
K V R G + VF +H E + +L +LR+ +++ + + +
Sbjct: 365 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKDHLLGDNAIDRIF 423
Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+ PVS E + L D L Y T+ ED A L + +L + +A +L
Sbjct: 424 TLGNSEYPVSWDNEVKLWTFLEDRALLLLKTYKTTVEEDRAFLKNSDLSVRATMAIKLRL 483
Query: 476 MEKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 484 GEKEILEKAVKSAA 497
>gi|301764186|ref|XP_002917505.1| PREDICTED: SET domain-containing protein 3-like [Ailuropoda
melanoleuca]
Length = 591
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 103/434 (23%), Positives = 197/434 (45%), Gaps = 39/434 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEEEVRDLQCTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P +AFT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDAFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V R ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 291 EDDRCECVALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------IS 416
K V R G + VF +H + + +L +LR+ +++ + + I
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TDPPVSAQLLAFLRVFCMTEEELKEHLLGDNALDRIF 409
Query: 417 SLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+LG PVS E + L D L Y T+ ED++ L +++L + +A +L
Sbjct: 410 TLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLKNHDLSVRATMAIKLRL 469
Query: 476 MEKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 470 GEKEILEKAVKSAA 483
>gi|281338628|gb|EFB14212.1| hypothetical protein PANDA_005835 [Ailuropoda melanoleuca]
Length = 585
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 103/434 (23%), Positives = 197/434 (45%), Gaps = 39/434 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEEEVRDLQCTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P +AFT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDAFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V R ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 291 EDDRCECVALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------IS 416
K V R G + VF +H + + +L +LR+ +++ + + I
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TDPPVSAQLLAFLRVFCMTEEELKEHLLGDNALDRIF 409
Query: 417 SLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+LG PVS E + L D L Y T+ ED++ L +++L + +A +L
Sbjct: 410 TLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLKNHDLSVRATMAIKLRL 469
Query: 476 MEKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 470 GEKEILEKAVKSAA 483
>gi|291411315|ref|XP_002721936.1| PREDICTED: SET domain containing 3 [Oryctolagus cuniculus]
Length = 591
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 102/437 (23%), Positives = 193/437 (44%), Gaps = 45/437 (10%)
Query: 72 SKKEEDLGDLKSWMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVP 129
K+E+ +L W NG + +V+ E+ + A+ +++A + VP
Sbjct: 73 GKREDYFPELMKWASANGASVEGFEVVNFEEEGFG--------LRATREIKAEELFLWVP 124
Query: 130 NSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQ 186
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 125 RKLLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERAS-PNSFWQPYIQTLPSE 182
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
++PL + E E+ YL + ++ + + R+Y +F + Q +P
Sbjct: 183 -------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--RVIQTHP 228
Query: 247 Y--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAML 302
+ +P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 229 HANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGY 288
Query: 303 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 362
DD + V R + AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D
Sbjct: 289 NLEDDRCECVALRDFHAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDR 348
Query: 363 QYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPI- 421
Y K V R G + VF +H E + +L +LR+ + E++ + G I
Sbjct: 349 LYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRV-FCMTEEELREHLLGDGAID 406
Query: 422 ---------CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQ 472
PVS E + L D L Y T+ ED+A+L L + +A +
Sbjct: 407 RIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKAVLRSPALSARAAMAVK 466
Query: 473 LVRMEKKMLNACLQVTA 489
L EK++L ++ A
Sbjct: 467 LRLGEKEILEKAVRSAA 483
>gi|432952574|ref|XP_004085141.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Oryzias
latipes]
Length = 606
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 101/436 (23%), Positives = 187/436 (42%), Gaps = 45/436 (10%)
Query: 62 TLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQA 121
T+ GSRE + DL SW +NG + + R + D++A
Sbjct: 69 TVFEGSRE------DSFADLMSWAQENGASCDGFTITNFGTEGYGLR------TTRDIKA 116
Query: 122 GDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKKQGKKSFWLP 178
+ VP +++T+E N + + + +++ + LAL+L+ E+ SFW P
Sbjct: 117 EELFLWVPRKMLMTVESAQ-NSVLGPIYSQDRILQAMGNVTLALHLLCERGD-PASFWSP 174
Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMA 238
YIR L ++ ++PL + + ++ L G+ ++L + + R+Y +F
Sbjct: 175 YIRSLPQE-------YDTPLYYQQEDVQLLLGTQAVQDVLNQYKNTARQY-----AYFY- 221
Query: 239 GSLFQQYP--YDIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAY 294
L Q +P +P + F+F+ ++ A +V + + V +R AL+PL
Sbjct: 222 -KLVQTHPAASKLPLKDGFSFDDYRWAVSSVMTRQNQIPTVDGSRVTLALIPLWDMCNHT 280
Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
+ DD + V + YK E I ++ G + N++ +I+ GF +DN +DR+ ++
Sbjct: 281 NGLITTGYNLEDDRCECVALQDYKKNEQIYIFYGTRSNAEFVIHNGFFFQDNAHDRVKIK 340
Query: 355 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL---------GYV 405
++ + Y K V R G + VF +H + + +L +LR+ Y+
Sbjct: 341 LGVSKSERLYAMKAEVLARAGIPASCVFALHC-NDPPISAQLLAFLRVFCMTEEELKDYL 399
Query: 406 SDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 465
+ + + PVS E + L L Y T ED ++L +L
Sbjct: 400 LGERAINKIFTLGNSDFPVSWENEIKLWTFLETRAALLLKTYKTTSEEDRSILEKPDLSL 459
Query: 466 KKRVATQLVRMEKKML 481
R+A QL EK++L
Sbjct: 460 HTRLAVQLRLAEKQIL 475
>gi|403274243|ref|XP_003928891.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Saimiri
boliviensis boliviensis]
Length = 513
Score = 89.0 bits (219), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 93/391 (23%), Positives = 182/391 (46%), Gaps = 33/391 (8%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQG 171
A+ D++A + VP L++T+E N + L + +++ + LA +L+ E+
Sbjct: 28 ATRDIKAEELFLWVPRKLLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-S 85
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 86 PNSFWQPYIQTLPSE-------YDTPLYFEEEEVRYLQSTQAIHDVFSQYKNTARQY--- 135
Query: 232 DTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 136 --AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPL 191
Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
+ DD + V + ++AGE I ++ G + N++ +I+ GF ++N
Sbjct: 192 WDMCNHTNGLITTGYNLEDDRCECVALQDFQAGEQIYIFYGTRSNAEFVIHSGFFFDNNS 251
Query: 348 YDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD 407
+DR+ ++ ++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 252 HDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTE 310
Query: 408 TSEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 458
+ + I +LG PVS E + L D L Y T+ ED+ +L
Sbjct: 311 EELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKFVL 370
Query: 459 TDYNLHPKKRVATQLVRMEKKMLNACLQVTA 489
+ +L + ++A +L EK++L ++ A
Sbjct: 371 KNQDLSVRAKMAIKLRLGEKEILEKAVKSAA 401
>gi|255080880|ref|XP_002504006.1| predicted protein [Micromonas sp. RCC299]
gi|226519273|gb|ACO65264.1| predicted protein [Micromonas sp. RCC299]
Length = 529
Score = 88.6 bits (218), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 107/395 (27%), Positives = 164/395 (41%), Gaps = 50/395 (12%)
Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWL 177
D++AG+ +P +L VT V + +A L EL LAL+L E+ +G S W
Sbjct: 83 DVRAGEPLIEIPQNLAVTSVDVADSPIVAGLAAGR--GELVGLALWLCLERHKGPLSEWA 140
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L + PLLW+ EL L GSP + + + R E EY +
Sbjct: 141 PYVATLP------SAGSDHPLLWTAGELQTLLQGSPVREQAVSRLESADDEYASI----- 189
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFV-AVQSCVVHLQKVSLARRFALVPL-------- 287
+ P D P +A+ F + + AFV A+ + + ++ A +A+VPL
Sbjct: 190 --ADQIRSNPNDFPPDAYEF-LTRDAFVDALATVLARAVWLNAANCYAMVPLVDLLPLVG 246
Query: 288 -----------------GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 330
G P LA ++ AA + + + + + V +
Sbjct: 247 SPPPGVSPAAAAGGPAVGKPGLAAAAGVVDYDAATECVAVVSANDAQQTARVVCVDPLAR 306
Query: 331 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG-KLSVQVFHVHAGRE 389
L + G VDE + D L A+ D Y+ KR + + G Q F V A R
Sbjct: 307 NAGDLFLATGAVDESHCGDYLAFAASCTQTDRLYEAKRQILEGMGMSADGQTFPVFADRM 366
Query: 390 KEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 449
+L Y+R V D E+ SV I VSP E VL L + LA Y +
Sbjct: 367 P---MQLLAYMRFARVQDPGELMSVSFEEDRI--VSPMNEYEVLQLLMQDAREMLAEYES 421
Query: 450 TLSEDEAM-LTDYNLHPKKRVATQLVRMEKKMLNA 483
+ E E + L + L ++RVA +L EK+++NA
Sbjct: 422 SSEEFELLQLKEKGLSARQRVAAKLRLAEKRLINA 456
>gi|432098266|gb|ELK28072.1| Histone-lysine N-methyltransferase setd3 [Myotis davidii]
Length = 585
Score = 88.6 bits (218), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 100/434 (23%), Positives = 194/434 (44%), Gaps = 39/434 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ +L W +NG V E + E+ + A+ D++A + VP
Sbjct: 62 GKREDYFPNLMKWASENG---ASVEGFEMFNFKEEGFGLR---ATRDIKAEELFLWVPRK 115
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 116 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 171
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 172 -----YDTPLYFEEDEVRSLQSTQAVHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 219
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 220 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 279
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V R ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 280 EDDRCECVALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 339
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVI 415
K V R G + VF +H E + +L +LR+ +++ + + +
Sbjct: 340 AMKAEVLARAGIPTSSVFALHF-MEPPISAQLLAFLRVFCMTEEELKDHLLGDNAIDKIF 398
Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+ PVS E + L D L Y T ED++ L +++L + R+A +L
Sbjct: 399 TLGNSEYPVSWDNEVKLWTFLEDRASLLLKTYKTTSEEDKSFLKNHDLSVRARMAIKLRL 458
Query: 476 MEKKMLNACLQVTA 489
EK++L + A
Sbjct: 459 GEKEILEKAVTSAA 472
>gi|224051705|ref|XP_002200601.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Taeniopygia
guttata]
Length = 593
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 91/393 (23%), Positives = 184/393 (46%), Gaps = 33/393 (8%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
+ A+ +++A + VP L++T+E N + L + +++ + LA +L+ E+
Sbjct: 108 LKATREIKAEELFLWVPRKLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERA 166
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
SFWLPYI+ L + ++PL + E E+ +L + ++ + + R+Y
Sbjct: 167 N-PHSFWLPYIQTLPSE-------YDTPLYFEEDEVQHLQSTQAIHDVFSQYKNTARQY- 217
Query: 230 ELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALV 285
+F + Q +P +P ++FT++ ++ A +V + + +R AL+
Sbjct: 218 ----AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRVTLALI 271
Query: 286 PLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 345
PL + DD + V + +KAGE I ++ G + N++ +I+ GF ++
Sbjct: 272 PLWDMCNHTNGLITTGYNLEDDRCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDN 331
Query: 346 NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV 405
N +DR+ ++ ++ D Y K V R G + VF +H+ E + +L +LR+ +
Sbjct: 332 NSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPAISAQLLAFLRVFCM 390
Query: 406 SDTSEMQSVIS--SLGPICPVSPCMERAVLD---QLADYFKAR----LAGYPATLSEDEA 456
S+ + +I ++G I + D +L + +AR L Y T+ D++
Sbjct: 391 SEEELKEHLIGEHAIGKIFTLGNSDFPVSWDNEVKLWTFLEARASLLLKTYKTTVEVDKS 450
Query: 457 MLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 489
L ++L P +A +L EK++L ++ A
Sbjct: 451 FLETHDLTPHAIMAIKLRLGEKEILEKAVKSAA 483
>gi|145350419|ref|XP_001419603.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579835|gb|ABO97896.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 524
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 81/309 (26%), Positives = 134/309 (43%), Gaps = 40/309 (12%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+L W+ LP K+ L+ + + A+E+++ G+A VP + ++T+ER
Sbjct: 83 ELARWLEGRRLPGQKMALEVNLAEG------RGLVATEEIKRGEALLGVPRTTLITVERA 136
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEK---KQGKKSFWLPYIRELDRQRGRGQLAVES 196
+ + +L E + LA +L + + G + YIR L R+ G S
Sbjct: 137 IAEAKLGP--KHAELQEWSVLATFLAQQALALESGTAGTFGEYIRALPRRTG-------S 187
Query: 197 PLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
L W E E+ L GSP++ ER + + +E+ + +P T
Sbjct: 188 VLDWPEDEVDKLLKGSPSRLAAAERQDSVNAAIDEIRSY----------FPE------IT 231
Query: 256 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 315
+ AF + S ++ L ++ ALVP +L + C A + DAV L DR
Sbjct: 232 VGALRWAFDILFSRLIRLD--AMGGELALVPW-ADMLNHKPGCAAFIDLNGDAVNLTTDR 288
Query: 316 PYKAGESIVVWCGPQPNSKLLINYGFVDE--DNPYDRLVVEAALNTEDPQYQDKRMVAQR 373
Y GE + G +P+S+LLI+YGF E +NP D + ++ DP K V +
Sbjct: 289 SYVKGEQVWASYGQRPSSELLISYGFAPEVGENPDDEYALTLGVDVNDPLADAKAQVLRD 348
Query: 374 NGKLSVQVF 382
G V+ F
Sbjct: 349 MGLSPVETF 357
>gi|115657973|ref|XP_798530.2| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Strongylocentrotus purpuratus]
Length = 682
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 104/438 (23%), Positives = 189/438 (43%), Gaps = 37/438 (8%)
Query: 64 VAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGD 123
VAG S +E W++ NG+ V + K + + A++D++
Sbjct: 66 VAGEPMQQSDREVHFETFFKWLNTNGVTTDAVKMA-------KFDEGYGLQATQDIKMDQ 118
Query: 124 AAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKKQGKKSFWLPYI 180
++P +++T + + + TI +L+ ++L + LA++++ EK + SFW PY+
Sbjct: 119 ELMNIPRKVMMTDQNAVDSPTIGDLVRGDRLLKGMPNVSLAIFILSEKLK-SDSFWKPYL 177
Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS 240
L + PL ++ E+ GS E L++ + I R+Y L F +
Sbjct: 178 DVLPS-------SYSLPLYFTPDEIQLFQGSTMYGECLKQHKNIARQYAYL----FKLLN 226
Query: 241 LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL-QKVSLARRFALVPLGPPLLAYSSKCK 299
L + I E FT++ ++ A V + + K +L+PL + + K
Sbjct: 227 LPENSKLHI-REYFTYDFYRWAVSTVMTRQNQIPAKDGKGMSLSLIPLWDMCNHANGEMK 285
Query: 300 AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 359
D+ + R + GE I + G + ++ LL+ GFV N YD + ++ L++
Sbjct: 286 TDFIEERDSCVNMALRDFSVGEQIFICYGRRSSADLLLYSGFVYPGNVYDGMAIQLGLSS 345
Query: 360 EDPQYQDKRMVAQRNGKLSV--QVFHVHAGREKEAISDMLPYLRLGYVSD---------T 408
D Y K + KL V Q +H+ AG+E + ++L +LR+ + D
Sbjct: 346 SDRLYAMKAQLCSVM-KLGVPSQNYHISAGKEPVTL-ELLTFLRIFCMQDLELRDRLLGD 403
Query: 409 SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKR 468
+ Q++ S + +S E LA Y ++ EDE L D NL ++R
Sbjct: 404 NRAQALFSLVDRSQIISKLNELRTCVYLATRVTLLQRQYKTSIQEDEEKLKDGNLSAQER 463
Query: 469 VATQLVRMEKKMLNACLQ 486
A QL+ +EK L L+
Sbjct: 464 SALQLLLIEKCTLENVLE 481
>gi|145344456|ref|XP_001416748.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576974|gb|ABO95041.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 515
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 115/431 (26%), Positives = 175/431 (40%), Gaps = 52/431 (12%)
Query: 76 EDLGDLKSWM-HKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
ED +L +W+ + G+ ++ KE R VA D+ AG +VP V
Sbjct: 47 EDARELAAWLSYDKGVDASGLVFKEG------ARGEVEVALRGDVDAGARVLAVPQDCAV 100
Query: 135 TLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
T V + ++ L EL LAL+L E+ +G S W PY++ L
Sbjct: 101 TSVDVDAHPIVSGL--AKGRPELVGLALWLCAERIKGGASDWAPYVKTLAAN-------P 151
Query: 195 ESPLLWSETE-LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 253
++PL W+E E A L GSP + +ER+ + EY + V + P P EA
Sbjct: 152 DAPLFWTEAEDFALLKGSPIVNDAVERSRSAREEYAAIVEV-------IKGDPTAFPAEA 204
Query: 254 FTF---EIFKQAFVAVQSCVVHLQKVSLARRFALVPL-------GPPLLAY---SSKCKA 300
+ F E F A V + L S +ALVPL G P+ S+K
Sbjct: 205 YEFFTEERFVDALATVCAKATWLPTASC---YALVPLLDVITIAGSPVPGVSPPSAKDGI 261
Query: 301 MLAAVD---DAVQLVVDRPYKA-GESIVVWCGP--QPNSKLLINYGFVDEDNPYDRLVVE 354
A D D+ +V+ KA S VV P + N +L +N G VD+ +P D L +
Sbjct: 262 ARCAADYDVDSACVVLSAVVKAPANSRVVQLDPLQRNNGELFLNTGRVDQKHPGDYLYMR 321
Query: 355 AALNTEDPQYQDKRMVAQRNG-KLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 413
+ D + K+ V + G Q F V+ E + + YLR V D EM +
Sbjct: 322 TEIQPSDRLFSAKKQVLEGMGFTAENQYFPVY---EDRMPTQLYSYLRFARVQDPGEMMA 378
Query: 414 VISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQL 473
V I VS E +L L + ++ Y ++ +L + + +
Sbjct: 379 VSFEEDKI--VSVMNEYEILQLLMGDCRELMSEYDTNEEDELNLLKLSDTMRVREIEAAK 436
Query: 474 VRMEKKMLNAC 484
+RM +K L C
Sbjct: 437 LRMSEKKLIGC 447
>gi|148686779|gb|EDL18726.1| mCG18357, isoform CRA_d [Mus musculus]
Length = 597
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 102/434 (23%), Positives = 197/434 (45%), Gaps = 39/434 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 76 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 129
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 130 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 185
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 186 -----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 233
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P E+FT+E ++ A +V + + +R AL+PL +
Sbjct: 234 NKLPLKESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 293
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V + ++AG+ I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 294 EDDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 353
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------IS 416
K V R G + VF +H+ E + +L +LR+ +++ + + I
Sbjct: 354 AMKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIF 412
Query: 417 SLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+LG PVS E + L D L Y T+ ED+ +L + +L + +A +L
Sbjct: 413 TLGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRL 472
Query: 476 MEKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 473 GEKEILEKAVKSAA 486
>gi|268370088|ref|NP_082538.2| histone-lysine N-methyltransferase setd3 [Mus musculus]
gi|81879567|sp|Q91WC0.1|SETD3_MOUSE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=Endothelial differentiation inhibitory protein D10;
AltName: Full=SET domain-containing protein 3
gi|16359331|gb|AAH16123.1| SET domain containing 3 [Mus musculus]
gi|18044800|gb|AAH19973.1| Setd3 protein [Mus musculus]
gi|26327255|dbj|BAC27371.1| unnamed protein product [Mus musculus]
gi|74145116|dbj|BAE27425.1| unnamed protein product [Mus musculus]
gi|74151505|dbj|BAE38861.1| unnamed protein product [Mus musculus]
gi|148686776|gb|EDL18723.1| mCG18357, isoform CRA_a [Mus musculus]
Length = 594
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 100/434 (23%), Positives = 195/434 (44%), Gaps = 39/434 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P E+FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V + ++AG+ I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 291 EDDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVI 415
K V R G + VF +H+ E + +L +LR+ +++ S + +
Sbjct: 351 AMKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIF 409
Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+ PVS E + L D L Y T+ ED+ +L + +L + +A +L
Sbjct: 410 TLGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRL 469
Query: 476 MEKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 470 GEKEILEKAVKSAA 483
>gi|302814473|ref|XP_002988920.1| hypothetical protein SELMODRAFT_129035 [Selaginella moellendorffii]
gi|300143257|gb|EFJ09949.1| hypothetical protein SELMODRAFT_129035 [Selaginella moellendorffii]
Length = 389
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 67/251 (26%), Positives = 125/251 (49%), Gaps = 23/251 (9%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS + G+ V + L++T E++ E + +LL+ + +S A LAL+L+ +K+ + S
Sbjct: 5 ASRPIHTGECMLHVSHDLMITPEKL--PEEVTKLLSKD-VSAWAKLALFLLAHQKKKETS 61
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI L ++ S + W++ EL YL SP E ++R + ++ E+ +
Sbjct: 62 AWAPYISCLPPFG-----SMHSTIFWTQDELVYLKVSPVYRETVQRKDVVRMEFAAAENA 116
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
+ P+ + E FK A+ V S ++ + + ALVP +
Sbjct: 117 LLLC-------PHIFGSRVSALE-FKHAYATVCSRAWGIETI---KSLALVPF-VDFFNH 164
Query: 295 SSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 352
+ C+AML+ +D ++V DR Y G+ +V+ G N+ L +++GF NP+D++
Sbjct: 165 DANCRAMLSYDEDRHCAEVVSDRDYATGDQVVISYGQLSNATLALDFGFALPFNPHDQVA 224
Query: 353 -VEAALNTEDP 362
+ +L+ +DP
Sbjct: 225 GIWLSLSEKDP 235
>gi|354483159|ref|XP_003503762.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Cricetulus
griseus]
gi|344254671|gb|EGW10775.1| SET domain-containing protein 3 [Cricetulus griseus]
Length = 577
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 100/434 (23%), Positives = 194/434 (44%), Gaps = 39/434 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERAS-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 291 EDDRCECVALQDFQAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVI 415
K V R G + VF +H E + +L +LR+ +++ S + +
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIF 409
Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+ PVS E + L D L Y T+ ED+ +L + +L + +A +L
Sbjct: 410 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRL 469
Query: 476 MEKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 470 GEKEILEKAVKSAA 483
>gi|12848462|dbj|BAB27964.1| unnamed protein product [Mus musculus]
gi|46241521|gb|AAS82953.1| endothelial differentiation inhibitory protein D10 [Mus musculus]
Length = 594
Score = 86.3 bits (212), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 102/434 (23%), Positives = 197/434 (45%), Gaps = 39/434 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P E+FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V + ++AG+ I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 291 EDDRCECVALQDFQAGDQIYIFYGTRSNAESVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------IS 416
K V R G + VF +H+ E + +L +LR+ +++ + + I
Sbjct: 351 AMKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIF 409
Query: 417 SLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+LG PVS E + L D L Y T+ ED+ +L + +L + +A +L
Sbjct: 410 TLGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRL 469
Query: 476 MEKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 470 GEKEILEKAVKSAA 483
>gi|302804384|ref|XP_002983944.1| hypothetical protein SELMODRAFT_119151 [Selaginella moellendorffii]
gi|300148296|gb|EFJ14956.1| hypothetical protein SELMODRAFT_119151 [Selaginella moellendorffii]
Length = 439
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 100/377 (26%), Positives = 167/377 (44%), Gaps = 39/377 (10%)
Query: 127 SVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
S+P +L + + V +E I E L +ALYL++EK + S W YIR L R
Sbjct: 37 SIPKTLWMDADTVRRSE-IGE--CCEGLRPWIAVALYLLHEKAK-PHSDWSAYIRVLPR- 91
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
++SPL WSE ELA L G+ + + E +KREY+++ T + + P
Sbjct: 92 ------TLDSPLFWSEEELAELKGTQLLSSMNGFKEFLKREYDKVMT------EVIEPRP 139
Query: 247 YDIPTEAFTFEIFKQAFVAVQSCVV------HLQKVSLA----RRFALVPLGPPLLAYSS 296
+T E F AF ++S +L V LA F L P +
Sbjct: 140 DVFDRSLYTLEAFTWAFGILRSRTFPPLIGDNLALVPLADFVNHGFGLTNEDP---GWKV 196
Query: 297 KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVE 354
K + A + + E ++ + + N++L +YGFVD D N D +
Sbjct: 197 KSAGVFARQETLTLQAAANCAEKQEVLIQYGKKKGNAQLATDYGFVDSDEKNNRDSFTLT 256
Query: 355 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLG--YVSDTSEMQ 412
++ + DK +AQ G S F+++ R + DM+ YLRL + SD+ ++
Sbjct: 257 LQVSLSERFADDKVDIAQMAGLDSTAYFNLY--RNQGPPEDMIAYLRLIALFGSDSFLLE 314
Query: 413 SVISSL---GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRV 469
++ + P+S E A+ + + + +A L Y +T+ ED +L L +K++
Sbjct: 315 ALFRNTVWDHLRLPISRENEEAICEAMIEGCRATLREYSSTIDEDTMLLNSSELSTRKKM 374
Query: 470 ATQLVRMEKKMLNACLQ 486
A + EK++L LQ
Sbjct: 375 AVVVRLGEKRILQEQLQ 391
>gi|147843303|emb|CAN82664.1| hypothetical protein VITISV_015206 [Vitis vinifera]
Length = 507
Score = 85.9 bits (211), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 94/380 (24%), Positives = 168/380 (44%), Gaps = 28/380 (7%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A D+ +A VP + + V +E + L +AL+L+ EK +
Sbjct: 81 LVAQRDIARNEAVLEVPKRFWINPDAVAASEIGS---VCGGLKPWVSVALFLIREKLR-D 136
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+S W Y+ L S + WSE EL + G+ L E ++ E+ +++
Sbjct: 137 ESPWRSYLDILPEY-------TNSTIYWSEEELVEIQGTQLSNTTLGVKEYVQSEFLKVE 189
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
+ S +P + + F I + +AF ++ Q + L L+ P +
Sbjct: 190 EEVILPHSQLFPFPVTLDDFLWAFGILRSRAFSRLRG-----QNLVLIPLADLINHSPSI 244
Query: 292 LA--YSSKCK-AMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNP 347
Y+ + K A L + D L KAGE +++ + + N++L ++YGF++
Sbjct: 245 TTEEYAWEIKGAGLFSRDQLFSLRTPVSVKAGEQVLIQYDLDKSNAELALDYGFIESRPN 304
Query: 348 YDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS- 406
+ + ++ DP + DK +A+ NG + F + G+ A MLPYLRL +
Sbjct: 305 RNSYTLTLEISESDPFFGDKLDIAESNGLSEIAYFDIVLGQSLPAA--MLPYLRLVALGG 362
Query: 407 -DTSEMQSVISS--LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYN 462
D ++S+ + G + PVS E + + D K+ L+GY T+ EDE + + N
Sbjct: 363 PDAFLLESIFRNTIWGHLELPVSRANEELICQVIQDACKSALSGYLTTIEEDEKLKEEGN 422
Query: 463 LHPKKRVATQLVRMEKKMLN 482
LHP+ +A + EKK+L
Sbjct: 423 LHPRLEIAVGVRTGEKKVLQ 442
>gi|225462926|ref|XP_002267249.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic [Vitis
vinifera]
gi|296087793|emb|CBI35049.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 85.9 bits (211), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 96/385 (24%), Positives = 170/385 (44%), Gaps = 38/385 (9%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A D+ +A VP + + V +E + L +AL+L+ EK +
Sbjct: 81 LVAQRDIARNEAVLEVPKRFWINPDAVAASEIGS---VCGGLKPWVSVALFLIREKLR-D 136
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+S W Y+ L S + WSE EL + G+ L E ++ E+ +++
Sbjct: 137 ESPWRSYLDILPEY-------TNSTIYWSEEELVEIQGTQLSNTTLGVKEYVQSEFLKVE 189
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG---- 288
+ S Q +P+ + T + F AF ++S + + L+PL
Sbjct: 190 EEVILPHS--QLFPFPV-----TLDDFLWAFGILRSRAFSRLR---GQNLVLIPLADLIN 239
Query: 289 --PPLLA--YSSKCK-AMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFV 342
P + Y+ + K A L + D L KAGE +++ + + N++L ++YGF+
Sbjct: 240 HSPSITTEEYAWEIKGAGLFSRDQLFSLRTPVSVKAGEQVLIQYDLDKSNAELALDYGFI 299
Query: 343 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 402
+ + + ++ DP + DK +A+ NG + F + G+ A MLPYLRL
Sbjct: 300 ESRPNRNSYTLTLEISESDPFFGDKLDIAESNGLSEIAYFDIVLGQSLPAA--MLPYLRL 357
Query: 403 GYVS--DTSEMQSVISS--LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM 457
+ D ++S+ + G + PVS E + + D K+ L+GY T+ EDE +
Sbjct: 358 VALGGPDAFLLESIFRNTIWGHLELPVSRANEELICQVIQDACKSALSGYLTTIEEDEKL 417
Query: 458 LTDYNLHPKKRVATQLVRMEKKMLN 482
+ NLHP+ +A + EKK+L
Sbjct: 418 KEEGNLHPRLEIAVGVRTGEKKVLQ 442
>gi|302754606|ref|XP_002960727.1| hypothetical protein SELMODRAFT_449995 [Selaginella moellendorffii]
gi|300171666|gb|EFJ38266.1| hypothetical protein SELMODRAFT_449995 [Selaginella moellendorffii]
Length = 430
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 101/377 (26%), Positives = 168/377 (44%), Gaps = 39/377 (10%)
Query: 127 SVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
S+P +L + ++ V +E I E L +ALYL++EK + S W YIR L R
Sbjct: 37 SIPKTLWMDVDTVRRSE-IGECCAG--LRPWIAVALYLLHEKAK-PHSDWSAYIRVLPR- 91
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
++SPL WSE ELA L G+ + I E +KREY+++ T + + P
Sbjct: 92 ------TLDSPLFWSEEELAELKGTQLLSSINGFKEFLKREYDKVMT------EVIEPRP 139
Query: 247 YDIPTEAFTFEIFKQAFVAVQSCVV------HLQKVSLA----RRFALVPLGPPLLAYSS 296
+T E F AF ++S +L V LA F L P +
Sbjct: 140 DVFDRSLYTLEAFTWAFGILRSRTFPPLIGDNLALVPLADFVNHGFGLTNEDP---YWHV 196
Query: 297 KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVE 354
K + A + + E ++ + + N++L +YGFVD D N D +
Sbjct: 197 KSAGVFARQETLTLQAAANCAEKQEVLMQYGKKKGNAQLATDYGFVDSDEKNNRDSFTLT 256
Query: 355 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLG--YVSDTSEMQ 412
++ + DK +AQ G S F+++ R + DM+ YLRL + SD+ ++
Sbjct: 257 LQVSLSERFADDKVDIAQMAGLDSTAYFNLY--RNQGPPEDMIAYLRLIALFGSDSFLLE 314
Query: 413 SVISSL---GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRV 469
++ + P+S E A+ + + + +A L Y +T+ ED +L L +K++
Sbjct: 315 ALFRNTVWDHLRLPISRENEEAICEAMIEGCRATLREYSSTIDEDTMLLNSSELSTRKKM 374
Query: 470 ATQLVRMEKKMLNACLQ 486
A + EK++L LQ
Sbjct: 375 AVVVRLGEKRILQEQLQ 391
>gi|41056027|ref|NP_956348.1| histone-lysine N-methyltransferase setd3 [Danio rerio]
gi|82187658|sp|Q7SXS7.1|SETD3_DANRE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|32766447|gb|AAH55261.1| SET domain containing 3 [Danio rerio]
Length = 596
Score = 85.5 bits (210), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 89/383 (23%), Positives = 168/383 (43%), Gaps = 29/383 (7%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKK 169
+ A++D++A + +P +++T+E N + L + +++ + LAL+L+ E+
Sbjct: 108 LKATKDIKAEELFLWIPRKMLMTVESA-KNSVLGPLYSQDRILQAMGNVTLALHLLCERA 166
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
S WLPYI+ L + ++PL + E E+ +L + ++L + + R+Y
Sbjct: 167 N-PSSPWLPYIKTLPSE-------YDTPLYFEEEEVRHLLATQAIQDVLSQYKNTARQY- 217
Query: 230 ELDTVWFMAGSLFQQYPYDIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
+F +P +AFTF+ ++ A +V + + +R AL+PL
Sbjct: 218 ----AYFYKVIHTHPNASKLPLKDAFTFDDYRWAVSSVMTRQNQIPTADGSRVTLALIPL 273
Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
+ DD + V + YK GE I ++ G + N++ +I+ GF EDN
Sbjct: 274 WDMCNHTNGLITTGYNLEDDRCECVALKDYKEGEQIYIFYGTRSNAEFVIHNGFFFEDNA 333
Query: 348 YDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL----- 402
+DR+ ++ ++ + Y K V R G + +F +H E + +L +LR+
Sbjct: 334 HDRVKIKLGVSKGERLYAMKAEVLARAGIPASSIFALHCS-EPPISAQLLAFLRVFCMTE 392
Query: 403 ----GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 458
Y+ + + + PVS E + L L Y ED +ML
Sbjct: 393 EELRDYLVGDHAINKIFTLGNTEFPVSWENEIKLWTFLETRAALLLKTYKTASEEDRSML 452
Query: 459 TDYNLHPKKRVATQLVRMEKKML 481
+L R+A +L EK++L
Sbjct: 453 EKPDLSLHSRIAIKLRLAEKEIL 475
>gi|392341246|ref|XP_002726820.2| PREDICTED: histone-lysine N-methyltransferase setd3 [Rattus
norvegicus]
gi|392349051|ref|XP_216781.6| PREDICTED: histone-lysine N-methyltransferase setd3 [Rattus
norvegicus]
gi|149044195|gb|EDL97577.1| rCG27725, isoform CRA_a [Rattus norvegicus]
Length = 596
Score = 85.1 bits (209), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 99/434 (22%), Positives = 194/434 (44%), Gaps = 39/434 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSILGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V + ++AG+ I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 291 EDDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVI 415
K V R G + VF +H E + +L +LR+ +++ S + +
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIF 409
Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+ PVS E + L D L Y T+ ED+ +L + +L + +A +L
Sbjct: 410 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKTVLKNPDLSVRATMAIKLRL 469
Query: 476 MEKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 470 GEKEILEKAVKSAA 483
>gi|410928182|ref|XP_003977480.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Takifugu
rubripes]
Length = 598
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 94/392 (23%), Positives = 175/392 (44%), Gaps = 43/392 (10%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKKQG 171
A+ D++A + +P +++T+E + L +++ + LAL+L+ E+
Sbjct: 110 ATRDIKAEELFLWIPRKMLMTVESA-KKSVLGPLYNQDRILQAMDNVTLALHLLCERAN- 167
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
SFWLPYIR L ++ ++PL + + E+ L G+ ++L + R+Y
Sbjct: 168 PASFWLPYIRTLPQE-------YDTPLFYEQDEVQLLQGTQAVQDVLSQYRNTARQY--- 217
Query: 232 DTVWFMAGSLFQQYPYD--IP-TEAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRFA 283
+F L Q +P +P ++FTF+ ++ A +V + + ++V+LA
Sbjct: 218 --AYFY--KLIQTHPASSKLPLKDSFTFDDYRWAVSSVMTRQNQIPTEDGRQVTLA---- 269
Query: 284 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 343
L+PL + DD + V + YK E I ++ G + N++ +I+ GF
Sbjct: 270 LIPLWDMCNHRNGLITTGYNLEDDRCECVALQDYKKNEQIYIFYGTRSNAEFVIHNGFFY 329
Query: 344 EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLG 403
++N +D++ ++ ++ + Y K V R G +F ++ E+ + +L +LR+
Sbjct: 330 QENAHDQVKIKLGISKSERLYAMKAEVLARAGIPVSSIFALYCN-EQPISAQLLAFLRV- 387
Query: 404 YVSDTSEMQSV---------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSE 453
+ E++ I +LG + PVS E + L L Y T E
Sbjct: 388 FCMKEEELRDYLLGGHAINKIVTLGSMEFPVSWDNEIKLWTFLETRVALLLKAYKTTSEE 447
Query: 454 DEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 485
D + L L P R+A QL EK +L L
Sbjct: 448 DSSTLEKSELSPHSRMAIQLRLAEKWILEKAL 479
>gi|449280698|gb|EMC87934.1| SET domain-containing protein 3 [Columba livia]
Length = 593
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 94/397 (23%), Positives = 184/397 (46%), Gaps = 41/397 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
+ A+ +++A + VP L++T+E N + L + +++ + LA +L+ E+
Sbjct: 108 LKATREIKAEELFLWVPRRLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERA 166
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
SFWLPYI+ L + +PL + E E+ YL + ++ + + R+Y
Sbjct: 167 N-PNSFWLPYIQTLPSE-------YNTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY- 217
Query: 230 ELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALV 285
+F + Q +P +P ++FT++ ++ A +V + + +R AL+
Sbjct: 218 ----AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRVTLALI 271
Query: 286 PLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 345
PL + DD + V + +KAGE I ++ G + N++ +I+ GF ++
Sbjct: 272 PLWDMCNHTNGLITTGYNLEDDRCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDN 331
Query: 346 NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV 405
N +DR+ ++ ++ D Y K V R G + VF +H+ E + +L +LR+ +
Sbjct: 332 NSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCM 390
Query: 406 SDTSEMQSVIS--------SLG-PICPVSPCMERAVLDQLADYFKAR----LAGYPATLS 452
S+ + +I +LG PVS E +L + +AR L Y T+
Sbjct: 391 SEEELKEHLIGEHAIDKIFTLGNSEFPVSWDNEV----KLWTFLEARASLLLKTYKTTVE 446
Query: 453 EDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 489
+D++ L ++L +A +L EK++L ++ A
Sbjct: 447 DDKSFLETHDLTSHAIMAIKLRLGEKEILEKAVKSAA 483
>gi|160774366|gb|AAI55279.1| SET domain containing 3 [Danio rerio]
Length = 596
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 88/383 (22%), Positives = 167/383 (43%), Gaps = 29/383 (7%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKK 169
+ A++D++A + +P +++T+E N + L + +++ + LAL+L+ E+
Sbjct: 108 LKATKDIKAEELFLWIPRKMLMTVESA-KNSVLGPLYSQDRILQAMGNVTLALHLLCERA 166
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
S WLPYI+ L + ++PL + E E+ +L + ++L + + R+Y
Sbjct: 167 N-PSSPWLPYIKTLPSE-------YDTPLYFEEEEVRHLLATQAIQDVLSQYKNTARQY- 217
Query: 230 ELDTVWFMAGSLFQQYPYDIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
+F +P +AFTF+ ++ A +V + + +R AL+PL
Sbjct: 218 ----AYFYKVIHTHPNASKLPLKDAFTFDDYRWAVSSVMTRQNQIPTADGSRVTLALIPL 273
Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
+ DD + V + YK GE I ++ G + N++ +I+ GF EDN
Sbjct: 274 WDMCNHTNGLITTGYNLEDDRCECVALKDYKEGEQIYIFYGTRSNAEFVIHNGFFFEDNA 333
Query: 348 YDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL----- 402
+DR+ ++ ++ + Y K V R G + +F +H E + +L +LR+
Sbjct: 334 HDRVKIKLGVSKSERLYAMKAEVLARAGIPASSIFALHCS-EPPISAQLLAFLRVFCMTE 392
Query: 403 ----GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 458
Y+ + + + PVS E + L L Y ED +ML
Sbjct: 393 EELRDYLVGDHAINKIFTLGNTEFPVSWENEIKLWTFLETRAALLLKTYKTASEEDRSML 452
Query: 459 TDYNLHPKKRVATQLVRMEKKML 481
+L R+ +L EK++L
Sbjct: 453 EKPDLSLHSRITIKLRLAEKEIL 475
>gi|168063638|ref|XP_001783777.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664720|gb|EDQ51429.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 395
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 75/274 (27%), Positives = 128/274 (46%), Gaps = 28/274 (10%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+ ++ G+ V L++T ++ + ELL T ++E A LAL+++ E+ G+ S
Sbjct: 5 AARPIEVGEQVLRVSGDLMITPNKL--PTEVKELLPTG-VTEWARLALFILVEQHLGQAS 61
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI L A+ S + W + EL + + E ++R I E+ + V
Sbjct: 62 QWAPYINCLPTCG-----ALHSTVFWKKEELELVRFTSLHRETMQRRAVIGSEFASVLPV 116
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
Q+ P+ I E FKQA+ +S + S R VP +
Sbjct: 117 -------LQKCPH-IFGERVLHSKFKQAYATGKSL-----RRSSNTRILTVPF-VDFFNH 162
Query: 295 SSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 352
S C+A+L+ ++ +++ D+ Y GE +V+ G PN+ L +++GF NPYD++
Sbjct: 163 DSNCRALLSYDEERACAEVIADKNYARGEQVVISYGRLPNTTLALDFGFTISCNPYDQVE 222
Query: 353 VEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 386
V AL+ DP + K + +G +V VHA
Sbjct: 223 VWMALSHRDPLRKMKLALLHAHGMPTV----VHA 252
>gi|47215092|emb|CAF98166.1| unnamed protein product [Tetraodon nigroviridis]
Length = 444
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 92/391 (23%), Positives = 173/391 (44%), Gaps = 41/391 (10%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKKQG 171
A+ D++A + +P +++T+E + L T +++ + LAL+L+ E+
Sbjct: 28 ATRDIKAEELFLWIPRKMLMTVESA-KKSVLGPLYTQDRILQAMDNVTLALHLLCERAD- 85
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
SFWLPYIR L ++ ++PL + + ++ L G+ ++L + R+Y
Sbjct: 86 PASFWLPYIRTLPQE-------YDTPLFYQQQDVQLLHGTQAIQDVLSQYRNTARQY--- 135
Query: 232 DTVWFMAGSLFQQYPYD--IP-TEAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRFA 283
+F L Q +P +P ++FTF+ ++ A +V + + ++V+LA
Sbjct: 136 --AYFY--KLVQTHPASSKLPLKDSFTFDDYRWAVSSVMTRQNQIPTEDGRQVTLA---- 187
Query: 284 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 343
L+PL + DD + V + YK E I ++ G + N++ +I+ GF
Sbjct: 188 LIPLWDMCNHRNGLITTGYNLEDDRCECVALQDYKKNEQIYIFYGTRSNAEFVIHNGFFY 247
Query: 344 EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL- 402
++N +D++ ++ ++ + Y K V R G VF ++ E + +L +LR+
Sbjct: 248 QENAHDQVKIKLGISKSERLYAMKAEVLGRAGIPVSSVFALYCN-EPPISAQLLAFLRVF 306
Query: 403 --------GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSED 454
Y+ + +++ PVS E + L L Y T ED
Sbjct: 307 CMMEEELKDYLFGAQAINRLVTLGSMEFPVSWENEIKLWTFLETRAALLLKAYKTTAEED 366
Query: 455 EAMLTDYNLHPKKRVATQLVRMEKKMLNACL 485
+ L +L P R+A QL EK +L L
Sbjct: 367 SSTLDKTDLSPHSRMAVQLRLAEKAILEKAL 397
>gi|302786274|ref|XP_002974908.1| hypothetical protein SELMODRAFT_102436 [Selaginella moellendorffii]
gi|300157067|gb|EFJ23693.1| hypothetical protein SELMODRAFT_102436 [Selaginella moellendorffii]
Length = 389
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 67/251 (26%), Positives = 125/251 (49%), Gaps = 25/251 (9%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS + G+ V + L++T E++ E + +LL+ + +S A LAL+L+ +K+ + S
Sbjct: 5 ASRPIHTGECMLHVSHDLMITPEKL--PEEVTKLLSKD-VSAWAKLALFLLAHQKKKETS 61
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI L ++ S + W++ EL YL SP E ++R + ++ E+ + V
Sbjct: 62 AWAPYISCLPPFG-----SMHSTIFWTQDELVYLKVSPVYRETVQRKDVVRMEFAAAENV 116
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
L QQ + + T ++ V S ++ + + ALVP +
Sbjct: 117 CM----LMQQVKLFVCSRILT------DYITVCSRAWGIETI---KSLALVPF-VDFFNH 162
Query: 295 SSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 352
+ C+AML+ +D ++V DR Y G+ +V+ G N+ L +++GF NP+D++
Sbjct: 163 DANCRAMLSYDEDRHCAEVVSDRDYATGDQVVISYGQLSNATLALDFGFALPFNPHDQVA 222
Query: 353 -VEAALNTEDP 362
+ +L+ +DP
Sbjct: 223 GIWLSLSEKDP 233
>gi|57529914|ref|NP_001006486.1| histone-lysine N-methyltransferase setd3 [Gallus gallus]
gi|363734802|ref|XP_003641459.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Gallus
gallus]
gi|75571462|sp|Q5ZML9.1|SETD3_CHICK RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|53127281|emb|CAG31024.1| hypothetical protein RCJMB04_1k10 [Gallus gallus]
Length = 593
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 92/397 (23%), Positives = 185/397 (46%), Gaps = 41/397 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
+ A+ +++A + VP L++T+E N + L + +++ + LA +L+ E+
Sbjct: 108 LKATREIKAEELFLWVPRKLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERA 166
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
SFWLPYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 167 N-PNSFWLPYIQTLPSE-------YDTPLYFEEDEVQYLRSTQAIHDVFSQYKNTARQY- 217
Query: 230 ELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALV 285
+F + Q +P +P ++FT++ ++ A +V + + +R AL+
Sbjct: 218 ----AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRVTLALI 271
Query: 286 PLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 345
PL + DD + V + +KAGE I ++ G + N++ +I+ GF ++
Sbjct: 272 PLWDMCNHTNGLITTGYNLEDDRCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDN 331
Query: 346 NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV 405
N +DR+ ++ ++ D Y K V R G + VF +H+ E + +L +LR+ +
Sbjct: 332 NSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-IEPPISAQLLAFLRVFCM 390
Query: 406 SDTSEMQSVIS--------SLG-PICPVSPCMERAVLDQLADYFKAR----LAGYPATLS 452
++ + +I +LG P+S E +L + +AR L Y T+
Sbjct: 391 NEEELKEHLIGEHAIDKIFTLGNSEFPISWDNEV----KLWTFLEARASLLLKTYKTTVE 446
Query: 453 EDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 489
+D++ L ++L +A +L EK++L ++ A
Sbjct: 447 DDKSFLETHDLTSHATMAIKLRLGEKEILEKAVKSAA 483
>gi|357153645|ref|XP_003576520.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
[Brachypodium distachyon]
Length = 492
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 91/387 (23%), Positives = 169/387 (43%), Gaps = 38/387 (9%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A+ +L G+ VP L + + V ++ + L ++L ++ E +G
Sbjct: 86 LVAARNLPRGEVVAEVPKKLWMDADAVAASDIGRACRSGGDLRPWVSVSLLILREAARGG 145
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W PY+ L RQ +S + WSE EL + G+ + + E ++ E++ ++
Sbjct: 146 DSLWAPYLAILPRQ-------TDSTIFWSEEELLEIQGTQLLSTTMGVKEYVQSEFDNVE 198
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
+ G +P + TF+ F AF ++S V + + AL+P L+
Sbjct: 199 AK--IIGPNKDLFP-----DTITFDDFLWAFGILRSRVFPELR---GDKLALIPFAD-LI 247
Query: 293 AYSS-----------KCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYG 340
+S+ + K L D L K+GE + V + + N++L ++YG
Sbjct: 248 NHSADITSKQSCWEIQGKGFLGR-DVVFSLRTPMEVKSGEQVYVQYDLDKSNAELALDYG 306
Query: 341 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL 400
F + ++ D + ++ DP Y DK +A+ NG F V G + M+ YL
Sbjct: 307 FTETNSTRDSYTLTLEISESDPFYGDKLDIAELNGMGETAYFDVVLG--ESLPPQMITYL 364
Query: 401 RLGYVSDTSE--MQSVISS--LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 455
RL + T ++++ + G + PVS E ++ + K+ L Y T+ EDE
Sbjct: 365 RLLCLGGTDAFLLEALFRNKVWGFLELPVSRDNEESICQVIQTACKSALTAYHTTIEEDE 424
Query: 456 AMLTDYNLHPKKRVATQLVRMEKKMLN 482
+L +L + ++A ++ EKK+L
Sbjct: 425 ELLKREDLQSRHQIAVEVRAGEKKVLQ 451
>gi|427784595|gb|JAA57749.1| Putative histone-lysine n-methyltransferase setd3 [Rhipicephalus
pulchellus]
Length = 485
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 96/417 (23%), Positives = 175/417 (41%), Gaps = 35/417 (8%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
W NG V +K+ P + A E ++ + VP L++T
Sbjct: 80 WCSDNGAYLGSVSIKDLPDGE------YGFVADEHIEESNQFLGVPLKLMMTTAAA-KKS 132
Query: 144 TIAELLTTN----KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
+ LL + +S +A LA++L+ E G+ SFW PYI L + + L
Sbjct: 133 KLGPLLRDDPIMMSMSNVA-LAMFLILEFCTGESSFWHPYISTL-------PASFNTVLY 184
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
+S EL L GS E L+ I R+Y+ + F L + PY + FT++++
Sbjct: 185 FSVEELELLHGSTVLDEALKLHRSIARQYSYFHKI-FRTHPLAKSLPY---KDCFTYDLY 240
Query: 260 KQAFVAVQS--CVVHLQKVSLARR-------FALVPLGPPLLAYSSKCKAMLAAVDDAVQ 310
+ A AV + V L + A+VPL K + ++
Sbjct: 241 RWAVSAVMTRQNAVPLTDTAGGDDEDGTDAMTAMVPLWDMCNHSDGKVFTDYDISANMLR 300
Query: 311 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 370
R ++ G+ + ++ G + N++ I+ GFV +N +D + ++ ++ +DP Y K +
Sbjct: 301 CYAMRDFEKGQEVTIFYGRRTNAEFFIHNGFVFPENRHDSVDIKLGISKQDPLYAVKAKL 360
Query: 371 AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMER 430
+ +F + RE+ D+ +LR+ + D S+ S I + R
Sbjct: 361 CDDHELTPSGIFAL-VPRERPVCEDLSTFLRILVLKDASQAASFTDE--HIMVATDDNAR 417
Query: 431 AVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQV 487
L+ L + L +P + E E ++ D + + ++A QL +E+K+L A L+
Sbjct: 418 EALNFLIVRIQLLLRAFPKSDQEYENIIADEGSNGRLKMAAQLRLLERKILTAVLET 474
>gi|395504553|ref|XP_003756612.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Sarcophilus
harrisii]
Length = 602
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 102/438 (23%), Positives = 195/438 (44%), Gaps = 47/438 (10%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG L N K + A+ +++A + VP
Sbjct: 80 GKREDYFPDLIKWAAENGASTDGFELV-----NFKEEGFG-LRATREIKAEELFLWVPRK 133
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFWLPYI+ L +
Sbjct: 134 LLMTVESA-KNSVLGALYSQDRILQAMGNITLAFHLLCERA-NPSSFWLPYIQTLPSE-- 189
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ +L + ++ + + R+Y +F + Q +P
Sbjct: 190 -----YDTPLYFEEDEVQHLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPNA 237
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 238 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 297
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V + + GE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 298 EDDRCECVALQDFNVGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 357
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS-------- 416
K V R G + VF +H E + +L +LR+ +++ + +I
Sbjct: 358 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLIGEHAIDRIF 416
Query: 417 SLG-PICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVAT 471
+LG PVS E +L + +AR L Y T+ ED++ L ++L +A
Sbjct: 417 TLGNSEFPVSWDNEV----KLWTFLEARASLLLKTYKTTIEEDKSFLATHDLTFHATMAI 472
Query: 472 QLVRMEKKMLNACLQVTA 489
+L EK++L ++ A
Sbjct: 473 KLRLGEKEILEKAVKSAA 490
>gi|326921018|ref|XP_003206761.1| PREDICTED: LOW QUALITY PROTEIN: SET domain-containing protein
3-like [Meleagris gallopavo]
Length = 593
Score = 82.0 bits (201), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 92/397 (23%), Positives = 184/397 (46%), Gaps = 41/397 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
+ A+ +++A + VP L++T+E + + L + +++ + LA +L+ E+
Sbjct: 108 LKATREIKAEELFLWVPRKLLMTVESA-KSSVLGSLYSQDRILQAMGNITLAFHLLCERA 166
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
SFWLPYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 167 N-PNSFWLPYIQTLPNE-------YDTPLYFEEDEVQYLRSTQAIHDVFSQYKNTARQY- 217
Query: 230 ELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALV 285
+F + Q +P +P ++FT++ ++ A +V + + +R AL+
Sbjct: 218 ----AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRVTLALI 271
Query: 286 PLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 345
PL + DD + V + +KAGE I ++ G + N++ +I+ GF ++
Sbjct: 272 PLWDMCNHTNGLITTGYNLEDDRCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDN 331
Query: 346 NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV 405
N +DR+ ++ ++ D Y K V R G + VF +H+ E + +L +LR+ +
Sbjct: 332 NSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-IEPPISAQLLAFLRVFCM 390
Query: 406 SDTSEMQSVIS--------SLG-PICPVSPCMERAVLDQLADYFKAR----LAGYPATLS 452
++ + +I +LG P+S E +L + +AR L Y T+
Sbjct: 391 NEEELKEHLIGEHAIDKIFTLGNSEFPISWDNEV----KLWTFLEARASLLLKTYKTTVE 446
Query: 453 EDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 489
+D+ L ++L +A +L EKK+L ++ A
Sbjct: 447 DDKLFLETHDLTSHATMAIKLRLGEKKILEKTVKSAA 483
>gi|387016380|gb|AFJ50309.1| Histone-lysine N-methyltransferase setd3 [Crotalus adamanteus]
Length = 592
Score = 81.6 bits (200), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 94/397 (23%), Positives = 182/397 (45%), Gaps = 41/397 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
+ A+ D++A + VP L++T+E N + L + +++ + LA +L+ E+
Sbjct: 108 LKATRDIKAEELFLWVPRKLLMTVESA-KNSILGSLYSQDRILQAMGNITLAFHLLCER- 165
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
SFWLPYI+ L + + L + E E+ YL + +I + + R+Y
Sbjct: 166 YNPNSFWLPYIQTLPNE-------YNTALYFEEDEVQYLQSTQAIHDIFSQYKNTARQY- 217
Query: 230 ELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALV 285
+F + Q +P +P ++FT++ ++ A +V + + +R AL+
Sbjct: 218 ----AYFY--KVVQTHPNASKLPLKDSFTYDDYRWAVSSVMARQNQIPAEDGSRVTLALI 271
Query: 286 PLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 345
PL + DD + V + +KAGE I ++ G + N++ +I+ GF ++
Sbjct: 272 PLWDMCNHTNGLITTGYNLKDDRCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDN 331
Query: 346 NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV 405
N +DR+ ++ ++ D Y K V R G + VF +H+ E + +L +LR+ +
Sbjct: 332 NSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCM 390
Query: 406 SDTSEMQSVIS--------SLG-PICPVSPCMERAVLDQLADYFKAR----LAGYPATLS 452
++ + +I +LG PVS E +L + +AR L Y T+
Sbjct: 391 TEDELKEHLIGEHTIDRIFTLGNSEFPVSWDNEV----KLWTFLEARASLLLKTYKTTIH 446
Query: 453 EDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 489
+D+ +L +L +A +L EK++L ++ A
Sbjct: 447 DDKFILETQDLTHNATMAIKLRLGEKEILEKAIKSAA 483
>gi|126290266|ref|XP_001367810.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Monodelphis domestica]
Length = 595
Score = 81.6 bits (200), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 102/438 (23%), Positives = 194/438 (44%), Gaps = 47/438 (10%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W NG L N K + A+ +++A + VP
Sbjct: 73 GKREDYFPDLIKWAAANGASTDGFELV-----NFKEEGFG-LRATREIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFWLPYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGALYSQDRILQAMGNITLAFHLLCERA-NPSSFWLPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ +L + ++ + + R+Y +F + Q +P
Sbjct: 183 -----YDTPLYFEEDEVQHLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPNA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V + + GE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 291 EDDRCECVALQDFNVGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS-------- 416
K V R G + VF +H E + +L +LR+ +++ + +I
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLIGEHAIDRIF 409
Query: 417 SLG-PICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVAT 471
+LG PVS E +L + +AR L Y T+ ED++ L ++L +A
Sbjct: 410 TLGNSEFPVSWDNEV----KLWTFLEARASLLLKTYKTTIEEDKSFLATHDLTFHATMAI 465
Query: 472 QLVRMEKKMLNACLQVTA 489
+L EK++L ++ A
Sbjct: 466 KLRLGEKEILEKAVKSAA 483
>gi|330822500|ref|XP_003291689.1| hypothetical protein DICPUDRAFT_57488 [Dictyostelium purpureum]
gi|325078125|gb|EGC31794.1| hypothetical protein DICPUDRAFT_57488 [Dictyostelium purpureum]
Length = 540
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 103/455 (22%), Positives = 184/455 (40%), Gaps = 52/455 (11%)
Query: 67 SREVVSKKEED---LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGD 123
+++ K+E D + + W+ +G K +K + E + ++ D++ G+
Sbjct: 56 GKQIAVKQETDQQLVSNFMEWLKNSGFDETKSKVKIGRNLAEGSG----LVSTCDIKEGE 111
Query: 124 AAFSVPNSL---VVTLERVLGNETIAELLTTNKLSEL--ACLALYLMYEKKQGKKSFWLP 178
+P L ++T + G LL N + + LALYL+ E S P
Sbjct: 112 EFLEIPEKLFIDIMTALKSFGQSGYDILLRDNLIRRVPNLVLALYLIKESTNPDSSI-AP 170
Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMA 238
Y++ L + + W + L GSP + G R+Y +F
Sbjct: 171 YLKVLPK-------TYSTIGYWGIEDFKQLEGSPVFQTAVNYTRGSMRQY-----CYFY- 217
Query: 239 GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP--PLLAYSS 296
LF P + T FT+E F A VQS V + AL+P ++
Sbjct: 218 -QLFDNNPGILQTSNFTYEAFIWAVATVQS---RQNPVGGGQEMALIPFWDFCNHSSHGG 273
Query: 297 KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 356
K + V + + YK GE + ++ GP+PNS+ + GF + N D +
Sbjct: 274 KITTFIDPVKHVLTCSAAKSYKKGEQVYMYYGPRPNSQFYLFQGFSLKTNLNDDYSFDMD 333
Query: 357 LNTEDPQ--YQDK-RMVAQRNGKLSVQVFHVHAGREKEAI-SDMLPYLRLGYVS--DTSE 410
L+ ED + DK ++ +R G Q + E + ++++P+ R+ +S +T +
Sbjct: 334 LDNEDDRDIAHDKIHILEERCGLRVGQTVSLSQNPSSEKLPAEIIPFYRIAALSPEETKK 393
Query: 411 M-----------QSVISSLGP--ICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM 457
+ + P +S E+ L D KARL+GYP TL++DE
Sbjct: 394 LAPPQEEGHHHHHQGPMDMKPEAFNIISEENEKKAFKLLLDSLKARLSGYPTTLAQDEQE 453
Query: 458 LTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 492
+ + N ++R ++ EKK+L ++ +I
Sbjct: 454 MKN-NPTTQRRYVLYILINEKKILERNIKYVEQLI 487
>gi|307108530|gb|EFN56770.1| hypothetical protein CHLNCDRAFT_8187, partial [Chlorella
variabilis]
Length = 398
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 104/389 (26%), Positives = 171/389 (43%), Gaps = 37/389 (9%)
Query: 116 SEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSF 175
S+ + G+ F+VP + +T + ++ + L L +AL+L++E+ G S
Sbjct: 4 SKAVNKGEQLFAVPEAAWITADTAQQSQIGSHL---TGLESWLAIALFLLHERAMGNASR 60
Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 235
W PYI L G SP+ W E +LA L GS + ++ +++L
Sbjct: 61 WAPYIALLPADSG-------SPVQWEEADLAELQGSQVLGTVQGYRAYFQQRFDQLQAEV 113
Query: 236 FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV------VHLQKVSLARRFALVPLGP 289
F S +D P F F+ F A V++ ++ V LA P P
Sbjct: 114 FGPNS----QAFD-PI-VFNFDAFLWAACTVRARAHPPLDGGNIALVPLADMVRSQPSWP 167
Query: 290 PLLA-YSSKCKAMLAAVDDAVQLVVDRP--YKAGESIVVWCGPQ-PNSKLLINYGFVDE- 344
P A + K L LV++ AG++I + GPQ + +LL+++G +D
Sbjct: 168 PDSAGWQLKQTGGLFGAGSTQALVMEASGSMAAGDAIAMDFGPQKSDGQLLVDHGVIDPL 227
Query: 345 -DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV-HAGREKEAISDMLPYLRL 402
+ P L +E L+ ED Y DK + + N +L+ H+ A R +A + L
Sbjct: 228 VNQPSYALTLE--LSKEDRNYDDKADILELN-ELAESTEHILRADRAPDAGLLPVLRLLN 284
Query: 403 GYVSDTSEMQSVISSL---GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 459
+D ++S+ + PVS ER QL D A LA YP ++ ED A++
Sbjct: 285 LSGTDAFLLESIFRNEVWEHMQLPVSEDNERGCYQQLIDGCTAALAAYPTSIDEDLALMA 344
Query: 460 DYNLHPKKRVATQL-VRM-EKKMLNACLQ 486
+L P R + + VR+ EK+ L+A L+
Sbjct: 345 SGSLQPGSRRQSAVRVRLGEKEALDATLR 373
>gi|260803924|ref|XP_002596839.1| hypothetical protein BRAFLDRAFT_284593 [Branchiostoma floridae]
gi|229282099|gb|EEN52851.1| hypothetical protein BRAFLDRAFT_284593 [Branchiostoma floridae]
Length = 500
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 96/389 (24%), Positives = 175/389 (44%), Gaps = 39/389 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKK 169
+ A +D++A + ++P L++T E ++ L+ +++ ++ LAL+++ EK
Sbjct: 121 LKAVKDIKAEELFITIPRKLMLTTE-TARESSLGPLIKKDRILQVMANVSLALHVLCEK- 178
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
SFW PYI +PL + E E+ +L GS +++L + + I R+Y
Sbjct: 179 YSSNSFWAPYINIFPG-------TYTTPLYFEEGEMLHLQGSLNFSDVLNQYKSIARQY- 230
Query: 230 ELDTVWFMAGSLFQQYP--YDIP-TEAFTFEIFKQAFVAVQSCVVHLQKV--SLARRF-- 282
+F LFQ P +P E FTF+ ++ A V + + +V S R
Sbjct: 231 ----AYFY--KLFQTQPEAAGLPLKECFTFDEYRWA---VSTVMTRQNQVPTSDGRHLIT 281
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
AL+P+ + + D+ + + R + + ++ G + N++ LI+ GFV
Sbjct: 282 ALIPMWDMCNHSNGEVSTEFNLGSDSAECLAMREFPTDSQVYIFYGMRSNAEFLIHNGFV 341
Query: 343 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 402
+N +DR+ V+ ++ D + K V R G + F VH G++ ++L +LR+
Sbjct: 342 YPENVHDRVNVKLGVSKNDSLFAMKAEVLSRAGIHASTSFQVHCGKDP-IPPELLVFLRV 400
Query: 403 -----GYVSD--TSEMQSV-ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSE 453
G + D TSE QS +S LG C V+ E L + Y ++ +
Sbjct: 401 FTMVEGDLRDLLTSEHQSAYLSCLGRSDCMVTQEQETKAWAFLETRLSLLIRSYRTSIKD 460
Query: 454 DEAMLTDYNLHPKKRVATQLVRMEKKMLN 482
E L ++ R A QL E ++L+
Sbjct: 461 VETELQAPDMTYHSRAALQLKLAEMQILS 489
>gi|168067849|ref|XP_001785817.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162662541|gb|EDQ49381.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 489
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 105/432 (24%), Positives = 181/432 (41%), Gaps = 61/432 (14%)
Query: 93 CKVILKEKPSHNEKHRPIHYVAASEDL--------QAGDAAFSVPNSLVVTLERVLGNET 144
+ I SH + + SE L AGD +VP S+ + L V N +
Sbjct: 43 VQTIWSWAQSHGIQGEAVKPAEVSEGLGLIAQRPVNAGDEILNVPESVWINLAAVQ-NSS 101
Query: 145 IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 204
+ + L +AL+L++E S W PY+ L + +++SPL WS+ E
Sbjct: 102 LGK--ACEGLKPWVAVALFLIHESSN-PSSKWRPYLDSLPK-------SLDSPLFWSDEE 151
Query: 205 LAYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 263
LA L G+ + E ++ EYN L + V +F Y TF+ FK AF
Sbjct: 152 LAELVGTQLLGSVTGYLEFLENEYNNLVEEVLEPNNKIFNPAVY-------TFDGFKWAF 204
Query: 264 VAVQSCVVHLQKVSLARRFALVPL------------GPP--LLAYSSKCKAMLAAVDDAV 309
++S ALVP+ G P + +S+ + D +
Sbjct: 205 GILRSRTFSPLT---GEDIALVPIADLVNHGKGLGDGSPSWVRKGTSQFWNIGKGSSDLL 261
Query: 310 QLVVDRPYKAGESIVVWCGP-QPNSKLLINYGFVDEDN--------PYDRLVVEAALNTE 360
+ + AGE +++ G + N+ L ++YGFV+ D D L + ++ +
Sbjct: 262 TVRASANFSAGEQVLMQYGATKSNADLALDYGFVERDRGSQFSPGIERDSLALSLEISPD 321
Query: 361 DPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS--DTSEMQSVI--S 416
D DK + + NG F + G+ +M+ +LRL +S D+ ++++
Sbjct: 322 DRFVDDKADILEINGFQCSMQFDLSRGQGPS--DEMITFLRLSALSGPDSFLLEALFRNE 379
Query: 417 SLGPIC-PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+ G + PVS E A+ + + KA L GY T+ +D +L +L + +A +VR
Sbjct: 380 AWGHVSLPVSRDNEEALCTSMLEGLKAALDGYSTTVEQDMELLARGDLSTRMEIAV-VVR 438
Query: 476 MEKKMLNACLQV 487
+ +K + LQ
Sbjct: 439 LGEKRVMQELQT 450
>gi|356534483|ref|XP_003535783.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Glycine
max]
Length = 463
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 63/255 (24%), Positives = 119/255 (46%), Gaps = 19/255 (7%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ +Q GD VP + +T + +L L ++ +A LA ++ EKK G+ S
Sbjct: 65 ASKIIQTGDCILKVPYRVQITADNLLPE---IRSLIGEEVGNIAKLATVILIEKKLGQGS 121
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI L +Q G+L + + W+E+EL + S E +++ I++++ + +
Sbjct: 122 EWYPYISCLPQQ---GEL--HNTVFWTESELEMIRPSSVYQETIDQKSQIEKDFLAIKHI 176
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
+ + F Y A T +F V + V + AL+P L +
Sbjct: 177 FECSHQSFGDSTYKDFMHACTLVLFDHFNVELP---VGSRAWGSTNGLALIPFAD-FLNH 232
Query: 295 SSKCKAMLAAVDD-------AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
+A++ + DD ++Q++ DR Y GE +++ G N+ L++++GF N
Sbjct: 233 DGVSEAIVMSDDDKQCSEVQSLQIIADRDYAPGEQVLIRYGKFSNATLMLDFGFTIPYNI 292
Query: 348 YDRLVVEAALNTEDP 362
YD++ ++ + DP
Sbjct: 293 YDQVQIQFDIPKHDP 307
>gi|302755392|ref|XP_002961120.1| hypothetical protein SELMODRAFT_402746 [Selaginella moellendorffii]
gi|300172059|gb|EFJ38659.1| hypothetical protein SELMODRAFT_402746 [Selaginella moellendorffii]
Length = 371
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 69/251 (27%), Positives = 113/251 (45%), Gaps = 39/251 (15%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+ + AG +P ++T E V ++ LL+T+ L+L+L+ EK + ++S
Sbjct: 10 ATRRVPAGSRFLEIPRIAIITPENVPSQ--VSHLLSTSNPKTR--LSLFLLSEKHKAQES 65
Query: 175 FWLPYIRELDRQRGRGQLA-VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
W PY+R L QL +ES + W + ELA+L SPT E +E + IK E++ L+
Sbjct: 66 QWAPYLRCL------PQLGDIESTMFWKDEELAWLKHSPTYRETMECLKIIKSEFHVLEA 119
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLA 293
F + D+ E + F A+ Q +P
Sbjct: 120 NVF-------PWCRDVLGEV-SLTDFMHAYSTDQ-----------------IPFA-DFFN 153
Query: 294 YSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
+ C+ L+ D V D+ YKAG+ I + G PNS L ++YGF NP++++
Sbjct: 154 HDHNCQTRLSYDKEKDCAVAVADQDYKAGDEIFLSYGSTPNSILAVDYGFAVASNPHEQV 213
Query: 352 VVEAALNTEDP 362
V ++ DP
Sbjct: 214 EVPMGVSLTDP 224
>gi|168043570|ref|XP_001774257.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674384|gb|EDQ60893.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 458
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 65/259 (25%), Positives = 118/259 (45%), Gaps = 22/259 (8%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLP 178
++ G+ V L++T R+ + E + ++E + LAL+ + K GK S W P
Sbjct: 70 IKRGEQVLRVSRELMITPNRL---PSCVEESLSEDVNEWSRLALFQLLHKHAGKASPWEP 126
Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMA 238
YIR L RG +++ + W + EL L S + R I +++ + V
Sbjct: 127 YIRCLPPLRG-----LQNTVFWRDEELELLRQSNVYDQTEHRKTLISNQFDLVQAV---- 177
Query: 239 GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC 298
+YP ++ E T E FK A+ S ++ + +VP + + S
Sbjct: 178 ---VNKYP-ELFGETVTLESFKHAYCVASSRSWGVEALG---SITMVPF-VDMFNHDSSA 229
Query: 299 KAMLAAVDDA--VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 356
+A+LA ++ ++V D+ Y G +V+ G PNS L +++GF DNP+D + +
Sbjct: 230 RALLAYYEEEGYAEVVADKDYNQGSQVVITYGTLPNSSLALDFGFTLPDNPHDEVQIWME 289
Query: 357 LNTEDPQYQDKRMVAQRNG 375
+ DP +K + + +G
Sbjct: 290 APSGDPLRAEKLKLLRDHG 308
>gi|291235388|ref|XP_002737626.1| PREDICTED: SET domain containing 4-like [Saccoglossus kowalevskii]
Length = 353
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 78/298 (26%), Positives = 138/298 (46%), Gaps = 36/298 (12%)
Query: 75 EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
+ D +L WM +NG K L + + E R + A++ Q GD S+P L++
Sbjct: 28 DNDYIELVRWMSRNGF---KGALLKPANFKETGRGL---MATKPFQIGDQVISIPEMLLI 81
Query: 135 TLERVLGNETIAELL---TTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQ 191
T + VL + + + + T KLS + + YL+ E+ + K SFW YI+ L +
Sbjct: 82 TTQNVLSS-YLGDFIKQQTRPKLSPMQVICTYLICERSRQKDSFWYNYIKVLPK------ 134
Query: 192 LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPT 251
+ +P+ ++ E+ +L K ++ + E I Y EL ++ + S F +
Sbjct: 135 -SYSNPVYFTNEEINWLP-RRIKRKVFDECEKINTAYRELKNLFSILESTFVSFK----- 187
Query: 252 EAFTFEIFKQAFVAVQSCVVHLQK-----VSLAR-RFALVPLGPPLLAYSS--KCKAMLA 303
F + F+ A+ V + V++ + +S+ R +AL P LL +++ + KA
Sbjct: 188 GIFEYSAFRWAWCTVNTRSVYMLQEQNPHLSIERDHYALAPF-LDLLNHTNTVEVKASYN 246
Query: 304 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 361
V ++ K + + ++ GP N KL I YGFV N ++ VVE L+ ED
Sbjct: 247 PVSKCYEIFTCTACKKYDQMFIYYGPHDNVKLFIEYGFVLPQNQHN--VVE--LDFED 300
>gi|217038301|gb|ACJ76599.1| SET domain-containing protein 3 (predicted) [Oryctolagus cuniculus]
Length = 394
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 79/340 (23%), Positives = 153/340 (45%), Gaps = 34/340 (10%)
Query: 72 SKKEEDLGDLKSWMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVP 129
K+E+ +L W NG + +V+ E+ + A+ +++A + VP
Sbjct: 73 GKREDYFPELMKWASANGASVEGFEVVNFEEEGFG--------LRATREIKAEELFLWVP 124
Query: 130 NSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQ 186
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 125 RKLLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE 182
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
++PL + E E+ YL + ++ + + R+Y V Q +P
Sbjct: 183 -------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYRV-------IQTHP 228
Query: 247 Y--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAML 302
+ +P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 229 HANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGY 288
Query: 303 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 362
DD + V R + AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D
Sbjct: 289 NLEDDRCECVALRDFHAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDR 348
Query: 363 QYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 402
Y K V R G + VF +H E + +L +LR+
Sbjct: 349 LYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRV 387
>gi|146162512|ref|XP_001009518.2| SET domain containing protein [Tetrahymena thermophila]
gi|146146406|gb|EAR89273.2| SET domain containing protein [Tetrahymena thermophila SB210]
Length = 789
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 104/417 (24%), Positives = 181/417 (43%), Gaps = 69/417 (16%)
Query: 107 HRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNET-IAELLTTNKLSELA----CLA 161
+R +H A + ++ G+ +P ++TLE L E I +L+ + + L+ L+
Sbjct: 373 YRGVH---ARQKIKKGECILFIPVDNMITLE--LSKELPICQLIESKNIRLLSPKHTFLS 427
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES---PLLWSETELAYLTGSPTKAEIL 218
+Y++ EKK KSFW P++ L VE P+L+++ EL +L GSP ++
Sbjct: 428 IYIIIEKK-NHKSFWKPFL---------DILPVEYTTFPILYTDEELFWLKGSPFLNQVK 477
Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA--FTFEIFKQAFVAVQSCVVHLQKV 276
ER E I ++Y Q IP A T + F A + S + L +
Sbjct: 478 ERRECITQDY--------------QAIVSKIPEFAKLCTLDEFAWARMMAASRIYGL-FI 522
Query: 277 SLARRFALVPLGPPLL----AYSS------KCKAMLAAVDDAVQLVVDRPYKAGESIVVW 326
+ R A VPL AY++ K ML A +D + G+ I
Sbjct: 523 NKKRTDAFVPLADMFNHRRPAYTNWGFCEDKGGFMLKASEDI---------RRGDQIYYS 573
Query: 327 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK-RMVAQRNGKLSVQVFHVH 385
CG + NS+ L+NYGFV ++N + + + + +D K +M+ +R K +F +H
Sbjct: 574 CGRKCNSRFLLNYGFVVKNNEANEIQLRVDFDKKDETLPIKLQMIGKR--KPESLIFRIH 631
Query: 386 AGREKEAISDMLPYLRLGYVSDTSEMQ-----SVISSLGPIC--PVSPCMERAVLDQLAD 438
E++++ + +LR + D ++ S P+ P S E+ + ++
Sbjct: 632 INYEEKSVLEFFGFLRFVLIRDYIVLEKFHEMSEGKEFDPLRTPPFSIENEKQMWTEIHK 691
Query: 439 YFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLL 495
+ YP TL ED+ +L L ++ L EK++L + + M LL
Sbjct: 692 ICAEIMIQYPTTLDEDKKILETSKLTINQKNCVILRMGEKEILMYYITMADRMKKLL 748
>gi|42565948|ref|NP_191068.2| SET domain-containing protein [Arabidopsis thaliana]
gi|56236044|gb|AAV84478.1| At3g55080 [Arabidopsis thaliana]
gi|59958342|gb|AAX12881.1| At3g55080 [Arabidopsis thaliana]
gi|332645816|gb|AEE79337.1| SET domain-containing protein [Arabidopsis thaliana]
Length = 463
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 67/273 (24%), Positives = 128/273 (46%), Gaps = 25/273 (9%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ + AGD VP + +T + + + +L +N++ + LA L+ EKK G+KS
Sbjct: 75 ASKVIYAGDCMLKVPFNAQITPDELPSD---IRVLLSNEVGNIGMLAAVLIREKKMGQKS 131
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W+PYI L + + S + W E EL+ + S E +++ I+++++
Sbjct: 132 RWVPYISRLPQ-----PAEMHSSIFWGEDELSMIRCSAVHQETVKQKAQIEKDFS----- 181
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
F+A + Q P I TE E F A+ V S + ++R +L+P +
Sbjct: 182 -FVAQAFKQHCP--IVTERPDLEDFMYAYALVGS-----RAWENSKRISLIPFADFMNHD 233
Query: 295 SSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
+L D+ + ++ DR Y G+ + + G N+ L++++GF N +D + +
Sbjct: 234 GLSASIVLRDEDNQLSEVTADRNYSPGDEVFIKYGEFSNATLMLDFGFTFPYNIHDEVQI 293
Query: 354 EAALNTEDPQYQDKRMVAQRNGKLSVQ---VFH 383
+ + +DP K + Q + +V+ +FH
Sbjct: 294 QMDVPNDDPLRNMKLGLLQTHHTRTVKDINIFH 326
>gi|145524453|ref|XP_001448054.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124415587|emb|CAK80657.1| unnamed protein product [Paramecium tetraurelia]
Length = 581
Score = 78.6 bits (192), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 101/422 (23%), Positives = 180/422 (42%), Gaps = 72/422 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNET-IAELLTTNKLSELA----CLALYLMYE 167
V A + + A + +P S ++TLE + ET +A+ + +L L+ L+ +L+ E
Sbjct: 165 VNAKQKINAKELILFIPKSHMITLE--MAKETPVAKKMIQFRLDLLSPKHSFLSTFLLQE 222
Query: 168 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
K + SFW PY+ L Q P+ ++ +L +L GSP +I ++ +K++
Sbjct: 223 KSRPN-SFWKPYLDIL------PQSYPSFPIFFNNYDLEWLQGSPFLKQINDKLSDLKKD 275
Query: 228 YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 287
YN++ V F QY +F F A + S + + + + A VPL
Sbjct: 276 YNDICNV----APEFSQY---------SFYEFCWARMTASSRIFGI-NIKGVKTDAFVPL 321
Query: 288 G-------PPLLA--YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 338
P L + YS + + + D+ + DR G+ I G + NS+ L+N
Sbjct: 322 ADMLNHKRPKLTSWCYSEEKQGFIIETDEKI----DR----GQMIFDSYGRKCNSRFLLN 373
Query: 339 YGFVDEDNPYDRLVVEAALNTEDPQYQDK--------------RMVAQRNGKLSVQVFHV 384
YGFV +DN + + V A DP Q K R++ +G + F
Sbjct: 374 YGFVVDDNDANEVNVTVAAEFNDPLIQLKEDATEEQLKQPKTFRLIMDTDGINEITHFL- 432
Query: 385 HAGREKEAISDMLPYLRLGYVSDTSEMQSVISS-----LGP--ICPVSPCMERAVLDQLA 437
+ + + + Y+R + D +++Q +++ + P I P+ E + D +
Sbjct: 433 -----EATVMEFMSYIRFLVIRDQTQLQFLLNERESKYIKPTKIQPLGIHNELDMWDLIR 487
Query: 438 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPD 497
L+ YP TL +D+ +L +L +R L EK++L Q + M LL +
Sbjct: 488 RICYVSLSRYPTTLEQDKEILQICDLTTNQRNCLILRMGEKEILKFYYQFSEKMKQLLSN 547
Query: 498 VT 499
Sbjct: 548 FN 549
>gi|148686777|gb|EDL18724.1| mCG18357, isoform CRA_b [Mus musculus]
Length = 466
Score = 78.2 bits (191), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 80/344 (23%), Positives = 156/344 (45%), Gaps = 29/344 (8%)
Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
LA +L+ E+ SFW PYI+ L + ++PL + E E+ L + ++
Sbjct: 28 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVF 79
Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 275
+ + R+Y +F + Q +P+ +P E+FT+E ++ A +V + +
Sbjct: 80 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKESFTYEDYRWAVSSVMTRQNQIPT 132
Query: 276 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 334
+R AL+PL + DD + V + ++AG+ I ++ G + N++
Sbjct: 133 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAE 192
Query: 335 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 394
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H+ E +
Sbjct: 193 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISA 251
Query: 395 DMLPYLRLGYVSDT---------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 445
+L +LR+ +++ S + + + PVS E + L D L
Sbjct: 252 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNAEFPVSWDNEVKLWTFLEDRASLLLK 311
Query: 446 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 489
Y T+ ED+ +L + +L + +A +L EK++L ++ A
Sbjct: 312 TYKTTIEEDKIVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAA 355
>gi|297849804|ref|XP_002892783.1| hypothetical protein ARALYDRAFT_471564 [Arabidopsis lyrata subsp.
lyrata]
gi|297338625|gb|EFH69042.1| hypothetical protein ARALYDRAFT_471564 [Arabidopsis lyrata subsp.
lyrata]
Length = 482
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 98/393 (24%), Positives = 175/393 (44%), Gaps = 38/393 (9%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A D+ + +P L + E V ++ I L L +AL+L+ EK + +
Sbjct: 79 LVARRDIGRNEVVLEIPKRLWINPETVTASK-IGPL--CGGLKPWVSVALFLIREKYE-E 134
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+S W Y+ L + + +S + WSE ELA L G+ + L E ++ E+ +L+
Sbjct: 135 ESSWRLYLDMLPQ-------STDSTVFWSEEELAELKGTQLLSTTLGVKEYVENEFLKLE 187
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG---- 288
+ D+ + T + F AF ++S + + L+PL
Sbjct: 188 QEILLPNK-------DLFSSRITLDDFIWAFGILKSRAFSRLR---GQNLVLIPLADLIN 237
Query: 289 --PPLLA--YSSKCK-AMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFV 342
P + Y+ + K A L + D L KAGE + + + + N++L ++YGFV
Sbjct: 238 HNPAITTEDYAYEIKGAGLFSRDLLFSLKSPVYVKAGEQVYIQYDLNKSNAELALDYGFV 297
Query: 343 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 402
+ + + + + DP + DK +A+ N F V G+ A ML YLRL
Sbjct: 298 ESNPNRNSYTLTIEIPESDPFFGDKLDIAETNKMGETGYFDVVDGQTLPA--GMLQYLRL 355
Query: 403 GYV--SDTSEMQSVISS--LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM 457
+ SD ++S+ ++ G + PVS E + + D K+ L+G+ T+ EDE +
Sbjct: 356 VALGGSDAFLLESIFNNTIWGHLELPVSRSNEELICRVVRDACKSALSGFSTTIEEDEKL 415
Query: 458 LTDYNLHPKKRVATQLVRMEKKMLNACLQVTAD 490
L + L P+ +A ++ EK++L Q+ D
Sbjct: 416 LEEGKLDPRLEMALKIRIGEKRVLQQIDQIFKD 448
>gi|297820264|ref|XP_002878015.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297323853|gb|EFH54274.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 473
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 60/249 (24%), Positives = 119/249 (47%), Gaps = 22/249 (8%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ + AGD VP ++ +T + + + ++ T+++ + LA L+ EKK+G+KS
Sbjct: 75 ASKVIHAGDCMLKVPFNVQITPDELSPDIRVS---LTDEVGNIGKLAAVLIREKKKGQKS 131
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W+PYI L + + S + W E E + + S E +++ I++E++
Sbjct: 132 RWVPYISRLPQ-----PAEMHSTIFWGEDEFSMIRCSAVHKETVKQKAQIEKEFS----- 181
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
F+A + Q YP I E E F A+ V S + ++ +L+P +
Sbjct: 182 -FVAQAFKQHYPMVI--ERPYLEDFMYAYALVGS-----RAWETSKGISLIPFADFMNHD 233
Query: 295 SSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
+L+ D+ + ++ DR Y G+ + + G N+ L++++GF N +D + +
Sbjct: 234 GLSASIVLSDEDNQLSEVTADRNYSPGDEVFIKYGEFSNATLMLDFGFTVPYNIHDEVQI 293
Query: 354 EAALNTEDP 362
+ + +DP
Sbjct: 294 QMDVPNDDP 302
>gi|424513480|emb|CCO66102.1| predicted protein [Bathycoccus prasinos]
Length = 571
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 104/405 (25%), Positives = 172/405 (42%), Gaps = 56/405 (13%)
Query: 116 SEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSF 175
S++++ GD S+P VT + +A L+ + EL LAL+L EK + K S
Sbjct: 122 SKNVEGGDVILSIPQDNCVTAVDAKEHPIVAPLI--EEKPELVQLALWLCCEKAKAKGSE 179
Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELA-YLTGSPTKAEILERAEGIKREYNELDTV 234
W PY++ L+ S L ++E E L G+ E +R + K EY L
Sbjct: 180 WWPYLKTLNGNPN-------SVLRFTEEEFKELLKGTSIDKEARQRRDSAKEEYEALRAA 232
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFV-AVQSCVVHLQKVSLARRFALVPL------ 287
+ P P + + F + + AF+ A+ Q ++ A +A+VPL
Sbjct: 233 -------IAEDPGKYPLDVYAF-LTESAFIDALDIVCARAQWLNSANCYAMVPLMDAIPI 284
Query: 288 --GPPLLA-------------------YSSKCKAMLAAVDDA-VQLVVDRPYKAGESIV- 324
PP ++ + +C VD A V L + AG I+
Sbjct: 285 CGAPPPVSPEDPSFARFYEIRDIKTGLTAVRCGYADYDVDSASVVLCANTRASAGSKILQ 344
Query: 325 VWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV-QVFH 383
+ + NS+L +++G VD+ +P D L+ DP Y K+ V + G Q F
Sbjct: 345 IDHSVRNNSELYLSFGDVDDQHPGDYEYWPTELSENDPLYAAKKSVLEAQGFADKGQTFP 404
Query: 384 VHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 443
V+ R + L YLR V+++ E+ +V + + VSP E L L + R
Sbjct: 405 VYKDRMPR---EFLSYLRFARVTNSEELFAVSFTEDKV--VSPMNEYETLQLLMADCRDR 459
Query: 444 LAGYPATLSEDEAMLTDY-NLHPKKRVATQLVRMEKKMLNACLQV 487
++ Y T EDE +L ++ K R A++L R EK+++ +
Sbjct: 460 MSAYD-TNEEDELLLQKRDDVSLKIRNASRLRRCEKELVGEMMNA 503
>gi|229596469|ref|XP_001008992.3| SET domain containing protein [Tetrahymena thermophila]
gi|225565279|gb|EAR88747.3| SET domain containing protein [Tetrahymena thermophila SB210]
Length = 629
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 81/288 (28%), Positives = 121/288 (42%), Gaps = 43/288 (14%)
Query: 66 GSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAA 125
+E + K E +L SW+ N + LK +HN + + +QA +
Sbjct: 142 ADKETLKKSE----NLLSWVQANKGEFSSIKLKYLSTHNRS------IVSKRIIQADETV 191
Query: 126 FSVPNSLVVTLERVLGNETIAELLTTNKLSEL-----ACLALYLMYEKKQGKKSFWLPYI 180
S+P V+TL+ V + ++LT K ++L A AL+L+ E+K+ S + YI
Sbjct: 192 ISIPQEQVITLD-VASSSDFCKILT-EKNTQLVQQKHAYFALFLLQEQKKKDASHYKAYI 249
Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS 240
L P L+SE EL YL G+ + E+ E IK +Y + V
Sbjct: 250 DSLPTDLSSF------PALFSEEELQYLEGTAALKLVQEQKEDIKTDYESISQV------ 297
Query: 241 LFQQYPYDIP--TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC 298
IP F+FE F+ AF+ S V + KV + +VPL L S
Sbjct: 298 --------IPEFKSEFSFEQFRWAFLCSHSRVFGI-KVKGVKTSVMVPLADMLNHKHSGQ 348
Query: 299 KAMLAAVDDAVQLVVDRPYKA---GESIVVWCGPQPNSKLLINYGFVD 343
+ DDA + K + I G + NSKL +NYGFVD
Sbjct: 349 EDSEWVFDDATNCFTVKALKKIQRNQQIHFSYGSKCNSKLFLNYGFVD 396
>gi|302766942|ref|XP_002966891.1| hypothetical protein SELMODRAFT_408134 [Selaginella moellendorffii]
gi|300164882|gb|EFJ31490.1| hypothetical protein SELMODRAFT_408134 [Selaginella moellendorffii]
Length = 374
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 72/256 (28%), Positives = 115/256 (44%), Gaps = 46/256 (17%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+ + AG +P ++T E V ++ LL+T+ + L+L+L+ EK + ++S
Sbjct: 10 ATRRVPAGSRFLEIPRIAIITPENVPSQ--VSHLLSTS--NPKTRLSLFLLSEKHKAQES 65
Query: 175 FWLPYIRELDRQRGRGQLA-VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
W PY+R L QL +ES + W ELA+L SPT E +E + IK E++ L
Sbjct: 66 QWAPYLRCL------PQLGDIESTMFWKAEELAWLKHSPTYRETMECLKIIKSEFHLLT- 118
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL-----ARRFALVPLG 288
+A Q +P+ C L +VSL A +P
Sbjct: 119 ---LANK--QVFPW---------------------CRDALGEVSLTDFMHAYSTDQIPFA 152
Query: 289 PPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 346
+ C+ L+ D V D+ YKAG+ I + G PNS L ++YGF N
Sbjct: 153 -DFFNHDHNCQTRLSYDKEKDCAVAVADQDYKAGDEIFLSYGSTPNSILAVDYGFAVASN 211
Query: 347 PYDRLVVEAALNTEDP 362
P++++ V ++ DP
Sbjct: 212 PHEQVEVPMGVSLTDP 227
>gi|17368377|sp|P94026.1|RBCMT_TOBAC RecName: Full=Ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic; AltName:
Full=[Ribulose-bisphosphate carboxylase]-lysine
N-methyltransferase; Short=RuBisCO LSMT; Short=RuBisCO
methyltransferase; Short=rbcMT; Flags: Precursor
gi|1731475|gb|AAC49565.1| ribulose-1,5-bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase [Nicotiana tabacum]
gi|1731477|gb|AAC49566.1| ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase [Nicotiana tabacum]
Length = 491
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 89/381 (23%), Positives = 168/381 (44%), Gaps = 31/381 (8%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A D+ G+ VP + + V +E I + + L +AL+L+ EK +
Sbjct: 87 LVAKRDIAKGETVLQVPKRFWINPDAVAESE-IGNVCS--GLKPWISVALFLLREKWR-D 142
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W Y+ L + + +S + WSE EL+ + G+ + + + ++ E+ +++
Sbjct: 143 DSKWKYYMDVLPK-------STDSTIYWSEEELSEIQGTQLLSTTMSVKDYVQNEFQKVE 195
Query: 233 TVWFMAGSLFQQYPYDIPTEAF--TFEIFK-QAFVAVQS-CVVHLQKVSLARRFALVPLG 288
+ Q +P+ I + F F I + +AF +++ ++ + L A V
Sbjct: 196 EEVILRNK--QLFPFPITLDDFFWAFGILRSRAFSRLRNQNLILVPFADLTNHNARVTTE 253
Query: 289 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNP 347
A+ + A L + D L KAG+ + + + + N+ + ++YGF++ +
Sbjct: 254 DH--AHEVRGPAGLFSWDLLFSLRSPLKLKAGDQLFIQYDLNKSNADMALDYGFIEPSSA 311
Query: 348 YDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD 407
D + ++ D Y DK +A+ NG F + G+ M+PYLRL +
Sbjct: 312 RDAFTLTLEISESDEFYGDKLDIAETNGIGETAYFDIKIGQSLPPT--MIPYLRLVALGG 369
Query: 408 TSEM-------QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 460
T SV LG PVS E + + D K+ L+GY T+ EDE ++ +
Sbjct: 370 TDAFLLESIFRNSVWGHLG--LPVSRANEELICKVVRDACKSALSGYHTTIEEDEKLMEE 427
Query: 461 YNLHPKKRVATQLVRMEKKML 481
NL + ++A + EK++L
Sbjct: 428 GNLSTRLQIAVGIRLGEKRVL 448
>gi|348537527|ref|XP_003456245.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Oreochromis niloticus]
Length = 607
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 102/451 (22%), Positives = 182/451 (40%), Gaps = 78/451 (17%)
Query: 74 KEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
+E+ +L SW +NG C+ + + + A+ D++A + +P ++
Sbjct: 75 REDHFPELMSWAKENG-ASCECFTVANFG-----KEGYGLRATRDIKAEELFLWIPRKML 128
Query: 134 VTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRG 190
+T+E N + L + +++ + LAL+L+ E+ SFWLPYIR L ++
Sbjct: 129 MTVESA-QNSILGPLYSQDRILQAMGNVTLALHLLCERA-NPASFWLPYIRSLPQE---- 182
Query: 191 QLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY----------NELDTVWF---- 236
+ PL + + ++ L G+ ++L + + R+Y L +V
Sbjct: 183 ---YDIPLYYQQEDVQLLLGTQAVQDVLSQYKNTARQYAYFYKLVQDKGMLGSVELRLFA 239
Query: 237 -----MAGSLFQQYPY--------DIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 283
M G LF Q+ IPTE + +V+LA
Sbjct: 240 SLTPVMGGKLFDQWAVSSVMTRQNQIPTEDGS-------------------RVTLA---- 276
Query: 284 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 343
L+PL + DD + V + YK E I ++ G + N++ +I+ GF
Sbjct: 277 LIPLWDMCNHTNGLITTGYNLEDDRCECVALQDYKENEQIYIFYGTRSNAEFVIHNGFFF 336
Query: 344 EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLG 403
+D+ +DR+ ++ ++ + Y K V R G + VF +H E + +L +LR+
Sbjct: 337 QDDAHDRVKIKLGVSKSERLYAMKAEVLARAGIPASYVFALHCN-EPPISAQLLAFLRVF 395
Query: 404 ---------YVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSED 454
Y+ + + + PVS E + L L Y T ED
Sbjct: 396 CMTEDELKYYLLGDRAINKIFTLGNSEFPVSWENEIKLWTFLETRAALLLKTYKTTSEED 455
Query: 455 EAMLTDYNLHPKKRVATQLVRMEKKMLNACL 485
+ML +L R+A QL EK++L L
Sbjct: 456 RSMLEKPDLSLHSRMAIQLRLAEKQILEKAL 486
>gi|303277863|ref|XP_003058225.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226460882|gb|EEH58176.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 612
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 89/306 (29%), Positives = 133/306 (43%), Gaps = 42/306 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIA--ELLTTNKLSELACLALYLMYEKKQ 170
AAS DL AG A ++P+S ++T L + T T L E + L+L+YEK
Sbjct: 195 AAASTDLPAGADALTIPSSALLTSRVALEDPTARGDAYRTFAGLGEDTLMTLWLVYEKYA 254
Query: 171 -GKKSFWLPYIREL---------DRQRGRGQLAVESPLLW-SETELAYLTGSPTKAEILE 219
G +S W P + L + G L + +P W +E A L G+P + ++
Sbjct: 255 LGDRSPWAPLLASLPMDDGGGDDGDRTAAGALGL-TPASWPAEVTDALLRGAPLLDDAVK 313
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
E R++ L F A L + +P PTE +T F+ A A + + +Q ++
Sbjct: 314 ARETTARQHAAL----FPA--LGEHFPEVFPTELYTLRRFRIASEAWNAYGMTVQAETVG 367
Query: 280 RRFALVPLGPPLLAYSSKCKAMLA-------AV------DDAVQLVVDRPYKAGESIVVW 326
PP A+L AV DDA+ L + R +AGE I V
Sbjct: 368 GASGGGEHHPPAPTTCLPPIALLCNHATWPHAVRYSRLRDDALHLPIARGVRAGEEIFVS 427
Query: 327 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQR-----NGKLSVQV 381
G + N++LL+ YGF DNPYD + L+ E PQ + + + A R KLS+
Sbjct: 428 YGAKSNAELLLFYGFGVRDNPYD----DVPLSLELPQGEVRDVSALRERVLHRAKLSLSP 483
Query: 382 FHVHAG 387
V G
Sbjct: 484 HSVRCG 489
>gi|159479580|ref|XP_001697868.1| rubisco large subunit N-methyltransferase [Chlamydomonas
reinhardtii]
gi|158273966|gb|EDO99751.1| rubisco large subunit N-methyltransferase [Chlamydomonas
reinhardtii]
Length = 475
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 109/424 (25%), Positives = 176/424 (41%), Gaps = 50/424 (11%)
Query: 83 SWMHKNGLPPCKV-----ILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
+W K G K IL +KP + AS D+Q G++ VP++ V++
Sbjct: 46 AWATKQGAKLEKANLSTDILTDKP----------ILVASADVQPGESLIVVPDAAWVSVP 95
Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
V T+ +L ++ L LAL L+ E+ KS Y L G +P
Sbjct: 96 NV-AKTTVGKLASSAGLEPWLQLALVLVAERFGSAKSELAGYASSLPEDLG-------TP 147
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
LLWSE E L G+ + + + +L LF P P FT
Sbjct: 148 LLWSEEETRALAGTQVAGTLNSYLTFFRSTFAQLQA------GLFTANPAAFPPAVFTLP 201
Query: 258 IFKQAFVAVQSCV---VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV- 313
F A AV+S + K++LA LV L A ++K + + Q+ V
Sbjct: 202 NFVWAVAAVRSRSHPPLEGDKIALA---PLVDLVSHRRAANTKLSVRSSGLFGRGQVAVV 258
Query: 314 --DRPYKAGESIVVWCGP-QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 370
R + GE++ + P + + +L++YG +D +P + L+ D DK +
Sbjct: 259 EATRAIRKGEALGMDYAPGKLDGPVLLDYGVMDTASPKPGYSLTLTLDESDKFVDDKADI 318
Query: 371 AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS--DTSEMQSVISSLGPICPVSPCM 428
+ G + + +++ +M+ +LRL + D ++S+ + VS
Sbjct: 319 VEGAGLRPSMTYSITP--DQQPGEEMMAFLRLMNIKAMDAFLLESIFRN-----EVSEGN 371
Query: 429 ERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRV-ATQLVRM-EKKMLNACLQ 486
E AV LA+ +A LAGYP TL +D A L + R A LVR+ EK+ L+A +
Sbjct: 372 EEAVCAMLAEGARAALAGYPTTLDQDLAALRSNSTPLGSRAEAALLVRLGEKESLDAVAR 431
Query: 487 VTAD 490
D
Sbjct: 432 FFED 435
>gi|413923745|gb|AFW63677.1| hypothetical protein ZEAMMB73_839660 [Zea mays]
gi|413923746|gb|AFW63678.1| hypothetical protein ZEAMMB73_839660 [Zea mays]
Length = 306
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 66/252 (26%), Positives = 113/252 (44%), Gaps = 25/252 (9%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E L+ W+ +GLP ++ + ++ E+ + A ++++ G+ VP SLV+T
Sbjct: 71 ESAASLERWLIDSGLPEQRLAI-QRVDIGERG-----LVALKNIRKGEKLLFVPPSLVIT 124
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ G + +++ N + + +A YL+ E S W+ YI L RQ
Sbjct: 125 ADSEWGRPEVGDVMKRNSVPDWPLIATYLISEASLEGSSRWISYIAALPRQ-------PY 177
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ EL AYL SP + ++R + YN+L +F ++P P E +
Sbjct: 178 SLLYWTRAELDAYLVASPIRKRAIQRITDVIGTYNDL------RDRIFSRHPDLFPEEVY 231
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLV 312
E F +F + S +V L S+ R ALVP +L +S + + L +
Sbjct: 232 NIETFLWSFGILFSRLVRLP--SMDGRVALVPWA-DMLNHSPEVETFLDFDKSSRGIVFT 288
Query: 313 VDRPYKAGESIV 324
DR Y+ G I+
Sbjct: 289 TDRSYQPGIYIL 300
>gi|149044197|gb|EDL97579.1| rCG27725, isoform CRA_c [Rattus norvegicus]
Length = 468
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 79/344 (22%), Positives = 155/344 (45%), Gaps = 29/344 (8%)
Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
LA +L+ E+ SFW PYI+ L + ++PL + E E+ L + ++
Sbjct: 28 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVF 79
Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 275
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 80 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 132
Query: 276 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 334
+R AL+PL + DD + V + ++AG+ I ++ G + N++
Sbjct: 133 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAE 192
Query: 335 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 394
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 193 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 251
Query: 395 DMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 445
+L +LR+ +++ S + + + PVS E + L D L
Sbjct: 252 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 311
Query: 446 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 489
Y T+ ED+ +L + +L + +A +L EK++L ++ A
Sbjct: 312 TYKTTIEEDKTVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAA 355
>gi|58177849|gb|AAH89108.1| Setd3 protein [Rattus norvegicus]
Length = 450
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 79/344 (22%), Positives = 155/344 (45%), Gaps = 29/344 (8%)
Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
LA +L+ E+ SFW PYI+ L + ++PL + E E+ L + ++
Sbjct: 10 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVF 61
Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 275
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 62 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 114
Query: 276 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 334
+R AL+PL + DD + V + ++AG+ I ++ G + N++
Sbjct: 115 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAE 174
Query: 335 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 394
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H E +
Sbjct: 175 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 233
Query: 395 DMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 445
+L +LR+ +++ S + + + PVS E + L D L
Sbjct: 234 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 293
Query: 446 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 489
Y T+ ED+ +L + +L + +A +L EK++L ++ A
Sbjct: 294 TYKTTIEEDKTVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAA 337
>gi|392349055|ref|XP_003750278.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Rattus
norvegicus]
Length = 416
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 79/338 (23%), Positives = 153/338 (45%), Gaps = 30/338 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 12 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 65
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 66 LLMTVESA-KNSILGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 121
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y V Q +P+
Sbjct: 122 -----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQYAYFYKV-------IQTHPHA 169
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 170 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 229
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
DD + V + ++AG+ I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 230 EDDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 289
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 402
K V R G + VF +H E + +L +LR+
Sbjct: 290 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRV 326
>gi|302768639|ref|XP_002967739.1| hypothetical protein SELMODRAFT_408995 [Selaginella moellendorffii]
gi|300164477|gb|EFJ31086.1| hypothetical protein SELMODRAFT_408995 [Selaginella moellendorffii]
Length = 421
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 70/288 (24%), Positives = 118/288 (40%), Gaps = 46/288 (15%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+L SW+ G +LK P + A D++AG+ V ++T +R+
Sbjct: 39 ELVSWLKIRGEHDACSLLKTGPDKRG-------LFAVRDIKAGECILRVSRDTMMTADRL 91
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
+LL++ +SE A LAL L++EK+ G+ S W PYI L R + S
Sbjct: 92 --PLEFQQLLSSG-VSEWAQLALLLLFEKRAGEASIWAPYISCLPRWG-----TIHSTAF 143
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
W + EL + S E + R I+ E+NE+ +V F +D + A
Sbjct: 144 WRKEELTMIQESSLSYETMSRRAAIREEFNEMQSVPFA-----DFMNHDWSSNAMLTYDT 198
Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 319
V+ V+ + +A A QL D+ Y A
Sbjct: 199 DNGSTEVEEVKVYSDCLYIALFCA--------------------------QLFADKNYAA 232
Query: 320 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 367
GE + + GP N+ L +++GF NP+D++ + ++ D ++K
Sbjct: 233 GEQVTISFGPLCNASLALDFGFTVPYNPWDKVQLWLGISRRDSLRKEK 280
>gi|242007310|ref|XP_002424484.1| SET domain-containing protein, putative [Pediculus humanus
corporis]
gi|212507902|gb|EEB11746.1| SET domain-containing protein, putative [Pediculus humanus
corporis]
Length = 492
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 97/430 (22%), Positives = 189/430 (43%), Gaps = 45/430 (10%)
Query: 74 KEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
+E+ +L SW+ +NG V +K NE + + A++DL+ + ++P +++
Sbjct: 81 REDHFSNLISWIKENGGVADNVTIKH---FNEMG---YGLEAAKDLEESELICAIPKNVM 134
Query: 134 VTLERVLGNETIAELLTTN----KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
+TL+ V + L N + +A LAL+L+ E + + SFW YI L
Sbjct: 135 MTLDNV-KVSPLKYLYENNPILKNMGNVA-LALFLILEHVKNENSFWHHYISSLPSD--- 189
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYD- 248
+ L + + + SPT + + I R+Y +F +LFQ +
Sbjct: 190 ----YNTVLYFDLNDFLEMKNSPTFEMATKHCKNIARQY-----AYF--NNLFQNSNDEA 238
Query: 249 --IPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF-----ALVPLGPPLLAYSSKCKAM 301
I FT+++++ A V + + S + L+PL + +++ +
Sbjct: 239 SLILRNVFTYQLYRWAVSTVMTRQNFIPSSSTSNDVENGINGLIPLWD-MCNHTNGYLST 297
Query: 302 LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 361
VD + L +P+K GE ++++ G + NS L++ GFV ++NP+D + ++ D
Sbjct: 298 QYKVDRSECLAC-KPFKKGEQVLIFYGERSNSDFLVHNGFVYDENPHDSFRLRLGISKSD 356
Query: 362 PQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY--VSDTSEMQSVISSLG 419
+ + + + G F++++G E ++L +LR+ V + + +S S L
Sbjct: 357 KLHGLRCELLKDLGIPDSGDFYLYSGSEP-VRENLLAFLRIFNMDVENLNHWKSHSSRLS 415
Query: 420 PI----CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+ C + +E V + L D L Y E E + D N +R+ ++
Sbjct: 416 DLMWKDCALDTKIESKVWNFLYDRINLLLKTYKG--DEVEVRVEDSNSTECRRLVRAQLK 473
Query: 476 MEKKMLNACL 485
EKK L++ L
Sbjct: 474 CEKKFLSSIL 483
>gi|15223054|ref|NP_172856.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase
[Arabidopsis thaliana]
gi|17369870|sp|Q9XI84.1|RBCMT_ARATH RecName: Full=[Fructose-bisphosphate aldolase]-lysine
N-methyltransferase, chloroplastic; AltName:
Full=Aldolases N-methyltransferase; AltName:
Full=[Ribulose-bisphosphate carboxylase]-lysine
N-methyltransferase-like; Short=AtLSMT-L;
Short=LSMT-like enzyme; Flags: Precursor
gi|5080779|gb|AAD39289.1|AC007576_12 Putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase [Arabidopsis thaliana]
gi|28973755|gb|AAO64193.1| putative ribulose-1,5 bisphosphate carboxylase oxygenase large
subunit N-methyltransferase [Arabidopsis thaliana]
gi|332190979|gb|AEE29100.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase
[Arabidopsis thaliana]
Length = 482
Score = 75.5 bits (184), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 96/394 (24%), Positives = 171/394 (43%), Gaps = 40/394 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A D+ + +P L + E V ++ I L L +AL+L+ EK + +
Sbjct: 79 LVARRDIGRNEVVLEIPKRLWINPETVTASK-IGPL--CGGLKPWVSVALFLIREKYE-E 134
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+S W Y+ L + + +S + WSE ELA L G+ + L E ++ E+ +L+
Sbjct: 135 ESSWRVYLDMLPQ-------STDSTVFWSEEELAELKGTQLLSTTLGVKEYVENEFLKLE 187
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL- 291
+ D+ + T + F AF ++S + + L+PL +
Sbjct: 188 QEILLPNK-------DLFSSRITLDDFIWAFGILKSRAFSRLR---GQNLVLIPLADLIN 237
Query: 292 ---------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGF 341
AY K A L + D L KAGE + + + + N++L ++YGF
Sbjct: 238 HNPAIKTEDYAYEIKG-AGLFSRDLLFSLKSPVYVKAGEQVYIQYDLNKSNAELALDYGF 296
Query: 342 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 401
V+ + + + + DP + DK +A+ N F + G+ A ML YLR
Sbjct: 297 VESNPKRNSYTLTIEIPESDPFFGDKLDIAESNKMGETGYFDIVDGQTLPA--GMLQYLR 354
Query: 402 LGYVS--DTSEMQSVISS--LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEA 456
L + D ++S+ ++ G + PVS E + + D K+ L+G+ T+ EDE
Sbjct: 355 LVALGGPDAFLLESIFNNTIWGHLELPVSRTNEELICRVVRDACKSALSGFDTTIEEDEK 414
Query: 457 MLTDYNLHPKKRVATQLVRMEKKMLNACLQVTAD 490
+L L P+ +A ++ EK++L Q+ D
Sbjct: 415 LLDKGKLEPRLEMALKIRIGEKRVLQQIDQIFKD 448
>gi|361129824|gb|EHL01706.1| putative Ribosomal N-lysine methyltransferase 4 [Glarea lozoyensis
74030]
Length = 483
Score = 75.5 bits (184), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 109/440 (24%), Positives = 180/440 (40%), Gaps = 57/440 (12%)
Query: 84 WMHKNGLPPC-KVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL-G 141
W+ K G+ K+ LK+ S V A+ D + + F +P + V+ + V G
Sbjct: 16 WLSKIGVRINPKMTLKDLKSEGRGR----GVVAAADFEEDEVVFCIPRTAVLNVNNVFAG 71
Query: 142 NETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
++ A ++ L +M E +Q S W PY+ L ++ ++S + WS
Sbjct: 72 QDSGASKEALLQMPNWLALTATMMSEGQQSD-SRWAPYLAVLPQK-------LDSLVFWS 123
Query: 202 ETELAYLTGSPTKAEILERA--EGIKREYNELDTVWF------MAGSLFQQYPYDIPTEA 253
E ELA L S +I + E + + L F S+ Y +DIP E
Sbjct: 124 EEELAELQASSVAKKIGRSSAEEMFTKHISPLGLGEFNVELCHQVASVIMAYAFDIPEE- 182
Query: 254 FTFEIFKQAFVAVQSCVVHL-------QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD 306
E KQ + L +K L+ ++PL L A + + A + +
Sbjct: 183 ---EPAKQENGGAEGETDDLVSDDGEDEKTILS----MIPLADMLNADAERNNARIYYEN 235
Query: 307 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED-NPYDRLVVEAA----LNTED 361
+ +++ +P AGE I G P S LL YG+V E+ YD + + +A L TE
Sbjct: 236 EDLEMRTIKPIMAGEEIFNDYGQLPRSDLLRRYGYVTENYAQYDVVEISSASIKSLMTEK 295
Query: 362 PQ---------------YQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS 406
PQ +++ +A R G L A E+ AI D L L ++
Sbjct: 296 PQEIQSGQFLDPLTSAEAEERVALADREGILEDSYDVNIANAEERAIPDELLALLYLFLL 355
Query: 407 DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPK 466
D ++++++S + S V L + R A Y TL EDE +L NL +
Sbjct: 356 DNENLEAIVTSQSALPSRSKLATELVGKVLVKVLRHREAEYATTLEEDEKLLQAANLPRR 415
Query: 467 KRVATQLVRMEKKMLNACLQ 486
+A Q+ EK++L ++
Sbjct: 416 TAMAIQVRHGEKRVLRLAVE 435
>gi|340720054|ref|XP_003398458.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Bombus
terrestris]
Length = 484
Score = 75.1 bits (183), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 101/438 (23%), Positives = 182/438 (41%), Gaps = 50/438 (11%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+ + +G +W+ +NG + E P ++ + A + + +P L
Sbjct: 77 KRSQGIGRFINWLKQNGANVYGASVAEFPGYDLG------LKAERNFLENELILRIPREL 130
Query: 133 VVTLERV------LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
+ ++ L N+ + +L+ LA+ L+ EK + + S W PY+ L
Sbjct: 131 IFSIHNAAPELVALQNDPLLQLMPQ------VALAIALLIEKHK-EYSKWKPYLDILPT- 182
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
+ L + ++ L GSPT L++ I R+Y +F LFQ+
Sbjct: 183 ------TYTTVLYMTAADMNELKGSPTLEAALKQCRNIARQY-----AYF--NKLFQKNN 229
Query: 247 YDIPT---EAFTFEIFKQAF--VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAM 301
+ + FT+E + A V + ++ + SL AL+P+ SK
Sbjct: 230 NAVSAILRDVFTYEKYCWAVSTVMTRQNIIPSKDGSLMIH-ALIPMWDMCNHEDSKITTD 288
Query: 302 LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 361
A + + R +K E I + GP+ NS ++ GFV DN D + ++ D
Sbjct: 289 FNATLNCCECYALRDFKKAEQIFISYGPRTNSDFFVHSGFVYMDNEQDGFKLRLGISKAD 348
Query: 362 PQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-MLPYLRLGYVSDTSEMQSVISS--L 418
P ++++ + + +V F + G E ISD +L +LR+ + E+ I S +
Sbjct: 349 PLHKERVELLNKLDLPAVGEFLLKPG--TEPISDTLLAFLRV-FSMRKEELAHWIQSDRV 405
Query: 419 GPICPVSPCMERAVLDQLADYFKARL----AGYPATLSEDEAMLTDYNLHPKKRVATQLV 474
+ + +E V + + + RL A YP TL ED +L + L K++A QL
Sbjct: 406 NDLKHMDCALETVVEENVKKFLLTRLQLLIANYPTTLKEDLQLL-ETTLPRIKKLAIQLR 464
Query: 475 RMEKKMLNACLQVTADMI 492
EK++L L+ I
Sbjct: 465 VTEKRILQGALEYVQQWI 482
>gi|8778402|gb|AAF79410.1|AC068197_20 F16A14.25 [Arabidopsis thaliana]
Length = 474
Score = 75.1 bits (183), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 94/388 (24%), Positives = 171/388 (44%), Gaps = 36/388 (9%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A D+ + +P L + E V ++ I L L +AL+L+ EK + +
Sbjct: 79 LVARRDIGRNEVVLEIPKRLWINPETVTASK-IGPL--CGGLKPWVSVALFLIREKYE-E 134
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+S W Y+ L + + +S + WSE ELA L G+ + L E ++ E+ +L+
Sbjct: 135 ESSWRVYLDMLPQ-------STDSTVFWSEEELAELKGTQLLSTTLGVKEYVENEFLKLE 187
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL- 291
+ D+ + T + F AF +++ + ++ F + P +
Sbjct: 188 QEILLPNK-------DLFSSRITLDDFIWAF-----GILNRESLTSMFEFEQINHNPAIK 235
Query: 292 ---LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNP 347
AY K A L + D L KAGE + + + + N++L ++YGFV+ +
Sbjct: 236 TEDYAYEIKG-AGLFSRDLLFSLKSPVYVKAGEQVYIQYDLNKSNAELALDYGFVESNPK 294
Query: 348 YDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS- 406
+ + + DP + DK +A+ N F + G+ A ML YLRL +
Sbjct: 295 RNSYTLTIEIPESDPFFGDKLDIAESNKMGETGYFDIVDGQTLPA--GMLQYLRLVALGG 352
Query: 407 -DTSEMQSVISS--LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYN 462
D ++S+ ++ G + PVS E + + D K+ L+G+ T+ EDE +L
Sbjct: 353 PDAFLLESIFNNTIWGHLELPVSRTNEELICRVVRDACKSALSGFDTTIEEDEKLLDKGK 412
Query: 463 LHPKKRVATQLVRMEKKMLNACLQVTAD 490
L P+ +A ++ EK++L Q+ D
Sbjct: 413 LEPRLEMALKIRIGEKRVLQQIDQIFKD 440
>gi|326492674|dbj|BAJ90193.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 95/391 (24%), Positives = 164/391 (41%), Gaps = 40/391 (10%)
Query: 109 PIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEK 168
P+ + A +L G+ VP L + + V + + L ++L ++ E
Sbjct: 39 PVLGLVAERNLPRGEVVAEVPKKLWLDADAVAASVLGRVCGSGGDLRPWVSVSLLILREA 98
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
+G S W PY+ L RQ +S + WSE EL + G+ + + E ++ E+
Sbjct: 99 ARGGDSLWAPYLAILPRQ-------TDSTIFWSEEELLEIQGTQLLSTTMGVKEYVQSEF 151
Query: 229 NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG 288
+ ++ AG + D+ TF+ F AF ++S V + + AL+P
Sbjct: 152 DNVE-----AGII--NVNKDLFPGTITFDDFLWAFGVLRSRVFPELR---GDKLALIPFA 201
Query: 289 PPL----------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLI 337
+ + K K L D L K+GE I V + + N++L +
Sbjct: 202 DLINHNGDITSKESCWEIKGKGFLGR-DTVFSLRTPVDVKSGEQIYVQYDLDKSNAELAL 260
Query: 338 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML 397
+YGF + ++ D + ++ DP Y+DK +A+ NG F V G + M+
Sbjct: 261 DYGFTESNSSRDSYTLTLEISESDPFYEDKLDIAELNGMGETAYFDVVLG--ESLPPQMI 318
Query: 398 PYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPAT 450
YLRL + T V L PVS E ++ + + K+ LA Y T
Sbjct: 319 TYLRLLCLGGTDAFLLEALFRNKVWEHLE--LPVSRDNEESICQVIQNACKSALAAYHTT 376
Query: 451 LSEDEAMLTDYNLHPKKRVATQLVRMEKKML 481
+ EDE +L +L + ++A ++ EKK+L
Sbjct: 377 IEEDEELLEREDLQSRHQIAVEVRVGEKKVL 407
>gi|302754340|ref|XP_002960594.1| hypothetical protein SELMODRAFT_402971 [Selaginella moellendorffii]
gi|300171533|gb|EFJ38133.1| hypothetical protein SELMODRAFT_402971 [Selaginella moellendorffii]
Length = 403
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 81/306 (26%), Positives = 132/306 (43%), Gaps = 41/306 (13%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEK----HRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
+ SW+ + G + + S + + HRP + AG+ +LV+T
Sbjct: 41 EFMSWLRRRGEDMNSIAVAIGMSKHGRALFAHRP---------MCAGECMIKFSQNLVLT 91
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
E+ L E IA L N+ + ++ L +M EK++G+ S W PYI L G+ +
Sbjct: 92 PEK-LPCEVIALLDQANEFTRVS---LLVMAEKRKGQNSAWAPYIECLP---SFGE--IH 142
Query: 196 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
S + W ELA L SP ER ++ EY E+ V S Y D+ +
Sbjct: 143 STIFWDPKELACLECSPIHRGTGERNALLQSEYREVKKV---VESCPHLYDPDV-----S 194
Query: 256 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV-D 314
E FK + V S S ++PL + + + + + DD +VV
Sbjct: 195 LEQFKHEYATVSSRAWGQGPHS---DMTMIPL-VDFANHDPRSRTLFSHADDNCTVVVAS 250
Query: 315 RPYKAGES-----IVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK-R 368
R Y+ G+ + + G N+ L ++YGFV DNP+D + + +EDP + K +
Sbjct: 251 RDYQTGDENFHLKVHICYGDHSNAVLALDYGFVVPDNPFDEAEIFLEIPSEDPLREIKLQ 310
Query: 369 MVAQRN 374
+AQ N
Sbjct: 311 YMAQNN 316
>gi|302771638|ref|XP_002969237.1| hypothetical protein SELMODRAFT_410177 [Selaginella moellendorffii]
gi|300162713|gb|EFJ29325.1| hypothetical protein SELMODRAFT_410177 [Selaginella moellendorffii]
Length = 336
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 77/275 (28%), Positives = 120/275 (43%), Gaps = 37/275 (13%)
Query: 107 HRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMY 166
HRP + AG+ LV+T E+ L E IA L N+ + ++ L +M
Sbjct: 72 HRP---------MCAGECMIKFSQDLVLTPEK-LPCEVIALLDQANEFTRVS---LLVMA 118
Query: 167 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 226
EK++G+ S W PYI L G+ + S + W ELA L SP ER ++
Sbjct: 119 EKRKGQNSAWAPYIECLP---SFGE--IHSTIFWDPKELACLECSPIHRGTGERNALLQS 173
Query: 227 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 286
EY E+ V S Y D+ + E FK + V S S ++P
Sbjct: 174 EYREVKKV---VESCPHLYDPDV-----SLEQFKHEYATVSSRAWGQGPHS---DMTMIP 222
Query: 287 LGPPLLAYSSKCKAMLAAVDDAVQLVV-DRPYKAGES-----IVVWCGPQPNSKLLINYG 340
L + + + + + DD +VV R Y+ G+ + + G N+ L ++YG
Sbjct: 223 L-VDFANHDPRSRTLFSHADDNCTVVVASRDYQTGDENFHLKVHICYGDHSNAVLALDYG 281
Query: 341 FVDEDNPYDRLVVEAALNTEDPQYQDK-RMVAQRN 374
FV DNP+D + + +EDP + K + +AQ N
Sbjct: 282 FVVPDNPFDEAEIFLEIPSEDPLREIKLQYMAQNN 316
>gi|449017905|dbj|BAM81307.1| similar to ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplast precursor
[Cyanidioschyzon merolae strain 10D]
Length = 567
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 116/444 (26%), Positives = 172/444 (38%), Gaps = 87/444 (19%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLE---RVLGNETIAELLTTNKLSELACLALY------LM 165
A D+QAG+ F VP L T + R + EL + LA L LY
Sbjct: 120 ARRDIQAGEVLFQVPFHLCFTKDVAVRRFAALNVPELADEEEFFALATLLLYERGLDESW 179
Query: 166 YEKKQGKKSFWLPYIRELDR--QRGRGQLAVES----PL----LWSETELAYLTGSPTKA 215
+ +G SFW PY+ L +G ES PL LW+E E+ +L GSPT
Sbjct: 180 KKSGRGPGSFWGPYLDILPPVPWEFKGAEPAESLSMDPLDALWLWAEDEMQWLQGSPTLL 239
Query: 216 EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE-AFTFEIFKQAFVAVQSCVVHLQ 274
++REY E L++++P+ E AF E F AF + S V L
Sbjct: 240 SARALRSKVEREYAE------ACERLYRRHPHIFDLEGAFRLERFLWAFGVLFSRAVSLP 293
Query: 275 KVSLARRFALVPLGPPLLAYSSKCKAMLAA----------------------------VD 306
+ ALVP L +S+ C + + A D
Sbjct: 294 AENGM--LALVPYAD-LANHSAFCVSFIDARTAAFPYAFRASSKQKRGQWWQRFLAPNSD 350
Query: 307 DA------------------VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
DA V DR Y E + V G + N++LL+ YGFV + NPY
Sbjct: 351 DAGAVANTDSSHYREDAQREVVAYADRFYDKFEQVYVSYGQKSNAELLLLYGFVSDRNPY 410
Query: 349 DRLVVEAALNTEDPQ----YQDKRMVAQRNGK--LSVQVFHVHAGREKEAISDMLPYLRL 402
+ + V +L+ + KR G+ + F ++A R + +L + L
Sbjct: 411 NSVEVCVSLSGSEAAGAGLLDRKRSFLLACGRDPDKPECFPLYADRYPLELMQLLRFASL 470
Query: 403 GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYN 462
+ S + + PV+ E A L K L YP + ED+A L D +
Sbjct: 471 --TEQDAAGYSDLEQIDVAQPVNRENEIAAKSALLQACKIALQAYPTSADEDDAALKDKS 528
Query: 463 ----LHPKKRVATQLVRMEKKMLN 482
L K+R++ +L R EK++L
Sbjct: 529 MAQLLSRKQRLSVRLRRSEKRILE 552
>gi|358392567|gb|EHK41971.1| hypothetical protein TRIATDRAFT_251278, partial [Trichoderma
atroviride IMI 206040]
Length = 956
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 97/449 (21%), Positives = 175/449 (38%), Gaps = 84/449 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER----------VLGNETIAELLTTNKLSELACLAL 162
+ A +D+ A F+VP S +V +E L +T E+ + + L +
Sbjct: 531 IVALQDIPADTVLFTVPRSAIVNIETSELRAKLPDVFLNQDTAMEVDNKPQQDPWSTLII 590
Query: 163 YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RA 221
L+YE +G +S W PY+ L + E+P+ WS+ E+ L S T+++I + A
Sbjct: 591 VLIYEYFKGDQSSWKPYLDVL-------PASFETPMFWSDAEVDELQASATRSKIGKTNA 643
Query: 222 EGI--------------------KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ 261
E + + EL + GS Y +D F+
Sbjct: 644 EEMFHAKILPVIRGNPDIFQTSQAKSDEELIQLAHRMGSTIMSYAFD----------FQN 693
Query: 262 AFVAVQSCVVHLQKVSLARR-FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 320
+ + A+ +VP+ +L ++ A + DDA+ + R KAG
Sbjct: 694 EDEEEEDDSEEWVEDREAKSTMGMVPMAD-ILNADAEYNAHVNYGDDALTVATLRTIKAG 752
Query: 321 ESIVVWCGPQPNSKLLINYGFVDEDN--------PY----DRLVVEAALNTED------- 361
E I+ + GP PNS+LL YG+V + P+ D L L++E
Sbjct: 753 EEILNYYGPHPNSELLRRYGYVTPKHSRYDVVELPWKMIEDALAANLGLSSEQLDSAREH 812
Query: 362 ---PQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL 418
++++ ++ + + + + + + E D+ L+ M I +
Sbjct: 813 LDLDEFEETFVLERESDEPNPDGTFANPAKFSEIPEDLREQLK--------SMLKAIRKV 864
Query: 419 GPICPVSPCMERAVLDQ-LADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRME 477
P C V V L A + YP T+ EDE +L+ NL +++ A + E
Sbjct: 865 DPSCIVDKRKRDEVQHTVLITALDALTSQYPTTIIEDELILSGSNLSERRKAAVTVRLGE 924
Query: 478 KKMLNACLQVTADMIMLLPDVTVSPCPAP 506
K++L + ++ + D + PAP
Sbjct: 925 KRLLQEARVLLSE---IASDAILDDAPAP 950
>gi|255083899|ref|XP_002508524.1| set domain protein [Micromonas sp. RCC299]
gi|226523801|gb|ACO69782.1| set domain protein [Micromonas sp. RCC299]
Length = 425
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 72/277 (25%), Positives = 117/277 (42%), Gaps = 35/277 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYE----K 168
+ A+E+++ G++ +P S ++T+ER + + L E + LA +L +
Sbjct: 24 LVATEEVRRGESLLDIPESTLITVERAIAESNLGP--AHANLQEWSVLAAFLAEQALAID 81
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA-YLTGSPTKAEILERAEGIKRE 227
S + Y+R L R+ G L W E ++ L GSP++ +ER +
Sbjct: 82 AGADGSRFATYVRALPRRTG-------GVLDWPEEDVKELLAGSPSQRAAMERQASVDAA 134
Query: 228 YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 287
+E+ +P P + AF + S ++ L A ALVP
Sbjct: 135 IDEIRA----------SFPQLTPG------ALRWAFDVLFSRLIRLPNRGGA--LALVPW 176
Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE--D 345
+L + C A + AV L DR YK GE + GP+P+S+LLI+YGF +
Sbjct: 177 AD-MLNHRPGCDAYIDDTGGAVCLSPDRRYKPGEQVYASYGPRPSSELLISYGFAPAVGE 235
Query: 346 NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
NP D V ++ D K +R G V+ F
Sbjct: 236 NPDDEFEVVLGIDPNDRHADAKADALRRIGLSPVEAF 272
>gi|145516108|ref|XP_001443948.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124411348|emb|CAK76551.1| unnamed protein product [Paramecium tetraurelia]
Length = 572
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 89/407 (21%), Positives = 171/407 (42%), Gaps = 51/407 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA----CLALYLMYEK 168
V A + + A + +P S ++TLE + T+A+ + +L L+ L+ +L+ EK
Sbjct: 165 VNAKQTINAKELILFIPKSHMITLE-MAKETTVAKKMMQFRLDLLSPKHSFLSTFLLQEK 223
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
+ SFW PYI L P+ ++ ++L +L GSP +I ++ ++++Y
Sbjct: 224 FRPN-SFWKPYIDILPSSYP------SFPIFYNNSDLEWLKGSPFLKQIKDKLADLQKDY 276
Query: 229 NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG 288
N++ V F QY F F A + S + + ++ + A VPL
Sbjct: 277 NDICNV----VPEFTQY---------QFHEFCWARMTASSRIFGI-NINGVKTDAFVPLA 322
Query: 289 -------PPLLA--YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 339
P L + YS + + + D+ ++ G+ I G + NS+ +NY
Sbjct: 323 DMLNHKRPKLTSWCYSDEKQGFIIETDEKIE--------RGQMIFDSYGRKCNSRFFLNY 374
Query: 340 GFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPY 399
GFV E N + + + + DP Q K + + + + F + ++ A+ D + +
Sbjct: 375 GFVVEGNDANEVNLAVEADQNDPLLQLKEQAIKESLQWP-KNFKLLMDTDETAVIDFMSH 433
Query: 400 LRLGYVSDTSEMQSVISSLG-------PICPVSPCMERAVLDQLADYFKARLAGYPATLS 452
+R + D ++++ +++ P+ E + + K L YP T
Sbjct: 434 IRFLVIRDEAQLKLLLNQKNSQNFKSTKTQPLGIYNELEMWKMIGRICKKTLKQYPTTFE 493
Query: 453 EDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVT 499
+D+ +L L +R L EK++L Q + M LL +
Sbjct: 494 QDQEILQICELTTNQRNCLILRMGEKEILKFYFQFSERMKELLSNFN 540
>gi|348675930|gb|EGZ15748.1| hypothetical protein PHYSODRAFT_561468 [Phytophthora sojae]
Length = 430
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 97/409 (23%), Positives = 165/409 (40%), Gaps = 36/409 (8%)
Query: 111 HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
H V A L +G VP L + E ++ L ++ + LAL+LM+E+ +
Sbjct: 35 HGVFAKRALTSGQVTLQVPFKLTMNTESAATSDLAPVLEKYPQIPDDEVLALHLMHERSK 94
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
G +SF+ P+I + + P+ W+E EL L G+ + ++R++
Sbjct: 95 GGESFFAPFIASM-------PTTFDLPVFWTEAELNELKGTNVLLLTQLMKQHLERDFEN 147
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 290
+ + F +PT T + + A + S VS ++ V L P
Sbjct: 148 IHQA---VAADFPDIFASLPT--LTIDDYMWAMSVIWSRAF---GVSKGGKYLHV-LCPA 198
Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPY---------KAGESIVVWCGPQPNSKLLINYGF 341
+ ++ + +DD V ++ AG ++ + G N+KLL +YGF
Sbjct: 199 MDMFNHDV-TVRKPLDDFVSFNEEKQMMTHHVPEDVAAGSAVHISYGQYSNAKLLYSYGF 257
Query: 342 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-MLPYL 400
V +N + + DP ++ K+ V N Q + H + + +L L
Sbjct: 258 VSPENFRRGVDFWMKIPLSDPYFKLKQTVLDSNELTKEQTYDFHGTLLSNDVDERLLATL 317
Query: 401 RLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 460
R+ +++ Q + I V E AV + L + +L+ Y TL EDEA+L +
Sbjct: 318 RVILMNEQEIRQYKKAFESSILSVRN--ELAVYENLQSTCRRKLSNYATTLEEDEAILAE 375
Query: 461 YNLHPKKRVATQL-VRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYA 508
K R+A + VRME K QVT +I L S P A
Sbjct: 376 TETESKPRLAFAVRVRMEDK------QVTTSVIETLEQWKQSLASKPDA 418
>gi|198413420|ref|XP_002131202.1| PREDICTED: similar to SET domain containing 3 [Ciona intestinalis]
Length = 577
Score = 72.0 bits (175), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 99/438 (22%), Positives = 197/438 (44%), Gaps = 56/438 (12%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
KSW+ ++G+ + ++E S E V A +D++ ++P ++T E
Sbjct: 83 FKSWLKEHGVEYSAIDIQE-VSEEEGFG----VIALQDIEIKCPLVTIPRKAMMTYEDA- 136
Query: 141 GNETIAELLTTNKLSEL---ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
+ +A L+ N++ + CLALYL E+ S + PYI + ++ +
Sbjct: 137 KSSYLAGLIEGNEVLSVMPNVCLALYLHCERFT-LNSKYQPYIDMIPQE-------FNTI 188
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA---F 254
L + E+ YL G+ + + + + I R++ L V+ GS ++ +P +A F
Sbjct: 189 LYFKPHEMKYLKGTAALSVAINQFKSIVRQFALLYQVF--NGSHQKEDVEKLPLQARNAF 246
Query: 255 TFEIFKQAFVAVQS----CVVHLQKV----SLARRFALVPL--------GPPLLAYSSKC 298
TF+ ++ AV + H+ V AL+P+ GP AY+
Sbjct: 247 TFDTYRWCASAVTTRQNKIPTHVGDVLGDLDENSTLALIPMWDMFNHAIGPLSTAYN--- 303
Query: 299 KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 358
A+ ++ + + +K GE + + G + NS LLI+ GFV +++P+D++ + ++
Sbjct: 304 -----ALTRGIECLAMQDFKTGEQVKICYGARTNSDLLIHNGFVMKESPFDKVRIHLGVS 358
Query: 359 TEDPQYQDK-RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT------SEM 411
+DP Y K +++ + N ++S Q +L +LR+ ++++ +
Sbjct: 359 QKDPLYSLKAKLLEKLNVEVSGQFAVCSMDNSLPTSPQLLVFLRVFHMNEEELRSWLEKQ 418
Query: 412 QSVISSLGPI---CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKR 468
++ +SSL I V + V + L + K L G+ E M+ D +L + +
Sbjct: 419 KNELSSLREIYISGEVKFKSDVKVWEFLENRVKLLLMGFKKIGDNIEEMMEDKSLTHRSK 478
Query: 469 VATQLVRMEKKMLNACLQ 486
+A Q E ++L+AC+
Sbjct: 479 LALQFRIEEHRILSACVN 496
>gi|145528147|ref|XP_001449873.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124417462|emb|CAK82476.1| unnamed protein product [Paramecium tetraurelia]
Length = 605
Score = 71.6 bits (174), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 91/408 (22%), Positives = 172/408 (42%), Gaps = 51/408 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNET-IAELLTTNKLSELA----CLALYLMYE 167
V A + + + + VP S ++TLE + +T +A+ + +L L+ L+ +L+ E
Sbjct: 190 VNARKAISSKEVILFVPRSHMITLE--MAKDTPVAKKIIQYRLDLLSPKHSFLSTFLLQE 247
Query: 168 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
KK + SFW PY+ L + P+ +++++L +L GSP ++ ++ +K++
Sbjct: 248 KK-IQDSFWKPYLDVLPKSYSNF------PIFFNDSDLEWLKGSPFLKQVKDKITDLKKD 300
Query: 228 YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 287
Y ++ V A Q +F+ F A + S + + + + A VPL
Sbjct: 301 YCDICQV---APEFLQN----------SFDEFCWARMTASSRIFGIN-IKGVKTDAFVPL 346
Query: 288 GPPL---------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 338
L YS + + + D+ ++ G+ I G + NS+ L+N
Sbjct: 347 ADMLNHKRPKLTSWCYSDERQGFIIETDENIE--------KGQMIFDSYGSKCNSRFLLN 398
Query: 339 YGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP 398
YGFV +DN + + V + Q K +++ + + F + + SD +
Sbjct: 399 YGFVVDDNNANEVNVMVEPDGTISLIQLKEGLSRETLQFP-KSFRLVIDPNDVSFSDFMS 457
Query: 399 YLRLGYVSDTSEMQSVISSLGPICP-----VSPCMERAVLDQLADYFKARLAGYPATLSE 453
++R + + E +++ I P +S E A + + + L YP TL +
Sbjct: 458 FIRFILIQEEKEFANLLGKNSYIKPTKIHFISIQNELATWNLIENICIRALNQYPTTLEQ 517
Query: 454 DEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVS 501
D +L L +R L EKK+LN Q + M L + S
Sbjct: 518 DLEILKICELTTNQRNCLILRMGEKKILNFYKQFSEKMRQLFSNFDFS 565
>gi|322703179|gb|EFY94792.1| UV-endonuclease UVE-1 [Metarhizium anisopliae ARSEF 23]
Length = 1118
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 95/427 (22%), Positives = 172/427 (40%), Gaps = 68/427 (15%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN-------KLSELACLALYLM 165
+ A D+ A F++P ++ + E + EL + L + L L +M
Sbjct: 694 IVALRDIPADTTLFTIPRDAIINSDTSSLREKLPELFESQGDEDEQQALDSWSALILIMM 753
Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEGI 224
YE G +S W PYI L L ++P+ WSE EL+YL S T +I + AE +
Sbjct: 754 YEFFLGHQSKWKPYIDVL-------PLTFDTPMFWSEEELSYLQASATVNKIGKADAEEM 806
Query: 225 KREY--------------------NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
R +L + GS Y +D+ E + +V
Sbjct: 807 FRTRLIPAIRGNPSVFASSGDCSDEDLIGLAHRMGSTIMAYAFDLENEEAENDDESDGWV 866
Query: 265 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 324
+ + V++A +L ++ A + D+ + + R KAGE I+
Sbjct: 867 EDREGKSMMGMVAMA----------DILNADAEFNAHVNHGDEELTVTSIRDIKAGEEIL 916
Query: 325 VWCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAALNTEDPQYQDKRMVAQRNGK 376
+ GP PNS+LL YG++ E + P+D V+ +L +E QD ++ + K
Sbjct: 917 NYYGPHPNSELLRRYGYITEKHSRYDVVEIPWD--AVQHSLMSELGVPQD--IMTETMDK 972
Query: 377 L------SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE-MQSVISSL----GPICPVS 425
+ + V +G + P + G D E +++ I L G +
Sbjct: 973 MDQDDLEDIFVLERDSGEPNPDGTFAGPAVVDGMPPDLKEQLKATIKLLQKVDGNLISDK 1032
Query: 426 PCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 485
+ + + + + + Y T++EDE +L +L ++R+A ++ EKK+L
Sbjct: 1033 RKRDDILRSTMVETLRLIASRYSTTIAEDEILLAQDSLTRRQRMAVRVRLGEKKLLQEAF 1092
Query: 486 QVTADMI 492
++M+
Sbjct: 1093 DHFSEMV 1099
>gi|326496433|dbj|BAJ94678.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 453
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 93/382 (24%), Positives = 160/382 (41%), Gaps = 40/382 (10%)
Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWL 177
+L G+ VP L + + V + + L ++L ++ E +G S W
Sbjct: 52 NLPRGEVVAEVPKKLWLDADAVAASVLGRVCGSGGDLRPWVSVSLLILREAARGGDSLWA 111
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
PY+ L RQ +S + WSE EL + G+ + + E ++ E++ ++
Sbjct: 112 PYLAILPRQ-------TDSTIFWSEEELLEIQGTQLLSTTMGVKEYVQSEFDNVE----- 159
Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL------ 291
AG + D+ TF+ F AF ++S V + + AL+P +
Sbjct: 160 AGII--NVNKDLFPGTITFDDFLWAFGVLRSRVFPELR---GDKLALIPFADLINHDGDI 214
Query: 292 ----LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDN 346
+ K K L D L K+GE I V + + N++L ++YGF + ++
Sbjct: 215 TSKESCWEIKGKGFLGR-DTVFSLRTPVDVKSGEQIYVQYDLDKSNAELALDYGFTESNS 273
Query: 347 PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS 406
D + ++ DP Y+DK +A+ NG F V G + M+ YLRL +
Sbjct: 274 SRDSYTLTLEISESDPFYEDKLDIAELNGMGETAYFDVVLG--ESLPPQMITYLRLLCLG 331
Query: 407 DTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 459
T V L PVS E ++ + + K+ LA Y T+ EDE +L
Sbjct: 332 GTDAFLLEALFRNKVWEHLE--LPVSRDNEESICQVIQNACKSALAAYHTTIEEDEELLE 389
Query: 460 DYNLHPKKRVATQLVRMEKKML 481
+L + ++A ++ EKK+L
Sbjct: 390 REDLQSRHQIAVEVRVGEKKVL 411
>gi|428163078|gb|EKX32170.1| hypothetical protein GUITHDRAFT_121664 [Guillardia theta CCMP2712]
Length = 449
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 97/426 (22%), Positives = 179/426 (42%), Gaps = 33/426 (7%)
Query: 77 DLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTL 136
D D+ W NG KV+L++ + + PI + A ED++AG+ S+P +L+
Sbjct: 25 DGSDVYEWAAANGANVSKVVLRD----DGEAGPI--LHAKEDIEAGEVILSLPANLLFP- 77
Query: 137 ERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES 196
RV + + ++ + + + LYL+ E+ S W P+++ L +
Sbjct: 78 TRVSDHSPVVHMIENTTIGRITAICLYLISERADSS-SHWKPWLQSLPPRFFHA------ 130
Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQ------QYPYDIP 250
L +SE ++ + S K + + +++EY + F P ++
Sbjct: 131 -LSYSEDDMLHFQASSFKELRDRKKKNVRQEYEQTVAPLLHKLPAFDPLLAAVDKPQNVT 189
Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL---GPPLLAYSSKCKAMLAAVD- 306
E FT+E F+ A+ V + + + R VPL GP ++ + + D
Sbjct: 190 REDFTYEAFEWAYSVVTTRGIFPGLLGEEEREGEVPLLVLGPLADSFIHGASGVKISYDA 249
Query: 307 DAVQLVVDRPYKAGES--IVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
+ V +K ++ I + G N +LL N GF+ ++N + ++++ L+ +
Sbjct: 250 QEHRCVFSALHKVAKNSPISIGVGMSSNMELLANRGFMMQNNGNNFVLMKFQLDRNSDMH 309
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPV 424
R + LS + +V R E +L LR+ +S E S +L PV
Sbjct: 310 ASARESMMKQLNLSNPMTYV--VRYGEMPQGLLASLRIQSLSPV-EFGSYGKALA--TPV 364
Query: 425 SPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNAC 484
+ E L + LA YP T+ EDE +LT R A L+R E+K++
Sbjct: 365 TLENEWRAYRLLISSCNSILAMYPTTIEEDEIVLTQTKTSRHLRAAV-LLRREEKLIYES 423
Query: 485 LQVTAD 490
++ A+
Sbjct: 424 IKTWAN 429
>gi|442753255|gb|JAA68787.1| Putative set domain-containing protein [Ixodes ricinus]
Length = 428
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 68/282 (24%), Positives = 122/282 (43%), Gaps = 28/282 (9%)
Query: 81 LKSWMHKNGLP-PCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
L +WM NG K+ L++ P V A E L G+ +P SL+++
Sbjct: 31 LLTWMEANGFRLHSKLGLRDFPDTGRG------VVALEKLVGGETFLKLPTSLLISTRTA 84
Query: 140 LGNETIAELLTTN---KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES 196
L +++ T KL+ + L L+++ +K G+ S W P++ L R +
Sbjct: 85 L--QSLLHSFITRYHAKLTPIDVLTLFVLDQKLLGEASRWWPFVDSLPR-------TFTT 135
Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 256
P+ T L + E+ R I+R + +L + + G + ++ + FT+
Sbjct: 136 PVFLRRTVFESLP-KDLREEVHTRITSIQRTFLKLKVL--LGGHVEEEPEVQSLSTGFTW 192
Query: 257 EIFKQAFVAVQSCVVHLQKVSLARRF-----ALVPLGPPLLAYSSKCKAMLAAVDDAVQL 311
F A+ AV + + Q + + + AL P L + K A V + ++
Sbjct: 193 NNFVWAWTAVNTRCIFAQGSNSSSLWENDHCALAPF-LDCLNHHWKASIETAMVGENFEI 251
Query: 312 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
+ + + A E + + GP N +L ++YGFV DNP D +VV
Sbjct: 252 LSHKSHDANEQVFISYGPHSNRRLFLDYGFVLPDNPNDVVVV 293
>gi|160331079|ref|XP_001712247.1| met [Hemiselmis andersenii]
gi|159765694|gb|ABW97922.1| met [Hemiselmis andersenii]
Length = 464
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 78/382 (20%), Positives = 170/382 (44%), Gaps = 40/382 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER-VLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
+ A + +Q G+ +P +L+++++R + NE + L+E L ++L+ + G
Sbjct: 103 LLAFKKIQQGEKLIEIPENLILSVDRDQIKNEG------NDFLNEYDSLGIFLIQQMAMG 156
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
KS W Y L R+ + W+ ++ +L GS T L E IK ++ L
Sbjct: 157 DKSKWKIYFDILPREED-----LNLGFRWNLNDIVFLRGSKTLNASLYLKEKIKIQFLRL 211
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP-- 289
+ F L +YP I F ++ A + S + LQ + ++ +LVP
Sbjct: 212 EKTIFSKNRL--KYPVSI----FNLAQWEWALSILLSRAIFLQNL---KKVSLVPYADFM 262
Query: 290 ---PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 346
P K + + + + + D+ Y + I G + N +LL+ YGF+ E N
Sbjct: 263 NHNPFSTSYINSKKISFSKNHEIVMYADKDYNKFDQIFTTYGQKTNLELLLLYGFILERN 322
Query: 347 PYDRLVVEAALNTEDPQYQDKRMV---AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLG 403
P+D + + +L+ +D ++ K+ ++ +++ +F+ +E + +LR
Sbjct: 323 PFDSIELRISLSDKDSFFEKKKQFMIECEKTSEITFPIFYYKYPKE------LYEFLRFC 376
Query: 404 YVSDTSEMQSV-ISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDE---AMLT 459
+S+ E+ S +S + +E+ + + + L Y +SE++ ++ +
Sbjct: 377 -ISNQEELGSTDLSDFNFNDENNYEIEKIIRKLVLFSCEKLLKNYSKKVSEEKILNSLNS 435
Query: 460 DYNLHPKKRVATQLVRMEKKML 481
++ + +++A + + EKK++
Sbjct: 436 NFLISKNQKMALKQSKCEKKII 457
>gi|62642307|gb|AAX92711.1| SET domain-containing protein [Picea abies]
Length = 106
Score = 70.9 bits (172), Expect = 2e-09, Method: Composition-based stats.
Identities = 37/75 (49%), Positives = 52/75 (69%), Gaps = 7/75 (9%)
Query: 42 SSLRLVRRKNRFSIRVSSSDTLVAGS------REVVSKKEEDLGDLKSWMHKNGLPPCKV 95
S +RL R F + V S+DTL A S ++ + KEE++ DLKSWMH++GLPPC+V
Sbjct: 32 SRVRLPGRCVGFPMVVYSADTLTASSQHGEDKKDAIRGKEEEV-DLKSWMHRHGLPPCRV 90
Query: 96 ILKEKPSHNEKHRPI 110
+LKE+PS + KH+PI
Sbjct: 91 MLKERPSPDGKHKPI 105
>gi|350408192|ref|XP_003488333.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Bombus
impatiens]
Length = 484
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 100/438 (22%), Positives = 181/438 (41%), Gaps = 50/438 (11%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+ + +G +W+ +NG + E P ++ + A + + +P L
Sbjct: 77 KRSQGIGRFINWLKQNGANVYGASVAEFPGYDLG------LKAERNFLENELILRIPREL 130
Query: 133 VVTLERV------LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
+ ++ L N+ + +L+ LA+ L+ EK + + S W PY+ L
Sbjct: 131 IFSIHNAAPELVALQNDPLLQLMPQ------VALAIALLIEKHK-EYSKWKPYLDILPT- 182
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
+ L + ++ L GSPT L++ I R+Y +F LFQ+
Sbjct: 183 ------TYTTVLYMTAADMNELKGSPTLEAALKQCRNIARQY-----AYF--NKLFQKNN 229
Query: 247 YDIPT---EAFTFEIFKQAF--VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAM 301
+ + FT+E + A V + ++ + SL AL+P+ +SK
Sbjct: 230 NAVSAILRDVFTYEKYCWAVSTVMTRQNIIPSKDGSLMIH-ALIPMWDMCNHENSKITTD 288
Query: 302 LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 361
A + + R +K E I + G + NS ++ GFV DN D + ++ D
Sbjct: 289 FNATLNCCECYALRDFKKAEQIFISYGARTNSDFFVHSGFVYMDNEQDGFKLRLGISKAD 348
Query: 362 PQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-MLPYLRLGYVSDTSEMQSVISS--L 418
P +++ + + +V F + G E ISD +L +LR+ + E+ I S +
Sbjct: 349 PLQKERVELLNKLDLPAVGEFLLKPG--TEPISDTLLAFLRV-FSMRKEELAHWIQSDRV 405
Query: 419 GPICPVSPCMERAVLDQLADYFKARL----AGYPATLSEDEAMLTDYNLHPKKRVATQLV 474
+ + +E V + + + RL A YP TL ED +L + L K++A QL
Sbjct: 406 NDLKHMDCALETVVEENVKKFLLTRLQLLIANYPTTLKEDLQLL-ETTLPRIKKLAIQLR 464
Query: 475 RMEKKMLNACLQVTADMI 492
EK++L L+ I
Sbjct: 465 VTEKRILQGALEYVQQWI 482
>gi|168046556|ref|XP_001775739.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672891|gb|EDQ59422.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 524
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 82/310 (26%), Positives = 125/310 (40%), Gaps = 24/310 (7%)
Query: 52 RFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIH 111
RF R S+ VS + L++W+ K C + L+ P
Sbjct: 49 RFGCRWVQSNGSTHTKESNVSISNTKVERLRNWLKKLNHDDCNLKLERCPQGGSGS-GYG 107
Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
A + G VP ++T E + + L+ + L+ + L+L+YE+ +G
Sbjct: 108 AFAGPGGVGNGSTIVKVPRKALMTEETARLCQDVGPLVKKSDLTPWQAMCLHLLYERARG 167
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSET-ELAYLTGSPTKAEILERAEGIKREYNE 230
+ SFW PYI L ++ +L P+LWS+ +L GSP ++ ER I RE E
Sbjct: 168 ETSFWYPYIAVLPKEL---ELIGIHPMLWSQKMRREWLEGSPM-LDVTERRLAICREDYE 223
Query: 231 LDTVWFMAGSLF----QQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL------AR 280
+ AG L + P I A + + +S ++LQ L
Sbjct: 224 A-MLLAGAGRLTPRGNEGEPISITETAVQ---WAATMLLSRSFSLNLQTQKLRPGSFAED 279
Query: 281 RFALVPLGPPLLAYSSKCKAMLAAVDD---AVQLVVDRPYKAGESIVVWCGPQPN-SKLL 336
ALVP L SS + D L R Y GE + GP + S+LL
Sbjct: 280 TIALVPWADMLNHSSSAGRESCLVYDQKSGVATLQAHRTYSEGEQVFDSYGPSCSPSRLL 339
Query: 337 INYGFVDEDN 346
++YGFVDE+N
Sbjct: 340 LDYGFVDEEN 349
>gi|156064409|ref|XP_001598126.1| hypothetical protein SS1G_00212 [Sclerotinia sclerotiorum 1980]
gi|154691074|gb|EDN90812.1| hypothetical protein SS1G_00212 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 470
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 102/409 (24%), Positives = 169/409 (41%), Gaps = 57/409 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSE----LACLALYLMYEK 168
V A+ D+ + FS+P + V L + +A L + +L E L LM E
Sbjct: 42 VVATGDIDDDEIIFSIPRNAV------LNAQNVAPLPVSRRLFEKMPSWLVLTSILMTEA 95
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA------- 221
Q + S W PY+ L + ++S + WS++ELA L S +I ++
Sbjct: 96 -QMENSKWAPYLAVLPER-------LDSLVFWSDSELAELQASAVVKKIGKKDAEDMFKS 147
Query: 222 ----EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQ 274
+G+K E+ S+ Y +DIP + + A V +
Sbjct: 148 YIAPQGLKHSSTEM---CHKVASVIMAYAFDIPDPSDAPTSGGKGGEAGDDLVSDDGEDE 204
Query: 275 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 334
K L+ ++PL L A + + A L ++ +++ +P GE I G P S
Sbjct: 205 KTILS----MIPLADMLNADADRNNARLICDNEELEMRAIKPISKGEEIFNDYGQLPRSD 260
Query: 335 LLINYGFV-DEDNPYD------RLVVEAALN----------TEDPQYQDKRMVAQRNGKL 377
LL YG+V D + YD L+V N T+D + + +A+R G
Sbjct: 261 LLRRYGYVTDGYSAYDVAEISAELIVSLFRNGKVHPSLHKLTQD-GLKTRLELAEREGVY 319
Query: 378 SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLA 437
VH+ ++ +I D L + D S ++++++S I S LA
Sbjct: 320 EDSFDLVHSSPDEPSIPDELLAFLYLLLVDESHLKAILNSESSIPSRSKLTTELAGQVLA 379
Query: 438 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 486
+AR Y TL EDE +L + +L + +A Q+ EKK+L A +Q
Sbjct: 380 TLLQAREKEYSTTLEEDEDLLKNADLPVRHAMAIQVRSGEKKVLRAAMQ 428
>gi|399949805|gb|AFP65462.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Chroomonas mesostigmatica
CCMP1168]
Length = 464
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 81/378 (21%), Positives = 160/378 (42%), Gaps = 36/378 (9%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A +Q G+ +P +L+ L++ L E +E L+ L+E LA+ + E+ G+KS
Sbjct: 100 AFRKIQQGEKLIEIPENLI--LKKSLK-ENRSEDLSF--LNEYDSLAIKAIQERAIGEKS 154
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W Y L +++ + W +++ +L GS E IK ++ ++
Sbjct: 155 KWKVYYEILPKEKDLNLV-----FRWKISDIVFLRGSKVLNASFYLKEKIKIQFLRIEKT 209
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP----- 289
F L P + F + ++ A + S + LQ + ++ ALVP
Sbjct: 210 IFSKNRLV------YPEKIFNLQSWEWAISLLLSRAIFLQNM---KKIALVPYADFINHN 260
Query: 290 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
P K + + ++ + + D+ Y + I G + N +LL+ YGF+ E NP+D
Sbjct: 261 PFSTSYINSKKIAFSENNEIVMYADKDYNKFDQIFTTYGQKTNLELLVLYGFIIERNPFD 320
Query: 350 RLVVEAALNTEDPQYQDKRMV---AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS 406
+ + AL+T+D Y K ++ +++ VF+ +E + ++RL
Sbjct: 321 SIELRVALSTKDELYNKKEKFINDCEKTEQITFPVFYYKYPKE------LYEFMRLCLSG 374
Query: 407 DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYN---L 463
S+L + +E+ + + K L Y T++E++ + N L
Sbjct: 375 PRDFFGEEFSNLNFTDEENFNLEKIIRKTVIFACKKNLKAYNKTINEEKILNNLSNIIVL 434
Query: 464 HPKKRVATQLVRMEKKML 481
++ + + + EKK+L
Sbjct: 435 TKNQKTSIKQRKCEKKIL 452
>gi|156361027|ref|XP_001625323.1| predicted protein [Nematostella vectensis]
gi|156212150|gb|EDO33223.1| predicted protein [Nematostella vectensis]
Length = 447
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 75/280 (26%), Positives = 117/280 (41%), Gaps = 32/280 (11%)
Query: 75 EEDLGDLKSWMHKNGLPPCKVILKEKPS-HNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
EE+ L W +NG+ V K +P+ + R + A E + + + SVP L+
Sbjct: 47 EENYISLLKWAKRNGM----VFKKIRPAIFSSTGRGM---LAIERIHSSECVISVPERLL 99
Query: 134 VT----LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
+T LE +GN + K S L L+LMYEK K SFW PYIR L
Sbjct: 100 ITASSVLESAIGNYVAERMKGGAKSSNDYLLVLFLMYEKYLEKGSFWAPYIRTLPD---- 155
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDI 249
+P ++ EL +L + + E+ IK+ Y + + Q + +
Sbjct: 156 ---TFNTPCYFTRKEL-FLLPEQCREQAFEQVTQIKQSYKSFAKAY---NDVLQDFDCNF 208
Query: 250 PTEAFTFEIFKQAFVAVQS-CVVHLQKVSLAR----RFALVPLGPPLLAYSSKCK--AML 302
FE FK A+ V + V H + A+ AL PL LL + K +
Sbjct: 209 -WRTVDFESFKWAWCVVNTRSVYHDEPNRRAQPIDGNCALAPL-LDLLNHCDKAEMCGRF 266
Query: 303 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
+ ++ V Y+ G + + GP N++L + YGFV
Sbjct: 267 NSSSKNYEINVITEYQKGTQVFINYGPHDNTRLFLEYGFV 306
>gi|440797255|gb|ELR18348.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
Length = 431
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 72/257 (28%), Positives = 105/257 (40%), Gaps = 43/257 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVT------------LERVLGNETIAELLTTNKLSELACL 160
V A+ D+ G+ SVP SLVV + R+L E N L
Sbjct: 59 VVAAHDIATGETLLSVPFSLVVDSADAPLATAAPEIRRILDEEFPLSATNENAL------ 112
Query: 161 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILER 220
L+ K S W YI L + L +S+ EL+YL GS +R
Sbjct: 113 ---LLLVHKNDPNSPWQRYIDVLPS-------TFSTTLFFSDDELSYLEGSSLHHFARQR 162
Query: 221 AEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV--QSCVVHLQKVSL 278
I+ +Y+ + T LF YP E F+ + +K A + +S VV K L
Sbjct: 163 RRAIESQYDTIFT------PLFVDYPEHFAPEQFSLDAWKWALSVIWSRSFVVDEGKRGL 216
Query: 279 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ---PNSKL 335
+ + P + + K + AVD + P K GE I V G N++L
Sbjct: 217 VPWADMFNMAPE----TEQVKVAVDAVDHHLIYSARSPIKKGEQIFVAYGQSRQMSNAQL 272
Query: 336 LINYGFVDEDNPYDRLV 352
L++YGFV E+NP+D +V
Sbjct: 273 LMDYGFVLENNPHDAVV 289
>gi|449442309|ref|XP_004138924.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Cucumis
sativus]
Length = 503
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 90/394 (22%), Positives = 167/394 (42%), Gaps = 40/394 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+A +++L + VP + + V +E + L +AL+L+ E +G
Sbjct: 100 LATTKNLSKNEVVLEVPKRFWINPDAVADSEIGN---VCSGLKPWISVALFLIRENLKGD 156
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W Y+ L ++ +S + WSE ELA + G+ + L E +K E+ +++
Sbjct: 157 -SRWRRYLDILPQE-------TDSTVFWSEEELAEIQGTQLLSTTLNVKEYVKSEFLKVE 208
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL- 291
+ D+ T + F AF ++S + + L+P +
Sbjct: 209 EEILLRHK-------DLFPSRITLDDFFWAFGILRSRAFSRLR---GQNLVLIPFADLVN 258
Query: 292 ---------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGF 341
A+ K A L + D L KAG+ + + + + N+ L ++YGF
Sbjct: 259 HSANVTTEEHAWEVKGPAGLFSWDVLFSLRSPLSVKAGDQVFIQYDLKKSNADLALDYGF 318
Query: 342 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 401
+++ + + + + D + DK +A+ NG F + E+ MLP+LR
Sbjct: 319 IEQKSDRNAYTLTLEIPESDLFFDDKLDIAETNGLNQTAYFDIIL--ERPFPPAMLPFLR 376
Query: 402 LGYVSDTSE--MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEA 456
L + T ++S+ S G + PVS E + + + +A L+GY T+ EDE
Sbjct: 377 LLALGGTDAFLLESLFRNSVWGHLEMPVSRANEELICQVVRNACEAALSGYHTTIEEDEK 436
Query: 457 MLTDYNLHPKKRVATQLVRMEKKMLNACLQVTAD 490
L + NL + R+A + EK++L +Q+ D
Sbjct: 437 -LKEENLDSRLRIAVGIREGEKRVLQQIIQIFKD 469
>gi|449495943|ref|XP_004159992.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Cucumis
sativus]
Length = 503
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 90/394 (22%), Positives = 167/394 (42%), Gaps = 40/394 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+A +++L + VP + + V +E + L +AL+L+ E +G
Sbjct: 100 LATTKNLSKNEVVLEVPKRFWINPDAVADSEIGN---VCSGLKPWISVALFLIRENLKGD 156
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W Y+ L ++ +S + WSE ELA + G+ + L E +K E+ +++
Sbjct: 157 -SRWRRYLDILPQE-------TDSTVFWSEEELAEIQGTQLLSTTLNVKEYVKSEFLKVE 208
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL- 291
+ D+ T + F AF ++S + + L+P +
Sbjct: 209 EEILLRHK-------DLFPSRITLDDFFWAFGILRSRAFSRLR---GQNLVLIPFADLVN 258
Query: 292 ---------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGF 341
A+ K A L + D L KAG+ + + + + N+ L ++YGF
Sbjct: 259 HSANVTTEEHAWEVKGPAGLFSWDVLCSLRSPLSVKAGDQVFIQYDLKKSNADLALDYGF 318
Query: 342 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 401
+++ + + + + D + DK +A+ NG F + E+ MLP+LR
Sbjct: 319 IEQKSDRNAYTLTLEIPESDLFFDDKLDIAETNGLNQTAYFDIIL--ERPFPPAMLPFLR 376
Query: 402 LGYVSDTSE--MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEA 456
L + T ++S+ S G + PVS E + + + +A L+GY T+ EDE
Sbjct: 377 LLALGGTDAFLLESLFRNSVWGHLEMPVSRANEELICQVVRNACEAALSGYHTTIEEDEK 436
Query: 457 MLTDYNLHPKKRVATQLVRMEKKMLNACLQVTAD 490
L + NL + R+A + EK++L +Q+ D
Sbjct: 437 -LKEENLDSRLRIAVGIREGEKRVLQQIIQIFKD 469
>gi|310799999|gb|EFQ34892.1| SET domain-containing protein [Glomerella graminicola M1.001]
Length = 478
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 96/433 (22%), Positives = 172/433 (39%), Gaps = 82/433 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER----------VLGNETIAELLTTNKLSELACLAL 162
+ A++D+ F++P ++ +E GN+ E + L L L
Sbjct: 44 IVATKDIAPETVLFTIPRKSIINIETSELPKKIPQVFTGNDGDDEDMENEPLDSWGSLIL 103
Query: 163 YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAE 222
++YE QG S W Y L + ++ + W +L YL GS ++I +
Sbjct: 104 VMIYEYLQGNASPWKTYFEVLPEK-------FDTLMFWESPDLEYLKGSAVLSKIGKDEA 156
Query: 223 GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR- 281
L + AG F Q P+E+ ++ + + + L+ +
Sbjct: 157 DEMFRSRILPVISANAGIFFPQ-GVSPPSESELLQLAHRMGSIIMAYAFDLENEEEPEQE 215
Query: 282 -------------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG 328
+VP+ +L ++ A + +D + + RP KAGE I+ + G
Sbjct: 216 DEEWVEDREGKTMLGMVPMAD-ILNADAEFNAHVNHGEDDLSVTALRPIKAGEEILNYYG 274
Query: 329 PQPNSKLLINYGFVDEDN--------PYDRLVVEAALNTEDPQYQDK--RMVAQRNGKLS 378
P PNS+LL YG+V + P+D +V++ L TE + D+ + VA+
Sbjct: 275 PHPNSELLRRYGYVTPKHSRYDVVEIPWD--LVQSTL-TEQLRLTDEVWKQVAEHVDPED 331
Query: 379 VQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL---------------GPICP 423
++ V E S+ G++ +++Q V + L G + P
Sbjct: 332 LEDVFVLERESGEPDSE-------GHLQTPAKVQEVSAELEEQLKDVLKAIKKVRGDLIP 384
Query: 424 -------VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 476
V C+ + L +L LA YP T EDEAML N+ ++++A ++
Sbjct: 385 DKRKRDEVYQCVVVSTLQKL-------LAQYPTTAEEDEAMLASGNVTSRQKLAVEVRLG 437
Query: 477 EKKMLNACLQVTA 489
EK+++ LQV
Sbjct: 438 EKRLIKEALQVAG 450
>gi|302836231|ref|XP_002949676.1| Rubisco large subunit N-methyltransferase [Volvox carteri f.
nagariensis]
gi|300265035|gb|EFJ49228.1| Rubisco large subunit N-methyltransferase [Volvox carteri f.
nagariensis]
Length = 484
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 107/401 (26%), Positives = 177/401 (44%), Gaps = 36/401 (8%)
Query: 105 EKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYL 164
+ RP+ + AS D Q GD FSVP+S ++ E V + +L L +AL L
Sbjct: 65 QTDRPV--LIASTDAQQGDVLFSVPDSAWLSAESVK-KAAVGKLAAAAGLEPWLQIALQL 121
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
+ ++ KS Y + +++PLLWSE EL L G+ ++L+ G
Sbjct: 122 VADRFGSTKSELSAYAASIPED-------LDTPLLWSEDELQELQGT----QVLQTLGGY 170
Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV---VHLQKVSLARR 281
+ T + LF P P FT F A AV+S + K++LA
Sbjct: 171 LTFFRS--TFQQLQSGLFTSNPAAFPPSIFTLPRFLWAVAAVRSRSHPPLDGPKIALA-- 226
Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DRPYKAGESIVVWCGP-QPNSKLLI 337
L L A +SK A + Q++V R + GE + + GP + + +L+
Sbjct: 227 -PLTELVSHRRAANSKLSVRSAGLFGRGQVLVLEATRAIRKGEPLSMDYGPGKLDGPVLV 285
Query: 338 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML 397
+YG +D +P + + D DK + + N V+++ +++ +ML
Sbjct: 286 DYGVMDVTSPKPGYSLTLKMPDSDRFIDDKLDILESNDLPQSVVYNLTP--DEQPTIEML 343
Query: 398 PYLRLGYV--SDTSEMQSVISS--LGPIC-PVSPCMERAVLDQLADYFKARLAGYPATLS 452
+LRL + SD ++S+ + G + PVS E AV + L++ +A L GY T+
Sbjct: 344 AFLRLMQLKGSDAFLLESIFRNDVWGFMQEPVSEGNEEAVCNTLSEGARAALGGYGTTID 403
Query: 453 EDEAMLTDYNLHPK--KRVATQLVRM-EKKMLNACLQVTAD 490
+D A L K +R A L+R+ EK+ L+A + D
Sbjct: 404 QDLAELRAQGSRAKGSRREAALLIRLGEKEALDAVARFFED 444
>gi|380015248|ref|XP_003691619.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Apis
florea]
Length = 483
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 95/435 (21%), Positives = 181/435 (41%), Gaps = 42/435 (9%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
SK+ + +G +W+ +NG + E P ++ + A + + +P
Sbjct: 75 SKRSQGIGQFINWLKENGANVDGASVAEFPGYDLG------LKAERNFLENELILRIPRG 128
Query: 132 LVVTLERVLGNETIAELLT------TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDR 185
L+ ++ + EL+T + ++A LA+ L+ E+ + + S W PY+ L
Sbjct: 129 LIFSI-----HNAAPELITLQNDPLIQHMPQVA-LAIALLIERHK-ENSKWKPYLDILPT 181
Query: 186 QRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQY 245
+ L + ++ L GSPT L++ I R+Y+ + V+ +
Sbjct: 182 -------TYTTVLYMTAADMIELKGSPTLEAALKQCRNIARQYSYFNKVFQNNNNAVSAI 234
Query: 246 PYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF-ALVPLGPPLLAYSSKCKAMLAA 304
D+ FT+E + A V + + +R AL+P+ + + A
Sbjct: 235 LRDV----FTYERYCWAVSTVMTRQNLIPSEDGSRMIHALIPMWDMCNHENGRITTDFNA 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
+ + R +K GE I + GP+ NS ++ GFV +N D + ++ D
Sbjct: 291 TSNYCECYALRDFKKGEQIFISYGPRTNSDFFVHSGFVYMENKQDGFKLRLGISKADSLQ 350
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISD-MLPYLRLGYVSDTSEMQSVISS--LGPI 421
+++ + + +V F + G E ISD +L +LR+ + +E+ I S + +
Sbjct: 351 KERIELLNKLDLPTVGEFLLKLG--TEPISDLLLAFLRV-FSMRKAELAHWIRSDRVNDL 407
Query: 422 CPVSPCMERAVLDQLADYFKARL----AGYPATLSEDEAMLTDYNLHPKKRVATQLVRME 477
+ +E V + + + RL A YP TL ED +L + L K++ QL E
Sbjct: 408 KHMDCALETVVEENVRKFLLTRLQLLIANYPTTLKEDLQLL-ETTLPQIKKLTIQLRVTE 466
Query: 478 KKMLNACLQVTADMI 492
K++L L+ I
Sbjct: 467 KRILQGALEYVEQWI 481
>gi|357444999|ref|XP_003592777.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase [Medicago truncatula]
gi|355481825|gb|AES63028.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase [Medicago truncatula]
Length = 451
Score = 69.7 bits (169), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 64/261 (24%), Positives = 117/261 (44%), Gaps = 25/261 (9%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ +Q GD VP SL +T + + + + + +A LA L+ K G+ S
Sbjct: 66 ASKSIQTGDCILQVPYSLQLTPDNLPPE---IKPFISEDVGNIAKLATVLLIHKNLGQDS 122
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI L Q + + + W+E+EL + S E + + I++++ E+ V
Sbjct: 123 EWHPYISCLPPQA-----EMHNTIFWNESELEMIRQSSVYQETIYQKSQIEKDFLEIKPV 177
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
FQ P+ FT++ F A V S + + +L+P L +
Sbjct: 178 -------FQ--PFCQSFGDFTWKDFMHACTLVGS-----RAWGSTKGLSLIPFAD-FLNH 222
Query: 295 SSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 352
+A++ + DD ++ DR Y GE +++ G N+ L++++GF N YD++
Sbjct: 223 DGISEAIVMSDDDNKCSEVFSDRDYVPGEQVLIRYGKFSNATLMLDFGFTIPYNIYDQVQ 282
Query: 353 VEAALNTEDPQYQDKRMVAQR 373
++ + DP K + Q+
Sbjct: 283 IQYDIPKYDPLRHTKLELLQQ 303
>gi|440804743|gb|ELR25614.1| SET domain containing protein, partial [Acanthamoeba castellanii
str. Neff]
Length = 273
Score = 69.7 bits (169), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 72/254 (28%), Positives = 104/254 (40%), Gaps = 43/254 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVT------------LERVLGNETIAELLTTNKLSELACL 160
V A+ D+ AG+ SVP SLVV + R+L E N L
Sbjct: 45 VVAAHDIAAGETLLSVPFSLVVDSADALLATSAPEIRRILDEEFPLSPTNENAL------ 98
Query: 161 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILER 220
L+ K S W YI L + L +S+ EL+YL GS +R
Sbjct: 99 ---LLLVHKNDPNSPWQRYIDVLPS-------TFSTTLFFSDDELSYLEGSSLHYFARQR 148
Query: 221 AEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV--QSCVVHLQKVSL 278
I+ +Y+ + T LF YP E F+ + +K A + +S VV K L
Sbjct: 149 RRAIESQYDTIFT------PLFVDYPEHFAPEQFSLDAWKWALSVIWSRSFVVDEGKSGL 202
Query: 279 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ---PNSKL 335
+ + P + + K + AVD + P K GE I V G N++L
Sbjct: 203 VPWADMFNMAPE----TEQVKVAVDAVDHHLIYSARSPIKKGEQIFVAYGQSRQMSNAQL 258
Query: 336 LINYGFVDEDNPYD 349
L++YGFV E+NP+D
Sbjct: 259 LMDYGFVLENNPHD 272
>gi|242049248|ref|XP_002462368.1| hypothetical protein SORBIDRAFT_02g024510 [Sorghum bicolor]
gi|241925745|gb|EER98889.1| hypothetical protein SORBIDRAFT_02g024510 [Sorghum bicolor]
Length = 489
Score = 69.7 bits (169), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 103/388 (26%), Positives = 163/388 (42%), Gaps = 40/388 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETI-AELLTTNKLSELACLALYLMYEKKQG 171
+ A+ DL G+ VP L + + V ++ A L +AL L+ E +G
Sbjct: 83 LVAARDLPRGEVVAEVPKKLWMDADAVAASDIGRACGGGGGGLRPWVAVALLLLSEVARG 142
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY--N 229
S W PY+ L RQ S A L S + ++L G+K EY +
Sbjct: 143 ADSPWAPYLAILPRQTD------------STIFCAGLKKSSLRYKLLSTTVGVK-EYVQS 189
Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV---VHLQKVSLARRFALVP 286
E D+V S + D+ + TF+ F AF ++S V + K++L LV
Sbjct: 190 EFDSVQAEIISRNK----DLFPGSITFDDFLWAFGILRSRVFPELRGDKLALVPFADLVN 245
Query: 287 LGPPLLAYSS----KCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGF 341
P + + S K K + + L K+G+ I + + + N++L ++YGF
Sbjct: 246 HSPDITSEGSSWEIKGKGLFGR-EPMFSLRTPVDVKSGQQIYIQYDLDKSNAELALDYGF 304
Query: 342 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 401
V+ + D V ++ DP Y DK +A+ N F + ++ MLPYLR
Sbjct: 305 VESNPSRDSYTVTLEISESDPFYGDKLDIAELNELGETAYFDIIL--DEPLPPQMLPYLR 362
Query: 402 LGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSED 454
L + T SV L P+SP E ++ + D K+ LA Y T+ ED
Sbjct: 363 LLCIGGTDAFILEALFRNSVWGHLE--LPLSPDNEESICQVMRDACKSALAAYHTTIEED 420
Query: 455 EAMLTDYNLHPKKRVATQLVRMEKKMLN 482
E + NL P+ +A + EKK+L
Sbjct: 421 EELSERENLQPRLTIAIGVRAGEKKVLQ 448
>gi|384484604|gb|EIE76784.1| hypothetical protein RO3G_01488 [Rhizopus delemar RA 99-880]
Length = 400
Score = 69.3 bits (168), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 71/289 (24%), Positives = 126/289 (43%), Gaps = 51/289 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A++D++ GD FS+P S++++ + ++EL ++LS + L L +MYE ++
Sbjct: 41 VTANKDIKEGDLLFSLPRSILLSQLTSSLKDQVSEL---SELSGWSPLILCMMYEIEK-P 96
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
SFW PY L R+ +P+ W++ +L L G+ ++I + E + +NEL+
Sbjct: 97 DSFWKPYFDVLPRE-------FTTPMFWNQEDLKELEGTDIISKI-GKKESEELFHNELE 148
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFK--QAFVAVQSCVVHLQKVSLARR--------- 281
+ ++YP + T E+F + + S LQK
Sbjct: 149 PI-------IKKYPNLFDEQKHTIELFHICGSLIMAYSFNDELQKAPKENNKEEEKEEEE 201
Query: 282 ---------------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 326
++VP+ L + A L D++Q+ + K GE I
Sbjct: 202 EEEEEEEEEEEEEGLISMVPMADMLNHKTGFNNARLFHEPDSLQMRAIKDIKEGEQIYNT 261
Query: 327 CGPQPNSKLLINYGFVDEDNPYDR------LVVEAALNTEDPQYQDKRM 369
G N+ LL YGFVDE N +D L+VE +D +++++
Sbjct: 262 YGDLCNADLLRKYGFVDEKNDFDLVELDGPLLVEVCCEDQDEALKERKI 310
>gi|302896454|ref|XP_003047107.1| SET domain protein [Nectria haematococca mpVI 77-13-4]
gi|256728035|gb|EEU41394.1| SET domain protein [Nectria haematococca mpVI 77-13-4]
Length = 1037
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 106/447 (23%), Positives = 178/447 (39%), Gaps = 93/447 (20%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN----------KLSELACLAL 162
+ A +D+ A F++P ++ +E + + ++ + +L + L L
Sbjct: 610 IIALQDIPAETTLFTIPRKGIINVETSELPKKLPDVFDLDKPIDDDDEAPRLDSWSSLIL 669
Query: 163 YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK-------A 215
LMYE QG+KS W PY L + ++P+ WSE+EL L S + A
Sbjct: 670 VLMYEYLQGEKSQWKPYFDVLPS-------SFDTPMFWSESELDQLQASHMRHKIGKADA 722
Query: 216 EILERAE-------------GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
E + R G R ++L + GS Y +D+ + E
Sbjct: 723 ESMFRKTLLPIIRKNSSVFGGENRSDDDLVEIAHRMGSTIMAYAFDLENDEDEEEEETDG 782
Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
+V + + +VP+ +L ++ A + ++++ + RP KAGE
Sbjct: 783 WVEDREGKSMM---------GMVPMAD-ILNADAEFNAHVNHEEESLTVTSLRPIKAGEE 832
Query: 323 IVVWCGPQPNSKLLINYGFVDEDNP-YDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQV 381
I + GP PNS+LL YG+V E + YD VVE + + V + N +S QV
Sbjct: 833 IFNYYGPHPNSELLRRYGYVTERHSRYD--VVEIPWDVVES-------VMRLNFGISGQV 883
Query: 382 FHV--HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGP-----------------IC 422
H E+E D R +T E+ S + GP +
Sbjct: 884 LEKLRHGLEEEEEFEDTFVLER-----ETGEVNSDGTFSGPARFESMPEDLQEQLKTFLK 938
Query: 423 PVSPCMERAVLDQ----------LADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQ 472
V A+ D+ LA +A + YP + SED +L +L + R+A +
Sbjct: 939 GVKKAQPEAIPDKRKRDEIHHAVLAKTLQALASKYPTSTSEDGILLQRQDLSQRTRMAIE 998
Query: 473 LVRMEKKMLNACLQVTA--DMIMLLPD 497
+ EKK+L + T+ D+ M + D
Sbjct: 999 VRLGEKKLLQEAIASTSSVDVEMTVDD 1025
>gi|412990750|emb|CCO18122.1| predicted protein [Bathycoccus prasinos]
Length = 543
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 70/262 (26%), Positives = 111/262 (42%), Gaps = 38/262 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLM------- 165
+ A+E ++ G+ +P ++T+E L + E +L E + LA +L
Sbjct: 118 LVATESIKRGEKVLEIPQEAIITVEVALKESLLREKKKLAELQEWSILATFLAETAQNLS 177
Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGI 224
E K + Y++ L R G S L W E+++ L GSP+ LER +
Sbjct: 178 TEDNSSNKYRFATYVKALPRSTG-------SVLEWPESDVRTLLAGSPSLFSALERRASV 230
Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 284
E+ + Q+ +DI +F S ++ L+ SL AL
Sbjct: 231 AAAIAEIRVNFPELNEKTLQWAFDI--------LF--------SRLIRLE--SLGGNLAL 272
Query: 285 VPLGPPLLAYSSKCKAM--LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
VP +L + C+A L V L DR Y+ GE + G +P+S+LLI+YGF
Sbjct: 273 VPWAD-MLNHQPGCEAFIDLDRGSRKVCLTTDRSYEPGEQVWASYGQRPSSELLISYGFA 331
Query: 343 DE--DNPYDRLVVEAALNTEDP 362
DNP D + ++ EDP
Sbjct: 332 PAVGDNPDDEYALNLQIDEEDP 353
>gi|79315114|ref|NP_001030864.1| SET domain-containing protein [Arabidopsis thaliana]
gi|51971180|dbj|BAD44282.1| unnamed protein product [Arabidopsis thaliana]
gi|332645817|gb|AEE79338.1| SET domain-containing protein [Arabidopsis thaliana]
Length = 353
Score = 68.9 bits (167), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 58/233 (24%), Positives = 106/233 (45%), Gaps = 27/233 (11%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LA L+ EKK G+KS W+PYI L + + S + W E EL+ + S E ++
Sbjct: 2 LAAVLIREKKMGQKSRWVPYISRLPQP-----AEMHSSIFWGEDELSMIRCSAVHQETVK 56
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
+ I+++++ F+A + Q P I TE E F A+ V S + +
Sbjct: 57 QKAQIEKDFS------FVAQAFKQHCP--IVTERPDLEDFMYAYALVGS-----RAWENS 103
Query: 280 RRFALVPLGPPLLAYSSKCKAMLAAVDD------AVQLVVDRPYKAGESIVVWCGPQPNS 333
+R +L+P + +L D+ +Q+ DR Y G+ + + G N+
Sbjct: 104 KRISLIPFADFMNHDGLSASIVLRDEDNQLSEFSTLQVTADRNYSPGDEVFIKYGEFSNA 163
Query: 334 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ---VFH 383
L++++GF N +D + ++ + +DP K + Q + +V+ +FH
Sbjct: 164 TLMLDFGFTFPYNIHDEVQIQMDVPNDDPLRNMKLGLLQTHHTRTVKDINIFH 216
>gi|225448769|ref|XP_002275729.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Vitis
vinifera]
Length = 480
Score = 68.9 bits (167), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 69/292 (23%), Positives = 132/292 (45%), Gaps = 35/292 (11%)
Query: 75 EEDLGDLKSWM-HKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
++D D W+ K G+ V+ K ++ + AS+ +Q GD VP ++
Sbjct: 39 DKDCDDFLPWLEQKAGVEISSVLSIGKSTYGRS------LFASKSIQTGDCILKVPYNVQ 92
Query: 134 VTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
++ + V I LL +++ +A LA+ + E K G+ S W PYI L Q G
Sbjct: 93 ISPDNV--PSKINSLLG-DEVGNIAKLAIVISVEWKMGQDSEWAPYINRLP-QPGE---- 144
Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW-FMAGSLFQQYPYDIPTE 252
+ S + WSE EL + S E + + I++++ + V + +LF+ DI +
Sbjct: 145 MHSTIFWSEGELKMIQQSSVYQETINQKAQIQKDFLAIKPVLHHFSENLFK----DISLK 200
Query: 253 AFTFEIFKQAFVAVQSC-VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA--V 309
F + +C +V + + +L+P + + ++L +D
Sbjct: 201 EF-----------MHACALVGSRAWGSTKGLSLIPFA-DFVNHDGFSDSVLLGDEDKQLS 248
Query: 310 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 361
+++ DR Y GE +++ G PN+ LL+++GF N YD++ ++ + D
Sbjct: 249 EVIADRNYAPGEQVLIRYGKFPNATLLLDFGFTLPYNIYDQVQIQVNIPHHD 300
>gi|255947868|ref|XP_002564701.1| Pc22g06730 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211591718|emb|CAP97961.1| Pc22g06730 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 679
Score = 68.9 bits (167), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 62/209 (29%), Positives = 101/209 (48%), Gaps = 15/209 (7%)
Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
A +LM + +G + FW PY+R L + GQL +PL + E ++ ++ G+ +
Sbjct: 105 TFAFFLMGQYLRGSEGFWYPYLRTLPQP---GQLT--TPLFFGEEDVDWIQGTGIPEAAV 159
Query: 219 ERAEGIKREYN----ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA-VQSCVVHL 273
ER + +++Y+ +LD + F +QY +++ A T I +AF A V S V
Sbjct: 160 ERIKVWEQKYDLGYLKLDEIGFPD---CEQYTWELYLWASTI-ITSRAFSAKVLSGAVQP 215
Query: 274 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 333
+ AL+PL L + K A D+ + L+V + AG+ I GP+ N
Sbjct: 216 DDLPEDGVSALLPL-IDLPNHRPMAKVEWRAGDEDIGLLVLEDHSAGQEISNNYGPRNNE 274
Query: 334 KLLINYGFVDEDNPYDRLVVEAALNTEDP 362
+LLINYGF NP D +V + + P
Sbjct: 275 QLLINYGFCIAGNPTDYRIVLLGVKPDSP 303
>gi|297736447|emb|CBI25318.3| unnamed protein product [Vitis vinifera]
Length = 487
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 69/298 (23%), Positives = 134/298 (44%), Gaps = 40/298 (13%)
Query: 75 EEDLGDLKSWM-HKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
++D D W+ K G+ V+ K ++ + + AS+ +Q GD VP ++
Sbjct: 39 DKDCDDFLPWLEQKAGVEISSVLSIGKSTYGSRS-----LFASKSIQTGDCILKVPYNVQ 93
Query: 134 VTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
++ + V I LL +++ +A LA+ + E K G+ S W PYI L Q G
Sbjct: 94 ISPDNV--PSKINSLLG-DEVGNIAKLAIVISVEWKMGQDSEWAPYINRLP-QPGE---- 145
Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW-FMAGSLFQQYPYDIPTE 252
+ S + WSE EL + S E + + I++++ + V + +LF+ DI +
Sbjct: 146 MHSTIFWSEGELKMIQQSSVYQETINQKAQIQKDFLAIKPVLHHFSENLFK----DISLK 201
Query: 253 AFTFEIFKQAFVAVQSC-VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD---- 307
F + +C +V + + +L+P + + ++L +D
Sbjct: 202 EF-----------MHACALVGSRAWGSTKGLSLIPFA-DFVNHDGFSDSVLLGDEDKQLS 249
Query: 308 ----AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 361
++++ DR Y GE +++ G PN+ LL+++GF N YD++ ++ + D
Sbjct: 250 ESSSTLEVIADRNYAPGEQVLIRYGKFPNATLLLDFGFTLPYNIYDQVQIQVNIPHHD 307
>gi|301094750|ref|XP_002896479.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262109454|gb|EEY67506.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 478
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 106/460 (23%), Positives = 186/460 (40%), Gaps = 50/460 (10%)
Query: 62 TLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQA 121
T V+ R +SK++ +L W+ NG K+ L+E + + R +H + + L
Sbjct: 18 TPVSPPRNGMSKEDVVGQELIQWLETNGADSKKLTLQE---YAPEVRGVH---SRKVLVP 71
Query: 122 GDAAFSVPNSLVVTLERVLGNET-IAELLTTNKLSELA----CLALYLMYEKKQGKKSFW 176
G+ +P ++T+E +G +T I L + +A L ++L+ + + + SF+
Sbjct: 72 GERILVIPKKCLITVE--MGKQTDIGRKLLARNVDFVAPKHIFLMMFLLTDMEHVETSFF 129
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
Y L P+ WSE EL++L GS +I ER I+++Y+ + V
Sbjct: 130 RNYYSTLPSTLS------NMPIFWSEEELSWLKGSYIIQQIQERKAAIRKDYDVICRV-- 181
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS 296
D F+ + F A + V S L + + ALVP L Y
Sbjct: 182 -----------DPSFARFSLDRFSWARMIVCSRNFGL-TIDGVKTAALVPFADMLNHYRP 229
Query: 297 K-CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY------D 349
+ DA + G + G + N + L+NYGF EDN +
Sbjct: 230 RETSWTFDQSIDAFTITSLGTIGTGAQVYDSYGKKCNHRFLLNYGFAVEDNTEEDGRNPN 289
Query: 350 RLVVEAALNTEDPQ-YQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 408
++++ L+ D Q + DKR +G ++ + + + RL + T
Sbjct: 290 EVLIDFQLSPADGQLFYDKRAYLHESGIYTMDA-RLSCSHSDANTREGFSFARL--IVAT 346
Query: 409 SEMQSVISSLGPIC---PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT--DYNL 463
E S + P P+S E L+ L + +L+ Y T+ ED +L Y L
Sbjct: 347 EEEFSTMKMKSPAHSSPPISFDNEIRALEYLRNLMTHQLSLYDTTIEEDNELLASKQYPL 406
Query: 464 HPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPC 503
+ A +R EK++ Q AD ++ L + ++ C
Sbjct: 407 FSNRIQALFFIRGEKQVCRY-FQELADKVIPLFSLPLAEC 445
>gi|336261436|ref|XP_003345507.1| hypothetical protein SMAC_07495 [Sordaria macrospora k-hell]
gi|380088183|emb|CCC13858.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 499
Score = 68.6 bits (166), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 70/304 (23%), Positives = 122/304 (40%), Gaps = 50/304 (16%)
Query: 81 LKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
L W H +G P ++ EK + + +P +A+E L + A + P S+ ++
Sbjct: 12 LLDWAHNHGASLHPSVEIYQDEKTGFSLRVKP----SAAESLHSPFKAVTCPTSITLSYL 67
Query: 138 RVLGNETIAELLT---------------TNKLSELACLALYLMYEKKQGKKSFWLPYIRE 182
L + I LT N L YL+ + +GK S W PYI
Sbjct: 68 NALTDGPITPYLTPPALDTQKHAFPERFMNSLPPHVIGRFYLIQQYLKGKSSLWAPYIST 127
Query: 183 LDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS-- 240
L + A+ P W+E ++ L G+ I E + +K EY + + GS
Sbjct: 128 LTDPSQLDKWAL--PPFWTEHDIELLRGTNAYVAIQEIQDNVKSEYKQARKILKQEGSPD 185
Query: 241 --LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYS--- 295
+ Q Y+ FT F+ + + +S ++++ L+P G + +S
Sbjct: 186 YRAYTQVLYNWAYCMFTSRSFRPSLILSESAREYVER--------LLPEGAKIDDFSILQ 237
Query: 296 -----------SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 344
++ L + A +L+ Y+ G+ + G + NS+LL+ YGFV E
Sbjct: 238 PLYDIGNHSPEAEYSWNLTSEPSACELICRNSYEPGQQVFNNYGKKTNSELLLGYGFVTE 297
Query: 345 DNPY 348
+N Y
Sbjct: 298 NNDY 301
>gi|322694827|gb|EFY86647.1| SET domain protein [Metarhizium acridum CQMa 102]
Length = 467
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 96/427 (22%), Positives = 172/427 (40%), Gaps = 68/427 (15%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN-------KLSELACLALYLM 165
+ A D+ A F++P ++ E + + +L + L + L L +M
Sbjct: 43 ITALRDIPADTTLFTIPRDAIINSETSSLRKKLPDLFESQGDEDEEQALDSWSALILIMM 102
Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEGI 224
YE G +S W PYI L L ++P+ WSE EL+YL S T +I + AE +
Sbjct: 103 YEFFLGDESKWKPYIDVL-------PLTFDTPMFWSEEELSYLQASATVNKIGKADAEEM 155
Query: 225 KREY--------------------NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
R +L + GS Y +D+ E + +V
Sbjct: 156 FRTRLIPAIRGNPSVFVSSGDCSDEDLIGLAHRMGSTIMAYAFDLENEEAENDEESDGWV 215
Query: 265 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 324
+ + V++A +L ++ A + D+ + + R KAGE I+
Sbjct: 216 EDREGKSMMGMVAMA----------DILNADAEFNAHVNHGDEELTVTSIRDIKAGEEIL 265
Query: 325 VWCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAALNTEDPQYQD------KRMV 370
+ GP PNS+LL YG++ E + P+D V+ +L +E QD +RM
Sbjct: 266 NYYGPHPNSELLRRYGYITEKHSRYDVVEIPWD--AVQHSLMSELGVPQDIMAETMERM- 322
Query: 371 AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE-MQSVISSL----GPICPVS 425
++ + V +G + P + G D E +++ I L G +
Sbjct: 323 -DQDDLEDIFVLERDSGEPNPDGTFAGPAVVDGMPPDLKEQLKATIKLLQKLDGNLISDK 381
Query: 426 PCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 485
+ + + + + + Y T++EDE +L +L ++R+A Q+ EKK+L
Sbjct: 382 RKRDDILRSTMVETLRLIASRYSTTIAEDEVLLAQDSLTRRQRMAVQVRLGEKKLLQEAC 441
Query: 486 QVTADMI 492
++M+
Sbjct: 442 DHFSEMV 448
>gi|302753470|ref|XP_002960159.1| hypothetical protein SELMODRAFT_437298 [Selaginella moellendorffii]
gi|300171098|gb|EFJ37698.1| hypothetical protein SELMODRAFT_437298 [Selaginella moellendorffii]
Length = 377
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 56/209 (26%), Positives = 95/209 (45%), Gaps = 30/209 (14%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LAL ++ E+ +G+ S W PYI L + +++ LW +TEL+YL SP + E
Sbjct: 114 LALIVLMERYKGQSSVWAPYISCLPQPA-----ELDNTFLWEDTELSYLKASPLYGKTRE 168
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
R E I E+ ++ + LF + + E FK + V S + +++
Sbjct: 169 RLEMITTEFGQVQNALNVWPQLFGK---------VSLEDFKHVYATVFS-----RSLAIG 214
Query: 280 RRFALVPLGPPLLAY-----SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 334
LV + P+L + +S K + + + DR Y + I + G N++
Sbjct: 215 EDSTLVMI--PMLDFFNHNATSFAKLSFNGLLNYAVVTADRAYTENDQIWINYGDLSNAE 272
Query: 335 LLINYGFVDEDNPYDRLVVEAALNTEDPQ 363
L ++YGF +NPYD E L T+ P+
Sbjct: 273 LALDYGFTVPENPYD----ETDLLTQFPE 297
>gi|219126444|ref|XP_002183467.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217405223|gb|EEC45167.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 519
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 94/352 (26%), Positives = 142/352 (40%), Gaps = 58/352 (16%)
Query: 160 LALYLMYEKK-QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
L +YL++++K G SF+ PY L P+ WS EL L GS ++I
Sbjct: 116 LMIYLLWDRKTHGSSSFFHPYYEILP------PTLRNMPIFWSAFELQELEGSHLLSQIA 169
Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 278
+R + I+ +Y + V G+L T + FK A + V S LQ +
Sbjct: 170 DRGQAIQDDYEAILEVAPSLGTLC------------TLDEFKWARMCVCSRNFGLQ-IDG 216
Query: 279 ARRFALVPLGPPLLAYSSK-CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 337
R ALVP L Y + K V + + +AG + G + N + L+
Sbjct: 217 HRTSALVPHADMLNHYRPRETKWTFDEVTQCFTITSLQSIQAGAQVYDSYGQKCNHRFLL 276
Query: 338 NYGFVDEDNPY------DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 391
NYGF EDN + + +E ++ D +QDK R E
Sbjct: 277 NYGFAVEDNRELDGFCPNEVPLELYVDPADILFQDKLEFWTRG--------------ETN 322
Query: 392 AISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATL 451
IS G V+ Q+V S+G P S E + + RLA YP T+
Sbjct: 323 QIS--------GAVTAGLIAQAVGGSMGRGVP-SHAAESYTSGPVVK--RVRLASYPTTI 371
Query: 452 SEDEAMLTDYNLHPK---KRVATQLVRMEKKMLN---ACLQVTADMIMLLPD 497
S+D A L D +P+ +R A VR EK++L+ + DM+ + D
Sbjct: 372 SQDMADLQDEASYPQFSNRRHAKIQVRGEKEVLHHFRVWSETALDMLTFIED 423
>gi|57899520|dbj|BAD87034.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
gi|57899939|dbj|BAD87851.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
Length = 509
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 59/260 (22%), Positives = 102/260 (39%), Gaps = 39/260 (15%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASE +Q GD VP + +TL+++ L + + + + LA L+ E+ G +S
Sbjct: 63 ASEPIQEGDCIMQVPYHVQLTLDKL---PQKFNTLLDHAVGDTSKLAALLIMEQHLGNES 119
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI+ L + + + +LW EL + S E +E E K+E+ L
Sbjct: 120 GWAPYIKSLPTKD-----QMHNMVLWDLNELHAVQNSSIYDEAIEHKEQAKKEFLALKPA 174
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
LF + A + V + QK
Sbjct: 175 LDHFPHLFGEVKLGDFMHASALDFLNHDGVFGSVLIYDEQK------------------- 215
Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
D +++ DR Y GE +++ G N+ L +N+GF N YD+ ++
Sbjct: 216 ------------DVCEIIADRNYAVGEQVMIRYGKYSNATLALNFGFTLARNIYDQALIR 263
Query: 355 AALNTEDPQYQDKRMVAQRN 374
+ +DP Y+ K + Q++
Sbjct: 264 IDMPVQDPLYKKKLDIWQKH 283
>gi|71895277|ref|NP_001025965.1| SET domain-containing protein 4 [Gallus gallus]
gi|53134599|emb|CAG32346.1| hypothetical protein RCJMB04_23h14 [Gallus gallus]
Length = 439
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 101/436 (23%), Positives = 175/436 (40%), Gaps = 61/436 (13%)
Query: 86 HKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVVTLERV 139
HK K LK++ + RP + + ++ LQAG+ S+P +VT V
Sbjct: 28 HKLEYIKLKKWLKDRGFGDSSLRPAQFWGTGRGLMTTKALQAGELVISLPEKCLVTTTTV 87
Query: 140 LGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
L N + E + K +S L L +L+ EK G++S W PY+ L + P
Sbjct: 88 L-NSCLGEYIMKWKPPVSPLIALCPFLIAEKHAGERSLWKPYLDVLPK-------TYSCP 139
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
+ E ++ L P + + E+ + Y + SLF + I F +
Sbjct: 140 VC-LEQDVVQLLPEPLRKQAQEQRTAVHELYMSSKAFFSSLQSLFAENTATI----FNYS 194
Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVP----LGP--PLLAYS--SKCKAMLAAVDDAV 309
+ A+ + + +++ K S F+L P L P LL +S + KA
Sbjct: 195 ALEWAWCTINTRTIYM-KHSQRECFSLEPDVYALAPYLDLLNHSPNVQVKAAFNEQSRNY 253
Query: 310 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRM 369
++ + K E + + GP N +LL+ YGFV DNP+ + V + + DK
Sbjct: 254 EIQTNSQCKKYEEVFICYGPHDNQRLLLEYGFVAVDNPHSSVYVSSDTLLKYFPSLDK-- 311
Query: 370 VAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCME 429
Q+N KLS+ H D+L L G+ + + + + L C
Sbjct: 312 --QKNAKLSILKEH-----------DLLENLTFGWDGPSWRLLTALKVLSLGGDEFTCWR 358
Query: 430 RAVLDQLADYFKARLAGYPATLSEDEAM-----LTDYNLHPKKRVATQLVRMEKKMLNAC 484
RA+ L D AR +E +A+ + + + + V Q+ +M++ N
Sbjct: 359 RAL---LGDVISAR--------NEQQALNITTKICHFLIEETQHVLLQISQMKRDKENLK 407
Query: 485 LQVTADMIMLLPDVTV 500
Q+T + L D+ +
Sbjct: 408 EQLTLVEALRLEDLKI 423
>gi|348671353|gb|EGZ11174.1| hypothetical protein PHYSODRAFT_361758 [Phytophthora sojae]
Length = 486
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 100/442 (22%), Positives = 176/442 (39%), Gaps = 50/442 (11%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+L W+ NG K+ L+E + + R +H + + L G+ +P ++T+E
Sbjct: 44 ELIQWLEGNGADTKKLALQE---YAPEVRGVH---SRKVLAPGERILVIPKKCLITVE-- 95
Query: 140 LGNET-IAELLTTNKLSELA----CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
+G +T I L + +A L ++L+ + ++ + SF+ Y L
Sbjct: 96 MGKQTDIGRKLLARNVDFVAPKHIFLMMFLLTDMERAETSFFRNYYSTLP------STLS 149
Query: 195 ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
P+ WS+ EL +L GS +I ER I+++Y+ + V D F
Sbjct: 150 NMPIFWSDEELGWLKGSYIIQQIQERKAAIRKDYDVICRV-------------DPAFARF 196
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSK-CKAMLAAVDDAVQLVV 313
+ + F A + V S L + + ALVP L Y + DA +
Sbjct: 197 SLDRFSWARMIVCSRNFGL-TIDGVKTAALVPFADMLNHYRPRETSWTFDQSIDAFTITS 255
Query: 314 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY------DRLVVEAALNTEDPQ-YQD 366
G + G + N + L+NYGF EDN + ++++ L+ D Q + D
Sbjct: 256 LGTIGTGAQVYDSYGKKCNHRFLLNYGFAVEDNTEEDGRNPNEVLIDFQLSQADGQLFYD 315
Query: 367 KRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPIC---P 423
KR +G ++ + + + RL + T + S + P P
Sbjct: 316 KRAYLHESGIYTMDA-RLSCSHSDANTREGFSFARL--IVATEDEFSSMKMKSPAHSSPP 372
Query: 424 VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT--DYNLHPKKRVATQLVRMEKKML 481
+S E L L D +L+ Y T+ ED +L Y L + A +R EK++
Sbjct: 373 ISFDNEIRALQYLRDLMTHQLSLYDTTIEEDNELLASKQYPLFSNRIQALFFIRGEKQVC 432
Query: 482 NACLQVTADMIMLLPDVTVSPC 503
Q AD ++ L + ++ C
Sbjct: 433 RY-FQELADKVIQLFSLPLAEC 453
>gi|367029027|ref|XP_003663797.1| hypothetical protein MYCTH_2080826, partial [Myceliophthora
thermophila ATCC 42464]
gi|347011067|gb|AEO58552.1| hypothetical protein MYCTH_2080826, partial [Myceliophthora
thermophila ATCC 42464]
Length = 357
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 79/317 (24%), Positives = 131/317 (41%), Gaps = 63/317 (19%)
Query: 78 LGDLKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
G L W ++G P ++ L ++ + P +A+E LQ G AA S P + +
Sbjct: 10 FGALVEWAEQHGARLHPSVEIYLDPVSKYSLRVSP----SATEGLQPGFAAVSCPARITL 65
Query: 135 TLERVLGNETIAELLTTNKLSELACL------------------------ALYLMYEKKQ 170
+ L + + +++ ++ A L +L+ E +
Sbjct: 66 SYLNALVDGLLDPSALSDRSAQSARLDQETSSTGAFPPRFTRSVPPHVLGRFFLVKEYLK 125
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVES-PLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
GK SFW PYI L Q+AV + P W + ++AYL G+ I E E +KRE+
Sbjct: 126 GKDSFWWPYIATLPPPE---QVAVWALPPFWPDHDIAYLEGTNAHVAIQEIQENVKREFK 182
Query: 230 ELDTVWFMAGSLFQQYPY-DIPTEAFTFEIFKQAFVAVQSCVVH----LQKVSLARRFAL 284
+ A L ++ + D+P A+T ++K AF S L + R AL
Sbjct: 183 Q-------ARKLLKEEDFPDLP--AYTQLLYKWAFCIFTSRSFRPSLVLSDATKRRLSAL 233
Query: 285 VPLG---------PPLLAYSSKCKAM-----LAAVDDAVQLVVDRPYKAGESIVVWCGPQ 330
+P G PLL ++ +V D +L+ PY+ G + G +
Sbjct: 234 LPQGVQLDDFSVLQPLLDIANHSPTARYTWDTTSVPDTCRLICHDPYQPGTQVYNNYGLK 293
Query: 331 PNSKLLINYGFVDEDNP 347
NS+LL+ YGF+ + P
Sbjct: 294 TNSELLLAYGFILPETP 310
>gi|357615786|gb|EHJ69829.1| putative SET domain containing 3 [Danaus plexippus]
Length = 489
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 80/342 (23%), Positives = 142/342 (41%), Gaps = 60/342 (17%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
SW+H++G V + E + + A++D G +VP ++++ E+
Sbjct: 88 SWLHEHGAEFEGVEISEFDGYG------FGLKATKDFSEGSLILTVPGKVMMS-EKDPKA 140
Query: 143 ETIAELLTTNKLSEL---ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
++E + + L + LAL+L+ EK SFW PYI L + + L
Sbjct: 141 SDLSEFINIDPLLQNMPNVTLALFLLLEK-NNPNSFWKPYIDVLPEK-------YSTVLY 192
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREY----NELDTVWFMAGSLFQQYPYDIPT---- 251
++ ELA L SP L+ I R+Y N++ T+ D+P
Sbjct: 193 FNSEELAELRPSPVFESSLKLYRSIVRQYAYFYNKIHTI-------------DLPVLKNL 239
Query: 252 -EAFTFEIFKQAFVAV---QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD 307
+ FTF+ ++ A V Q+ +V +L A +PL C +
Sbjct: 240 QDIFTFDNYRWAVSTVMTRQNNIVQGTAFTLTN--AFIPLW-------DMCNHKHGKITT 290
Query: 308 AVQLVVDR-------PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 360
L ++R Y+ E I ++ G +PNS L ++ GFV DN YD L + ++
Sbjct: 291 DFNLELNRGECYALQDYRRDEQIFIFYGARPNSDLFLHNGFVYPDNDYDSLSIALGISPN 350
Query: 361 DPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 402
D K + + G V F ++ G ++ ++L ++R+
Sbjct: 351 DALRNGKVNLLNKLGLSGVTNFSLYKGASPISV-ELLAFIRI 391
>gi|298711968|emb|CBJ32910.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 247
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 64/196 (32%), Positives = 91/196 (46%), Gaps = 23/196 (11%)
Query: 311 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT-EDPQYQDK-R 368
+ R + AGE +++ GP+ N LL YGFV++DNP D + ++ D +D R
Sbjct: 40 VTTQRGWTAGEQVLISYGPRSNDHLLRRYGFVEQDNPNDVYRITGLIDKLSDVLGKDSVR 99
Query: 369 MVAQRNGKLSVQ--------VFHVHAGR-----EKEAISDMLPYLRLGYVSDTS--EMQS 413
++ + GKL V V GR EKE ++P RL V D E ++
Sbjct: 100 VLRESGGKLGTTGDNAEGRPVESVTVGRSGLLGEKEE-GRVMPVFRLAVVKDDQLPEGKA 158
Query: 414 VISSLGPIC-PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYN--LHPKKRVA 470
SL +SP E A D L G+ TL+EDEA L+ L +KRVA
Sbjct: 159 AGISLKDFSNEISPANEAAARDALRKLCIKEREGFATTLAEDEAYLSSLGNSLGAQKRVA 218
Query: 471 TQLVRMEKK-MLNACL 485
RMEKK +L+A +
Sbjct: 219 FSF-RMEKKRVLDAAI 233
>gi|190402231|gb|ACE77646.1| hypothetical protein [Sorex araneus]
Length = 350
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 62/248 (25%), Positives = 114/248 (45%), Gaps = 11/248 (4%)
Query: 252 EAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 310
E+FT+E ++ A +V + + +R AL+PL + DD +
Sbjct: 12 ESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCE 71
Query: 311 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 370
V + ++AGE I ++ G + N++ +++ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 72 CVALQDFRAGEQIYIFYGTRSNAEFVVHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEV 131
Query: 371 AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLGPI- 421
R G + VF +H E + +L +LR+ +++ + + I +LG
Sbjct: 132 LARAGIPTSSVFALHV-TELPISAQLLAFLRVFCMTEEELREHLLGENAIDRIFTLGNSE 190
Query: 422 CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 481
PVS E + L D L Y T+ ED+A L L P+ +A +L EK++L
Sbjct: 191 YPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKAFLQSPGLSPRAAMAVKLRLGEKEIL 250
Query: 482 NACLQVTA 489
++ A
Sbjct: 251 EKAVRSAA 258
>gi|429861365|gb|ELA36056.1| set domain protein [Colletotrichum gloeosporioides Nara gc5]
Length = 471
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 89/425 (20%), Positives = 161/425 (37%), Gaps = 65/425 (15%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER----------VLGNETIAELLTTNKLSELACLAL 162
+ A++D+ A F++P ++ +E GN+ E + L L L
Sbjct: 42 IIATKDIPAETTLFTIPRRSIINVETSELPKKIPQVFTGNDGDDEDMENEPLDSWGSLIL 101
Query: 163 YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-LERA 221
++YE QG S W PY L + + + W ++L L GS ++I E A
Sbjct: 102 VMIYEFLQGAASPWKPYFEVLPEK-------FHTLMFWESSDLENLKGSAVLSKIGKEEA 154
Query: 222 EGIKREY----------------------NELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
+ + R EL + GS+ Y +D+ E +
Sbjct: 155 DEMFRSRILTVIAANPAIFYPEGSSPLGEAELLQLAHRMGSIIMAYAFDLDNEEEPEQEE 214
Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 319
++ + L V +A +L ++ A + DD + + RP A
Sbjct: 215 DDEWIEDRDGKTMLGMVPMAD----------ILNADAEFNAHVNHGDDELTVTALRPIPA 264
Query: 320 GESIVVWCGPQPNSKLLINYGFVDEDN--------PYD--RLVVEAALNTEDPQYQDKRM 369
GE I+ + GP PNS+LL YG+V + P+D + V L D ++ +
Sbjct: 265 GEEILNYYGPHPNSELLRRYGYVTPKHSRYDVVEIPWDLVQASVSEHLKIGDDVWKQVQE 324
Query: 370 VAQRNGKLSVQVFHVHAGR-EKEAISDMLPYLRLGYVSDTSEMQSVISSL----GPICPV 424
V V +G + E + +R ++++V+ ++ G + P
Sbjct: 325 YVDPEELEDVFVLERESGEPDSEGQFRTVAEVREISAELEEQLKAVLKAIKKINGDLIPD 384
Query: 425 SPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNAC 484
+ + + L+ YP + EDEA+L +L ++R+A + EKK+L
Sbjct: 385 KRKRDEVFHAVIVSTLQKILSQYPTSTQEDEALLATSDLTNRQRMAIHVRLGEKKLLKEA 444
Query: 485 LQVTA 489
L+
Sbjct: 445 LEFAG 449
>gi|241603784|ref|XP_002405757.1| SET domain-containing protein, putative [Ixodes scapularis]
gi|215502568|gb|EEC12062.1| SET domain-containing protein, putative [Ixodes scapularis]
Length = 429
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 65/282 (23%), Positives = 122/282 (43%), Gaps = 24/282 (8%)
Query: 79 GDLKSWMHKNGLP-PCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
G L +WM NG K+ L++ P V A E L G+ +P +L+++
Sbjct: 30 GRLLTWMEANGFRLHSKLGLRDFPDTGRG------VVALEKLVGGETFLKLPATLLISTR 83
Query: 138 RVLGNETIAELLTTN-KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES 196
L + + ++ + KL+ + L L+++ +K G+ S W P++ L R +
Sbjct: 84 TALQSRLHSFIIRHHAKLTPIDVLTLFVLDQKLLGEASRWWPFVDSLPR-------TFTT 136
Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 256
P+ L + E+ I+R + +L + + G + ++ + FT+
Sbjct: 137 PVFLRRKVFESLP-KDLREEVQTGITFIQRTFLKLKVL--LGGHVEEEPEVQCLSTGFTW 193
Query: 257 EIFKQAFVAVQSCVVHLQKVSLARRF-----ALVPLGPPLLAYSSKCKAMLAAVDDAVQL 311
F A+ AV + + Q + + + AL P L + K A V + ++
Sbjct: 194 NNFVWAWTAVNTRCIFAQGSNSSSLWEDDHCALAPF-LDCLNHHWKASIETAMVGENFEI 252
Query: 312 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
+ + + A E + + GP N +L ++YGFV DNP D +VV
Sbjct: 253 LSHKSHDANEQVFISYGPHSNRRLFLDYGFVLPDNPNDVVVV 294
>gi|52545671|emb|CAH56365.1| hypothetical protein [Homo sapiens]
Length = 380
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 58/248 (23%), Positives = 116/248 (46%), Gaps = 11/248 (4%)
Query: 252 EAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 310
++FT+E ++ A +V + + +R AL+PL + DD +
Sbjct: 23 DSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCE 82
Query: 311 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 370
V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 83 CVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEV 142
Query: 371 AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD---------TSEMQSVISSLGPI 421
R G + VF +H E + +L +LR+ +++ S + + +
Sbjct: 143 LARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSE 201
Query: 422 CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 481
PVS E + L D L Y T+ ED+++L +++L + ++A +L EK++L
Sbjct: 202 FPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNHDLSVRAKMAIKLRLGEKEIL 261
Query: 482 NACLQVTA 489
++ A
Sbjct: 262 EKAVKSAA 269
>gi|46130858|ref|XP_389160.1| hypothetical protein FG08984.1 [Gibberella zeae PH-1]
Length = 1000
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 95/443 (21%), Positives = 179/443 (40%), Gaps = 79/443 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--------LSELACLALYL 164
+ A +D+ A F++P ++ E + I ++ +K L + L L +
Sbjct: 578 IIALKDIPAETTLFTIPRKGIINTETSELPKKIPDVFDLDKPDEDDVPGLDSWSSLILIM 637
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEG 223
+YE QG S W Y L + ++P+ WSE EL L S + +I + AE
Sbjct: 638 IYEYLQGDSSQWKSYFDVLPS-------SFDTPMFWSENELDQLQASHMRHKIGKADAED 690
Query: 224 I-------------------KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
+ R EL + GS Y +D+ + E ++
Sbjct: 691 MFKKTLVPIIRSNPSIFNAENRSDYELVEIAHRMGSTIMAYAFDLENDEEEEEETEEWVE 750
Query: 265 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 324
+ + +VP+ +L ++ A + ++++ + RP KAGE I+
Sbjct: 751 DREGKSM----------MGMVPMAD-ILNADAEFNAHVNHEEESLTVTSLRPIKAGEEIL 799
Query: 325 VWCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAALNT----------------E 360
+ GP PNS+LL YG+V E + P+D +VE+ L E
Sbjct: 800 NYYGPHPNSELLRRYGYVTEKHSRYDVVEIPWD--IVESVLTNFGISSKILEQIRGEFEE 857
Query: 361 DPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGP 420
+ +++D ++ + G+++ + + D+ L+ ++ ++Q S P
Sbjct: 858 EEEFEDTFVLERDTGEVNSDGTFAEPAKFEGMPEDLQEQLK-SFLKGIKKLQ---SDTIP 913
Query: 421 ICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 480
+ +AVL + + AR YP ++SED+ +L NL + R+AT + EKK+
Sbjct: 914 DKRKRDEIHQAVLVKTLEALAAR---YPTSISEDQILLNGQNLDQRARMATVVRLGEKKL 970
Query: 481 LNACLQVTADMIMLLPDVTVSPC 503
L + ++ + + D P
Sbjct: 971 LQEAIATFSEDVEMTMDDESGPA 993
>gi|340519125|gb|EGR49364.1| predicted protein [Trichoderma reesei QM6a]
Length = 963
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 91/410 (22%), Positives = 170/410 (41%), Gaps = 53/410 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERV--------LGNETIAELLTTNKLSELACLALYL 164
+ A +D+ A F+VP S +++ E + ET E+ + + L + +
Sbjct: 537 IVALQDIPAEAVLFTVPRSGILSSETSELKGKLPEIFQETAMEVDDKPQQDPWSTLIIVM 596
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
MYE +G +S W PYI L + E+P+ WS+ EL L S T++++ +A
Sbjct: 597 MYEYFKGSESKWKPYIDVLPS-------SFETPMFWSDAELDELQASATRSKV-GKASAE 648
Query: 225 KREYNELDTVWFMAGSLF---QQYPYDIPTE----------AFTFEIFKQAFVAVQSCVV 271
+ +++ V LF Q Y D + +++F+ +
Sbjct: 649 EMFQDKVLPVIRANQHLFPTSQTYSDDDLIQLAHRMGSTIMSYSFDFQNEDEEDEDETEE 708
Query: 272 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 331
+++ +VP+ +L ++ A + DDA+ + R KAGE I + GP P
Sbjct: 709 WVEEREAKSTMGMVPMAD-ILNADAEYNAHVNYGDDALTVTALRTIKAGEEIFNYYGPHP 767
Query: 332 NSKLLINYGFVD-EDNPYDRL---------VVEAALNTEDPQYQDKRMVAQRNGKLSVQV 381
NS+LL YG+V + + YD + V A+L Q DK + +L
Sbjct: 768 NSELLRRYGYVTPKHSRYDVVELPWTLVEESVAASLGLSSEQL-DKARECLDSDELEDTF 826
Query: 382 FHVHAGREKEAISDMLPYLRLGYVSDT--SEMQSVISSLGPICPVSPCMERA-------V 432
E + R + + +++S++ ++ P S +R +
Sbjct: 827 VLERETEEPNPDGTLTGSARFSEIPEDLRDQLKSLLKAIRKAVPSSVVDKRKRDEIQHNI 886
Query: 433 LDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 482
L + D +R YP ++SEDE +L ++ ++R A + EK+++
Sbjct: 887 LIRALDALASR---YPTSISEDERILAGNDISERRRAAVTVRLGEKRLIQ 933
>gi|270005261|gb|EFA01709.1| hypothetical protein TcasGA2_TC007289 [Tribolium castaneum]
Length = 230
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 58/226 (25%), Positives = 101/226 (44%), Gaps = 26/226 (11%)
Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRP-------YKAGESIVVWCGPQPNSK 334
+AL+PL C + A V+DR +KAGE + ++ G + N+
Sbjct: 14 YALIPLW-------DMCNHTNGTISTAYNPVLDRSECLAVKNFKAGEQLFIFYGSRSNAD 66
Query: 335 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 394
L ++ GFV E+N YD + ++ DP Q + + GKLS+ + R+ +
Sbjct: 67 LFVHNGFVFENNDYDVYWIRLGISKSDPLQQKRGHLL---GKLSIASTCDFSIRKGASPI 123
Query: 395 D--MLPYLRLGYVSDTSEMQSVISS-----LGPI-CPVSPCMERAVLDQLADYFKARLAG 446
D +L +LR+ + + ++ I+S LG I C + +E L K L+
Sbjct: 124 DGQLLAFLRV-FNMNEEQLDHWINSDKSADLGHIDCALDTALETKSWRFLHARLKLLLST 182
Query: 447 YPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 492
Y TL EDE +L + P + +A ++ EK+++ L+ I
Sbjct: 183 YKTTLDEDEKLLAEAQATPNRLLAIKMRATEKRIIRETLEYVEQYI 228
>gi|452841392|gb|EME43329.1| hypothetical protein DOTSEDRAFT_131367 [Dothistroma septosporum
NZE10]
Length = 445
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 102/437 (23%), Positives = 172/437 (39%), Gaps = 59/437 (13%)
Query: 83 SWMHKNGLP-PCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLG 141
+W+ NG K+ L + N + A EDL + FSVP S ++T E
Sbjct: 16 NWLRDNGASISAKITLDDLRQQNAGRG----IVAVEDLDEDEELFSVPRSTMLTTETSRN 71
Query: 142 NETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
E + + + LS + +AL + G +S W PY L ++ ++ + WS
Sbjct: 72 GEAVLQEVDDPWLSLIVVMALEYL----DGSQSRWKPYFDVL-------PVSFDNLMFWS 120
Query: 202 ETELAYLTGSPTKAEI----------------LERAEGIKREYNE-LDTVWFMAGSLFQQ 244
+ EL +L GS +I +ER K NE L + GS
Sbjct: 121 DRELRHLEGSTVVGKIGKEAADATFREQLIPVIERISKAKAADNEELLRMCHRMGSTIMA 180
Query: 245 YPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAA 304
Y +D+ T + + + + L K +VPL L A + + A L
Sbjct: 181 YGFDLETSSDQAKNDGEEWEEDSDAGETLPK-------GMVPLADMLNADADRNNAKLFY 233
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN---------PYDRLVVEA 355
DD V + +P KAGE + G P + LL YG++ DN P D + A
Sbjct: 234 EDDKVVMKTIKPVKAGEELYNDFGSLPRADLLRRYGYL-TDNYAQYDVVEIPADLIKERA 292
Query: 356 ALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVI 415
L T+D ++ A+ G L A E+ + L L +E + V
Sbjct: 293 GLRTQD--VDERWQYAEEQGVLDDGYDVSRASSEEGQFPEELCVLLNLLALPRAEFEKVK 350
Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKR-VATQLV 474
+ P + +L + Y R A YP + + M +D +L+ ++R +A ++
Sbjct: 351 NKDKIPKPDLTTNAKKLLRTILVY---RYAAYPGNVDQ---MHSDVSLNDRRRKMAIVVI 404
Query: 475 RMEKKMLNACLQVTADM 491
+ EK++L + +++
Sbjct: 405 QGEKQVLQEAVDAISEI 421
>gi|189236574|ref|XP_975615.2| PREDICTED: similar to SET domain containing 3 [Tribolium castaneum]
Length = 667
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 58/226 (25%), Positives = 101/226 (44%), Gaps = 26/226 (11%)
Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRP-------YKAGESIVVWCGPQPNSK 334
+AL+PL C + A V+DR +KAGE + ++ G + N+
Sbjct: 451 YALIPLW-------DMCNHTNGTISTAYNPVLDRSECLAVKNFKAGEQLFIFYGSRSNAD 503
Query: 335 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 394
L ++ GFV E+N YD + ++ DP Q + + GKLS+ + R+ +
Sbjct: 504 LFVHNGFVFENNDYDVYWIRLGISKSDPLQQKRGHLL---GKLSIASTCDFSIRKGASPI 560
Query: 395 D--MLPYLRLGYVSDTSEMQSVISS-----LGPI-CPVSPCMERAVLDQLADYFKARLAG 446
D +L +LR+ + + ++ I+S LG I C + +E L K L+
Sbjct: 561 DGQLLAFLRV-FNMNEEQLDHWINSDKSADLGHIDCALDTALETKSWRFLHARLKLLLST 619
Query: 447 YPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 492
Y TL EDE +L + P + +A ++ EK+++ L+ I
Sbjct: 620 YKTTLDEDEKLLAEAQATPNRLLAIKMRATEKRIIRETLEYVEQYI 665
Score = 40.0 bits (92), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 35/119 (29%), Positives = 58/119 (48%), Gaps = 12/119 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSEL--ACLALYLMYEKK 169
V A+ D+ +VP L++++E + +L+ +K L + L+++L+ EK
Sbjct: 116 VKANVDIAESSLVIAVPRKLMMSVENA-KESVLKDLIEKDKILGSMPNVALSIFLLLEKY 174
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
+G SFW PYI L + + L +S EL L GSPT L + + I R+Y
Sbjct: 175 KGD-SFWKPYIDILPK-------TYTTVLYFSIDELEELRGSPTLEVALRQIKSITRQY 225
>gi|449464220|ref|XP_004149827.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Cucumis
sativus]
Length = 499
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 63/249 (25%), Positives = 110/249 (44%), Gaps = 39/249 (15%)
Query: 112 YVAASEDLQAGDAAFSVP-------NSLVVTLERVLGNETIAELLTTNKLSELACLALYL 164
++ ASE ++AGD VP +SL + + +LGNE + +A LA+ +
Sbjct: 77 FLFASETIRAGDCILKVPFNVQISPDSLPLPIRDLLGNE----------IGNVAKLAVVV 126
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
+ E K G S W PYI L + + + + W E+EL + S E L + I
Sbjct: 127 LLEHKLGLGSEWAPYIIRLPQP-----WEMHNTIFWKESELEMIRKSSLYEESLNQRSQI 181
Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 284
KRE+ + + +P I + + + F A+ V S + +L
Sbjct: 182 KREFLAIRKA-------LEAFPEII--DRISCDDFMHAYALVTS-----RAWRSTEGVSL 227
Query: 285 VPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
+P L + +AML DD ++V DR + GE +++ G N+ L++++GF
Sbjct: 228 IPFAD-FLNHDGASEAMLLNDDDKQLSEVVADRDFAPGEHVLIRYGKYSNATLMLDFGFA 286
Query: 343 DEDNPYDRL 351
N +D++
Sbjct: 287 LPYNIHDQV 295
>gi|342881738|gb|EGU82570.1| hypothetical protein FOXB_06936 [Fusarium oxysporum Fo5176]
Length = 467
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 102/450 (22%), Positives = 182/450 (40%), Gaps = 75/450 (16%)
Query: 104 NEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--------LS 155
N ++ A ED+ A F++P ++ +E + I + +K L
Sbjct: 38 NAGRGEVNKTVALEDIPAETTLFTIPRKGIINVETSELPKKIPDAFDLDKPDDDDAPGLD 97
Query: 156 ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 215
+ L L ++YE QG+ S W PY L + ++P+ WS+ EL L S +
Sbjct: 98 SWSSLILIMIYEYLQGENSKWKPYFDVLPS-------SFDTPMFWSDNELDQLQASHMRH 150
Query: 216 EILE-RAEGIKRE------------YN-------ELDTVWFMAGSLFQQYPYDIPTEAFT 255
+I + AE + ++ +N EL + GS Y +D+ +
Sbjct: 151 KIGKADAENMFQKTLLPIIRSNAEIFNAGNKTDAELIEIAHRMGSTIMAYAFDLENDE-- 208
Query: 256 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 315
+ V S+ +VP+ +L ++ A + ++++ + R
Sbjct: 209 -----EEEEEADGWVEDRDGKSM---MGMVPMAD-ILNADAEFNAHVNHEEESLTVTSLR 259
Query: 316 PYKAGESIVVWCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAAL--NTEDPQYQ 365
P KAGE I+ + GP PNS+LL YG+V E + P+D +VE+AL N P
Sbjct: 260 PIKAGEEILNYYGPHPNSELLRRYGYVTEKHSRYDVVEIPWD--IVESALTSNFGIP--- 314
Query: 366 DKRMVAQRNGKL-------SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE-MQSVISS 417
+++ Q G L V G + P D E +++ +
Sbjct: 315 -GQVLEQIRGALEEDEEFEDTFVLERETGEVNSDGTFAEPARFESMPEDLQEQLKTFLKG 373
Query: 418 LGPICP--VSPCMERAVLDQ--LADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQL 473
+ P + +R + Q LA +A +A YP ++SEDE +L +L+ + R+A +
Sbjct: 374 IKKAQPDAIPDKRKRDEIHQAVLAKTLEALVARYPTSISEDENLLKQ-DLNQRTRMAIAV 432
Query: 474 VRMEKKMLNACLQVTADMIMLLPDVTVSPC 503
EKK+L + ++ + + D P
Sbjct: 433 RLGEKKLLQEAITASSGDVEMTMDDESGPA 462
>gi|325183831|emb|CCA18289.1| conserved hypothetical protein [Albugo laibachii Nc14]
gi|325183979|emb|CCA18437.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 561
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 101/437 (23%), Positives = 180/437 (41%), Gaps = 59/437 (13%)
Query: 68 REVVSKKEEDL--GDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAA 125
RE V+ E D+ +L W+ G K++L++ + + R +H +L G+
Sbjct: 105 REDVADLENDVVGAELIDWLQNQGAETKKLMLQQ---YAPEVRGVH---CRNELVPGERI 158
Query: 126 FSVPNSLVVTLERVLGNET-IAELLTTNKLSELA----CLALYLMYEKKQGKKSFWLPYI 180
+P + ++T+E +G +T I + + + + +A L LYL+ + ++ +F+ Y
Sbjct: 159 LFIPKNCLITVE--MGKQTEIGQKVLAHNIEFVAPKHIFLILYLLTDMEKKDLTFFKYYY 216
Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS 240
L P+ WS+ EL++L GS +I ER I+++Y+ +
Sbjct: 217 STL------PSTLKNMPIFWSDQELSWLKGSYILHQIQERKAAIRKDYDAICRA------ 264
Query: 241 LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA 300
D F+ E F A + V S L + + ALVP L Y + +
Sbjct: 265 -------DPSFSRFSLERFSWARMIVCSRNFGL-TIDGVKTAALVPFADMLNHYRPRETS 316
Query: 301 -MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY------DRLVV 353
D + +G + G + N + L+NYGF EDN + ++V
Sbjct: 317 WTFDQKLDGFTITSLESICSGAQVYDSYGKKCNHRFLLNYGFAVEDNTEEDGSNPNEIMV 376
Query: 354 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY-----VSDT 408
+ L+ D Q ++ + L + R + SD P R G+ ++ T
Sbjct: 377 DFQLDPGDGQ-----LLYDKTAYLYESGIYTMNARLSCSHSD--PSTREGFSFARLIAAT 429
Query: 409 SEMQSVISSLGPIC---PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML--TDYNL 463
+ S + P P+S E A L+ L +L YP +L E EA+L +Y L
Sbjct: 430 EDEFSSMKMRSPAHASPPISFRNEIAALNLLKQLMDTQLDQYPTSLDEGEAILKSKEYPL 489
Query: 464 HPKKRVATQLVRMEKKM 480
+ + A +R EK++
Sbjct: 490 YSNRIQALFFIRGEKQV 506
>gi|350595011|ref|XP_003484025.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Sus
scrofa]
Length = 326
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 46/150 (30%), Positives = 77/150 (51%), Gaps = 7/150 (4%)
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS 296
M +F +YP P E F E FK +F + S +V L S+ + ALVP ++ +S
Sbjct: 73 MRVRIFSKYPDFFPEEVFNIESFKWSFGILFSRMVRLP--SMDGKNALVPWAD-MMNHSC 129
Query: 297 KCKAMLAAVDDAVQLV--VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLV 352
+ + L + +V DRPY+ GE + + G + N +LL++YGFV ++ NP D +
Sbjct: 130 EVETFLDYDKSSKGIVFPTDRPYQPGEQVFISYGKKSNGELLLSYGFVPKEGTNPSDSVE 189
Query: 353 VEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
+ +L D Y++K + ++ G Q F
Sbjct: 190 LSLSLKKSDESYKEKLELLKKYGLSGSQCF 219
>gi|425766115|gb|EKV04742.1| hypothetical protein PDIG_87340 [Penicillium digitatum PHI26]
gi|425778867|gb|EKV16969.1| hypothetical protein PDIP_33360 [Penicillium digitatum Pd1]
Length = 679
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 58/213 (27%), Positives = 98/213 (46%), Gaps = 23/213 (10%)
Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
A +LM + +G + FW PY+R L + GQL +PL + E ++ ++ G+ +
Sbjct: 105 TFAFFLMAQYLRGPEGFWYPYLRTLPQP---GQLT--TPLFFGEEDVDWIQGTGIPEAAV 159
Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 278
ER + + +Y D+ + G+ +P E +T+E++ A + S + +S
Sbjct: 160 ERIKIWEEKY---DSGYLQLGA--TGFP---DCETYTWELYLWASTIITSRAFSAKVLSG 211
Query: 279 ARR---------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 329
A + AL+PL L + K A D + L+V + AG+ I GP
Sbjct: 212 AVQPGDLPEDGVSALLPL-IDLPNHRPMAKVEWRAGDKDIGLLVLEDHSAGQEISNNYGP 270
Query: 330 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 362
+ N +LLINYGF NP D +V + + P
Sbjct: 271 RNNEQLLINYGFCIAGNPTDYRIVHLGVKPDSP 303
>gi|255087300|ref|XP_002505573.1| set domain protein [Micromonas sp. RCC299]
gi|226520843|gb|ACO66831.1| set domain protein [Micromonas sp. RCC299]
Length = 509
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 108/451 (23%), Positives = 187/451 (41%), Gaps = 57/451 (12%)
Query: 64 VAGSREVVSKKEEDLGDLKSWMHKNGLPPCKV---ILKEKPSHNEKHRPIHYVAASEDLQ 120
VA V S+ + D L +W+ G+ KV ++ P R VAA ED+
Sbjct: 44 VAVDASVDSRTQADFDALWAWLGSEGVDVSKVSPALVDAAPGG----RGWGLVAA-EDIG 98
Query: 121 AGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYI 180
GDA ++P SL +T++ L + A + +AL L++E+ G+KS W Y+
Sbjct: 99 GGDAVLAIPRSLWMTVDTALASPIGAH---CGDEAGWIAVALQLLHERSIGEKSRWAAYV 155
Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS 240
L Q +++PL WS E+A LTG+ ++L+ A G + T + S
Sbjct: 156 NALPAQ-------LDAPLFWSAEEVATLTGT----QLLDAAAGY--DSYARGTWARLKES 202
Query: 241 LFQQYPYDIPTEAFTFEIFKQAFVAVQS-CVVHLQKVSLARRFALVPLGPPLLAYSSKCK 299
F P P++AF F AF ++S C V ALVP +A S
Sbjct: 203 AFDANPDVFPSDAFDEPSFLWAFGILRSRCQA---PVDQGADIALVP--GLDMANHSGLS 257
Query: 300 AMLAAVDDAVQLVVDRPYKAGESIVV-------------------WCGPQPNSKLLINYG 340
+ +++ V K+G S+++ + + +++L ++YG
Sbjct: 258 SQTWTLNNGGVAAVFGGGKSGGSMLLRTEKGAKGLLAKGAEVFMNYGQRKIDNQLALDYG 317
Query: 341 FVDEDNPYDRLVVE-AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPY 399
F D V+ A+ DP DK V + G F + A + E ++ +
Sbjct: 318 FTDAFASRPGYVLGPIAIPESDPNAFDKMDVLEVAGLREAPSFVLRAFEDPE--PELRVF 375
Query: 400 LRLGYV--SDTSEMQSVI--SSLGPIC-PVSPCMERAVLDQLADYFKARLAGYPATLSED 454
+RL + D ++++ + G I PVS E+ + + + L GY + +D
Sbjct: 376 MRLLNLKGEDAFLLEAIFRQEAWGLISEPVSRLNEQEACGTMINGCEEALRGYATRVEDD 435
Query: 455 EAMLTDYNLHPKKRVATQLVRMEKKMLNACL 485
+ D + + R+A ++ EK+ L L
Sbjct: 436 RRVAEDPGVGHRLRLAARVRMGEKQALADAL 466
>gi|296419472|ref|XP_002839331.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295635461|emb|CAZ83522.1| unnamed protein product [Tuber melanosporum]
Length = 541
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 62/251 (24%), Positives = 116/251 (46%), Gaps = 25/251 (9%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V D+ + S P++L + + + A+ +T ++ A L ++L E +GK
Sbjct: 43 VITCTDIPSHSQLISCPHTLTINYTKARSAFS-ADFITNT--TQHAALCMFLCLEWLKGK 99
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+SFW PY+ L R+ ++PL +S+ +L +L G +A +E + I RE E
Sbjct: 100 ESFWWPYLCVLPRE-------FDTPLYFSDEDLQFLQGCNLEATEVEARKLIWREEFE-- 150
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA----LVPLG 288
A S+ Q+ YD TE +T+E++ A S + + R +P+
Sbjct: 151 ----AAVSILQREGYD--TEYYTWELYLWASTIFTSRSFPGKLMDWDRIIVHEDDTMPIL 204
Query: 289 PPLLAYSSKCKAMLAA---VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 345
PL+ + A + D +++++ AG + GP+ N +LL+ YGF
Sbjct: 205 FPLIDSLNHYPATIITWQPSDTSLRIISGVGVSAGAEVYNNYGPKANEELLMGYGFTLLQ 264
Query: 346 NPYDRLVVEAA 356
NP+D +++++
Sbjct: 265 NPFDSFLLKSS 275
>gi|303271159|ref|XP_003054941.1| methyltransferase [Micromonas pusilla CCMP1545]
gi|226462915|gb|EEH60193.1| methyltransferase [Micromonas pusilla CCMP1545]
Length = 544
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 103/392 (26%), Positives = 160/392 (40%), Gaps = 45/392 (11%)
Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWL 177
D++AG+ +P +L VT V + +A L EL LAL+L E+ +G S W
Sbjct: 98 DVRAGEPLLEIPQNLAVTSVDVSDHPIVAGLAAGR--GELVGLALWLCCERAKGSLSDWA 155
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L V+ PL W E+E+ + L GSPT + + RA + EY +
Sbjct: 156 PYVNTLPT-----GCTVDHPLRWEESEIRSLLKGSPTCEQAVGRAVDAREEYASIRAAIA 210
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL--------- 287
+ Y+ TE F A V + V L ++ +ALVPL
Sbjct: 211 ADADAYPADAYEFLTEL----AFTDALATVLARAVWLNAANV---YALVPLVDLLPVVGA 263
Query: 288 -----GPPLLAYSSKCKAMLAAV-----DDAVQLV-VDRPYKAGESIVVWCGP---QPNS 333
P A + + + AAV D A + V V A ++ V C +
Sbjct: 264 PPPGVNPAAAAADAGARGLDAAVGVVDYDAATECVAVVSANDARQTAPVVCADALGRNAG 323
Query: 334 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG-KLSVQVFHVHAGREKEA 392
L ++ G V+ + D L + D Y K+ + + G Q F V A R
Sbjct: 324 DLFLSTGRVNGAHVGDYLTFVTSTVMSDKLYAAKKQILEGMGYSADAQAFPVFADRMP-- 381
Query: 393 ISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLS 452
+ Y+R V + SE+ +V I VSP E +L L + LA Y +
Sbjct: 382 -LQLFAYMRFARVQEPSELMTVSFEEDRI--VSPMNEYEILQLLMGDAREMLAEYENSSE 438
Query: 453 EDEAM-LTDYNLHPKKRVATQLVRMEKKMLNA 483
E E + L + N+ ++ A +L EK+++NA
Sbjct: 439 EFELLQLKETNISERQMTAAKLRLGEKRLINA 470
>gi|260835124|ref|XP_002612559.1| hypothetical protein BRAFLDRAFT_219602 [Branchiostoma floridae]
gi|229297937|gb|EEN68568.1| hypothetical protein BRAFLDRAFT_219602 [Branchiostoma floridae]
Length = 327
Score = 65.5 bits (158), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 73/302 (24%), Positives = 125/302 (41%), Gaps = 50/302 (16%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
+D L W+ +NG ++L + P V ++ +L+ GD S+P +L++T
Sbjct: 4 DDSIQLMRWLRRNGFRDSHLVLTDFPDTGRG------VMSTRNLKEGDCIVSLPENLLIT 57
Query: 136 LERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
V+ N + + + T K L+ L+LYL+ EK +GK SFW PYI+ L +
Sbjct: 58 TTTVV-NSHLGQYIKTWKPRLTPKQVLSLYLIAEKSRGKDSFWYPYIQTL-------PTS 109
Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 253
+P +S E+ L +A + R + ++ Y L T LF +
Sbjct: 110 YTTPSYFSTAEVDALPALVREATLRHR-KVLQNSYKSLQTSLHNLEPLFPDW-----KTV 163
Query: 254 FTFEIFKQAFVAVQSCVVH-------LQKVSLARRFALVPL-----GPPLLA------YS 295
FT + ++ A+ V + V+ S +AL P PL+ S
Sbjct: 164 FTLKSYRWAWATVYTRSVYKRGPGWEFLDPSDPDVYALAPFLDMLNHSPLVQTDTDFNVS 223
Query: 296 SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEA 355
SKC ++ + + + + P N +LL+ YGFV NP+ + A
Sbjct: 224 SKC----------YEVKTEGACRKYRQVFINYDPYDNGRLLMEYGFVMPRNPHSVVTFTA 273
Query: 356 AL 357
A+
Sbjct: 274 AV 275
>gi|391340216|ref|XP_003744440.1| PREDICTED: SET domain-containing protein 4-like [Metaseiulus
occidentalis]
Length = 381
Score = 65.5 bits (158), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 66/279 (23%), Positives = 118/279 (42%), Gaps = 41/279 (14%)
Query: 78 LGDLKSWMHKNGLPPCKVI-LKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTL 136
+G+L SW+ + G P V+ L P+ + +++AGD +P++L++T
Sbjct: 21 IGELYSWIQRLGFKPTSVLRLACTPASGRG------IVCLSNIEAGDVIIDLPSTLLITP 74
Query: 137 ERVLGNETIAEL-LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ V EL ++ LS L ++++ E+ G+KS W PYI +
Sbjct: 75 DLVR-----KELNMSKENLSAEEILTIFVLSERSLGEKSKWKPYIESI------------ 117
Query: 196 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
P ++ + P + A+ I R E V+ F+ D+
Sbjct: 118 -PDVFDGLQCRKSVRLPRRL-----AQAIDRWNAERRNVFSRLRMFFRGRGIDL-----N 166
Query: 256 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 315
FE F A+ AV + ++++ L P LL + K + V++ + +
Sbjct: 167 FETFSWAWSAVNTRCIYVE----GHGSTLAPF-LDLLNHHWKASIETSFVNNHFIIRSNV 221
Query: 316 PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
Y+AG + + G N L +NYGFV ++NP D + VE
Sbjct: 222 GYEAGSEVFIGYGSHDNRTLFLNYGFVLDENPNDCITVE 260
>gi|189189204|ref|XP_001930941.1| SET domain-containing protein RMS1 [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187972547|gb|EDU40046.1| SET domain-containing protein RMS1 [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 476
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 78/302 (25%), Positives = 122/302 (40%), Gaps = 51/302 (16%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLER-VLG 141
+W+ ++G I E + + R V AS+D+ + F +P + ++++E +L
Sbjct: 13 AWLRQSGAEISPKIKLEDLRNKDAGRG---VVASQDIAEHELLFRIPRASILSVENSILS 69
Query: 142 NETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
E A L+ L L L ++YE G S W PY L + + + W+
Sbjct: 70 TEIPAATLSL--LGPWLSLILVMLYEYHNGSASNWAPYFAVLPTE-------FNTLMFWT 120
Query: 202 ETELAYLTGSPTKAEIL---------------------------ERAEGIKREYNELDTV 234
E ELA L S ++ ERA+ +E L+ +
Sbjct: 121 EDELAELQASAVVGKVGKESADEAFLEQLLPVIEEFADIVFSGDERAKDKAKEMRSLENL 180
Query: 235 WFM--AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
M GSL Y +D+ T E+ ++ A + L K +VPL L
Sbjct: 181 ELMHKMGSLIMAYAFDVEPATPTKEVDEEG-FAEEEEDAALPK-------GMVPLADMLN 232
Query: 293 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 352
A + +C A L D +++ +P +AGE I GP P S LL YG+V DN V
Sbjct: 233 ADADRCNARLFYEKDCLEMKALKPIQAGEEIFNDYGPLPRSDLLRRYGYVT-DNYAQYDV 291
Query: 353 VE 354
VE
Sbjct: 292 VE 293
>gi|408393455|gb|EKJ72719.1| hypothetical protein FPSE_07119 [Fusarium pseudograminearum CS3096]
Length = 465
Score = 65.1 bits (157), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 94/443 (21%), Positives = 176/443 (39%), Gaps = 79/443 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--------LSELACLALYL 164
+ A D+ A F++P + +E + I ++ +K L + L L +
Sbjct: 43 IIALRDIPAETTLFTIPRKGSINIETSELPQKIPDVFDLDKPDEDDVPGLDSWSSLILIM 102
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEG 223
+YE +G S W Y L + ++P+ WSE EL L S + +I + AE
Sbjct: 103 IYEYLRGDSSQWKSYFDVLPS-------SFDTPMFWSENELDQLQASHMRHKIGKADAEN 155
Query: 224 I-------------------KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
+ R +EL + GS Y +D+ + E ++
Sbjct: 156 MFKKTLVPIIRSNPSIFNAENRSDSELVEIAHRMGSTIMAYAFDLENDEEEEEETEEWVE 215
Query: 265 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 324
+ +VP+ +L ++ A + ++++ + RP KAGE I+
Sbjct: 216 DRDGKSM----------MGMVPMAD-ILNADAEFNAHVNHEEESLTVTSLRPIKAGEEIL 264
Query: 325 VWCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAALNT----------------E 360
+ GP PNS+LL YG+V E + P+D +VE+ L
Sbjct: 265 NYYGPHPNSELLRRYGYVTEKHSRYDVVEIPWD--IVESVLTNFGISSKILKQIRGEFEG 322
Query: 361 DPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGP 420
+ +++D ++ + G+++ + + D+ L+ S ++ V S P
Sbjct: 323 EEEFEDTFVLERDTGEINSDGTFAEPAKFEGMPEDLQEQLK----SFLKGIKKVQSDTIP 378
Query: 421 ICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 480
+ +AVL + + AR YP ++SED+ +L NL + R+AT + EKK+
Sbjct: 379 DKRKRDEIHQAVLVKTLEALAAR---YPTSISEDQTLLNGQNLDQRARMATVVRLGEKKL 435
Query: 481 LNACLQVTADMIMLLPDVTVSPC 503
L + ++ + + D P
Sbjct: 436 LQEAIATFSEDVEMTMDDESGPA 458
>gi|302792358|ref|XP_002977945.1| hypothetical protein SELMODRAFT_107696 [Selaginella moellendorffii]
gi|300154648|gb|EFJ21283.1| hypothetical protein SELMODRAFT_107696 [Selaginella moellendorffii]
Length = 467
Score = 65.1 bits (157), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 73/292 (25%), Positives = 119/292 (40%), Gaps = 59/292 (20%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A DL G+ ++P + +TL + I L L + LMYE+ +GK
Sbjct: 34 VRALRDLHHGELIATIPKAACLTLLTTAARDAIERARLGGGLG----LTVALMYERSKGK 89
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL 231
S W Y++ L RQ P LWSE E+ L G+ + E +K ++ E
Sbjct: 90 GSKWYRYLKTLPRQE-------SVPFLWSEEEIDGLLLGTELHKALKEDKLLMKEDWEE- 141
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ-KVSLARRFALVPLGPP 290
L ++ P + P + FTFE +++A +S V ++ + +VPL
Sbjct: 142 -----NIAPLTKEDPLEFPAQDFTFE----SYLAAKSLVSSRSFEIDAEHGYGMVPLAD- 191
Query: 291 LLAYSSKCKA-----MLAA-----------VDDA---------------VQLVVDRPYKA 319
++ K A ML A +DD +++V+ + A
Sbjct: 192 --LFNHKTDAEDVHFMLNASDSDDDDNGLIIDDGLANGDCREISSDKSVLEMVMVKDVAA 249
Query: 320 GESIVVWCGPQPNSKLLINYGFVDEDNPYD--RLVVEAALNTEDPQYQDKRM 369
G I G N+ LL YGF + +NP+D L ++ L ++Q KR+
Sbjct: 250 GSEIFNTYGQLGNAALLHRYGFTEPNNPHDIVNLDMDCVLEVLLSRFQKKRV 301
>gi|321470773|gb|EFX81748.1| hypothetical protein DAPPUDRAFT_317395 [Daphnia pulex]
Length = 495
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 96/432 (22%), Positives = 176/432 (40%), Gaps = 40/432 (9%)
Query: 88 NGLPPCKVILKEKPSHNEKHRPIHYVA---------ASEDLQAGDAAFSVPNSLVVTLER 138
+ LPP L+ +H+ K P+ V A++ + + FS+P L+++ E
Sbjct: 75 DTLPP---FLEWMTNHDVKMGPVELVELPLYGCCVRATKQVSTDELLFSIPQKLMLSNET 131
Query: 139 VLGNETIAELLTTNK-LSELACLAL-YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES 196
+ TI + + LS++ +AL + + + KSFW PY+ L + ++
Sbjct: 132 A-NSSTIGHFINNDPILSQMPNVALAFHVLNELYDPKSFWKPYLDALPS-------SYDT 183
Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-----DTVWFMAGSLFQQYPYDIPT 251
+ ++ E+ L GSP + L I R+Y+ V +L + Y+
Sbjct: 184 VMYFTPDEITELKGSPAFDDALRMCRNIARQYSYFYSLLQKNVDPALSNLRANFTYNDYR 243
Query: 252 EAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQL 311
A + + +Q + Q + K L AL+PL +
Sbjct: 244 WAVSTVMTRQNLIPSQEEISGNDKDQLPPVNALIPLWDFCNHQDGQFSTEFQLESRRTVC 303
Query: 312 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 371
R + GE + ++ G + ++ I+ GFVD +N +D L ++ L+ DP + +
Sbjct: 304 QAGRDFGPGEQVFIFYGTRTCAEQFIHNGFVDINNAHDALTLKVGLSKSDPLAGQRATLL 363
Query: 372 QRNGKLSVQ------VFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVS 425
+ LS + F + AG + +L +LRL ++ S + + S
Sbjct: 364 CKLRILSDEKISGPIAFQLKAG-PQPVDGKLLAFLRLFCMTKDSLDRWLQSDNASNLMHE 422
Query: 426 PC-MERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 480
C +E V D+ + KAR L YP T D ML + +L +R+ L EK++
Sbjct: 423 ECGIETEVDDKSWSFLKARCQLLLQLYPTTKEADLKMLEE-DLSSHRRMCVLLRLAEKRI 481
Query: 481 LNACLQVTADMI 492
L + ++ A I
Sbjct: 482 LLSAIECAAQRI 493
>gi|255568191|ref|XP_002525071.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
gi|223535652|gb|EEF37318.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
Length = 456
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 61/249 (24%), Positives = 121/249 (48%), Gaps = 27/249 (10%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ +Q GD VP S + + +L +++LL +++ +A LA+ L+ ++K G++S
Sbjct: 51 ASKSIQTGDCILRVPYSAQIASDNLLPE--LSDLLG-DEVGSVAKLAIVLLVDQKVGQES 107
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI L Q G + S + WS++EL + S E +++ I++++ + V
Sbjct: 108 KWAPYISRLP-QLGE----MHSTIFWSKSELDMIFQSSVYKETIKQKAQIEKDFLTIKPV 162
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
+ +P + + TF+ F A+ V+S + + +L+P L +
Sbjct: 163 -------LEHFPQ--ISRSITFQDFMHAYALVKS-----RAWGSTKGVSLIPFA-DFLNH 207
Query: 295 SSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 352
+A++ +D ++ DR Y E +++ G N+ LL+++GF N +++
Sbjct: 208 DGFSEAVVLNDEDKQVSEVAADRNYAPHEEVLIRYGKFSNATLLLDFGFSLPYNIHEQ-- 265
Query: 353 VEAALNTED 361
VE +N D
Sbjct: 266 VEIQINIPD 274
>gi|297726941|ref|NP_001175834.1| Os09g0411650 [Oryza sativa Japonica Group]
gi|255678893|dbj|BAH94562.1| Os09g0411650, partial [Oryza sativa Japonica Group]
Length = 206
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 47/158 (29%), Positives = 82/158 (51%), Gaps = 9/158 (5%)
Query: 330 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGRE 389
+ N++L ++YGF + ++ D + ++ DP Y DK +A+ NG F + G
Sbjct: 10 KSNAELALDYGFTESNSSRDAYTLTLEISESDPFYDDKLDIAELNGMGETAYFDIVLG-- 67
Query: 390 KEAIS-DMLPYLRLGYVSDTSE--MQSVISS--LGPI-CPVSPCMERAVLDQLADYFKAR 443
E++ MLPYLRL + T ++++ + G + PVS E A+ + + K+
Sbjct: 68 -ESLPPQMLPYLRLLCLGGTDAFLLEALFRNAVWGHLELPVSQDNEEAICQVIRNACKSA 126
Query: 444 LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 481
L Y T+ EDE +L NL P+ ++A ++ EKK+L
Sbjct: 127 LGAYHTTIEEDEELLGSENLQPRLQIAVEVRAGEKKVL 164
>gi|303288796|ref|XP_003063686.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226454754|gb|EEH52059.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 538
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 112/451 (24%), Positives = 176/451 (39%), Gaps = 65/451 (14%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKV---ILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
++ + D L +W+ + G V ++ P + A+ D+ GDAA V
Sbjct: 73 ARTQADFDALWTWLEREGADVASVSPALVDATPGGRGWG-----LVATRDVGGGDAAIVV 127
Query: 129 PNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
P +L +T E ++ I L LAL L++EK G S W YIR L R
Sbjct: 128 PRALWMTKETAFASK-IGTALDPETTPPWCALALQLLHEKSLGDDSRWAAYIRCLPRVE- 185
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPY 247
A+++PL WS ELA L G+ A ++ + L F +LF
Sbjct: 186 ----ALDAPLFWSSEELAELAGTQLLANAAGYDSYVRGTHAALKETTFKEHPALFGDAGD 241
Query: 248 DIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC-------KA 300
D AF+ F AF ++S L V AL+P G + + C
Sbjct: 242 DDGGGAFSEREFLWAFGVLRSRA--LPPVDQGESIALIP-GIDMANHDGLCSQTWQLNNG 298
Query: 301 MLAAV---------DDAVQLVVDRP----YKAGESIVVWCGP-QPNSKLLINYGFVDEDN 346
+AAV +V L V++ K GE I GP +S+ ++YGFVD
Sbjct: 299 GIAAVFGGRGGADGGGSVLLRVEKTKAGGAKRGEEIRCNYGPANIDSQFALDYGFVDAFC 358
Query: 347 PYDRLVVEA-ALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV 405
V+ ++ +D DK V G F + A ++ +M+ ++RL +
Sbjct: 359 SRPGYVLGPLSIPEDDVNAFDKMDVLSVAGLKESPAFTIRA--FEDPPPEMVVFMRLLNL 416
Query: 406 SDTS----------EMQSVISSLGPICPVSPCMERAVLDQLADY-FKARLAGYPATLSED 454
+ E ++IS PVSP D AD + L Y + +D
Sbjct: 417 KNDDAFLLEAIFRQECWALISD-----PVSP-------DNEADAGCEEALGAYATKIEDD 464
Query: 455 EAMLTDYNLHPKKRVATQLVRMEKKMLNACL 485
+ D + P+ R+A ++ EK+ L L
Sbjct: 465 RGVADDADASPRLRLAARVRMGEKQALEEVL 495
>gi|449662705|ref|XP_002165483.2| PREDICTED: uncharacterized protein LOC100209819 [Hydra
magnipapillata]
Length = 819
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 95/394 (24%), Positives = 158/394 (40%), Gaps = 59/394 (14%)
Query: 107 HRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAEL---LTT-----NKLSELA 158
HR + + A+ED++ G+ F+VP L++ + E L T N S
Sbjct: 127 HR--YGMLATEDIKKGEVLFTVPRQLLLNQNTATLKNRLNEFEKWLDTHGKSLNDSSGWL 184
Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY-LTGSPTKAEI 217
L + LM+E Q K SFW Y+ + G PL W E E G P +I
Sbjct: 185 PLLITLMWEFNQ-KDSFWASYLLLVPEISEFGH-----PLFWKEEEYNLEFQGMPLLNDI 238
Query: 218 LERAEGIKREYNELDTVWF-----MAGSLFQQYPYDIPTEAFTFEIFKQ--AFVAVQSCV 270
+ E I+ EY E ++ + GSL E ++ E FK+ AFV S
Sbjct: 239 IVDRENIETEYAEFVLLFLRRNKDLFGSL----------ENYSLEFFKRMVAFVMAYSFT 288
Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 330
+ S+ VP+ +L + S A L +Q++ R K GE + G
Sbjct: 289 EDEESPSM------VPMAD-ILNHHSNNNAHLVFHKSNLQMISIRRIKKGEEVFNTFGKL 341
Query: 331 PNSKLLINYGFVD-EDNPYDRLVV-----------EAALNTEDPQYQDKRMVAQRNGKLS 378
N++LL YG+V+ N YD L++ + +DP K + R G
Sbjct: 342 GNTELLQMYGYVEIPSNQYDSLLLPVKDFYKIMTSKNGTANDDPYLLAKINLLNRTGIAE 401
Query: 379 VQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVS---PCMERAVLDQ 435
V F + D++ +L++ + SD E++ ++ + P S + + L +
Sbjct: 402 VDAFFMFDKNGLRCGPDLIQFLKIFHASD-RELEKILKTRASKRPESFYHKLLRKLRLSK 460
Query: 436 LADYFKARLAGYPATLSEDEAMLTDYNLHPKKRV 469
+ K L ++ED+ + N + +K V
Sbjct: 461 KTE--KNSLGMTVIDITEDDTEMDIENFNKRKNV 492
>gi|301122791|ref|XP_002909122.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262099884|gb|EEY57936.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 426
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 92/411 (22%), Positives = 171/411 (41%), Gaps = 38/411 (9%)
Query: 111 HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
H V A + L +G +P L + +E ++ L ++ + LAL+LM+E+ +
Sbjct: 35 HGVFAKQALTSGQVTLRIPFKLTMNIESAARSDLARVLEKYPQIPDDEVLALHLMHERSK 94
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
SF+ P+I L + P+ WSE+EL L G+ + ++R++
Sbjct: 95 RSDSFFAPFIASLPT-------TFDLPVFWSESELNELKGTNVLLLTQLMKQQLQRDFEN 147
Query: 231 LDTVWFMAGSLFQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 287
+ ++ + +P +PT T E + A + S + + R L P
Sbjct: 148 IHQ------AVVEDFPEVFALLPT--LTLEDYTWAMSVIWSRAFGVTREKKYLR-VLCP- 197
Query: 288 GPPLLAYSSKCKAML---AAVDDAVQLV---VDRPYKAGESIVVWCGPQPNSKLLINYGF 341
+ + + +L + D+ Q++ V + AG ++ + G N+KLL +YGF
Sbjct: 198 AMDMFNHDVSLRILLDDFVSFDEETQMLTHHVPKEVAAGSALQISYGQYSNAKLLFSYGF 257
Query: 342 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR-EKEAISDMLPYL 400
V ++N + + DP + K+ V N Q + E + +L L
Sbjct: 258 VAKENSRRAVDFWMKIPPNDPYLKLKQTVLDSNELTRDQTYDFCGTLFENDVDERLLATL 317
Query: 401 RLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 460
R+ +++ E++ + +S E AV + L + + +LA + TL EDEA+L +
Sbjct: 318 RVILMNE-QEIR-LYKKAFETSIISIRNELAVYENLQNTCRRKLANFATTLEEDEAILAE 375
Query: 461 YNLHPKKRVATQL-VRMEKKM--------LNACLQVTADMIMLLPDVTVSP 502
R++ + VR+E K L QV A + + P T P
Sbjct: 376 MATESSPRLSFAVRVRVEDKQVLTGVIDTLEKWKQVLASNLEMYPPSTTRP 426
>gi|328700922|ref|XP_003241429.1| PREDICTED: SET domain-containing protein 3-like [Acyrthosiphon
pisum]
Length = 463
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 68/301 (22%), Positives = 129/301 (42%), Gaps = 30/301 (9%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
+ ++ + L W KNG IL H ++ + + A++++ GD +VP +L
Sbjct: 81 RNDQSIEKLTKWATKNG-----AILNGVEIHQFENYA-YGMKANKNITVGDKLVTVPRAL 134
Query: 133 VVTLERV----LGNETIAELLTTNKLSELACLALYLMYEK-KQGKKSFWLPYIRELDRQR 187
++T E + L +++ N + LA++++ E ++ KKSFW Y+ L
Sbjct: 135 MMTEENIPSSPLWKLHSQDMMLRNMPN--VALAIFILVESLRKDKKSFWHSYLTTL---- 188
Query: 188 GRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY 247
+ +P+ + +L L GSP L+ I R+Y ++ ++ P
Sbjct: 189 ---PVTYSTPVYFDVADLEALKGSPAFEAALKLNRNIARQYAYFKKLFQLSND-----PA 240
Query: 248 D-IPTEAFTFEIFKQA---FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA 303
I + FT+E ++ A ++ Q+ V S AL+PL S +
Sbjct: 241 SVILKDTFTYEYYRWAVSTLMSRQNTVPSSDNPS-ENVSALIPLWDMFNHRSGRLSTDFV 299
Query: 304 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQ 363
+ D Y A E + ++ G + N+ L++ GFV DN +D + + ++ DP
Sbjct: 300 KSSNVCVCYADGDYAADEQVYIFYGVRTNADFLVHNGFVYPDNEHDAVKIRLGVSRSDPL 359
Query: 364 Y 364
Y
Sbjct: 360 Y 360
>gi|327290197|ref|XP_003229810.1| PREDICTED: SET domain-containing protein 4-like [Anolis
carolinensis]
Length = 440
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 77/309 (24%), Positives = 131/309 (42%), Gaps = 32/309 (10%)
Query: 86 HKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVVTLERV 139
HK+ K LKEK + K RP + + ++ LQ G+ S+P ++T + V
Sbjct: 29 HKDEYILLKKWLKEKGCNVNKLRPAQFPETGRGLVTTKGLQVGELIISLPEKCLLTTDTV 88
Query: 140 LGNETIAELLT--TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
L N + E + T +S L L +L+ EK +KS W PY+ L + S
Sbjct: 89 L-NSYLREYIVKWTPPISPLIALCTFLIAEKWAQEKSPWKPYLDLLPE--------IYSC 139
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
+ E ++ L P + + E+ + ++ + +F LF P D+ + F ++
Sbjct: 140 PVCLEQKIVNLFPEPLRRKAHEQRKLVQELFISSQQFFFSLQPLF---PKDVAS-VFNYQ 195
Query: 258 IFKQAFVAVQSCVV---HLQKVSLARRFALVPLGP--PLLAY--SSKCKAMLAAVDDAVQ 310
FK A+ + + V H Q+ +R L P LL + + + KA +
Sbjct: 196 AFKWAWCTINTRTVYMKHSQRDCFSRDTDTYALAPYLDLLNHNPTVQVKAGFNEKTKCYE 255
Query: 311 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 370
+ + + GP N +LL+ YGFV DNP+ + V ++ +DK
Sbjct: 256 ITTVTQCHHYNEVFICYGPHDNQRLLLEYGFVSRDNPHSSVYVGTDTLLKNVFPEDK--- 312
Query: 371 AQRNGKLSV 379
QR KLS+
Sbjct: 313 -QRPKKLSI 320
>gi|390354259|ref|XP_001201449.2| PREDICTED: SET domain-containing protein 4-like [Strongylocentrotus
purpuratus]
Length = 455
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 68/290 (23%), Positives = 122/290 (42%), Gaps = 26/290 (8%)
Query: 75 EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
+E L WM ++G CK + ++ R + ++L+ GD+ +P L+V
Sbjct: 40 DEQYITLMKWMKEHGFN-CKGCCLKPAVFSDTGRGL---MTKKNLRPGDSIVEIPRHLLV 95
Query: 135 TLERVLGNETIAELLTTN--KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
T + +L E + ++ K + + +L+ E+ +GK SFW PYI L +
Sbjct: 96 TAKDILNTE-LGPIIKRQRQKPTPYQVVCAFLLTERSKGKSSFWYPYINVLPKD------ 148
Query: 193 AVESPLLWSETELAYLTGSPT--KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIP 250
+P S T+ A PT ++ + + + I+ + ++ F QY
Sbjct: 149 -FTTPAFGS-TKQADFDVLPTIARSRAINQLQDIRAAFESASCLFEDIERTFPQYRIFFS 206
Query: 251 TEAFTFEIF----KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYS--SKCKAMLAA 304
++F + F + ++ C K S FAL P LL +S ++ A
Sbjct: 207 LDSFVWAWFVINSRSVYIEPSGCEAFDPKAS--DDFALAPFLD-LLNHSPGAEVTAGFDP 263
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
V + ++ Y A + + + GP N LL+ YGFV NP+D + E
Sbjct: 264 VSNCYRIKTLDSYHAYDQVFIHYGPHDNVNLLLEYGFVIPSNPHDAVSFE 313
>gi|50557134|ref|XP_505975.1| YALI0F28061p [Yarrowia lipolytica]
gi|49651845|emb|CAG78787.1| YALI0F28061p [Yarrowia lipolytica CLIB122]
Length = 454
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 73/275 (26%), Positives = 116/275 (42%), Gaps = 43/275 (15%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V ASED++ + F +P S +++E + I ++ KL+ L LY+M K G
Sbjct: 42 VIASEDIEEDEVLFKIPRSSFLSVEN--DPDFIKQVPEAKKLNSWLQLILYMM---KAGS 96
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE-- 230
+ W PY L Q ++S ++W++ EL L GS +I G + +Y E
Sbjct: 97 MTKWKPYFDVLPTQ-------LDSLMMWTDDELEGLKGSMIVKKI--GKAGAEEDYQEKL 147
Query: 231 -------------LDT---VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ 274
DT + G L Y +D P ++F+ + L
Sbjct: 148 KPIIDAHPEYFKDCDTSLESFHRMGGLIMAYSFDAP-DSFS----EDEEDDEDIEHDDLY 202
Query: 275 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD-AVQLVVDRPYKAGESIVVWCGPQPNS 333
L + A+VPL L A++ C A L A DD + +P K GE + G PN
Sbjct: 203 NEGLVK--AMVPLADTLNAHTRFCNANLIAEDDGGFSMTAIQPIKKGEQVYNTYGELPNC 260
Query: 334 KLLINYGFV-DEDNPYDRLVVEAALNTEDPQYQDK 367
L YG+V +E +D +VE +++ Y +K
Sbjct: 261 DFLRRYGYVENEGTEFD--IVEFSMDEISDFYANK 293
>gi|451999637|gb|EMD92099.1| hypothetical protein COCHEDRAFT_1134267 [Cochliobolus
heterostrophus C5]
Length = 476
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 92/379 (24%), Positives = 155/379 (40%), Gaps = 55/379 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
VAA +D+ + FS+P S ++++E + + I T L L L ++YE G
Sbjct: 40 VAAKQDIAEHELLFSIPRSSILSVENSILSTEIPPT-TFALLGPWLSLILVMLYEYHNGS 98
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL-------------- 218
S W PY L ++ + W+E EL L S +I
Sbjct: 99 ASNWAPYFAVLPTD-------FDTLMFWTEDELTELQASAVVNKIGKEGANEVFIEQLLP 151
Query: 219 -------------ERAEGIKREYNELDTVWFM--AGSLFQQYPYDIPTEAFTFEIFKQAF 263
ERA+ + +E + + M GSL Y +D+ A + + +
Sbjct: 152 VIEEFADVIFSGDERAKDLAKEMRAPENLELMHKMGSLIMAYAFDVEP-AISDKEVDEEG 210
Query: 264 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESI 323
A + L K +VPL L A + +C A L D +++ +P +AGE I
Sbjct: 211 FAEEEEDAALPK-------GMVPLADMLNADADRCNARLFYEKDGLEMKALKPIQAGEEI 263
Query: 324 VVWCGPQPNSKLLINYGFVDED-NPYDRLVVEAALNTE----DPQYQDKRMVAQRNGKLS 378
GP P S LL YG++ E+ YD + + A L ++ D + +KR+ ++
Sbjct: 264 FNDYGPLPRSDLLRRYGYITENYAQYDVVEIPADLVSQALAHDGLWHEKRIEYLDEQEIV 323
Query: 379 VQVFHVHAG---REKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQ 435
+ + A +E++S L L + + E + + S G + P + M +
Sbjct: 324 DTGYDIAASVPFSLEESLSPELVILVETMLLPSEEFER-LQSKGRL-PKAEKMTGKAAEI 381
Query: 436 LADYFKARLAGYPATLSED 454
L +AR+A YP TL +D
Sbjct: 382 LYKIVQARIAQYPTTLEQD 400
>gi|428182808|gb|EKX51668.1| hypothetical protein GUITHDRAFT_102933 [Guillardia theta CCMP2712]
Length = 436
Score = 63.5 bits (153), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 81/369 (21%), Positives = 156/369 (42%), Gaps = 51/369 (13%)
Query: 81 LKSWMHK-NGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
L+ W+ + +G+ KV L+ P V A+ L+ G+ F +P S + E V
Sbjct: 29 LRIWLEEEHGVDMSKVDLQRSPLEG------LGVFANRRLEPGETLFMIPKSCCIYPELV 82
Query: 140 LGNETIAELLTTNKLS-------ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
+ + + + KL+ E+ LA +L EK +G +S + P+I L
Sbjct: 83 FEDRQLGK--SMQKLASAAGEGIEVVALATFLAREKMKGSESSYKPFIDVL-------PW 133
Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 252
PLLW++ E+ L G+ EIL E ++ + V G ++Q+ I TE
Sbjct: 134 DSLHPLLWTDEEVDLLEGTYAHREILAFREQVEVATELFEPVLNPKG--WKQFFQTIETE 191
Query: 253 AFTFEIF----KQAFVAVQS----CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAA 304
T E F + AF +V S + L R + P L ++
Sbjct: 192 KMTPEEFGFMMRGAFASVLSRAFDSKIGRGDKGLEERVVI----PLLDIFNHGSYGPSIT 247
Query: 305 VDDAVQLVVDRPY-----------KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
D A++ ++ + + GE + + G +PN +L YGFV + +
Sbjct: 248 FDTALERDNEKGFPVRVADKGKSIEEGEELFGFYGDKPNWNMLTTYGFVSPNPKCQETTL 307
Query: 354 EAALNTEDPQYQDKRMVAQRNGKLSV-QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQ 412
+++ +DP + K + + G ++V Q+F + + + + ++ Y R+ +S+ +++
Sbjct: 308 SVSIDEKDPYFAQKEEILKARGMVAVEQLFDIR--HDTDPMGPLINYFRIREISNEADLT 365
Query: 413 SVISSLGPI 421
V ++ G +
Sbjct: 366 KVQTNYGEM 374
>gi|327259114|ref|XP_003214383.1| PREDICTED: SET domain-containing protein 3-like, partial [Anolis
carolinensis]
Length = 311
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 55/197 (27%), Positives = 97/197 (49%), Gaps = 18/197 (9%)
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 365
DD + V + +KAGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 9 DDRCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 68
Query: 366 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS--------S 417
K V R G + VF +HA E + +L +LR+ +++ + +I +
Sbjct: 69 MKAEVLARAGIPTSSVFALHA-TEPPISAQLLAFLRVFCMTEDELKEHLIGEHAIDRIFT 127
Query: 418 LGPI-CPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVATQ 472
LG PVS E +L + +AR L Y T+SED+A L +L +A +
Sbjct: 128 LGNSEFPVSWDNEV----KLWTFLEARASLLLKTYKTTVSEDKAFLGTQDLTCNATMAIK 183
Query: 473 LVRMEKKMLNACLQVTA 489
L EK++L ++ A
Sbjct: 184 LRLGEKEILEKAIKSAA 200
>gi|238485948|ref|XP_002374212.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
gi|83768069|dbj|BAE58208.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|220699091|gb|EED55430.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
Length = 713
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 55/193 (28%), Positives = 89/193 (46%), Gaps = 7/193 (3%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
+LM + QGK+ FW PYIR L Q G A+ +PL + +L +L G+ ++A
Sbjct: 131 FFLMGQYLQGKEGFWYPYIRTLP-QPG----ALTTPLYYEGDDLEWLEGTSLSPARQQKA 185
Query: 222 EGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLAR 280
+K +Y + T AG ++Y +D+ A T + + V S V+ ++
Sbjct: 186 NLLKEKYGTVYTELCKAGFDGAEKYTWDLYLWASTIFVSRAFSAKVLSGVIPDTQLPEEN 245
Query: 281 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 340
L+P +L + K A V +V AG+ I GP+ N +L++NYG
Sbjct: 246 VSVLLPF-IDILNHRPLAKVEWRAGKGNVAFLVLEDVAAGQEISNNYGPRNNEQLMMNYG 304
Query: 341 FVDEDNPYDRLVV 353
F +NP D +V
Sbjct: 305 FCLPNNPCDYRIV 317
>gi|317144568|ref|XP_001820210.2| SET domain protein [Aspergillus oryzae RIB40]
gi|391871646|gb|EIT80803.1| N-methyltransferase [Aspergillus oryzae 3.042]
Length = 703
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 55/193 (28%), Positives = 89/193 (46%), Gaps = 7/193 (3%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
+LM + QGK+ FW PYIR L Q G A+ +PL + +L +L G+ ++A
Sbjct: 121 FFLMGQYLQGKEGFWYPYIRTLP-QPG----ALTTPLYYEGDDLEWLEGTSLSPARQQKA 175
Query: 222 EGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLAR 280
+K +Y + T AG ++Y +D+ A T + + V S V+ ++
Sbjct: 176 NLLKEKYGTVYTELCKAGFDGAEKYTWDLYLWASTIFVSRAFSAKVLSGVIPDTQLPEEN 235
Query: 281 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 340
L+P +L + K A V +V AG+ I GP+ N +L++NYG
Sbjct: 236 VSVLLPF-IDILNHRPLAKVEWRAGKGNVAFLVLEDVAAGQEISNNYGPRNNEQLMMNYG 294
Query: 341 FVDEDNPYDRLVV 353
F +NP D +V
Sbjct: 295 FCLPNNPCDYRIV 307
>gi|303275964|ref|XP_003057276.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226461628|gb|EEH58921.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 308
Score = 62.8 bits (151), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 69/278 (24%), Positives = 122/278 (43%), Gaps = 37/278 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEK---K 169
+ A ED++ G+ +P++ ++T+ER + + +L E + LA +L + +
Sbjct: 25 LVAREDVKRGEPLLEIPDASLITVERAVKESKLGP--KHAELQEWSLLAAFLAEQALDIE 82
Query: 170 QGKKS-FWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKRE 227
G +S + Y++ L R+ G L W E ++ L GSP++ ER +
Sbjct: 83 NGDESGVFAAYVKALPRRTG-------GVLDWPEEDVKTLLAGSPSQRAAYERQASVDGA 135
Query: 228 YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 287
E+ ++P P + AF + S ++ L + ALVP
Sbjct: 136 IEEIRA----------EFPQLTPG------ALRWAFDVLFSRLIRLP--NRGGELALVPW 177
Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE--D 345
+L + C A + V L DR YK GE + G +P+++LLI+YGF E +
Sbjct: 178 AD-MLNHKPGCNAYIDDSGGKVCLQPDRAYKPGEQVFASYGQRPSAELLISYGFAPEVGE 236
Query: 346 NPYDRLVVEAALNTEDPQYQDKRMVA-QRNGKLSVQVF 382
NP D + ++ D +Y D + A ++ G V+ F
Sbjct: 237 NPDDEYEITLGIDPND-RYADAKAAALEKIGLRPVESF 273
>gi|413917183|gb|AFW57115.1| hypothetical protein ZEAMMB73_742803 [Zea mays]
Length = 514
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 73/277 (26%), Positives = 115/277 (41%), Gaps = 24/277 (8%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASE + GD A +P SL+++ E + +E L N ++ L L+ M E+
Sbjct: 153 ASESIGVGDIALEIPESLIISDELLCQSEVFLSLKDFNNITSETMLLLWSMRERYNLGSK 212
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F PY L G L + LA L G+ EI++ + ++++Y+EL +
Sbjct: 213 F-KPYFDTLPANFNTG-------LSFGIDALAALEGTLLFDEIIQARQHLRQQYDELFPL 264
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
L +P + T++ F A S + + S LVP+ L
Sbjct: 265 ------LCTNFPEIFRKDVCTWDDFLWACELWYSNSMMIVLSSGKLSTCLVPVAGLLNHS 318
Query: 295 SSKCKAMLAAVDDA---VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYDR 350
S VD+A ++ + RP AGE + G P S LL YGF+ DNPYD
Sbjct: 319 VSPHILNYGRVDEATKSLKFPLSRPCDAGEQCFLSYGKHPGSHLLTFYGFLPRGDNPYDV 378
Query: 351 LVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 387
+ ++ D D+ + AQ + S Q H+ G
Sbjct: 379 IPLDL-----DTSADDEDITAQSSATTS-QTTHMVRG 409
>gi|159490820|ref|XP_001703371.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280295|gb|EDP06053.1| predicted protein [Chlamydomonas reinhardtii]
Length = 339
Score = 62.4 bits (150), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 60/250 (24%), Positives = 108/250 (43%), Gaps = 18/250 (7%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER-----VLGNETIAELLTTNKLSELACLALYLMYE 167
+ AS +++ G+ VP+ V+ E VL E + + ++ E+ L + +M+E
Sbjct: 68 LVASRNIKMGEVVVEVPDDAVLMAENCGLRDVLEEEGMTKDSADEEILEVQGLVIAVMWE 127
Query: 168 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
+ +G +S W PY+ L PL W E L G+ ++L RA+
Sbjct: 128 RWRGPESRWAPYLALLPDD------MTHMPLYWKRREFRELRGTAAYDKMLGRAQHPSDA 181
Query: 228 YNELDTVWF-MAGSLFQQYP-YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
++ +W + G ++P +P +E+++ A AV S L + A+V
Sbjct: 182 PTQVPLLWSEVVGPFIAEHPELGLPGGERGYELYRWATAAVASYSFILGD---DKYQAMV 238
Query: 286 PLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 343
P+ L + L + +Q++ R AG +V G N++LL YGFV+
Sbjct: 239 PVWDLLNHITGDVNVRLHHCSKRHVLQMIAMRDIVAGSELVNNYGELSNAELLRGYGFVE 298
Query: 344 EDNPYDRLVV 353
N Y+ + V
Sbjct: 299 RANRYNHIPV 308
>gi|347836900|emb|CCD51472.1| similar to SET domain-containing protein [Botryotinia fuckeliana]
Length = 470
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 98/397 (24%), Positives = 163/397 (41%), Gaps = 53/397 (13%)
Query: 126 FSVPNSLVVTLERVLGNETIAELLTTNKLSEL--ACLALY-LMYEKKQGKKSFWLPYIRE 182
FS+P S V L + L + +L+E + LAL ++ + Q S W PY+
Sbjct: 55 FSIPRSAV------LNAQNAKPLAISKRLAEKMPSWLALTSILMAEGQVDDSKWAPYLAI 108
Query: 183 LDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA-----------EGIKREYNEL 231
L Q + S + WS++ELA L S +I ++ +G++ E+
Sbjct: 109 LPEQ-------LNSLVFWSDSELAELQASAVVKKIGKQGAEDMFKTYITPQGLQHSSTEM 161
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQKVSLARRFALVPLG 288
S+ Y +DIP + + A V +K L+ ++PL
Sbjct: 162 ---CHKVASVIMAYAFDIPDPSEGPTSGGKGEEAADDLVSDDGEDEKTILS----MIPLA 214
Query: 289 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNP 347
L A + + A L ++ +++ +P GE I G P S LL YG+V D +
Sbjct: 215 DMLNADADRNNARLICDNEDLEMRAIKPIAKGEEIFNDYGQLPRSDLLRRYGYVTDGYSA 274
Query: 348 YDRLVVEAAL----------NTEDPQY-QDKRMV----AQRNGKLSVQVFHVHAGREKEA 392
YD + A L + P+ QDK V A+R G VH+ ++ +
Sbjct: 275 YDVAEISAELIVSLFRNGKVHPSLPKLTQDKLKVRLDLAEREGVYDESFDLVHSSPDEPS 334
Query: 393 ISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLS 452
I D L + D S +++++ S + S LA +AR Y T+
Sbjct: 335 IPDELLAFLYLLLVDESHLKAILDSESSLPSRSKLTTELAGQVLAILLQARENEYSTTVE 394
Query: 453 EDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 489
EDE +L + +L + +A Q+ EKK+L A ++ A
Sbjct: 395 EDEDLLKNADLPIRTAMAIQVRSGEKKVLRAAIREAA 431
>gi|365989356|ref|XP_003671508.1| hypothetical protein NDAI_0H00910 [Naumovozyma dairenensis CBS 421]
gi|343770281|emb|CCD26265.1| hypothetical protein NDAI_0H00910 [Naumovozyma dairenensis CBS 421]
Length = 540
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 65/252 (25%), Positives = 105/252 (41%), Gaps = 34/252 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLT-----TNKLSELA---CLALYL 164
+ AS+D+ + F +P S ++ N T ++L T KL EL+ L + +
Sbjct: 90 IIASKDIDTDELLFEIPRSSIL-------NVTTSQLCVDFPHITGKLMELSQWDSLIICM 142
Query: 165 MYEKKQGK-KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLT--------GSPTKA 215
MYE K + +S W Y L L + W++ EL++LT G
Sbjct: 143 MYEMKVLQHESRWSSYFNVLPSSESLNTL-----MYWNDKELSFLTPSLVVNRVGKGDAE 197
Query: 216 EILERAEGIKREYNELDTVWFMAGSL----FQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 271
+ R E+NE D + GS+ F P I +F EI
Sbjct: 198 TMYRRILDTINEFNE-DILTEKLGSISWEEFLYIPSIIMAYSFDVEIKNDDDENEGDEEF 256
Query: 272 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 331
++ +++PL L A + KC A L D+++++ +P K GE + G P
Sbjct: 257 DEKEEEPELLKSMIPLADTLNADTHKCNANLTYDKDSLKMLAIKPIKKGEQVYNTYGELP 316
Query: 332 NSKLLINYGFVD 343
NS+LL YG+V+
Sbjct: 317 NSELLRKYGYVE 328
>gi|212546319|ref|XP_002153313.1| SET domain protein [Talaromyces marneffei ATCC 18224]
gi|210064833|gb|EEA18928.1| SET domain protein [Talaromyces marneffei ATCC 18224]
Length = 481
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 96/429 (22%), Positives = 173/429 (40%), Gaps = 73/429 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A ++Q G+ F +P+ +V+ ++ N+ +A+ L L L + ++YE G+
Sbjct: 50 VVARSNIQEGEDLFHLPHHIVLMVKTSRLNQILADDLKN--LGPWLSLVVVMIYEYSLGE 107
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT---------KAEILERAEG 223
+S W Y + L + ++ + WSE E + L S + +I E+
Sbjct: 108 QSNWKQYFQVLPSK-------FDTLMFWSEEEFSQLQASAVVDKVGKRDAEEDIFEKVLP 160
Query: 224 IKREYNEL------------DT-------VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
+ R + +L DT + GSL Y +DI + ++
Sbjct: 161 LVRAHPDLFPPIDGVMSYDDDTGAQALLELAHRMGSLIMAYAFDIEKAEEEESEGEDGYL 220
Query: 265 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 324
L K +VPL L A + + A L + A+ + +P KAG+ I
Sbjct: 221 TDDEE--QLPK-------GMVPLADLLNADADRNNARLFQEEGALVMRAIKPIKAGDEIF 271
Query: 325 VWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL----------NTEDPQYQDKRMVAQRN 374
G P S LL YG+V DN VVE L N ED +Y +++ Q
Sbjct: 272 NDYGELPRSDLLRRYGYV-TDNYAQYDVVELPLTGICHAAGFDNIEDKEYPQLKLLDQL- 329
Query: 375 GKLSVQVFHVHAGREKEAISDMLP----YLRLGYVSDTSEMQSVISSLGPICPVSPCMER 430
++ + + ++ + D+LP L D+ E+Q ++S P+ E
Sbjct: 330 -EILEDGYCILRPSPEDTLLDILPDELLALLKTLTLDSEELQRLLSKNKHPKPILGAREA 388
Query: 431 AVLDQLADYFKARLAGYPATLSEDEAMLTDY-------NLHPKKRVATQLVRMEKKMLNA 483
+ L D ++++ Y T+ ED+ +L + ++ +A Q+ EK++L A
Sbjct: 389 RI---LLDAAQSKMGQYGTTIQEDKILLQQFASSSVLRTRERRRHMAVQVRVGEKEILQA 445
Query: 484 CLQVTADMI 492
L + D +
Sbjct: 446 LLMMLQDFL 454
>gi|315039895|ref|XP_003169325.1| hypothetical protein MGYG_08872 [Arthroderma gypseum CBS 118893]
gi|311337746|gb|EFQ96948.1| hypothetical protein MGYG_08872 [Arthroderma gypseum CBS 118893]
Length = 455
Score = 62.0 bits (149), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 90/395 (22%), Positives = 164/395 (41%), Gaps = 38/395 (9%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
V A + G+ ++P++ + T+++ + + +L + LS LALYL++ K
Sbjct: 28 VKALRSFKEGERILTIPSACLWTVKKAYADPLLGPVLRAAQPPLSVEDSLALYLLFVKS- 86
Query: 171 GKKSFWLPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIK 225
R L + R +A + + +++ EL GS A + + +
Sbjct: 87 ----------RTLGYEGQRHHIAAMPQSYSASIFFTDDELQVCKGSSLYALTPQLEQRVH 136
Query: 226 REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
+Y +L +L Q+ P + FT E +K A ++ S + A +
Sbjct: 137 DDYRQLLV------ALLSQHRDLFPLDQFTIEDYKWALCSIWSRAMDFAVSETASVRLVA 190
Query: 286 PLGPPLLAYS---SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
PL +L +S +C A D + ++ + Y+ G+ I ++ G PN++LL YGFV
Sbjct: 191 PLAD-MLNHSPDVKQCHAYDPTSGD-LSILAAKDYQVGDQIFIYYGSVPNNRLLRLYGFV 248
Query: 343 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI-SDMLPYLR 401
DNP D + + P Y+ K + G S + K+ + +++L YLR
Sbjct: 249 LPDNPNDSYDLVLQTSPLAPLYEQKERLWALAGLDSTCTIPLTV---KDPLPNNVLRYLR 305
Query: 402 LGYVSDTSEMQSVISSL--GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 459
+ + D S + + L G V+ E VL L D + L G+ L + EA L
Sbjct: 306 IQRL-DESNITDITLRLVNGTDGKVNDGNEIQVLQFLVDSIGSLLEGFGIPLEKLEAQLV 364
Query: 460 --DYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 492
DY A + E+++L + D++
Sbjct: 365 AGDYPAGGNAWAAAHVSAGEQRVLTRAKKTAEDLL 399
>gi|255562948|ref|XP_002522479.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
gi|223538364|gb|EEF39971.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
Length = 502
Score = 62.0 bits (149), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 92/405 (22%), Positives = 165/405 (40%), Gaps = 65/405 (16%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERV----LGNETIAELLTTNKLSELACLALYLMYEKKQ 170
A D+ + +P L + + V +GN + L +AL+L+ EK +
Sbjct: 84 AERDIARNEVVLEIPKKLWINPDAVAASDIGN-------VCSGLKPWISVALFLIREKLK 136
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLW-----------------SETELAYLTGSPT 213
+ S W PY+ L S + W SE ELA L G+
Sbjct: 137 KEGSTWWPYLDILPD-------TTNSTIYWWVLLVAFYVLVLSFQRRSEEELAELQGTQL 189
Query: 214 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 273
L E ++RE+ +++ + + +P I T + F AF ++S
Sbjct: 190 LRTTLGVKEYMQREFAKVEEEILLPHK--ELFPSPI-----TLDDFLWAFGILRSRAFSR 242
Query: 274 QKVSLARRFALVPLGPPL----------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESI 323
+ + L+PL + AY K + + + L K+GE +
Sbjct: 243 LR---GQNLVLIPLADLINHSPDITTEDYAYEIKGGGLFSR-ELLFSLRSPISVKSGEQV 298
Query: 324 VV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
++ + + N++L ++YGF+++ + + ++ DP + DK +A+ NG F
Sbjct: 299 LIQYDLNKSNAELALDYGFIEKTPDRNTYTLTLQISESDPFFGDKLDIAETNGSGETADF 358
Query: 383 HVHAGREKEAISDMLPYLRLGYVSDTSE--MQSVISS--LGPI-CPVSPCMERAVLDQLA 437
+ G MLPYLRL + T ++S+ + G + P+S E + +
Sbjct: 359 DIVLGNPLPPA--MLPYLRLVALGGTDAFLLESIFRNTIWGHLELPISRANEELICRVVR 416
Query: 438 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 482
D K+ L+GY T+ EDE L +L+P+ +A + EKK+L
Sbjct: 417 DACKSALSGYHTTIEEDEK-LEAADLNPRLEIAVGIRAGEKKVLQ 460
>gi|156384284|ref|XP_001633261.1| predicted protein [Nematostella vectensis]
gi|156220328|gb|EDO41198.1| predicted protein [Nematostella vectensis]
Length = 403
Score = 61.6 bits (148), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 90/394 (22%), Positives = 168/394 (42%), Gaps = 43/394 (10%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL--ACLALYLMYEKKQGK 172
A+ DLQ +VP L++++ + + + L + LAL+++ E+ +
Sbjct: 26 ATADLQENQVFVAVPEKLLMSVVTAKKSSLGPLISREHGLRSMPHVVLALHVLCERLH-E 84
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W PY+ L R + + L +S ++ L GSP+ E L++ GI ++Y
Sbjct: 85 DSTWAPYLNILPR-------SYSTCLYFSPDDMMALQGSPSMGEALKQFRGIVKQY---- 133
Query: 233 TVWFMAGSLFQQYPYDIPTE-AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
V+F +P + +FTF+ F+ A V + ++ S AL+P+
Sbjct: 134 -VYFFRLVQINPEASRLPLKNSFTFDDFRWAVSTVMTRQNDVKVSSNETVKALIPM---- 188
Query: 292 LAYSSKCKA-MLAAVDDA---VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
+ C DD+ V+ + +P +AG+ + ++ G + N+ L + GFV +
Sbjct: 189 WDMCNHCNGPFTTGFDDSTKEVKSLAFKPTRAGDQVFIFYGRRNNADRLFHNGFVYTEAE 248
Query: 348 YDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS-DMLPYLRLGYVS 406
D + ++ ++ D Y K + G L R E IS ++ +LR+ +
Sbjct: 249 EDWVNIQLGVSKNDRLYAMKAQILAMVG-LDASGRSYRVLRGPEPISPELRIFLRV-FSM 306
Query: 407 DTSEMQSV--------ISSLGPICPVSPCMERAVLDQLADYFKAR----LAGYPATLSED 454
+T E++ ++ L +C + +L +F R L Y T ED
Sbjct: 307 NTGELKPYLFNPEGLPVTPLAELCKAEFTLSEENELKLWSFFHTRLQLILGQYKTTKQED 366
Query: 455 EAMLT--DYNLHPKKRVATQLVRMEKKMLNACLQ 486
EA+L+ D LH R +L E+ +L + L+
Sbjct: 367 EALLSRDDNTLH--TRNCIRLRMSERDILVSALE 398
>gi|168005531|ref|XP_001755464.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693592|gb|EDQ79944.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1033
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 69/295 (23%), Positives = 116/295 (39%), Gaps = 57/295 (19%)
Query: 70 VVSKKEEDLGD-LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
VV + D D SWM NG S +EK H +A L G
Sbjct: 520 VVHQNGTDTTDQFVSWMEGNGF-----------SISEKLSITHLLAGDGKLVRG------ 562
Query: 129 PNSLVVTLERVLGNETIAEL-----------LTTNKLSELACLALYLMYEKKQGKKSFWL 177
VV L+ + ET+ L + ++ A L+ EK +G S W
Sbjct: 563 ----VVVLKNIRRGETLCNLPLDMGLYDNETIVAGEVDSWDRAAARLLREKAKGSSSAWA 618
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
YI L + + P+L + EL + P E+++ + I+ ++ L +V +
Sbjct: 619 SYINILPQN-------MTVPILLEDHELHEVQWWPVLRELVQVRKSIRESFSLL-SVDDL 670
Query: 238 AGSLFQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
AG+ F++Y + + + AFT +F A + ++ ++ + + P+
Sbjct: 671 AGADFEEYRWAAMMVHSRAFTLPVFADDHYAPYVMMPYMDMINHHYHYQADWMSQPIWG- 729
Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
V++V R K GE + GP+ N L + YGFV +DNP+D
Sbjct: 730 ------------GKVEIVARRDIKKGEELFASFGPRANDNLFLYYGFVLKDNPFD 772
>gi|296804474|ref|XP_002843089.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
gi|238845691|gb|EEQ35353.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
Length = 455
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 90/395 (22%), Positives = 164/395 (41%), Gaps = 38/395 (9%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
V A + G+ ++P++ + T+E+ + + +L + + LS LA+YL++ +
Sbjct: 28 VKALRSFKEGERILTIPSACLWTVEKAYADPLLGPVLRSAQPPLSVEDALAVYLLFVRS- 86
Query: 171 GKKSFWLPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIK 225
R + R +A + + ++E EL GS A + + ++
Sbjct: 87 ----------RTSGYEGQRHHIAAMPQSYSASIFFTEDELQVCAGSSLYALTRQLEQRVR 136
Query: 226 REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
+Y +L L Q+ P + FT E +K A ++ S + VS LV
Sbjct: 137 DDYRQLLV------PLLSQHRDLFPLDQFTIEDYKWALCSIWSRAMDF-AVSGTTSVRLV 189
Query: 286 PLGPPLLAYS---SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
+L +S +C A D + ++ + Y+ G+ + ++ G PN++LL YGFV
Sbjct: 190 APLADMLNHSPDVKQCHAYDPTSGD-LSILAAKDYQVGDQVFIYYGSVPNNRLLRLYGFV 248
Query: 343 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI-SDMLPYLR 401
DNP D + + P Y+ K + G S + K+ + +++L YLR
Sbjct: 249 LPDNPNDSYDLVLQTSPLAPLYEQKERLWALAGLDSTCTIPLTV---KDPLPNNVLRYLR 305
Query: 402 LGYVSDTSEMQSVISSL--GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 459
+ + D S + + L G VS E VL L D + L G+ L + EA L
Sbjct: 306 IQRL-DESNITDITLQLVNGTDGKVSDGNEMQVLQFLVDSIGSLLEGFGIPLEKLEAQLA 364
Query: 460 --DYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 492
DY A + E+++L + D++
Sbjct: 365 AGDYPAGGNAWAAAHVSAGEQRVLTRAKRTAEDLL 399
>gi|66825817|ref|XP_646263.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|60474297|gb|EAL72234.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 567
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 96/460 (20%), Positives = 174/460 (37%), Gaps = 58/460 (12%)
Query: 65 AGSREVVSKKEEDL-GDLKSWMHKNGL--PPCKVILKEKPSHNEKHRPIHYVAASEDLQA 121
A S ++V E L + W+ G CKV + S + A++D++
Sbjct: 56 ANSGKIVEPTEAQLVANFIEWLKGKGFDESKCKVKIDRNTSEGTG------LVATQDIKE 109
Query: 122 GDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLP 178
G+ +P++L +T +L ++L + L+++L+ E S W P
Sbjct: 110 GEDFVEIPSNLFITTAVAFQGLGKPPILENDRLIQSIPGILLSIFLVKELSN-PTSEWGP 168
Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMA 238
YI+ L +Q + W E GSP + G R+Y ++
Sbjct: 169 YIKLLPKQ-------YNTVYYWGLKEFTQFRGSPNLEYAMRYVRGAMRQY------CYLY 215
Query: 239 GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR-----FALVPLGP--PL 291
+ + +P +FT++ F A VQS Q A AL+P
Sbjct: 216 SMIDRTQSNIMPISSFTWDAFVWAISTVQS----RQNPVYAGNGNGSIMALIPFWDFCNH 271
Query: 292 LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
+ SK + + + + +K GE + ++ GP+ N++LL++ GF + N +D
Sbjct: 272 SSTGSKITSFYHMDSNCMTSGAIKDFKKGEQVYMFYGPRDNTQLLMHAGFATKTNLHDSY 331
Query: 352 VVEAAL--NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 409
E L + ++ ++ +R + V V E +++P+ R+ Y
Sbjct: 332 PFELHLLEGNHEIRHDKVHLLEERGIRDGVVVNLNQNPTSNELPLELIPFYRI-YALSEQ 390
Query: 410 EMQSVIS---------------SLGPIC--PVSPCMERAVLDQLADYFKARLAGYPATLS 452
E +++ L P+ ++ E L K +LA YP TL
Sbjct: 391 ETRAIAPPQVPGEHNHHHGHQLELKPLAFKIITQENEEKAYSNLVQALKGKLASYPTTLE 450
Query: 453 EDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 492
EDE L N +R EKK+L+ ++ +I
Sbjct: 451 EDEQELKK-NPPANQRFILYTKINEKKILDRNIKYLESLI 489
>gi|80479475|gb|AAI08868.1| Unknown (protein for MGC:132347) [Xenopus laevis]
Length = 456
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 59/247 (23%), Positives = 110/247 (44%), Gaps = 20/247 (8%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAEL-LTTNKLSELACLALYLMYEKKQGKK 173
A+ DL+ G+ ++P + ++T E VL + + L +S L L +L+ E+ G++
Sbjct: 64 ATRDLKPGELIIALPETCLITTETVLQSYLGKYIRLWRPHVSPLLALCTFLIAERFAGER 123
Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
S W PY+ + P+ W E E+ +L +P + + LE+ K E EL T
Sbjct: 124 SQWKPYLDVIPS-------TYSCPVYW-ELEIVHLLPAPLRQKALEQ----KTEVQELHT 171
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA---LVPLGP- 289
+ Q D + +T++ + A+ V + V+++ R A + L P
Sbjct: 172 ESLAFFNSLQPLFCDNVADIYTYDALRWAWCTVNTRTVYMKHTQQDRLLAQQDVCALAPY 231
Query: 290 -PLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 346
LL +S + + D ++ + + + + GP N +LL+ YGFV +N
Sbjct: 232 LDLLNHSPEVQVEAEFSKDRRCYEIRTNSGCRKHDQAFICYGPHDNQRLLLEYGFVAANN 291
Query: 347 PYDRLVV 353
P+ + V
Sbjct: 292 PHRSVYV 298
>gi|162606198|ref|XP_001713614.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Guillardia theta]
gi|13794534|gb|AAK39909.1|AF165818_117 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Guillardia theta]
Length = 460
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 75/382 (19%), Positives = 166/382 (43%), Gaps = 55/382 (14%)
Query: 133 VVTLERVLGNETIAEL--------------LTTNKLSELACLALYLMYEKKQGKKSFWLP 178
++ +L NE I E+ + +N + LA+ L+ E + KKSFW P
Sbjct: 102 LIASRNILKNEKIIEISENLMFDKFEHNLEINSNGSDNYSDLAIKLLVELFKNKKSFWFP 161
Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMA 238
YI L + L W EL ++ GS + +K +Y ++
Sbjct: 162 YIGILPEEYDLKLL-----FRWPLKELFFIKGSRLSKASDYLKKKLKAQYEMVNK----- 211
Query: 239 GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC 298
+FQ+ P++ F ++ ++ + + S + LQ+ ++ L+P LL ++
Sbjct: 212 -EVFQRNRLLYPSKIFNYQNWEWSMSILLSRTISLQE---TKKVVLIPY-IDLLNHNPFS 266
Query: 299 KAMLAA----VDDAVQLVV--DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 352
+ ++ + D+ ++VV D+ + + + G + N +LL YGF+ E NPYD ++
Sbjct: 267 SSFISYRKIPLSDSKEIVVYSDKNCNKFDQLYISYGQKSNLELLNLYGFIAERNPYDSVI 326
Query: 353 VEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL----GYVSDT 408
+ +++ +D +++K+ N K + + + + +M+ ++++ ++D
Sbjct: 327 IRISMSPKDIFFKEKKSFLFSNKKFFYNSYPIFLYKYPD---EMIEFIKICLFNTNINDK 383
Query: 409 SEMQSVISSLGPICPVSPC----MERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 464
+ + I + + C +E+++ DY R L E+ ++D
Sbjct: 384 NFNLNKIENYDYTKIIKSCIVTVIEKSLNSNYNDYENLR----NIMLKENLLHISD---- 435
Query: 465 PKKRVATQLVRMEKKMLNACLQ 486
++++ + +EKK+LN L+
Sbjct: 436 -NQKISIKYNALEKKILNRFLE 456
>gi|62860180|ref|NP_001017105.1| SET domain containing 4 [Xenopus (Silurana) tropicalis]
gi|89267009|emb|CAJ81787.1| novel protein containing a SET domain [Xenopus (Silurana)
tropicalis]
Length = 442
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 61/249 (24%), Positives = 117/249 (46%), Gaps = 20/249 (8%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSELACLALYLMYEKKQG 171
+ A+ DLQ G+ S+P+S ++T E VL + + T + +S L L +L+ E+
Sbjct: 62 LMATRDLQPGELIISLPDSCLITTETVLQSYLGKYIRTWSPPVSPLLALCTFLIAERVAR 121
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
++S W PY+ L + P+ W E+E+ L +P + + LE+ +K + E
Sbjct: 122 ERSPWKPYLDVLPS-------SYSCPVYW-ESEIISLLPAPLRQKALEQQTEVKELHTE- 172
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQKVSLARRFALVPLG 288
W SL + +I T+ +T+ + A+ V + V H ++ L+ + + +
Sbjct: 173 --SWSFFVSLQPLFGGNI-TDIYTYGALRWAWCTVNTRTVYMKHPRRHGLSAQQDVYAMA 229
Query: 289 PPL-LAYSSKCKAMLAAVDD---AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 344
P L L S + AA ++ ++ + + + + GP N +LL+ YGF+
Sbjct: 230 PYLDLLNHSPAVQVEAAFNEERRCYEIRTNSGCRKHDQAFICYGPHDNQRLLLEYGFIAA 289
Query: 345 DNPYDRLVV 353
+NP+ + V
Sbjct: 290 NNPHRSVYV 298
>gi|336467028|gb|EGO55192.1| hypothetical protein NEUTE1DRAFT_147775 [Neurospora tetrasperma
FGSC 2508]
gi|350288355|gb|EGZ69591.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
Length = 504
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 72/302 (23%), Positives = 123/302 (40%), Gaps = 40/302 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
S +E L W HK+G P +V E + + +P +A+E L +G A S
Sbjct: 3 SPHKERFEALLDWAHKHGASLHPLLEVYEDEVTGFSLRVKP----SATELLGSGFKAVSC 58
Query: 129 PNSLVVTLERVLGNETIAELLTT---------------NKLSELACLALYLMYEKKQGKK 173
P S+ ++ L + I TT N L YL+ + +GK
Sbjct: 59 PTSITLSYLNALTDGPITPSSTTLAPNTENPAFPERFMNSLPPHVIGRFYLIQQYLKGKS 118
Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
SFW PYI L + A+ P W+E ++ L G+ I E +K EY +
Sbjct: 119 SFWAPYISTLADPSQLDKWAL--PPFWAEDDIELLKGTNAYVAIQEIQSNVKSEYKQARK 176
Query: 234 VWFMAG----SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKV----SLARRFALV 285
+ G + Q Y+ FT F+ + V +S +++++ S F+++
Sbjct: 177 ILKKEGFPDYRDYTQVLYNWAYCMFTSRSFRPSLVLSESAREYVERLLPEGSKIDDFSIL 236
Query: 286 PLGPPLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 340
PL ++ + L + A +L+ + Y G+ + G + NS+LL+ YG
Sbjct: 237 ---QPLYDIGNHSWDASYTWNLTSEPSACELICNDSYGPGQQVFNNYGFKTNSELLLGYG 293
Query: 341 FV 342
F+
Sbjct: 294 FI 295
>gi|85093434|ref|XP_959692.1| hypothetical protein NCU09581 [Neurospora crassa OR74A]
gi|28921141|gb|EAA30456.1| predicted protein [Neurospora crassa OR74A]
Length = 504
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 73/307 (23%), Positives = 121/307 (39%), Gaps = 50/307 (16%)
Query: 72 SKKEEDLGDLKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
S +E L W HK+G P +V E + + +P +A+E L +G A S
Sbjct: 3 SPHKERFEALLDWAHKHGASLHPLLEVYEDEVTGFSLRVKP----SATERLGSGFKAVSC 58
Query: 129 PNSLVVTLERVLGNETIAELLTT---------------NKLSELACLALYLMYEKKQGKK 173
P S+ ++ L + I TT N L YL+ + +GK
Sbjct: 59 PTSITLSYLNALTDGPITPSSTTPAPNTKNPAFPERFMNSLPPHVIGRFYLIQQYLKGKS 118
Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
SFW PYI L + A+ P W+E ++ L G+ I E +K EY +
Sbjct: 119 SFWAPYISTLADPSQLDKWAL--PPFWAEDDIELLQGTNAYIAIQEIQNNVKSEYKQARK 176
Query: 234 VWFMAG----SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP 289
+ G + Q Y+ FT F+ + V +S ++++ L+P G
Sbjct: 177 ILKKEGFPDYREYTQVLYNWAYCMFTSRSFRPSLVLSESAREYVER--------LLPEGT 228
Query: 290 ---------PLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
PL ++ + L + A +L+ + Y G+ + G + NS+L
Sbjct: 229 KIDDFSVLQPLYDIGNHSWDASYTWNLTSEPSACELICNDSYGPGQQVFNNYGFKTNSEL 288
Query: 336 LINYGFV 342
L+ YGF+
Sbjct: 289 LLGYGFI 295
>gi|380480025|emb|CCF42668.1| SET domain-containing protein RMS1 [Colletotrichum higginsianum]
Length = 318
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 55/232 (23%), Positives = 107/232 (46%), Gaps = 34/232 (14%)
Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
+VP+ +L ++ A + +D + +V RP KAGE I+ + GP PNS+LL YG+
Sbjct: 71 LGMVPMAD-ILNADAEFNAHVNHGEDDLSVVALRPIKAGEEILNYYGPHPNSELLRRYGY 129
Query: 342 VDEDN--------PYDRLVVEAALNTE---------------DPQ-YQDKRMVAQRNGKL 377
V + P+D +V++ L + DP+ ++D ++ + +G+
Sbjct: 130 VTPKHSRYDVVEIPWD--LVQSILTEQLRLTDDVWKQLAEHVDPEDFEDVFVLERDSGEP 187
Query: 378 SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLA 437
+ + +E +++ L+ + ++++ G + P + +A
Sbjct: 188 DSEGRLTTPAKVQEVSAELEEQLK-------AVLKAIKKVRGDLIPDKRKRDEVYQHVVA 240
Query: 438 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 489
+ LA YP T EDEA+L NL ++R+A ++ EK++L LQ+
Sbjct: 241 AALQKLLAQYPTTAEEDEALLASGNLTSRQRMAVEVRLGEKRLLKEALQMDG 292
>gi|344277088|ref|XP_003410336.1| PREDICTED: SET domain-containing protein 4 [Loxodonta africana]
Length = 440
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 77/323 (23%), Positives = 130/323 (40%), Gaps = 34/323 (10%)
Query: 67 SREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAF 126
SR V + + +LK W+ +I P + + LQ G
Sbjct: 22 SRGVNESYKSEFIELKKWLKDRKFEDTNLIPARFPGTGRG------LMSKTSLQVGQMII 75
Query: 127 SVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFWLPYIRELD 184
S+P S +++ + V+ + +T K S L L +L+ EK G +S W PY+ L
Sbjct: 76 SLPESCLLSTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVLEKHAGDQSSWKPYLETLP 134
Query: 185 RQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQ 244
+ P+ W E E+ L P +A+ E+ ++ + + LF +
Sbjct: 135 K-------TYTCPVCW-EPEVVNLLPRPLRAKAQEQRTRVQEFFTSFRDFFSSLQPLFSE 186
Query: 245 YPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP----LGP--PLLAYS--S 296
+I FT+ A+ V + V+L+ L R F+ P L P LL +S
Sbjct: 187 AVENI----FTYSALLWAWCTVNTRAVYLRHRQL-RCFSAEPDTCALAPYLDLLNHSPDV 241
Query: 297 KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 356
+ KA ++V + E + + GP N +LL+ YGFV NP+ + V
Sbjct: 242 QVKAAFNEKTRCYEIVAVSSCRKHEEVFICYGPHDNHRLLLEYGFVSTRNPHACVYVSRD 301
Query: 357 LNTEDPQYQDKRMVAQRNGKLSV 379
+ + DK+M N K+S+
Sbjct: 302 ILVKYLPSTDKQM----NKKISI 320
>gi|351701197|gb|EHB04116.1| SET domain-containing protein 3 [Heterocephalus glaber]
Length = 705
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 62/251 (24%), Positives = 116/251 (46%), Gaps = 14/251 (5%)
Query: 252 EAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 310
++FT+E ++ A +V + + +R AL+PL + DD +
Sbjct: 346 DSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCE 405
Query: 311 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 370
V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 406 CVALQDFQAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEV 465
Query: 371 AQRNG---KLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG 419
R G + VF +H E + +L +LR+ +++ + + I +LG
Sbjct: 466 LARAGIPTYVWSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGENAIDRIFTLG 524
Query: 420 -PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEK 478
PVS E + L D L Y T ED+A+L + +L + ++A +L EK
Sbjct: 525 NSEFPVSWENEVKLWSFLEDRASLLLKTYKTTTEEDKAVLKNPDLPARTKMAIKLRLGEK 584
Query: 479 KMLNACLQVTA 489
++L +Q A
Sbjct: 585 EILEKAVQSAA 595
>gi|159477607|ref|XP_001696900.1| rubisco small subunit N-methyltransferase [Chlamydomonas
reinhardtii]
gi|158274812|gb|EDP00592.1| rubisco small subunit N-methyltransferase [Chlamydomonas
reinhardtii]
Length = 411
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 52/182 (28%), Positives = 83/182 (45%), Gaps = 14/182 (7%)
Query: 315 RPYKAGESIVVWCGP---------QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 365
R A + +VVW G +PN +LL+ G + ++N D L A L D Y
Sbjct: 166 RAAGARKGVVVWDGAGSEMLLNDGRPNGELLLATGTLQDNNSSDFLSWPAGLVPADRYYM 225
Query: 366 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVS 425
K V + G + + F V+A R +L YLRL V+D + + + +S
Sbjct: 226 MKSQVLESMGYSAAEEFPVYADRMP---IQLLAYLRLSRVADPALLAKC--TFEADVELS 280
Query: 426 PCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 485
E +L L + RLA Y + ED + +L PK+R+A +L EK+++NA +
Sbjct: 281 QMNEYEILQILMGDCRERLASYTKSYEEDVKIAQQSDLSPKERLAVKLRLGEKRIINATM 340
Query: 486 QV 487
+
Sbjct: 341 EA 342
>gi|70984218|ref|XP_747626.1| SET domain protein [Aspergillus fumigatus Af293]
gi|66845253|gb|EAL85588.1| SET domain protein [Aspergillus fumigatus Af293]
Length = 492
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 71/280 (25%), Positives = 119/280 (42%), Gaps = 55/280 (19%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
V A D+ G+ FS+P LV++ + N + +LL+ + L EL L L +MYE
Sbjct: 50 VVARSDIFDGEELFSIPRGLVLSAQ----NSKLKDLLSQD-LEELGPWLSLILVMMYEYL 104
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
G++S W PY + L + + ++ + WS +EL L GS ++I EG +
Sbjct: 105 LGEQSAWAPYFKILPK-------SFDTLMFWSPSELRELQGSAIVSKI--GKEGAE---- 151
Query: 230 ELDTVWFMAGSLFQQYPYDIPT---------EAFTFEIFKQAFVA---VQSCVVHLQKVS 277
D++ M + + P P+ EA + + + A + + + ++KV
Sbjct: 152 --DSIMQMIAPVVRANPSLFPSVDGLASWDGEAGSHALLRLAHIMGSLIMAYAFDIEKVE 209
Query: 278 LARRF-------------------ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYK 318
+VPL L A + + A L DD++ + +P +
Sbjct: 210 DEDDENNDEEDGYVTDDEQDQSSKGMVPLADILNADADRNNARLFQEDDSLVMKAIKPIR 269
Query: 319 AGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 358
GE I G P + LL YG+V DN VVE +L+
Sbjct: 270 VGEEIFNDYGELPRADLLRRYGYV-TDNYAQYDVVELSLD 308
>gi|159122413|gb|EDP47534.1| SET domain protein [Aspergillus fumigatus A1163]
Length = 492
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 71/280 (25%), Positives = 119/280 (42%), Gaps = 55/280 (19%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
V A D+ G+ FS+P LV++ + N + +LL+ + L EL L L +MYE
Sbjct: 50 VVARSDIFDGEELFSIPRGLVLSAQ----NSKLKDLLSQD-LEELGPWLSLILVMMYEYL 104
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
G++S W PY + L + + ++ + WS +EL L GS ++I EG +
Sbjct: 105 LGEQSAWAPYFKILPK-------SFDTLMFWSPSELRELQGSAIVSKI--GKEGAE---- 151
Query: 230 ELDTVWFMAGSLFQQYPYDIPT---------EAFTFEIFKQAFVA---VQSCVVHLQKVS 277
D++ M + + P P+ EA + + + A + + + ++KV
Sbjct: 152 --DSIMQMIAPVVRANPSLFPSVDGLASWDGEAGSHALLRLAHIMGSLIMAYAFDIEKVE 209
Query: 278 LARRF-------------------ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYK 318
+VPL L A + + A L DD++ + +P +
Sbjct: 210 DEDDENNDEEDGYVTDDEQDQSSKGMVPLADILNADADRNNARLFQEDDSLVMKAIKPIR 269
Query: 319 AGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 358
GE I G P + LL YG+V DN VVE +L+
Sbjct: 270 VGEEIFNDYGELPRADLLRRYGYV-TDNYAQYDVVELSLD 308
>gi|396495152|ref|XP_003844476.1| similar to SET domain-containing protein [Leptosphaeria maculans
JN3]
gi|312221056|emb|CBY00997.1| similar to SET domain-containing protein [Leptosphaeria maculans
JN3]
Length = 475
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 72/281 (25%), Positives = 113/281 (40%), Gaps = 52/281 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER-VLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
V A+++++ + F +P S V+++E +L E T + L L L ++YE G
Sbjct: 40 VVATQEIREHEVLFRIPRSAVLSVENSILSTEIPTS--TFDLLGPWLSLILVMLYEHLNG 97
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-------------- 217
S W PY L + + + WSE ELA L S A+I
Sbjct: 98 DASNWAPYFAVLPNE-------FNTLMFWSEHELAELQASAVLAKIGREGANEAFLGQLV 150
Query: 218 -----------------LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK 260
++AE ++ E N T+ GSL Y +DI
Sbjct: 151 PVIKEFAGIFFSGDSRAAQKAEEMRDEKN--ITLMHKMGSLIMAYAFDIEPAT------P 202
Query: 261 QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 320
+ V + + +L + ++PL L A + +C A L +++ +P KAG
Sbjct: 203 RKDVDEEGFAEEEEDEALPK--GMIPLADMLNADADRCNARLFYEQKYLEMKALKPIKAG 260
Query: 321 ESIVVWCGPQPNSKLLINYGFVDED-NPYDRLVVEAALNTE 360
E I GP P S LL YG+V E+ YD + V L +E
Sbjct: 261 EEIFNDYGPLPRSDLLRRYGYVTENYAQYDVVEVPMELVSE 301
>gi|307109196|gb|EFN57434.1| hypothetical protein CHLNCDRAFT_142903 [Chlorella variabilis]
Length = 1233
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 89/375 (23%), Positives = 146/375 (38%), Gaps = 82/375 (21%)
Query: 111 HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
H A+ D+ G+ VP SL +T V G + E+L + SEL LAL+LM E+ +
Sbjct: 871 HGFVAARDVGQGEVLLQVPGSLAITAVDV-GKDAQLEVLARGR-SELVGLALWLMQERAK 928
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE-LAYLTGSPTKAEILERAEGIKREYN 229
A +P+LW + E L GSP E R + +++E+
Sbjct: 929 ----------------------ATLTPILWPDEERQQLLRGSPVLEEARTREQALRQEWQ 966
Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV-HLQKVSLARRFALVPLG 288
++ + + YP + E QAF+ S V+ H + A+ FAL+PL
Sbjct: 967 DIAAIAAQTSGGPEAYPAVVYNE--------QAFLEAMSVVLAHAAYLPKAQCFALLPLV 1018
Query: 289 PPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 346
L S A+L +AV +V R G+ + ++C
Sbjct: 1019 GGLCRTGSSSGALLDYDLEREAVTVVAQR--TPGQEVALYC------------------- 1057
Query: 347 PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS 406
L + A+L D Y KR + + G F + ++ A ++ +
Sbjct: 1058 ----LFMAASLVAADRLYTTKREILEELGLGVKAEFPIF--EDRLATQQLINF------- 1104
Query: 407 DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPK 466
E ++I SP E +L L + R+ Y +D L +L P+
Sbjct: 1105 ---EQDTII---------SPENEYEILQLLMGDLRDRIQAYATEFDDDIKDLQRTDLTPR 1152
Query: 467 KRVATQLVRMEKKML 481
+R+A QL EK++L
Sbjct: 1153 QRLAAQLRLGEKRIL 1167
>gi|330933580|ref|XP_003304225.1| hypothetical protein PTT_16721 [Pyrenophora teres f. teres 0-1]
gi|311319308|gb|EFQ87682.1| hypothetical protein PTT_16721 [Pyrenophora teres f. teres 0-1]
Length = 476
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 77/302 (25%), Positives = 120/302 (39%), Gaps = 51/302 (16%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLER-VLG 141
+W+ K+G I E + + R V AS+++ + F +P + ++++E +L
Sbjct: 13 AWLRKSGAEISPKIKLEDLRNKDAGRG---VVASQEIAEHELLFRIPRTSILSVENSILS 69
Query: 142 NETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
E A L+ L L L ++YE G S W PY L + + + W+
Sbjct: 70 TEIPAATLSL--LGPWLSLILVMLYEYHNGSASNWAPYFAVLPTE-------FNTLMFWT 120
Query: 202 ETELAYLTGSPTKAEIL---------------------------ERAEGIKREYNELDTV 234
E ELA L S +I E+A+ +E +
Sbjct: 121 EDELAELQASAVVGKIGKESADEAFLEQLLPVIEEFADIVFSGDEKAKDKAKEMRSPKNL 180
Query: 235 WFM--AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
M GSL Y +D+ T E+ ++ A + L K +VPL L
Sbjct: 181 ELMHKMGSLIMAYAFDVEPATPTKEVDEEG-FAEEEEDAALPK-------GMVPLADMLN 232
Query: 293 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 352
A + +C A L D +++ +P +AGE I GP P S LL YG+V DN V
Sbjct: 233 ADADRCNARLFYEKDCLEMKALKPIQAGEEIFNDYGPLPRSDLLRRYGYVT-DNYAQYDV 291
Query: 353 VE 354
VE
Sbjct: 292 VE 293
>gi|260831632|ref|XP_002610762.1| hypothetical protein BRAFLDRAFT_91548 [Branchiostoma floridae]
gi|229296131|gb|EEN66772.1| hypothetical protein BRAFLDRAFT_91548 [Branchiostoma floridae]
Length = 604
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 58/231 (25%), Positives = 112/231 (48%), Gaps = 35/231 (15%)
Query: 151 TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 210
T++ + L+L+L+ EK +GK SFW PYIR L + +P+ ++E+EL L+
Sbjct: 230 TSRFTCAQVLSLFLLLEKNKGKDSFWYPYIRSLPN-------SFTTPVYFTESELNALSP 282
Query: 211 SPTKAEILERAEGIKRE----YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 266
S + E+A +K+E +N+L+ F+ L + FTF+ F+ A+ +
Sbjct: 283 S-----LQEKARDLKKELLHAFNDLEP--FVTSCLPEL------DSTFTFDAFRWAWSVL 329
Query: 267 QSCVVHLQKVS---LARR----FALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPY 317
++ ++ + L+ + LVP+ L+ +S KA ++ V PY
Sbjct: 330 KTRTLYQEDCRSPYLSNKEPQTSTLVPM-LDLINHSPSAKARFGYNVNTSCYEVRVLEPY 388
Query: 318 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED-PQYQDK 367
+ + + + G + N++L++ +GF +NP D + + + E PQ D+
Sbjct: 389 RKYDQVFISYGFEENTELMLKFGFFVPENPKDFMKINLSEMLESLPQINDE 439
>gi|148226164|ref|NP_001079674.1| SET domain containing 4 [Xenopus laevis]
gi|28422727|gb|AAH46855.1| MGC53706 protein [Xenopus laevis]
Length = 456
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 60/247 (24%), Positives = 108/247 (43%), Gaps = 20/247 (8%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAEL-LTTNKLSELACLALYLMYEKKQGKK 173
A+ DL+ G+ ++P + ++T E VL + + L +S L L +L+ E+ G
Sbjct: 64 ATRDLKPGELIIALPETCLITTETVLQSYLGKYIRLWRPHVSPLLALCTFLIAERFAGDC 123
Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
S W PY+ + P+ W E E+ +L +P + + LE+ K E EL T
Sbjct: 124 SQWKPYLDVIPS-------TYSCPVYW-ELEIIHLLPAPLRKKALEQ----KTEVQELHT 171
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA---LVPLGP- 289
S Q D + +T++ + A+ V + V+++ R A + L P
Sbjct: 172 ESLAFFSSLQPLFCDNVADIYTYDALRWAWCTVNTRTVYMKHTQQDRLLAQQDVCALAPY 231
Query: 290 -PLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 346
LL +S + + D ++ + + + + GP N +LL+ YGFV +N
Sbjct: 232 LDLLNHSPEVQVEAEFSKDRRCYEIRTNSGCRKHDQAFICYGPHDNQRLLLEYGFVAANN 291
Query: 347 PYDRLVV 353
P+ + V
Sbjct: 292 PHRSVYV 298
>gi|440804394|gb|ELR25271.1| rubisco lsmt substrate-binding protein [Acanthamoeba castellanii
str. Neff]
Length = 408
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 92/395 (23%), Positives = 153/395 (38%), Gaps = 45/395 (11%)
Query: 121 AGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELAC-----LALYLMYEKKQGKKSF 175
A + VP SL++ L E + + K + A LAL++++E ++ SF
Sbjct: 4 ASERILEVPFSLLLDAGAALRAEDVGSVFAAVKPALDAVDNRLPLALFMLHELRK-PDSF 62
Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 235
W PY L + V P+ W++ ++ L GSP A +L + + + + E
Sbjct: 63 WRPYFDALPSR-------VNLPMFWADEDMQLLAGSPLHAAVLAQKKQARDWHTE----- 110
Query: 236 FMAGSLFQQYP--YDIPTE------AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 287
+ ++YP + + + +++ F+ + S + +VP+
Sbjct: 111 -HIVPIVRRYPRPFGVSDDDSSLEPSYSLARFEWVLSMIASRAFWHFDLKDTWEPHMVPM 169
Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLV---VDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 344
+ + DD Q V +PY GE + + N +LL Y + E
Sbjct: 170 ADLINHSLTNDNVSKYTFDDKTQTFIVHVQQPYAEGEQVFITYCTDSNFELLKTYAMMVE 229
Query: 345 DN-------PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML 397
DN D + E + R + QR L+ Q + V + +E D++
Sbjct: 230 DNYNKYTEIRLDETTIARICPDEVERLTKTRALTQRG--LAKQTYPV---KSEEFPLDLV 284
Query: 398 PYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM 457
LRL ++ T S+ PVS E V D +A K L+ YP T ED AM
Sbjct: 285 QALRLYHLPLTDSHTE--STCFETDPVSVQNELMVYDTIAGCVKELLSQYPITAQEDAAM 342
Query: 458 LT-DYNLHPKKRVATQLVRMEKKMLNACLQVTADM 491
L D L R+A R +K L V A+M
Sbjct: 343 LAHDPRLSATARLAVAYRREDKLFLTEVGSVFAEM 377
>gi|302816067|ref|XP_002989713.1| hypothetical protein SELMODRAFT_447801 [Selaginella moellendorffii]
gi|300142490|gb|EFJ09190.1| hypothetical protein SELMODRAFT_447801 [Selaginella moellendorffii]
Length = 400
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 60/255 (23%), Positives = 111/255 (43%), Gaps = 24/255 (9%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+ ++AG+ +P+ LV+T E++ ++ + +LL+T + L L ++ E+ +G+ S
Sbjct: 14 AARSIRAGEQIVRIPHELVLTAEKL--DDCVKKLLSTEY--DWCPLTLLILAEQHKGEAS 69
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PY+ L S + W + EL +L + ER E I EYN + V
Sbjct: 70 RWAPYVSCLPSFGDH-----HSTIFWGKEELKFLECTRAFRGTAERREMISDEYNSVKDV 124
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
P+ + F+ F A+ V V +L+ ++ P +
Sbjct: 125 -------ISSCPHVFGEDISLFQ-FAHAYATV---VSRAWNGALSSEISMRPF-VDFCNH 172
Query: 295 SSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 352
A ++ DA ++ +R Y GE + + G + N+ L ++YGFV +N D+
Sbjct: 173 DPVSHATVSHDTCKDAT-IIAERDYTKGEEVFISYGKRSNAVLAVDYGFVLPNNLSDQAE 231
Query: 353 VEAALNTEDPQYQDK 367
+ + DP + K
Sbjct: 232 LWMEIPWNDPLREKK 246
>gi|212542185|ref|XP_002151247.1| SET domain protein [Talaromyces marneffei ATCC 18224]
gi|210066154|gb|EEA20247.1| SET domain protein [Talaromyces marneffei ATCC 18224]
Length = 709
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 68/223 (30%), Positives = 99/223 (44%), Gaps = 24/223 (10%)
Query: 161 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE--LAYLTGSPTKAEIL 218
A +LM + + FW PYIR L G+ + +PL + E E L +L G + A
Sbjct: 108 AFFLMGQYLLQEHGFWYPYIRSLP-----GKEELTTPLFFREEEGDLEWL-GMTSLAASR 161
Query: 219 ERAEGI-----KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA--VQSCVV 271
ER I +R Y L + F + Y +D+ A T I +AF A + S +
Sbjct: 162 ERRLAIWRGNYERGYTMLKELGFEG---VEGYTWDLYLWASTI-ISSRAFTAKVLASVIP 217
Query: 272 HLQKVSLARRFALVPLGPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 329
L+ + R L+PL + A + K K A D++ LVV AGE + GP
Sbjct: 218 ELKNAEVDRVSVLLPL---IDATNHKPLSKVEWRAGTDSIGLVVMSDVAAGEEVGNNYGP 274
Query: 330 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 372
+ N +L++NYGF DNP + VV + P Q K Q
Sbjct: 275 RNNEQLMMNYGFCIPDNPCEYRVVSLRAPLDSPLAQIKAQYEQ 317
>gi|281207217|gb|EFA81400.1| mRNA-decapping enzyme 2 [Polysphondylium pallidum PN500]
Length = 1078
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 102/464 (21%), Positives = 187/464 (40%), Gaps = 97/464 (20%)
Query: 61 DTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQ 120
D + G ++ +++D ++++W + P + EK + +S D++
Sbjct: 12 DIRIGGQTVQLTFRKDDGINIQTWKQDSKQPLLSLTPNEKG-----------IFSSRDIK 60
Query: 121 AGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLA--------LYLMY--EKKQ 170
G+ S+P +++ +V + + L NK+ +L A LY Y +
Sbjct: 61 EGEELLSLPWYNSLSMNKV--QQQLPWLF--NKIQDLELTAEDGLVVALLYYRYCMDDLS 116
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
S W + E+ + S L +S+ E L GSP +++ + K +
Sbjct: 117 FDYSEWFSAMPEV----------LNSGLFFSDAEAELLNGSPAYIDLMNQRLDAKELFGR 166
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL---ARRFALVPL 287
L SLF++ + A T++ K A+ V S ++ + +L F V L
Sbjct: 167 L-------KSLFKEQQFS--KCAMTYDRLKWAYSVVDSRKIYTEAPNLDANGNPFITVVL 217
Query: 288 GPPLLAYSSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWCGPQP-NSKLLINYGF 341
P L Y + + AA D A+++V +P K GE I + G Q NS LLI+YGF
Sbjct: 218 AP-FLDYFNHAEDAQAAYDFDYDESAIKVVALQPIKKGEQIFLNYGNQDCNSDLLIHYGF 276
Query: 342 VDEDNPYDRLV---VEAALNT---EDPQYQDKRMVAQRNGKLS--VQVFHVHAGREKEAI 393
+D+ + V VE LNT DPQ +K + + + + +++F E I
Sbjct: 277 IDQSSTAKHCVNVLVEELLNTIPASDPQLIEKTELLTKAFEQNERMKLFKDSLTEELLKI 336
Query: 394 SDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSE 453
S L Y ++L L ++ YP T+ E
Sbjct: 337 SKYLSYKNF----------------------------SLLPYLKSLIDMKMKAYPTTMEE 368
Query: 454 DEAML---TDYNLHPKKRVATQLVRMEK----KMLNACLQVTAD 490
D A++ T++ ++ + ++R+++ K + A +QV D
Sbjct: 369 DRAIIEATTEFEKLSQRSKMSIIMRLQEKETLKEIGALIQVKID 412
>gi|148686780|gb|EDL18727.1| mCG18357, isoform CRA_e [Mus musculus]
Length = 458
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 49/193 (25%), Positives = 95/193 (49%), Gaps = 10/193 (5%)
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 365
DD + V + ++AG+ I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 156 DDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 215
Query: 366 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISS 417
K V R G + VF +H+ E + +L +LR+ +++ + + I +
Sbjct: 216 MKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFT 274
Query: 418 LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 476
LG PVS E + L D L Y T+ ED+ +L + +L + +A +L
Sbjct: 275 LGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRLG 334
Query: 477 EKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 335 EKEILEKAVKSAA 347
>gi|332020870|gb|EGI61268.1| SET domain-containing protein 3 [Acromyrmex echinatior]
Length = 232
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 55/217 (25%), Positives = 94/217 (43%), Gaps = 11/217 (5%)
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
AL+P+ + + A D + R +K GE + + GP+ NS ++ GFV
Sbjct: 18 ALIPMWDMCNHENGRITTDFNATSDRCECYALRDFKKGEQVFISYGPRTNSDFFVHSGFV 77
Query: 343 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-MLPYLR 401
DN D + ++ D +++ + + SV F + G E ISD +L +LR
Sbjct: 78 CMDNEQDGFKLRLGISKADSLQKERIELLSKLDLPSVGEFLLKPG--TEPISDTLLAFLR 135
Query: 402 LGYVSDTSEMQSVISSLGPI------CPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 455
+ + +E+ + S C + +E V L + +A YP TL ED
Sbjct: 136 V-FSMRKAELTHWLRSDKVFDLKHVDCALETVVEENVRKFLLTRLQLLIANYPTTLKEDL 194
Query: 456 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 492
+L + L K++A QL EK++L+ L+ I
Sbjct: 195 ELL-ETTLPQMKKMAVQLRVTEKRILSGALEYVEQWI 230
>gi|302804448|ref|XP_002983976.1| hypothetical protein SELMODRAFT_423083 [Selaginella moellendorffii]
gi|300148328|gb|EFJ14988.1| hypothetical protein SELMODRAFT_423083 [Selaginella moellendorffii]
Length = 266
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 61/242 (25%), Positives = 103/242 (42%), Gaps = 29/242 (11%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK-- 172
AS ++AG+ + L++ + G + T ++LA + L Y K Q K
Sbjct: 25 ASRPVRAGERVLEISLDLMIAPSDLPGELSTVLSSTVKPWTKLALIVLMERY-KGQAKLQ 83
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W PYI L +++ LW +TEL+YL SP + ER E I E+ ++
Sbjct: 84 SSAWAPYISCLPEPA-----ELDNTFLWEDTELSYLRASPLYGKTRERLEMITTEFGQVQ 138
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
+ LF + + E FK + V S + +++ LV + P+L
Sbjct: 139 NALDVWPQLFGK---------VSLEDFKHVYATVFS-----RSLAIGEDSTLVMI--PML 182
Query: 293 AY-----SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
+ +S K + + + DR Y + I + G N++L ++YGF +NP
Sbjct: 183 DFFNHNATSFAKLSFNGLLNYAVVTADRDYAENDQIWINYGDLSNAELALDYGFAVPENP 242
Query: 348 YD 349
YD
Sbjct: 243 YD 244
>gi|406860468|gb|EKD13526.1| putative SET domain-containing protein [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 474
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 97/410 (23%), Positives = 165/410 (40%), Gaps = 59/410 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELAC-LALY-LMYEKKQ 170
+ A D+ + F++P V+ LG+ +L E+ C LAL ++ + Q
Sbjct: 42 LVAQSDIGEDEVLFTIPRDAVLNTTTALGSADNPAIL------EMPCWLALTAIILTEGQ 95
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEGIKREY- 228
+ S W PY+ L + ++S + WSE+EL L S +I AE + E+
Sbjct: 96 QEDSKWAPYLALLPSR-------LDSLVFWSESELLELQASTVVNKIGRASAEQLFLEHI 148
Query: 229 ------NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-----CVVHLQKVS 277
N + S+ Y +DIP K+ +S +V +
Sbjct: 149 SPLGLSNTNTEMCHKVASVVMAYAFDIPE--------KKGHDDPESPEDGDDLVSDNEEE 200
Query: 278 LARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 337
+++PL L A + A L ++ +++ +P GE I+ G P S LL
Sbjct: 201 ENTILSMIPLADMLNADADGNNARLCCDNEELEMRSIKPISKGEEILNDYGQLPRSDLLR 260
Query: 338 NYGFV-DEDNPYDRLVVE-------AALNTEDP-------------QYQDKRMVAQRNGK 376
YG++ D+ YD V E A+L+TE P + + + +AQR G
Sbjct: 261 RYGYISDKYAAYD--VAELSTQSLLASLSTEQPLLAGGTLQPLSREKLEQRVELAQREGV 318
Query: 377 LSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQL 436
H G + +I D L L + D + ++ +S + S V L
Sbjct: 319 YEDSYDLTHPGPDDPSIPDELLALLYILLLDNENLAAIETSHASLPSRSKLATSLVGQIL 378
Query: 437 ADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 486
++R Y T+ D+A+L NL +KR+A ++ EK +L +Q
Sbjct: 379 TKILESRKQEYATTIEADQAILQADNLPSRKRMAVEVRLGEKLVLEKAIQ 428
>gi|148686778|gb|EDL18725.1| mCG18357, isoform CRA_c [Mus musculus]
Length = 536
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 49/193 (25%), Positives = 95/193 (49%), Gaps = 10/193 (5%)
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 365
DD + V + ++AG+ I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 234 DDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 293
Query: 366 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISS 417
K V R G + VF +H+ E + +L +LR+ +++ + + I +
Sbjct: 294 MKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFT 352
Query: 418 LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 476
LG PVS E + L D L Y T+ ED+ +L + +L + +A +L
Sbjct: 353 LGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRLG 412
Query: 477 EKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 413 EKEILEKAVKSAA 425
>gi|340966944|gb|EGS22451.1| hypothetical protein CTHT_0019870 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 499
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 48/191 (25%), Positives = 78/191 (40%), Gaps = 12/191 (6%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
+L+ E +G+ SFW PYI L + + P W E ++ +L G+ I E
Sbjct: 111 FFLIKEYLKGENSFWWPYIATLPQPEQVNSWTL--PAFWPEDDIQFLEGTNAHVAIGEIQ 168
Query: 222 EGIKREYNELDTVW----FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK-V 276
IKREY + V F + Q Y FT F+ + + QS ++ +
Sbjct: 169 ANIKREYKQARKVLKEENFPNWKEYSQMLYKWAFSIFTSRSFRPSLILSQSVKDYVSTLL 228
Query: 277 SLARRFALVPLGPPLLAYSSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWCGPQP 331
AR + PL ++ D + QL+ Y+ G+ + G +
Sbjct: 229 PSAREIDDFSILQPLFDIANHSMTATYTWDTTSDPNCCQLICQDSYRPGDQVFNNYGFKT 288
Query: 332 NSKLLINYGFV 342
NS+LL+ YGF+
Sbjct: 289 NSELLLAYGFI 299
>gi|330806388|ref|XP_003291152.1| hypothetical protein DICPUDRAFT_155733 [Dictyostelium purpureum]
gi|325078672|gb|EGC32310.1| hypothetical protein DICPUDRAFT_155733 [Dictyostelium purpureum]
Length = 465
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 96/406 (23%), Positives = 164/406 (40%), Gaps = 52/406 (12%)
Query: 72 SKKEEDLGDLKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
+K+ E L + K W+ N P + L +K + + A + ++ D S+
Sbjct: 34 TKEIESLKEFKEWLVNNNAYINPNIDIELLDKYGRS--------IVAKKSIKKQDKLISI 85
Query: 129 PNSLVVTLERVLGN-----ETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIREL 183
P ++++ + G + I E + + LS A+++MY K +KSFW PY+ L
Sbjct: 86 PKDIIMS--NIGGYPKKIPKEIYEQVQSIGLSPTNLQAVFIMY-SKLNEKSFWHPYVTVL 142
Query: 184 DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQ 243
+ + L +S+ EL L S K + R +GI+R Y S F
Sbjct: 143 PE-------SFSTSLYFSDNELDELQASQLKEFTIIRKDGIERHYE----------STFS 185
Query: 244 QYPYDIPTEAFTFEIFKQA-FVAVQSCVVHLQKVSLARR-FALVPLGPPLLAYS-SKCKA 300
+ +P E ++ Q F SCV + SLA +VPL A SK K
Sbjct: 186 RLSKLVP-EFSNLALYNQELFTWALSCVWS-RAFSLAENDGGMVPLADMFNAEDRSKSKV 243
Query: 301 MLAAVDDAVQLVVDRPYKAGESIVVWCG---PQPNSKLLINYGFV-DEDNPYDRLVVEA- 355
+ D + GE I G P +S++L++YGF+ DE D + +
Sbjct: 244 LPKVTDTTLDYYASDDIAEGEQIFTPYGVYKPLSSSQMLMDYGFIFDEGTVSDNVAITVP 303
Query: 356 ALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVI 415
+ ++P K+ + + N ++ +VF + A D+L Y R+ + Q+
Sbjct: 304 VFHNDEPNLSTKQEILEENDIIN-EVFLLQKTDPLPA--DLLLYARVKNLIAKECDQAKK 360
Query: 416 SSLGP---ICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 458
L P P++ E+ L L + L Y L D+ +L
Sbjct: 361 HFLSPNTRNTPLNTRNEKVSLRFLENLIHRYLDSYGTNLESDKNLL 406
>gi|34784341|gb|AAH57968.1| Setd3 protein [Mus musculus]
Length = 408
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 49/193 (25%), Positives = 95/193 (49%), Gaps = 10/193 (5%)
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 365
DD + V + ++AG+ I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 106 DDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 165
Query: 366 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISS 417
K V R G + VF +H+ E + +L +LR+ +++ + + I +
Sbjct: 166 MKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFT 224
Query: 418 LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 476
LG PVS E + L D L Y T+ ED+ +L + +L + +A +L
Sbjct: 225 LGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRLG 284
Query: 477 EKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 285 EKEILEKAVKSAA 297
>gi|145250231|ref|XP_001396629.1| SET domain protein [Aspergillus niger CBS 513.88]
gi|134082145|emb|CAK42259.1| unnamed protein product [Aspergillus niger]
gi|350636112|gb|EHA24472.1| hypothetical protein ASPNIDRAFT_48629 [Aspergillus niger ATCC 1015]
Length = 489
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 78/308 (25%), Positives = 123/308 (39%), Gaps = 54/308 (17%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
+W+ G P K+ K + + H V A DL G+ F++P + V++++ N
Sbjct: 22 TWLA--GKPGVKINPKIQIADLRSHAAGRGVVAQSDLDEGEELFTIPRAHVLSVQ----N 75
Query: 143 ETIAELLTTN--KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
+ LL+ N L L + ++YE QG +S W Y R L R ++ + W
Sbjct: 76 SNLKNLLSQNLDDLGPWLSLMVVMIYEYLQGDQSAWASYFRVLPRN-------FDTLMFW 128
Query: 201 SETELAYLTGSP---------TKAEILE------RA--------EGIKREYNELDTVWFM 237
S +EL L GS + ILE RA +G+ + T +
Sbjct: 129 SASELEELQGSAIVEKIGKQGAEESILETIAPIVRANPALFPPIDGVASYDGDAGTQALL 188
Query: 238 -----AGSLFQQYPYDI--PTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 290
GSL Y +DI P + + ++ + +VPL
Sbjct: 189 HLAHTMGSLIMAYAFDIEKPEDEEGERDGEDGYLTDEE--------EEQSSKGMVPLADL 240
Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 350
L A + + A L ++ + + +P KAGE I G P S LL YG+V DN
Sbjct: 241 LNADADRNNARLFQEEEVLVMKAIKPIKAGEEIFNDYGEIPRSDLLRRYGYV-TDNYAQY 299
Query: 351 LVVEAALN 358
VVE +L+
Sbjct: 300 DVVELSLD 307
>gi|384251065|gb|EIE24543.1| hypothetical protein COCSUDRAFT_40909 [Coccomyxa subellipsoidea
C-169]
Length = 685
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 52/171 (30%), Positives = 76/171 (44%), Gaps = 18/171 (10%)
Query: 201 SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK 260
+E E++ L G+P +E + I+ +Y V +L YP DI + T + F
Sbjct: 65 TEEEVSMLEGTPAHTTFVEARQHIREQYRAAQPV---LQALTAAYPDDITPDLVTEDKFI 121
Query: 261 QAFVAVQSCVVHLQKVSLARRFALVPLG--------PPLLAYSSKCKAMLAAVDDAVQLV 312
A S + ++ V A R LVP+ P ++ Y L A D+++L
Sbjct: 122 WACELWYSYAIEVEYVDGAVRQTLVPIAHLLNHSPWPHIVRYGR-----LDAATDSLRLR 176
Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR--LVVEAALNTED 361
R AGE + GP PN KLL+ YGF DNP+D + EA N D
Sbjct: 177 AFRHCAAGEQCFLSYGPLPNLKLLLFYGFALPDNPHDTVPITFEAEKNEGD 227
>gi|307190530|gb|EFN74527.1| SET domain-containing protein 3 [Camponotus floridanus]
Length = 232
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 55/217 (25%), Positives = 94/217 (43%), Gaps = 11/217 (5%)
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
AL+P+ + + A D + R +K GE + + GP+ NS ++ GFV
Sbjct: 18 ALIPMWDMCNHENGRITTDFNATSDHCECYALRNFKKGEQVFISYGPRTNSDFFVHSGFV 77
Query: 343 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-MLPYLR 401
+N D + ++ D +++ + + G SV F + G E ISD +L +LR
Sbjct: 78 YMNNKQDGFKLRLGISKADSLQKERIELLSKLGLPSVGEFLLKPG--TEPISDTLLAFLR 135
Query: 402 LGYVSDTSEMQSVISSLGPI------CPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 455
+ + +E+ + S C + +E V L + +A YP TL ED
Sbjct: 136 V-FSMRKAELAHWLRSDKVFDLKHMDCALETVVEENVRKFLLTRLQLLIANYPTTLKEDL 194
Query: 456 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 492
+L + L K++A QL EK++L L+ I
Sbjct: 195 ELL-ETTLPQIKKMAVQLRVTEKRILLGALEYVEQWI 230
>gi|19112238|ref|NP_595446.1| ribosomal lysine methyltransferase Set10 [Schizosaccharomyces pombe
972h-]
gi|74626910|sp|O74738.1|SET10_SCHPO RecName: Full=Ribosomal N-lysine methyltransferase set10
gi|3738151|emb|CAA21252.1| ribosomal lysine methyltransferase Set10 [Schizosaccharomyces
pombe]
Length = 547
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 64/247 (25%), Positives = 106/247 (42%), Gaps = 31/247 (12%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
L +L E +G +S W YI L + +PL ++E + A+L + + E
Sbjct: 82 LCTFLALESLKGIQSKWYGYIEYLPK-------TFNTPLYFNENDNAFLISTNAYSAAQE 134
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF-KQAFVAVQSCVVHLQKVSL 278
R K EY E A SL + PTE FTF+++ A V C +
Sbjct: 135 RLHIWKHEYQE-------ALSL-----HPSPTERFTFDLYIWSATVFSSRC---FSSNLI 179
Query: 279 ARRFALVPLGPPLL-AYSSKCKAMLAAVDD-----AVQLVVDRPYKAGESIVVWCGPQPN 332
+ P+ PL+ + + K K + D +VQL+ G + GP+ N
Sbjct: 180 YKDSESTPILLPLIDSLNHKPKQPILWNSDFQDEKSVQLISQELVAKGNQLFNNYGPKGN 239
Query: 333 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG--KLSVQVFHVHAGREK 390
+LL+ YGF DNP+D + ++ A++ + P K + + + +LS VF + +K
Sbjct: 240 EELLMGYGFCLPDNPFDTVTLKVAIHPDLPHKDQKAAILENDCQFQLSNLVFFLPKSPDK 299
Query: 391 EAISDML 397
E +L
Sbjct: 300 EIFQKIL 306
>gi|449542715|gb|EMD33693.1| hypothetical protein CERSUDRAFT_56467 [Ceriporiopsis subvermispora
B]
Length = 510
Score = 58.9 bits (141), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 65/226 (28%), Positives = 98/226 (43%), Gaps = 31/226 (13%)
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
A+VP+ L A A L + +++V +P AGE I G PNS LL YG V
Sbjct: 260 AMVPMADMLNARFESENAKLFYEEHYLKMVATKPINAGEQIWNTYGDPPNSDLLRRYGHV 319
Query: 343 D----------EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQV-FHVHAGREK- 390
D E NP D + + A L R + G L V+V F + +
Sbjct: 320 DVVPLGEPLSGEGNPADVVEIRADL-----VVSAVRKARKAAGDLQVRVDFWLEEADDDT 374
Query: 391 -------EAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 443
E ++L ++RL +S T + + + + G + +E +L + D K R
Sbjct: 375 FVLMTDCEVPEELLSFIRL--LSLTKDEWNKVKAKGKLP--KGKLELELLPAIVDVLKER 430
Query: 444 LAGYPATLSEDEAML---TDYNLHPKKRVATQLVRMEKKMLNACLQ 486
L YP T+ EDE++L + NL KR A + EK++L LQ
Sbjct: 431 LKEYPTTIEEDESLLGPDSAVNLSFNKRNAVVVRLGEKRILRGALQ 476
>gi|302815683|ref|XP_002989522.1| hypothetical protein SELMODRAFT_129980 [Selaginella moellendorffii]
gi|300142700|gb|EFJ09398.1| hypothetical protein SELMODRAFT_129980 [Selaginella moellendorffii]
Length = 464
Score = 58.9 bits (141), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 89/375 (23%), Positives = 150/375 (40%), Gaps = 62/375 (16%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
L L L+YE+ Q K S+W PYI L + P+ +S ++ + +P ++ +
Sbjct: 105 LGLKLLYERAQ-KGSYWWPYISMLPH-------SFTLPIFFSGVDIESIDYAPVTHQVKK 156
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF---EIFKQAFVAVQSCVVHLQKV 276
R + + +EL + + P +I A F A AV S + V
Sbjct: 157 RCRFLLQFSSEL--------AKLESLPEEIHPFAGQFVDSGALGWAMAAVSSRAFRIHGV 208
Query: 277 SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV----------QLVVDRPYKAGESIVVW 326
+ A++ PL+ + A +++ + ++V R + G +I +
Sbjct: 209 TNKLCSAMML---PLIDMCNHSFQPNAHIEEDLSRDAQDVSFLKVVTKRNLEKGSAITLN 265
Query: 327 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA--QRNGK------LS 378
GP N LL++YGFV DNP+DR+ L + ++ RM+A R G S
Sbjct: 266 YGPLSNDLLLLDYGFVIPDNPHDRI----ELRYDGSLMENARMIAGLSRTGSPPFSSPAS 321
Query: 379 VQVFH--------------VHAGREKEAISDMLPYLRLGYVSDTS--EMQSVIS--SLGP 420
QV V G +E +L LR+ + E + ++S + G
Sbjct: 322 WQVDRLKQLGLADSGESQKVTLGGPEEVDGRLLAALRILHAESQEPLERRELVSLQAWGV 381
Query: 421 ICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 480
VS E VL L + T+ EDEA L+D +L R+A Q +K++
Sbjct: 382 ESMVSSDNEERVLRTLCGLGAIVFNQFKTTIEEDEAKLSDKSLAETSRIAVQFRLTKKRL 441
Query: 481 LNACLQVTADMIMLL 495
+ L+ +M L
Sbjct: 442 VVRVLESLKKRLMDL 456
>gi|149044196|gb|EDL97578.1| rCG27725, isoform CRA_b [Rattus norvegicus]
Length = 538
Score = 58.9 bits (141), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 47/193 (24%), Positives = 92/193 (47%), Gaps = 10/193 (5%)
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 365
DD + V + ++AG+ I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 234 DDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 293
Query: 366 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVIS 416
K V R G + VF +H E + +L +LR+ +++ S + + +
Sbjct: 294 MKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFT 352
Query: 417 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 476
PVS E + L D L Y T+ ED+ +L + +L + +A +L
Sbjct: 353 LGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKTVLKNPDLSVRATMAIKLRLG 412
Query: 477 EKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 413 EKEILEKAVKSAA 425
>gi|145553305|ref|XP_001462327.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124430166|emb|CAK94954.1| unnamed protein product [Paramecium tetraurelia]
Length = 481
Score = 58.5 bits (140), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 93/466 (19%), Positives = 183/466 (39%), Gaps = 63/466 (13%)
Query: 58 SSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASE 117
S L S+ + + + +L W+ KV ++ K +E +R + AS+
Sbjct: 20 DSESELRTKSKRITYEDPDPYKNLIQWLKDGKAEVSKVSIEVK---SEGYRTLR---ASQ 73
Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIA-ELLTTNKL-SELACLALYLMYEKKQGKKSF 175
++ G+ VP + ++LE V + I +++ N + + + + + ++ + + SF
Sbjct: 74 FIRQGEWVLFVPRTHYLSLEEVKKSCLINRKMIQLNYIPNNIQTYFVNHLLQENRRQNSF 133
Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 235
W PYI L + P + + A L GSPT ++ + + + EY+ L
Sbjct: 134 WKPYIDVLPKD------VSGFPTNFDAEQDALLKGSPTLFTVMNQRKTFQEEYDNLKE-- 185
Query: 236 FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF-----------AL 284
A FQ+Y Y T+ F V + ++++R F L
Sbjct: 186 --AVKEFQRYGY-------TYNDF-----------VKFRTLTISRSFPVYIGENEQQQLL 225
Query: 285 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVD--RPYKAGESIVVWCGPQPNSKLLINYGFV 342
VPL + + + DA + R + GE + G N +NYGF
Sbjct: 226 VPLAD-FINHDNNGFLQYGYSPDADGFFMQAVRNIQKGEELFYNYGQWSNKYFFMNYGFA 284
Query: 343 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 402
NP ++ + L+ D + +M + G + + + L +R
Sbjct: 285 SLTNPMNQFDFDICLDRNDRMF---KMKVELTGGNICWGNRLVNETDHDTFRQSLATVRF 341
Query: 403 GYVS---DTSEMQSVISSLGPICP---VSPC---MERAVLDQLADYFKARLAGYPATLSE 453
+S D +++ + + P +P +E+A L D + LA + +T+ +
Sbjct: 342 AQISKLDDFLQLEEDVQNYNQFWPGWHTTPKTIELEKATFKALRDLLVSELANFASTIED 401
Query: 454 DEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVT 499
D+ L D + +R L EK+++ ++V DM++ + D T
Sbjct: 402 DQRRLNDPSTPEFRRHIIMLTMREKQIIKKNIEV-CDMMLSVIDKT 446
>gi|320168265|gb|EFW45164.1| hypothetical protein CAOG_03170 [Capsaspora owczarzaki ATCC 30864]
Length = 464
Score = 58.5 bits (140), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 74/318 (23%), Positives = 124/318 (38%), Gaps = 81/318 (25%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELA---------CLA 161
V A D+ AG +VP +L++T E+ +ET +L+T+ L +EL+ L
Sbjct: 170 VIARRDIPAGQTFINVPEALMMTAEKARKSETF-QLITSGALDSTELSPAMAKLDNFLLR 228
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
++L+ E+++G S+W PYI +L QR R PL ++E EL L SP E +
Sbjct: 229 MFLIVERRRGGNSYWSPYI-DLLPQRFR------LPLYFTEAELELLKPSPALQEAFVQL 281
Query: 222 EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR 281
+ R+Y + ++QY E+ + A + S H + + RR
Sbjct: 282 RNVVRQY-----------AAWKQY-------LMMLELARAAELPSGSGDAHQKILDQRRR 323
Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR-------------------------- 315
+P+ L Y C A A Q+VV
Sbjct: 324 AQAMPVRYNELTYDLFCWASSAVATRQNQIVVGEVRANQAPELSLALIPGWDMCNHAFGG 383
Query: 316 ------------------PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 357
P GE +++ G + + N FV D+P D+ ++ A+
Sbjct: 384 ASSFYDTQTRSLECVAVAPIAKGEPVLLHYGDRSSMAYFGNSEFVPADHPTDQYLILLAV 443
Query: 358 NTEDPQYQDKRMVAQRNG 375
+DP ++ K + Q G
Sbjct: 444 GKQDPLFKSKSTILQALG 461
>gi|325186532|emb|CCA21071.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 441
Score = 58.5 bits (140), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 95/421 (22%), Positives = 173/421 (41%), Gaps = 44/421 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A++ LQ G+ +P L ++ + ++ L N+L + +AL+LM E+ +
Sbjct: 39 VYAAKSLQKGEITMEIPFHLTISKVTAMQSDLRQILQDKNELDQDEIVALFLMIERFKSS 98
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
SF+ P+I+ L Q + P+ W++++ A L G T +L + I R+ E D
Sbjct: 99 DSFFEPFIQSLPSQ-------FDLPIFWNDSDFAELEG--TNVALLAK---IMRKQIEAD 146
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
F A + Y+ T EI + S ++ + + R + + P L
Sbjct: 147 ---FQAIHIPLLRAYEERLNLRTSEISISDYEWALS-IIWTRAFGITRYGEYLRVLCPAL 202
Query: 293 AYSSKCKAMLAAVD---------DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV- 342
+ + +D D + V A + + G ++KLL +YGFV
Sbjct: 203 DMFNHSVLVQEPLDEFIKYDHMKDVLAHCVVMETSANDPFYISYGSYSDAKLLYSYGFVS 262
Query: 343 -DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-MLPYL 400
+E N ++ + + + DP ++ K+ + + N Q + + + + L
Sbjct: 263 LNEKNRFNGIDLWMRVPVTDPNFKLKQAILEGNAATRDQTYDFRGTIHLDDVDERFLASF 322
Query: 401 RLGYVS--DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 458
R+ +S + E + S VS E AV + D + RLA +P +L +D L
Sbjct: 323 RIILLSQEEFREYEKAFDS----TIVSVRNELAVYAAIHDVCEKRLARFPTSLEDDLKKL 378
Query: 459 TDYNLHPKKRVATQL-VRME-KKMLNACLQVTADMIMLL--------PDVTVSPCPAPYA 508
+ ++ R + VRME KK+L + ++ + LL PDVT P
Sbjct: 379 AELEMNSDLRKTYAISVRMEDKKILQSVCRLMKEWRNLLENDSNIYPPDVTRQQQPQLSM 438
Query: 509 P 509
P
Sbjct: 439 P 439
>gi|346465219|gb|AEO32454.1| hypothetical protein [Amblyomma maculatum]
Length = 353
Score = 58.5 bits (140), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 82/346 (23%), Positives = 144/346 (41%), Gaps = 36/346 (10%)
Query: 80 DLKSWMHKNGLP-PCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLER 138
DL WM NG ++ ++E E R + A + + AG+ VP L++T
Sbjct: 28 DLLEWMIANGFELHVQLCVRE---FTETGRGL---ATLQKVTAGETFLRVPTCLLITTTT 81
Query: 139 VLGNETIAELLTTNK-LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
L + L+ ++ L+ + L L+L+ EK +G S W +I L ++ +P
Sbjct: 82 ALSSSLHGFLVRHHRQLTAIEVLTLFLINEKLRGLDSEWRFFIDSL-------PVSYTTP 134
Query: 198 LLWSETELAYLTGSPT-KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 256
+ LA L + KAE + I+R + L + + +L +E FT+
Sbjct: 135 VFLGSKLLARLPETMCRKAE--AQVSRIRRTFVRLQIL--LKRALLDDSALLNLSENFTW 190
Query: 257 EIFKQAFVAVQS-CVVHLQKVSLARRFA---LVPLGPPLLAYSSKCKAMLAAVDDA--VQ 310
+F A+ AV + C+ K F L P L + KA + + +
Sbjct: 191 HLFVWAWTAVNTRCI--FSKHRTDHSFWDDDYCALAPFLDCLNHHWKADVETTVEGSYFE 248
Query: 311 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 370
+V + Y+ + + + G N KLL+ YGFV DNP D + + T+ Y ++
Sbjct: 249 IVTNNNYEPNDQVFISYGSHDNKKLLLEYGFVLADNPNDVVAI-----TKGHLY---KLN 300
Query: 371 AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS 416
+Q+N + + EK+ ISD + G + + V+S
Sbjct: 301 SQQNDTVLYFATKLSFLEEKDIISDTCGFTTDGLTWNGKIVMQVLS 346
>gi|255071849|ref|XP_002499599.1| predicted protein [Micromonas sp. RCC299]
gi|226514861|gb|ACO60857.1| predicted protein [Micromonas sp. RCC299]
Length = 588
Score = 58.5 bits (140), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 78/295 (26%), Positives = 123/295 (41%), Gaps = 36/295 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVL---GNETIAELLTTNKLSELACLALYLMYEKK 169
AA+ + AGD A ++P + T+ L G A + L E AL+L+ E+
Sbjct: 188 AAATTHIPAGDIAAAIPVERLFTVRHALEMPGPRGDAYRMFA-ALGEDTIAALWLIAERA 246
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVE-----SPLLWS-ETELAYLTGSPTKAEILERAEG 223
G+ S W I L G G+ + +P+ W E A L G+P A+ + +E
Sbjct: 247 LGEASPWHAVIASLPWPEG-GEGSASPCGGCTPVSWPREACDALLGGTPLLADAIAASEK 305
Query: 224 IKREYNELDTVWFMAGSLFQQYPYDI-PTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF 282
+ R++ L F A ++ D+ P A+T + F++A A S + +Q
Sbjct: 306 LARQHAAL----FPA---LSEHMADVFPASAYTLDNFRRAHEAWNSYGMTVQASPGEPAA 358
Query: 283 ALVP---------LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 333
+P L P ++ YS D ++L V R AGE + V G + N+
Sbjct: 359 TCLPPVAMLCNHALWPHVVRYSRL-------RDGTLRLPVARSVHAGEEVFVSYGAKSNA 411
Query: 334 KLLINYGFVDEDNPYDRLVVEAAL-NTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 387
+LL+ YGF NPYD + + L E R A L++ V AG
Sbjct: 412 ELLLFYGFALPGNPYDDVPLSLELPGGEVADVTKAREAALARAGLTLSPHAVRAG 466
>gi|66828265|ref|XP_647487.1| hypothetical protein DDB_G0268558 [Dictyostelium discoideum AX4]
gi|60475797|gb|EAL73732.1| hypothetical protein DDB_G0268558 [Dictyostelium discoideum AX4]
Length = 459
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 80/316 (25%), Positives = 130/316 (41%), Gaps = 52/316 (16%)
Query: 78 LGDLKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
L + W+ N + P ++ + EK + + A + ++ + SVP +++
Sbjct: 35 LNEFNKWLINNKVYKNPKIEIKVLEKYGRS--------IVAKQSIKKNEKLISVPKLIIM 86
Query: 135 T----LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRG 190
+ L NE I E + +S A++LMY K KSFW PY+ L ++
Sbjct: 87 SNMGGFSHHLPNE-IYEPSISIGISPTNLQAIFLMY-CKLNDKSFWYPYVSVLPKE---- 140
Query: 191 QLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIP 250
+ + +SE EL L S K + R +GI+R YN T G + + P
Sbjct: 141 ---FTTSIYFSEEELDELQSSKLKEFTIIRKDGIERHYNSTFTRLSNRG-IAEFSPTSTQ 196
Query: 251 T---EAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL-------VPLGPPLLAYS-SKCK 299
T + +T E+F A SCV +R F+L VPL A SK K
Sbjct: 197 TLQQKGYTLELFTWAL----SCV-------WSRAFSLSDSDGGMVPLADMFNAEEISKSK 245
Query: 300 AMLAAVDDAVQLVVDRPYKAGESIVVWCG---PQPNSKLLINYGFV-DEDNPYDRLVVEA 355
D + + GE I G P +S++L++YGFV D P D + +
Sbjct: 246 VQPKVTDSTLDYYASDDIEIGEQIFTPYGVYKPLSSSQMLMDYGFVFDHGTPSDNVAISV 305
Query: 356 -ALNTEDPQYQDKRMV 370
+ ++P Q K+ +
Sbjct: 306 PIFHPDEPNIQVKQSI 321
>gi|356564844|ref|XP_003550657.1| PREDICTED: uncharacterized protein LOC100778605 [Glycine max]
Length = 549
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 64/241 (26%), Positives = 106/241 (43%), Gaps = 22/241 (9%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A +DL+ GD A +P S++++ E V + L + +S L L+ M EK
Sbjct: 178 ARKDLKVGDIALEIPVSIIISEELVHETDMYGVLKEIDGISSETILLLWSMKEKYNCDSK 237
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F + Y L + G L +S + L G+ EI++ + + +Y+EL
Sbjct: 238 FKI-YFDTLPEKFNTG-------LSFSIQAITMLDGTLLLEEIMQARQHLHAQYDEL--- 286
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
F A L +P P E +T+E F A S + + R L+PL L
Sbjct: 287 -FPA--LCNNFPDIFPPELYTWEKFLWACELWYSNSMKIMYSDGKLRTCLIPLAGFL--N 341
Query: 295 SSKCKAML--AAVD---DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPY 348
S C ++ VD ++++ + RP ++GE + G +S L+ YGF+ + DN Y
Sbjct: 342 HSLCPHVMHYGKVDPATNSLKFCLSRPCRSGEECCLSYGNFSSSHLITFYGFLPQGDNSY 401
Query: 349 D 349
D
Sbjct: 402 D 402
>gi|384248321|gb|EIE21805.1| SET domain-containing protein, partial [Coccomyxa subellipsoidea
C-169]
Length = 275
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 65/258 (25%), Positives = 109/258 (42%), Gaps = 34/258 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLS----ELACLALYLMYEK 168
V A++D+ G+ VP+ V+ E +E + + TN E L L LM EK
Sbjct: 32 VVATKDISCGEVVVHVPDESVLMPENCSCSEALEDAGLTNASGDAEMESIGLILALMTEK 91
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR-- 226
K GK S W Y+ L + PL W +L L G+ ++E+ G K
Sbjct: 92 KLGKSSKWKGYLDFLPKS------IPGMPLFWDSEQLQSLEGT----SLIEKMNGCKAMP 141
Query: 227 --------EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 278
++N + + F+ + + P++ + + ++ A V+ S + +
Sbjct: 142 DRPLEPPCKFNSV-VLPFLQSNAHLKLPHNAASTRRLY-VWATAMVSAYSFTIGEDRFQ- 198
Query: 279 ARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLL 336
A+VP+ L + L A A++++ GE ++ G PNS+LL
Sbjct: 199 ----AMVPMWDALNHITGHANVRLHHCARKGALRMIATCLITKGEQVINSYGDLPNSELL 254
Query: 337 INYGFVDED-NPYDRLVV 353
YGFV+ D NP+D L V
Sbjct: 255 RRYGFVETDPNPHDCLEV 272
>gi|358369683|dbj|GAA86297.1| SET domain protein [Aspergillus kawachii IFO 4308]
Length = 489
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 76/308 (24%), Positives = 124/308 (40%), Gaps = 54/308 (17%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
+W+ G P K+ K + + H V A DL G+ F++P + V++++ N
Sbjct: 22 TWLA--GKPGVKINPKIQIADLRSHAAGRGVVAQSDLDEGEELFTIPRAHVLSVQ----N 75
Query: 143 ETIAELLTTN--KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
+ LL+ N L L + ++YE QG +S W Y R L R ++ + W
Sbjct: 76 SNLKNLLSQNLEDLGPWLSLMVVMIYEYLQGDQSAWASYFRVLPRN-------FDTLMFW 128
Query: 201 SETELAYLTGSP---------TKAEILE------RA--------EGIKREYNELDTVWFM 237
S +EL L GS + I+E RA +G+ + T +
Sbjct: 129 SASELEELQGSAIVEKIGKQGAEGSIIESIAPIVRANPALFPPIDGVASYDGDAGTQALL 188
Query: 238 -----AGSLFQQYPYDI--PTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 290
GSL Y +DI P + + ++ + + +VPL
Sbjct: 189 HLAHTMGSLIMAYAFDIEKPEDEEGDRDGEDGYLTDEEEEQSSK--------GMVPLADL 240
Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 350
L A + + A L ++ + + +P K+GE I G P S LL YG+V DN
Sbjct: 241 LNADADRNNARLFQEEEVLVMKAIKPIKSGEEIFNDYGEIPRSDLLRRYGYV-TDNYAQY 299
Query: 351 LVVEAALN 358
VVE +L+
Sbjct: 300 DVVELSLD 307
>gi|449283795|gb|EMC90389.1| SET domain-containing protein 4 [Columba livia]
Length = 440
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 85/364 (23%), Positives = 147/364 (40%), Gaps = 49/364 (13%)
Query: 97 LKEKPSHNEKHRPIHY------VAASEDLQAG-DAAFSVPNSLVVTLERVLGNETIAELL 149
LK++ + RP + + ++ LQ D S+P ++T + VL + + E +
Sbjct: 39 LKDRGFEDSHLRPAEFWDTGRGLMTTKTLQVSRDLIISLPEKCLLTTDTVLSS-CLGEYI 97
Query: 150 TTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY 207
K +S L L +L+ EK G+KS W PY+ L + P+ E ++
Sbjct: 98 MKWKPPVSPLTALCTFLIAEKHAGEKSLWKPYLDVLPK-------TYSCPVC-LEHDVVS 149
Query: 208 LTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ 267
L P + + E+ + Y + LF + I F + + A+ +
Sbjct: 150 LLPEPLRKKAQEQRTKVHELYISSKAFFSSLQPLFAENTETI----FNYSALEWAWCTIN 205
Query: 268 SCVVHLQKVSLARRFALVP----LGP--PLLAYS--SKCKAMLAAVDDAVQLVVDRPYKA 319
+ +++ K S + F+L P L P LL +S + KA + ++ + K
Sbjct: 206 TRTIYM-KHSQRKCFSLEPDVYALAPYLDLLNHSPNVQVKAAFNEQTRSYEIRTNSLCKK 264
Query: 320 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 379
E + + GP N +LL+ YGFV DNP+ + V +A + DK QRN K+S+
Sbjct: 265 YEEVFICYGPHDNQRLLLEYGFVAMDNPHSSVYVSSATLLKYFPPLDK----QRNAKVSI 320
Query: 380 QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADY 439
H D+L L G+ + + + + L C R + L D
Sbjct: 321 LKDH-----------DLLENLTFGWDGPSWRLLTALKVLSLGADEFTCWRRTL---LGDV 366
Query: 440 FKAR 443
AR
Sbjct: 367 ISAR 370
>gi|307195794|gb|EFN77608.1| SET domain-containing protein 3 [Harpegnathos saltator]
Length = 245
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 56/217 (25%), Positives = 92/217 (42%), Gaps = 11/217 (5%)
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
AL+P+ + + A D + R ++ GE I + GP+ NS ++ GFV
Sbjct: 31 ALIPMWDMCNHENGRITTDFNATSDRCECYALRNFQKGEQIFISYGPRTNSDFFVHSGFV 90
Query: 343 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDM-LPYLR 401
DN D + ++ D +++ + + SV F + G E ISDM L +LR
Sbjct: 91 YMDNEQDGFKLRLGISKADSLQKERTELLGKLDLPSVGEFLLKPG--TEPISDMLLAFLR 148
Query: 402 LGYVSDTSEMQSVISSLGPI------CPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 455
+ + +E+ + S C + +E V L + +A YP TL ED
Sbjct: 149 V-FSMRKAELAHWLRSDKVFDLKHMDCALETVVEENVRKFLLTRLQLLIANYPTTLKEDL 207
Query: 456 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 492
+L + L K++ QL EKK+L L+ I
Sbjct: 208 ELL-ETTLPQVKKMTVQLRVTEKKILLGALEYVEQWI 243
>gi|226508108|ref|NP_001151788.1| SET domain containing protein [Zea mays]
gi|195649689|gb|ACG44312.1| SET domain containing protein [Zea mays]
Length = 536
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 71/277 (25%), Positives = 114/277 (41%), Gaps = 24/277 (8%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASE + GD A +P L+++ E + +E L N ++ L L+ M E+
Sbjct: 184 ASESIGVGDIALEIPEFLIISDELLCQSEVFLALKDFNNITSETMLLLWSMRERYNLGSK 243
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F PY L G L + LA L G+ EI++ + ++++Y+EL +
Sbjct: 244 F-KPYFDTLPANFNTG-------LSFGIDALAALEGTLLFDEIIQARQHLRQQYDELFPL 295
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
L +P + T++ F A S + + S LVP+ L
Sbjct: 296 ------LCTNFPEMFRKDVCTWDDFLWACELWYSNSMMIVLSSGKLSTCLVPVAGLLNHS 349
Query: 295 SSKCKAMLAAVDDA---VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYDR 350
S VD+A ++ + RP AGE + G P S L+ YGF+ DNPYD
Sbjct: 350 VSPHILNYGRVDEATKSLKFPLSRPCDAGEQCFLSYGKHPGSHLVTFYGFLPRGDNPYDV 409
Query: 351 LVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 387
+ ++ D D+ + AQ + S Q H+ G
Sbjct: 410 IPLDL-----DTSVDDEDIAAQSSATTS-QTTHMVRG 440
>gi|428175234|gb|EKX44125.1| hypothetical protein GUITHDRAFT_109909 [Guillardia theta CCMP2712]
Length = 442
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 46/189 (24%), Positives = 77/189 (40%), Gaps = 28/189 (14%)
Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM---------AGSLFQQ 244
+ +PL WS+ E L GS YN LD W M A L Q
Sbjct: 161 LTTPLFWSDKEREELQGSNL--------------YNMLDG-WTMNVEKLHRSTARVLGQH 205
Query: 245 YPY-DIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA---RRFALVPLGPPLLAYSSKCKA 300
+ D+P ++ + FK A+ + + + S R+ + P+ K
Sbjct: 206 NVFPDLPKAIYSLKEFKWAYATIFARAFDVDGKSFGFSGRQRIMAPMADLFNHGDVKTSY 265
Query: 301 MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 360
A +L + + GE I + + N++ L+ YGFV E NP+D + + A++ +
Sbjct: 266 TFNAASGHFELFTQQFFSRGEQIFMNYDSKNNAEFLLQYGFVIESNPHDYVGIAASIGND 325
Query: 361 DPQYQDKRM 369
P Y+DK +
Sbjct: 326 QPFYRDKSL 334
>gi|170067683|ref|XP_001868579.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167863782|gb|EDS27165.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 269
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 50/189 (26%), Positives = 89/189 (47%), Gaps = 24/189 (12%)
Query: 317 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 376
Y+ GE I ++ G + N+ L++ GFV DN + + +LN + Q++ ++ + ++ G
Sbjct: 74 YRKGEQIFIYYGNRTNADFLVHNGFVYPDNANSAVAIPLSLNPTEEQFEQRKQLLEKLGL 133
Query: 377 LSVQVFHVHAGREKEAIS-DMLPYLR--------LGYVSDTSEMQSVISSLGPICPVSPC 427
S F+V G IS ++L + R LG+ +QS + L P C P
Sbjct: 134 ASSGDFNVQRGGGDSFISPELLGFARVFNMTKEQLGHWQGEDAVQSQL--LEPDC---PG 188
Query: 428 MERAVLDQLADYFKARLA----GYPATLSEDEAMLTDYN------LHPKKRVATQLVRME 477
+E ++ +++ Y RL TL +DEA+L + L K + Q +E
Sbjct: 189 LEASLREKVWKYLSIRLQLALRMTGTTLDQDEALLANQGQKGAQKLGHIKSMLVQFRVVE 248
Query: 478 KKMLNACLQ 486
KK+L+ L+
Sbjct: 249 KKILSEALE 257
>gi|367048695|ref|XP_003654727.1| hypothetical protein THITE_2117893 [Thielavia terrestris NRRL 8126]
gi|347001990|gb|AEO68391.1| hypothetical protein THITE_2117893 [Thielavia terrestris NRRL 8126]
Length = 481
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 54/194 (27%), Positives = 80/194 (41%), Gaps = 18/194 (9%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
+L+ E +G+ SFW PYI L + A+ P W E ++AYL G+ I E
Sbjct: 107 FFLIKEYLKGRDSFWAPYIATLPQPEHVSAWAL--PAFWPEEDIAYLAGTNAHVAIAEIQ 164
Query: 222 EGIKREYNE----LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV----AVQSCVVHL 273
+K E+ + L F A + Q Y FT F+ + V A Q L
Sbjct: 165 ANVKSEFKQARKALKAAGFPAWQDYTQMLYKWAFCIFTSRSFRPSLVLSEPAKQQMAELL 224
Query: 274 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA-----VQLVVDRPYKAGESIVVWCG 328
F+++ PL ++ A D A QLV Y+ GE + G
Sbjct: 225 PPGCQLDDFSILQ---PLFDIANHSMTARYAWDVASDPASCQLVCHDAYQPGEQVYNNYG 281
Query: 329 PQPNSKLLINYGFV 342
+ NS+LL+ YGF+
Sbjct: 282 LKTNSELLLAYGFI 295
>gi|260835045|ref|XP_002612520.1| hypothetical protein BRAFLDRAFT_214305 [Branchiostoma floridae]
gi|229297897|gb|EEN68529.1| hypothetical protein BRAFLDRAFT_214305 [Branchiostoma floridae]
Length = 287
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 63/287 (21%), Positives = 118/287 (41%), Gaps = 31/287 (10%)
Query: 75 EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
EE W+H+NG C+ + + R + A++ L+ + +P L++
Sbjct: 18 EESFVRFFQWLHRNG---CRNVPLKPAVFPGTGRGM---MATKALKHEELMLVIPQRLLI 71
Query: 135 TLERVLGNETIAELLTTN-KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
T++ ++ + + + +L+ LA++LM EK + +KSFW PYI L +
Sbjct: 72 TMDAIMDSYIAPYIERADPRLTPTQALAVFLMCEKYRREKSFWRPYIDILPEE------- 124
Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 253
P ++E + L S + + + +EY EL + M LF +A
Sbjct: 125 YSCPTFFTEDDFRLLPNS-LRGKAKAKKYECHKEYKELAPFFKMLADLFPD-----QEDA 178
Query: 254 FTFEIFKQAFVAVQSCVV----------HLQKVSLARRFALVPLGPPL-LAYSSKCKAML 302
F F+ FK A+ A+++ + HL+ + PL + A +K +
Sbjct: 179 FNFKDFKWAWSAIKTRALDVPIGRESCRHLRDAEDTPTPTMFPLVDSINHAAQAKIRHRY 238
Query: 303 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
++ + Y+ ++ G N LL+ +GFV NP D
Sbjct: 239 NEKSRCLESRTETVYRRHAEVMNSYGRADNDNLLLEFGFVVPGNPED 285
>gi|320169513|gb|EFW46412.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 495
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 99/418 (23%), Positives = 170/418 (40%), Gaps = 54/418 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYE-KKQG 171
V A DL AG+ VP SL++ +E + + +L +LS+ +A +L+YE +
Sbjct: 79 VFALRDLAAGETVLRVPLSLLLNVEHASAS-PLGGILDDFRLSDAEAMAFWLIYELTRPE 137
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
+ S WLPY+ L QL + + E+ L SP R ++ ++ +
Sbjct: 138 RASPWLPYLESL--PASIKQLT----MFYDPFEMKRLQASPVAEFTSRRTVKMRNKFGKY 191
Query: 232 DTVWFMAGSLFQQYP-----YDIPTEAFTFEIFKQAFVAVQSC------VVHLQKVSLAR 280
+ + P + P E T + F A +AVQ V H R
Sbjct: 192 RE------QISKHRPAHLAEIEFPVELITVDDFLWA-MAVQFTRLITVQVKHPADGEWER 244
Query: 281 RFALVPLG-----PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQ--PN 332
LVPL P + +C L + + RP G+ ++ + G + N
Sbjct: 245 TKCLVPLADLLNTAPADQINVECATNLDSTH--FECATIRPVAEGQELLTPYGGAEQLSN 302
Query: 333 SKLLINYGFVDEDNPYDRLVV------EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 386
+L+++YG +NP D + + E A+ + M R +L + V
Sbjct: 303 GQLIMDYGVTFRNNPSDLVALPIPKLRETAVAYDSKMRLLMAMSLDRFDRLQLPVLDHFE 362
Query: 387 GREKEAISDMLPYLRLGYVS---DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 443
KE +L + R+ YVS D S+++ V+ + ++P ER L+ L
Sbjct: 363 SIPKE----LLAFARV-YVSTPSDLSDLEHVLELMKEHRAINPSNERRALELLLQLTNEM 417
Query: 444 LAGYPATLSEDEAMLTDYNLH--PKKRVATQLV-RM-EKKMLNACLQVTADMIMLLPD 497
+ Y T+ EDE ML + + P +V R+ EK++L++ Q+ I LP+
Sbjct: 418 ILKYITTIEEDETMLRELDAESVPNANAVNAVVLRLGEKRILSSLWQLLDSAIEALPE 475
>gi|308811012|ref|XP_003082814.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplast precursor (ISS)
[Ostreococcus tauri]
gi|116054692|emb|CAL56769.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplast precursor (ISS)
[Ostreococcus tauri]
Length = 588
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 86/387 (22%), Positives = 149/387 (38%), Gaps = 51/387 (13%)
Query: 49 RKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSH-NEKH 107
R +R S+ V G + E L W+ + G +V+ + N+
Sbjct: 12 RASRARWTTRSTRARVRGDAQRARASREAYDGLWMWLERRGADVSRVVADAVTTDANDSE 71
Query: 108 RPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL----ACLALY 163
R V A L+ G A +P + + R + + L + + +AL
Sbjct: 72 RAQFGVRAKTTLRRGTRAMVIPREVWMDATRATEDADVGAALRDARYDAVKQPWVRVALL 131
Query: 164 LMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG 223
L+ E+++G + Y+ L + ++SPL WS EL + G+ ++L+ A G
Sbjct: 132 LLKERERGADGEFAAYVATLPK-------TLDSPLFWSADELRDIAGT----QLLDNAAG 180
Query: 224 ----IKREYNELDTVWFMAGSLFQQYPYDIPTE-AFTFEIFKQAFVAVQS-CVVHLQKVS 277
++ Y EL +F +Y + AF F+ AF ++S + L +
Sbjct: 181 YDAYVRAVYEEL------KNGVFVEYASTFDVDGAFDEASFRWAFGILRSRTMAPLDGAN 234
Query: 278 LARRFALVPLGPPLLAYSSKCKAMLAAVDD---------------AVQLVVDRPYKAGES 322
+A LVP G L+ +SS A A + DR Y G
Sbjct: 235 VA----LVP-GLDLINHSSLSGARWRVGGGGGMGGLFGGGSGSGVAAYVECDRDYDEGAE 289
Query: 323 IVVWCGPQP-NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQV 381
I V P+ +SK ++YGF+D NP + ++ +D DK V + G
Sbjct: 290 IFVNYDPEGIDSKFALDYGFIDVVNPSPGYALTLSIPEDDANLFDKLDVLETQGLPEAPT 349
Query: 382 FHVHAGREKEAISDMLPYLRLGYVSDT 408
F + + + ++ +LRL + DT
Sbjct: 350 FTLRPYSDPD--RELRTFLRLLHCKDT 374
>gi|145516585|ref|XP_001444181.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124411592|emb|CAK76784.1| unnamed protein product [Paramecium tetraurelia]
Length = 658
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 37/143 (25%), Positives = 65/143 (45%), Gaps = 26/143 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL---------SELACLALY 163
V+A ++ A ++PN L+++ +VL +E ++++ T+K +E CLALY
Sbjct: 48 VSAKMNIPANKVIIAIPNKLIISHHKVLKSE-LSDMFKTHKQFFDDQITADAEFNCLALY 106
Query: 164 LMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG 223
+ Y K QG KSFW PY+ +++ + W +L L E +
Sbjct: 107 IFYHKLQGDKSFWYPYLNVVEQH---------TMFEWRNRDLFNLQDQSLIDEFMYIQ-- 155
Query: 224 IKREYNELDTVWFMAGSLFQQYP 246
+E+D W+ L +YP
Sbjct: 156 -----SEMDKSWYKFKGLMNKYP 173
>gi|358366345|dbj|GAA82966.1| SET domain protein [Aspergillus kawachii IFO 4308]
Length = 673
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 61/229 (26%), Positives = 96/229 (41%), Gaps = 7/229 (3%)
Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
+ + E +L+ + +G + FW PYIR L Q G ++ +P + +L +L G+
Sbjct: 77 DAVGEKESTIFFLIGQYLRGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGT 131
Query: 212 PTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
A +R E +K +Y + T AG Y +D+ A + I + V S V
Sbjct: 132 SLLAAREKRLEVLKEKYEKGSTALRNAGFEGADAYTWDLYLWAASMFISRAFSARVLSGV 191
Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 330
+S + L+P+ + + K A D V VV AG+ I GP+
Sbjct: 192 FPETDLSEEKLSVLLPI-IDMGNHRPLAKVEWRAGKDDVAFVVLEDVSAGQEISNNYGPR 250
Query: 331 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 379
N +L++NYGF NP D +V P Y K Q L+V
Sbjct: 251 NNEQLMMNYGFCIPGNPCDHRIVSLRAPPGSPLYMAKSHQLQMYPDLAV 299
>gi|17367341|sp|Q43088.1|RBCMT_PEA RecName: Full=Ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic; AltName:
Full=[Fructose-bisphosphate aldolase]-lysine
N-methyltransferase; AltName:
Full=[Ribulose-bisphosphate carboxylase]-lysine
N-methyltransferase; Short=PsLSMT; Short=RuBisCO LSMT;
Short=RuBisCO methyltransferase; Short=rbcMT; Flags:
Precursor
gi|508551|gb|AAA69903.1| ribulose-1,5 bisphosphate carboxylase large subunit
N-methyltransferase [Pisum sativum]
Length = 489
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 91/377 (24%), Positives = 160/377 (42%), Gaps = 24/377 (6%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A +D+ D VP L + + V +E I + + +L + L+L+ E+ + +
Sbjct: 84 LVALKDISRNDVILQVPKRLWINPDAVAASE-IGRVCS--ELKPWLSVILFLIRERSR-E 139
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W Y L ++ +S + WSE EL L GS + E +K E +L+
Sbjct: 140 DSVWKHYFGILPQE-------TDSTIYWSEEELQELQGSQLLKTTVSVKEYVKNECLKLE 192
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQS-CVVHLQKVSLARRFALVPLGPP 290
+ P + + F I + +AF +++ +V + L A V
Sbjct: 193 QEIILPNKRLFPDPVTLDDFFWAFGILRSRAFSRLRNENLVVVPMADLINHSAGVTTEDH 252
Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYD 349
AY K A L + D L KAGE + + + + N++L ++YGF++ +
Sbjct: 253 --AYEVKGAAGLFSWDYLFSLKSPLSVKAGEQVYIQYDLNKSNAELALDYGFIEPNENRH 310
Query: 350 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 409
+ ++ DP + DK VA+ NG F + R +LPYLRL + T
Sbjct: 311 AYTLTLEISESDPFFDDKLDVAESNGFAQTAYFDIFYNRTLPP--GLLPYLRLVALGGTD 368
Query: 410 E--MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 464
++S+ + G + VS E + + + K+ LAGY T+ +D L + NL
Sbjct: 369 AFLLESLFRDTIWGHLELSVSRDNEELLCKAVREACKSALAGYHTTIEQDRE-LKEGNLD 427
Query: 465 PKKRVATQLVRMEKKML 481
+ +A + EK +L
Sbjct: 428 SRLAIAVGIREGEKMVL 444
>gi|384483765|gb|EIE75945.1| hypothetical protein RO3G_00649 [Rhizopus delemar RA 99-880]
Length = 376
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 66/251 (26%), Positives = 116/251 (46%), Gaps = 38/251 (15%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A+ED++AG+ SVP + ++T NE++ +L T+ LS LAL+L+ + K
Sbjct: 1 MMATEDIEAGEVIVSVPRNFLIT------NESLTKLYGTHSLSPHQLLALHLVLLTRD-K 53
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+S+W PY L + S LL ++L S K E +++ + I +Y
Sbjct: 54 QSWWKPYTDLLPMHFNTMPVNYPSELL------SHLPNS-LKQETMQQKDNIHTDY---- 102
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL- 291
V + +Q P DI T E FK A++ V + +H+ + + L P L
Sbjct: 103 -VTCLKFCKSKQLPQDI-----TAEEFKWAWLCVNTRCIHMTVPDYLAKGENIALAPMLD 156
Query: 292 -LAYSSKCKAMLAAVDDAVQLVVDR-------PYKAGESIVVWCGPQPNSKLLINYGFVD 343
L ++++ K ++ + R YK GE + + GP N +L YGFV
Sbjct: 157 FLNHTTEAK-----IESGFNIRTQRFEIKTLTAYKKGEQVYINYGPHDNLAMLKEYGFVL 211
Query: 344 EDNPYDRLVVE 354
+N Y+ ++++
Sbjct: 212 NENIYNFVLLD 222
>gi|297836754|ref|XP_002886259.1| hypothetical protein ARALYDRAFT_319874 [Arabidopsis lyrata subsp.
lyrata]
gi|297332099|gb|EFH62518.1| hypothetical protein ARALYDRAFT_319874 [Arabidopsis lyrata subsp.
lyrata]
Length = 541
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 59/244 (24%), Positives = 100/244 (40%), Gaps = 29/244 (11%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASEDL+ GD A +P S +++ E V ++ L + ++ + L+ M EK
Sbjct: 173 ASEDLKFGDVALEIPISSIISEEYVFNSDMYPILEKIDGITSETMVLLWTMREKHNLDSK 232
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F PY L G + + + L G+ EI++ E ++ Y+EL
Sbjct: 233 F-KPYFDSLQENFCTG-------MSFGVNAIMELDGTLLLDEIMQAKELLRERYDELI-- 282
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG------ 288
L + + P E +T+E + A S + ++ + L+P+
Sbjct: 283 -----PLLSNHRHVFPPEHYTWEHYLWACELYYSNSMQIKFPDGKLKTCLIPVAGFLNHS 337
Query: 289 --PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-D 345
P ++ Y C +++ V RP GE + G +S LL YGF+ + D
Sbjct: 338 IYPHIVKYGKVCVET-----SSLKFPVSRPCNKGEQCFLSYGNYSSSHLLTFYGFLPKGD 392
Query: 346 NPYD 349
NPYD
Sbjct: 393 NPYD 396
>gi|281201870|gb|EFA76078.1| hypothetical protein PPL_10657 [Polysphondylium pallidum PN500]
Length = 1234
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 56/252 (22%), Positives = 105/252 (41%), Gaps = 18/252 (7%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTT-NKLSELACLALYLMYEKKQG 171
+ ++ ++ + VP ++ ++ + + + + L++ L L+++YEK +
Sbjct: 767 IVTTKKVEENEVIIKVPRKFLINVQVAREHPILGRIFEEFSGLNDDTILFLFVIYEK-EN 825
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
SFW P+ L + + ++ TEL L G+ AE L+ +K +
Sbjct: 826 PNSFWRPFFDTLPS-------YFPTSIHYTSTELLELEGTNLFAETLQ----VKEHLQSI 874
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
+ F L +QYP P F++E F A S + L K+ LVP+ +
Sbjct: 875 RDMLF--PELSEQYPTIFPESLFSWENFLWARSLFDSRAIQL-KIDDKITNCLVPMADMI 931
Query: 292 LAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
+ + + D ++V I + G N +L + YGFV +DNPYD
Sbjct: 932 NHHHNAQISQRFFDQTDQCFKMVSCCSVPPNAQIFLHYGALQNRELALYYGFVIQDNPYD 991
Query: 350 RLVVEAALNTED 361
+++ L ED
Sbjct: 992 SMLIGFDLPDED 1003
>gi|384246985|gb|EIE20473.1| rubisco small subunit N-methyltransferase, partial [Coccomyxa
subellipsoidea C-169]
Length = 363
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 92/376 (24%), Positives = 146/376 (38%), Gaps = 92/376 (24%)
Query: 124 AAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIREL 183
A +P +L VT V +E +A L EL LAL+LM E+++G++S W P++ L
Sbjct: 2 ALVELPGNLSVTAVDVAAHEEVAGL--AEGRGELTGLALWLMAERQKGEESRWAPFLECL 59
Query: 184 DRQRGRGQLAVESPLLW-SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLF 242
A SP+LW E + L SPT E R +++E++
Sbjct: 60 PE-------ATLSPVLWPEEVQDELLKNSPTLKECRARRAALQQEWD------------- 99
Query: 243 QQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL--ARRFA-------LVPLGPPLLA 293
V Q+++ ARRF+ + LG P
Sbjct: 100 ----------------------------VIAQRIATGDARRFSGGDELKLWITLGSP--G 129
Query: 294 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
+ +L A+ D +PN +L + G V++DN D L V
Sbjct: 130 WGGTSDKLLMAIYDG---------------------RPNGELAMATGRVEDDNASDCLTV 168
Query: 354 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 413
L D + K+ + + G VQ F + R + +L YLRL ++D + +
Sbjct: 169 RVGLVQADRLFSVKKQILESLGFDIVQEFPIFRDR---MPTQLLAYLRLARLTDPALLAK 225
Query: 414 VISSLGPICPVSPCMERAVLDQLADYFKARLAGYP----ATLSEDEAMLTDYNLHPKKRV 469
V S ++P E VL L + RL Y + ED +L L ++R+
Sbjct: 226 V--SFEEDIILNPVNEYEVLQLLLGECRDRLTSYAGMHMGSAEEDVKLLQRPGLTAQERL 283
Query: 470 ATQLVRMEKKMLNACL 485
A +L + EK +L L
Sbjct: 284 AARLRKAEKAILQGTL 299
>gi|225446052|ref|XP_002268920.1| PREDICTED: uncharacterized protein LOC100256524 [Vitis vinifera]
Length = 566
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 61/249 (24%), Positives = 104/249 (41%), Gaps = 28/249 (11%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+EDL+ GD A +P S+V++ E V ++ L + +S L L+ M EK
Sbjct: 194 ATEDLKVGDVALEIPMSIVISEELVHESDMFPILEKIDGISSETMLLLWSMKEKHNSNSK 253
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F Y L A + L + + L G+ EI+E + + +Y EL
Sbjct: 254 F-NTYFNALPE-------AFNTGLSFEFDAIMVLAGTLLLEEIIEAKKHLNAQYEEL--- 302
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG------ 288
+L + +P P E +T E F A S + + R L+P+
Sbjct: 303 ---VPALCKDHPDIFPPEFYTQEQFLWACELWYSNGMQVMFTDGKLRTCLIPIAGFLNHS 359
Query: 289 --PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-D 345
P ++ Y + + ++++ V +P GE + G +S L+ YGF+ + D
Sbjct: 360 LYPHIMHYGK-----VDSKTNSLKFCVSKPCNMGEQCYLSYGNFSSSHLVTFYGFIPQGD 414
Query: 346 NPYDRLVVE 354
N YD + +E
Sbjct: 415 NLYDTIPLE 423
>gi|444705829|gb|ELW47217.1| Histone-lysine N-methyltransferase setd3 [Tupaia chinensis]
Length = 539
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/193 (23%), Positives = 91/193 (47%), Gaps = 10/193 (5%)
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 365
DD + V + ++ GE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 234 DDRCECVALQDFRPGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 293
Query: 366 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD---------TSEMQSVIS 416
K V R G + VF +H + + +L +LR+ +++ S + + +
Sbjct: 294 MKAEVLARAGIPTSSVFALHF-TDPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFT 352
Query: 417 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 476
PVS E + L D L Y T+ ED+++L +L + +A +L
Sbjct: 353 LGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKSRDLSVRATMAIKLRLG 412
Query: 477 EKKMLNACLQVTA 489
EK++L ++ A
Sbjct: 413 EKEILERAVRSAA 425
Score = 39.7 bits (91), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 38/160 (23%), Positives = 73/160 (45%), Gaps = 18/160 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ +++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVDGFEMVNFKEEGFGLR---ATREIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERAS-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
++PL + E E+ YL + ++ + + R+Y
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY 217
>gi|297735395|emb|CBI17835.3| unnamed protein product [Vitis vinifera]
Length = 583
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 61/249 (24%), Positives = 104/249 (41%), Gaps = 28/249 (11%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+EDL+ GD A +P S+V++ E V ++ L + +S L L+ M EK
Sbjct: 211 ATEDLKVGDVALEIPMSIVISEELVHESDMFPILEKIDGISSETMLLLWSMKEKHNSNSK 270
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F Y L A + L + + L G+ EI+E + + +Y EL
Sbjct: 271 F-NTYFNALPE-------AFNTGLSFEFDAIMVLAGTLLLEEIIEAKKHLNAQYEEL--- 319
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG------ 288
+L + +P P E +T E F A S + + R L+P+
Sbjct: 320 ---VPALCKDHPDIFPPEFYTQEQFLWACELWYSNGMQVMFTDGKLRTCLIPIAGFLNHS 376
Query: 289 --PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-D 345
P ++ Y + + ++++ V +P GE + G +S L+ YGF+ + D
Sbjct: 377 LYPHIMHYGK-----VDSKTNSLKFCVSKPCNMGEQCYLSYGNFSSSHLVTFYGFIPQGD 431
Query: 346 NPYDRLVVE 354
N YD + +E
Sbjct: 432 NLYDTIPLE 440
>gi|403350379|gb|EJY74649.1| SET domain containing protein [Oxytricha trifallax]
Length = 2165
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 75/333 (22%), Positives = 135/333 (40%), Gaps = 65/333 (19%)
Query: 97 LKEKPSHNEKHRPIHYVA------ASEDLQAGDAAFSVPNSLVVTLERVL----GNETIA 146
L++ SH EK + +Y A A+ D++ G+ VP ++TLE + G +
Sbjct: 154 LEQGGSHFEKLKIRYYTADYRGVHAARDIKKGEIILYVPKHQIITLEMAMTSPVGKKMYE 213
Query: 147 ELLTTNKLS-ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 205
+ L +S + + L+ Y+M EK++ + S W YI L + P+ ++E E
Sbjct: 214 KGLRQRLISPKHSFLSTYIMQEKRKPE-SQWQIYIDILPKNFSN------FPIFFTEEER 266
Query: 206 AYLTGSPTKAEILERAEGIKREYN---------------ELDTVWFMAGSLFQQYPYDIP 250
+L GSP +ILE+ E IK +Y+ E + M S + I
Sbjct: 267 IWLKGSPFLDQILEKIEDIKADYDLICKEVPEYVQFPIREYSEIRMMVSSRI----FGIQ 322
Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 310
E + FVA + H + P K ++ A++D
Sbjct: 323 IEG----VKTDGFVAYADMLNHKR-----------PRQTSWTYTDEKQGFIIEAMEDI-- 365
Query: 311 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR-M 369
+ GE + G + NS+ +NYGF++ +N + + ++ +T+D Q K+ M
Sbjct: 366 -------QRGEQVYDSYGKKCNSRFFLNYGFINLNNDANEVPIKVYYHTDDQLKQVKQDM 418
Query: 370 VAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 402
+ + + F V E + + +LR
Sbjct: 419 IVDHS---EFKKFRVVENLEDRVMQEFFSWLRF 448
>gi|242823770|ref|XP_002488126.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
gi|218713047|gb|EED12472.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
Length = 480
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 88/386 (22%), Positives = 156/386 (40%), Gaps = 56/386 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A D+Q G+ F +P +V+ ++ NE +A+ L L L + ++YE G+
Sbjct: 50 VVARSDIQEGEDLFHLPQRVVLMVKTSPLNEILADEL--KNLGPWLSLVVVMIYEYSLGE 107
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT---------KAEILERAEG 223
+S W Y + L + ++ + WS EL+ L S + +I E+
Sbjct: 108 RSNWNQYFQVLPTK-------FDTLMFWSGEELSQLQASAVIHKIGKKDAEEDIFEKIIP 160
Query: 224 IKREYNELDTVWFMAGSLFQQYPYDIPTEA--------------FTFEIFKQAFVAVQSC 269
+ R + +L F + Y D +A + F+I K +
Sbjct: 161 LVRSHPDL----FPPVNGVMSYDDDAGAQALLELAHRMGSLIMAYAFDIEKGEEEESEGE 216
Query: 270 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 329
+L +VPL L A + + A L D A+ + +P K G+ I G
Sbjct: 217 DGYLTDDEEQLPKGMVPLADLLNADADRNNARLFQEDGALVMRAIKPIKTGDEIFNDYGE 276
Query: 330 QPNSKLLINYGFVDEDNPYDRLVVEAAL----------NTEDPQYQDKRMVAQRNGKLSV 379
P S LL YG+V DN VVE L N E +Y +++ + ++
Sbjct: 277 LPRSDLLRRYGYV-TDNYAQYDVVELPLTGICHAAGLDNIESQEYPHLKLLHEL--EILE 333
Query: 380 QVFHVHAGREKEAISDMLP----YLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQ 435
+ + +++++D+LP L + E+Q + S P P+ E +
Sbjct: 334 DGYCILRPSAEDSLTDILPDELLALLKSLTLEREELQRLQSKQKPPKPILAAREARI--- 390
Query: 436 LADYFKARLAGYPATLSEDEAMLTDY 461
L D K++L+ Y T+ +D+A+L +
Sbjct: 391 LLDSVKSKLSQYGTTVEQDKAILQQF 416
>gi|358056251|dbj|GAA97802.1| hypothetical protein E5Q_04481 [Mixia osmundae IAM 14324]
Length = 433
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 92/391 (23%), Positives = 162/391 (41%), Gaps = 55/391 (14%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETI---AELLTTNKLSELACLALYLMYEKKQG 171
A+ +L++ FS+P SLV+++ +++ +E+ T + + CL MYE+
Sbjct: 39 ATSNLRSETELFSIPRSLVLSVHTSPLPKSLPDWSEISTQGWVGLILCL----MYEQID- 93
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-LERAEGIKREYNE 230
S W Y+ + +S + WS+ EL L GS +I E AEG Y+
Sbjct: 94 PASHWKRYLNSM-------PTCFDSLMFWSDDELRELQGSSVLDKIGREEAEG--SYYSI 144
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL------------ 278
L +F+ P EA++ ++ + + S H+
Sbjct: 145 LVPYLSKHADIFK------PLEAYSLALYHRCGSLILSRSFHVSNQDDSASDASDDDDAA 198
Query: 279 ---ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
+VP+ L A S A L DA+ + + AGE I PN+ L
Sbjct: 199 YHEVETVGMVPMADVLNAKSGSANACLVYHPDALVMTTTKEIAAGEQIFNTYNDPPNADL 258
Query: 336 LINYGFVDEDNPYDRLVVEAAL-NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI- 393
L YG VDE N D + + A L +D +R L ++ V+ + E +
Sbjct: 259 LRRYGHVDEVNLNDNVEISADLIGCKD---------LERVDWLLDRLDDVYTLTQAEDLP 309
Query: 394 SDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSE 453
D + +++ + SE + + + P ++ A ++ + + RLA Y +T+ E
Sbjct: 310 EDFITAVKI-LTASKSEFRKIQKA--DDLP-DDVLDEATAMRVREILQMRLAQYSSTIEE 365
Query: 454 DEAMLTDYNLHPKKRVATQLVRM-EKKMLNA 483
DE++L + + A LVR+ EK++L A
Sbjct: 366 DESLLASSTMLTSRSRAALLVRLGEKRILAA 396
>gi|320163048|gb|EFW39947.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 476
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 106/470 (22%), Positives = 192/470 (40%), Gaps = 66/470 (14%)
Query: 83 SWMHKNGLPPC-KVILKEKPSHNEKHRP--IH---YVAASEDLQAGDAAFSVPNSLVVTL 136
+W+ NG K+ L+ + N R +H +A + FS+P L+++
Sbjct: 17 AWLRANGATVSPKLTLQATAAFNADSRTQVLHRRVIASAEAGFDKEEELFSIPRKLLLSA 76
Query: 137 ERVLGNETIAELLTTNKLSELAC-----LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQ 191
+IAELL NK A L + +MYE K SFW PY+ L
Sbjct: 77 ----STSSIAELLLENKKEACALVGWMPLVVAMMYEI-TNKDSFWRPYLDLLPE------ 125
Query: 192 LAVESPLLWSETELAYLTGSPTKAEI-LERAEGIKRE----YNELDTVWF----MAGSLF 242
+++P+ W++ +L L G+ T + + E AE I E + +L F +L+
Sbjct: 126 -TLDTPMFWNDDDLELLEGTSTLSHLGKEDAETIFTEQIVPFMKLHPTHFDLKVHNMALY 184
Query: 243 QQYPYDIPTEAFTFEIFKQAFV-------------AVQSCVVHLQKVSLARRFALVPLGP 289
+ I +F+ + + A C ++ + + A+VPL
Sbjct: 185 HRVASVIMAYSFSEDDDEDDDDEDDDEEEDCCDGDANNECCSQKRQKRM-EKIAMVPLAD 243
Query: 290 PLLAYSSKCK-AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
+L + + C A L + + P AG + G NS+LL YGF+D+ N +
Sbjct: 244 -MLDHKTGCNNARLFYGKTTLAMSCIEPCAAGHELYNTYGDLSNSELLRKYGFIDDVNEH 302
Query: 349 DRLVVEAAL---NTEDPQYQDKRMVAQRNGKLSVQVFHVHAG----REKEAISDML---- 397
+ + + + E + ++ M A + FH+ A +E EA +L
Sbjct: 303 NSVDIPVEMLEERFESCSFMEEAMEALEEIGCWLPEFHIPADALPPQELEASIALLFQSP 362
Query: 398 -PYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEA 456
L + D E++S +++L V+ C R V + L + + R Y T EDE
Sbjct: 363 KQVRALRALDDEDEIRSFLATL-----VNKC-RRKVSETLLAFGQKRAEEYTTTREEDEE 416
Query: 457 MLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAP 506
L + +L ++++A ++ E+ +L+ + + + P + PAP
Sbjct: 417 RLKESDLTHRQKMALRVRIGERTILHNYISHLKERLETTPPDQETKEPAP 466
>gi|24987776|pdb|1MLV|A Chain A, Structure And Catalytic Mechanism Of A Set Domain Protein
Methyltransferase
gi|24987777|pdb|1MLV|B Chain B, Structure And Catalytic Mechanism Of A Set Domain Protein
Methyltransferase
gi|24987778|pdb|1MLV|C Chain C, Structure And Catalytic Mechanism Of A Set Domain Protein
Methyltransferase
gi|33357815|pdb|1OZV|A Chain A, Crystal Structure Of The Set Domain Of Lsmt Bound To
Lysine And Adohcy
gi|33357816|pdb|1OZV|B Chain B, Crystal Structure Of The Set Domain Of Lsmt Bound To
Lysine And Adohcy
gi|33357817|pdb|1OZV|C Chain C, Crystal Structure Of The Set Domain Of Lsmt Bound To
Lysine And Adohcy
gi|33357822|pdb|1P0Y|A Chain A, Crystal Structure Of The Set Domain Of Lsmt Bound To
Melysine And Adohcy
gi|33357823|pdb|1P0Y|B Chain B, Crystal Structure Of The Set Domain Of Lsmt Bound To
Melysine And Adohcy
gi|33357824|pdb|1P0Y|C Chain C, Crystal Structure Of The Set Domain Of Lsmt Bound To
Melysine And Adohcy
Length = 444
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 90/377 (23%), Positives = 158/377 (41%), Gaps = 24/377 (6%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A +D+ D VP L + + V +E ++L + L+L+ E+ + +
Sbjct: 40 LVALKDISRNDVILQVPKRLWINPDAVAASEIGR---VCSELKPWLSVILFLIRERSR-E 95
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W Y L ++ +S + WSE EL L GS + E +K E +L+
Sbjct: 96 DSVWKHYFGILPQE-------TDSTIYWSEEELQELQGSQLLKTTVSVKEYVKNECLKLE 148
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQS-CVVHLQKVSLARRFALVPLGPP 290
+ P + + F I + +AF +++ +V + L A V
Sbjct: 149 QEIILPNKRLFPDPVTLDDFFWAFGILRSRAFSRLRNENLVVVPMADLINHSAGVTTEDH 208
Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYD 349
AY K A L + D L KAGE + + + + N++L ++YGF++ +
Sbjct: 209 --AYEVKGAAGLFSWDYLFSLKSPLSVKAGEQVYIQYDLNKSNAELALDYGFIEPNENRH 266
Query: 350 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 409
+ ++ DP + DK VA+ NG F + R +LPYLRL + T
Sbjct: 267 AYTLTLEISESDPFFDDKLDVAESNGFAQTAYFDIFYNRTLPP--GLLPYLRLVALGGTD 324
Query: 410 E--MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 464
++S+ + G + VS E + + + K+ LAGY T+ +D L + NL
Sbjct: 325 AFLLESLFRDTIWGHLELSVSRDNEELLCKAVREACKSALAGYHTTIEQDRE-LKEGNLD 383
Query: 465 PKKRVATQLVRMEKKML 481
+ +A + EK +L
Sbjct: 384 SRLAIAVGIREGEKMVL 400
>gi|109158151|pdb|2H21|A Chain A, Structure Of Rubisco Lsmt Bound To Adomet
gi|109158152|pdb|2H21|B Chain B, Structure Of Rubisco Lsmt Bound To Adomet
gi|109158153|pdb|2H21|C Chain C, Structure Of Rubisco Lsmt Bound To Adomet
gi|109158154|pdb|2H23|A Chain A, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
Adohcy
gi|109158155|pdb|2H23|B Chain B, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
Adohcy
gi|109158156|pdb|2H23|C Chain C, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
Adohcy
gi|109158157|pdb|2H2E|A Chain A, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
gi|109158158|pdb|2H2E|B Chain B, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
gi|109158159|pdb|2H2E|C Chain C, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
gi|109158160|pdb|2H2J|A Chain A, Structure Of Rubisco Lsmt Bound To Sinefungin And
Monomethyllysine
gi|109158161|pdb|2H2J|B Chain B, Structure Of Rubisco Lsmt Bound To Sinefungin And
Monomethyllysine
gi|109158162|pdb|2H2J|C Chain C, Structure Of Rubisco Lsmt Bound To Sinefungin And
Monomethyllysine
Length = 440
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 90/377 (23%), Positives = 158/377 (41%), Gaps = 24/377 (6%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A +D+ D VP L + + V +E ++L + L+L+ E+ + +
Sbjct: 36 LVALKDISRNDVILQVPKRLWINPDAVAASEIGR---VCSELKPWLSVILFLIRERSR-E 91
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W Y L ++ +S + WSE EL L GS + E +K E +L+
Sbjct: 92 DSVWKHYFGILPQE-------TDSTIYWSEEELQELQGSQLLKTTVSVKEYVKNECLKLE 144
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQS-CVVHLQKVSLARRFALVPLGPP 290
+ P + + F I + +AF +++ +V + L A V
Sbjct: 145 QEIILPNKRLFPDPVTLDDFFWAFGILRSRAFSRLRNENLVVVPMADLINHSAGVTTEDH 204
Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYD 349
AY K A L + D L KAGE + + + + N++L ++YGF++ +
Sbjct: 205 --AYEVKGAAGLFSWDYLFSLKSPLSVKAGEQVYIQYDLNKSNAELALDYGFIEPNENRH 262
Query: 350 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 409
+ ++ DP + DK VA+ NG F + R +LPYLRL + T
Sbjct: 263 AYTLTLEISESDPFFDDKLDVAESNGFAQTAYFDIFYNRTLPP--GLLPYLRLVALGGTD 320
Query: 410 E--MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 464
++S+ + G + VS E + + + K+ LAGY T+ +D L + NL
Sbjct: 321 AFLLESLFRDTIWGHLELSVSRDNEELLCKAVREACKSALAGYHTTIEQDRE-LKEGNLD 379
Query: 465 PKKRVATQLVRMEKKML 481
+ +A + EK +L
Sbjct: 380 SRLAIAVGIREGEKMVL 396
>gi|302762396|ref|XP_002964620.1| hypothetical protein SELMODRAFT_81798 [Selaginella moellendorffii]
gi|300168349|gb|EFJ34953.1| hypothetical protein SELMODRAFT_81798 [Selaginella moellendorffii]
Length = 464
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 86/367 (23%), Positives = 153/367 (41%), Gaps = 46/367 (12%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
L L L+YE+ Q K S+W PYI L + P+ +S ++ + +P ++ +
Sbjct: 105 LGLKLLYERAQ-KGSYWWPYISMLPH-------SFTLPIFFSGVDIESIDYAPVTHQVKK 156
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH--LQKVS 277
R + + EL + + + + + A + + A V+ ++ +H K+
Sbjct: 157 RCRFLLQFSAELAKLESLPEEVHPFAGQSVDSGALGWAM---AAVSSRAFRIHGVTNKLC 213
Query: 278 LARRFALVPLGPPLLAYSSKCKAMLA--AVDDA-VQLVVDRPYKAGESIVVWCGPQPNSK 334
A L+ + ++ + L+ A D + +++V R + G +I + GP N
Sbjct: 214 SAMMLPLIDMCNHSFQPNAHIEEDLSRDAQDVSFLKVVTKRNLEKGSAITLNYGPLSNDL 273
Query: 335 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA--QRNGK------LSVQVFH--- 383
LL++YGFV DNP+DR+ L + ++ RM+A R G S QV
Sbjct: 274 LLLDYGFVIPDNPHDRI----ELRYDGSLMENARMIAGLSRTGSPPFSSPASWQVDRLKQ 329
Query: 384 -----------VHAGREKEAISDMLPYLRLGYVSDTS--EMQSVIS--SLGPICPVSPCM 428
V G +E +L LR+ + E + ++S + G VS
Sbjct: 330 LGLADSGESQKVTLGGPEEVDGRLLAALRILHAESQEPLERRELVSLQAWGVESMVSSDN 389
Query: 429 ERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVT 488
E VL L + T+ EDEA L+D +L R+A Q +K+++ L+
Sbjct: 390 EERVLRTLCGLAAIVFNQFKTTIEEDEAKLSDKSLAETSRIAVQFRLTKKRLVVRVLESL 449
Query: 489 ADMIMLL 495
+M L
Sbjct: 450 KKRLMDL 456
>gi|224098926|ref|XP_002311320.1| SET domain-containing protein [Populus trichocarpa]
gi|222851140|gb|EEE88687.1| SET domain-containing protein [Populus trichocarpa]
Length = 490
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 88/401 (21%), Positives = 163/401 (40%), Gaps = 50/401 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A D+ + +P L + + V +E + +AL+L+ EK + +
Sbjct: 80 LVAQRDISRNEVVLEIPKKLWINPDVVAASEIGN---VCGGVKPWVSVALFLIREKLK-E 135
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W PY+ L + S + WSE ELA L G+ + L ++RE+ +++
Sbjct: 136 DSTWRPYLDVLPE-------STNSTIFWSEEELAELQGTQLLSTTLGVKSYLRREFLKVE 188
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG---- 288
+ Q +P + T + F AF ++S + + L+PL
Sbjct: 189 EEILVPHK--QLFPSPV-----TLDDFSWAFGILRSRSFSRLR---GQNLVLIPLADLCN 238
Query: 289 -------------PPLL----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ- 330
P + Y K + + D L KAGE +++
Sbjct: 239 FLHTWLLDQVNHSPDITIEDGVYEIKGAGLFSR-DLIFSLRSPISLKAGEQVLIQYNLNL 297
Query: 331 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFH-VHAGRE 389
N++L ++YGF++ + + + ++ DP + DK +A+ NG + F V
Sbjct: 298 SNAELAVDYGFIEAKSDRNMYTLTLQISESDPFFGDKLDIAETNGLGEIADFDIVLGNPL 357
Query: 390 KEAISDMLPYLRLGYVSDTSEMQSVISS--LGPI-CPVSPCMERAVLDQLADYFKARLAG 446
+ L + LG +D+ ++S+ + G + PVS E + + D K+ L+G
Sbjct: 358 PPTLLPYLRLVALGG-TDSFLLESIFRNTIWGHLELPVSRANEELICRVVRDACKSALSG 416
Query: 447 YPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQV 487
Y T+ EDE L L+P+ +A + EKK+L ++
Sbjct: 417 YHTTIEEDEK-LKGEELNPRLEIAVGIRAGEKKVLQQIEEI 456
>gi|42567909|ref|NP_197226.2| protein SET DOMAIN GROUP 40 [Arabidopsis thaliana]
gi|75271674|sp|Q6NQJ8.1|SDG40_ARATH RecName: Full=Protein SET DOMAIN GROUP 40
gi|34222078|gb|AAQ62875.1| At5g17240 [Arabidopsis thaliana]
gi|51969984|dbj|BAD43684.1| unknown protein [Arabidopsis thaliana]
gi|332005020|gb|AED92403.1| protein SET DOMAIN GROUP 40 [Arabidopsis thaliana]
Length = 491
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 81/396 (20%), Positives = 158/396 (39%), Gaps = 56/396 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNE-TIAELLTT-NKLSELACLALYLMYEKKQ 170
+ A+ +L+ G+ VP ++T E ++ + +++ + N LS L++ L+YE +
Sbjct: 51 LGAARELKKGELVLKVPRKALMTTESIIAKDLKLSDAVNLHNSLSSTQILSVCLLYEMSK 110
Query: 171 GKKSFWLPYI----RELDRQRGRGQ-----LAVESPLLWSETELAYLTGSPTKAEILERA 221
KKSFW PY+ R+ D G L VE + +E A +A L +
Sbjct: 111 EKKSFWYPYLFHIPRDYDLLATFGNFEKQALQVEDAVWATEKATAKCQSEWKEAGSLMKE 170
Query: 222 EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR 281
+K ++ W A + +P ++ V L
Sbjct: 171 LELKPKFRSFQ-AWLWASATISSRTLHVPWDS----------AGCLCPVGDLFNYDAPGD 219
Query: 282 FALVPLGPP---------LLAYSSKCKAMLAAVDDAVQ---LVVDRPYKAGESIVVWCGP 329
++ P GP L+ + + ++ V L R Y+ GE +++ G
Sbjct: 220 YSNTPQGPESANNVEEAGLVVETHSERLTDGGFEEDVNAYCLYARRNYQLGEQVLLCYGT 279
Query: 330 QPNSKLLINYGFVDEDNPYDRLVV--EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 387
N +LL +YGF+ E+N D++ + E +L + + + ++GKLS
Sbjct: 280 YTNLELLEHYGFMLEENSNDKVFIPLETSLFSLASSWPKDSLYIHQDGKLSFA------- 332
Query: 388 REKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGY 447
++ LRL + + +SV+ + +S E V+ +++ + L
Sbjct: 333 --------LISTLRLWLIPQSQRDKSVMRLVYAGSQISVKNEILVMKWMSEKCGSVLRDL 384
Query: 448 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 483
P +++ED + LH ++ +R+E+K A
Sbjct: 385 PTSVTEDTVL-----LHNIDKLQDPELRLEQKETEA 415
>gi|403349615|gb|EJY74245.1| hypothetical protein OXYTRI_04500 [Oxytricha trifallax]
Length = 689
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 35/131 (26%), Positives = 66/131 (50%), Gaps = 19/131 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-------SELACLALYLM 165
+ A +D+ A +PNS ++++ RV + + ++L+ ++ ++ CLA++LM
Sbjct: 74 IGAKKDIGQYKAFLFIPNSCIISVTRVKKHPIVGQILSNHQELFMKHADADQLCLAVFLM 133
Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL--WSETELAYLTGSPTKAEILERAEG 223
E QG++SFW PYI ++ ES LL W + E+ L + E +
Sbjct: 134 NEYLQGQQSFWWPYINVMN----------ESDLLYKWKDEEIKLLNDFEIYQQAKEYRDD 183
Query: 224 IKREYNELDTV 234
I+ E+N+L +
Sbjct: 184 IEDEWNKLSKI 194
>gi|403215215|emb|CCK69715.1| hypothetical protein KNAG_0C06190 [Kazachstania naganishii CBS
8797]
Length = 496
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 59/254 (23%), Positives = 109/254 (42%), Gaps = 32/254 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLT--TNKLSELACLALYLMYE-KK 169
V A ED++ + F VP + ++ +E ++ E+ + + L + L++E K
Sbjct: 41 VIAIEDIEKDEILFEVPRTTMLNVENCELSKRYPEIKNHLVESVGQWEGLIIALLFEWKV 100
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
G+KS W PY++ L ++ QL + W++ EL L G+ E+ E
Sbjct: 101 VGEKSKWWPYLQVLPKKTDMNQL-----IYWADDELELLKPSLILERVGADKAKEMFENV 155
Query: 222 EGI--KREYNELDT-----VW---FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 271
I K E D+ W + S+ Y +D+ + + E K+ +
Sbjct: 156 VDIINKSTLKEKDSYILKVTWENFLLVASIIMSYSFDV--QDYVEE--KEGGTDEEEDDN 211
Query: 272 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 331
+ V + ++PL L + + KC A L + +++ + K GE I G P
Sbjct: 212 ESENVRSLK--CMIPLADTLNSNTHKCNAHLIHGSNLLEMRSIKAIKKGEQIYNIYGDHP 269
Query: 332 NSKLLINYGFVDED 345
NS++L YG+++ D
Sbjct: 270 NSEILRRYGYIEPD 283
>gi|393230612|gb|EJD38215.1| SET domain-containing protein [Auricularia delicata TFB-10046 SS5]
Length = 381
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 74/288 (25%), Positives = 119/288 (41%), Gaps = 33/288 (11%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVT-------LERVLGNETIAELLTTNKLSE--LACLALY 163
V SE+L S P SL +T L+R+LG A+L N LSE L C L
Sbjct: 3 VHTSEELPPDAPVISAPFSLAITPTVAADALQRILGPG--ADL---NSLSERELVCTYLA 57
Query: 164 LMYEKKQ---GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILER 220
+ + K+ G + L + +D R QL +PL ++ ELA L G+ A +R
Sbjct: 58 MHWIAKEVDLGPSAASLDHGPYVDSLPSRAQL--RTPLHFTPQELALLKGTNMAAATTDR 115
Query: 221 AEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLAR 280
+ E V G Y + + F ++ + ++ + +
Sbjct: 116 EADWRSECERCRAVLGHWGEHLTWEHYLTASTHLSSRAFPSTLLSPEPALI----PTPSS 171
Query: 281 RFALVPLGPPL-LAYSSKCKAMLAAVDDAVQ---LVVDRPYKAGESIVVWCGPQPNSKLL 336
LVPL L A + ++ D+ +V P AG ++ GP+PN++L+
Sbjct: 172 HPVLVPLIDSLNHARAHPVSWSVSPADNGAHTLSIVQHAPVAAGAEVLNNYGPKPNAELV 231
Query: 337 INYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV 384
+ YGF DNP D LV++ + D+R + + G L +F V
Sbjct: 232 LGYGFALPDNPDDTLVLKVS------GAADRREIWRAGGGLQRILFDV 273
>gi|295668911|ref|XP_002795004.1| SET domain-containing protein [Paracoccidioides sp. 'lutzii' Pb01]
gi|226285697|gb|EEH41263.1| SET domain-containing protein [Paracoccidioides sp. 'lutzii' Pb01]
Length = 488
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 90/429 (20%), Positives = 168/429 (39%), Gaps = 67/429 (15%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
+ A +D+ + F++P LV++ + N + +L+ N+ L + CL L ++YE Q
Sbjct: 50 IVAYDDINKEEELFAIPQGLVLSFQ----NSKLKDLMEINERDLGQWLCLILVMIYEYLQ 105
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA--EGIKRE- 227
G S W PY + L ++ + W++ EL L GS I + A E R+
Sbjct: 106 GVASPWAPYFKVLPTD-------FDTLMFWTDAELLELKGSAVLGRIGKSAAEEVFLRDL 158
Query: 228 -------------------YNELD------TVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
YN D ++ GSL Y +D+ + +
Sbjct: 159 LPLVSKNSELFPLTSGLLSYNSPDGKAALLSLAHRMGSLIMSYAFDVKNDEAEEVEGEGG 218
Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
+V ++ L + ++PL L A + + A L D + + + + GE
Sbjct: 219 YVTDD------EERQLPK--GMIPLADLLNADADRNNACLFQEDGYLAMKSIKSIRKGEE 270
Query: 323 IVVWCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAALNTEDPQYQDKRMVAQRN 374
I G P ++LL YG+V ++ P + A L + P + R+ +
Sbjct: 271 IFNDYGELPRAELLRRYGYVTDNYAQYDEAEVPIQTICKVAGLKSSTPGPDEPRLEFLDD 330
Query: 375 GKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPV-SPCMERAVL 433
++ + + +++ LP L ++ + L V P + A
Sbjct: 331 LEVLDDGYGIPRPDRSTPLAETLPTELLVVLNILVMPLEQFNQLKQKSKVPKPALGIAEA 390
Query: 434 DQLADYFKARLAGYPATLSEDEAML-----TDYNLHPKK----RVATQLVRMEKKMLNAC 484
L + + L YP T+++D+ +L + PK ++A Q+ + EK++LNA
Sbjct: 391 TLLDEVVRLILGEYPTTVAQDKELLASCANNQGSTSPKSAGRLKMALQVRKGEKEILNAV 450
Query: 485 LQVTADMIM 493
L D I+
Sbjct: 451 LSELEDFIV 459
>gi|425773952|gb|EKV12277.1| hypothetical protein PDIG_46020 [Penicillium digitatum PHI26]
gi|425782378|gb|EKV20291.1| hypothetical protein PDIP_17950 [Penicillium digitatum Pd1]
Length = 487
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 75/296 (25%), Positives = 119/296 (40%), Gaps = 62/296 (20%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
V A ++ G+ FS+P ++V+T++ N + LL N ++ L L ++YE
Sbjct: 50 VVAQSNIVEGEELFSIPRTMVLTVQ----NSELRTLLAENLEEQMGPWLSLMLVMVYEYL 105
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI------------ 217
QG+KS W PY R L + ++ + WS EL L S +I
Sbjct: 106 QGEKSRWAPYFRVLPSR-------FDTLMFWSPAELQELQASTIVEKIGRSNAEESIRDS 158
Query: 218 -----------------LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPT---EAFTFE 257
L EGI + L V + GSL Y +DI + E
Sbjct: 159 IAPILAKRPDLFPPPPGLASWEGIAGDA-ALIQVGHVMGSLIMAYAFDIEKAEDDDDEGE 217
Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 317
+ ++++ L K +VPL L A + + A L + A+ + +P
Sbjct: 218 VNDESYMTDDEEEEQLPK-------GMVPLADLLNADADRNNARLYQEEGALVMKAIKPI 270
Query: 318 KAGESIVVWCGPQPNSKLLINYGFVDEDNP-YDRL------VVEAA-LNTEDPQYQ 365
+ G+ I G P + LL YG+V ++ YD L + EAA L DP+ Q
Sbjct: 271 QKGDEIFNDYGEIPRADLLRRYGYVTDNYAVYDVLELSLETICEAAGLANADPESQ 326
>gi|396468374|ref|XP_003838159.1| hypothetical protein LEMA_P116830.1 [Leptosphaeria maculans JN3]
gi|312214726|emb|CBX94680.1| hypothetical protein LEMA_P116830.1 [Leptosphaeria maculans JN3]
Length = 660
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 91/428 (21%), Positives = 175/428 (40%), Gaps = 78/428 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLE-----RVL-----GNETIAELLTTNKLSELACLAL 162
+ A+ D+ A F++P + ++ +E R+L G AE L A L L
Sbjct: 41 IVATRDIPAETTLFTIPRNAIINVETSDLARLLPGIFDGTLNDAEDEKAEPLDPWASLIL 100
Query: 163 YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAE 222
++ E G++S+W PYI L + ++P+ W++ EL L G+ AE + ++E
Sbjct: 101 VMLREYLHGEQSYWKPYIDIL-------PTSFDTPIFWTQDELKELEGTVLTAEKIGKSE 153
Query: 223 ------------------------GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEI 258
I +L + GS Y +D+ + +
Sbjct: 154 SDEMLRTHVLPIVTQNPTAFCPKGAIPLNEEDLLALAHRIGSTIMSYAFDLDDDKEESDA 213
Query: 259 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD---R 315
++ +V + + L +VP+ L A + A V+ +L V
Sbjct: 214 EEEGWVEDRDGLTML---------GMVPMADVLNANAD----FNAHVNHGEKLEVTSLRS 260
Query: 316 PYKAGESIVVWCGPQPNSKLLINYGFVD-EDNPYD---------RLVVEAALNTEDPQYQ 365
+AG I+ + GP P+S+LL YG+V E + YD R + A L D Q
Sbjct: 261 DIRAGTEILNYYGPLPSSELLRRYGYVTPEHHRYDVAEVSWELVRSTLVAHLELSDGILQ 320
Query: 366 DKRMVAQRNGKLSVQVFHV---HAGREKEAISDMLPYLRLGYVSDTSE-MQSVISSLGPI 421
V + G+ ++ + V +G + + P ++ SE +++++ +L
Sbjct: 321 ---AVETQLGEDELEDYFVLERDSGEPSDEGRLVQPPQTCEVPTELSEQLKTILKALKKQ 377
Query: 422 CP--VSPCMERAVLDQ--LADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRME 477
P + + R + + + +L+ Y ++ EDE +L + +L + R+A ++ E
Sbjct: 378 QPELIGSAVRRDEILHAVIGEALNRKLSEYATSVEEDEELLENSSLTKRHRLAIEVRLGE 437
Query: 478 KKMLNACL 485
K++L+ L
Sbjct: 438 KRLLHELL 445
>gi|242081035|ref|XP_002445286.1| hypothetical protein SORBIDRAFT_07g007800 [Sorghum bicolor]
gi|241941636|gb|EES14781.1| hypothetical protein SORBIDRAFT_07g007800 [Sorghum bicolor]
Length = 490
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 65/243 (26%), Positives = 103/243 (42%), Gaps = 22/243 (9%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ ASE + G+ A +P SL+++ E + +E L N ++ L L+ M E+
Sbjct: 182 MVASESIGVGEIALEIPESLIISDELLCQSEVFLALKDFNSITSETMLLLWSMRERYNLA 241
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
F PY L G L + LA L G+ EI++ + ++++Y+EL
Sbjct: 242 SKF-KPYFDTLPANFNTG-------LSFGIDGLAALEGTLLFDEIMQAKQHLRQQYDELF 293
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQA--FVAVQSCVVHLQKVSLARRFALVPLGPP 290
+ L +P + T++ F A S +V L L+ LVP+
Sbjct: 294 PL------LCTNFPEIFRKDVCTWDNFLWACELWYSNSMMVVLSSGKLST--CLVPVAGL 345
Query: 291 LLAYSSKCKAMLAAVDDA---VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DN 346
L S VD+A ++ + RP AGE + G P S L+ YGF+ DN
Sbjct: 346 LNHSVSPHILNYGRVDEATKSLKFPLSRPCDAGEQCFLSYGKHPGSHLVTFYGFLPRGDN 405
Query: 347 PYD 349
PYD
Sbjct: 406 PYD 408
>gi|241712095|ref|XP_002413441.1| conserved hypothetical protein [Ixodes scapularis]
gi|215507255|gb|EEC16749.1| conserved hypothetical protein [Ixodes scapularis]
Length = 227
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 52/190 (27%), Positives = 81/190 (42%), Gaps = 25/190 (13%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLG-- 141
W NG + L+ P + AA +D+Q G VP +++T +G
Sbjct: 11 WCLDNGATINGITLQALPDDE------YGFAAEQDIQVGPVFLGVPLGMMMT---TIGAR 61
Query: 142 NETIAELLTTN---KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPL 198
+ LL + K E L+++L+ E G SFW PYI L R + + L
Sbjct: 62 KSKLGALLKDDPIMKSMENVALSMFLILELCAGSASFWHPYISILPR-------SFNTVL 114
Query: 199 LWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEI 258
+S EL LTGS E L+ I R+Y + F L + PY + FT+++
Sbjct: 115 YFSVDELQLLTGSSVLDEALKLHRSIARQYAYFHKI-FRTHPLAKSLPY---KDCFTYDL 170
Query: 259 FKQAFVAVQS 268
++ A AV +
Sbjct: 171 YRWAVSAVMT 180
>gi|302810436|ref|XP_002986909.1| hypothetical protein SELMODRAFT_235145 [Selaginella moellendorffii]
gi|300145314|gb|EFJ11991.1| hypothetical protein SELMODRAFT_235145 [Selaginella moellendorffii]
Length = 447
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 72/294 (24%), Positives = 119/294 (40%), Gaps = 61/294 (20%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A DL G+ ++P + +TL + IA L L + +MYE+ +GK
Sbjct: 10 VRALRDLHHGELIATIPKAACLTLLTTAARDAIARARLGGGLG----LTVAVMYERSKGK 65
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL 231
S W Y++ L Q P LWSE E+ L G+ + E +K ++ E
Sbjct: 66 GSKWYRYLKTLPCQE-------SVPFLWSEEEIDGLLLGTELHKALKEDKLLMKEDWEE- 117
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ-KVSLARRFALVPLGPP 290
L ++ P + P + FTFE +++A +S V ++ + +VPL
Sbjct: 118 -----NIAPLTKEDPLEFPAQDFTFE----SYLAAKSLVSSRSFEIDAEHGYGMVPLAD- 167
Query: 291 LLAYSSKCKA-----MLAA-------------VDDA---------------VQLVVDRPY 317
++ K A ML A +DD +++V+ +
Sbjct: 168 --LFNHKTDAEDVHFMLNASDSDDDDDNNGLIIDDGLANGDCREISSDKSVLEMVMVKDV 225
Query: 318 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYD--RLVVEAALNTEDPQYQDKRM 369
AG I G N+ LL YGF + +NP+D L ++ L ++Q KR+
Sbjct: 226 AAGSEIFNTYGQLGNAALLHRYGFTEPNNPHDIVNLDMDCLLEVLLSRFQKKRV 279
>gi|145549620|ref|XP_001460489.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124428319|emb|CAK93092.1| unnamed protein product [Paramecium tetraurelia]
Length = 482
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 81/408 (19%), Positives = 159/408 (38%), Gaps = 41/408 (10%)
Query: 103 HNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV----LGNETIAELLTTNKLSELA 158
+E HR + A++ ++ G+ +P + ++LE V L N + ++ K + +
Sbjct: 63 QSEGHRTLR---ATQFIRQGEWVLFIPRTQYLSLEEVKKSCLINRKMIQI--NYKPNNIQ 117
Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
+ + ++ + K SFW PYI L + P + + A L GSPT ++
Sbjct: 118 TYFVNHLLQENRRKYSFWKPYIDVLPKD------VSGFPTYFDAEQDALLKGSPTLFTVI 171
Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 278
+ + K EY L A FQ+Y Y T++ F + + S +Q
Sbjct: 172 NQRKVFKEEYENLKE----AVKEFQKYGY-------TYDDFIKFRILTISRSFTVQIGEK 220
Query: 279 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD--RPYKAGESIVVWCGPQPNSKLL 336
++ LVPL + + + DA + R + GE + G N
Sbjct: 221 EQQQLLVPLAD-FINHDNNGFLKYGYSKDADGFFMQAVRNIQKGEELFYNYGQWSNKYFF 279
Query: 337 INYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN---GKLSVQVFHVHAGREKEAI 393
+NYGF NP ++ ++ LN D + K + + N G V R+ A
Sbjct: 280 MNYGFASLTNPMNQFDLDICLNKNDRLFNLKISLTKGNMCWGNRLVNETDHDTFRQSLA- 338
Query: 394 SDMLPYLRLGYVSDTSEMQSVISSLGPICP------VSPCMERAVLDQLADYFKARLAGY 447
+ + ++ + D +++ + + P + +E+A L L +
Sbjct: 339 --TVRFTQISKLDDFLQLEEDVQNFKQFWPGWHTTIKTIELEKATFKALKGILVTELGNF 396
Query: 448 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLL 495
+T+ +DE L D ++ L EK+++ + + M+ ++
Sbjct: 397 ASTIEDDERRLNDPQTPEFRKHIIMLTLREKQIIKKNIDICDLMLQVI 444
>gi|255581713|ref|XP_002531659.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase,
putative [Ricinus communis]
gi|223528717|gb|EEF30729.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase,
putative [Ricinus communis]
Length = 558
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 57/244 (23%), Positives = 106/244 (43%), Gaps = 28/244 (11%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+EDL+ GD A +P S++++ E V ++ L + +S L L+ M E+
Sbjct: 190 ATEDLKVGDIALEIPVSIIISEELVRHSDMYHILEKIDGISSETMLLLWSMKERHNCNSK 249
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
+ Y L ++ G ++ S+ L + EI++ E ++ +Y+EL
Sbjct: 250 SKI-YFDTLPKEFNTGLSFGVDAIMASDGTLLF-------DEIMQAKEHLRVQYDEL--- 298
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP-------- 286
+L YP P E +T+E F A S + ++ + R L+P
Sbjct: 299 ---VPALCNNYPDVFPPELYTWEQFLWACELWYSNSMKIKFLDGKLRTCLIPIAGFLNHS 355
Query: 287 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-D 345
L P ++ Y + ++ + ++ + RP + GE + G + L+ YGF+ + D
Sbjct: 356 LHPHIIHYGK-----VDSITNTLKFPLSRPCRVGEQCCLSYGNFSGAHLITFYGFLPQGD 410
Query: 346 NPYD 349
N YD
Sbjct: 411 NRYD 414
>gi|145356486|ref|XP_001422460.1| chloroplast lysine N-methyltransferase [Ostreococcus lucimarinus
CCE9901]
gi|144582703|gb|ABP00777.1| chloroplast lysine N-methyltransferase [Ostreococcus lucimarinus
CCE9901]
Length = 529
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 90/386 (23%), Positives = 155/386 (40%), Gaps = 55/386 (14%)
Query: 48 RRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKH 107
RR+ R+ +S + D L W+ NG V + + +E
Sbjct: 24 RRRARWGDATTSKTRRPRTRARRDAASSADHDALHEWLSANGADVASVEFYDARAGDEDD 83
Query: 108 RPIHYVA--ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELL----TTNKLSELACLA 161
A+ L G A VP SL +T E + ++ + + L L+ LA
Sbjct: 84 GGDAGWGARATRALARGAKAIVVPKSLWITPEVGMNDDELGKALRDEDVAGGLARWTTLA 143
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
L L+ E+++G++S + Y++ L + SPL W+ EL+ + G+ ++L+ A
Sbjct: 144 LTLLKERERGEESKYAAYVKTLPE-------VLHSPLFWNAEELSEIQGT----QLLDNA 192
Query: 222 EG----IKREYNELDTVWFMAGSLFQQYP--YDIPTEAFTFEIFKQAFVAVQSCVVHLQK 275
G ++ Y L T +F ++ +D+ AF+ + F+ AF ++S
Sbjct: 193 AGYDGYVRGVYETLRT------GMFAKHADVFDVEG-AFSEDNFRWAFGILRS---RTMA 242
Query: 276 VSLARRFALVPLGPPLLAYSSKCKA-------MLAAV---------DDAVQLVV--DRPY 317
ALVP G L+ +SS +A + AV DD V V DR
Sbjct: 243 PCDGANIALVP-GVDLVNHSSLSQARWRVSGGVAGAVAGLFGGGKGDDGVSARVECDRAL 301
Query: 318 KAGESIVVWCGPQ-PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 376
E + V P+ ++ +++GFVD P + ++ +DP DK V G
Sbjct: 302 NVNEPLYVNYNPEGTDTSFALDFGFVDTITPSPGYALSLSVPEDDPNVFDKLDVLDVCGL 361
Query: 377 LSVQVFHVHAGREKEAISDMLPYLRL 402
F + A + + D+ +LRL
Sbjct: 362 GETPTFTLRAYSDPD--PDLRTFLRL 385
>gi|403414266|emb|CCM00966.1| predicted protein [Fibroporia radiculosa]
Length = 420
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 82/339 (24%), Positives = 143/339 (42%), Gaps = 55/339 (16%)
Query: 61 DTLVAGSREVVSKKEEDLGDLKSWMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASED 118
D +V+ + +VV+ K+W+ +NG P E+ ++ V AS+D
Sbjct: 8 DGIVSANGDVVA--------FKNWLAENGAEFHPHAAFRTERSGYS--------VIASQD 51
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELL----TTNKLSE--LAC--LALYLMYEKKQ 170
L++ S P SL +T E + + LL T SE L C + ++ + +
Sbjct: 52 LRSDTTVVSCPFSLAITPE--VSKNALTTLLGPTFTGQSWSERQLICSYICMHWILDPSA 109
Query: 171 GKKSFWLPYIREL---DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
+ PYIR L D+ R +PL +S+TEL L GS L+R + E
Sbjct: 110 SSELAHWPYIRMLPAPDKLR--------TPLHFSDTELEALKGSNLYGATLDRRRDWQSE 161
Query: 228 YNE----LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 283
+ + + TV G F Y + + F ++ +V S +
Sbjct: 162 WEQCQKTIATVDLTWGEQFSWERYLSASTYLSSRAFPSMVLSPNPSLV-----STEESYP 216
Query: 284 LVPLGPPLLAYS-----SKCKAMLAAVD-DAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 337
++ G L +S S ++ + D + + LV+ + AG ++ GP+PN++L++
Sbjct: 217 VLLPGIDSLNHSRGQPVSWVVSIGTSSDVNRISLVLHKSTPAGSELLNNYGPKPNAELIL 276
Query: 338 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 376
YGF +NP D +V++ N+ Q K V RN +
Sbjct: 277 GYGFSLPENPDDTIVLKIGGNSASGLQQQKWEVG-RNAQ 314
>gi|451854554|gb|EMD67847.1| hypothetical protein COCSADRAFT_34629 [Cochliobolus sativus ND90Pr]
Length = 476
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 103/429 (24%), Positives = 166/429 (38%), Gaps = 72/429 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A +D+ + FS+P S ++ +E + + I T L L L ++YE G
Sbjct: 40 VVAKQDIAEHELLFSIPRSSILGVENSILSTEIPPA-TFAHLGPWLSLILIMLYEYHNGS 98
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL-------------- 218
S W PY L ++ + W+E ELA L S +I
Sbjct: 99 ASNWAPYFAVLPTD-------FDTLMFWTEDELAELQASAVVNKIGKEGANEVFIEQLLP 151
Query: 219 -------------ERAEGIKREYNELDTVWFM--AGSLFQQYPYDIPTEAFTFEIFKQAF 263
ERA+ +E + + M GSL Y +D+ A + + +
Sbjct: 152 VIEEFADVIFSGDERAKHKAKEMRAPENLELMHKMGSLIMAYAFDVEP-AISDKEVDEEG 210
Query: 264 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESI 323
A + L K +VPL L A +C A L D +++ +P +AG+ I
Sbjct: 211 FAEEEEDAALPK-------GMVPLADMLNADGDRCNARLFYEKDGLEMKALKPIQAGDEI 263
Query: 324 VVWCGPQPNSKLLINYGFV-DEDNPYD-----------RLVVEAALNTEDPQYQDKRMVA 371
GP P S LL YG++ D YD L + + E +Y D++ +
Sbjct: 264 FNDYGPLPRSDLLRRYGYITDNYAQYDVVEIPVDLVSQTLAHDGLWHEERIEYLDEQEIV 323
Query: 372 QRNGKLSVQV-FHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMER 430
++ + F + +E++S L L + E + + S G + P + M
Sbjct: 324 DTGYDIAASIPFSL-----EESLSPELVILVETMLLPREEFER-LQSKGRL-PKAEKMTG 376
Query: 431 AVLDQLADYFKARLAGYPATLSED-----EAMLTDYNLHPKKRVA-TQLVRM-EKKMLNA 483
L +AR+A YP TL +D E ++RVA + VR+ EKK+L
Sbjct: 377 KAAKFLYKIVQARIAQYPTTLEQDLQISSETQPVQTMSRKERRVAMARAVRIGEKKLLVQ 436
Query: 484 CLQVTADMI 492
+ AD I
Sbjct: 437 TEERLADKI 445
>gi|281207968|gb|EFA82146.1| hypothetical protein PPL_04566 [Polysphondylium pallidum PN500]
Length = 510
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 40/159 (25%), Positives = 72/159 (45%), Gaps = 21/159 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A +DL+ +P S ++T +I+ L K+ + ++ L+YE G
Sbjct: 59 VIALQDLKIDHTVAIIPKSCLLTPHTT----SISAYLKKYKIKDATATSIALLYEASIGS 114
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+S W YI+ L L+V+ P+LW++ +L L G+ + + E E + YN+
Sbjct: 115 QSKWYGYIKSL-------PLSVDLPILWNDADLKNLKGTSIETVVYENKETVDATYNK-- 165
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 271
++ L +P F+ + FK+A SC+V
Sbjct: 166 ---YIKSKLIANHPDVFNEHVFSLDNFKRA-----SCLV 196
>gi|255945819|ref|XP_002563677.1| Pc20g11910 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211588412|emb|CAP86520.1| Pc20g11910 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 487
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 74/283 (26%), Positives = 117/283 (41%), Gaps = 57/283 (20%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
V A ++ G+ FSVP ++V+T++ N + LL N ++ L L ++YE
Sbjct: 50 VVAQSNISEGEELFSVPRAMVLTVQ----NSELRTLLGENLEEQMGPWLSLMLVMVYEYL 105
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA---EGIKR 226
QG+KS W PY R L + ++ + WS EL L S T E + R+ E I+
Sbjct: 106 QGEKSRWAPYFRVLPSR-------FDTLMFWSPAELQELQAS-TIVEKIGRSGAEESIRN 157
Query: 227 ----------------------EYNELDT----VWFMAGSLFQQYPYDI---PTEAFTFE 257
E + D V + GSL Y +DI + E
Sbjct: 158 SIAPILAKRPDLFPPPQGLASWEGDAGDAALIQVGHIMGSLIMAYAFDIEKSEDDGDEGE 217
Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 317
++++ L K +VPL L A + + A L + A+ + +P
Sbjct: 218 ANDESYMTDDEEEEQLPK-------GMVPLADLLNADADRNNARLYQEEGALVMKAIKPI 270
Query: 318 KAGESIVVWCGPQPNSKLLINYGFVDEDNP-YDRLVVEAALNT 359
+ GE I G P + LL YG+V ++ YD V+E +L T
Sbjct: 271 QQGEEIFNDYGEIPRADLLRRYGYVTDNYAVYD--VLELSLET 311
>gi|367023575|ref|XP_003661072.1| hypothetical protein MYCTH_2300057 [Myceliophthora thermophila ATCC
42464]
gi|347008340|gb|AEO55827.1| hypothetical protein MYCTH_2300057 [Myceliophthora thermophila ATCC
42464]
Length = 496
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 69/270 (25%), Positives = 107/270 (39%), Gaps = 55/270 (20%)
Query: 113 VAASEDLQAGDAAFSVP-NSLVVTLERVLGNE----------------TIAELLTTNKLS 155
+ A D+ A F++P +S++ T L NE + E T++
Sbjct: 49 IVARTDIAADTVLFTIPRSSIICTATSALKNEIPGIFDLEGDEDGNSDSGGEDGTSSSQD 108
Query: 156 ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 215
L L L+YE QG S W PY+ L A ++P+ WS TELA L S
Sbjct: 109 SWTLLILILIYEYLQGDASQWKPYLDVL-------PSAFDTPMFWSPTELAELQASALVT 161
Query: 216 EI-LERAEGIKRE-----YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSC 269
++ E A+ + R D V+F G Q+ D FE+ + A+ +
Sbjct: 162 KVGREEADRMIRSKILPVIRGHDHVFFPHGR--QRLDDDQ-----LFELAHRMGSAIMAY 214
Query: 270 VVHLQKVSLARR-----------------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 312
L+K A +VP+ +L ++ A + D++
Sbjct: 215 AFDLEKDDDANEEASEQDEWVDDREGRTMLGMVPMA-DMLNADAEFNAYINHGADSLTAT 273
Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
R KAGE I+ + GP PN +LL YG+V
Sbjct: 274 ALRTIKAGEEILNYYGPLPNGELLRRYGYV 303
>gi|115386294|ref|XP_001209688.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114190686|gb|EAU32386.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 486
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 92/392 (23%), Positives = 155/392 (39%), Gaps = 62/392 (15%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
V A D+ + F++P LV++ + N + +LL+ + L EL L L +MYE
Sbjct: 50 VVAQTDIPENEELFTIPRDLVLSTQ----NSKLKDLLSQD-LEELGPWLSLMLVMMYEYL 104
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP---------TKAEILER 220
G +S W Y + L R+ ++ + W+ +EL L GS ILE
Sbjct: 105 LGDQSTWAAYFKVLPRK-------FDTLMFWTPSELLELQGSAVIDKIGRQGADESILEM 157
Query: 221 AEGIKREYNEL----------------DTVWFMA---GSLFQQYPYDI--PTEAFTFEIF 259
I R + L + +A GSL Y +DI P +
Sbjct: 158 IAPIVRAHPSLFPPVDGLPSYDGDAGTQALLHLAHTMGSLIMAYAFDIEKPEDEDEEGDG 217
Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 319
+ ++ + ++ L++ +VPL L A + + A L ++A+ + +P
Sbjct: 218 EGGYMTDE------EEEQLSK--GMVPLADLLNADADRNNARLFQDENALVMKAIKPIAK 269
Query: 320 GESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVE-------AALNTEDPQYQDKRMVA 371
GE I G P + LL YG+V D PYD + V A L+ DP+ Q
Sbjct: 270 GEEIFNDYGEIPRADLLRRYGYVTDNYAPYDVVEVSLDVICKAAGLSDSDPEKQPPLEFL 329
Query: 372 QRNGKLSVQVFHVHAGREKEAISDMLP-YLRLGYVSDTSEMQSVISSLGPICPVSPCMER 430
L +E + ++D+LP L + + T + + P P
Sbjct: 330 DELELLDDGYVIPRPSQEDDQLTDILPDELIILLRTLTLSPEQLAQQRSKNKPPKPAFAE 389
Query: 431 AVLDQLADYFKARLAGYPATLSEDEAMLTDYN 462
A LA + + A Y T+++D+ +L+ N
Sbjct: 390 AEATILAKAIQLKQAQYATTIAQDQEILSQLN 421
>gi|327295326|ref|XP_003232358.1| hypothetical protein TERG_07206 [Trichophyton rubrum CBS 118892]
gi|326465530|gb|EGD90983.1| hypothetical protein TERG_07206 [Trichophyton rubrum CBS 118892]
Length = 692
Score = 55.8 bits (133), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 50/200 (25%), Positives = 91/200 (45%), Gaps = 23/200 (11%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LA ++++E+ +G+ S W PY+ L R + S L + +++L +L G+
Sbjct: 108 LAFFMVHEQLKGRDSHWWPYLATLPRAS-----ELTSALFFQDSDLEWLQGTSLYETHRA 162
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF--VAVQSCVVHLQKVS 277
+K EY+ +A S+ + Y + E++T++IF A+ +A ++ +
Sbjct: 163 YRNTVKEEYD-------LAISILRDEGY-LAIESYTWDIFCWAYTLIASRAFTSRVLDAY 214
Query: 278 LARRFAL-----VPLGPPLLAYSSK---CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 329
L+ +L + PL+ +S+ K A ++L V P GE + GP
Sbjct: 215 LSNHPSLKQEEEFQIMLPLVDFSNHKPLAKIEWQAEATEIRLKVVEPTFTGEEVHNNYGP 274
Query: 330 QPNSKLLINYGFVDEDNPYD 349
N +L+ YGF DNP D
Sbjct: 275 LNNQQLMTTYGFCIVDNPCD 294
>gi|225561342|gb|EEH09622.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
Length = 487
Score = 55.5 bits (132), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 94/423 (22%), Positives = 166/423 (39%), Gaps = 68/423 (16%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
+ G+ F++P ++ T+E + + L + + LS LA+Y+++ +
Sbjct: 52 FKEGERIFTIPADVLWTVEHAYADSLLGPALRSARPPLSVDDTLAMYILFVRS------- 104
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
RE R LA S + +++ EL GS A I+ +Y L
Sbjct: 105 ----RESGYDGPRSHLATLPKSYSSSIFFTDDELEVCAGSSLYALTKRLGRCIEDDYRAL 160
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRFALVP 286
L Q+ P + FT E +K A V S + + + L FA
Sbjct: 161 VV------RLLVQHQDLFPLDKFTIEDYKWALCTVWSRAMDFVLPGGKSIRLMAPFA--- 211
Query: 287 LGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGES-----IVVWCGPQPNSKLLINY 339
+L +SS+ + A + + ++ + Y+AG+ + ++ G PN++LL Y
Sbjct: 212 ---DMLNHSSEVRQCHAYDPLSGNLTILAGKDYEAGDQGVFFQVFIYYGSIPNNRLLRLY 268
Query: 340 GFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPY 399
GFV NP D + + P ++ KR + G S + ++D LP
Sbjct: 269 GFVMPGNPNDSYDLVLETHPMAPFFEQKRKLWDLAGFDSTSTISI-------TLTDPLPK 321
Query: 400 LRLGYV----SDTSEMQSVISSLGPICP----VSPCMERAVLDQLADYFKARLAGYPATL 451
LGY+ SD S++ S+ I P +S E VL L + F L + L
Sbjct: 322 NVLGYLRIQRSDESDLASIARQ--RIDPKYEKISDSNEVEVLQSLIESFCGLLDSFGTQL 379
Query: 452 SEDEAMLTDYNLHPKKR---VATQLVRMEKKMLNACLQVTADMIMLLPDVTVS-----PC 503
E L + ++P + A + E+++L + DM+ + + + P
Sbjct: 380 ESLEKQLAE-GVYPSRGNAWAAAHVSLGEQQVLRLARKRAEDMLAAVESGSGNEKGSLPA 438
Query: 504 PAP 506
PAP
Sbjct: 439 PAP 441
>gi|28393324|gb|AAO42088.1| unknown protein [Arabidopsis thaliana]
Length = 543
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 62/239 (25%), Positives = 98/239 (41%), Gaps = 19/239 (7%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASEDL+ GD A +P S +++ E V ++ L T + ++ L L+ M EK
Sbjct: 180 ASEDLKLGDVALEIPVSSIISEEYVYNSDMYPILETFDGITSETMLLLWTMREKHNLDSK 239
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F PY L G L + + L G+ EI++ E ++ Y+EL
Sbjct: 240 F-KPYFDSLQENFCTG-------LSFGVDAIMELDGTLLLDEIMQAKELLRERYDELI-- 289
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
L + P E +T+E + A S + ++ + L+P+ L
Sbjct: 290 -----PLLSNHREVFPPELYTWEHYLWACELYYSNSMQIKFPDGKLKTCLIPVAGFLNHS 344
Query: 295 SSKCKAMLAAVD---DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYD 349
VD +++ V RP GE + G +S LL YGF+ + DNPYD
Sbjct: 345 IYPHIVKYGKVDIETSSLKFPVSRPCNKGEQCFLSYGNYSSSHLLTFYGFLPKGDNPYD 403
>gi|79557522|ref|NP_179475.3| SET domain-containing protein [Arabidopsis thaliana]
gi|56381987|gb|AAV85712.1| At2g18850 [Arabidopsis thaliana]
gi|330251719|gb|AEC06813.1| SET domain-containing protein [Arabidopsis thaliana]
Length = 543
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 62/239 (25%), Positives = 98/239 (41%), Gaps = 19/239 (7%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASEDL+ GD A +P S +++ E V ++ L T + ++ L L+ M EK
Sbjct: 180 ASEDLKFGDVALEIPVSSIISEEYVYNSDMYPILETFDGITSETMLLLWTMREKHNLDSK 239
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F PY L G L + + L G+ EI++ E ++ Y+EL
Sbjct: 240 F-KPYFDSLQENFCTG-------LSFGVDAIMELDGTLLLDEIMQAKELLRERYDELI-- 289
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
L + P E +T+E + A S + ++ + L+P+ L
Sbjct: 290 -----PLLSNHREVFPPELYTWEHYLWACELYYSNSMQIKFPDGKLKTCLIPVAGFLNHS 344
Query: 295 SSKCKAMLAAVD---DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYD 349
VD +++ V RP GE + G +S LL YGF+ + DNPYD
Sbjct: 345 IYPHIVKYGKVDIETSSLKFPVSRPCNKGEQCFLSYGNYSSSHLLTFYGFLPKGDNPYD 403
>gi|367036287|ref|XP_003648524.1| hypothetical protein THITE_2106073 [Thielavia terrestris NRRL 8126]
gi|346995785|gb|AEO62188.1| hypothetical protein THITE_2106073 [Thielavia terrestris NRRL 8126]
Length = 496
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 55/203 (27%), Positives = 83/203 (40%), Gaps = 32/203 (15%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
L L L+YE QG+ S W PY+ L ++P+ WS TEL+ L S A++
Sbjct: 109 LILVLIYEHLQGEASRWRPYLDVL-------PPTFDTPMFWSPTELSELQASALVAKV-G 160
Query: 220 RAEG----------IKREYNELDTVWFMAG----------SLFQQYPYDIPTEAFTFEIF 259
RAE + R + E V+F G L + I AF E
Sbjct: 161 RAEADRMIEAKVLPVIRAHEE---VFFPPGRAKLDDAQLFELAHRMGSTIMAYAFDLEND 217
Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 319
++ +VP+ +L ++ A + DDA+ RP +A
Sbjct: 218 DSDNDEADEDDEWVEDREGRTMLGMVPMAD-MLNADAEFNAHINHGDDALTATALRPIRA 276
Query: 320 GESIVVWCGPQPNSKLLINYGFV 342
G+ I+ + GP PN +LL YG+V
Sbjct: 277 GDEILNYYGPLPNGELLRRYGYV 299
>gi|358388339|gb|EHK25932.1| hypothetical protein TRIVIDRAFT_82204 [Trichoderma virens Gv29-8]
Length = 915
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 48/198 (24%), Positives = 88/198 (44%), Gaps = 22/198 (11%)
Query: 158 ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
+ L + +M+E +G +S W PY+ L + E+P+ WS EL L S T+ ++
Sbjct: 543 SILIIIMMFEYFKGDESKWKPYMDVL-------PASFETPMFWSGAELDELQASATRTKV 595
Query: 218 LERAEGIKREYNELDTVWFMAGSLF---QQYPYDIPTE----------AFTFEIFKQAFV 264
+A+ + + ++ V +F Q Y D + ++ F+ +
Sbjct: 596 -GKADAEEMFHAKVLPVIRANHEIFPSSQSYSDDELVQLAHRMGSTIMSYAFDFQNEDEE 654
Query: 265 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 324
+ ++ +VP+ +L ++ A + DDA+ + R KAGE I+
Sbjct: 655 DEEDEEEWVEDRESKSTMGMVPMAD-ILNADAEYNAHVNYGDDALTVTALRTIKAGEEIL 713
Query: 325 VWCGPQPNSKLLINYGFV 342
+ GP PNS+LL YG+V
Sbjct: 714 NYYGPHPNSELLRRYGYV 731
>gi|358397725|gb|EHK47093.1| hypothetical protein TRIATDRAFT_298882 [Trichoderma atroviride IMI
206040]
Length = 481
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 50/193 (25%), Positives = 87/193 (45%), Gaps = 14/193 (7%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
L L+ E +G++SFW PYI+ L + A+ P W E E L G+ + + +
Sbjct: 90 LLLIKELLRGEESFWWPYIQALPQPEDVDDWAL--PPFWPEEEAELLEGTNVEVGLDKIR 147
Query: 222 EGIKREYNELDTVWFMA--------GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 273
+ +KRE+ E + + L + Y+ F+ F+ + V ++ L
Sbjct: 148 DDLKREFREAKAMLLASQKDAEDDFSELLTRELYNWAYCIFSSRSFRASLVMTEAQQQAL 207
Query: 274 -QKVSLARRFALVPL---GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 329
+ VS+ L+PL G +A + + A A QL V R ++ G+ I P
Sbjct: 208 PEDVSVDDFSVLLPLFDIGNHDMAVDVRWELDAANSGAACQLRVGREHQPGQQIFNNYSP 267
Query: 330 QPNSKLLINYGFV 342
+ N++LL+ YGF+
Sbjct: 268 KTNAELLLGYGFM 280
>gi|224042477|ref|XP_002188626.1| PREDICTED: SET domain-containing protein 4 [Taeniopygia guttata]
Length = 457
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 93/392 (23%), Positives = 147/392 (37%), Gaps = 67/392 (17%)
Query: 81 LKSWM------HKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSV 128
LKS+M HK K LKE+ + RP + + ++ LQAGD S+
Sbjct: 17 LKSFMDGVNCSHKLEYIKLKKWLKERGFEDSNLRPAEFWETGRGLMTTKALQAGDLIISL 76
Query: 129 PNSLVVT----LERVLGNE------------TIAELLTTNKLSELACLALYL---MYEKK 169
P ++T L LG + L L L C L + EK
Sbjct: 77 PEKCLLTTGTVLSSCLGGHIEKWKPPVSPLLALCTFLIGQNLELLECFQFLLVNGIAEKH 136
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
G+KS W PY+ L + A P E ++ L P + + E+ I+ +
Sbjct: 137 AGQKSPWKPYLDVLPK-------AYTCPAC-LEPDIINLLPKPLQKKAQEQKMLIQELFQ 188
Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP--- 286
+ LF + +I F F + A+ V + +++ K F+L P
Sbjct: 189 SSRAFFSSLQPLFAEDTGNI----FNFSALQWAWCTVNTRTIYM-KHPHRECFSLEPDVY 243
Query: 287 -LGP--PLLAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
L P LL +S + KA + ++ D K + +++ GP N +LL+ YGF
Sbjct: 244 ALAPYLDLLNHSPNVQVKAGFNEQTRSYEIWTDSQCKKYQEVLICYGPHDNQRLLLEYGF 303
Query: 342 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 401
V DNP+ + V A + DK QR K+S+ H D L L
Sbjct: 304 VATDNPHSSVYVSADTLLKYFSSLDK----QREAKVSILKDH-----------DFLENLT 348
Query: 402 LGYVSDTSEMQSVISSLGPICPVSPCMERAVL 433
G+ + + + + L C R +L
Sbjct: 349 FGWEGPSWRLLTALKVLSLAADEFACWRRILL 380
>gi|119467702|ref|XP_001257657.1| SET domain protein [Neosartorya fischeri NRRL 181]
gi|119405809|gb|EAW15760.1| SET domain protein [Neosartorya fischeri NRRL 181]
Length = 492
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 71/283 (25%), Positives = 118/283 (41%), Gaps = 61/283 (21%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
V A D+ G+ FS+P LV++ + N + +LL+ + L EL L L +MYE
Sbjct: 50 VVARSDIFDGEELFSIPRGLVLSAQ----NSKLKDLLSQD-LEELGPWLSLILVMMYEYL 104
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
G++S W PY + L + + ++ + WS +EL L GS ++I + EG +
Sbjct: 105 LGEQSAWAPYFKVLPK-------SFDTLMFWSPSELQELQGSAIVSKIGK--EGAE---- 151
Query: 230 ELDTVWFMAGSLFQQYPYDIPT----EAFTFEIFKQAFVAVQSCVVHLQKVSLARRF--- 282
D++ M + + P P+ ++ E A + + + L +A F
Sbjct: 152 --DSIMQMIAPVVRANPSLFPSVEGLASWDGEAGSHALLGLAHIMGSL---IMAYAFDIE 206
Query: 283 ---------------------------ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 315
+VPL L A + + A L +D++ + +
Sbjct: 207 KAEDEDDEDNDEEEGYVTDDEQDQSSKGMVPLADILNADADRNNARLFQEEDSLVMKAIK 266
Query: 316 PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 358
P AGE I G P + LL YG+V DN VVE +L+
Sbjct: 267 PIHAGEEIFNDYGELPRADLLRRYGYV-TDNYAHYDVVELSLD 308
>gi|443730800|gb|ELU16158.1| hypothetical protein CAPTEDRAFT_140019 [Capitella teleta]
Length = 255
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 67/255 (26%), Positives = 110/255 (43%), Gaps = 34/255 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGN---ETIAELLTTNKLSELACLALYLMYEKK 169
V L GD ++P SL++T VL + I + L +LS L ++L+ E+
Sbjct: 8 VMVRRRLLTGDTIIAIPESLLITTSTVLRSYLGPVIHDFLPC-RLSPTETLVIFLLCERN 66
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
+G SFW PY+ L + L W+ E+ L TK + + +N
Sbjct: 67 KGCSSFWKPYVDILPS-------SYTDILHWTSKEMDLLPKF-TKRRACDLRLKAEESFN 118
Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLA----RRF 282
L + L +Q P AFT+++FK A+ +V + V++ Q L+ +
Sbjct: 119 RLCNGFLPL--LVRQMPQF--NGAFTWDLFKWAWSSVNTRCVYMSQPQNSVLSPDEEDKS 174
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDA------VQLVVDRPYKAGESIVVWCGPQPNSKLL 336
AL P LL ++ + A DD+ L +PY + + + GP N KLL
Sbjct: 175 ALAPFL-DLLNHTVDVEVN-ARFDDSSKSYKITTLTACKPY---DQVFINYGPHSNEKLL 229
Query: 337 INYGFVDEDNPYDRL 351
+ YGF NP++ +
Sbjct: 230 LEYGFTLPCNPHNNI 244
>gi|334184301|ref|NP_001189551.1| SET domain-containing protein [Arabidopsis thaliana]
gi|330251720|gb|AEC06814.1| SET domain-containing protein [Arabidopsis thaliana]
Length = 536
Score = 55.1 bits (131), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 62/239 (25%), Positives = 98/239 (41%), Gaps = 19/239 (7%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASEDL+ GD A +P S +++ E V ++ L T + ++ L L+ M EK
Sbjct: 180 ASEDLKFGDVALEIPVSSIISEEYVYNSDMYPILETFDGITSETMLLLWTMREKHNLDSK 239
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F PY L G L + + L G+ EI++ E ++ Y+EL
Sbjct: 240 F-KPYFDSLQENFCTG-------LSFGVDAIMELDGTLLLDEIMQAKELLRERYDELI-- 289
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
L + P E +T+E + A S + ++ + L+P+ L
Sbjct: 290 -----PLLSNHREVFPPELYTWEHYLWACELYYSNSMQIKFPDGKLKTCLIPVAGFLNHS 344
Query: 295 SSKCKAMLAAVD---DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYD 349
VD +++ V RP GE + G +S LL YGF+ + DNPYD
Sbjct: 345 IYPHIVKYGKVDIETSSLKFPVSRPCNKGEQCFLSYGNYSSSHLLTFYGFLPKGDNPYD 403
>gi|432901733|ref|XP_004076920.1| PREDICTED: SET domain-containing protein 4-like [Oryzias latipes]
Length = 441
Score = 55.1 bits (131), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 69/273 (25%), Positives = 118/273 (43%), Gaps = 29/273 (10%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
+Q G S+P S ++T VL + + L + K S L L ++L+ E+ +G+ S W
Sbjct: 68 IQPGGMLVSLPESCLLTTSTVL-HSYLGPFLKSWKPRPSSLVALCVFLVCERHRGEASDW 126
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYL-TGSPTKAEILERAEGIKREYNELDTVW 235
PYI L + P +++T +A L +G +AE E+ EG++ Y + +
Sbjct: 127 FPYIDVLP-------CSYCCPPYFTDTVMAVLPSGVRRRAE--EQREGLQHLY-AVHQDF 176
Query: 236 FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK-----VSLARRFALVPLGPP 290
FM+ +P P E T+E + A+ ++ + V + + +S +AL P
Sbjct: 177 FMSLQPVLSHP---PEEVLTYEALRWAWCSINTRSVFMDRPSSSFLSGPDNYALAPF-LD 232
Query: 291 LLAY--SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
LL + + KA ++ + + G N +LL+ YGFV NP+
Sbjct: 233 LLNHRPDVQVKAGFNRTSGCYEIRSISGVQRYHQAFINYGSHDNQRLLLEYGFVSSCNPH 292
Query: 349 DRLVVEAALNTE----DPQYQDKRMVAQRNGKL 377
+ VE L E D +K + NG L
Sbjct: 293 SVIYVEEDLLCEVLRGDESLDEKMKFLRENGFL 325
>gi|242769547|ref|XP_002341787.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
gi|218724983|gb|EED24400.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
Length = 739
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 65/230 (28%), Positives = 100/230 (43%), Gaps = 29/230 (12%)
Query: 161 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSE--TELAYLTGSPTKAEIL 218
+LM + + ++ FW PYI+ L G + +PLL+ E +LA+L + A
Sbjct: 138 TFFLMGQYLRREEGFWYPYIQSLP-----GPEELTTPLLFKEEDGDLAWLNMTSLAASRE 192
Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQ--------YPYDIPTEAFTFEIFKQAFVA--VQS 268
R + K Y + A S+ Q Y +D+ A T I +AF A + S
Sbjct: 193 RRLQIWKVNYEK-------AYSMMQDLGVENARLYTWDLYLWASTI-ISSRAFTAKVLAS 244
Query: 269 CVVHLQKVSLARRFALVPLGPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVW 326
+ LQ R ++ L P + A + K K A D++ LVV +AG+ +
Sbjct: 245 VIPKLQTAEEGDRISV--LLPLIDATNHKPLSKVEWRAGTDSIGLVVMSDLRAGDEVGNN 302
Query: 327 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 376
GP+ N +L++NYGF DNP + VV + P Q K Q K
Sbjct: 303 YGPRNNEQLMMNYGFCIPDNPCEYRVVSLRAPPDSPLAQIKAQYEQHCSK 352
>gi|121707885|ref|XP_001271968.1| SET domain protein [Aspergillus clavatus NRRL 1]
gi|119400116|gb|EAW10542.1| SET domain protein [Aspergillus clavatus NRRL 1]
Length = 677
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 55/203 (27%), Positives = 92/203 (45%), Gaps = 27/203 (13%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
+L+ + QG+ FW PYIR L + L++ +PL + +L +L G+ +R
Sbjct: 97 FFLIGQYLQGEDGFWFPYIRTLPQP-----LSLTTPLYYEGDDLGWLKGTSLWPAREQRM 151
Query: 222 EGIKREYNELDTVWFMAGSLFQ---QYPYD--------IPTEAFTFEIFKQAFVAVQSCV 270
E +K Y + V + + FQ +Y +D I + AF+ ++ +AF +
Sbjct: 152 ELLKEAYE--NGVRELRKAGFQDVDKYTWDLYLWASSMIVSRAFSPKVLAEAFADID--- 206
Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 330
+ VS+ L+P L+ + K A V +V AG+ I GP+
Sbjct: 207 LPEDGVSV-----LLPC-IDLMNHRPLAKVEWRAGKQDVAYLVLEDVAAGQEIANNYGPR 260
Query: 331 PNSKLLINYGFVDEDNPYDRLVV 353
N +L++NYGF DNP D +V
Sbjct: 261 NNEQLMMNYGFCLPDNPCDYRIV 283
>gi|388516285|gb|AFK46204.1| unknown [Lotus japonicus]
Length = 271
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/133 (29%), Positives = 65/133 (48%), Gaps = 7/133 (5%)
Query: 257 EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVD 314
E FK +F + S +V L S+ + ALVP +L +S + L + D
Sbjct: 2 ESFKWSFGILFSRMVRLP--SMDGKVALVPWAD-MLNHSCDVETFLDYDKQSKGIVFTTD 58
Query: 315 RPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQ 372
RPY+ GE + + G + N +LL++YGFV + NP D + + +L D Y++K + +
Sbjct: 59 RPYQPGEQVFISYGKKSNGELLLSYGFVTREGANPSDSVELSLSLKKSDGSYKEKLELLK 118
Query: 373 RNGKLSVQVFHVH 385
+ G Q F +
Sbjct: 119 KYGLSGSQCFPIR 131
>gi|330800139|ref|XP_003288096.1| hypothetical protein DICPUDRAFT_152307 [Dictyostelium purpureum]
gi|325081857|gb|EGC35358.1| hypothetical protein DICPUDRAFT_152307 [Dictyostelium purpureum]
Length = 525
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 94/477 (19%), Positives = 179/477 (37%), Gaps = 132/477 (27%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ +++DL+ + +P +++++ +I+ +LT + A+ L+YE G+
Sbjct: 82 IISNKDLKVNNIVAKIPKDIILSIHT----SSISNILTKYTMERNIATAIALIYEASIGE 137
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
KS W YI L L V+ P+LW + L G+ + I + I Y ++
Sbjct: 138 KSKWYGYISSL-------PLKVDIPILWDKESQQLLNGTVMEDVIQDDNILINHAYADI- 189
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF--ALVPLG-- 288
+ L + +P E F+ EIF + + +V + + +LVPL
Sbjct: 190 ----VESLLIKNHP-----EYFSKEIFSFENFKIANSIVSSRAFCIDSYHGDSLVPLADI 240
Query: 289 ---------------------------------PPLLAYSSK------------------ 297
PL+ S+K
Sbjct: 241 FNHKTGRENVHIESNGDVCNKCGSIKTCKHRKVTPLITKSAKSYKKLTNKKKMELIEKQK 300
Query: 298 ---------CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
C + D+ + + V + KA + + G N+ LL YGF++ DNP
Sbjct: 301 QQQINDEENCGDIAEEDDEHLYIKVVKAVKANQEVYNTYGDHSNATLLSKYGFIEMDNPC 360
Query: 349 DRLVVEAAL-----------NTEDPQYQDKRM--------VAQRNGKLSVQVFHVHAGRE 389
D L VE +L N D KR+ + RN S+++ +GR
Sbjct: 361 DNLPVEKSLVDTNLISLCKENGFDSNELSKRISFYASLFDIDSRNTH-SIEI----SGRL 415
Query: 390 KEA-----------ISDMLPYLRLGYVSDTSEMQSVISSLGP--ICPVSPCMERAVLDQL 436
+A +S+ +L++ +++ L I + +++A++ L
Sbjct: 416 DDALVCSVGIALAPLSEFEGWLKMS----EHKLEKYFEKLEAEDIVKQNAQVKKAIVQIL 471
Query: 437 ADYFKARLAGYPATLSEDEAMLTDY--NLHPKKRVATQLVRMEKKMLNACLQVTADM 491
+ +L+ YP TL +D+ L + N +K ++T L EKK++ ++ D+
Sbjct: 472 NN----KLSNYPTTLEQDQNKLKELKENEENRKIISTSLNICEKKLIYKSIKYYEDL 524
>gi|320170797|gb|EFW47696.1| hypothetical protein CAOG_05634 [Capsaspora owczarzaki ATCC 30864]
Length = 903
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 53/200 (26%), Positives = 87/200 (43%), Gaps = 21/200 (10%)
Query: 62 TLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQA 121
T V G+R + + +L W+H G+ I + S + V A+E ++A
Sbjct: 371 TAVIGTRPAALESRKIGDNLLQWLHNAGMTS---IAENHLSIADFEHTGRGVLANERIEA 427
Query: 122 GDAAFSVPNSLVVTLERVLG-NETIAELLTT--NKLSELACLALYLMYEK-KQGKKSFWL 177
G +P L++ + L + I +L+ ++ + L LY+++EK G S W
Sbjct: 428 GVEVLHLPQHLLINIHVALDESHPIGRVLSDLRDEYDDDTLLLLYVLHEKLVAGSASRWA 487
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
P+ L SPLL+ TEL L G+ E E +G++ + L
Sbjct: 488 PFFETL-------PATYNSPLLFHVTELLELEGTRLIDETFEIKDGLRVLHESL------ 534
Query: 238 AGSLFQQYPYDIPTEAFTFE 257
G L + YP PT+AFT+E
Sbjct: 535 -GPLAEAYPALFPTDAFTYE 553
>gi|328869852|gb|EGG18227.1| hypothetical protein DFA_03714 [Dictyostelium fasciculatum]
Length = 504
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 42/176 (23%), Positives = 78/176 (44%), Gaps = 20/176 (11%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A +DL+ + +P V++ + +IA +L +L E ++ LMYE +G
Sbjct: 44 IIAKQDLKVDEIIAVIPKRCVLSPKTT----SIAPILEKYELEEAVATSIALMYETSKGV 99
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+S W YI+ + ++ P+LW + + YL G+ + ++E E ++ +Y E
Sbjct: 100 QSKWYSYIQSM-------PTVIDLPILWDKESIEYLVGTDLEEIVIENIETLEEQYRE-- 150
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG 288
+ + +P FT E FK A V S ++ + +LVPL
Sbjct: 151 ----DVEPIIKNHPETFKENIFTLESFKIASTIVSSRAFNIDQYHGE---SLVPLA 199
>gi|322698908|gb|EFY90674.1| putative histone-lysine N-methyltransferase [Metarhizium acridum
CQMa 102]
Length = 437
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 64/271 (23%), Positives = 117/271 (43%), Gaps = 37/271 (13%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELL--TTNKLSELACLALYLMYEKKQGKKSFW 176
+ G+ ++P+ ++ T+E + + +L T+ LS LA+Y+++ + + K +
Sbjct: 34 FKEGENILTIPSGILWTVEHAYADSILGPVLRSTSLPLSVEDTLAIYILFVRSR-KSGYD 92
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
P R +A S + + E +L G+ + + I+ +Y L
Sbjct: 93 GP----------RNHVAALPASYSSSIFFMEDQLEVCAGTSLYTITKQLEQRIEDDYRGL 142
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRFALVP 286
V M G QYP P + FT E +K A V S + + + L FA
Sbjct: 143 --VVRMLG----QYPDLFPLDKFTVEDYKWALCTVWSRAMDFVLPDGKSIRLLAPFA--- 193
Query: 287 LGPPLLAYSSKCKA--MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 344
+L +SS+ K + A + ++ + Y+AG+ + + GP PN++LL YGFV
Sbjct: 194 ---DMLNHSSEAKQCHVYDASSGNLSVLAGKDYEAGDQVFINYGPMPNNRLLRLYGFVVP 250
Query: 345 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 375
NP D + A + P ++ K+ + G
Sbjct: 251 GNPNDSYDLVLATHPMAPFFKQKQKLWASAG 281
>gi|226294776|gb|EEH50196.1| SET domain-containing protein [Paracoccidioides brasiliensis Pb18]
Length = 488
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 92/428 (21%), Positives = 165/428 (38%), Gaps = 67/428 (15%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
+ A +D+ + F++P LV++ + N + +L+ N+ L + CL L ++YE Q
Sbjct: 50 IVAYDDINEEEELFAIPQGLVLSFQ----NSKLKDLMEINERDLGQWLCLILVMIYEYLQ 105
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP--------TKAEILERA- 221
G S W PY + L ++ + W++ EL L GS T E+ R
Sbjct: 106 GAASPWAPYFKVLPTD-------FDTLMFWTDAELLELKGSAVLGRIGKSTAEEVFLRDL 158
Query: 222 -------------EGIKREYNELD------TVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
G YN D ++ GSL Y +D+ + +
Sbjct: 159 LPLVSKNSELFPLTGGLLSYNSPDGKAALLSLAHRMGSLIMSYAFDVENDE------AEE 212
Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
V ++ L + ++PL L A + + A L D + + + + GE
Sbjct: 213 VEGEDGYVTDDEERQLPK--GMIPLADLLNADADRNNARLFQEDGYLSMKSIKSIRKGEE 270
Query: 323 IVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVE-------AALNTEDPQYQDKRMVAQRN 374
I G P ++LL YG+V D YD V A L + P + R+ +
Sbjct: 271 IFNDYGELPRAELLRRYGYVTDSYAQYDEAEVPIQTICRVAGLKSSTPGPDEPRLEFLDD 330
Query: 375 GKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPV-SPCMERAVL 433
++ + + +++ LP L ++ + L V P + A
Sbjct: 331 LEVLDDGYGIPRHDRSTPLAETLPTELLVVLNILVMPLEQFNQLKQKSKVPKPALGIAEA 390
Query: 434 DQLADYFKARLAGYPATLSEDEAML---TDYNLHP------KKRVATQLVRMEKKMLNAC 484
L + + L YP T+++D+ +L +Y + ++A Q+ + EK++LNA
Sbjct: 391 TLLDEVVRLILGEYPTTVAQDKELLASCANYQGSTSPISAGRLKMALQVRKGEKEILNAV 450
Query: 485 LQVTADMI 492
L D I
Sbjct: 451 LSELEDFI 458
>gi|167521575|ref|XP_001745126.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776740|gb|EDQ90359.1| predicted protein [Monosiga brevicollis MX1]
Length = 390
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 64/311 (20%), Positives = 124/311 (39%), Gaps = 35/311 (11%)
Query: 75 EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
EE+ +L W+ + G KV + + + A+ + G+ +P + ++
Sbjct: 24 EEEYDELVDWLKQCGATVDKVAVDHFNGMGQG------LKATAEAAPGETLLRIPEACML 77
Query: 135 --------TLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
TL + ++T+ +L+ L+ + + + SFW PYI L
Sbjct: 78 SEESARRSTLGAYMDSDTMLKLMPNVTLA-------FHLLLELHDLDSFWRPYIACL--- 127
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQY 245
++ PL W +L L GS E + + + R+Y L + + A +
Sbjct: 128 ----PVSYSVPLYWDLPDLMSLRGSSLFVEAIRLYKHVCRQYGYLHNKLSVRANPSCSCF 183
Query: 246 PY--DIPTEAFTFEIFKQAFVAVQSCVVHLQKVS----LARRFALVPLGPPLLAYSSKCK 299
P + EAFTFE ++ A V + + + + AL+PL + +
Sbjct: 184 PLTLGLSPEAFTFEDWRWAVATVMTRQNSIPQAGPDGQMKPTLALIPLWDMINHANHPMS 243
Query: 300 AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 359
+ + ++ V P K G I +W G + N + L++ GF + D + V +L+
Sbjct: 244 TQFDSERECLEFVCPAPAKPGSQITMWYGDRNNGQFLLHQGFFFAGHANDYVNVPFSLDE 303
Query: 360 EDPQYQDKRMV 370
D Y+ K ++
Sbjct: 304 TDSLYKIKALL 314
>gi|347967018|ref|XP_321037.5| AGAP002018-PA [Anopheles gambiae str. PEST]
gi|333469795|gb|EAA01259.5| AGAP002018-PA [Anopheles gambiae str. PEST]
Length = 493
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 91/399 (22%), Positives = 160/399 (40%), Gaps = 58/399 (14%)
Query: 121 AGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA--CLALYLMYEKKQGKKSFWLP 178
AG+ +VP S+ + + EL+ +SE LAL L+ E+ + K S W P
Sbjct: 110 AGECIITVPRSMFFYVTNEPRYRQLLELMPGAMMSEQGNIMLALALIMERFRAK-SDWKP 168
Query: 179 YIREL-DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
Y+ L DR +PL ++ ++ L + L+ + I R+Y +
Sbjct: 169 YLDLLPDR--------YTTPLYYTTEDMGELAETDAFLPALKLCKHIARQYGFIRR---- 216
Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS----CVVHLQKV-SLARRFALVPL----- 287
F Q D + FT+++F+ A V + V+L + + AL+PL
Sbjct: 217 ----FVQEKVDELRDCFTYDVFRWAVSTVMTRQNKVPVNLAEFDGMDHTLALIPLWDMAN 272
Query: 288 -GPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYK--AGESIVVWCGPQPNSKLLINYGFV 342
P A ++C A A ++ ++ + R A I + G + +++ L++ GFV
Sbjct: 273 HAFPDTANETRCVAETCYNATNEQLECSLTREVSDIASVPIFIVYGTRTDAEFLVHNGFV 332
Query: 343 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP---- 398
NP+ + L P Y+++ + + G + F RE A + P
Sbjct: 333 CPRNPHANVQKRFTLVPAIPLYKERAHLLELLGMPTTGTFSFGPAREPAAATTTTPISQE 392
Query: 399 YLRLGYVSDTS------------EMQSVISSLGPICPVSPC--MERAVLDQLADYFKARL 444
+ L VS + + + + + P C ER LA K L
Sbjct: 393 LISLARVSSMTAKELDEYTAMKETQRQTLRTYQALLPAELCARTER----WLATVMKIML 448
Query: 445 AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 483
YP T+ +DEA+L N H +R+ + EK++L +
Sbjct: 449 LRYPTTIEQDEALLKT-NRHHIRRLLIEYRLGEKQILRS 486
>gi|350632383|gb|EHA20751.1| hypothetical protein ASPNIDRAFT_120572 [Aspergillus niger ATCC
1015]
Length = 668
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 57/217 (26%), Positives = 92/217 (42%), Gaps = 7/217 (3%)
Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
+ + E +L+ + +G + FW PYIR L Q G ++ +P + +L +L G+
Sbjct: 77 DAVGEKESTIFFLVGQYLRGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGT 131
Query: 212 PTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
A +R E +K +Y + T AG Y +D+ A + I + V S V
Sbjct: 132 SLLAAREKRLEVLKEKYEKGSTELRNAGFEGADAYTWDLYLWAASMFISRAFSAKVLSGV 191
Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 330
+S + L+P+ + + K A D + VV AG+ I GP+
Sbjct: 192 FPETDLSEEKLSVLLPI-IDMGNHRPLAKVEWRAGKDDIAFVVLEDVWAGQEISNNYGPR 250
Query: 331 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 367
N +L++NYGF NP D +V P Y K
Sbjct: 251 NNEQLMMNYGFCIPGNPCDHRIVSLRAPPGSPLYMAK 287
>gi|308802149|ref|XP_003078388.1| related to histone-lysine N-methyltransferase (ISS) [Ostreococcus
tauri]
gi|116056840|emb|CAL53129.1| related to histone-lysine N-methyltransferase (ISS), partial
[Ostreococcus tauri]
Length = 446
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 99/406 (24%), Positives = 162/406 (39%), Gaps = 53/406 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL-ACLALYLMYEK-KQ 170
VA + D+ G+ +VP V+ + T+ L+ + L LA +++ E
Sbjct: 25 VATTRDVTRGELLATVPLEKCVSTSSARADATLWRGLSARPGASLDGILAAHVLREAFGL 84
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
G++S + P++R L + ++ + W E EL L GS A + + EY+
Sbjct: 85 GERSAFWPWLRLLPSE-------TDAAVGWDEDELRELQGSNVVAFARAIKKSWREEYDA 137
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAF--------TFEIFKQAFVAVQSCVVHLQKVSLARRF 282
LD F D P EAF TFE F A V S + L+ S +
Sbjct: 138 LD---------FAGLGVDFP-EAFGGEHAAHYTFEKFTWARFVVWSRAIDLKTDSTSA-- 185
Query: 283 ALVPLGPPLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 337
++ + P+L A S K A +AV++ +K + +P+ L+
Sbjct: 186 PVIRMLVPILDMANHAPSGKLLPRWDAKANAVKIYAGSAFKRNTELRFNYDTKPSQYFLL 245
Query: 338 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML 397
YGF+ E NP + + V L+ D + K + +R+G + R + D+L
Sbjct: 246 QYGFIPEANPAECVEVTMQLSQRDNLRERKEALLRRHGLDPTKRNFEWKVRGLD--YDLL 303
Query: 398 PYLRLGYVSDTSEMQSVIS-----SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLS 452
R+ D SE+ S S + + +AVL + L GY TL
Sbjct: 304 AAARI-IAMDESELDDDTSVALSVSGASVSAKNDARTKAVLLK---SLITSLDGYGTTLG 359
Query: 453 EDEAMLTDYNLH----PKKRVA-TQLVRMEKKMLNACLQVTADMIM 493
ED + + +N PKKR L+RM +K L +AD +
Sbjct: 360 EDNSYIARFNTSSDELPKKRKRFAVLLRMREK---GILLASADALF 402
>gi|317038661|ref|XP_001401929.2| SET domain protein [Aspergillus niger CBS 513.88]
Length = 699
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 57/217 (26%), Positives = 92/217 (42%), Gaps = 7/217 (3%)
Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
+ + E +L+ + +G + FW PYIR L Q G ++ +P + +L +L G+
Sbjct: 103 DAVGEKESTIFFLIGQYLRGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGT 157
Query: 212 PTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
A +R E +K +Y + T AG Y +D+ A + I + V S V
Sbjct: 158 SLLAAREKRLEVLKEKYEKGSTELRNAGFEGADAYTWDLYLWAASMFISRAFSAKVLSGV 217
Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 330
+S + L+P+ + + K A D + VV AG+ I GP+
Sbjct: 218 FPETDLSEEKLSVLLPI-IDMGNHRPLAKVEWRAGKDDIAFVVLEDVWAGQEISNNYGPR 276
Query: 331 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 367
N +L++NYGF NP D +V P Y K
Sbjct: 277 NNEQLMMNYGFCIPGNPCDHRIVSLRAPPGSPLYMAK 313
>gi|451852693|gb|EMD65988.1| hypothetical protein COCSADRAFT_86793 [Cochliobolus sativus ND90Pr]
Length = 478
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 86/370 (23%), Positives = 146/370 (39%), Gaps = 53/370 (14%)
Query: 151 TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 210
+ L L L ++YE QG+ S W Y+ L + A E+P+ W+ EL L G
Sbjct: 100 SEALDSWGSLILVMLYEYLQGEASRWKTYLDILPQ-------AFETPIFWTPDELKELEG 152
Query: 211 SPTKAEILERAEGIK--RE--------------------YNELD--TVWFMAGSLFQQYP 246
+ E + + E + RE NE D ++ GS Y
Sbjct: 153 TSLTTEKIGKKESDRMLRERILPIVTSHPDVFSPPGAPRLNEDDLLSLAHRMGSTIMAYA 212
Query: 247 YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD 306
+D+ E E + ++ + SL +VP+ +L +++ A + D
Sbjct: 213 FDLENEEEQSEDEEDGWIEDRDGK------SL---IGMVPMAD-MLNANAEFNAHVHHGD 262
Query: 307 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRL-----VVEAALNTE 360
+ AG I+ + GP P+S+LL YG+V E + YD +V AL E
Sbjct: 263 QLQVTSLRESIPAGSEILNYYGPLPSSELLRRYGYVTSEHHRYDVAEISWSLVRTALAEE 322
Query: 361 DPQYQDKRMVAQRNGKLSVQVFHV---HAGREKEAISDML--PYLRLGYVSDTSEMQSVI 415
+D +R + ++ F V AG E + + P LR + ++ +
Sbjct: 323 LKLSEDTIADIERKLESELEEFFVIERDAG-EPSSYGTLTQPPVLREISTELEEQTKAFL 381
Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
+L P E L + RL YP + +DE++L+ L + R+A ++
Sbjct: 382 KALKKRDPKRKRSETICNTVLEKALRTRLGQYPTSAKQDESLLSKEGLSKRHRMAVEVRL 441
Query: 476 MEKKMLNACL 485
EK++L L
Sbjct: 442 GEKRLLQEAL 451
>gi|225678514|gb|EEH16798.1| SET domain-containing protein RMS1 [Paracoccidioides brasiliensis
Pb03]
Length = 488
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 91/428 (21%), Positives = 164/428 (38%), Gaps = 67/428 (15%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
+ A +D+ + F++P LV++ + N + +L+ N+ L + CL L ++YE Q
Sbjct: 50 IVAYDDINEEEELFAIPQGLVLSFQ----NSKLKDLMEINERDLGQWLCLILVMIYEYLQ 105
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA--------- 221
G S W PY + L ++ + W++ EL L GS I + A
Sbjct: 106 GAASPWAPYFKVLPTD-------FDTLMFWTDAELLELKGSAVLGRIGKSAAEEVFLRDL 158
Query: 222 -------------EGIKREYNELD------TVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
G YN D ++ GSL Y +D+ + +
Sbjct: 159 LPLVSKNSELFPLTGGLLSYNSPDGKAALLSLAHRMGSLIMSYAFDVENDE------AEE 212
Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
V ++ L + ++PL L A + + A L D + + + + GE
Sbjct: 213 VEGEDGYVTDDEERQLPK--GMIPLADLLNADADRNNARLFQEDGYLAMKSIKSIRKGEE 270
Query: 323 IVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVE-------AALNTEDPQYQDKRMVAQRN 374
I G P ++LL YG+V D YD V A L + P + R+ +
Sbjct: 271 IFNDYGELPRAELLRRYGYVTDSYAQYDEAEVPIQTICRVAGLKSSTPGPDEPRLEFLDD 330
Query: 375 GKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPV-SPCMERAVL 433
++ + + +++ LP L ++ + L V P + A
Sbjct: 331 LEVLDDGYGIPRHDRSTPLAETLPTELLVVLNILVMPLEQFNQLKQKSKVPKPALGIAEA 390
Query: 434 DQLADYFKARLAGYPATLSEDEAML---TDYNLHP------KKRVATQLVRMEKKMLNAC 484
L + + L YP T+++D+ +L +Y + ++A Q+ + EK++LNA
Sbjct: 391 TLLDEVVRLILGEYPTTVAQDKELLASCANYQGSTSPISAGRLKMALQVRKGEKEILNAV 450
Query: 485 LQVTADMI 492
L D I
Sbjct: 451 LSELEDFI 458
>gi|299473350|emb|CBN77749.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 563
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 80/298 (26%), Positives = 124/298 (41%), Gaps = 30/298 (10%)
Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIP-TEAFT 255
P+ W+E E+ L GS ++ ER + I+ +Y G + YP P + T
Sbjct: 212 PIFWTEEEMRLLQGSYLVTQVEERNQAIEGDY----------GVICDLYP---PFRDVAT 258
Query: 256 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSK-CKAMLAAVDDAVQLVVD 314
E FK A + V S L L R ALVP L Y + K +
Sbjct: 259 LEEFKWARMCVCSRNFGLDINGL-RTSALVPYADMLNHYRPRETKWTYDNNRGGFTITTL 317
Query: 315 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY------DRLVVEAALNTEDPQYQDKR 368
G + G + N + L+NYGF E+N + + + L+ DP Q K
Sbjct: 318 HRILGGAQVYDSYGQKCNHRFLLNYGFAIENNQEANGFCPNEVPLLFRLDARDPLRQKKA 377
Query: 369 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS--VISSLGPI-CPVS 425
+ +G +V + G + +A+ L LR+ V+D +EM + + ++ + P+S
Sbjct: 378 RFWRMDGPEQRRV-RLCVG-DTDAVRGALSMLRV-IVADAAEMGARYMYRTVKDVRFPLS 434
Query: 426 PCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP--KKRVATQLVRMEKKML 481
E A +++L L YP TL ED A L + L P +R A V EK +L
Sbjct: 435 VRNEVAAMERLLLLTTGALDAYPTTLEEDRAALKNGGLEPFSNRRHALIQVYGEKVVL 492
>gi|168014081|ref|XP_001759585.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689124|gb|EDQ75497.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 340
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 51/212 (24%), Positives = 97/212 (45%), Gaps = 28/212 (13%)
Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 350
LL +SS+ +++ +++V ++ + GE++V+ GP N LL++YGFV NP DR
Sbjct: 123 LLQHSSESQSL-----PVLEVVAEKDLEKGENVVLNYGPLSNDILLLDYGFVMPKNPNDR 177
Query: 351 --------------LVVEAALNT-EDPQYQDKRMVAQRN--GKLSVQVFHVHAGREKEAI 393
LV + +++ +DP ++ + N G S Q+ V G +
Sbjct: 178 VELRYDDQLLHMACLVAKVNIDSFKDPTTSQLALLTRLNLHGPSSSQM--VTLGGTELVE 235
Query: 394 SDMLPYLRLGYVSDTSEMQSV----ISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 449
+L +R+ + D E+ V + + P+ ER + L LA +P
Sbjct: 236 GRLLAAVRVMHAQDPMELLDVDLEALQTWNQSPPLGVLNERKTIRTLIGLGMLALASFPT 295
Query: 450 TLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 481
+ ED++ L ++ R+A Q ++K++L
Sbjct: 296 EIEEDQSELVKGDISENHRLAIQFRMLKKRLL 327
>gi|323449371|gb|EGB05259.1| hypothetical protein AURANDRAFT_66448 [Aureococcus anophagefferens]
Length = 762
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 72/305 (23%), Positives = 117/305 (38%), Gaps = 60/305 (19%)
Query: 66 GSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAA 125
GS VV+ + +W+ G K+ +K H R + AA E G+
Sbjct: 11 GSSAVVTS------EFVAWLRAGGASFDKLAIK----HTALGRGVVATAAYE---PGETL 57
Query: 126 FSVPNSLVVTLERVLGNETIAELLTTNKLSELAC------LALYLMYEKKQGKKSFWLPY 179
SVP +L++T+++ +A L + + LAL+L ++ + W PY
Sbjct: 58 LSVPEALLLTVDKASRRADVAASLGAARARGVDANGGNLALALFLAGDRSEA----WRPY 113
Query: 180 IRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAG 239
+ R P W + A L GSP +++ R + I+R+ L
Sbjct: 114 RNVISRS------VSHLPCFWPTADEALLAGSPLGEDVVRRRDEIRRDCRSLGLTAVEDR 167
Query: 240 SLFQQYPYDIPTEAFTFEIFKQA--FVAVQSCVVHLQK-VSLA-RRFALVPLGPPLLAYS 295
F + + AF F + F + + H ++ V A R A V
Sbjct: 168 QAFAFAEAQVLSRAFAFNGTRAMVPFADLMNTARHHERHVDFAFERGAFV---------- 217
Query: 296 SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN--PYDRLVV 353
+ AV R AGE + GP+ N++ L+NYGF DN RL+
Sbjct: 218 ------MRAV---------RRGAAGEPVTDSYGPKSNARYLLNYGFAMADNRDEAGRLLD 262
Query: 354 EAALN 358
+AAL+
Sbjct: 263 DAALD 267
>gi|67540796|ref|XP_664172.1| hypothetical protein AN6568.2 [Aspergillus nidulans FGSC A4]
gi|40738718|gb|EAA57908.1| hypothetical protein AN6568.2 [Aspergillus nidulans FGSC A4]
gi|259480141|tpe|CBF71002.1| TPA: SET domain protein (AFU_orthologue; AFUA_6G04520) [Aspergillus
nidulans FGSC A4]
Length = 484
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 66/277 (23%), Positives = 108/277 (38%), Gaps = 49/277 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN--KLSELACLALYLMYEKKQ 170
V A D+ + F++P LV++ N + +LL+ + +L L L +++E Q
Sbjct: 50 VVAQADIDEDEELFAIPRDLVLSTH----NSKLKDLLSQDLDQLGPWLSLMLVMIFEYLQ 105
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA--------- 221
G KS W PY + L + ++ + WS EL L GS +I ++
Sbjct: 106 GGKSTWAPYFKVLPQN-------FDTLMFWSPEELEELQGSAVVEKIGKQGAEESILKLI 158
Query: 222 --------------EGIKREYNELDTVWFMA-----GSLFQQYPYDIPTEAFTFEIFKQA 262
G+ ++ + GSL Y +DI T E +
Sbjct: 159 IPVVRANPALFPPINGLASYDGDVGAQALLGLAHTMGSLIMAYAFDIETPENEDEREGED 218
Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
K +VPL L A + + A L ++++ + +P +AGE
Sbjct: 219 GYLTDEEEEQSSK-------GMVPLADMLNADAYRNNARLFQEEESLVMKAIKPIRAGEE 271
Query: 323 IVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 359
I G P S LL YG+V DN V+E +L+T
Sbjct: 272 IFNDYGEIPRSDLLRRYGYV-TDNYASYDVIELSLDT 307
>gi|134074534|emb|CAK38827.1| unnamed protein product [Aspergillus niger]
Length = 625
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 57/217 (26%), Positives = 92/217 (42%), Gaps = 7/217 (3%)
Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
+ + E +L+ + +G + FW PYIR L Q G ++ +P + +L +L G+
Sbjct: 29 DAVGEKESTIFFLIGQYLRGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGT 83
Query: 212 PTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
A +R E +K +Y + T AG Y +D+ A + I + V S V
Sbjct: 84 SLLAAREKRLEVLKEKYEKGSTELRNAGFEGADAYTWDLYLWAASMFISRAFSAKVLSGV 143
Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 330
+S + L+P+ + + K A D + VV AG+ I GP+
Sbjct: 144 FPETDLSEEKLSVLLPI-IDMGNHRPLAKVEWRAGKDDIAFVVLEDVWAGQEISNNYGPR 202
Query: 331 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 367
N +L++NYGF NP D +V P Y K
Sbjct: 203 NNEQLMMNYGFCIPGNPCDHRIVSLRAPPGSPLYMAK 239
>gi|7329638|emb|CAB82703.1| putative protein [Arabidopsis thaliana]
Length = 486
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 44/159 (27%), Positives = 77/159 (48%), Gaps = 18/159 (11%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ + AGD VP + +T + + + +L +N++ + LA L+ EKK G+KS
Sbjct: 75 ASKVIYAGDCMLKVPFNAQITPDELPSD---IRVLLSNEVGNIGMLAAVLIREKKMGQKS 131
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W+PYI L + + S + W E EL+ + S E +++ I+++++
Sbjct: 132 RWVPYISRLPQPA-----EMHSSIFWGEDELSMIRCSAVHQETVKQKAQIEKDFS----- 181
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS--CVV 271
F+A + Q P I TE E F A+ + C+V
Sbjct: 182 -FVAQAFKQHCP--IVTERPDLEDFMYAYALGEKVLCIV 217
>gi|159476096|ref|XP_001696150.1| protein N-methyltransferase [Chlamydomonas reinhardtii]
gi|158275321|gb|EDP01099.1| protein N-methyltransferase [Chlamydomonas reinhardtii]
Length = 474
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 58/221 (26%), Positives = 98/221 (44%), Gaps = 35/221 (15%)
Query: 157 LACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAE 216
A + L++ K+QG +S P+I +L G PL WS+ +LA L A+
Sbjct: 138 FAKMGAMLLWHKRQGSQSPLAPWIAQLPADTG-------VPLNWSDKQLAALQYPYLVAQ 190
Query: 217 ILERAEGIKREYNEL-DTV-----------------WFMAGSLFQQYPYDIPTEAFTFEI 258
+ E+ +RE+ L DT+ W+ G + + + P T
Sbjct: 191 VKEQ----QREWTALYDTLRGSGMAAGAAPPSREEFWWAMG-VVRSRTFSGPYIGSTLSD 245
Query: 259 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAA--VDDAVQLVVDRP 316
+ V + VV L + SL +++A+ PL L ++S ++ ++ D+ +V R
Sbjct: 246 RLRLAGLVAALVVILSR-SL-KQYAICPL-IDLFNHTSAAQSEVSYNYFGDSYSVVASRD 302
Query: 317 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 357
+K GE + + G Q N L+ YGF + DNP D V+ L
Sbjct: 303 FKKGEQVFITYGAQSNDSLMQYYGFAEADNPQDTYVISDVL 343
>gi|148908465|gb|ABR17345.1| unknown [Picea sitchensis]
Length = 350
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/154 (27%), Positives = 73/154 (47%), Gaps = 16/154 (10%)
Query: 317 YKAGESIVVWCG-PQPNSKLLINYGFVDED----NPYDRLVVEAALNTEDPQYQDKRMVA 371
++ GE +++ G + N +L ++YGFV+ + + D + ++ DP + DK +A
Sbjct: 136 FRTGEQVLMQYGMNKSNGQLALDYGFVERNRKNGSNRDIFTLTLEISESDPFFADKLDIA 195
Query: 372 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM-------QSVISSLGPICPV 424
+ NG + F + G + ML +LRL + T SV L PV
Sbjct: 196 ELNGMETTAYFDITQG--QGVPESMLTFLRLIALGGTDAFLLEPLFRDSVWEHLS--LPV 251
Query: 425 SPCMERAVLDQLADYFKARLAGYPATLSEDEAML 458
S E A+ + D ++ L+GY T+ EDEA+L
Sbjct: 252 SQENEAAICKVVLDGCQSTLSGYGTTIEEDEALL 285
>gi|400597281|gb|EJP65016.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
Length = 484
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 57/236 (24%), Positives = 90/236 (38%), Gaps = 75/236 (31%)
Query: 149 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 208
L+++ L L L L+YE +G S W PY+ L E+P+ W+ EL L
Sbjct: 96 LSSSPLDAWGALILVLLYEHLRGAASAWRPYLDVL-------PATFETPMFWTGAELGAL 148
Query: 209 TGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 268
T ++ RE E DT + + + +P ++F+ +
Sbjct: 149 QAGATAGKV-------GRESAE-DTFRGILLPVVRAHP----------DVFQGSAALSDE 190
Query: 269 CVVHLQKVSLARRFALVPLGPPLLAYS---------------------SKCKAMLAAV-- 305
+V +LA R +G ++AY+ KAM+ V
Sbjct: 191 ALV-----ALAHR-----MGSTIMAYAFDLENDEEREDEEDEDGWVEDRDGKAMMGMVPM 240
Query: 306 -----------------DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 344
D+ + + RP KAGE I+ + GP PNS+LL YG+V E
Sbjct: 241 ADILNADAEFNAHVNHGDNELTVTALRPIKAGEEILNYYGPHPNSELLRRYGYVTE 296
>gi|384246211|gb|EIE19702.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 503
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 109/443 (24%), Positives = 189/443 (42%), Gaps = 56/443 (12%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E L L +W+ + GLP K+ ++ + + + S+ + G +VP+S +T
Sbjct: 50 ETLPPLSAWVEQRGLPLKKLNVRPEIVEGDL-----CLVVSKPTKKGQPLVAVPSSAWLT 104
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
++V+ + +I L+ L +AL+L++E+ + + W ++ + A +
Sbjct: 105 -QQVVRSSSIGSLV--EDLEPWLQIALFLLHERSKPDAA-WQGFLDSI-------PAAPD 153
Query: 196 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
PL WSE EL+ L G+ + + + + +Y EL+ LF + P ++
Sbjct: 154 VPLFWSEEELSQLEGTQLLSSVQGYRQFFEAKYAELEE------QLFAPHREAFPPKSHQ 207
Query: 256 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY----SSKCKAMLAA--VDDAV 309
+ F A V+S V ALVPL L+ + ++ + LA A
Sbjct: 208 LDDFLWAVATVRSRV---HSPLDGEDVALVPLAD-LVQHRKLQGARWQLQLAGGLFSKAQ 263
Query: 310 QLVVD--RPYKAGESIVVWCGP--------QPNSKLLINYGFVDEDNPY-DRLVVEA--- 355
LVV+ R Y GE + + G + +S++L++YG +D D P D VV+
Sbjct: 264 ALVVEAQRDYAEGEVVTMDFGAPLTEEDQEKLDSQVLLDYGALDADRPQADPGVVQGGFI 323
Query: 356 ---ALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQ 412
AL +D Y DK + + NG F + A E L D ++
Sbjct: 324 LSLALPEDDKYYDDKADILELNGLSEAASFVLRANEEPSEQLLGFLRLLNLSGQDAFLLE 383
Query: 413 SVISSLG---PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRV 469
+ + + PVS ERAV + + + +A L GY ++ +D L D P R+
Sbjct: 384 PLFRNEAWGHMLAPVSEANERAVYESMMEGCRAALQGYATSIDDDLRALRDT--QPGTRL 441
Query: 470 ATQ-LVRM-EKKMLNACLQVTAD 490
LVR+ EK+ L+A L D
Sbjct: 442 EKAILVRLGEKETLDATLAFFED 464
>gi|346319394|gb|EGX88996.1| Protein kinase-like domain [Cordyceps militaris CM01]
Length = 1753
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 86/394 (21%), Positives = 156/394 (39%), Gaps = 36/394 (9%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
V A + G+ ++P++ + T E + + +L + + LS LA++L++ K
Sbjct: 912 VKALRSFKKGERILTIPSACLWTAEAARADPLLGPVLRSAQPPLSVEDTLAIHLLFVKS- 970
Query: 171 GKKSFWLPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIK 225
R + R +A + + ++E EL GS + + ++
Sbjct: 971 ----------RTAGYEGQRLHIAAMPQRHSASIFFAEDELQVCEGSSLHTLTTQLEQRVQ 1020
Query: 226 REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
++ +L L Q+ P + FT E +K A + S + +
Sbjct: 1021 DDFRQLLV------QLLSQHRDLFPLDQFTIEDYKWALCTIWSRAMDFAVSDTTSVRLVA 1074
Query: 286 PLGPPLLAYS---SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
PL +L +S +C A D + ++ + Y+ G+ I ++ G PN++LL YGFV
Sbjct: 1075 PLAD-MLNHSLDVKQCHAYDPTSGD-LSILAAKDYQVGDQIFIYYGSVPNNRLLRLYGFV 1132
Query: 343 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 402
DNP D + + P Y+ K + G S + A + ++L YLR
Sbjct: 1133 LLDNPNDSYDLVLQTSPMAPLYEQKERLWALAGLDSTCTIPLTA--KHPLPKNVLRYLRT 1190
Query: 403 GYVSDTSEMQSVISSL--GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 460
+ D +++ + L G V+ E VL L D + L G+ L + EA L
Sbjct: 1191 QRL-DAADVADMTLQLLNGTDGKVNDGNEIQVLQFLIDSLGSVLEGFGIPLEKLEAQLAG 1249
Query: 461 --YNLHPKKRVATQLVRMEKKMLNACLQVTADMI 492
Y A Q+ E+ +L + DM+
Sbjct: 1250 GFYPAGGNAWAAAQVSAGEQGILTRAKKTAEDML 1283
>gi|345325919|ref|XP_001512656.2| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Ornithorhynchus anatinus]
Length = 345
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 51/197 (25%), Positives = 94/197 (47%), Gaps = 18/197 (9%)
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 365
DD + V + + AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 42 DDRCECVALQDFTAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 101
Query: 366 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS--------S 417
K V R G + VF +H E + +L +LR+ +++ + +I +
Sbjct: 102 MKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLIGDHAIDKIFT 160
Query: 418 LG-PICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVATQ 472
LG PVS E +L + +AR L Y T+ ED++ L +L +A +
Sbjct: 161 LGNSEFPVSWDNEV----KLWTFLEARASLLLKTYKTTIEEDKSFLETPDLTFHATMAIK 216
Query: 473 LVRMEKKMLNACLQVTA 489
L EK++L ++ A
Sbjct: 217 LRLGEKEILEKAVKSAA 233
>gi|299115166|emb|CBN75532.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 524
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 74/335 (22%), Positives = 131/335 (39%), Gaps = 71/335 (21%)
Query: 77 DLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTL 136
+L L SW ++G K+ L++ + + L G+ S+P SL +T+
Sbjct: 27 ELDGLLSWFVEHGGSMTKLCLEDLGGEMSLS-----LLTGQALNKGEVVMSIPISLCMTV 81
Query: 137 ERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES 196
+ VL AL+LM E+++G SFW Y+R L V++
Sbjct: 82 DSVL--------------------ALHLMAERRKGDGSFWKQYLRTLPDD-------VDT 114
Query: 197 PLLW----SETELAYLTGSPTKAEILERA--EGIKREYNELDTVWFMAGSLFQQYPYDIP 250
PL W +E E L G T +L R +++++ E L + +P +
Sbjct: 115 PLRWLVEQAEEEFRLLDG--TMVGLLSRMMHSQVRKDWEEFHL------PLVEAHPEILG 166
Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQK----VSLARRFALVPLG-------------PPLLA 293
TFE + A ++ S Q+ S R A+VP+ ++
Sbjct: 167 --GVTFEDYLWAMSSIWSRSFDYQEPGPDDSPCSRRAMVPVINAANHDPSAADSLSEMIE 224
Query: 294 YSSKCKAMLAAVDD------AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
+ ++ + + + +++ R Y A E + G N+KLL +YGFV NP
Sbjct: 225 FQAQEGGLSMGIGEPGRARGTLRVSAGRDYAAREQFFILYGRYSNAKLLYSYGFVLASNP 284
Query: 348 YDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
Y L + DP + K+ + + + Q +
Sbjct: 285 YGGLDYWVRVPQTDPGFAWKQALLDEHPLTAAQAY 319
>gi|326472332|gb|EGD96341.1| SET domain-containing protein [Trichophyton tonsurans CBS 112818]
Length = 485
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 71/345 (20%), Positives = 123/345 (35%), Gaps = 72/345 (20%)
Query: 97 LKEKPSHNEKHRPIHY-----------VAASEDLQAGDAAFSVPNSLVVTLERVLGNETI 145
LK H + H IH + AS D+ + F +P+ L+++++ +
Sbjct: 24 LKRSSPHFKMHSGIHIADLRSIGAGRGICASRDIAEDEELFVIPDDLILSVQNSEARSVL 83
Query: 146 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 205
L +L L + ++YE QG++S W PY R L + ++ + W++ +L
Sbjct: 84 G--LDDKQLGPWLSLIITMIYEYYQGEQSKWYPYFRILPS-------SFDTLMFWTDEQL 134
Query: 206 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 265
+ L GS +I + A DT+ L Q P+ P +
Sbjct: 135 SELQGSAVVGKIGKAAAD--------DTILQKVVPLIQANPHHFPPRPNMPPLNSPDSQN 186
Query: 266 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 325
C+ H +G ++AY+ + A +D +
Sbjct: 187 ALLCLAHR-------------MGSIIMAYAFDIEKADEADEDTAE--------------- 218
Query: 326 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 385
Y DED P +V A + D Q + R+ + + + ++H
Sbjct: 219 -----------DGYMTDDEDEPAKGMVPLADIFNADAQRNNARLFQEEGSFVMKAIKNIH 267
Query: 386 AGREKEAISDMLPYL----RLGYVSDTSEMQSVIS-SLGPICPVS 425
+G E LP R GYV+D V+ SL IC V+
Sbjct: 268 SGEEIFNDYGELPRADLLRRYGYVTDNYAQYDVVEFSLDGICKVA 312
>gi|357145323|ref|XP_003573603.1| PREDICTED: SET domain-containing protein 4-like [Brachypodium
distachyon]
Length = 532
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 61/264 (23%), Positives = 114/264 (43%), Gaps = 30/264 (11%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ ASE++ G A +P SL+++ E + ++ L N ++ L L+ M E+
Sbjct: 178 MVASENIGVGHIALEIPESLIISEELLCQSDMFLALKDLNSITTETMLLLWSMRERHNPS 237
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+F + + L G L + LA L G+ E+++ + + ++Y+EL
Sbjct: 238 SNFKM-FFETLPSNFNTG-------LNFGIGALAALEGTLLFDELMQARQHLHQQYDELF 289
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG---- 288
+ L ++P + +T++ F A S + + S L+P+
Sbjct: 290 PM------LCTKFPEIFTQDIYTWDNFLWACELWYSNSMMVVLSSGKLTTCLIPVAGLLN 343
Query: 289 ----PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-D 343
P +L Y +A +++ + RP KAG+ + G S L+ YGF+
Sbjct: 344 HSVYPHILNYGRVDQAT-----KSLKFPLSRPCKAGQQCFLSYGKHSGSHLITFYGFLPR 398
Query: 344 EDNPYDR--LVVEAALNTEDPQYQ 365
EDNPYD L ++ +++ ED Q
Sbjct: 399 EDNPYDVVPLDLDMSVDEEDGTAQ 422
>gi|320170159|gb|EFW47058.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 640
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 75/296 (25%), Positives = 118/296 (39%), Gaps = 39/296 (13%)
Query: 70 VVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVP 129
VS L L +W+ GL ++ +P N+ Y+ AS ++A +VP
Sbjct: 162 AVSTPRGALARLTAWIDNAGL---EINSNARPGLNDVDE--LYLFASNPIEAATLVATVP 216
Query: 130 NSLVV--TLERVLGNETIAELLTTNKLSELA------CLALYLMYEKKQGKKSFWLPYIR 181
LV+ T R L N I L + ++ LA+ L+YE + KS W +I
Sbjct: 217 APLVMFETYLRTLENPMI--LAIDRRFKTMSVPDPSYALAMALLYESYE-PKSMWREWIS 273
Query: 182 ELDRQRGRGQLAVESPLLWSETELAYLTGSP--TKAEILERAEGIKREYNELDTVWFMAG 239
L + ++S + WS E L P K +ILER +++ YN
Sbjct: 274 SLPQ-------TLDSTVFWSAEEQDALQSLPLKRKTQILER--HLQQLYNA------TTP 318
Query: 240 SLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CVVHLQKVSLARRFALVPLGPPLLAYSSKC 298
L +P+ +++E+FK A++ V S + + L PL L +
Sbjct: 319 RLLAAFPHIFAGGNYSYEMFKWAYMIVDSRSLTFSTGPDTLPQIMLAPLVDLLHHDPVQT 378
Query: 299 KAMLAAVDDAV-----QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
L + V L R K GE +V G PN +LL+ +G NPY+
Sbjct: 379 NIQLGVHPEEVLGFEISLKTTRAIKKGEPLVRHIGELPNHQLLLRFGLAMPRNPYE 434
>gi|224077384|ref|XP_002305239.1| SET domain protein [Populus trichocarpa]
gi|222848203|gb|EEE85750.1| SET domain protein [Populus trichocarpa]
Length = 518
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 42/151 (27%), Positives = 66/151 (43%), Gaps = 21/151 (13%)
Query: 56 RVSSSDTLVAGSREVVSKKEEDLGD------LKSWMHKNGLPPCKVILKEKP-------- 101
R+ +S T++ + K+ ED G W G+ C L P
Sbjct: 9 RIWASFTVLRRNSRQTKKEMEDAGQDEGFERFLKWAANLGISDCTTNLSLHPQSPTSCLG 68
Query: 102 -SHNEKHRPI---HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL 157
S H P +AA DL+ G+ VP S+++T + +L +E + + N S L
Sbjct: 69 HSLTVSHFPDAGGRGLAAVRDLKKGELVLRVPKSVLITRDSLLKDEKLCSFVNNNTYSSL 128
Query: 158 A---CLALYLMYEKKQGKKSFWLPYIRELDR 185
+ LA+ L+YE +GK S+W PY+ L R
Sbjct: 129 SPTQILAVCLLYEMGKGKSSWWYPYLMHLPR 159
>gi|323452617|gb|EGB08490.1| hypothetical protein AURANDRAFT_71532 [Aureococcus anophagefferens]
Length = 1114
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 92/373 (24%), Positives = 145/373 (38%), Gaps = 54/373 (14%)
Query: 123 DAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRE 182
DA + LV + ER + L E LAL L+YE+++G KS W P+I
Sbjct: 64 DAMLHARSPLVCSGEREANDARALGALLGKVTREDDALALRLLYERRKGAKSRWGPHIAL 123
Query: 183 LDRQRGRGQLAVESPLLWSETELAYLTGSPTK--------------AEILERAEGIKREY 228
L + L WSE ELA L GS +EI++++ E
Sbjct: 124 LP------ATPPHALLRWSEAELAELAGSDALELANRWRSQVSSDFSEIVDKSRAAVEES 177
Query: 229 NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG 288
+ + A ++P+ + E F++ + ++ + VS++R+ A
Sbjct: 178 DPGKQL-SAAVKASLRFPW-LDLEGFSWAV----------SMIWSRCVSVSRKGA----- 220
Query: 289 PPLLAY--------SSKCKAMLAAVDDAVQLVVDR---PYKAGESIVVWCGPQPNSKLLI 337
PP+ A+ DDA V R K G+ + + PN+ LL+
Sbjct: 221 PPIKAFLPVVDMHNHDPGAPENHGFDDARDGFVLRRTGNAKKGDELKLCYDGLPNAWLLL 280
Query: 338 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML 397
YGF + + + A L+ E P Y+ KR + KL + A + A D L
Sbjct: 281 LYGFALDHAAHAGRDLYAPLSPEAPHYEAKRAALE---KLGLGATADGAAPFRLAADDAL 337
Query: 398 PYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM 457
P RL ++ ++ + L + S RA L A LA Y + ED A
Sbjct: 338 PE-RL--LTALMAQRATLDELPGLPATSEATARAAAGDLVAACDALLAAYRGSEDEDAAA 394
Query: 458 LTDYNLHPKKRVA 470
L D P+ R+A
Sbjct: 395 LADPATPPRLRLA 407
>gi|348690659|gb|EGZ30473.1| hypothetical protein PHYSODRAFT_553476 [Phytophthora sojae]
Length = 437
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 100/404 (24%), Positives = 173/404 (42%), Gaps = 54/404 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLS---ELACLALYLMYEK- 168
V +ED+ FS+P V++++ + N + + +L+ E LA+ L+YEK
Sbjct: 47 VFIAEDVTPHAEVFSIPLDSVLSVKSLQENAVLQSIAFFQQLTPEREDDQLAIALLYEKF 106
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
+G KS W +I L R + L + EL L GS + E + +Y
Sbjct: 107 VRGSKSKWAKHIELLPR-------TYHNALYFGPEELRALEGSNVYFIAQQMEEKVAHDY 159
Query: 229 NELDTVWFMAGSLFQQYP----YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF-- 282
L + LF+ P D+ E F+ E +K A + S V +A++
Sbjct: 160 ARLKESVLL--ELFENVPEGINVDLFDEFFSLENYKWALSTIWS---RFGDVPVAKQSFK 214
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQ---LVVDRPYKAGESIVVWCGPQPNSKLLINY 339
A+VP+ +L + + + M D + Q LV + + AG + + GP N KLL Y
Sbjct: 215 AMVPVFD-MLNHDPEAE-MSHFFDMSTQRFKLVSHQHWNAGAQMFINYGPLSNHKLLALY 272
Query: 340 GFVDEDNPYDRLVVEAALNTEDPQ---YQDKRMVAQRNGKLSVQVFHVHAGREKEAISD- 395
GFV NP+D VE L ++ +Q+K + NG HA E ++D
Sbjct: 273 GFVIIGNPFD--AVEMWLPMDEASTKFFQEKEQLLLTNGL-------DHATNPFELVADE 323
Query: 396 ----MLPYLRLGYVS--DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 449
+L R+ + E + + + +S E+ L +L + L +P
Sbjct: 324 SNDLLLMAARIQEIDCETVEEFEELANKALEGEMISLENEQEALTRLIYTLEKMLESFPT 383
Query: 450 TLSEDEAML------TDYNLHPKKRVATQLVRMEKKMLNACLQV 487
++ ED+ +L TD NL+ +R+A + R +K +L+ + +
Sbjct: 384 SIEEDDILLEQDDKKTD-NLN-HERMAVAVRRSDKYILSENINM 425
>gi|302826668|ref|XP_002994755.1| hypothetical protein SELMODRAFT_432653 [Selaginella moellendorffii]
gi|300136963|gb|EFJ04180.1| hypothetical protein SELMODRAFT_432653 [Selaginella moellendorffii]
Length = 688
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 47/191 (24%), Positives = 82/191 (42%), Gaps = 26/191 (13%)
Query: 167 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 226
+K Q + S W PYI L +++ LW +TEL+YL SP + ER E I
Sbjct: 503 QKFQLQSSAWAPYISCLPEPA-----ELDNTFLWEDTELSYLRASPLYGKTRERLEIITT 557
Query: 227 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 286
E+ ++ + LF + + E F + V S + +++ LV
Sbjct: 558 EFGQVQNALDVWPQLFGK---------VSVEDFMHVYATVFS-----RPLAIGEDSTLVM 603
Query: 287 LGPPLLAYSSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
+ P+L + + A A + + + DR + I + CG N++L ++YGF
Sbjct: 604 I--PMLDFFNHNAASFAKLSFNGLLNYAVVTADRDCAENDQIWINCGDLSNAELALDYGF 661
Query: 342 VDEDNPYDRLV 352
+N YD ++
Sbjct: 662 TVPENRYDEVM 672
>gi|323447496|gb|EGB03414.1| hypothetical protein AURANDRAFT_72732 [Aureococcus anophagefferens]
Length = 403
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 68/269 (25%), Positives = 109/269 (40%), Gaps = 33/269 (12%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
W+ +NG + E S++++ R +H A+ DL+ + VP ++T+E +G
Sbjct: 37 WLTENGGKFADCV--ELRSYDDEVRGVH---ATRDLETEEILVEVPLKCLITVE--MGKA 89
Query: 144 TIAELLTTNKLSELAC-----LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPL 198
T EL L L+++ +++ +F+ PY L P+
Sbjct: 90 TDVGRAVLEAELELDAPKHVFLMLFVLLDRRDSS-TFFAPYYDILP------STLSNMPI 142
Query: 199 LWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEI 258
W EL +L GS +I ER IK +Y + +W P I + T E
Sbjct: 143 FWQPDELEWLKGSYLLTQIEERKRAIKADYEAICGIW----------PSFI--DVCTLEE 190
Query: 259 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSK-CKAMLAAVDDAVQLVVDRPY 317
FK A + V S + V+ AR A+VP L + + K A + +
Sbjct: 191 FKWARMCVCSRNFGVV-VNGARTSAMVPYADMLNHFRPRETKWTFDNSRGAFTITSLQKI 249
Query: 318 KAGESIVVWCGPQPNSKLLINYGFVDEDN 346
G I G + N + L+NYGF EDN
Sbjct: 250 SVGSQIYDSYGQKCNHRFLLNYGFAIEDN 278
>gi|428179206|gb|EKX48078.1| hypothetical protein GUITHDRAFT_106158 [Guillardia theta CCMP2712]
Length = 410
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 54/211 (25%), Positives = 95/211 (45%), Gaps = 21/211 (9%)
Query: 153 KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP 212
+L E L+L+L+ EK + ++S W +IR + + ++ WSE +A L P
Sbjct: 19 QLCERQLLSLHLLVEKWKAERSRWWRFIRSIPP-------SYDTLENWSEQSVARLQYKP 71
Query: 213 TKAEILERAEGIKREYNELDTV--------WFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
A R + E+++L + W + + + +F+ E + A
Sbjct: 72 FLAIAARRKRVVNDEFSQLQRLLSRCKKRSWNEPEAAEEAERIQLGFSSFSREDYLWAAG 131
Query: 265 AVQSCVVHLQKVS-LARRFALVPLGPPLLAYSSKCKAMLAAV---DDAV--QLVVDRPYK 318
V + H ++ S + R V P+L + + A +AA DA+ ++ R Y+
Sbjct: 132 TVSTRSCHYERKSGYSLRGETVGCLVPVLDFLNHSTAPVAACGFCKDAMVYRVTCLRSYE 191
Query: 319 AGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
GE +++ G N+ LL +YGFV EDNP D
Sbjct: 192 EGEQVMIHYGNWSNAGLLEHYGFVLEDNPLD 222
>gi|126325439|ref|XP_001376285.1| PREDICTED: SET domain-containing protein 4-like [Monodelphis
domestica]
Length = 437
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 65/283 (22%), Positives = 119/283 (42%), Gaps = 28/283 (9%)
Query: 86 HKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVVTLERV 139
HK + LK++ + RP + + A + LQ G+ S+P ++T + V
Sbjct: 30 HKQEFIELRKWLKKRKFEDHNLRPTRFSNTGRGLMAVKSLQPGELIISLPKECLLTTDTV 89
Query: 140 LGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
+ + + +T +S L L +L+ EK G KS W PY+ L + +
Sbjct: 90 I-RSYLGDYITKWMPPISPLLALCAFLISEKHAGNKSPWKPYLDVLPK--------AYTC 140
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
L+ E E+ L P + + E+ +++ + + SLF + D+ F +
Sbjct: 141 LVCLEPEVVRLLPRPLQMKAEEQRMQVQKLFISSRGFFSSLQSLFTE---DV-KHVFHYH 196
Query: 258 IFKQAFVAVQSCVV---HLQKVSLARRFALVPLGP--PLLAYSSKCKAMLAAVDDAV--Q 310
F A+ + + V H QK L+ + L P LL +S + A ++ +
Sbjct: 197 AFLWAWCTINTRTVYMKHAQKQCLSAEPDVYALAPYLDLLNHSPRVWVEAAFNEETCCYE 256
Query: 311 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
+ K E + + GP N +LL+ YGFV +NP+ + +
Sbjct: 257 IRTTSHCKKFEELFICYGPHDNHRLLLEYGFVASNNPHSAVYI 299
>gi|452824261|gb|EME31265.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase
[Galdieria sulphuraria]
Length = 546
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 86/387 (22%), Positives = 148/387 (38%), Gaps = 82/387 (21%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPS---HNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
E +L++W+ NG+P +K KP HN + A L+ G+ ++P
Sbjct: 71 EKTEELENWLFDNGVPS----IKGKPVLSPHNCRT-----FRAKIPLKLGEEVLAIPERF 121
Query: 133 VVTL---ERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
+T E++LG + LS+ +A L+ E + + SFW P+I L
Sbjct: 122 WLTKQLSEKLLG-------FHVSDLSDEEAIAALLLVETARKETSFWKPWIETLPSSDEL 174
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY-D 248
L+WS E YL S T +IL E + EL+T LF ++ Y
Sbjct: 175 HHF-----LVWSTAETQYLESSSTFEDILSLRETASLVFEELNT------ELFPKFLYPQ 223
Query: 249 IPTEAFTFEIFKQAFVAVQSCVVH-----------------------LQKVSLARRF--- 282
+ FT F A VQS ++ + + S R++
Sbjct: 224 YDVKYFTLPYFTWALSIVQSFGLYDIMDSCPLVIVPGLEWLTYKYSLITEESFFRQYFHI 283
Query: 283 ---ALVPLGPPLLAYSSKCKAMLAAVDDA-----VQLVVDRPYKAGESIVVWCGPQPNSK 334
+L+ +GP ++ + + + A +D V LV + ++ W
Sbjct: 284 SNVSLIRVGP---FFTQERRLKITASEDLKVGEPVSLVYEGNVSLIDTFCRWGWK----- 335
Query: 335 LLINYGFVDEDN--PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEA 392
++ G +DE+ + A+ T D + DK + +Q F + KE
Sbjct: 336 --LDLGALDEEQLLKMGSYEISFAVTTTDQFFDDKEDILDAQRLELLQTFELRYDMSKEL 393
Query: 393 ISDMLPYLRLGYVSDTSE--MQSVISS 417
+ +LP+LRL + D ++SV S
Sbjct: 394 LQRILPFLRLICLKDKDSFILESVFRS 420
>gi|281205954|gb|EFA80143.1| hypothetical protein PPL_06965 [Polysphondylium pallidum PN500]
Length = 417
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 91/402 (22%), Positives = 171/402 (42%), Gaps = 54/402 (13%)
Query: 76 EDLGDLKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
+DL K WM G+ P ++ E + + A+ ++ GD VP ++
Sbjct: 9 DDLVTFKQWMDDEGIYLNPSLDIVKLEDYGRS--------IIANTLIKEGDVLIRVPRNV 60
Query: 133 VVT---LERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQR 187
+++ +E + E I ++ +N+ + A+YLMY K S+W Y L +Q
Sbjct: 61 MMSRTGIELHIPKE-IRSIIDSNRDDIGSTDGQAVYLMY-SLLNKDSYWHQYTSILPKQ- 117
Query: 188 GRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY 247
+ + + + E+ L S + R GI+R YN ++ SL ++
Sbjct: 118 ------FTTSIYFDQDEMKELQLSKLRYFTESRLSGIERHYN---VIFKKLSSLNDEFK- 167
Query: 248 DIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD 307
+ +TFE+FK A + S L + +VPL A K K+ +
Sbjct: 168 ---KKEYTFELFKWALSCIWSRAFSLS----SDDGGMVPLADMFNAIE-KAKSKVRPDSR 219
Query: 308 AVQLV--VDRPYKAGESIVVWCGPQP---NSKLLINYGFVDEDNPYDRLVVEAALN--TE 360
A QL+ + + GE + G N+++L++YGF D+P + ++ L+ ++
Sbjct: 220 ADQLIYYASKDIERGEQVFTPYGVYKTIGNAQMLMDYGFA-FDDPSEGDTIQLTLDNFSD 278
Query: 361 DPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQ----SVIS 416
D Y D ++ + V+ F++ + + ++L Y R+ + + +E+Q +
Sbjct: 279 DELYIDTKIDLLEQLDI-VREFNL---KRNQLPQELLIYARVKNLKE-NELQLAKEHYRN 333
Query: 417 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 458
PVS E+ L L++Y L Y TLS+D +L
Sbjct: 334 DDNRNKPVSRRNEKTALRYLSNYLSRYLDSYETTLSDDLELL 375
>gi|326480913|gb|EGE04923.1| SET domain-containing protein [Trichophyton equinum CBS 127.97]
Length = 692
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 49/206 (23%), Positives = 82/206 (39%), Gaps = 35/206 (16%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LA ++ +E+ +G+ S W PY+ L R + S L + +++L +L G+
Sbjct: 108 LAFFVAHEQLKGRDSHWWPYLATLPRAS-----ELTSALFYQDSDLDWLQGTNLYQTHQA 162
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
+K EY+ ++ G L E+++++IF A+ + S + +
Sbjct: 163 YRNTVKEEYDSAISILRDEGCL--------AVESYSWDIFCWAYTLIAS------RAFTS 208
Query: 280 RRFALVPLGPPLLAYSSKCKAMLAAVDDA----------------VQLVVDRPYKAGESI 323
R P L + + ML VD + + L V P GE I
Sbjct: 209 RVLDAYLSNHPTLKQDEEFQIMLPLVDSSNHKPLAKIEWRAEATEIGLKVIEPTFTGEEI 268
Query: 324 VVWCGPQPNSKLLINYGFVDEDNPYD 349
GP N +L+ YGF DNP D
Sbjct: 269 HNNYGPLNNQQLMTTYGFCIVDNPCD 294
>gi|326473914|gb|EGD97923.1| hypothetical protein TESG_05224 [Trichophyton tonsurans CBS 112818]
Length = 692
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 49/206 (23%), Positives = 82/206 (39%), Gaps = 35/206 (16%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LA ++ +E+ +G+ S W PY+ L R + S L + +++L +L G+
Sbjct: 108 LAFFVAHEQLKGRDSHWWPYLATLPRAS-----ELTSALFYQDSDLDWLQGTNLYQTHQA 162
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
+K EY+ ++ G L E+++++IF A+ + S + +
Sbjct: 163 YRNTVKEEYDSAISILRDEGCL--------AVESYSWDIFCWAYTLIAS------RAFTS 208
Query: 280 RRFALVPLGPPLLAYSSKCKAMLAAVDDA----------------VQLVVDRPYKAGESI 323
R P L + + ML VD + + L V P GE I
Sbjct: 209 RVLDAYFSNHPTLKQDEEFQIMLPLVDSSNHKPLAKIEWRAEATEIGLKVIEPTFTGEEI 268
Query: 324 VVWCGPQPNSKLLINYGFVDEDNPYD 349
GP N +L+ YGF DNP D
Sbjct: 269 HNNYGPLNNQQLMTTYGFCIVDNPCD 294
>gi|453083670|gb|EMF11715.1| SET domain-containing protein [Mycosphaerella populorum SO2202]
Length = 477
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 72/269 (26%), Positives = 104/269 (38%), Gaps = 51/269 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA----CLALYLMYEK 168
V A++DL + FS+P + ++T NET L N EL L L +++E
Sbjct: 45 VVATQDLSEDEELFSIPRASILT------NETTD--LPANLRKELDHPWLSLILVMVHEY 96
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS----------------- 211
+G KS W PY L +S + WS+ EL L GS
Sbjct: 97 LKGTKSSWYPYFNLLPE-------TFDSLMFWSDEELLSLKGSAVVDKIGKESADSTFTE 149
Query: 212 ---PTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDI--PTEAFTFEIFKQAFVAV 266
P A+ + R +EL ++ GS Y +D+ P +
Sbjct: 150 QLIPLIAQHANIFQTAGRSNDELLSLCHRMGSTIMAYAFDLEKPEPSQPPNQQDDEEWEE 209
Query: 267 QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 326
+ + L K ++PL L A + A L DD V + +AGE +
Sbjct: 210 EESAISLPK-------GMIPLADMLNANADHNNAKLFYQDDKVVMKTLHAVRAGEELFND 262
Query: 327 CGPQPNSKLLINYGFV-DEDNPYDRLVVE 354
GP P S LL YG+V D+ YD VVE
Sbjct: 263 FGPLPRSDLLRRYGYVTDQYAKYD--VVE 289
>gi|118395738|ref|XP_001030215.1| hypothetical protein TTHERM_01108540 [Tetrahymena thermophila]
gi|89284510|gb|EAR82552.1| hypothetical protein TTHERM_01108540 [Tetrahymena thermophila SB210]
Length = 1709
Score = 52.8 bits (125), Expect = 4e-04, Method: Composition-based stats.
Identities = 41/148 (27%), Positives = 70/148 (47%), Gaps = 37/148 (25%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLG-------NETIAELLTTNKLSEL----AC-- 159
+AA +D+ ++PN L+++ ++V G + +++ N+ EL C
Sbjct: 954 IAADQDISPQKVILAIPNKLIISEDKVYGCDLEEVLEKIQQQIIKQNRFPELFDEEKCGD 1013
Query: 160 -----LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 214
LALYLMYEK +G++SFW PY EL+++ + L WS ELA S
Sbjct: 1014 ADFNILALYLMYEKLKGEQSFWHPYF-ELNQKS-------YTLLDWSTEELAQFEDSY-- 1063
Query: 215 AEILERAEGIKREYNELDTVWFMAGSLF 242
I +E N+ + ++F+ S+
Sbjct: 1064 ---------ILQEVNQSNQIFFLQQSVL 1082
>gi|302834219|ref|XP_002948672.1| hypothetical protein VOLCADRAFT_104004 [Volvox carteri f.
nagariensis]
gi|300265863|gb|EFJ50052.1| hypothetical protein VOLCADRAFT_104004 [Volvox carteri f.
nagariensis]
Length = 510
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 94/386 (24%), Positives = 154/386 (39%), Gaps = 70/386 (18%)
Query: 157 LACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAE 216
A +A L++ K+QG +S P+I +L G P+LW E ++A L A+
Sbjct: 135 FAKMAAMLLWHKRQGSQSPLAPWIAQLPSDTG-------VPVLWDERQIAALQYPYLIAQ 187
Query: 217 ILERAEGIKREYNEL---------------DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ 261
+ E+ ++ Y +L D W M S + + P T + +
Sbjct: 188 VKEQQREWQQLYGDLVRSGTPAGVQAPSREDFFWAM--SCVRSRTFSGPYIGSTLQDRLR 245
Query: 262 -----AFVAVQSCVVHL---QKVSLA-------------------RRFALVPLGPPLLAY 294
A +A + V+ L QK A +++A+ PL L +
Sbjct: 246 TAGLVAVLAAGNTVLGLADPQKTLSAAIAVLLFNVLYELILSRSLKQYAICPL-IDLFNH 304
Query: 295 SSKCKAMLAA--VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 352
SS ++ +A D+ +V R +K GE + + G Q N L+ YGF + +NP D V
Sbjct: 305 SSAVQSEVAYNYFGDSYSVVASREFKKGEQVFISYGAQSNDSLMQYYGFAEANNPQDVYV 364
Query: 353 VEAALN--TEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE 410
+ L T R+ A + L+ + V R S+ L +R +D SE
Sbjct: 365 MTDMLRWLTAVRSVGQSRLDALKGSPLANSLQQVAIQRAGFP-SETLQAVRFLLAAD-SE 422
Query: 411 MQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKR-- 468
+ +SS SP E + + +A+ + L ++L ED A+L+ R
Sbjct: 423 AGADVSSFSKSG--SPDQEAQLAEVVAEVVRRELGHLGSSLQEDLALLSSTGASAGGRKG 480
Query: 469 -------VATQLVRMEKK-MLNACLQ 486
VA R+EKK +L A LQ
Sbjct: 481 GTAAAAAVAAVAFRVEKKRLLTAVLQ 506
>gi|169626351|ref|XP_001806576.1| hypothetical protein SNOG_16462 [Phaeosphaeria nodorum SN15]
gi|160705819|gb|EAT76160.2| hypothetical protein SNOG_16462 [Phaeosphaeria nodorum SN15]
Length = 474
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 75/311 (24%), Positives = 122/311 (39%), Gaps = 55/311 (17%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLER-VLG 141
+W+ ++G+ I E + + R V A++D+ + F +P + ++++E +L
Sbjct: 13 AWLRRSGVEISPKIQLEDLRNAQAGRG---VVATQDIPEHELLFRIPRTAILSVENSILS 69
Query: 142 NETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
E A T L L L ++YE G S W PY L + + + WS
Sbjct: 70 TEIPA--ATFEMLGPWLSLILVMLYEYINGDASNWAPYFSVLPTE-------FNTLMFWS 120
Query: 202 ETELAYLTGSPTKAEI-------------------------------LERAEGIKREYNE 230
E ELA L S +I +RAE ++ E N
Sbjct: 121 EDELAELQASAVLNKIGKEGANEAFMEQLLPIIKEFADIFFAGDERAKQRAEEMRDERNV 180
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 290
L + GSL Y +D+ A + + + A + L K ++PL
Sbjct: 181 L--LMHKMGSLIMAYAFDVEP-ATSRKDVDEEGFAEEEEDEALPK-------GMIPLADM 230
Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYD 349
L A + A L + +++ +P +AGE + GP P S LL YG+V D YD
Sbjct: 231 LNADADCNNARLFYEEKYLEMKALKPIRAGEEVFNDYGPLPRSDLLRRYGYVTDNYAQYD 290
Query: 350 RLVVEAALNTE 360
+ + L TE
Sbjct: 291 VVEINMDLVTE 301
>gi|298715435|emb|CBJ28046.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 719
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 71/312 (22%), Positives = 133/312 (42%), Gaps = 36/312 (11%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
W+ +G + E PS +E + A D+ GD +P++L+++ +
Sbjct: 26 WLRSHG---AAIDCVEWPS-SETESGVRGAVARRDIAPGDHMVIIPHALMMSEFHAKADP 81
Query: 144 TIAEL--LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
+ L T L LALY+M E + ++SF+ PY+R L ES LL
Sbjct: 82 KYGHVHRLNTRLLGSDNGLALYIMQEILKEERSFYWPYLRMLPTPCNLRNWNRESLLLLQ 141
Query: 202 ETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ 261
+ +L T + ++ ++L + RE T+ F++ S YP + +TFE+F
Sbjct: 142 DHKLVRRTAARSR-QLL----ALYRE-----TIEFLSSS----YPELYTADRYTFELFDF 187
Query: 262 AFVAVQSCVVHLQKVSLARRF---ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DR 315
A+ +Q+ + +R ALVP L + + K + + +
Sbjct: 188 AWRTIQA-------RAFGKRLKSSALVPFADCLNHGNVQTKYDFDVGGNGTFRLFPSGNN 240
Query: 316 PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL---NTEDPQYQDKRMVAQ 372
Y ++ G + N LL++YGF DN +D V +L + + P + ++ +
Sbjct: 241 RYPRNSEVLNSYGRRANDNLLLDYGFAMLDNEWDAAEVICSLPPSHDQSPLDRRRKACLR 300
Query: 373 RNGKLSVQVFHV 384
+G+ +V++ V
Sbjct: 301 ASGQHTVRILRV 312
>gi|121703688|ref|XP_001270108.1| SET domain protein [Aspergillus clavatus NRRL 1]
gi|119398252|gb|EAW08682.1| SET domain protein [Aspergillus clavatus NRRL 1]
Length = 492
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 70/278 (25%), Positives = 107/278 (38%), Gaps = 51/278 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
V A + G+ FS+P LV++ E N + LL+ + L EL L L ++YE
Sbjct: 50 VVAQSAIVEGEELFSIPRDLVLSTE----NSKLKSLLSQD-LGELGPWLSLMLVMIYEYL 104
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA-------- 221
++S W PY R ++ + WS EL L GS +I +
Sbjct: 105 LREQSAWAPYYRIFPEN-------FDTLMFWSPAELQELQGSAIVDKIGRQGAEESILQM 157
Query: 222 ---------------EGIKREYNELDTVWFMA-----GSLFQQYPYDIPTEAFTFEIFKQ 261
+G+ E T + GSL Y +DI + +
Sbjct: 158 IAPVVKANPSLFPPIQGLSSWEGEAGTQALLGLAHVMGSLIMAYAFDIEKVNDEDDEDNE 217
Query: 262 AFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGE 321
+ Q +VPL L A + + A L +D++ + +P AG+
Sbjct: 218 GEDGYMTDEEEDQSSK-----GMVPLADILNADADRNNARLFQEEDSLVMKAIKPIAAGD 272
Query: 322 SIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALN 358
I G P S LL YG+V D PYD V+EA+L+
Sbjct: 273 EIFNDYGELPRSDLLRRYGYVTDNYAPYD--VIEASLD 308
>gi|395518633|ref|XP_003763464.1| PREDICTED: SET domain-containing protein 4 [Sarcophilus harrisii]
Length = 440
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 67/283 (23%), Positives = 117/283 (41%), Gaps = 28/283 (9%)
Query: 86 HKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVVTLERV 139
HK + LKE+ + RP + + A + LQ G+ S+P ++T + V
Sbjct: 29 HKLEFIELRKWLKERKFEDHNLRPTRFSGTGRGLMAVKSLQPGELIISLPEKCLLTTDTV 88
Query: 140 LGNETIAELLT--TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
+ + + + +T T +S L L +L+ E G KS W PY+ L + +
Sbjct: 89 IKS-YLGDYITKWTPPISPLLALCTFLISENNAGNKSPWKPYLDILPKDY--------TC 139
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
L+ E ++ L P K + E+ ++ + + SLF + D+ F +
Sbjct: 140 LVCLEPQVVRLLPKPLKIKAQEQKTQVQELFVSSRGFFSSLQSLFTE---DVK-HIFHYH 195
Query: 258 IFKQAFVAVQSCVV---HLQKVSLARRFALVPLGP--PLLAYS--SKCKAMLAAVDDAVQ 310
F A+ + + V H QK L+ + L P LL +S + A +
Sbjct: 196 AFLWAWCTINTRTVYMKHAQKKCLSAEPDVYALAPYLDLLNHSPGVQVNAAFNEKTRCYE 255
Query: 311 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
+ K E + + GP N +LL+ YGFV +NP+ + V
Sbjct: 256 IRTTSSCKKYEELFICYGPHDNHRLLLEYGFVAINNPHSAVYV 298
>gi|345326326|ref|XP_001512617.2| PREDICTED: SET domain-containing protein 4-like [Ornithorhynchus
anatinus]
Length = 499
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 64/267 (23%), Positives = 114/267 (42%), Gaps = 22/267 (8%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGK 172
A++ L+AG+ S+P + ++T + VL + + + K +S L L +L+ EK+ G
Sbjct: 64 ATKSLKAGEMIISLPEACLLTTDTVL-KSPLGDYIWKWKPPVSPLLALCTFLIAEKQAGA 122
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+S W PY+ L + A P+ L+ L P E+ + RE
Sbjct: 123 RSLWQPYLGVLPQ-------AYTCPVGLDAAVLSLLP-QPLGRRAREQRTAV-RELFAAS 173
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQKVSLARRFALVPLGP 289
+F SL + D+ FT + A+ V + V H Q+ + + L P
Sbjct: 174 RAFF--SSLQPLFSEDV-ERVFTLDALGWAWCTVNTRTVYMEHAQRDCFSAEADIYALAP 230
Query: 290 --PLLAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 345
LL +S ++ +A ++ + E +++ GP N +LL+ YGFV +
Sbjct: 231 YLDLLNHSPGAQVEAAFNKETRCYEIRTASRCRKYEEVLICYGPHDNRRLLLEYGFVCSN 290
Query: 346 NPYDRLVVEAALNTEDPQYQDKRMVAQ 372
NP+ +VV + DK+M +
Sbjct: 291 NPHSNVVVSPDVLVRHLPSGDKQMTKK 317
>gi|340520781|gb|EGR51016.1| N-methyltransferase [Trichoderma reesei QM6a]
Length = 470
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 79/186 (42%), Gaps = 21/186 (11%)
Query: 196 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
S + +SE EL G+ + + IK +Y +L A LF Q+P P + FT
Sbjct: 107 SSIFFSEGELEVCAGTSLYTVTKQLEQRIKDDYRQL------AVRLFAQHPDLFPLQKFT 160
Query: 256 FEI----------FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS---KCKAML 302
E +K A V S + + L P +L +SS +C A
Sbjct: 161 IEDVRLLRRATDPYKWALCTVWSRSMDFTLPDGSSIRLLAPFAD-MLNHSSEVKQCHAYD 219
Query: 303 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 362
D + + + Y+ G+ + ++ GP PN++LL YGFV DNP D + + P
Sbjct: 220 VKSGD-LSVFAGKDYEIGDQVYIYYGPIPNNRLLRLYGFVIPDNPNDSYDLVLTTHPMAP 278
Query: 363 QYQDKR 368
Y+ K+
Sbjct: 279 FYEQKQ 284
>gi|145353540|ref|XP_001421068.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581304|gb|ABO99361.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 813
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 63/247 (25%), Positives = 100/247 (40%), Gaps = 24/247 (9%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A D G+ +P TL L ++ + + + + +AL++ E+ +G+K+
Sbjct: 26 ALRDCARGEVLLEIPLERGFTLAAALEDDAVKRVASCCARHD-DVVALHVCAERFRGEKA 84
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
++ L R + ++ WSE EL LTG+ E + E K +Y L
Sbjct: 85 TRAAHVATLPR-------SFDTAFFWSEEELRELTGTTCLRETMNLREETKNDYETLTKK 137
Query: 235 W--FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
G +++ D +E + A + S L R A+VP +
Sbjct: 138 MEAIGEGGWMREHEVD-------YERYAWARSNLWSRQCDLLMPDGKRTRAMVPT-FDIF 189
Query: 293 AYSSKC----KAMLAAVDDAVQLVVDRPYKAGES--IVVWCGPQPNSKLLINYGFVDEDN 346
+S+K L A + V + YKAGE I G NSKLL YGF +DN
Sbjct: 190 NHSAKAPLGKTHKLNAEKNCVTVYAADDYKAGEQAFISYGSGEAANSKLLTWYGFCIDDN 249
Query: 347 PYDRLVV 353
PY+ L V
Sbjct: 250 PYEELDV 256
>gi|195439104|ref|XP_002067471.1| GK16171 [Drosophila willistoni]
gi|194163556|gb|EDW78457.1| GK16171 [Drosophila willistoni]
Length = 511
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 75/362 (20%), Positives = 142/362 (39%), Gaps = 38/362 (10%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A++D+ A VP + + E++ E + T + LA L+ EK +G S
Sbjct: 118 ATKDINADQQVLRVPRKKIFSEEQLSKTERESFCNFTTNFN----LANALVVEKSRGADS 173
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI L + + L ++ ++ L G+ + L + I R+Y +L
Sbjct: 174 IWKPYIDVLPSR-------YNTVLYFTVEQMRRLRGTSVCSSALRQCRMIARKYAKLYAF 226
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR-------FALVPL 287
+ S + +E+++ A V + +L +A + AL+P
Sbjct: 227 AYCDSSYLRPDTGLFTQHGLCYELYRWAVSTVMT-RQNLVPREIATKDDGNSPISALIPC 285
Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
K + + ++ KAG ++ G +PN+ LL++ GFVD +N
Sbjct: 286 WDMANHRPGKITSFYDSNAHQMECTAQEFCKAGNQFFIYYGDRPNADLLVHNGFVDPNNN 345
Query: 348 YDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREK-----EAIS-DMLPYLR 401
D + + L+ D +A++ +L ++ H G + E IS +L ++R
Sbjct: 346 KDFVNIRLGLSPTDG-------LAEKRSRLLDRLNIEHKGEFRVLPAPEYISGQLLAFVR 398
Query: 402 LGYVSD------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 455
+ +S S+++ + L C + +E FK L ATL E +
Sbjct: 399 VFNMSSDQLDHWCSDLERAVDLLHIDCALETDLETRTWQYFHQRFKLLLGVLEATLREAD 458
Query: 456 AM 457
+
Sbjct: 459 EL 460
>gi|195132508|ref|XP_002010685.1| GI21676 [Drosophila mojavensis]
gi|193907473|gb|EDW06340.1| GI21676 [Drosophila mojavensis]
Length = 593
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 87/400 (21%), Positives = 156/400 (39%), Gaps = 48/400 (12%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+ D++AG+ SVP L+ + E L E +L N + L + L+ EK +G S
Sbjct: 206 ATRDIKAGEQVLSVPRKLIFSEE--LLPEKQRQLFR-NFPTHLK-VTYTLIMEKLRGADS 261
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W P+I L + + L ++ ++ L G+ + + I R Y +
Sbjct: 262 PWQPFIDTLPSR-------YNTVLYFTVEQMQRLRGTSACSAAVRHCRVIARLYASMYKC 314
Query: 235 WFMA---------GSLFQQYP--YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 283
FM +LF Y Y++ A + +Q V Q + ++ A
Sbjct: 315 AFMQLDDSVMGGMANLFTDYGLCYELYRWAVSTVTTRQNLVPRQEIPSDAANLPIS---A 371
Query: 284 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 343
L+P S K + ++ YK+GE ++ G + N+ L++ GFVD
Sbjct: 372 LIPYWDMANHRSGKITSFYDQAAGQMECTAQEAYKSGEQYFIYYGDRSNADRLVHNGFVD 431
Query: 344 EDNPYDRLVVEAALNTEDPQYQDKRMV-----AQRNGKLSVQVFHVHAGREKEAISDMLP 398
NP D + + L+ D + + ++ +R +L V H E +L
Sbjct: 432 MQNPKDYVQIRLGLSPTDALAEQRAILLAELNIERKAELRVLPAPEHISGE------LLA 485
Query: 399 YLRLGYVSD------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLS 452
++R+ +S S+++ + L C + +E L K L ATL
Sbjct: 486 FVRVFNMSKEQLEHWCSDLERAVDLLHIDCALETDLETRTWQYLYQRLKLLLGVLEATLK 545
Query: 453 ED------EAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 486
E EA+ + + Q R+E+++L+ LQ
Sbjct: 546 ETDELKQLEALQQQADASEIDIMVLQYRRLERRILSDALQ 585
>gi|260819628|ref|XP_002605138.1| hypothetical protein BRAFLDRAFT_122719 [Branchiostoma floridae]
gi|229290469|gb|EEN61148.1| hypothetical protein BRAFLDRAFT_122719 [Branchiostoma floridae]
Length = 453
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 73/332 (21%), Positives = 136/332 (40%), Gaps = 40/332 (12%)
Query: 67 SREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAF 126
+R V EE W+H+NG C+ + + E R + A++ L+ +
Sbjct: 19 TRPVSLAHEESFVRFFQWLHRNG---CRNVPLKPAVFPETGRGL---MATKALKHEELIL 72
Query: 127 SVPNSLVVTLERVLGNETIAELL--TTNKLSELACLALYLMYEKKQGKKSFWLPYIRELD 184
+P L++T++ ++ + +A + ++L+ LA++LM EK + +KSFW PYI L
Sbjct: 73 VIPKRLLITIDAIM-DSYLAPYIERADSQLTPSQALAVFLMCEKCRREKSFWRPYIDILP 131
Query: 185 RQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQ 244
+ P ++E + L S + + + +E+ EL + M LF
Sbjct: 132 EE-------YTCPAFFTEEDFRLLPNS-LRGKAKAKKYECHKEFMELAPFFKMLADLFPD 183
Query: 245 YPYDIPTEAFTFEIFKQAFVAVQS----------CVVHLQKVSLARRFALVPLGPPL-LA 293
+AF F+ FK A+ A+++ L+ + PL + A
Sbjct: 184 -----QEDAFNFKDFKWAWSAIKTRAFDVPLGGETCYRLRDSEDTSNPTMFPLVDSINHA 238
Query: 294 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL-- 351
+K + ++ + Y+ ++ G N LL+ +GFV NP D +
Sbjct: 239 AQAKIRHRYNEKRRCLESRTETVYRRHAEVMNSYGRADNDNLLLEFGFVVPGNPADTVTF 298
Query: 352 -VVEAALNTEDPQYQD----KRMVAQRNGKLS 378
+V+ L P+ + K M RN +S
Sbjct: 299 HLVQDVLEYLQPENNELLERKIMFLARNNLIS 330
>gi|356521657|ref|XP_003529470.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Glycine
max]
Length = 487
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 85/377 (22%), Positives = 158/377 (41%), Gaps = 24/377 (6%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A +D+ + VP L + + V +E I ++ + L +AL+L+ E+ +
Sbjct: 82 LVALKDISRNEVVLQVPKRLWINPDAVAASE-IGKVCSG--LKPWLAVALFLIRERSR-S 137
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W Y L ++ +S + WSE EL+ L G+ + ++ E+ L+
Sbjct: 138 DSLWKHYFSILPKE-------TDSTIYWSEEELSELQGTQLLNTTRSVKQYVQNEFRRLE 190
Query: 233 TVWFMAGSLFQQYPYDIPTEAF--TFEIFK-QAFVAVQS-CVVHLQKVSLARRFALVPLG 288
+ + +P I + F F I + +AF +++ +V + L A V
Sbjct: 191 EEIIIPNK--KLFPSSITLDDFFWAFGILRSRAFSRLRNENLVVIPLADLINHSARVTTD 248
Query: 289 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNP 347
AY K A L + D L KAG+ + + + + N++L ++YGF++ +
Sbjct: 249 DH--AYEIKGAAGLFSWDYLFSLRSPLSLKAGDQVYIQYDLNKSNAELALDYGFIEPNTD 306
Query: 348 YDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD 407
+ + ++ DP + DK +A+ NG F + R L +D
Sbjct: 307 RNAYTLTLQISESDPFFGDKLDIAESNGFGETAYFDIFYNRPLPPGLLPYLRLVALGGTD 366
Query: 408 TSEMQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 464
++S+ S G + PVS E + + + K LAGY T+ ED+ L + L
Sbjct: 367 AFLLESIFRNSIWGHLELPVSRDNEELICRVVRETCKTALAGYHTTIEEDQK-LKEAKLD 425
Query: 465 PKKRVATQLVRMEKKML 481
+ +A + EK +L
Sbjct: 426 SRHAIAVGIREGEKNLL 442
>gi|328772383|gb|EGF82421.1| hypothetical protein BATDEDRAFT_86633 [Batrachochytrium
dendrobatidis JAM81]
Length = 648
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 75/325 (23%), Positives = 127/325 (39%), Gaps = 51/325 (15%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L W +G V +KE S +++ + AS+D+ +P++++++ V
Sbjct: 36 LVDWGRMHGANIENVEIKETASDDDR-KLTRGAYASKDIPPNSEICFIPSTILLSESDVR 94
Query: 141 GNETIAELLT--------TNKLSE---------LACLALYLMYEKKQ-GKKSFWLPYIRE 182
+E +LT K+S+ L +A +++++ S WLPY+
Sbjct: 95 ASEIGKAILTYIDEHQDAKQKISDKIKHPHAEILLAMAAFIVHQVSLPTADSHWLPYLAS 154
Query: 183 LDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL-ERAEGIKREYNELDTVWFMAGSL 241
L + PL+W+ + L G + ++ ER E I+ N V G
Sbjct: 155 LPKNYAL-------PLMWTRDRIQNLLGGTSLLYMMIERLEWIQ---NSTKVVENACGHY 204
Query: 242 FQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR------------FALVPLGP 289
F PT A T + + A ++ S K SL + + + L P
Sbjct: 205 F-------PTGALTVQSMQWATCSIWSRAFPKAKPSLDLQDGSHQDVQDWIGLSEICLFP 257
Query: 290 PLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
L ++ K + + V + G ++ GP+ N LL NYGFV E+NP
Sbjct: 258 ILDMFNHKRGYRVEWRMTEKGVSFITPDGICKGSELLNNYGPKGNENLLSNYGFVIENNP 317
Query: 348 YDRLVVEAALNTEDPQYQDKRMVAQ 372
D V L EDP Y K+ V +
Sbjct: 318 EDYFKVFLGLQQEDPLYTAKKAVLE 342
>gi|328772335|gb|EGF82373.1| hypothetical protein BATDEDRAFT_86177 [Batrachochytrium
dendrobatidis JAM81]
Length = 966
Score = 52.4 bits (124), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 70/319 (21%), Positives = 126/319 (39%), Gaps = 60/319 (18%)
Query: 71 VSKKEEDLGDLKS---WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFS 127
++K + L L+S W+H NG+ + +K+ + I ++ + G+
Sbjct: 548 TAEKLDQLASLESFTQWLHANGINTDGISIKKVDDSKDVGLGIF---STRQIHKGECLVK 604
Query: 128 VPNSLVVTLERVLGNETIA-----ELLTTNKL--SELACLALYLMYEKKQGKKSFWLPYI 180
+P L+ +L N+T A ++ +N L ++ + + + + ++ S W PY
Sbjct: 605 IP------LKLILSNDTSAMPALNSIVKSNVLLKTDPSVILVIRLLQEYINPMSLWQPYF 658
Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS 240
L R P+L S +LA TG+ E++ + R+Y L +
Sbjct: 659 DLLPR-------VFTIPVLGSAQDLAAYTGTSIIDEVVHDMIALMRQYLYLQHI------ 705
Query: 241 LFQQYPYD-IPTEAFTFEIFKQAFVAVQS-----CVVHLQKVSLARRFALVPL------- 287
F+ P IP FTF F A V + C + + + L+PL
Sbjct: 706 -FKSIPEPPIPLADFTFAAFSWARAIVSTRQNEICYANPSTSEMQQFLCLIPLFDMFNHK 764
Query: 288 -GPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 344
G + +K C +A+ D GE I + G + N ++L+ GFVD
Sbjct: 765 PGNSTTQFDTKEYCSETIASCD----------VSPGEQIFIHYGKRSNQEMLLYSGFVDP 814
Query: 345 DN-PYDRLVVEAALNTEDP 362
N YD + + ++ DP
Sbjct: 815 TNIEYDHIKLSVSIPQSDP 833
>gi|302832548|ref|XP_002947838.1| hypothetical protein VOLCADRAFT_88145 [Volvox carteri f.
nagariensis]
gi|300266640|gb|EFJ50826.1| hypothetical protein VOLCADRAFT_88145 [Volvox carteri f.
nagariensis]
Length = 508
Score = 52.4 bits (124), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 93/403 (23%), Positives = 150/403 (37%), Gaps = 53/403 (13%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+ +SW+ GL ++L+ R + AS L G+ +P+ LV+T ER
Sbjct: 24 EFQSWLRSEGLSTQPLLLRHC------GREGRGLVASRSLSRGEVLVKLPDHLVITAERA 77
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
G ++ LL LA ++ + W PY+ L ++ G + L
Sbjct: 78 AGEWSLLALLLAEVKGRLAA------GDRSSPAAARWGPYVAVLPQRPG-------TLLD 124
Query: 200 WSETELA-YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEI 258
W E+ L GSP + + EL+ + G P +P E
Sbjct: 125 WPAKEVQQLLRGSPLQRLADSITSAASASWRELEPL-IAQGRADGLVPEHVPLSKGDLEW 183
Query: 259 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL---LAYSSKC----------------K 299
AF + S + L S L P L ++ C
Sbjct: 184 ---AFGVLLSRCIRLP--SRGDLQVLAPWADQLNHDVSAEEGCHLDWSWDVAGPAVPGGD 238
Query: 300 AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV--DEDNPYDRLVVEAAL 357
A A+ L DRPY AG+ + V GP+ + +LL++YGF NP+ + A+
Sbjct: 239 RAGGATKGALVLRADRPYAAGQQVYVSYGPKSSGELLLSYGFCPPPASNPHQDCRLRVAV 298
Query: 358 NTE-DPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL--RLGYVSDTSEMQSV 414
+ + DP K R+G S F + E + L +L R +T E+ SV
Sbjct: 299 DRQGDPLADLKEQALARHGLPSELEFPLKLEGIPEGLLQYLAFLDARPKVAQETFELASV 358
Query: 415 ISSLGPICPVSPCMERAV--LDQLADYFKARLAGYPATLSEDE 455
+ G P+ + V L L++ A L YP ++ D+
Sbjct: 359 LFESGGF-PLLDGQDTLVLALRGLSNRCTAALKAYPTSMEADQ 400
>gi|240278777|gb|EER42283.1| conserved hypothetical protein [Ajellomyces capsulatus H143]
gi|325090312|gb|EGC43622.1| conserved hypothetical protein [Ajellomyces capsulatus H88]
Length = 471
Score = 52.4 bits (124), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 60/271 (22%), Positives = 109/271 (40%), Gaps = 37/271 (13%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
+ G+ ++P+ ++ T+E + + L + + LS LA Y+++ +
Sbjct: 34 FKEGERILTIPSDVLWTVEHAYADSLLGPTLHSARPPLSVDDTLATYILFVRS------- 86
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
RE R LA S + ++E EL TG+ A + I+ +Y L
Sbjct: 87 ----RESGYNGLRSHLAALPKSYSSSIFFTEDELEVCTGTSLYAITKQLGRCIQDDYKAL 142
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRFALVP 286
L Q+ P FT E +K A V S + + + L FA
Sbjct: 143 VV------RLLIQHRDLFPLSKFTIEDYKWALCTVWSRAMDFVLPDGKSIRLLAPFA--- 193
Query: 287 LGPPLLAYSSKCKAMLAA--VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 344
+L +SS + A + + ++ + YKAG+ + ++ G PN++LL YGF+
Sbjct: 194 ---DMLNHSSDVRQCHAYDPLSGNLSILAGKDYKAGDQVFIYYGSIPNNRLLRLYGFIIP 250
Query: 345 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 375
NP D + + P ++ K + + G
Sbjct: 251 SNPNDNYELVLETHPMAPFFEQKHKLWESAG 281
>gi|356577306|ref|XP_003556768.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Glycine
max]
Length = 487
Score = 52.4 bits (124), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 84/376 (22%), Positives = 159/376 (42%), Gaps = 22/376 (5%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A +D+ + VP L + + V +E I ++ L +AL+L+ E+ +
Sbjct: 82 LVALKDISRNEVVLQVPKRLWINPDAVAASE-IGKVCIG--LKPWLAVALFLIRERSR-S 137
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W Y L ++ +S + WSE EL+ L G+ + ++ EY L+
Sbjct: 138 NSLWKHYFSVLPKE-------TDSTIYWSEEELSELQGTQLLNTTRSVKQYVENEYRRLE 190
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQS-CVVHLQKVSLARRFALVPLGPP 290
+ P + + F I + +AF +++ +V + A V
Sbjct: 191 EEIILPNKKLFPSPLTLDDFFWAFGILRSRAFSRLRNENLVVIPFADFINHSARVTTEDH 250
Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYD 349
AY K A L + D L KAG+ + + + + N++L ++YGF++ + +
Sbjct: 251 --AYEIKGAAGLFSWDYLFSLRSPLSLKAGDQVYIQYDLNKSNAELALDYGFIEPNADRN 308
Query: 350 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV-HAGREKEAISDMLPYLRLGYVSDT 408
+ ++ DP + DK +A+ NG F + ++ + L + LG +D
Sbjct: 309 AYTLTLQISESDPFFGDKLDIAESNGFGETAYFDIFYSRPLPPGLLPYLRLVALG-GTDA 367
Query: 409 SEMQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 465
++S+ S G + PVS E + + + K LAGY T+ ED+ L + L
Sbjct: 368 FLLESIFRNSIWGHLELPVSRDNEELICRVVRETCKTALAGYHTTIEEDQK-LKEAKLDS 426
Query: 466 KKRVATQLVRMEKKML 481
+ +A + EK++L
Sbjct: 427 RHAIAVGIREGEKQLL 442
>gi|400602586|gb|EJP70188.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
Length = 797
Score = 52.0 bits (123), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 40/77 (51%), Gaps = 2/77 (2%)
Query: 281 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 340
R AL+P+ L ++ C + +A + DR Y+AGE + G N LL YG
Sbjct: 593 RLALLPVADVLNHANAGCSVAFST--EAYDITADRAYQAGEEVYTSYGAHSNDFLLAEYG 650
Query: 341 FVDEDNPYDRLVVEAAL 357
FV DNP+D+L ++ L
Sbjct: 651 FVLPDNPWDQLCLDKVL 667
>gi|358056332|dbj|GAA97699.1| hypothetical protein E5Q_04377 [Mixia osmundae IAM 14324]
Length = 347
Score = 52.0 bits (123), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 53/107 (49%), Gaps = 4/107 (3%)
Query: 252 EAFTFEIFKQAFVAVQS-CV-VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA- 308
E F+ F+ A++ V S CV + L + F LVPL + +SS C D A
Sbjct: 122 EIIDFDAFRWAWLCVNSRCVWLDLDYEAHEENFTLVPL-LDMANHSSTCANATVKYDHAH 180
Query: 309 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEA 355
+L + RP K GE IV G + L YGF++ NP++R+ + A
Sbjct: 181 FELKLTRPVKRGEEIVFEYGGHDQATLWAEYGFIESSNPHERIDLTA 227
>gi|328854233|gb|EGG03367.1| hypothetical protein MELLADRAFT_90239 [Melampsora larici-populina
98AG31]
Length = 509
Score = 52.0 bits (123), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 57/216 (26%), Positives = 92/216 (42%), Gaps = 24/216 (11%)
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
A+VPL L A S A L + + + + GE I PN+ LL YG V
Sbjct: 260 AMVPLADILNAKSGCENAKLFYEPTTLNMTTTKSIRKGEQIYNTYADPPNADLLRRYGHV 319
Query: 343 DEDNPYD----------RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV-------- 384
D++NP+D RL E +L+ DPQ Q+ + K +++V +
Sbjct: 320 DDENPFDLAEVSLELCIRLAAE-SLHPSDPQNQNTLDELKSRAKWALEVSDIDEIFMLPT 378
Query: 385 HAGRE-KEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 443
+ RE KE + D L + +S E Q+ S G + P M + R
Sbjct: 379 KSQREPKEILPDELVIMLRILLSTEEEFQT-WKSKGKVP--KPAMSEPIAQLAIQILSNR 435
Query: 444 LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKK 479
L Y T+ D+ +L D +L ++++ + VR+ +K
Sbjct: 436 LNQYSTTIQNDQDLLKDQSL-SRRKLKSIKVRLGEK 470
>gi|225554758|gb|EEH03053.1| SET domain-containing protein [Ajellomyces capsulatus G186AR]
Length = 485
Score = 52.0 bits (123), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 69/306 (22%), Positives = 119/306 (38%), Gaps = 45/306 (14%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+ SW+ + P KV K K + + A +D+ + F++P +LV++ +
Sbjct: 19 EFMSWLKQR--PGVKVSPKIKIADLRSEGAGRGIVADDDIGEDEELFAIPQNLVLSFQ-- 74
Query: 140 LGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
N ++ +LL N+ CL + ++YE QG S W Y + L ++
Sbjct: 75 --NSSLKDLLDFNERDFDPWLCLIVVMIYEYLQGGASTWSRYFQLLPTN-------FDTL 125
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP-------YDIP 250
+ W++ EL L+GS +L + E N L + + +P +D P
Sbjct: 126 MFWTDEELRELSGSA----VLNKIGRSDAEANILRNILPLVSGNPSHFPPMSGVASFDSP 181
Query: 251 TE----------------AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
A+ F+I K + ++ +VPL L A
Sbjct: 182 EGKAALLSLAHRMGSLIMAYAFDIEKGENDGGEGQDGYVTDDEEELSKGMVPLADLLNAD 241
Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVV 353
+ + A L D + + +P + GE I G P + LL YG+V D YD V
Sbjct: 242 TDRNNARLFQEDCYLSMRSIKPIRKGEEIFNDYGELPRADLLRRYGYVTDNYAQYDE--V 299
Query: 354 EAALNT 359
E ++ T
Sbjct: 300 EISMRT 305
>gi|302658278|ref|XP_003020845.1| SET domain protein [Trichophyton verrucosum HKI 0517]
gi|291184711|gb|EFE40227.1| SET domain protein [Trichophyton verrucosum HKI 0517]
Length = 692
Score = 52.0 bits (123), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 47/206 (22%), Positives = 81/206 (39%), Gaps = 35/206 (16%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LA ++++E+ +G+ S W PY+ L R S L + + +L +L G+
Sbjct: 108 LAFFMVHEQLKGRDSHWWPYLATLPRAS-----EFTSALFYQDNDLEWLQGTNLYQTHQA 162
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
++ EY+ ++ G L E++ ++IF A+ + S + +
Sbjct: 163 YRNAVQEEYDSAISILRDEGFL--------AVESYRWDIFCWAYTLIAS------RAFTS 208
Query: 280 RRFALVPLGPPLLAYSSKCKAMLAAVDDA----------------VQLVVDRPYKAGESI 323
R P L + + ML VD + + L V P +GE +
Sbjct: 209 RVLDAYFSNHPTLKQDEEFQIMLPLVDSSNHKPLAKIEWRAEATEIGLKVIEPTSSGEEV 268
Query: 324 VVWCGPQPNSKLLINYGFVDEDNPYD 349
GP N +L+ YGF DNP D
Sbjct: 269 HNNYGPLNNQQLMTTYGFCIVDNPCD 294
>gi|322712432|gb|EFZ04005.1| histone-lysine N-methyltransferase [Metarhizium anisopliae ARSEF
23]
Length = 462
Score = 52.0 bits (123), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 82/392 (20%), Positives = 161/392 (41%), Gaps = 34/392 (8%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
V A + G+ ++P+ L T++ + + L + + LS LA+++++ +
Sbjct: 28 VKARRRFKQGERILTIPSGLHWTVKHAQNDSLLGPALCSAQPPLSVEDTLAVHILFVRS- 86
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
++S + ++R + S + +++ EL G+ + + I+ +Y +
Sbjct: 87 -RESGYDGLRSHVERLPA----SYSSSIFFTDDELEVCAGASLYTITKQLQQRIEDDYRD 141
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 290
L + QYP P + FT +K A AV S + Q + L P
Sbjct: 142 LVV------RVLVQYPDLFPLDKFTLHHYKWALCAVWSRAMDFQLSDGSSIRLLAPFAD- 194
Query: 291 LLAYSS---KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
+L +SS +C A+ D + ++ + Y+AG+ + + G PN +LL YGF+ NP
Sbjct: 195 MLNHSSESKQCHVYDASSGD-LSVLAGKDYEAGDQVYIHYGSIPNHRLLRLYGFIIPGNP 253
Query: 348 YDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP-----YLRL 402
D + A + P ++ K+ + G S + ++D LP YLR+
Sbjct: 254 NDSYDLVLATHPLAPFFELKQKLWALAGLDSTCTISL-------TLTDPLPKNVIRYLRI 306
Query: 403 GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD-- 460
+ D S++ S+ +S E VL L + + L + L + E L
Sbjct: 307 QRL-DESDLASIALGQAADEKISNSNEVQVLQSLVESIASLLGSFGTRLEKLEEQLATGV 365
Query: 461 YNLHPKKRVATQLVRMEKKMLNACLQVTADMI 492
Y + A + E+++L + D++
Sbjct: 366 YPVGGNAWAAAHVSLGEQRVLKLAKKKAEDLL 397
>gi|119500300|ref|XP_001266907.1| SET domain protein [Neosartorya fischeri NRRL 181]
gi|119415072|gb|EAW25010.1| SET domain protein [Neosartorya fischeri NRRL 181]
Length = 704
Score = 52.0 bits (123), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 55/203 (27%), Positives = 90/203 (44%), Gaps = 27/203 (13%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
+L+ + +G + FW PYIR L + L++ +PL + +L +L G+ +R
Sbjct: 124 FFLIGQYLKGSEGFWFPYIRTLPQP-----LSLTTPLYYEGGDLRWLDGTSLAPAREQRM 178
Query: 222 EGIKREYNELDTVWFMAGSLFQ---QYPYDI--------PTEAFTFEIFKQAFVAVQSCV 270
K +Y T AG FQ QY +D+ + AF+ ++ +A V+
Sbjct: 179 GVWKEKYKNGITELRKAG--FQDVDQYTWDLYLWSSSILVSRAFSAKVLAEAVTDVE--- 233
Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 330
+ VS+ L+P L+ + K A V VV +G+ I GP+
Sbjct: 234 LPEDGVSV-----LLPC-IDLMNHRPLAKVEWRAGKQDVAFVVLEDVGSGQEISNNYGPR 287
Query: 331 PNSKLLINYGFVDEDNPYDRLVV 353
N +L++NYGF DNP D +V
Sbjct: 288 NNEQLMMNYGFCLPDNPCDYRIV 310
>gi|387197713|gb|AFJ68815.1| set domain protein, partial [Nannochloropsis gaditana CCMP526]
Length = 327
Score = 52.0 bits (123), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 53/218 (24%), Positives = 96/218 (44%), Gaps = 31/218 (14%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
+W+ +G K+ E PS+ + I A +D+ + + S+P L++T + L +
Sbjct: 74 AWLRAHGARCDKI---EWPSYATGSQ-IRGAVALDDINSNEDMVSIPEPLLLTPDVALKD 129
Query: 143 ETIAELLTTN--KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
I ++ N S+ L + LM+E+ +G+ SF+ PY+ L R ++ L W
Sbjct: 130 PDIGKVFEDNLEDFSDEDMLLILLMHERGKGETSFFYPYLATLPR-------LPDTLLNW 182
Query: 201 SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAG--SLFQQYPYDIP------TE 252
+E L++L E+ R + Y L AG LF + P D +
Sbjct: 183 NEEGLSWLQDEGLSLEVFLRESQLTAHYTRLVEEKLKAGWPGLFGEAPDDASDSESKGAD 242
Query: 253 AFTFEIFKQAFVAVQSCVVHLQKVSLARRF---ALVPL 287
++ E F+ A++ +Q+ + RR AL+PL
Sbjct: 243 PYSLENFRFAWLTIQA-------RAFGRRLPYSALIPL 273
>gi|346980096|gb|EGY23548.1| SET domain-containing protein RMS1 [Verticillium dahliae VdLs.17]
Length = 469
Score = 52.0 bits (123), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 83/373 (22%), Positives = 143/373 (38%), Gaps = 69/373 (18%)
Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 213
L L L ++YE QG S W PY L +Q ++P+ WS+ EL L G+
Sbjct: 91 LDSWGQLILVMLYEVLQGDASRWKPYFDILPQQ-------FDTPIFWSDGELLELQGTSL 143
Query: 214 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 273
AE + + E +++ + ++F PTE + + + + L
Sbjct: 144 TAEKIGKVESDAMFRSKILPIVQANPAIFYPEGAAQPTEDELLHLAHRMGSTIMAYAFDL 203
Query: 274 QKVSLARR--------------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 319
+ +VP+ L A +++ A + + + KA
Sbjct: 204 ENDDENENEEDGWVEDREGRTMLGMVPMADTLNA-NAEFNAHINHGESLEATAIRADIKA 262
Query: 320 GESIVVWCGPQPNSKLLINYGFVD-EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLS 378
G+ I+ + GP P S+LL YG+V E + YD + V L E V + LS
Sbjct: 263 GDQILNYYGPLPTSELLRRYGYVTPEHSRYDVVEVPWTLVKE---------VIVSSLSLS 313
Query: 379 VQVF-HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLA 437
+ + V + + E I D Y + S ++ + VSP + ++QL
Sbjct: 314 AEAWKQVESQIDDEEIED---YFVIERDSGEPGPDGRFTAPAVLREVSPEL----VEQLK 366
Query: 438 DYFKA-----------------------------RLAGYPATLSEDEAMLTDYNLHPKKR 468
++ KA RLA YP ++ DE +L + +L ++R
Sbjct: 367 EFLKAVKKLDSERIPDKRKRDEICDAVIAEVLKVRLAQYPTSIETDEKLLAEADLPARRR 426
Query: 469 VATQLVRMEKKML 481
+A + EKK+L
Sbjct: 427 MAVVVRLGEKKLL 439
>gi|389741836|gb|EIM83024.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 502
Score = 52.0 bits (123), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 103/463 (22%), Positives = 173/463 (37%), Gaps = 116/463 (25%)
Query: 118 DLQAGDAAFSVPNSLVVT-----LERVLGNETIAELLTTNKLSEL----ACLALYLMYEK 168
D+ G F++P +L ++ L +LG L K EL A L L +M+E+
Sbjct: 37 DIPEGHTLFTLPRNLTLSTRTSALPGLLG-------LDEWKQHELHIGWAGLILCMMWEE 89
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI----------- 217
QG S W Y+ L + ++P+ WS +L L G+ +I
Sbjct: 90 AQGASSRWSTYLASLPS-------SFDTPMFWSPDDLEELKGTSVVDKIGRDGAEEDYRS 142
Query: 218 -----------LERAEGIKREYNELDTVWFMAGSLF------------------------ 242
L E + R Y+ L+ M +
Sbjct: 143 KVVPTLQSRPDLFAPEALSRHYS-LENYHLMGSRILSRSFSVERWEGHAADKQEDSASSP 201
Query: 243 -----QQYPYDIPTEAFTFE-------IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 290
+ D+ TEA T + + +FV + A+VP+
Sbjct: 202 VADTGRDEAMDVDTEAVTATAPEAEDGVDEPSFVVDDENDSDDEDEEDPANVAMVPMADM 261
Query: 291 LLA-YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE----- 344
L A Y S+ + +D ++++ +P GE I G PNS LL YG VD
Sbjct: 262 LNARYRSENAKLFYETED-LRMITTKPILKGEQIFNTYGDPPNSDLLRRYGHVDLVPLPN 320
Query: 345 ---DNPYDRLVVE-----AALNTEDPQYQDKRMVAQR------NGKLSVQVFHVHAGREK 390
NP D +VE A + + Q A+R G V + +
Sbjct: 321 GDIGNPAD--IVELRGDLAFFSISERHKQPVESSAERVDWWLEEGGEDVFILETN----H 374
Query: 391 EAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPAT 450
E +++P+ RL S + ++ S P V + ++L +A+ + RLA YP +
Sbjct: 375 ELPDELVPFCRLLLQSQSEWEKTKSKSKLPKAKV----DESILSTIANALERRLAEYPTS 430
Query: 451 LSEDEAMLTD-YNLHPKKRVATQLVRMEKKMLNACLQVTADMI 492
+ ED+ +LT+ +L+ K V +L EK++L+ L + +
Sbjct: 431 VEEDQKLLTEPLSLNRKHAVIVRL--GEKRILHGTLSTVKEKL 471
>gi|322697804|gb|EFY89580.1| putative histone-lysine N-methyltransferase [Metarhizium acridum
CQMa 102]
Length = 466
Score = 52.0 bits (123), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 58/265 (21%), Positives = 109/265 (41%), Gaps = 27/265 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
V A + G+ ++P++L T++ + + L + + L+ LA+Y+++ +
Sbjct: 28 VKARRRFKQGERILTIPSALHWTVQHAQADSLLGPALRSARPPLTVEDTLAVYVLFVRS- 86
Query: 171 GKKSFWLPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIK 225
RE R +A S + ++E EL G+ + + I+
Sbjct: 87 ----------RESGYNGPRSHVAALPTSYSSSIFFTEDELEVCAGTSLYTITKQLKQRIE 136
Query: 226 REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
+Y +L + P P FT +K A V S + + + L
Sbjct: 137 DDYKDL------IARVLGPRPDLFPLNKFTIHHYKWALCTVWSRAMDFELYDGSSMRLLA 190
Query: 286 PLGPPLLAYSSKCKA--MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 343
P +L +SS+ K + A + ++ + Y+AG+ + + G PNS+LL YGFV
Sbjct: 191 PFAD-MLNHSSESKQCHVYDASTGNLSILAGKDYEAGDQVYIHYGSIPNSRLLRLYGFVI 249
Query: 344 EDNPYDRLVVEAALNTEDPQYQDKR 368
DNP D + A + P ++ K+
Sbjct: 250 PDNPNDSYDLVLATHPMAPFFEQKQ 274
>gi|336468018|gb|EGO56181.1| hypothetical protein NEUTE1DRAFT_83233 [Neurospora tetrasperma FGSC
2508]
gi|350289741|gb|EGZ70966.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
Length = 459
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 59/261 (22%), Positives = 112/261 (42%), Gaps = 31/261 (11%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
+ G+ ++P ++ T++ + + L + + LS LA Y+++ K
Sbjct: 34 FKEGEKILTIPAGILWTVKHAYADPLLGPALRSAQPPLSVEDTLATYILFVKS------- 86
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
RE R +A S +L++E +L G+ + + I+ ++ L
Sbjct: 87 ----RESGYDGQRSHIAALPTSYSSSILFAEDDLEACAGTSLYTITKQLEQSIEDDHRAL 142
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP-- 289
LF Q+P P + FT E +K A V S + LA ++ L P
Sbjct: 143 VV------RLFVQHPDLFPLDKFTVEDYKWALCTVWSRAMDF---VLADGNSIRLLAPFA 193
Query: 290 PLLAYSSKCKA--MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
+L ++S+ K + + ++ + Y+AG+ + + GP PNS+LL YGFV NP
Sbjct: 194 DMLNHTSEVKQCHVYDPSSGNLSVLAGKDYEAGDQVFINYGPVPNSRLLRLYGFVIPGNP 253
Query: 348 YDRLVVEAALNTEDPQYQDKR 368
D + + + + P ++ K+
Sbjct: 254 NDSYDLVLSTHPQAPFFEQKQ 274
>gi|308806489|ref|XP_003080556.1| SET-domain transcriptional regulator-like protein (ISS)
[Ostreococcus tauri]
gi|116059016|emb|CAL54723.1| SET-domain transcriptional regulator-like protein (ISS)
[Ostreococcus tauri]
Length = 394
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 67/263 (25%), Positives = 108/263 (41%), Gaps = 46/263 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERV------------------LGNETIAELLTTNKL 154
V A E ++AG+ VP ++ +E+ +G++ I + T L
Sbjct: 4 VRAVERVEAGECVARVPWDALLGVEQTVETSSPSPTSEILKQLTRMGDQIIMVIWLTAAL 63
Query: 155 SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 214
C YE+ W P +R L + S L W+ +L + G
Sbjct: 64 DAFEC-GDASAYEE-------WAPALRALPTR-------ASSSLAWNADDLGAVAGEDLA 108
Query: 215 AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF-TFEIFKQAFVAVQSCVVHL 273
+ E +K +Y+ L F A L +Q P P AF + F++A+ S + +
Sbjct: 109 NRLREYRRSVKVQYDAL----FPA--LCEQVPEAFPARAFGDYAKFERAYDIWTSYAMKV 162
Query: 274 QK-VSLARRFALVP----LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG 328
Q SL R +VP L A+S + ++ A +L + R GE+I + G
Sbjct: 163 QDPDSLQIREVIVPGVFLCNHSLSAHSVRYTSLERGTK-AFRLELSRGCVEGEAITISYG 221
Query: 329 PQPNSKLLINYGFVDEDNPYDRL 351
N+ LL+ YGF E+NPYDR+
Sbjct: 222 RLDNADLLMFYGFSLENNPYDRV 244
>gi|380089029|emb|CCC12973.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 465
Score = 51.6 bits (122), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 76/354 (21%), Positives = 136/354 (38%), Gaps = 39/354 (11%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
+ G+ ++P+S++ T+E + + L + + LS L YL++ +
Sbjct: 43 FKEGEKILTIPSSILWTVEHAYADPLLGPALCSVQPPLSPEDTLTTYLLFVRS------- 95
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
RE R +A S + ++E EL G+ + + I+ ++ L
Sbjct: 96 ----RESGYDGQRSHVAALPTSYSSSIFFTEEELEVCAGTSLYTITKQLEQSIEDDHRAL 151
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
LF Q+ P + F+ E +K A V S + Q L P +
Sbjct: 152 ------VMQLFIQHRDLFPLDKFSIEDYKWALCTVWSRRMDFQLRDGKSMRLLAPFAD-M 204
Query: 292 LAYSSKCK--AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
L +SS+ K + + ++ + Y+ G+ + + G PNS+LL YGFV NP D
Sbjct: 205 LNHSSEAKPCHVYDVSSGNLSVLAGKDYEPGDQVFINYGSVPNSRLLRLYGFVIPGNPND 264
Query: 350 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP-----YLRLGY 404
+ + + + P Y+ K + G S + ++D LP YLR+
Sbjct: 265 TYDLVLSTHPQAPFYEQKHKLWVSAGLDSTSTIPL-------TLTDPLPKNVLRYLRIQR 317
Query: 405 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 458
+ + + VS E +L L + F L G+ L + E L
Sbjct: 318 ADASDLAAMALQNAKADEKVSDSNEVEILQFLVESFGHLLGGFGTPLEKLEEQL 371
>gi|164423408|ref|XP_963594.2| hypothetical protein NCU08733 [Neurospora crassa OR74A]
gi|157070080|gb|EAA34358.2| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 459
Score = 51.6 bits (122), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 59/261 (22%), Positives = 111/261 (42%), Gaps = 31/261 (11%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
+ G+ ++P ++ T++ + + L + + LS LA Y+++ K
Sbjct: 34 FKEGEKILTIPAGILWTVKHAYADPLLGPALRSAQPPLSVEDTLATYILFVKS------- 86
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
RE R +A S +L++E +L G+ + + I+ ++ L
Sbjct: 87 ----RESGYDGQRSHIAALPASYSSSILFAEDDLEACAGTSLYTITKQLEQSIEDDHRAL 142
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP-- 289
LF Q+P P + FT E +K A V S + LA ++ L P
Sbjct: 143 VV------RLFVQHPDLFPLDKFTVEDYKWALCTVWSRAMDF---VLADGNSIRLLAPFA 193
Query: 290 PLLAYSSKCKA--MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
+L ++S+ K + + + + Y+AG+ + + GP PNS+LL YGFV NP
Sbjct: 194 DMLNHTSEVKQCHVYDPSSGTLSVFAGKDYEAGDQVFINYGPVPNSRLLRLYGFVIPGNP 253
Query: 348 YDRLVVEAALNTEDPQYQDKR 368
D + + + + P ++ K+
Sbjct: 254 NDSYDLVLSTHPQAPFFEQKQ 274
>gi|1150596|emb|CAA86307.1| putative transcription regulator [Saccharomyces cerevisiae]
Length = 496
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 60/254 (23%), Positives = 109/254 (42%), Gaps = 22/254 (8%)
Query: 113 VAASEDLQAGDAAFSVPNS--LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
V A++ ++ + F +P S L VT +++ + + N+ L + ++YE +
Sbjct: 41 VVATQKIKKDETLFKIPRSSVLSVTTSQLIKDYPSLKDKFLNETGSWEGLIICILYEMEV 100
Query: 171 -GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
++S W PY + ++ L + W + EL L G E+ ER
Sbjct: 101 LQERSRWAPYFKVWNKPSDMNAL-----IFWDDNELQLLKPSLVLERIGKKEAKEMHERI 155
Query: 222 -EGIKREYNELDTVWFMAGSLFQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKVS 277
+ IK+ E T S F + Y I + +F E+ + + +++
Sbjct: 156 IKSIKQIGGEFSTCVANCPSKFDNFAYIASIILSYSFDLEMQDSSVNENEEEETSEEELE 215
Query: 278 LARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLL 336
R +++PL L A +SKC A L + +++V R + E + G PNS+LL
Sbjct: 216 NERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELL 275
Query: 337 INYGFVDED-NPYD 349
YG+V+ D + YD
Sbjct: 276 RRYGYVEWDGSKYD 289
>gi|357131865|ref|XP_003567554.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Brachypodium distachyon]
Length = 316
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 46/181 (25%), Positives = 74/181 (40%), Gaps = 22/181 (12%)
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
KKS W PY+R L R + + + W EL + S E +ER E +E++ +
Sbjct: 38 KKSGWAPYVRSLPRND-----QMHNMMFWDLNELHMVRISSICDEAIERRERAMKEFSAV 92
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
LF + E F A S +V + +R +L+P
Sbjct: 93 KPSLECFPHLFGE---------IKLEDFMHA-----SALVSSRAWQTSRGVSLIPFAD-F 137
Query: 292 LAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
L + ++L D +++ DR Y GE ++V G N+ L +N+GF N YD
Sbjct: 138 LNHDGVSDSILLYDGQKDIAEVISDRNYAVGEQVMVRYGKYSNAMLALNFGFTLPRNIYD 197
Query: 350 R 350
+
Sbjct: 198 Q 198
>gi|347967016|ref|XP_003436005.1| AGAP002018-PB [Anopheles gambiae str. PEST]
gi|333469796|gb|EGK97407.1| AGAP002018-PB [Anopheles gambiae str. PEST]
Length = 504
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 91/410 (22%), Positives = 159/410 (38%), Gaps = 69/410 (16%)
Query: 121 AGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA--CLALYLMYEKKQGKKSFWLP 178
AG+ +VP S+ + + EL+ +SE LAL L+ E+ + K S W P
Sbjct: 110 AGECIITVPRSMFFYVTNEPRYRQLLELMPGAMMSEQGNIMLALALIMERFRAK-SDWKP 168
Query: 179 YIREL-DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
Y+ L DR +PL ++ ++ L + L+ + I R+Y +
Sbjct: 169 YLDLLPDR--------YTTPLYYTTEDMGELAETDAFLPALKLCKHIARQYGFIRR---- 216
Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFV--------AVQSCVVHLQKV--------SLARR 281
F Q D + FT+++F+ AV + + KV +
Sbjct: 217 ----FVQEKVDELRDCFTYDVFRLLLFSLLIPHSWAVSTVMTRQNKVPVNLAEFDGMDHT 272
Query: 282 FALVPL------GPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYK--AGESIVVWCGPQP 331
AL+PL P A ++C A A ++ ++ + R A I + G +
Sbjct: 273 LALIPLWDMANHAFPDTANETRCVAETCYNATNEQLECSLTREVSDIASVPIFIVYGTRT 332
Query: 332 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 391
+++ L++ GFV NP+ + L P Y+++ + + G + F RE
Sbjct: 333 DAEFLVHNGFVCPRNPHANVQKRFTLVPAIPLYKERAHLLELLGMPTTGTFSFGPAREPA 392
Query: 392 AISDMLP----YLRLGYVSDTS------------EMQSVISSLGPICPVSPC--MERAVL 433
A + P + L VS + + + + + P C ER
Sbjct: 393 AATTTTPISQELISLARVSSMTAKELDEYTAMKETQRQTLRTYQALLPAELCARTER--- 449
Query: 434 DQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 483
LA K L YP T+ +DEA+L N H +R+ + EK++L +
Sbjct: 450 -WLATVMKIMLLRYPTTIEQDEALLKT-NRHHIRRLLIEYRLGEKQILRS 497
>gi|440802665|gb|ELR23594.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
Length = 984
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 60/264 (22%), Positives = 108/264 (40%), Gaps = 43/264 (16%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNET---IAELLTTNKLSEL------ACLALYLM 165
A+ED+ G+ S+P LV+T E +E +A L + L A L YL+
Sbjct: 32 ATEDILPGEELCSIPVRLVLTTEIARKSEVGRLVAAHLNAVQGERLRVSAGRAILCAYLI 91
Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIK 225
+++ + +FW PY+R L + R + ++ +L G+ + E+ + I+
Sbjct: 92 HQRA-AQDAFWGPYLRSLPKHDDR-----------PDEDIQHLAGTNLFYAMQEKQQQIR 139
Query: 226 REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV------QSCVVHLQKVSLA 279
++ + +L +P P + FT++ F F A Q+ V + A
Sbjct: 140 ESFD------LLFPALCHAHPTVFPPDLFTWDHFLWTFTACSSRSFPQTLVQQPTATTSA 193
Query: 280 RR--FALVPLGPPLL--------AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 329
+ L+ + LL Y K L ++ V + + G GP
Sbjct: 194 HADPYDLLEIDECLLPGLDMLNHQYRKKITWALDPSTGRLKFVTEDTVEKGTEAFNNYGP 253
Query: 330 QPNSKLLINYGFVDEDNPYDRLVV 353
+ N +LL+ YGF EDN D +++
Sbjct: 254 KGNEELLMGYGFCIEDNEQDYVMI 277
Score = 39.7 bits (91), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 22/70 (31%), Positives = 39/70 (55%), Gaps = 3/70 (4%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELL--TTNKLSELACLALYLMYEKKQ 170
V A++ + AG A ++P L++T++ L + E L L E L L+L++EK +
Sbjct: 497 VFAAQAVPAGQALLTIPRQLLITVDTAL-ESPLGEALQYVEGGLDEDTVLTLFLVWEKGR 555
Query: 171 GKKSFWLPYI 180
G+ S W P++
Sbjct: 556 GQASPWYPFL 565
>gi|384249602|gb|EIE23083.1| SET domain-containing protein, partial [Coccomyxa subellipsoidea
C-169]
Length = 306
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 70/150 (46%), Gaps = 17/150 (11%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A +DL G +P + V++++ N IA++L +++ L + +MYE GK
Sbjct: 4 VFAVQDLCEGQRLCEIPKTAVLSVQ----NTGIADILEQHRIRGGLGLIIAIMYELSIGK 59
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+SFW Y+ EL ++ PL W+E E + L G+ + E E + ++
Sbjct: 60 ESFWHGYLEELHKRE-------YLPLFWAEQERSLLQGTEAEHRPQEDEELTQEDFET-- 110
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
L +Q+ + ++FT E F+ A
Sbjct: 111 ----HVPPLVEQHADRLRADSFTLESFRVA 136
>gi|302840199|ref|XP_002951655.1| hypothetical protein VOLCADRAFT_105180 [Volvox carteri f.
nagariensis]
gi|300262903|gb|EFJ47106.1| hypothetical protein VOLCADRAFT_105180 [Volvox carteri f.
nagariensis]
Length = 517
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 45/199 (22%), Positives = 86/199 (43%), Gaps = 13/199 (6%)
Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
+++ E+ L + +MYEK +G++S W PY+ + PL W E L G+
Sbjct: 179 DEILEVQGLIIAVMYEKSRGRQSRWAPYLNLIPDD------MTHMPLYWKHREFKELRGT 232
Query: 212 PTKAEILERAEGIKREYNELDTVWF-MAGSLFQQYP-YDIPTEAFTFEIFKQAFVAVQSC 269
+++ + + ++ +W + Q++P ++P +++++ A AV S
Sbjct: 233 AAYDKMMGKVQCPADAPTQVPVLWSEVVEPFIQEHPELELPEGKAGYDLYRWATCAVASY 292
Query: 270 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWC 327
L A+VP+ L + + L A + ++ R GE +V
Sbjct: 293 SFILGDDKYQ---AMVPVWDLLNHITGRVNVRLHHCAKRHVLHMIATRDILRGEELVNNY 349
Query: 328 GPQPNSKLLINYGFVDEDN 346
G N++LL YGFV+ N
Sbjct: 350 GELSNAELLRGYGFVEARN 368
>gi|224012755|ref|XP_002295030.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220969469|gb|EED87810.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 753
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 60/227 (26%), Positives = 99/227 (43%), Gaps = 28/227 (12%)
Query: 127 SVPNSLVVTLERVLGNET-IAELLTTNKLSELA----CLALYLMYEKK-QGKKSFWLPYI 180
S+P S ++T+E +G T I + T+ L A L +Y+++++K G+ SF+ PY
Sbjct: 141 SIPKSCLITVE--MGQATPIGRKILTSDLELDAPKHIFLMIYILWDRKVNGETSFFAPYY 198
Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS 240
+ L + P+ W+ EL L GS +I +RAE IK +Y + ++ G
Sbjct: 199 KILP------ETLRNMPIFWTREELDALEGSYLLLQIADRAEAIKEDYISICSIAPEFGD 252
Query: 241 LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSK-CK 299
+ T E F+ A + V S L ++ R ALVP L + K
Sbjct: 253 I------------ATLEEFQWARMIVCSRNFGLL-INGHRTSALVPHADMLNHLRPRETK 299
Query: 300 AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 346
+ + + + GE + G + N + L+NYGF E N
Sbjct: 300 WTFSEESQSFTITTLQEIGMGEQVFDSYGQKCNHRFLLNYGFCVERN 346
>gi|302784522|ref|XP_002974033.1| hypothetical protein SELMODRAFT_414219 [Selaginella moellendorffii]
gi|300158365|gb|EFJ24988.1| hypothetical protein SELMODRAFT_414219 [Selaginella moellendorffii]
Length = 527
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 60/227 (26%), Positives = 90/227 (39%), Gaps = 37/227 (16%)
Query: 135 TLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
T ER L + +L N + +L+ E+ +GK+SFW PYI L +L++
Sbjct: 90 TAERCL---LVGPMLRKNDFRPWLTMCAHLLVERSRGKESFWHPYISALPSVE---ELSI 143
Query: 195 ESPLLW-SETELAYLTGSPTKAEILERAEGIKREYNELDTVW---FMAGSLFQQYPYDIP 250
PLLW +ET L GSP I R + + ++ L T F+ G
Sbjct: 144 SHPLLWPAETIQELLQGSPMLDTIATRLKLCQEDHEALLTAGIEKFLPGG---------- 193
Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQKVSLA-------RRFALVPLGPPLLAYSSKCKAMLA 303
E + V S V+ + SL LVP L SS +
Sbjct: 194 ------ETLSEGDVRWASAVLLSRAFSLELDVDDDFDTLCLVPWADMLNHCSSAGEESCL 247
Query: 304 AVDDAVQ---LVVDRPYKAGESIVVWCGPQ-PNSKLLINYGFVDEDN 346
D + L + Y G+ + GP S+L ++YGFVD++N
Sbjct: 248 IFDQDTKTASLEAHKSYSKGDEVFDSYGPALTGSQLFLDYGFVDDEN 294
>gi|387191841|gb|AFJ68625.1| set domain-containing protein [Nannochloropsis gaditana CCMP526]
Length = 736
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 41/200 (20%), Positives = 79/200 (39%), Gaps = 22/200 (11%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LA+ L+ E+ +G +SFW PY+R L + P+ ++ +E + +
Sbjct: 260 LAVLLVAERMKGPQSFWWPYLRNLPEK------YAHMPIFYNNSEFGSIQIPSLMRTVQS 313
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
R + ++ +Q + P E + + C + +
Sbjct: 314 RCRMLVN----------ISDGYLRQLSHGGPAENPFLDDVHANDMGWGLCAASSRALRNI 363
Query: 280 RRFALVPLGPPLLAYSSKCKAMLAAVDD------AVQLVVDRPYKAGESIVVWCGPQPNS 333
PL P++ + + + D ++QLV R + G+++ + G N
Sbjct: 364 PGLGSTPLMVPVIDFCEHAVSPTCYIKDYRKSGGSIQLVAGRDLQPGDALTISYGNLTNP 423
Query: 334 KLLINYGFVDEDNPYDRLVV 353
+LL++YGF DNP+DR V
Sbjct: 424 QLLLDYGFTLSDNPHDRFEV 443
>gi|336260071|ref|XP_003344832.1| hypothetical protein SMAC_06115 [Sordaria macrospora k-hell]
Length = 456
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 81/390 (20%), Positives = 149/390 (38%), Gaps = 41/390 (10%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
+ G+ ++P+S++ T+E + + L + + LS L YL++ +
Sbjct: 34 FKEGEKILTIPSSILWTVEHAYADPLLGPALCSVQPPLSPEDTLTTYLLFVRS------- 86
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
RE R +A S + ++E EL G+ + + I+ ++ L
Sbjct: 87 ----RESGYDGQRSHVAALPTSYSSSIFFTEEELEVCAGTSLYTITKQLEQSIEDDHRAL 142
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
LF Q+ P + F+ E +K A V S + Q L P +
Sbjct: 143 ------VMQLFIQHRDLFPLDKFSIEDYKWALCTVWSRRMDFQLRDGKSMRLLAPFAD-M 195
Query: 292 LAYSSKCK--AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
L +SS+ K + + ++ + Y+ G+ + + G PNS+LL YGFV NP D
Sbjct: 196 LNHSSEAKPCHVYDVSSGNLSVLAGKDYEPGDQVFINYGSVPNSRLLRLYGFVIPGNPND 255
Query: 350 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP-----YLRLGY 404
+ + + + P Y+ K + G S + ++D LP YLR+
Sbjct: 256 TYDLVLSTHPQAPFYEQKHKLWVSAGLDSTSTIPL-------TLTDPLPKNVLRYLRIQR 308
Query: 405 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD--YN 462
+ + + VS E +L L + F L G+ L + E L Y+
Sbjct: 309 ADASDLAAMALQNAKADEKVSDSNEVEILQFLVESFGHLLGGFGTPLEKLEEQLAQGVYS 368
Query: 463 LHPKKRVATQLVRMEKKMLNACLQVTADMI 492
A + E+++L + D++
Sbjct: 369 PGGNAWAAAHVSLGEQRVLRLAKKRAEDLL 398
>gi|50294638|ref|XP_449730.1| hypothetical protein [Candida glabrata CBS 138]
gi|49529044|emb|CAG62706.1| unnamed protein product [Candida glabrata]
Length = 510
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 74/301 (24%), Positives = 116/301 (38%), Gaps = 46/301 (15%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
SW+ NG+ K+ K K N V + D+Q + F +P ++++ E
Sbjct: 14 SWLTNNGV---KISPKLKVEDNRYKDEGRCVVTTTDIQKDELLFEIPRNVLLNCETSQLV 70
Query: 143 ETIAELLT---TNKLSE-------LACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
+ I +LT T SE + CL Y MY K KS W PY L L
Sbjct: 71 KDIPAVLTELETFSGSEPLSWEPLILCL-FYEMYILKD--KSRWWPYFEVLPTLEDMNVL 127
Query: 193 AVESPLLWSETELAYLTGS-----------PTKAEILER------AEGIKREYNELDTVW 235
+LWS+ +LA L S ++L+R E +K N+
Sbjct: 128 -----VLWSDEDLAALEPSYVLSCIGKEQVENMYQLLKRFIEASDHEQLKSNLNKFSWDS 182
Query: 236 FM-AGSLFQQYPYDIPTEAF-------TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 287
F+ GSL Y +D+ E + + + + + ++VPL
Sbjct: 183 FIRIGSLIMSYSFDVGKEIHNEGKEGESMNENDNMTNGDEDEDEDEEDLEVEMIKSMVPL 242
Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
L A + KC A L ++++ R +GE + G NS+LL YG+V+ D
Sbjct: 243 ADTLNADTKKCNANLLHSKQTLRMIAIRDIPSGEQVYNTYGELSNSELLRRYGYVEWDGS 302
Query: 348 Y 348
Y
Sbjct: 303 Y 303
>gi|402224283|gb|EJU04346.1| hypothetical protein DACRYDRAFT_114691 [Dacryopinax sp. DJM-731 SS1]
Length = 1313
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 57/215 (26%), Positives = 90/215 (41%), Gaps = 19/215 (8%)
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
A+VP+ L A A L D +Q++ +P GE I G PNS LL YG+V
Sbjct: 1060 AMVPMADMLNARCGCNNAKLFYTRDDLQMMATKPIAKGEQIWNTYGDPPNSDLLRRYGYV 1119
Query: 343 DE-------DNPYDRLVVEAALNTEDPQ---YQDKRMVAQRNGKLSVQVFHVHAGREKEA 392
D +P D + + A E + YQD+ G V V +
Sbjct: 1120 DALTLPDGVGSPSDVVEINADTVVEAAKVQSYQDRIDWWLEEGGDDAFVLDV-----TYS 1174
Query: 393 ISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLS 452
+ D + L + + + + S P P ++ + L + RLA YP +L+
Sbjct: 1175 VPDEMLSLVRLLLLNQEDWEKAQSK---GKPPKPKLDEKSYEVLLVVLQKRLAMYPISLT 1231
Query: 453 EDEAMLTDYN-LHPKKRVATQLVRMEKKMLNACLQ 486
E E ML N L+ K+R A + E+++L+ L+
Sbjct: 1232 EQEGMLRSSNELNEKRRNALIVTTGEQRILHKTLE 1266
>gi|70993754|ref|XP_751724.1| SET domain protein [Aspergillus fumigatus Af293]
gi|66849358|gb|EAL89686.1| SET domain protein [Aspergillus fumigatus Af293]
gi|159125354|gb|EDP50471.1| SET domain protein [Aspergillus fumigatus A1163]
Length = 674
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 53/201 (26%), Positives = 88/201 (43%), Gaps = 23/201 (11%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
+L+ + +G + FW PYIR L + L++ +PL + +L +L G+ +R
Sbjct: 94 FFLIGQYLRGSEGFWFPYIRTLPQP-----LSLTTPLYYEGDDLRWLDGTSLAPAREQRM 148
Query: 222 EGIKREYNELDTVWFMAG-SLFQQYPYDI--------PTEAFTFEIFKQAFVAVQSCVVH 272
K +Y T AG QY +D+ + AF+ ++ +A V+ +
Sbjct: 149 GVWKEKYENGITELRKAGFEDVDQYTWDLYLWSSSILVSRAFSAKVLAEAVTDVE---LP 205
Query: 273 LQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPN 332
VS+ L+P L+ + K A V VV +G+ I GP+ N
Sbjct: 206 EDGVSV-----LLPC-IDLMNHRPLAKVEWRAGKQDVAFVVLEDVASGQEISNNYGPRNN 259
Query: 333 SKLLINYGFVDEDNPYDRLVV 353
+L++NYGF DNP D +V
Sbjct: 260 EQLMMNYGFCLPDNPCDYRIV 280
>gi|366987955|ref|XP_003673744.1| hypothetical protein NCAS_0A08050 [Naumovozyma castellii CBS 4309]
gi|342299607|emb|CCC67363.1| hypothetical protein NCAS_0A08050 [Naumovozyma castellii CBS 4309]
Length = 499
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 63/285 (22%), Positives = 120/285 (42%), Gaps = 45/285 (15%)
Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAE------LLTTNKLSELACLALYLM 165
++ A+ED++ + F +P ++ VL + ++E +L + L + ++
Sbjct: 42 FILATEDIKTDELLFEIPRESILN---VLTSSLVSEYPAWENILLDGDVGHWEGLIICML 98
Query: 166 YEKKQGKK-SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAE 216
+E K K S W PY L + S + W+ EL L G+ +
Sbjct: 99 FEIKVKKNMSKWAPYFDVLPESTD-----LNSLMYWTAEELEALKPSLVLDRIGNDGAHQ 153
Query: 217 ILERAEGIKREYNELDTV-----------WFMAGSLFQQYPYDI---PTEAFTFEIFKQA 262
+ E+ + R + + +V + S+ Y +D+ PT A E +
Sbjct: 154 MHEKVMELIRTFEKDHSVDLSFGTITWEDFLYVASIIMSYSFDVELPPTSADENEEDDEV 213
Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
V+ V + + +++PL L + ++KC A L +D++++ KAGE
Sbjct: 214 EEDVEQTVRNEGSLK-----SMIPLADTLNSDTNKCNAHLIYDEDSLKMRAISNIKAGEQ 268
Query: 323 IVVWCGPQPNSKLLINYGFVD-EDNPYD--RLVVEAALNTEDPQY 364
+ G PN+++L YG+V+ E + YD L +E + T QY
Sbjct: 269 VYNIYGNHPNAEILRRYGYVEWEGSKYDFGELPLEVIIETLHEQY 313
>gi|428173103|gb|EKX42007.1| hypothetical protein GUITHDRAFT_141487 [Guillardia theta CCMP2712]
Length = 355
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 61/256 (23%), Positives = 102/256 (39%), Gaps = 34/256 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERV-------------LGNETIAELLTTNKLSELAC 159
A +D+ G+ ++P+ +++ ERV + +++I+ LSE
Sbjct: 75 TTAKDDIADGELYIAIPDHMLMGPERVEPGSRLDKKLMKIVKSQSISMQEQRRLLSEKNK 134
Query: 160 LALYL---MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAE 216
+ +Y MY K K+SFW PY + + SP+ WSE EL L GS
Sbjct: 135 VLMYFLLQMYNPK--KESFWKPYFDIMPTN-------LTSPIFWSEDELQELAGSEVSNM 185
Query: 217 ILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
+ ++ Y+EL +F+ +AFT + + A S V+ L +
Sbjct: 186 ARIEKKRLRAMYDELRE------RIFKHDRKTFLKQAFTLKNWFWANGLYDSRVIQLNRQ 239
Query: 277 SLARRF-ALVPLGPPLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNS 333
+ +PL + S+ K + A + DR G + G + N
Sbjct: 240 TGHGNVPTFIPLIDMVNCIESQDKTFIQYDKKLRAAVMYADRAVSRGVQVFESYGNKSNY 299
Query: 334 KLLINYGFVDEDNPYD 349
+ L+ GFV EDNP D
Sbjct: 300 EYLLYNGFVMEDNPND 315
>gi|327295769|ref|XP_003232579.1| hypothetical protein TERG_06571 [Trichophyton rubrum CBS 118892]
gi|326464890|gb|EGD90343.1| hypothetical protein TERG_06571 [Trichophyton rubrum CBS 118892]
Length = 488
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 99/467 (21%), Positives = 178/467 (38%), Gaps = 90/467 (19%)
Query: 97 LKEKPSHNEKHRPIHY-----------VAASEDLQAGDAAFSVPNSLVVTLERVLGNETI 145
LK H + H IH + AS D+ + F +P+ LV++++ +
Sbjct: 24 LKRSSPHFKMHPGIHIADLRSVGAGRGICASRDIAEDEELFIIPDDLVLSVQNSEARSAL 83
Query: 146 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 205
L +L L + ++YE QG++S W PY R L + ++ + W++ +L
Sbjct: 84 E--LDDKQLGPWLSLIITMIYEYYQGEQSKWYPYFRILPS-------SFDTLMFWTDEQL 134
Query: 206 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 265
L GS +I + A DT+ L Q P P +
Sbjct: 135 LELQGSAVVGKIGKAAAD--------DTILQKVVPLIQANPRHFPPRPNMPPLNSSDSQN 186
Query: 266 VQSCVVH-LQKVSLARRF---------------------------ALVPLGPPLLAYSSK 297
C+ H + + +A F +VPL A + +
Sbjct: 187 ALLCLAHRMGSIIMAYAFDIEKTDEVDEDTAEDGYMTDDEDEPAKGMVPLADIFNADAQR 246
Query: 298 CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 357
A L + + + + +GE I G P + LL YG+V DN VVE +L
Sbjct: 247 NNARLFQEEGSFVMKAIKNIHSGEEIFNDYGELPRADLLRRYGYV-TDNYAQYDVVEFSL 305
Query: 358 NT---------EDPQYQDKRMVAQRNGKLSVQVFHV----HAGREKEAI-SDMLPYLRLG 403
++ +P + R+ N + + + + G K+ I D L LR
Sbjct: 306 DSICKVAGLPDSEPSSTNPRLELLDNLDMLEEGYSIPRIPPNGTLKDTIPKDFLVLLRAL 365
Query: 404 Y--VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML--- 458
+ D + +++ + P S E ++L L R + YP ++ EDE++L
Sbjct: 366 TLPIEDLNRLKARNKAPKPEFSTS---EASLLRSLV---TCRQSEYPTSVQEDESILRCL 419
Query: 459 ------TDYNLHPKKRVATQLVRMEKKMLNACLQV--TADMIMLLPD 497
+ ++ +K++A Q+ + EK++L L + T D ++ PD
Sbjct: 420 EQQNGYINDSIPIRKKMAVQVRKGEKEILTQILTLLDTQDTHLVQPD 466
>gi|422293007|gb|EKU20308.1| ribulose- -bisphosphate carboxylase oxygenase small subunit
n-methyltransferase i [Nannochloropsis gaditana CCMP526]
Length = 385
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 53/202 (26%), Positives = 83/202 (41%), Gaps = 20/202 (9%)
Query: 78 LGDLKSWMHKN---GLPPCKVILKEKP-SHNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
LG+ WM G+PP ++L + E + + G+A F +P S+V
Sbjct: 115 LGENGVWMQDKSGWGVPPHPLLLSSRTIDEIELEDSGRGLICKYPINMGNALFQLPLSIV 174
Query: 134 VTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
+ E+ L A ++E +AL L+ E+ G SFW PYI L
Sbjct: 175 IDKEKSLAAFDGA---LPADINEYFAIALMLIKERALGPSSFWAPYIDVLPTTE-----E 226
Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS-LFQQYPYDIPTE 252
V L+W E +LA L SP A + E+ L+ + A S +F
Sbjct: 227 VNPTLVWPEGDLALLEASPLVAATRSLKRKLAAEFALLEEQYMRARSDVFD-------PS 279
Query: 253 AFTFEIFKQAFVAVQSCVVHLQ 274
FTFE + AF+ + S + ++
Sbjct: 280 VFTFEAYLWAFINIFSRAIRVK 301
>gi|260807503|ref|XP_002598548.1| hypothetical protein BRAFLDRAFT_118329 [Branchiostoma floridae]
gi|229283821|gb|EEN54560.1| hypothetical protein BRAFLDRAFT_118329 [Branchiostoma floridae]
Length = 448
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 60/280 (21%), Positives = 118/280 (42%), Gaps = 42/280 (15%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN-KLSELACLALYLMYEKKQGKKSFWL 177
++ G +P ++++ + VL + + + +L+ + + +L+Y+K G+ SFW
Sbjct: 65 IKRGQTMIKMPQHMILSTKTVLDSVLGPYIESAEPQLTTIQAITTFLIYQKHIGETSFWK 124
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
PY+ L + P+ + E + YL S +A I + + + Y EL +
Sbjct: 125 PYLDILPNE-------YTHPVYFGEEDFLYLPHS-LRANIKAKKQECIKSYEELKPFFPS 176
Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAF--VAVQSCVVHLQKVSLARRF--------ALVPL 287
L + FTF+ ++ A+ V +S V + ++ R +LVP+
Sbjct: 177 LEPLLPNW-----EGIFTFDAYRWAWSTVKTRSLYVDDKGSTVLRNLDKSGLGVTSLVPM 231
Query: 288 GPPLLAYSSKCKAML------AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
LL +S + L D + + YK G+ ++ N LL+NYGF
Sbjct: 232 -VDLLNHSHSARTGLLIKKSCKNGDYFYTVTAEDDYKRGDQVLFCYRRADNQTLLLNYGF 290
Query: 342 VDEDNPYDRL----------VVE-AALNTEDPQYQDKRMV 370
V DN D + ++E EDP+++ ++++
Sbjct: 291 VLPDNHLDTIKFFLVKDIIGILELMNFEEEDPKFRRRKVL 330
>gi|171679805|ref|XP_001904849.1| hypothetical protein [Podospora anserina S mat+]
gi|170939528|emb|CAP64756.1| unnamed protein product [Podospora anserina S mat+]
Length = 468
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 51/199 (25%), Positives = 80/199 (40%), Gaps = 29/199 (14%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
+L+ E + K S+W PYI L + A+ P +W E ++ L + + E
Sbjct: 104 FFLVKEYLKEKDSYWWPYISTLPQPDRVDTWAL--PAVWPEDDIECLEETNAHVAVREIQ 161
Query: 222 EGIKREYNE----LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 277
IK+EY L V F + Q Y FT F+ + + Q H+
Sbjct: 162 ANIKKEYKHARKLLKEVDFPGWQEYTQLLYKWAFCIFTSRSFRPSLILSQETQDHV---- 217
Query: 278 LARRFALVPLGP---------PLLAY-----SSKCKAMLAAVDDAVQLVVDRPYKAGESI 323
L P G PLL +S+ + L VD QL+ + Y+ G+ +
Sbjct: 218 ----LGLTPHGTKVDDFSILQPLLDIGNHDPTSQYQWNLE-VDGTCQLICNNAYQPGQQV 272
Query: 324 VVWCGPQPNSKLLINYGFV 342
G + NS+LL+ YGF+
Sbjct: 273 FNNYGLKSNSELLLGYGFI 291
>gi|440302460|gb|ELP94773.1| hypothetical protein EIN_341910 [Entamoeba invadens IP1]
Length = 823
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 93/414 (22%), Positives = 160/414 (38%), Gaps = 97/414 (23%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL---VVTLE 137
+ +W+ ++G V +K P + + +S++ GD S+P L + L
Sbjct: 4 ITTWVKEHGGHIDGVYVKNFPVYGNG------LCSSKEFHEGDTLLSIPYHLQLNTIELH 57
Query: 138 RVL-----GNET--IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRG 190
V G E + E N+ E + + LYL K +K F PYI L
Sbjct: 58 NVFESMVPGFEVPRLGEG-AKNRDDENSVVYLYLAM-NKTNEKCFHFPYINTLPT----- 110
Query: 191 QLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIP 250
PL +SE EL L G+ ++L E K +L + +L QYP
Sbjct: 111 --TFSCPLSYSENELKMLKGT----KLLVTVEKTKTFLKKLSDYY---ETLTHQYP---- 157
Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV---PLGP-----PLLAYSSKCKAML 302
T F+ F Q V +V +R F ++ P+G P +S+
Sbjct: 158 TRFQQFDDFYQRLVWAH-------QVFWSRAFLVIYPDPIGDVASLIPFADFSNH----- 205
Query: 303 AAVDDAVQLVVDRPYKA----GESIVVWCGPQ--------PNSKLLINYGFVDEDNPYDR 350
+ V V +R + V+ CG Q PN K+L+ YGFV +NPYD
Sbjct: 206 -NTETKVTYVSNRQTQTFSLQTNEKVLHCGEQIFNNYRIRPNEKMLLGYGFVISENPYDE 264
Query: 351 LVV-----EAALNTEDPQYQDKRM--------------------VAQRNGKLSVQVFHVH 385
+++ E + + ++ +M + Q + V F +
Sbjct: 265 VLLRINFKERHFEKQVEESEESKMEVENKENERMEVEEEDNEDEITQILKREGVDRFDYY 324
Query: 386 AGREKEAISDMLPYLRLGYVS--DTSEMQSVISSLGPICPVSPC-MERAVLDQL 436
REKE +D+L LR+ +S + ++ + L + P++ R++++Q+
Sbjct: 325 LTREKELPTDLLRVLRIVNLSLVEANQYSQALLDLSYVSPINEIKATRSLMEQI 378
>gi|145346652|ref|XP_001417799.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578027|gb|ABO96092.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 490
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 61/259 (23%), Positives = 104/259 (40%), Gaps = 31/259 (11%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLS------ELACLALYLMY 166
V A+ DL+ G+ SVP+ V+T++ + + E + + L + +M
Sbjct: 57 VRATRDLRVGEVVVSVPDDAVLTVDACAVKKELGEFVGDGDDEAPSPRLDKELLVIAVMC 116
Query: 167 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS-------PTKAEILE 219
E GK S W Y+ + G S L W + ++ L G+ E L+
Sbjct: 117 EMCAGKSSAWCEYLETVHEAVRVGH----SVLAWDDEQVTALFGTDAWRDAYENDDETLD 172
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF-TFEIFKQAFVAVQSCVVHLQKVSL 278
+ + + T++F LF + + EA A VA S + ++
Sbjct: 173 LPMMTEEHFENVVTLFF---KLFPKLASGLSVEALRELHFAATAMVAGYSFTLGDDEIQ- 228
Query: 279 ARRFALVPLGPPLLAYSSKCKAMLAAVDD----AVQLVVDRPYKAGESIVVWCGPQPNSK 334
A+VP +L ++ C+A + D +Q++ R K GE + GP N++
Sbjct: 229 ----AMVPFWD-MLNHAPPCEASVRLHHDQKNGCLQMITVRGVKKGEEVFNTYGPLRNAE 283
Query: 335 LLINYGFVDEDNPYDRLVV 353
LL YGFV NP+ V
Sbjct: 284 LLRRYGFVLPRNPHGGTTV 302
>gi|299472213|emb|CBN77183.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloropl [Ectocarpus
siliculosus]
Length = 460
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 64/270 (23%), Positives = 111/270 (41%), Gaps = 35/270 (12%)
Query: 84 WMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLG 141
W+ K+G L V+ P E+ + A++ ++ G + ++P SL +T L
Sbjct: 18 WLTKSGVRLTDNAVLAGRSPLAGERG-----LVAAKAIETGQSVLAIPQSLGLTATG-LK 71
Query: 142 NETIAELLTT--NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
+ IA+ + E +AL +++E+ QG+ S P+I L ++ G+L E PL
Sbjct: 72 SSGIAQYVEGFEGWTGETGLIALQVLWERAQGEGSKMAPWIAVLPKE---GEL--EMPLF 126
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
W E +L S T+ G + +E D W ++ + F ++P P + F F
Sbjct: 127 WGEADLTLADASSTRG-----ISGFVADVDE-DFAW-LSENAFAKHPKVFPADKFGPGDF 179
Query: 260 KQAFVAVQSCVVHLQK-------VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 312
+ A S + V A +L + P + + + AV L
Sbjct: 180 RWAVGVALSRSFFVDGELRLTPLVDFANHSSLRGVSEP----TGGTTGLFGS--KAVVLR 233
Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
+ Y+ GE V GP+ + L GFV
Sbjct: 234 AGKNYEEGEEFFVSYGPKGAAGYLEENGFV 263
>gi|303279242|ref|XP_003058914.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226460074|gb|EEH57369.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 457
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 69/306 (22%), Positives = 121/306 (39%), Gaps = 63/306 (20%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNE---KHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
K+W+ NG + ++ +E + P V A D++ G++ +P+S T E
Sbjct: 4 FKTWLRSNGFWWNEDAIELGSRIDEGGGEDAPRVGVKAKRDIEIGESVARIPSSACFTCE 63
Query: 138 RVLGNETIAEL-LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES 196
+ + ++ L+ + LA L L+ E+ G S W Y+ L +
Sbjct: 64 NCAHADAVRKVKLSAGEDEWLASLGTALVLERTLGSSSRWNAYLDSLPHSE------PDV 117
Query: 197 PLLWSET--ELAYLTGSPTKAEILERAEGIKREYNE-----LDTVWFMAGSLFQQYPYDI 249
++WSE YL G+ + + + + E+ LDT+ A + +D
Sbjct: 118 VMMWSEDGERRRYLCGTDIEQSLRDERAAARTEWTRHVKPVLDTLRGAA----KDVGFD- 172
Query: 250 PTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP-LGPPLL--------------AY 294
F+A +S V+ +R F + P +G L+ Y
Sbjct: 173 ------------DFLAARS-------VASSRAFTVNPRVGAGLVPIADLFNHRTGGHHVY 213
Query: 295 SSKCKAMLA-------AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
S + A + DDA+ + V + KAGE + G N+KLL +YGF DNP
Sbjct: 214 LSDARGTAAVSERDEGSDDDALFVRVVKASKAGEEVFNTYGKLGNAKLLCSYGFAQLDNP 273
Query: 348 YDRLVV 353
D++ +
Sbjct: 274 ADKVTI 279
>gi|322706860|gb|EFY98439.1| SET domain protein [Metarhizium anisopliae ARSEF 23]
Length = 595
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 68/289 (23%), Positives = 129/289 (44%), Gaps = 38/289 (13%)
Query: 113 VAASEDLQAGDA------AFSVPNSLVVTLERVLG----NETIAELL-TTNKLSELACLA 161
+ A DL++ +A ++P+ LV++ E V + +LL + S +
Sbjct: 143 LVAHADLESAEADGTSKGPVTIPHDLVLSAEAVEDFAKVDHNFKQLLEAVGRQSTRGDIM 202
Query: 162 LYLMYEKKQGKK------SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 215
LYL+ + Q + + W YIR L R + P +W+E E L G+ +A
Sbjct: 203 LYLVSQFAQSSRPKGLSPTPWTEYIRLLPR-------PIPVPTMWTEPERLLLNGTSLEA 255
Query: 216 EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 275
+ + + +E++ L V + +P+ E+ + +V V + +
Sbjct: 256 ALEAKLLSLGKEFDTLREV-------SEDFPFWNEFLWSGEEVSLEDWVLVDAWY-RSRC 307
Query: 276 VSLARR-FALVPLGPPLLAYSSKCKAMLAAVD-DAVQLVV--DRPYKAGESIVVWCG-PQ 330
+ L R A+VP G ++ +SSK A D D V L++ P ++GE + + G +
Sbjct: 308 LELPRSGTAMVP-GLDMVNHSSKATAYYEEDDHDNVVLLIRPGCPVRSGEEVTISYGDAK 366
Query: 331 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 379
P S++L +YGF+D +N D+L + +DP + K ++ L++
Sbjct: 367 PASEMLFSYGFIDPNNIVDKLTLRLDPFPDDPLARAKLRISNSGPTLTI 415
>gi|387193935|gb|AFJ68731.1| hypothetical protein NGATSA_2061300, partial [Nannochloropsis
gaditana CCMP526]
Length = 446
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 62/274 (22%), Positives = 110/274 (40%), Gaps = 35/274 (12%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L W +KNG+ I S + A+ ++ G+ +VP +L +++ V
Sbjct: 60 LLEWCNKNGIKDASKITIGPVSQAGMGLGL---VATAPIKQGETLATVPLNLCFSMDSVR 116
Query: 141 GN---ETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
+ + I E L + + +AL L+YE G KS + YI+ L R GQ + P
Sbjct: 117 ASPLGKVIGEF--EPALGDASLIALQLLYEAHMGPKSKYAVYIKSLPRP---GQDGFDHP 171
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
L WS E L S T+ + + +Y W + +L + + ++F
Sbjct: 172 LFWSTAEQGVLAKSSTRNLGETLIDAVAEDYG-----WIQS-ALARGGISGLQADSFDLS 225
Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA---------MLAAVDDA 308
F+ A V S + R A PLL +++ + +
Sbjct: 226 DFEWAVAVVLSRSFFAEN---GLRLA------PLLDMANRGEGCTNEPQIGGLGIFGGKG 276
Query: 309 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
++++ DR G+ IV+ GP+ + L ++GFV
Sbjct: 277 LKVIADRDTDKGQEIVISYGPKSGIEFLEDHGFV 310
>gi|114684050|ref|XP_001168792.1| PREDICTED: SET domain-containing protein 4 isoform 4 [Pan
troglodytes]
gi|410222534|gb|JAA08486.1| SET domain containing 4 [Pan troglodytes]
gi|410259178|gb|JAA17555.1| SET domain containing 4 [Pan troglodytes]
gi|410287502|gb|JAA22351.1| SET domain containing 4 [Pan troglodytes]
gi|410336607|gb|JAA37250.1| SET domain containing 4 [Pan troglodytes]
Length = 307
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 60/250 (24%), Positives = 101/250 (40%), Gaps = 22/250 (8%)
Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSF 175
LQ G S+P S ++T + V+ + +T K S L L +L+ EK G +S
Sbjct: 67 SLQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGHRSL 125
Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 235
W PY+ L + A P+ E E+ L KA+ E+ ++ + +
Sbjct: 126 WKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFF 177
Query: 236 FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--P 290
LF + I F++ A+ V + V+L Q+ L+ L P
Sbjct: 178 SSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPYLD 233
Query: 291 LLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
LL +S + KA + ++ ++ E + + GP N +L + YGFV NP+
Sbjct: 234 LLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEYGFVSVHNPH 293
Query: 349 DRLVVEAALN 358
+ V N
Sbjct: 294 ACVYVSRGWN 303
>gi|224125978|ref|XP_002329631.1| predicted protein [Populus trichocarpa]
gi|222870512|gb|EEF07643.1| predicted protein [Populus trichocarpa]
Length = 513
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 58/256 (22%), Positives = 108/256 (42%), Gaps = 28/256 (10%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A++DL+ GD A +P S++++ E V ++ L + ++ L L+ M E+
Sbjct: 149 ATKDLKVGDIALEIPVSIIISEEHVHKSDMYHILEKIDGITSETMLLLWSMKERHNCSSK 208
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F + Y L + G L + + L G+ EI++ E ++ +Y+EL
Sbjct: 209 FKI-YFDTLPEEFKTG-------LSFGVDAIMALDGTLLLEEIMQAKEHLRVQYDEL--- 257
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG------ 288
L + YP E +T+E F A S + + V R L+P+
Sbjct: 258 ---VPPLCKNYPDVFLPELYTWEQFLWACELWYSNSMKVMFVDGKLRTCLIPIAGFLNHS 314
Query: 289 --PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-D 345
P ++ Y + + + ++ + RP GE + G +S L+ YGF+ + D
Sbjct: 315 LYPHIVHYGK-----VDSATNTLKFPLTRPCCFGEQCCLSYGNFSSSHLITFYGFMPQGD 369
Query: 346 NPYDRLVVEAALNTED 361
NP D + ++ + D
Sbjct: 370 NPCDVIPLDIDVGDAD 385
>gi|422293951|gb|EKU21251.1| hypothetical protein NGA_2061300, partial [Nannochloropsis gaditana
CCMP526]
Length = 452
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 62/274 (22%), Positives = 110/274 (40%), Gaps = 35/274 (12%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L W +KNG+ I S + A+ ++ G+ +VP +L +++ V
Sbjct: 66 LLEWCNKNGIKDASKITIGPVSQAGMGLGL---VATAPIKQGETLATVPLNLCFSMDSVR 122
Query: 141 GN---ETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
+ + I E L + + +AL L+YE G KS + YI+ L R GQ + P
Sbjct: 123 ASPLGKVIGEF--EPALGDASLIALQLLYEAHMGPKSKYAVYIKSLPRP---GQDGFDHP 177
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
L WS E L S T+ + + +Y W + +L + + ++F
Sbjct: 178 LFWSTAEQGVLAKSSTRNLGETLIDAVAEDYG-----WIQS-ALARGGISGLQADSFDLS 231
Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA---------MLAAVDDA 308
F+ A V S + R A PLL +++ + +
Sbjct: 232 DFEWAVAVVLSRSFFAEN---GLRLA------PLLDMANRGEGCTNEPQIGGLGIFGGKG 282
Query: 309 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
++++ DR G+ IV+ GP+ + L ++GFV
Sbjct: 283 LKVIADRDTDKGQEIVISYGPKSGIEFLEDHGFV 316
>gi|55953063|ref|NP_001007260.1| SET domain-containing protein 4 isoform 2 [Homo sapiens]
gi|12804091|gb|AAH02898.1| SET domain containing 4 [Homo sapiens]
gi|119630161|gb|EAX09756.1| SET domain containing 4, isoform CRA_a [Homo sapiens]
Length = 307
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 60/250 (24%), Positives = 101/250 (40%), Gaps = 22/250 (8%)
Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSF 175
LQ G S+P S ++T + V+ + +T K S L L +L+ EK G +S
Sbjct: 67 SLQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGHRSL 125
Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 235
W PY+ L + A P+ E E+ L KA+ E+ ++ + +
Sbjct: 126 WKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFF 177
Query: 236 FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--P 290
LF + I F++ A+ V + V+L Q+ L+ L P
Sbjct: 178 SSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPYLD 233
Query: 291 LLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
LL +S + KA + ++ ++ E + + GP N +L + YGFV NP+
Sbjct: 234 LLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEYGFVSVHNPH 293
Query: 349 DRLVVEAALN 358
+ V N
Sbjct: 294 ACVYVSRGWN 303
>gi|356553227|ref|XP_003544959.1| PREDICTED: protein SET DOMAIN GROUP 40-like [Glycine max]
Length = 475
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 66/290 (22%), Positives = 119/290 (41%), Gaps = 47/290 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSELACLALYLMYEKKQG 171
+ A DL+ G+ VP S ++T E V+ ++ + + + + LS L + L+YE +G
Sbjct: 55 LGAVRDLRRGEIVLRVPKSALMTRETVMEDKKLCDAVNRHSSLSSAQILIVCLLYEMGKG 114
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
K S W PY+ L + ++ E E L ++ E ++ +
Sbjct: 115 KTSRWHPYLMHLPH-------TYDVLAMFGEFEKHAL-------QVDEAMWVTEKAMLKA 160
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
+ W A SL Q + + FTF+ + A + S +H + L P+G L
Sbjct: 161 KSEWKEAHSLMQDLMF--KPQFFTFKAWVWAAATISSRTLH---IPWDEAGCLCPVG-DL 214
Query: 292 LAYSSKC--KAMLAAVDDAVQL---------------------VVDRPYKAGESIVVWCG 328
Y + + + +D A QL YK G+ +++ G
Sbjct: 215 FNYDAPGIEPSGIEDLDHAEQLDSHSWRLTDGGFEEDANAYCFYAREHYKKGDQVLLCYG 274
Query: 329 PQPNSKLLINYGFVDEDNPYDRLVV--EAALNTEDPQYQDKRMVAQRNGK 376
N +LL +YGF+ ++NP D++ + E AL + + + + NGK
Sbjct: 275 TYTNLELLEHYGFLLQENPNDKVFIPLEPALYSS-TSWSKESLYIHHNGK 323
>gi|168016200|ref|XP_001760637.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687997|gb|EDQ74376.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 450
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 71/308 (23%), Positives = 122/308 (39%), Gaps = 51/308 (16%)
Query: 81 LKSWMHKNGLPP--CKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLER 138
+ WM NG+ C++ +PS N ++ A ++ Q P L +T
Sbjct: 18 FRDWMQINGVQSRFCEI----RPSSNGENAGFGLFATKDNAQG--VLMVTPLLLAITPMT 71
Query: 139 VLGNETIA----ELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
VL + + +L+ ++ + + L+L+ E+ +G+ SFW PY+ L + G
Sbjct: 72 VLQDPELGGHYCKLMEEGEVDDRLLIMLFLVIERARGRFSFWAPYLEILPFKFG------ 125
Query: 195 ESPLLWSETELAYLTGSPT-KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 253
+PL +SE EL+ L G+ +A + G+ LD A S+F +IP
Sbjct: 126 -TPLSFSEEELSELKGTHLFQATQQQSTTGLILRCPVLDR----ANSVFWTRALNIPCP- 179
Query: 254 FTFEIFKQAFVAVQSCVVH----------LQKVSLARRFALVPLGPPLLAYSSKCKAM-- 301
F F H V + + L P + + KA+
Sbjct: 180 ---HSFNNRFAVDLDSTTHKKPEESSAADTDDVKIPSSVWVEGLVPGIDFCNHDLKAVAL 236
Query: 302 ---------LAAVDDAVQLV--VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 350
+ V +++ LV +D G I + G + N +LL YGFV +NP D
Sbjct: 237 WEVDGPEGSVTGVPNSMYLVTGLDVVISNGSEIFISYGNKSNEELLYLYGFVLVENPDDY 296
Query: 351 LVVEAALN 358
L+V + +
Sbjct: 297 LMVRSTIG 304
>gi|50556556|ref|XP_505686.1| YALI0F20944p [Yarrowia lipolytica]
gi|49651556|emb|CAG78495.1| YALI0F20944p [Yarrowia lipolytica CLIB122]
Length = 402
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/219 (26%), Positives = 106/219 (48%), Gaps = 35/219 (15%)
Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIREL-DRQRGRGQLAVESPLLWSETELAYLTGSP 212
+S LAL+L+ ++ G KS W ++ L DR+ G ++ PL WS+ + LT P
Sbjct: 79 MSAHQVLALFLVIQQSLGSKSDWKAFMGLLPDRKEG----FLDVPLQWSKEDQDSLT--P 132
Query: 213 TKAEILERA-EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CV 270
+L++ + + +Y++ T F+A +Y D P +A+ + A++ V S C+
Sbjct: 133 EGIVVLKKTLDTFEADYDKTKT--FVA-----KYDSD-PRDAYLW-----AWLCVNSRCL 179
Query: 271 VHLQKVSLARRFAL-----VPLGP--PLLAYS-----SKCKAMLAAVDDAVQLVVDRPYK 318
++ ++ A + L P L+ +S + C+ +++ + L R Y
Sbjct: 180 YFDLTLTTGKKDAQEVPDNITLAPYVDLINHSVESGPTHCQLKTSSIGFEI-LCGQRGYT 238
Query: 319 AGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 357
A E I + GP+ NS LL YGF +NP+D + + AL
Sbjct: 239 ADEEIFLCYGPRSNSVLLCEYGFTVPENPWDDVDISDAL 277
>gi|346324642|gb|EGX94239.1| SET domain-containing protein RMS1 [Cordyceps militaris CM01]
Length = 482
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 58/225 (25%), Positives = 89/225 (39%), Gaps = 47/225 (20%)
Query: 158 ACLALYLMYEKKQ------GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
+ L L L+YE Q G W PY+ L A +P+ WS EL L S
Sbjct: 102 SALILVLLYEHLQRDADATGAACRWRPYLDVL-------PAAFATPMFWSPAELGALQAS 154
Query: 212 PTKAEI-LERAEGIKREY--------------------NELDTVWFMAGSLFQQYPYDIP 250
P A++ E A+ + R ++ + GS Y +D+
Sbjct: 155 PAVAKVGRESADNMFRGILLPAVRAHAHVFAGSERLSDEQIVALAHRMGSTIMAYAFDLD 214
Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 310
E E + +V + + V +A +L ++ + DD +
Sbjct: 215 KEEDEDEDGEDGWVEDRDGKALMGMVPMA----------DILNADAEFNVHVNHGDDDLT 264
Query: 311 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP-YDRLVVE 354
+ RP +AGE I+ + GP PNS+LL YG+V E + YD VVE
Sbjct: 265 VTALRPIRAGEEILNYYGPHPNSELLRRYGYVTERHARYD--VVE 307
>gi|378728064|gb|EHY54523.1| SET domain-containing protein 6 [Exophiala dermatitidis NIH/UT8656]
Length = 495
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 81/326 (24%), Positives = 122/326 (37%), Gaps = 51/326 (15%)
Query: 118 DLQAGDAAFSVPNSLVVTL-ERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFW 176
D+ + + F++P SLV+T + + EL L + ++YE +G+ S W
Sbjct: 55 DIASDEELFAIPRSLVLTTATSSIPRSVLKELEDKGATGAWPPLIVTIIYEYLRGESSPW 114
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN------- 229
PY + L + + W++ ELA L S +I R + E+
Sbjct: 115 HPYFKIL-------PTTFNTLMFWNDAELAELQASAVVDKIGRRQ--AEEEWQNTIIPTM 165
Query: 230 --------------ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 275
+L + MAGSL Y +DI + + A + +
Sbjct: 166 ADHPDLFPVGGSSAKLIELAHMAGSLIMAYAFDIDRDDMEDDNDNDKDGADSADDEFEED 225
Query: 276 VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
+VP L A + K A L D + + +P AGE I GP P S L
Sbjct: 226 DEDEPFKGMVPFADMLNADADKNNARLFQEPDYLIMKATKPISAGEQIFNDYGPLPRSDL 285
Query: 336 LINYGFV-DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 394
L YG+V D YD + L E VA ++ K QV+ RE+E
Sbjct: 286 LRMYGYVTDNYAQYDVVEFSHDLLLE---------VAGKHSKSKDQVW-----REREQQL 331
Query: 395 DMLPYLRLGYV-----SDTSEMQSVI 415
D L L GY DT +Q V+
Sbjct: 332 DELGVLDDGYAITRPEYDTQGLQDVL 357
>gi|426392958|ref|XP_004062802.1| PREDICTED: SET domain-containing protein 4 [Gorilla gorilla
gorilla]
Length = 440
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 65/278 (23%), Positives = 113/278 (40%), Gaps = 27/278 (9%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ + +T K S L L +L+ EK G++S W
Sbjct: 68 LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGRRSLW 126
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L + A P+ E E+ L KA+ E+ ++ + +
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
LF + I F++ A+ V + V+L Q+ L+ L P L
Sbjct: 179 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPYLDL 234
Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY- 348
L +S + KA + ++ ++ E + + GP N +L + YGFV NP+
Sbjct: 235 LNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEYGFVSVHNPHA 294
Query: 349 ----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
R ++ L + D Q K + + +G + F
Sbjct: 295 CVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332
>gi|302803412|ref|XP_002983459.1| hypothetical protein SELMODRAFT_445547 [Selaginella moellendorffii]
gi|300148702|gb|EFJ15360.1| hypothetical protein SELMODRAFT_445547 [Selaginella moellendorffii]
Length = 536
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 56/217 (25%), Positives = 86/217 (39%), Gaps = 34/217 (15%)
Query: 145 IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW-SET 203
+ +L N + +L+ E+ +GK+SFW PYI L +L++ PLLW +ET
Sbjct: 97 VGPMLRKNDFRPWLTMCAHLLVERSRGKESFWHPYIAALPSV---DELSISHPLLWPAET 153
Query: 204 ELAYLTGSPTKAEILERAEGIKREYNELDTVW---FMAGSLFQQYPYDIPTEAFTFEIFK 260
L GSP I R + + ++ L T F+ G E
Sbjct: 154 IQELLQGSPMLDTIATRLKLCQEDHEALLTAGIEKFLPGG----------------ETLS 197
Query: 261 QAFVAVQSCVVHLQKVSLA-------RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ--- 310
+ V S V+ + SL LVP L SS + D +
Sbjct: 198 EGDVRWASAVLLSRAFSLELDVDDDFDTLCLVPWADMLNHCSSAGEESCLIFDQDTKTAS 257
Query: 311 LVVDRPYKAGESIVVWCGPQ-PNSKLLINYGFVDEDN 346
L + Y G+ + GP S+L ++YGFVD++N
Sbjct: 258 LEAHKSYSKGDEVFDSYGPALTGSQLFLDYGFVDDEN 294
>gi|409080258|gb|EKM80618.1| hypothetical protein AGABI1DRAFT_71041 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 492
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 61/225 (27%), Positives = 97/225 (43%), Gaps = 33/225 (14%)
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
A+VP+ L A A L D +++V +P K GE I G PN++LL YG V
Sbjct: 245 AMVPMADILNARYQTENAKLFHEKDELKMVTTKPIKTGEQIWNTYGDLPNAELLRRYGHV 304
Query: 343 D--------EDNPYDRLVVEAAL----NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREK 390
D NP D + ++A L + P+ V K + + + G E
Sbjct: 305 DFLSLPSGGHGNPGDVVEIKADLIISAVSSTPE-----AVKDDEAKERID-WWLEEGGED 358
Query: 391 EAISD--------MLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 442
I D M+ +++L ++ ++ S P +E + D L +
Sbjct: 359 VFILDYEYDLPPVMISFVKLLLLTQADWEKAREKS----KPPKSRLEGILYDILISTLEK 414
Query: 443 RLAGYPATLSEDEAMLT-DYNLHPKKRVATQLVRMEKKMLNACLQ 486
RLA YP T+ D+A+LT D L+ K + +L EK++L+ LQ
Sbjct: 415 RLAEYPTTIETDKALLTNDTPLNNKNAIIVRL--GEKEILHGILQ 457
>gi|395332633|gb|EJF65011.1| SET domain-containing protein [Dichomitus squalens LYAD-421 SS1]
Length = 502
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 61/223 (27%), Positives = 97/223 (43%), Gaps = 27/223 (12%)
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
A+VP+ L A A L + +++V +P +AGE I G PNS LL YG V
Sbjct: 257 AMVPMADMLNARFESENAKLFYEERELKMVTTKPVEAGEQIWNTYGDPPNSDLLRRYGHV 316
Query: 343 D----------EDNPYDRLVVEAAL----NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR 388
D NP D + V A L ++ +Y + V + VF +
Sbjct: 317 DVVPLRPPLSGMGNPRDIVEVRADLIVSAVSKKVEYSLQERVDWWLEEAEDDVFILRT-- 374
Query: 389 EKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP 448
+ E +++ + RL ++S+ +++ S P V P VL D ARL YP
Sbjct: 375 DCELPEELVSFERLLFLSEDEWIKTAKKSKLPKPKVDP----DVLTVAIDVLSARLKEYP 430
Query: 449 ATLSEDEAMLT-----DYNLHPKKRVATQLVRMEKKMLNACLQ 486
++ EDE +L+ +L+ K V +L EK++L L+
Sbjct: 431 TSIEEDEKLLSADKVESLSLNKKHAVIVRL--GEKRILQGTLK 471
>gi|328772032|gb|EGF82071.1| hypothetical protein BATDEDRAFT_23340 [Batrachochytrium
dendrobatidis JAM81]
Length = 419
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 63/254 (24%), Positives = 109/254 (42%), Gaps = 38/254 (14%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERV--LGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
A+ D Q GD +P L++ R L N A + L + +AL++ ++K
Sbjct: 49 ATSDFQIGDPVVRIPARLLLVPRRTHKLFNNHPAIV----ALKQHPSIALFIAWQKIHPT 104
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
W PYI L R L ++ LL L Y +I E A K + ++LD
Sbjct: 105 PE-WSPYIDILPRSFDTMPLCIDLKLL---AMLPY--------DIQEIA---KNQQSKLD 149
Query: 233 TVWFMAGSLFQQYPYD-IPTEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-------RFAL 284
T + + Y+ IP + IFK A++ V + + + ++++ + +
Sbjct: 150 TDYAFVCTALAVSGYEMIPKD-----IFKWAWIVVNTRCITMNTNAISKPQLSHIHQQPI 204
Query: 285 VPLGPPLLAYSSKCKAMLAAVDDAVQ----LVVDRPYKAGESIVVWCGPQPNSKLLINYG 340
+ L P L + A ++A D V+ + PYK G + + GP N+ LL YG
Sbjct: 205 ITLAPFLDCLNHTSTARISAGYDTVEKAYIIRTLVPYKKGSQVFINYGPHDNNFLLAEYG 264
Query: 341 FVDEDNPYDRLVVE 354
F NP++ +V++
Sbjct: 265 FAILKNPFNHVVLD 278
>gi|116206234|ref|XP_001228926.1| hypothetical protein CHGG_02410 [Chaetomium globosum CBS 148.51]
gi|88183007|gb|EAQ90475.1| hypothetical protein CHGG_02410 [Chaetomium globosum CBS 148.51]
Length = 442
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 49/195 (25%), Positives = 79/195 (40%), Gaps = 20/195 (10%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
+L+ E +GK SFW PY+ L + P W E ++AYL + I E
Sbjct: 113 FFLIKEYLKGKDSFWWPYLATLPSPDQVNAWVL--PAFWPEDDIAYLECTNAHVAIQEIQ 170
Query: 222 EGIKREYNE----LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---- 273
+K E+ + L F + + Y FT F+ + + + H+
Sbjct: 171 ANVKGEFKQARKILKNENFPDVAAYTSLMYKWAFTIFTSRSFRPSLILSDTTKRHISTLL 230
Query: 274 -QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWC 327
Q V L F+++ PLL ++ + + D DA LV Y G +
Sbjct: 231 PQSVEL-DDFSILQ---PLLDIANHSPTAVYSWDTTSPADACTLVCGDRYPPGAQVFNNY 286
Query: 328 GPQPNSKLLINYGFV 342
G + NS+LL+ YGF+
Sbjct: 287 GLKTNSELLLGYGFI 301
>gi|392569623|gb|EIW62796.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
Length = 509
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 61/221 (27%), Positives = 92/221 (41%), Gaps = 23/221 (10%)
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
A+VP+ L A A L + +++V +P KAGE I G PNS LL YG V
Sbjct: 264 AMVPMADMLNARFESENAKLFYDERELKMVSTKPIKAGEQIWNTYGDPPNSDLLRRYGHV 323
Query: 343 D----------EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG----KLSVQVFHVHAGR 388
D NP D + V A L + K + +R + VF +
Sbjct: 324 DLVPLSAPLSGLGNPGDVVEVRADLIVSVAAKKVKHDLKERVDWWLEEADDDVFVLRT-- 381
Query: 389 EKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP 448
+ E +++ ++RL + ++ S P P +++ VL D + RL YP
Sbjct: 382 DCELAEELVSFVRLLLLPKDEWEKAAQKSKLP----KPKLDKDVLTIAVDVLEKRLKDYP 437
Query: 449 ATLSEDEAMLTDY---NLHPKKRVATQLVRMEKKMLNACLQ 486
TL EDEA+ L KR A + EK++L L+
Sbjct: 438 TTLEEDEALFAPERFGELSLNKRHAVVVRLGEKRILRGTLK 478
>gi|66827459|ref|XP_647084.1| hypothetical protein DDB_G0267502 [Dictyostelium discoideum AX4]
gi|60475269|gb|EAL73204.1| hypothetical protein DDB_G0267502 [Dictyostelium discoideum AX4]
Length = 472
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 69/329 (20%), Positives = 133/329 (40%), Gaps = 67/329 (20%)
Query: 195 ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
++ L + E E+ YL GSP +I+ E + Y++L F + + +
Sbjct: 163 DTSLYFDEKEIEYLAGSPAFVDIMVEKEVATKLYDQLSQTLFKDNVILEMCQG--QSTII 220
Query: 255 TFEIFKQAFVAVQSCVVHLQK----VSLARRFALVPLGPPLLAYSSKCKAMLAAVD---- 306
++ F+ A + + +++ S ++ L P+ PP++ Y + A +D
Sbjct: 221 GWDQFRWAHSTITARKIYVTDPDSVGSDGKQMKLSPVVPPIVDYFNHGNQPSAEIDYNEE 280
Query: 307 -DAVQLVVDRPYKAGESIVV-----WCGPQPNSKLLINYGF----VDEDNPYDRLVVE-- 354
+V + + K GE I V +CG S LL++YG+ +D+ + + L+ E
Sbjct: 281 LGSVDVKAIKDIKKGEEIFVSYDHHYCG----SDLLVDYGYLPNQIDDKSCVNVLMEELL 336
Query: 355 AALNTEDPQYQDK-----RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 409
+N +DP DK +++ ++ KL +
Sbjct: 337 ETINLDDPIKDDKYYLVNKLLETKDIKLKIS----------------------------- 367
Query: 410 EMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML--TDYN-LHPK 466
M S+ L I + ++L+ L ++ YP T+ +D+ L +YN L +
Sbjct: 368 -MDSLTEDLLKISKYMSYKQESLLEYLKRLVSLKIGHYPTTIIQDKEFLLSKEYNQLSAR 426
Query: 467 KRVATQLVRMEKKMLNAC---LQVTADMI 492
++A L EK++L+ LQ D I
Sbjct: 427 SKLAFNLAFQEKQILSNVYTKLQENIDTI 455
>gi|449506720|ref|XP_004162829.1| PREDICTED: uncharacterized LOC101212907 [Cucumis sativus]
Length = 559
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 62/256 (24%), Positives = 105/256 (41%), Gaps = 28/256 (10%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A EDL GD +P +++++ E V + L + + L+ M EK
Sbjct: 189 AKEDLDVGDTVLEIPLAIIISEELVQKSTMYPVLSKVEGMLPETMMLLWSMKEKHIVDSE 248
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F + Y L A + L + + L G+ E+++ E ++++YNEL
Sbjct: 249 FRV-YFDTLPE-------AFNTGLSFGVGAMTTLVGTLLFDELMQAKEHLRKQYNEL--- 297
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP-------- 286
F A L +P P E +++E F A S + + R LVP
Sbjct: 298 -FPA--LCNNHPDIFPEEFYSWEEFLWACELWYSNSLKIMFPDGNVRTCLVPIAGFLNHS 354
Query: 287 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-D 345
L P +L Y + + D+++ + RP +AGE + G S L+ YGF+ E D
Sbjct: 355 LHPHILHYGK-----VDSDTDSLKFRLSRPCRAGEECYLSYGNYSGSHLVTFYGFLPEGD 409
Query: 346 NPYDRLVVEAALNTED 361
N D + ++ +D
Sbjct: 410 NVNDVIPLDIDFGDDD 425
>gi|328872715|gb|EGG21082.1| hypothetical protein DFA_00957 [Dictyostelium fasciculatum]
Length = 643
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 95/455 (20%), Positives = 183/455 (40%), Gaps = 63/455 (13%)
Query: 71 VSKKEEDLGDLKSWM-HKNG-LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
+S EDL + W+ +KN L P I+ P + A+ +++ + +
Sbjct: 202 ISTTPEDLKSFQQWLSNKNTYLNPSIDIVDLGPPFGRS------MVANTNIKKDEILVEI 255
Query: 129 PNSLVVTLERVLGN--ETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
P +++T + ++ N I + + K+S A+ ++Y + S+W Y+ L +Q
Sbjct: 256 PKGIMMTPKSMIKNLPRFIIDWMDEMKISRTDQQAIAIIYSILH-EDSYWYEYVSILPKQ 314
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN-------------ELDT 233
+ + ++ E+ L SP R G+ R Y+ E D+
Sbjct: 315 -------FTTTVYFTREEMTQLQASPVHRFTEMRLNGVHRHYDTTISRLRFGYEGGEDDS 367
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLA 293
S + + +T + FK A V S L + +VPL A
Sbjct: 368 TKTKTKSQLDAMK-EFKDDRYTLDQFKWALGCVWSRAFSLSE----EDGGMVPLADMFNA 422
Query: 294 YS----SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP---NSKLLINYGFVDED- 345
+ SK ++A ++ + +AGE I G + ++L++YGF+ ED
Sbjct: 423 DTVISRSKVHPKISASSPSLVYTASQDIEAGEQIFTPYGVYKTLGSGQMLMDYGFIHEDG 482
Query: 346 NPYDRLVVEAA-LNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 404
+ D +V A + +P Y KR + Q NG + + F + + + ++ + R+
Sbjct: 483 SSADSTIVTVAPIPPSEPLYDLKRHLMQSNG-IESEEFTI---TKNKLAKELFLFARIKS 538
Query: 405 VS--DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML---- 458
++ ++ + + S ++P E+A L L++ L Y T+ +D +L
Sbjct: 539 INKKESDQASAHFMSTQRHSMLNPRNEKAALRLLSNLISRHLDAYQTTIDQDNQILKEIE 598
Query: 459 ---TDYNLHPKKRVAT----QLVRMEKKMLNACLQ 486
T+ N H T +L MEK +LN+ L+
Sbjct: 599 KDKTNTN-HSSVTFNTINAIKLRLMEKNILNSFLK 632
>gi|238494116|ref|XP_002378294.1| SET domain protein [Aspergillus flavus NRRL3357]
gi|317148877|ref|XP_001822982.2| SET domain protein [Aspergillus oryzae RIB40]
gi|220694944|gb|EED51287.1| SET domain protein [Aspergillus flavus NRRL3357]
Length = 478
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 80/305 (26%), Positives = 125/305 (40%), Gaps = 51/305 (16%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
SW+ +G P KV K + + V A D+ G+ F++P V++ + N
Sbjct: 22 SWL--SGKPGVKVNPKIRLADLRSRAAGRGVVAQSDIAEGEELFTIPREHVLSTQ----N 75
Query: 143 ETIAELLTTN--KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
+ +LL+ + +L L L ++YE G +S W Y + L R+ ++ + W
Sbjct: 76 SKLKDLLSQDVEELGPWLSLMLVMIYEYLLGDQSAWASYFKILPRK-------FDTLMFW 128
Query: 201 SETELAYLTGSPTKAEILER--AEGIKREYNELDTVWFMAG-SLF------QQYPYDIPT 251
S +EL L GS I++R EG + E+ A SLF Y D T
Sbjct: 129 SPSELQELQGSA----IVDRIGKEGAEESILEMIAPIVRANPSLFPPVDGLASYDGDAGT 184
Query: 252 EAF-------TFEIFKQAF-----------VAVQSCVVHLQKVSLARRFALVPLGPPLLA 293
+A I AF +S V + L++ +VPL L A
Sbjct: 185 QALLNLAHVMGSLIMAYAFDIEKPEDEDDEGDDESGYVTDDEEQLSK--GMVPLADLLNA 242
Query: 294 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLV 352
+ + A L + + + +P AG I G P + LL YG+V D +PYD V
Sbjct: 243 DADQNNARLFQEETGLVMKAIKPISAGAEIFNDYGEIPRADLLRRYGYVTDNYSPYD--V 300
Query: 353 VEAAL 357
VE +L
Sbjct: 301 VELSL 305
>gi|308809221|ref|XP_003081920.1| N-methyltransferase (ISS) [Ostreococcus tauri]
gi|116060387|emb|CAL55723.1| N-methyltransferase (ISS) [Ostreococcus tauri]
Length = 403
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 51/171 (29%), Positives = 75/171 (43%), Gaps = 40/171 (23%)
Query: 158 ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
A LA+ LM + G + W Y L AV+S ++WS+ EL L GS +
Sbjct: 47 ATLAVALMQQTNGGASARWRAYCDAL-------PAAVDSLMMWSDEELEVLQGSALRQRA 99
Query: 218 LERAEGIKREYNELDTVWFMAGSLFQQYPYDI-PTEAFTFEIFKQAFVAVQSCVVHLQKV 276
+ R + KREY+ L F A L + P EA++F++F+ A+ V
Sbjct: 100 VFRRDLCKREYDAL----FPA--LARADPETFGDVEAYSFDVFRWAYATV---------- 143
Query: 277 SLARRFALVPLGPPLLAYSSKCKAMLAAVD------DAVQLVVDRPYKAGE 321
+AR F L L +C A+L +D DA + VV+R A E
Sbjct: 144 -MARAFVLPDL---------QCMALLPGLDIYNSARDAEKCVVERDEGACE 184
>gi|348552908|ref|XP_003462269.1| PREDICTED: SET domain-containing protein 4-like [Cavia porcellus]
Length = 440
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 69/333 (20%), Positives = 127/333 (38%), Gaps = 39/333 (11%)
Query: 67 SREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAF 126
SR V + + LK W+ ++ P + + L+ G
Sbjct: 22 SRGVNESHKPEFIKLKKWLKDRNFEDTNLMPARFPGTGRG------LMSKTSLREGQMII 75
Query: 127 SVPNSLVVTLERVLGNETIAELLTTNKL-SELACLALYLMYEKKQGKKSFWLPYIRELDR 185
S+P S ++T + V+ + A ++ S L L +L+ EK G +S W PY+ L +
Sbjct: 76 SLPGSCLLTTDTVIRSSLGAYIIKWKPPPSPLLALCTFLVSEKHAGDQSVWKPYLDILPK 135
Query: 186 QRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQY 245
+ P+ E E+ L P KA+ E+ +++ + + LF++
Sbjct: 136 -------SYTCPVC-LEPEVVNLLPEPLKAKAEEQRMSVQQFFASSRDFFSSLQPLFEEA 187
Query: 246 PYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS--------- 296
+ F++ A+ V + V+L+ RR + L P A +
Sbjct: 188 TDSV----FSYSALLWAWCTVNTRAVYLR----TRRRDCLSLEPDTCALAPYLDLLNHSP 239
Query: 297 --KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY-----D 349
+ KA ++ Y+ + + + GP N +LL+ YGFV NP+
Sbjct: 240 NVQVKAAFNEETGCYEIRTASDYRKHKEVFICYGPHDNHRLLLEYGFVSLCNPHACVYVS 299
Query: 350 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
R ++ L + D Q K + + +G L F
Sbjct: 300 REILVKYLPSTDKQMNKKISILKDHGFLENLTF 332
>gi|403342378|gb|EJY70508.1| SET domain containing protein [Oxytricha trifallax]
Length = 653
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 31/124 (25%), Positives = 59/124 (47%), Gaps = 14/124 (11%)
Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGN------ETIAELLTTNKLSELACLALYLMYE 167
AA +++ D VP +++T+ER L + + A + + + L ++L+YE
Sbjct: 56 AAKLNIKNNDVIVYVPQKVLITVERALASPIGFIFDNHASIFKATEDRDYLVLLVFLIYE 115
Query: 168 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
++G +SFW PY +D G L P WS+ + L S K +I + + + +
Sbjct: 116 HQKGTRSFWHPYFEAID----PGLL----PCFWSDQTIEELADSELKDQIRQERDNYEED 167
Query: 228 YNEL 231
++ L
Sbjct: 168 WDML 171
>gi|378731232|gb|EHY57691.1| hypothetical protein HMPREF1120_05719 [Exophiala dermatitidis
NIH/UT8656]
Length = 714
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 61/224 (27%), Positives = 86/224 (38%), Gaps = 46/224 (20%)
Query: 161 ALYLMYEKKQGKKSFWLPYIRELD--RQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
A +L+ + G KS+W PYI L Q E+ LLW L G+ KA
Sbjct: 128 AFFLLEQLVLGDKSWWAPYISSLPTVEDVSHSQFEDEADLLW-------LEGTNLKAGFA 180
Query: 219 ERAEGIKREYNELDTVWFMAG--SLFQQYPYDIPTEAFTFEIFKQAFV--AVQSCVVHLQ 274
A K Y + G L Q + A+T+E F+ A +S +
Sbjct: 181 AEAARWKEMY--------LKGMHQLKQSQWENAVNGAYTWERFRWAMTIFGSRSFTSQVL 232
Query: 275 KVSLARRFALVP------------LGP----------PLLAYSSK---CKAMLAAVDDAV 309
+L AL+ LG PL+ S+ K A V
Sbjct: 233 DATLPADKALLQQYRHDDGRDLCVLGELFAQHFGVLLPLVDISNHKPGAKVEWQARYSFV 292
Query: 310 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
L V PY++G+ I GP+ N LL+ YGF DNP+D +V+
Sbjct: 293 GLQVLEPYESGQEIFNNYGPRDNETLLVAYGFTIPDNPFDHVVI 336
>gi|332872029|ref|XP_001168891.2| PREDICTED: SET domain-containing protein 4 isoform 8 [Pan
troglodytes]
gi|410222532|gb|JAA08485.1| SET domain containing 4 [Pan troglodytes]
gi|410259176|gb|JAA17554.1| SET domain containing 4 [Pan troglodytes]
gi|410287500|gb|JAA22350.1| SET domain containing 4 [Pan troglodytes]
gi|410336605|gb|JAA37249.1| SET domain containing 4 [Pan troglodytes]
Length = 440
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 65/278 (23%), Positives = 112/278 (40%), Gaps = 27/278 (9%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ + +T K S L L +L+ EK G +S W
Sbjct: 68 LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGHRSLW 126
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L + A P+ E E+ L KA+ E+ ++ + +
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
LF + I F++ A+ V + V+L Q+ L+ L P L
Sbjct: 179 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPYLDL 234
Query: 292 LAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY- 348
L +S + KA + ++ ++ E + + GP N +L + YGFV NP+
Sbjct: 235 LNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEYGFVSVHNPHA 294
Query: 349 ----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
R ++ L + D Q K + + +G + F
Sbjct: 295 CVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332
>gi|8393013|ref|NP_059134.1| SET domain-containing protein 4 isoform 1 [Homo sapiens]
gi|12229715|sp|Q9NVD3.1|SETD4_HUMAN RecName: Full=SET domain-containing protein 4
gi|7023055|dbj|BAA91819.1| unnamed protein product [Homo sapiens]
gi|119630162|gb|EAX09757.1| SET domain containing 4, isoform CRA_b [Homo sapiens]
gi|119630163|gb|EAX09758.1| SET domain containing 4, isoform CRA_b [Homo sapiens]
gi|119630165|gb|EAX09760.1| SET domain containing 4, isoform CRA_b [Homo sapiens]
Length = 440
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 65/278 (23%), Positives = 112/278 (40%), Gaps = 27/278 (9%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ + +T K S L L +L+ EK G +S W
Sbjct: 68 LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGHRSLW 126
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L + A P+ E E+ L KA+ E+ ++ + +
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
LF + I F++ A+ V + V+L Q+ L+ L P L
Sbjct: 179 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPYLDL 234
Query: 292 LAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY- 348
L +S + KA + ++ ++ E + + GP N +L + YGFV NP+
Sbjct: 235 LNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEYGFVSVHNPHA 294
Query: 349 ----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
R ++ L + D Q K + + +G + F
Sbjct: 295 CVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332
>gi|388452885|ref|NP_001253203.1| SET domain-containing protein 4 [Macaca mulatta]
gi|355560299|gb|EHH16985.1| SET domain-containing protein 4 [Macaca mulatta]
gi|387541878|gb|AFJ71566.1| SET domain-containing protein 4 isoform 1 [Macaca mulatta]
Length = 440
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 65/278 (23%), Positives = 112/278 (40%), Gaps = 27/278 (9%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ + +T K S L L +L+ EK G +S W
Sbjct: 68 LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGDRSLW 126
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L + A P+ E E+ L KA+ E+ ++ + +
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
LF + I F++ A+ V + V+L Q+ L+ L P L
Sbjct: 179 SLQPLFVEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPYLDL 234
Query: 292 LAYSSKC--KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY- 348
L +S + KA + ++ ++ E + + GP N +L + YGFV NP+
Sbjct: 235 LNHSPRVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEYGFVSVHNPHA 294
Query: 349 ----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
R ++ L + D Q K + + +G + F
Sbjct: 295 CVYVSREILVKYLPSRDKQMDKKISILKDHGYIENLTF 332
>gi|154272535|ref|XP_001537120.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150409107|gb|EDN04563.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 485
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 63/291 (21%), Positives = 111/291 (38%), Gaps = 42/291 (14%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+ SW+ + P KV K K + + A +D+ + F++P SLV++ +
Sbjct: 19 EFMSWLKQR--PGVKVSPKIKIADLRSEGAGRGIVADDDIGEDEELFAIPQSLVLSFQ-- 74
Query: 140 LGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
N + +LL N+ CL + ++YE QG S W Y + L ++
Sbjct: 75 --NSRLKDLLDFNERDFDPWLCLIVVMIYEYLQGGASTWSRYFQLLPTN-------FDTL 125
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP-------YDIP 250
+ W++ EL L+GS +L + E N + + +P +D P
Sbjct: 126 MFWTDEELRELSGSA----VLNKIGRSDAEANIFRNILPLVSGNPSLFPPMSGVASFDSP 181
Query: 251 ----------------TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
A+ F+I K + ++ +VPL L A
Sbjct: 182 EGKAALLSLAHRMGSLVMAYAFDIEKGENDGREGQDGYVTDDEEELSKGMVPLADLLNAD 241
Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 345
+ + A L D + + +P + GE I G P + LL YG+V ++
Sbjct: 242 ADRNNARLFQEDCYLSMRSIKPIRKGEEIFNDYGELPRADLLRRYGYVTDN 292
>gi|302410103|ref|XP_003002885.1| SET domain-containing protein RMS1 [Verticillium albo-atrum
VaMs.102]
gi|261357909|gb|EEY20337.1| SET domain-containing protein RMS1 [Verticillium albo-atrum
VaMs.102]
Length = 469
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 81/373 (21%), Positives = 142/373 (38%), Gaps = 69/373 (18%)
Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 213
L L L ++YE QG S W PY L +Q ++P+ WS+ EL L G+
Sbjct: 91 LDSWGQLILVMLYEVLQGDSSRWKPYFDILPQQ-------FDTPIFWSDGELLELQGTSL 143
Query: 214 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 273
AE + + E +++ + ++F PTE + + + + L
Sbjct: 144 TAEKIGKVESDAMFRSKILPIVQANPAIFYPEGAAQPTEDELLHLAHRMGSTIMAYAFDL 203
Query: 274 QKVSLARR--------------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 319
+ +VP+ L A +++ A + + + +A
Sbjct: 204 ENDDENENEEDGWVEDREGRTMLGMVPMADTLNA-NAEFNAHINHGESLEATAIRADIRA 262
Query: 320 GESIVVWCGPQPNSKLLINYGFVD-EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLS 378
G+ ++ + GP P S+LL YG+V E + YD + V L E V LS
Sbjct: 263 GDQVLNYYGPLPTSELLRRYGYVTPEHSRYDVVEVPWTLVKE---------VIVSCLSLS 313
Query: 379 VQVF-HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLA 437
+ + V + + E I D Y + S ++ + VSP + ++QL
Sbjct: 314 AEAWKQVESQIDDEEIED---YFVIERDSGEPGPDGRFTAPAVLREVSPEL----VEQLK 366
Query: 438 DYFKA-----------------------------RLAGYPATLSEDEAMLTDYNLHPKKR 468
++ KA RLA YP ++ DE +L + +L ++R
Sbjct: 367 EFLKAVKKLDSERIPDKRKRDEICDAVIAEVLKVRLAQYPTSIETDEKLLAEADLPARRR 426
Query: 469 VATQLVRMEKKML 481
+A + EKK+L
Sbjct: 427 MAVVVRLGEKKLL 439
>gi|345561352|gb|EGX44442.1| hypothetical protein AOL_s00188g347 [Arthrobotrys oligospora ATCC
24927]
Length = 468
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 68/299 (22%), Positives = 121/299 (40%), Gaps = 41/299 (13%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTT--NKLSELACLALYLMYEKKQGKKSFW 176
+ G+ ++P+S++ T+E + I +L + LS LA+Y+++ +
Sbjct: 34 FKEGERILTIPSSILWTVEHAYADSIIRPVLQSMQGALSVDDTLAIYILFVRS------- 86
Query: 177 LPYIRELDRQRGRGQL-----AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
RE R + + S + +++ EL GS + + I+ +Y L
Sbjct: 87 ----RESGYNGLRSHVEALPTSYSSSIFFTDDELEVCAGSSLYTITKQLKQQIQDDYRTL 142
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
LF QY FT E +K A V S + + L P +
Sbjct: 143 ------VERLFGQYLDIFSLGKFTIEDYKWALCTVWSRAMDFVQPDGKSIRLLAPFAD-M 195
Query: 292 LAYSS---KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
L +SS KC + D + ++ + Y+ G+ + + G PN++LL YGFV +NP
Sbjct: 196 LNHSSDVKKCHVYDTSSGD-LSILAGKDYEPGDQVFINYGSIPNNRLLRLYGFVVPNNPN 254
Query: 349 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP-----YLRL 402
D + E P ++ K+ + G SV + +++D LP YLR+
Sbjct: 255 DSYDLVLMTQPEAPFFELKQKLWVSAGLDSVSTISL-------SLNDPLPKSVLQYLRI 306
>gi|397507017|ref|XP_003824008.1| PREDICTED: SET domain-containing protein 4 [Pan paniscus]
Length = 440
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 65/278 (23%), Positives = 112/278 (40%), Gaps = 27/278 (9%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ + +T K S L L +L+ EK G +S W
Sbjct: 68 LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGHRSLW 126
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L + A P+ E E+ L KA+ E+ ++ + +
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
LF + I F++ A+ V + V+L Q+ L+ L P L
Sbjct: 179 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPYLDL 234
Query: 292 LAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY- 348
L +S + KA + ++ ++ E + + GP N +L + YGFV NP+
Sbjct: 235 LNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEYGFVSVHNPHA 294
Query: 349 ----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
R ++ L + D Q K + + +G + F
Sbjct: 295 CVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332
>gi|297707870|ref|XP_002830708.1| PREDICTED: SET domain-containing protein 4 [Pongo abelii]
Length = 440
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 64/278 (23%), Positives = 112/278 (40%), Gaps = 27/278 (9%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ + +T K S L L +L+ EK G +S W
Sbjct: 68 LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGDRSLW 126
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L + A P+ E E+ L KA+ E+ ++ + +
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPQSLKAKAEEQRAHVQEFFASSRDFFS 178
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ---KVSLARRFALVPLGP--PL 291
LF + I F++ A+ V + V+L+ + L+ L P L
Sbjct: 179 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRHRECLSAELDTCALAPYLDL 234
Query: 292 LAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY- 348
L +S + KA + ++ ++ E + + GP N +L + YGFV NP+
Sbjct: 235 LNHSPHVQVKAAFNEETHSYEIRTTSRWRRHEEVFICYGPHDNQRLFLEYGFVSVHNPHA 294
Query: 349 ----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
R ++ L + D Q K + + +G + F
Sbjct: 295 CVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332
>gi|255070351|ref|XP_002507257.1| predicted protein [Micromonas sp. RCC299]
gi|226522532|gb|ACO68515.1| predicted protein [Micromonas sp. RCC299]
Length = 986
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 68/285 (23%), Positives = 111/285 (38%), Gaps = 67/285 (23%)
Query: 113 VAASEDLQA----GDAAFSVPNSLVVTLERVLGNET---IAELLTTNK-LSELACLALYL 164
V A+E++ GD FS+P + ++T + T + EL ++ + + L +L
Sbjct: 45 VIAAENVNGAQDGGDTIFSIPITCLMTPAAAFADVTYGKVFELFAAHQSVEDRTVLVFFL 104
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
E+++G S W PYIREL +PL WS E L G+ R G
Sbjct: 105 AIERQRGMTSHWGPYIRELPS-------IFSNPLNWSRAETLRLAGT--------RLGGA 149
Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA----- 279
+ F +L Q +P AF + Q ++ + + +SLA
Sbjct: 150 TK---------FHDCALLQLTEVCVP--AFIAILRAQLILSANTKAIASGAISLAQDALS 198
Query: 280 -----------------------RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV--- 313
R ALVPLG +L +S + D A Q ++
Sbjct: 199 PDRLAWSHSCVSSRAFSLFLNGQRTIALVPLG-DMLDHSPDAQIEWRTDDTAGQFLIISH 257
Query: 314 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 358
DR AG + G + N +L++ YGF + + + L V A++
Sbjct: 258 DR-LPAGSIMFNNYGAKSNEELILGYGFFMKSSVLETLYVRLAVD 301
>gi|384254260|gb|EIE27734.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 724
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 81/331 (24%), Positives = 127/331 (38%), Gaps = 54/331 (16%)
Query: 71 VSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVP- 129
+ K E + W ++G+ + L E + +AA++++ G+ S+P
Sbjct: 67 IQKSGEGPLGFQEWALQSGITSPSLRLAEFAG-------LRGMAAADNIAKGEVLVSLPV 119
Query: 130 -NSLVVT-LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQR 187
+LVV+ ER T +K +AL L+YE++ G S PY+ L
Sbjct: 120 AAALVVSPKERSQLPGTFCSSAFYSKKPWYVQMALNLLYERQLGPASKLAPYVAALP--- 176
Query: 188 GRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL------------DTVW 235
+ +PL WSE +L L E+ + EG+KR + EL D +W
Sbjct: 177 ----VDFSTPLSWSEAQLQALCYPQLIREVATQREGLKRLHAELAVSTPGTPITEQDLIW 232
Query: 236 FMAGSLFQQY--PYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL-GPPLL 292
+ + + PY PT + F + + ++ A AL L +L
Sbjct: 233 ALQAVRSRAFSGPYAGPTWRSRLKTFGALGALAAASITVAHVLNGAIAAALFNLLYDVVL 292
Query: 293 AYSSKCKAMLAAVD-----DAVQLVVDRPYKA-------------GESIVVWCGPQPNSK 334
+ K AM VD VQ V+ Y A GE + + G Q N
Sbjct: 293 SQKVKWYAMCPVVDFLNHKSTVQSEVEYEYFADRFSVRCQSYFSKGEQVFISYGKQSNDS 352
Query: 335 LLINYGFVDEDNPYDRLVV----EAALNTED 361
LL YGFV+ P+D + AAL D
Sbjct: 353 LLQYYGFVEPGIPHDTYTIPDLRAAALALSD 383
>gi|22328112|gb|AAH36556.1| SETD4 protein [Homo sapiens]
gi|119630166|gb|EAX09761.1| SET domain containing 4, isoform CRA_d [Homo sapiens]
gi|167773807|gb|ABZ92338.1| SET domain containing 4 [synthetic construct]
Length = 416
Score = 48.9 bits (115), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 65/278 (23%), Positives = 113/278 (40%), Gaps = 27/278 (9%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ + + +T K S L L +L+ EK G +S W
Sbjct: 44 LQEGQMIISLPESCLLTTDTVIRS-YLGAYITKWKPPPSPLLALCTFLVSEKHAGHRSLW 102
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L + A P+ E E+ L KA+ E+ ++ + +
Sbjct: 103 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 154
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
LF + I F++ A+ V + V+L Q+ L+ L P L
Sbjct: 155 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPYLDL 210
Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY- 348
L +S + KA + ++ ++ E + + GP N +L + YGFV NP+
Sbjct: 211 LNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEYGFVSVHNPHA 270
Query: 349 ----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
R ++ L + D Q K + + +G + F
Sbjct: 271 CVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 308
>gi|358386801|gb|EHK24396.1| hypothetical protein TRIVIDRAFT_168260 [Trichoderma virens Gv29-8]
Length = 370
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 33/140 (23%), Positives = 59/140 (42%), Gaps = 6/140 (4%)
Query: 218 LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 277
LE E +++ E W + F+ D+P E +T+ + + K
Sbjct: 113 LESREHLRKREKEFQGNW----NAFKDAFPDVPYEEYTYAWMIVNTRSFYNETPETLKYP 168
Query: 278 LARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 337
R AL+P+ CK +A D +V DR YK GE + + N +L+
Sbjct: 169 WEDRLALIPVADLFNHSDDGCKVYYSA--DGYHIVADREYKKGEELFISYSSHSNDYILL 226
Query: 338 NYGFVDEDNPYDRLVVEAAL 357
YGF+ +++ D + ++ A+
Sbjct: 227 EYGFIPDESLDDDVYIDDAV 246
>gi|355747383|gb|EHH51880.1| SET domain-containing protein 4 [Macaca fascicularis]
Length = 440
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 64/278 (23%), Positives = 112/278 (40%), Gaps = 27/278 (9%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ + +T K S L L +L+ EK G +S W
Sbjct: 68 LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGDRSLW 126
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L + A P+ E E+ L KA+ E+ ++ + +
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
LF + I F++ A+ + + V+L Q+ L+ L P L
Sbjct: 179 SLQPLFVEAVDSI----FSYSALLWAWCTINTRAVYLRPRQRECLSAEPDTCALAPYLDL 234
Query: 292 LAYSSKC--KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY- 348
L +S + KA + ++ ++ E + + GP N +L + YGFV NP+
Sbjct: 235 LNHSPRVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEYGFVSVHNPHA 294
Query: 349 ----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
R ++ L + D Q K + + +G + F
Sbjct: 295 CVYVSREILVKYLPSRDKQMDKKISILKDHGYIENLTF 332
>gi|328864871|gb|EGG13257.1| hypothetical protein DFA_11018 [Dictyostelium fasciculatum]
Length = 1658
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 76/351 (21%), Positives = 139/351 (39%), Gaps = 27/351 (7%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELL--TTNKLSELACLALYLMYEKKQ 170
V ++ ++ + SVP ++ ++ + + +L L++ L L+++YEK +
Sbjct: 1212 VVTTKKVEENECVVSVPRKFLINVDCARKHPVLNSILFEEATGLNDDTILFLFVIYEK-E 1270
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
SFW P+ L + + ++ TEL L G+ + E IK
Sbjct: 1271 NPNSFWRPFFDTLPS-------YFPTSIHYTTTELLELEGT----NLFEETIQIKEHLES 1319
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 290
+ + F L QYP P FT E F A S + L K+ LVP+
Sbjct: 1320 IRELLF--PELSNQYPDVFPESLFTMENFLWARSLFDSRAIQL-KIDGRIVNCLVPMADM 1376
Query: 291 LLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
+ + + +D +++ A I + G + +L + YGFV +N Y
Sbjct: 1377 INHHDQAQISQRYFDQENDCFRMISCCNIPATSQIFLQYGALQSWELALYYGFVISNNHY 1436
Query: 349 DRLVVEAALNTED-PQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD 407
D + + + ED P+ ++++ L+V ++H S +L LR+ +++
Sbjct: 1437 DSVHIGFDMPEEDTPELREEKQKLLDRHLLTVDHHYLHRSN---IPSKLLASLRVALLAE 1493
Query: 408 TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 458
E + PI S E VL L L + +T ED+ +L
Sbjct: 1494 -DEFNPHVDVWNPI---SRSNEEVVLYTLYSTVLMLLKQFSSTCDEDQQLL 1540
>gi|145518912|ref|XP_001445328.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124412772|emb|CAK77931.1| unnamed protein product [Paramecium tetraurelia]
Length = 761
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 41/141 (29%), Positives = 67/141 (47%), Gaps = 23/141 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERV-LGNETIA-----ELLTTNKLS--ELACLALYL 164
V A++D+ A A VP L+++ E+ L + +I EL N+ S E L YL
Sbjct: 46 VVATKDIPANTAIICVPQPLIISQEKCKLSSLSIVYDKHPELFDENETSDAEFNILIFYL 105
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
EKK+G+KSF+ PY++ + + + + WS+ EL Y+ E I
Sbjct: 106 FNEKKKGEKSFYHPYVQAIQ--------SNNTLIDWSKEELNYIEDPIILDEF-----AI 152
Query: 225 KREYNELDTVWFMAGSLFQQY 245
RE +L +W A +F ++
Sbjct: 153 VRE--DLKDLWNQAKEIFNEF 171
>gi|113930683|ref|NP_001039027.1| SET domain-containing protein 4 [Danio rerio]
gi|66911144|gb|AAH96876.1| SET domain containing 4 [Danio rerio]
Length = 440
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 71/323 (21%), Positives = 136/323 (42%), Gaps = 47/323 (14%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVV 134
L+ W+++ G +I P+++ + A++ ++A ++ S+P ++
Sbjct: 37 LRRWLNERGFTSQSLI------------PVNFHDTGRGLMATQTIKAKNSVISLPEECLL 84
Query: 135 TLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
T VL + +A+ + +S L L +L+ E+ G+ S W PYI L +
Sbjct: 85 TTSTVLKS-YMADYIKRWHPPISPLLALCCFLISERHHGEASEWNPYIDILPK------- 136
Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 252
PL + + + L S K + ++ E + ++ T + LF Q PTE
Sbjct: 137 TYTCPLYFPDNVIELLPRSLQK-KATQQKEQFQELFSSSQTFFHSLQPLFNQ-----PTE 190
Query: 253 A-FTFEIFKQAFVAVQSCVV---HLQKVSLARRFALVPLGP--PLLAY--SSKCKAMLAA 304
F+ + + A+ +V + V H Q L+R + L P LL + + + +A
Sbjct: 191 ELFSQDALRWAWCSVNTRTVYMEHDQSKYLSREKDVYALAPYLDLLNHCPNVQVEAGFNK 250
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY-----DRLVVEAALNT 359
++ K + + GP N +LL+ YGFV NP+ D ++ L+
Sbjct: 251 ETRCYEIRSVNGCKKFQQAFINYGPHDNHRLLLEYGFVAPCNPHSVVYVDLETLKVGLDE 310
Query: 360 EDPQYQDKRMVAQRNGKLSVQVF 382
+D Q ++K + + N L F
Sbjct: 311 KDKQLKEKLLYLKDNDFLRNLTF 333
>gi|329663327|ref|NP_001192753.1| SET domain-containing protein 4 [Bos taurus]
gi|296490853|tpg|DAA32966.1| TPA: SET domain containing 4 [Bos taurus]
Length = 440
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 71/318 (22%), Positives = 127/318 (39%), Gaps = 35/318 (11%)
Query: 39 NFGSSLRLVRRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILK 98
N G +RR+ F+ SS+ SR V + + +LK W+ +I
Sbjct: 3 NGGGRTSRIRRRKLFT----SSE-----SRGVNESYKPEFIELKKWLKDRRFEDTTLIPA 53
Query: 99 EKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-SEL 157
P + + LQ G S+P S ++T + V+ + A + S L
Sbjct: 54 HFPGTGRG------LMSKTSLQEGQTIISLPESCLLTTDTVIRSYLGAYIAKWQPPPSPL 107
Query: 158 ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
L +L+ EK G +S W PY+ L + A P+ E E+ L +P K +
Sbjct: 108 LALCTFLVSEKHAGDRSPWKPYLEVLPK-------AYTCPVC-LEPEVVNLLPNPLKTKA 159
Query: 218 LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK-- 275
E+ + ++ + LF + I F++ + A+ AV + V++++
Sbjct: 160 WEQRSHVWEFFSSSRGFFSSLQPLFSEAVETI----FSYRALRWAWCAVNTRAVYMKRPP 215
Query: 276 -VSLARRFALVPLGP--PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGE--SIVVWCGPQ 330
+ L+ L P LL +S + A ++ + + G+ + + GP
Sbjct: 216 LLCLSPEPDTCALAPYLDLLNHSPDVQVKAAFNEETRCYEIRTATRCGKHKEVFICYGPH 275
Query: 331 PNSKLLINYGFVDEDNPY 348
N +LL+ YGFV NP+
Sbjct: 276 DNHRLLLEYGFVCVSNPH 293
>gi|409045252|gb|EKM54733.1| hypothetical protein PHACADRAFT_97093 [Phanerochaete carnosa
HHB-10118-sp]
Length = 513
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 63/222 (28%), Positives = 97/222 (43%), Gaps = 26/222 (11%)
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
A+VP+ L + A L D+ V +++ KAGE I G PNS LL YGF
Sbjct: 271 AMVPMADMLNGRFNTETARLFYDDEHVLRMMTVHEIKAGEQIWNTYGDPPNSDLLRRYGF 330
Query: 342 VD----------EDNPYD------RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 385
+D NP D LVVEAA + QD+ V + VF V
Sbjct: 331 IDVTKLESPLSGAGNPADIVEIPANLVVEAATKHTTSKTQDR--VDWWLEEAEDDVFVV- 387
Query: 386 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 445
G + E +M+ RL + +E + + G + P M+ + D ++RL
Sbjct: 388 -GTDCELPPEMVSLARL-LLQPKAEWEKT-KAKGKVP--KPTMDTTIAAIAMDVLQSRLK 442
Query: 446 GYPATLSEDEAMLTDYN-LHPKKRVATQLVRMEKKMLNACLQ 486
YP ++ EDE +L D + L +++A + EK++L L+
Sbjct: 443 EYPTSVEEDERLLADESQLGFNRKMAVTVRLGEKRILAGTLR 484
>gi|242066082|ref|XP_002454330.1| hypothetical protein SORBIDRAFT_04g028760 [Sorghum bicolor]
gi|241934161|gb|EES07306.1| hypothetical protein SORBIDRAFT_04g028760 [Sorghum bicolor]
Length = 490
Score = 48.5 bits (114), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 64/289 (22%), Positives = 113/289 (39%), Gaps = 44/289 (15%)
Query: 80 DLKSWMHKNG---LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTL 136
DL W+ + G P +V +H E + A D+ GD ++P L + L
Sbjct: 60 DLVRWVQREGGFVHPALRVA-----NHPEHGLGVSAAAPDGDIPPGDVLIALPGRLPLRL 114
Query: 137 ERVLGN-ETIAELLTTNKLSELACLALYL-MYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
R G + + L EL + L L + +++ SFW PYI L
Sbjct: 115 RRPTGAADDVLVQLAQQVPEELWAMKLGLRLLQERAKSDSFWWPYIANLPE-------TF 167
Query: 195 ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
P+ + ++ L +P ++ +R + E+ QQ + +P+
Sbjct: 168 TVPIFFPGEDIKNLQYAPLLHQVNKRCRFLLEFEKEI-----------QQKLHTVPSVDH 216
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC------------KAML 302
F + Q + S + + +R F L P LL C + +
Sbjct: 217 PF--YGQDVNS--SSLGWAMSAASSRAFRLHGEIPMLLPLIDMCNHSFNPNARIVQEGSV 272
Query: 303 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
++D +V++V ++ + SI + G PN L++YGFV NPYD++
Sbjct: 273 NSLDMSVKVVAEKKIEQNASITLNYGCHPNDFFLLDYGFVITPNPYDQV 321
>gi|403271547|ref|XP_003927684.1| PREDICTED: SET domain-containing protein 4 [Saimiri boliviensis
boliviensis]
Length = 440
Score = 48.5 bits (114), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 65/278 (23%), Positives = 111/278 (39%), Gaps = 27/278 (9%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ + +T K S L L +L+ EK G +S W
Sbjct: 68 LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGDRSLW 126
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L + A P+ E E+ L KA+ E+ ++ + +
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
LF + I F++ A+ V + V+L Q+ L+ L P L
Sbjct: 179 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQQECLSAEPDTCALAPYLDL 234
Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY- 348
L +S + KA ++ ++ E + + GP N +L + YGFV NP+
Sbjct: 235 LNHSPHVQVKAAFNEETHCYEIRTTSRWRKHEEVFICYGPHDNQRLFLEYGFVSAHNPHA 294
Query: 349 ----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
R ++ L + D Q K + + +G + F
Sbjct: 295 CVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332
>gi|426218421|ref|XP_004003445.1| PREDICTED: SET domain-containing protein 4 [Ovis aries]
Length = 439
Score = 48.5 bits (114), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 70/318 (22%), Positives = 126/318 (39%), Gaps = 35/318 (11%)
Query: 39 NFGSSLRLVRRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILK 98
N G +RR+ F SS+ SR V + + +LK W+ +I
Sbjct: 3 NGGGRTSRIRRRKLFR----SSE-----SRGVNESYKPEFIELKKWLKDRRFEDATLIPA 53
Query: 99 EKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-SEL 157
P + + LQ G S+P S ++T + V+ + A + S L
Sbjct: 54 RFPGTGRG------LMSKTSLQEGQTIISLPESCLLTTDTVIRSYLGAYIAKWQPPPSPL 107
Query: 158 ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
L +L+ EK G +S W PY+ L + A P+ E E+ L +P K +
Sbjct: 108 LALCTFLVSEKHAGDRSPWKPYLEVLPK-------AYTCPVC-LEPEVVNLLPNPLKTKA 159
Query: 218 LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK-- 275
E+ ++ ++ + LF + I F++ + A+ V + V++++
Sbjct: 160 WEQRSHVQEFFSSSRGFFSSLQPLFSEAIETI----FSYRALRWAWCTVNTRAVYMKRPP 215
Query: 276 -VSLARRFALVPLGP--PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGE--SIVVWCGPQ 330
+ L+ L P LL +S + A ++ + + G+ + + GP
Sbjct: 216 QLCLSPEPDTCALAPYLDLLNHSPDVQVKAAFNEETRCYEIRTATRCGKHKEVFICYGPH 275
Query: 331 PNSKLLINYGFVDEDNPY 348
N +LL+ YGFV NP+
Sbjct: 276 DNHRLLLEYGFVSVSNPH 293
>gi|393217169|gb|EJD02658.1| SET domain-containing protein [Fomitiporia mediterranea MF3/22]
Length = 513
Score = 48.5 bits (114), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 104/442 (23%), Positives = 163/442 (36%), Gaps = 86/442 (19%)
Query: 115 ASEDLQAGDAAFSVPNSLVVT-----LERVLGNETIAELLTTNKLSELACLALYLMYEKK 169
A +D+ G FSVP SL ++ L +++G E + L NK L L +M+E+
Sbjct: 34 ALQDIPEGHTLFSVPRSLTLSTHTSELPKLIG-EAAWKSLRLNK--GWVGLILCMMWEEC 90
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
+ S W Y L R A ++P+ W+ EL L G+ +I + E +R+Y
Sbjct: 91 RWTDSKWCGYFNILPR-------AFDTPMFWTGDELKELDGTDVLGKIGK--EQAERDYY 141
Query: 230 EL---------DTVWFMAGSLFQQYPYD--------IPTEAFTFEIFKQAFVAVQS---- 268
E+ D F G + Y + I + +F E +K+ QS
Sbjct: 142 EILNPAVRTRPDL--FDPGHIASFYSLENYHVMGSRILSRSFHVEKWKEQTPGSQSRASS 199
Query: 269 -------CV------VHLQKVSLAR--------------RFALVPLGPPLLAYSSKCKAM 301
C+ +L V A+VP+ L A A
Sbjct: 200 ELHENGDCMDIDDESSNLSAVGAENGGDDDSDDEAENPSDIAMVPMADMLNAQYGSENAK 259
Query: 302 LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD----------EDNPYDRL 351
L + +V +P + GE I G PNS LL YG VD E NP D +
Sbjct: 260 LFYEPTHLNMVSTKPIRRGEQIYNAYGDLPNSALLREYGHVDLVPLPGVPWKEGNPADVV 319
Query: 352 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS------DMLPYLRLGYV 405
+ A L R+ A+ + K + + G + + D++ +
Sbjct: 320 EIPADLALHAVLSSQARVDAE-SLKERIDWWLEEGGDDVFVLGTDLELPDVMISFLKLLL 378
Query: 406 SDTSEMQSVISSLGPICP-VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 464
E + S P P + + + + RLA YP TL DEA+L+
Sbjct: 379 LSKLEWEKARSKSKPPKPKLDMDSKLQTFPLVLGMLERRLAKYPTTLEHDEALLSGQTSL 438
Query: 465 PKKRVATQLVRM-EKKMLNACL 485
P +VR+ EK +L C+
Sbjct: 439 PYNVRNAIIVRIGEKHILVGCM 460
>gi|348676999|gb|EGZ16816.1| hypothetical protein PHYSODRAFT_251772 [Phytophthora sojae]
Length = 424
Score = 48.1 bits (113), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 76/349 (21%), Positives = 137/349 (39%), Gaps = 59/349 (16%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK---LSELACLALYLMYEKKQG 171
A+ + +G+ +P L+++ + + + + N+ + LAL+L+ E
Sbjct: 43 AAAAVASGEPMLCIPRRLLISEDLCWRDPQLGRVFQDNRDVFTRDDPVLALFLVRELLLA 102
Query: 172 KKSFWLPYIRELDRQRGRGQLAV----ESPLLWSETELAYLTGSPTKAEILER-AEGIKR 226
+SF+ PY LAV ES W++ EL L ER + R
Sbjct: 103 DRSFFHPY------------LAVLPYPESVQDWTQAELGELHD--------ERLVDAAAR 142
Query: 227 EYNELDTVWFMAGSLFQ-QYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF--- 282
+E+D + Q +YP + P +TF+ FK A+ +Q+ + RR
Sbjct: 143 RTSEIDVYYRRVMVRLQTKYPGEFPEALYTFDRFKFAWKTIQA-------RTFGRRLPWT 195
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DRPYKAGESIVVWCGPQPNSKLLINY 339
ALVP L + K D+ + + + G + G + N +LL++Y
Sbjct: 196 ALVPFADCLNHTNVATKYDFDVNDNGLFRLYPSGATSFAQGAEVFNSYGRRSNFQLLLDY 255
Query: 340 GFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPY 399
GF DN +D + VE + P R KL V R+ ++ ++ P
Sbjct: 256 GFALPDNEWDYVDVEIGKDRAGP----------RGRKLRFMKRVVRIDRQS-SLDELFPP 304
Query: 400 LRLGYVSD------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 442
L ++D SE + +S +C + +++ +AD+ A
Sbjct: 305 SFLAGLADPVPDEEQSEAAAELSERTALCDALEWLRSILIETIADWGTA 353
>gi|325095092|gb|EGC48402.1| SET domain-containing protein [Ajellomyces capsulatus H88]
Length = 485
Score = 48.1 bits (113), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 68/302 (22%), Positives = 115/302 (38%), Gaps = 45/302 (14%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
W+ + P KV K K + + A +D+ + F++P +LV+ + N
Sbjct: 23 WLKQR--PGVKVSPKIKIADLRSEGAGRGIVADDDIGEDEELFAIPQNLVLGFQ----NS 76
Query: 144 TIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
+ +LL N+ CL + ++YE QG S W Y + L ++ + W+
Sbjct: 77 RLKDLLDFNERDFDPWLCLIVVMIYEYLQGGASTWSRYFQLLPTN-------FDTLMFWT 129
Query: 202 ETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP-------YDIPTE-- 252
+ EL L+GS +L + E N L + + +P +D P
Sbjct: 130 DEELRELSGSA----VLNKIGRSDAEANILRNILPLVSGNPSHFPPMSGVASFDSPEGKA 185
Query: 253 --------------AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC 298
A+ F+I K + ++ +VPL L A + +
Sbjct: 186 ALLSLAHRMGSLIMAYAFDIEKGENDGREGQDGYVTDDEEELSKGMVPLADLLNADADRN 245
Query: 299 KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAAL 357
A L D + + +P + GE I G P + LL YG+V D YD VE ++
Sbjct: 246 NARLFQEDCYLSMRSIKPIRKGEEIFNDYGELPRADLLRRYGYVTDNYAQYDE--VEISM 303
Query: 358 NT 359
T
Sbjct: 304 RT 305
>gi|149742140|ref|XP_001496337.1| PREDICTED: SET domain-containing protein 4 [Equus caballus]
Length = 440
Score = 48.1 bits (113), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 65/260 (25%), Positives = 108/260 (41%), Gaps = 22/260 (8%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSELACLALYLMYEKKQGKKSFWL 177
LQ G S+P S ++T + V+ + A + LS L L +L+ EK G +S W
Sbjct: 68 LQEGQMIISLPESCLLTTDTVIRSYLGAYIAKWQPPLSPLLALCTFLVAEKHAGDRSVWK 127
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
PY+ L + A P+ E E+ L P KA+ E+ ++ + +
Sbjct: 128 PYLEVLPK-------AYTCPVC-LEPEVVDLLPKPLKAKAREQRTRLQAFFTSSRDFFSS 179
Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP----LGP--PL 291
LF + I F++ F A+ V + V++ K R F+ P L P L
Sbjct: 180 LRPLFSEAVESI----FSYSAFLWAWCTVNTRAVYM-KPRRRRCFSAEPDTYALAPYLDL 234
Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
L +S + +A ++ + E + + GP N +LL+ YGFV NP+
Sbjct: 235 LNHSPDVQVRAGFNEETRCYEIRTVSSCRKHEEVFICYGPHDNQRLLLEYGFVSIHNPHA 294
Query: 350 RLVVEAALNTEDPQYQDKRM 369
+ V + + DK+M
Sbjct: 295 CVYVSKDILVKYLPSTDKQM 314
>gi|281201674|gb|EFA75882.1| tryptophan 2,3-dioxygenase [Polysphondylium pallidum PN500]
Length = 732
Score = 48.1 bits (113), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 57/208 (27%), Positives = 94/208 (45%), Gaps = 36/208 (17%)
Query: 81 LKSWMHKNG----LPPCKVI--LKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
K+W+ NG L K++ L E + A+ +++ GD VP L +
Sbjct: 70 FKNWLASNGCQESLDKVKIVRTLAEGTG----------LIANTEIKEGDEFIKVPLKLFM 119
Query: 135 TLE---RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQ 191
+ E + +G++ E L K+ L ++L+ E ++ ++SFW PYIR L +
Sbjct: 120 SQETAFKSIGDKVSREPLF--KMLPNMLLVIHLIQETQKQQQSFWAPYIRMLPK------ 171
Query: 192 LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPT 251
+ ++ L ++ E L GSP +LE E I N L F+ F + P + T
Sbjct: 172 -SYKTALYFTLAEFQLLIGSP----VLE--ESINTYRNTLRQYCFLY-DFFGKNPGILST 223
Query: 252 EAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
FT+E F+Q +A +V L K LA
Sbjct: 224 SNFTWE-FEQNELAAYKSIVSLLKKRLA 250
>gi|449466129|ref|XP_004150779.1| PREDICTED: uncharacterized protein LOC101212907 [Cucumis sativus]
Length = 559
Score = 48.1 bits (113), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 62/256 (24%), Positives = 104/256 (40%), Gaps = 28/256 (10%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A EDL GD +P +++++ E V + L + L+ M EK
Sbjct: 189 AKEDLDVGDTVLEIPLAIIISEELVQKSTMYPVLSKVEGMLPETMTLLWSMKEKHIVDSE 248
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F + Y L A + L + + L G+ E+++ E ++++YNEL
Sbjct: 249 FRV-YFDTLPE-------AFNTGLSFGVGAMTTLVGTLLFDELMQAKEHLRKQYNEL--- 297
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP-------- 286
F A L +P P E +++E F A S + + R LVP
Sbjct: 298 -FPA--LCNNHPDIFPEEFYSWEEFLWACELWYSNSLKIMFPDGNVRTCLVPIAGFLNHS 354
Query: 287 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-D 345
L P +L Y + + D+++ + RP +AGE + G S L+ YGF+ E D
Sbjct: 355 LHPHILHYGK-----VDSDTDSLKFRLSRPCRAGEECYLSYGNYSGSHLVTFYGFLPEGD 409
Query: 346 NPYDRLVVEAALNTED 361
N D + ++ +D
Sbjct: 410 NVNDVIPLDIDFGDDD 425
>gi|240276868|gb|EER40379.1| SET domain-containing protein [Ajellomyces capsulatus H143]
Length = 485
Score = 48.1 bits (113), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 68/302 (22%), Positives = 115/302 (38%), Gaps = 45/302 (14%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
W+ + P KV K K + + A +D+ + F++P +LV+ + N
Sbjct: 23 WLKQR--PGVKVSPKIKIADLRSEGAGRGIVADDDIGEDEELFAIPQNLVLGFQ----NS 76
Query: 144 TIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
+ +LL N+ CL + ++YE QG S W Y + L ++ + W+
Sbjct: 77 RLKDLLDFNERDFDPWLCLIVVMIYEYLQGGASTWSRYFQLLPTN-------FDTLMFWT 129
Query: 202 ETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP-------YDIPTE-- 252
+ EL L+GS +L + E N L + + +P +D P
Sbjct: 130 DEELRELSGSA----VLNKIGRSDAEANILRNILPLVSGNPSHFPPMSGVASFDSPEGKA 185
Query: 253 --------------AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC 298
A+ F+I K + ++ +VPL L A + +
Sbjct: 186 ALLSLAHRMGSLIMAYAFDIEKGENDGREGQDGYVTDDEEELSKGMVPLADLLNADADRN 245
Query: 299 KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAAL 357
A L D + + +P + GE I G P + LL YG+V D YD VE ++
Sbjct: 246 NARLFQEDCYLSMRSIKPIRKGEEIFNDYGELPRADLLRRYGYVTDNYAQYDE--VEISM 303
Query: 358 NT 359
T
Sbjct: 304 RT 305
>gi|255075907|ref|XP_002501628.1| predicted protein [Micromonas sp. RCC299]
gi|226516892|gb|ACO62886.1| predicted protein [Micromonas sp. RCC299]
Length = 607
Score = 48.1 bits (113), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 55/220 (25%), Positives = 95/220 (43%), Gaps = 16/220 (7%)
Query: 145 IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 204
+A + +L+ A LAL++++E +S Y+ L G+ +V PLLW+ T+
Sbjct: 137 VAASMGAPELATHAALALHVLFELGD-PRSEGFAYLATLPGLAGKASPSV--PLLWTPTQ 193
Query: 205 LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY---DIPTEAFTFEIFKQ 261
+A L G+PT +L RA+ + + L G +++ + + + A + +
Sbjct: 194 VATLRGTPTHGRVLRRAKFVSDAHAALFGSGGGGGVPLEKFAWALSSVLSRAASGDRMPY 253
Query: 262 AFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGE 321
AF+ + H V + V L P + + D V V D P AGE
Sbjct: 254 AFLPGVDLLNH-GGVDANCELSAVKLAP------GGNEENVTWGDVEVTCVKDTP--AGE 304
Query: 322 SIVVWCGPQP-NSKLLINYGFVDEDNPYDRLVVEAALNTE 360
+ + G + N +LL YGF N +DR +E L +
Sbjct: 305 QLTISYGDESDNCRLLRLYGFATRGNVHDRRTIELRLTGD 344
>gi|302804174|ref|XP_002983839.1| hypothetical protein SELMODRAFT_445692 [Selaginella moellendorffii]
gi|300148191|gb|EFJ14851.1| hypothetical protein SELMODRAFT_445692 [Selaginella moellendorffii]
Length = 236
Score = 48.1 bits (113), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 46/186 (24%), Positives = 79/186 (42%), Gaps = 29/186 (15%)
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
+ S W PYI L G +++ LW +TEL+YL SP + ER E I E+ ++
Sbjct: 74 QSSAWAPYISCLPEPAG-----LDNTFLWEDTELSYLRASPLYGKTRERLEIITTEFGQV 128
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
+ LF + + E F + V S + +++ LV + P+
Sbjct: 129 QNALDVWPQLFGK---------VSVEDFMHVYATVFS-----RPLAIGEDSTLVMI--PM 172
Query: 292 LAYSSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 346
L + + A A + + + DR + I + CG N++L ++YGF
Sbjct: 173 LDFFNHNAASFAKLSFNGLLNYAVVTADRDCAENDQIWINCGDLSNAELALDYGFT---V 229
Query: 347 PYDRLV 352
P +RL+
Sbjct: 230 PENRLI 235
>gi|449520517|ref|XP_004167280.1| PREDICTED: LOW QUALITY PROTEIN: sulfate transporter 4.1,
chloroplastic-like, partial [Cucumis sativus]
Length = 923
Score = 48.1 bits (113), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 56/219 (25%), Positives = 92/219 (42%), Gaps = 39/219 (17%)
Query: 112 YVAASEDLQAGDAAFSVP-------NSLVVTLERVLGNETIAELLTTNKLSELACLALYL 164
++ ASE ++AGD VP +SL + + +LGNE + +A LA+ +
Sbjct: 734 FLFASETIRAGDCILKVPFNVQISPDSLPLPIRDLLGNE----------IGNVAKLAVVV 783
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
+ E K G S W PYI L + + + + W E+EL + S E L + I
Sbjct: 784 LLEHKLGLGSEWAPYIIRLPQ-----PWEMHNTIFWKESELEMIRKSSLYEESLNQRSQI 838
Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 284
KRE+ + + +P I + + + F A+ V S + +L
Sbjct: 839 KREFLAIRKA-------LEAFPEII--DRISCDDFMHAYALVTS-----RAWRSTEGVSL 884
Query: 285 VPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGE 321
+P L + +AML DD ++V DR + GE
Sbjct: 885 IPFA-DFLNHDGASEAMLLNDDDKQLSEVVADRDFAPGE 922
>gi|308812602|ref|XP_003083608.1| unnamed protein product [Ostreococcus tauri]
gi|116055489|emb|CAL58157.1| unnamed protein product [Ostreococcus tauri]
Length = 427
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 79/343 (23%), Positives = 139/343 (40%), Gaps = 46/343 (13%)
Query: 151 TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 210
+N+ +E A LA+ L E+++G S + P++ L+ T
Sbjct: 79 SNESAEWA-LAIELAMEREKGVASRYRPFV----------------DSLYERTPANSTVV 121
Query: 211 SPTKAEIL--ERAEGIKREYNELDTV--WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 266
S E L AE + R Y+E D V W A F+ +P + FT F++A V
Sbjct: 122 SKKARERLAEHHAEKVMRRYDE-DIVRGWNAAVRTFRTFPTIFRAQDFTRSKFEEALAIV 180
Query: 267 QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 326
++ + + R LVPL L+ +S + VDD + VD ++AG+ +
Sbjct: 181 RANSFEVTRADGVRERVLVPLAHLLVHDTSSSVPCVKMVDDTFVINVD-EHRAGDELSCS 239
Query: 327 CGPQPNSKLLINYG----FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL--SVQ 380
G +++ +G + +E+N D + + D+ + + G +
Sbjct: 240 HGEYSDAETFARFGTSAVYSEENNARDVITF---------TFPDEVHLKEEIGSCGPAED 290
Query: 381 VFHVHAGREKEAISDMLPYLRL--GYVSDTSEMQSVISSLGPIC--PVSPCMERAVLDQL 436
+ G A ++++ LRL ++ SEM+ L + P+S E AV D L
Sbjct: 291 IGFTRDG----ASAELMCALRLVSANATEWSEMRKPNFDLQSLKNRPLSEESEVAVYDAL 346
Query: 437 ADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKK 479
L YP + +DE +L L +R A ++ EK+
Sbjct: 347 FATLTDLLNSYPYSDVDDEHLLRGDRLADDERRAVKIRLREKR 389
>gi|428163884|gb|EKX32933.1| hypothetical protein GUITHDRAFT_120884 [Guillardia theta CCMP2712]
Length = 320
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 55/227 (24%), Positives = 95/227 (41%), Gaps = 33/227 (14%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ + G+ VP L++ + L ++ LL +L + C+ L LM E S
Sbjct: 32 ASKRISPGETFLKVPRHLLLGPHQ-LRASSLDRLLEGWQLPD--CMLLLLMCESVN-SSS 87
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F+ PY+ L V++P+ WS+ E L GSP ++ + R + E
Sbjct: 88 FFRPYLDLLPD-------TVDTPITWSKEEAKELVGSPVLHRAVKLRHELARSFQE---- 136
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
M +F +YP P F++E ++ A+ ++S + L+PL + +
Sbjct: 137 --MKDKVFDKYPDRFPPLLFSYERYQWAYSILRSRAFG--------NYTLMPLIDLMNHH 186
Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC--GPQPNSKLLINY 339
A D + L+ R Y VW G + ++ LL+NY
Sbjct: 187 PDSRLAPTLLSDGSDALIARREYN------VWGFYGRKSDADLLLNY 227
>gi|256270722|gb|EEU05884.1| Set7p [Saccharomyces cerevisiae JAY291]
Length = 494
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 61/255 (23%), Positives = 110/255 (43%), Gaps = 26/255 (10%)
Query: 113 VAASEDLQAGDAAFSVPNS--LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
V A++ ++ + F +P S L VT +++ + + N+ L + ++YE +
Sbjct: 41 VVATQKIKKDETLFKIPRSSVLSVTTSQLIKDYPSLKDKFLNETGSWEGLIICILYEMEV 100
Query: 171 -GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
++S W PY + ++ L + W + EL L G E+ ER
Sbjct: 101 LQERSRWAPYFKVWNKPSDMNAL-----IFWDDNELQLLKPSLVLERIGKKEAKEMHERI 155
Query: 222 -EGIKREYNELDTVWFMAGSL-FQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
+ IK+ E V A S F + Y I + +F E+ + + +++
Sbjct: 156 IKSIKQIGGEFSRV---ATSFEFDNFAYIASIILSYSFDLEMQDSSVNENEEEETSEEEL 212
Query: 277 SLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
R +++PL L A +SKC A L + +++V R + E + G PNS+L
Sbjct: 213 ENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSEL 272
Query: 336 LINYGFVDED-NPYD 349
L YG+V+ D + YD
Sbjct: 273 LRRYGYVEWDGSKYD 287
>gi|320163219|gb|EFW40118.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 1188
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 45/202 (22%), Positives = 81/202 (40%), Gaps = 22/202 (10%)
Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 256
PL W++ EL +L G+ I ER ++ ++ + V L ++ P P + FT+
Sbjct: 245 PLWWNDAELDHLDGTNIGGYIQERRNQVRNQFLNVFPV------LSREQPALFPKDVFTY 298
Query: 257 EIFKQAFVAVQSCVVHLQ-KVSLARRFALVPLGPPLLAYSSKC------------KAMLA 303
E + AF S L+ V+ +G P+ +C A +
Sbjct: 299 EAYLWAFSTCSSRAFPLRVTVNPTTGVESHAIGNPMKEPCVECLLPLLDMMNHQFGASIT 358
Query: 304 AVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 361
D +V+ + GE + GP+ N +LL+ YGF +N D + ++ + D
Sbjct: 359 WFTDETSVRFFTGAKVRKGEQVYNNYGPKSNEELLMGYGFCLPNNEADHVKIQLTVGN-D 417
Query: 362 PQYQDKRMVAQRNGKLSVQVFH 383
P + K + + +G H
Sbjct: 418 PDGEAKLAILRWHGLSLTHFLH 439
>gi|294948379|ref|XP_002785721.1| hypothetical protein Pmar_PMAR008080 [Perkinsus marinus ATCC 50983]
gi|239899769|gb|EER17517.1| hypothetical protein Pmar_PMAR008080 [Perkinsus marinus ATCC 50983]
Length = 353
Score = 47.8 bits (112), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 21/71 (29%), Positives = 37/71 (52%)
Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
++PL S+K + V++ Q++ ++P K GE I G N LL+ +GF
Sbjct: 171 LCVIPLADQFNHSSTKWHTRVREVEEGFQMLAEKPVKKGEEIFNNYGLYTNEMLLLTHGF 230
Query: 342 VDEDNPYDRLV 352
++ DNP+D +
Sbjct: 231 IEFDNPHDHFI 241
>gi|189190580|ref|XP_001931629.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187973235|gb|EDU40734.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 372
Score = 47.8 bits (112), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 57/245 (23%), Positives = 100/245 (40%), Gaps = 24/245 (9%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A+ D+QAG+ VP L TL+ V + I+ L N +S A LA YL +K
Sbjct: 33 IIATRDIQAGETILFVPFKLFRTLKHV--PKAISRRLPRN-MSLHALLATYLSLDKTD-- 87
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+F +P D + P LW EL P ++++ KR+
Sbjct: 88 -TFAIPNKTLPDLSSFEAGM----PFLWP-AELHPFLPKPALDLLMKQQRSFKRD----- 136
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
W + + D A+ + ++F ++++ R A++P+
Sbjct: 137 --WDIVSKAYSNISQDQYLHAWLL-VNTRSFYCTTPI---MERLPHDDRLAILPVADLFN 190
Query: 293 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 352
C+A A+ + + DR Y+ GE + + G LL YGFV +N +D +
Sbjct: 191 HADVGCEARFAS--ENYSFIADRDYRTGEELHISYGSHSTDFLLTEYGFVPTENCWDVVC 248
Query: 353 VEAAL 357
++ A+
Sbjct: 249 LDEAI 253
>gi|302754816|ref|XP_002960832.1| hypothetical protein SELMODRAFT_437299 [Selaginella moellendorffii]
gi|300171771|gb|EFJ38371.1| hypothetical protein SELMODRAFT_437299 [Selaginella moellendorffii]
Length = 418
Score = 47.8 bits (112), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 46/172 (26%), Positives = 76/172 (44%), Gaps = 22/172 (12%)
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF---TF 256
W +TEL+YL SP + ER E I E+ ++ F L Q D+ + F +
Sbjct: 202 WEDTELSYLRASPLYGKARERLEMITTEFGQVQND-FCTCVLEQ--ALDVWPQLFGKVSL 258
Query: 257 EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY-----SSKCKAMLAAVDDAVQL 311
E K + V S + +++ LV + P+L + +S K + + +
Sbjct: 259 EDLKHVYATVFS-----RSLAIGEDSTLVMI--PMLDFFNHNATSFAKLSFNGLLNYAVV 311
Query: 312 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQ 363
DR Y + I + G N++L ++YGF +NPYD E L T+ P+
Sbjct: 312 TADRDYAENDQIWINYGDLSNAELALDYGFTVPENPYD----ETELLTQFPE 359
>gi|296810368|ref|XP_002845522.1| SET domain-containing protein [Arthroderma otae CBS 113480]
gi|238842910|gb|EEQ32572.1| SET domain-containing protein [Arthroderma otae CBS 113480]
Length = 491
Score = 47.8 bits (112), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 64/273 (23%), Positives = 110/273 (40%), Gaps = 45/273 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A D+ + F +P L++++E E + L +L L + ++YE QG+
Sbjct: 61 LGAVRDIAEDEELFVIPEDLILSVENSKAREALG--LNETQLGPWLSLIIVMIYEYYQGE 118
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA--EGIKREYNE 230
+S W PY L + ++ + W+E +L L G +I + A E I ++
Sbjct: 119 QSRWEPYFHIL-------PTSFDTLMFWTEAQLQELQGCAVVDKIGKSAADEAILQKVVP 171
Query: 231 L--------------------DTVWFMA---GSLFQQYPYDI-PTEAFTFEIFKQAFVAV 266
L D + +A GSL Y +DI TE + + ++
Sbjct: 172 LIQANPHHFPARSGMPPLDSNDALLCLAHRMGSLIMAYAFDIEKTEGADDDAAEDGYMTD 231
Query: 267 QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 326
+ A+ +VPL A + + A L + + + R +AGE I
Sbjct: 232 -------DEDEPAK--GMVPLADIFNADAQRNNARLFQEEGSFVMKAIRNIQAGEEIFND 282
Query: 327 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 359
G P + LL YG+V DN VVE +L++
Sbjct: 283 YGELPRADLLRRYGYV-TDNYAQYDVVEFSLDS 314
>gi|323355591|gb|EGA87411.1| Set7p [Saccharomyces cerevisiae VL3]
Length = 515
Score = 47.8 bits (112), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 61/255 (23%), Positives = 110/255 (43%), Gaps = 26/255 (10%)
Query: 113 VAASEDLQAGDAAFSVPNS--LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKK- 169
V A++ ++ + F +P S L VT +++ + + N+ L + ++YE +
Sbjct: 41 VVATQKIKKDETLFKIPRSSVLSVTTSQLIKDYPSLKDKFLNETGSWEGLIICILYEMEV 100
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
++S W PY + ++ L + W + EL L G E+ ER
Sbjct: 101 LQERSRWAPYFKVWNKPSDMNAL-----IFWDDNELQLLKPSLVLERIGKKEAKEMHERI 155
Query: 222 -EGIKREYNELDTVWFMAGSL-FQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
+ IK+ E V A S F + Y I + +F E+ + + +++
Sbjct: 156 IKSIKQIGGEFSRV---ATSFEFDNFAYIASIILSYSFDLEMQDSSVNENEEEETSEEEL 212
Query: 277 SLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
R +++PL L A +SKC A L + +++V R + E + G PNS+L
Sbjct: 213 ENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSEL 272
Query: 336 LINYGFVDED-NPYD 349
L YG+V+ D + YD
Sbjct: 273 LRRYGYVEWDGSKYD 287
>gi|323334121|gb|EGA75505.1| Set7p [Saccharomyces cerevisiae AWRI796]
Length = 515
Score = 47.8 bits (112), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 61/255 (23%), Positives = 110/255 (43%), Gaps = 26/255 (10%)
Query: 113 VAASEDLQAGDAAFSVPNS--LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
V A++ ++ + F +P S L VT +++ + + N+ L + ++YE +
Sbjct: 41 VVATQKIKKDETLFKIPRSSVLSVTTSQLIKDYPSLKDKFLNETGSWEGLIICILYEMEV 100
Query: 171 -GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
++S W PY + ++ L + W + EL L G E+ ER
Sbjct: 101 LQERSRWAPYFKVWNKPSDMNAL-----IFWDDNELQLLKPSLVLERIGKKEAKEMHERI 155
Query: 222 -EGIKREYNELDTVWFMAGSL-FQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
+ IK+ E V A S F + Y I + +F E+ + + +++
Sbjct: 156 IKSIKQIGGEFSRV---ATSFEFDNFAYIASIILSYSFDLEMQDSSVNENEEEETSEEEL 212
Query: 277 SLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
R +++PL L A +SKC A L + +++V R + E + G PNS+L
Sbjct: 213 ENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSEL 272
Query: 336 LINYGFVDED-NPYD 349
L YG+V+ D + YD
Sbjct: 273 LRRYGYVEWDGSKYD 287
>gi|6320463|ref|NP_010543.1| Rkm4p [Saccharomyces cerevisiae S288c]
gi|46577338|sp|Q12504.1|RKM4_YEAST RecName: Full=Ribosomal N-lysine methyltransferase 4; AltName:
Full=SET domain-containing protein 7
gi|1136212|emb|CAA92714.1| unknown [Saccharomyces cerevisiae]
gi|1226033|emb|CAA94096.1| unknown [Saccharomyces cerevisiae]
gi|51830266|gb|AAU09704.1| YDR257C [Saccharomyces cerevisiae]
gi|190404795|gb|EDV08062.1| hypothetical protein SCRG_00269 [Saccharomyces cerevisiae RM11-1a]
gi|259145494|emb|CAY78758.1| Set7p [Saccharomyces cerevisiae EC1118]
gi|285811273|tpg|DAA12097.1| TPA: Rkm4p [Saccharomyces cerevisiae S288c]
gi|323349272|gb|EGA83501.1| Set7p [Saccharomyces cerevisiae Lalvin QA23]
gi|365766338|gb|EHN07836.1| Set7p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
gi|392300372|gb|EIW11463.1| Rkm4p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 494
Score = 47.4 bits (111), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 61/255 (23%), Positives = 110/255 (43%), Gaps = 26/255 (10%)
Query: 113 VAASEDLQAGDAAFSVPNS--LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
V A++ ++ + F +P S L VT +++ + + N+ L + ++YE +
Sbjct: 41 VVATQKIKKDETLFKIPRSSVLSVTTSQLIKDYPSLKDKFLNETGSWEGLIICILYEMEV 100
Query: 171 -GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
++S W PY + ++ L + W + EL L G E+ ER
Sbjct: 101 LQERSRWAPYFKVWNKPSDMNAL-----IFWDDNELQLLKPSLVLERIGKKEAKEMHERI 155
Query: 222 -EGIKREYNELDTVWFMAGSL-FQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
+ IK+ E V A S F + Y I + +F E+ + + +++
Sbjct: 156 IKSIKQIGGEFSRV---ATSFEFDNFAYIASIILSYSFDLEMQDSSVNENEEEETSEEEL 212
Query: 277 SLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
R +++PL L A +SKC A L + +++V R + E + G PNS+L
Sbjct: 213 ENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSEL 272
Query: 336 LINYGFVDED-NPYD 349
L YG+V+ D + YD
Sbjct: 273 LRRYGYVEWDGSKYD 287
>gi|349577313|dbj|GAA22482.1| K7_Set7p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 494
Score = 47.4 bits (111), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 61/255 (23%), Positives = 110/255 (43%), Gaps = 26/255 (10%)
Query: 113 VAASEDLQAGDAAFSVPNS--LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
V A++ ++ + F +P S L VT +++ + + N+ L + ++YE +
Sbjct: 41 VVATQKIKKDETLFKIPRSSVLSVTTSQLIKDYPSLKDKFLNETGSWEGLIICILYEMEV 100
Query: 171 -GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
++S W PY + ++ L + W + EL L G E+ ER
Sbjct: 101 LQERSRWAPYFKVWNKPSDMNAL-----IFWDDNELQLLKPSLVLERIGKKEAKEMHERI 155
Query: 222 -EGIKREYNELDTVWFMAGSL-FQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
+ IK+ E V A S F + Y I + +F E+ + + +++
Sbjct: 156 IKSIKQIGGEFSRV---ATSFEFDNFAYIASIILSYSFDLEMQDSSINENEEEETSEEEL 212
Query: 277 SLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
R +++PL L A +SKC A L + +++V R + E + G PNS+L
Sbjct: 213 ENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSEL 272
Query: 336 LINYGFVDED-NPYD 349
L YG+V+ D + YD
Sbjct: 273 LRRYGYVEWDGSKYD 287
>gi|151942233|gb|EDN60589.1| SET domain-containing protein [Saccharomyces cerevisiae YJM789]
Length = 494
Score = 47.4 bits (111), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 61/255 (23%), Positives = 110/255 (43%), Gaps = 26/255 (10%)
Query: 113 VAASEDLQAGDAAFSVPNS--LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
V A++ ++ + F +P S L VT +++ + + N+ L + ++YE +
Sbjct: 41 VVATQKIKKDETLFKIPRSSVLSVTTSQLIKDYPSLKDKFLNETGSWEGLIICILYEMEV 100
Query: 171 -GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
++S W PY + ++ L + W + EL L G E+ ER
Sbjct: 101 LQERSRWAPYFKVWNKPSDMNAL-----IFWDDNELQLLKPSLVLERIGKKEAKEMHERI 155
Query: 222 -EGIKREYNELDTVWFMAGSL-FQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
+ IK+ E V A S F + Y I + +F E+ + + +++
Sbjct: 156 IKSIKQIGGEFSRV---ATSFEFDNFAYIASIILSYSFDLEMQDSSINENEEEETSEEEL 212
Query: 277 SLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
R +++PL L A +SKC A L + +++V R + E + G PNS+L
Sbjct: 213 ENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSEL 272
Query: 336 LINYGFVDED-NPYD 349
L YG+V+ D + YD
Sbjct: 273 LRRYGYVEWDGSKYD 287
>gi|354548388|emb|CCE45124.1| hypothetical protein CPAR2_701280 [Candida parapsilosis]
Length = 565
Score = 47.4 bits (111), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 72/322 (22%), Positives = 126/322 (39%), Gaps = 65/322 (20%)
Query: 108 RPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYE 167
+P ++ A S+ G A+ +P LVVT ++ G + + T + + L +YL Y
Sbjct: 28 KPNYFGAISK--SNGKASIQIPRELVVTCDK--GIDLYKD--TYKNANHSSLLKIYLCYS 81
Query: 168 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
+ Q +SF PY+ L + A++SP +WS + A L G+ + E + E
Sbjct: 82 RTQ--QSFHQPYLDTLPSLQ-----AIDSPYIWSAEDKALLKGTNLGNSLKENISSLVEE 134
Query: 228 YNELDTVWFMAGSLFQQYPYDIP------------------TEAFTFEIFKQAFVAVQSC 269
W+ A +L P D+P T+ + F + + +
Sbjct: 135 -------WWNAINLL---PEDVPKPEQHFINLKFYYENKFYTDDDYYSYFNEVDTSNWTS 184
Query: 270 VVHLQKVSL---ARRFALVPLGPP-------------LLAYSSKCKAMLAAVDDAVQLVV 313
+ SL +R F + P LL ++ K K + D
Sbjct: 185 FPNYLWASLVLKSRAFPAYIIDPSLPKNEPMLLPVVDLLNHNPKTKVQWSGTDGGFLFQS 244
Query: 314 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQR 373
D +GE + G + N +LL+ YGF E+NP D AAL + P D ++ +
Sbjct: 245 DDA-SSGEELFNNYGQKGNEELLLAYGFAIENNPADS----AALKIKIP---DSKLQVVK 296
Query: 374 NGKLSVQVFHVHAGREKEAISD 395
+ + + H + + +SD
Sbjct: 297 DLGIKLPSIHDYTNSVIDQVSD 318
>gi|254577261|ref|XP_002494617.1| ZYRO0A05654p [Zygosaccharomyces rouxii]
gi|238937506|emb|CAR25684.1| ZYRO0A05654p [Zygosaccharomyces rouxii]
Length = 494
Score = 47.4 bits (111), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 60/258 (23%), Positives = 112/258 (43%), Gaps = 34/258 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAEL--LTTNKLSELACLALYLMYE-KK 169
V AS+D+ + + F +P S V+ + +L + +L L L ++YE K
Sbjct: 42 VLASQDIGSDEVLFEIPRSSVLNVATSQLVRDFPQLKDVFWQELGHWEGLILCMVYEIKV 101
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLT--------GSPTKAE----I 217
G++SFW Y++ L + + L + WS +LA L G+ E I
Sbjct: 102 MGQQSFWWNYLQVLPKSQDLNTL-----VYWSADQLAALEPSLVVGRLGADESQEMYRQI 156
Query: 218 LERAEGIKREYN----ELDTVWFM-AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ----S 268
L+ + E+ +L F+ S+ Y +D+ + E + + S
Sbjct: 157 LKYIQNFGPEFQSKIGQLTFEEFVHVASVIMSYSFDVDLKGEDDEDDEDEDEGEEEEGES 216
Query: 269 CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG 328
V H + + ++VPL L A + + A L +++++V +P K G+ + + G
Sbjct: 217 NVAHDKYMK-----SMVPLADTLNADTKQFNAHLVYDKESLKMVSVKPIKMGQQVYNFYG 271
Query: 329 PQPNSKLLINYGFVDEDN 346
PN+++L YG+V+ D
Sbjct: 272 EHPNAEILRRYGYVEWDG 289
>gi|302754814|ref|XP_002960831.1| hypothetical protein SELMODRAFT_402223 [Selaginella moellendorffii]
gi|300171770|gb|EFJ38370.1| hypothetical protein SELMODRAFT_402223 [Selaginella moellendorffii]
Length = 486
Score = 47.4 bits (111), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 43/162 (26%), Positives = 71/162 (43%), Gaps = 18/162 (11%)
Query: 196 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF- 254
S W +TEL+YL SP + ER E I E+ ++ F L Q D+ + F
Sbjct: 313 STFRWEDTELSYLRASPLYGKARERLEMITTEFGQVQND-FCTCVLEQ--ALDVWPQLFG 369
Query: 255 --TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY-----SSKCKAMLAAVDD 307
+ E K + V S + +++ LV + P+L + +S K + +
Sbjct: 370 KVSLEDLKHVYATVFS-----RSLAIGEDSTLVMI--PMLDFFNHNATSFAKLSFNGLLN 422
Query: 308 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
+ DR Y + I + G N++L ++YGF +NPYD
Sbjct: 423 YAVVTADRDYAENDQIWINYGDLSNAELALDYGFTVPENPYD 464
>gi|66813084|ref|XP_640721.1| hypothetical protein DDB_G0281543 [Dictyostelium discoideum AX4]
gi|60468751|gb|EAL66753.1| hypothetical protein DDB_G0281543 [Dictyostelium discoideum AX4]
Length = 1339
Score = 47.0 bits (110), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 52/243 (21%), Positives = 100/243 (41%), Gaps = 17/243 (6%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V ++ + +A VP ++ ++ + + + L++ L L+++YEK
Sbjct: 787 VVTTKKVDENEAVVVVPKKYLINVDVAKAHPILGPIFEELHLNDDTILFLFVIYEKGNAN 846
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
SFW P+ L + + +S TEL L G+ + E K++ N
Sbjct: 847 -SFWRPFYDTLPS-------YFTTSIHYSATELLELEGT----NLFEETLHTKQQLNSFR 894
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
F L +QYP P F++E F A + S + L K+ + + LVP+ +
Sbjct: 895 DYLF--PELSKQYPDIFPESQFSWENFLWARSLLDSRAIQL-KIDGSIKSCLVPMADMIN 951
Query: 293 AYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 350
+++ + + +++ A I + G N +L + YGF+ +N YD
Sbjct: 952 HHTNAQISERFFDHDSQSFKMISSCNIPANNQIFLHYGALQNWELALYYGFIIPNNIYDS 1011
Query: 351 LVV 353
L +
Sbjct: 1012 LHI 1014
>gi|261190993|ref|XP_002621905.1| SET domain-containing protein [Ajellomyces dermatitidis SLH14081]
gi|239590949|gb|EEQ73530.1| SET domain-containing protein [Ajellomyces dermatitidis SLH14081]
gi|239613147|gb|EEQ90134.1| SET domain-containing protein [Ajellomyces dermatitidis ER-3]
gi|327354785|gb|EGE83642.1| SET domain-containing protein [Ajellomyces dermatitidis ATCC 18188]
Length = 481
Score = 47.0 bits (110), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 62/263 (23%), Positives = 107/263 (40%), Gaps = 50/263 (19%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELL--TTNKLSELACLALYLMYEKKQ 170
+ A ++ + F++P +LV++ + N + +LL + L CL L ++YE Q
Sbjct: 50 IVALSNINEDEELFAIPQNLVLSFQ----NSKLKDLLHISEKDLGPWLCLILVMIYEYLQ 105
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI--LERAEGIKRE- 227
G S W Y + L + ++ + W++ EL L+GS +I + I R+
Sbjct: 106 GGASPWSRYFQVLPTE-------FDTLMFWTDEELRELSGSAVLNKIGKSDAEAAILRDI 158
Query: 228 -------------------YNELD---TVWFMA---GSLFQQYPYDIPTEAFTFEIFKQA 262
Y+ D T+ +A GSL Y +DI E +
Sbjct: 159 FPIVSTNPHLFPPISGLGSYDSPDGRATLLSLAHRMGSLIMAYAFDI-------EKGEDE 211
Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
VQ + + L + +VPL L A + + A L D + + +P + GE
Sbjct: 212 EGEVQDGYITDEGEELTK--GMVPLADLLNADADRNNARLFQEDGYLAMKSIKPIRNGEE 269
Query: 323 IVVWCGPQPNSKLLINYGFVDED 345
I G P + LL YG+V ++
Sbjct: 270 IFNDYGELPRADLLRRYGYVTDN 292
>gi|453087416|gb|EMF15457.1| SET domain-containing protein [Mycosphaerella populorum SO2202]
Length = 454
Score = 47.0 bits (110), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 72/336 (21%), Positives = 141/336 (41%), Gaps = 38/336 (11%)
Query: 126 FSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIREL 183
++P+ L+ T++R + + LL++ + LS LA Y+++ + + K + P
Sbjct: 41 LTIPHGLLWTVKRAYADPVLGPLLSSTRPPLSVDDTLATYILFIRAR-KSGYDGP----- 94
Query: 184 DRQRGRGQL--AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSL 241
Q L + S + +++ EL GS A I+ +Y +L L
Sbjct: 95 --QSHVAALPASYSSSIFFADAELEICAGSSLYTTTKHLARQIEVDYKDL------VARL 146
Query: 242 FQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYS---SKC 298
F ++ P++ FT + +K A V S + K+ L+ +L +S +C
Sbjct: 147 FGRHRDVFPSDKFTIDDYKWALCTVWSRAMDF-KLRDGESIRLMAPFADMLNHSPDVGQC 205
Query: 299 KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 358
+ + ++ + Y+ G+ + + GP PN++L YGFV NP D + + +
Sbjct: 206 HVYDPQSGN-LSILAGKSYEPGDQVFINYGPIPNNRLSRLYGFVVPGNPNDSYDLVLSTH 264
Query: 359 TEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP-----YLRLGYVSDTSEMQS 413
P ++ K + G S + ++D LP YLR+ +++T ++ +
Sbjct: 265 PMAPFFEQKHKLWIAAGLDSTSTVSL-------TLTDPLPRSVLRYLRIQRLNET-DLAA 316
Query: 414 VISSLGPIC--PVSPCMERAVLDQLADYFKARLAGY 447
V + + +S E VL L + A L G+
Sbjct: 317 VGTRQSDVAFEKISDSNETEVLTFLVESISALLDGF 352
>gi|148671823|gb|EDL03770.1| SET domain containing 4, isoform CRA_d [Mus musculus]
Length = 397
Score = 47.0 bits (110), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 65/271 (23%), Positives = 112/271 (41%), Gaps = 27/271 (9%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ ++ + K +S L L +L+ EK G +S W
Sbjct: 25 LQEGQVMISLPESCLLTTDTVI-RSSLGPYIKKWKPPVSPLLALCTFLVSEKHAGCRSLW 83
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
Y+ L + + P+ E E+ L SP KA+ E+ ++ + +
Sbjct: 84 KSYLDILPK-------SYTCPVCL-EPEVVDLLPSPLKAKAEEQRARVQDLFTSARGFFS 135
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ---KVSLARRFALVPLGP--PL 291
LF + P D F++ F A+ V + V+L+ + L+ L P L
Sbjct: 136 TLQPLFAE-PVD---SVFSYRAFLWAWCTVNTRAVYLRSRRQECLSAEPDTCALAPFLDL 191
Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
L +S + KA ++ + + + + GP N +LL+ YGFV NP+
Sbjct: 192 LNHSPHVQVKAAFNEKTRCYEIRTASRCRKHQEVFICYGPHDNQRLLLEYGFVSVRNPHA 251
Query: 350 RLVVEA-----ALNTEDPQYQDKRMVAQRNG 375
+ V A L D Q K + + +G
Sbjct: 252 CVPVSADMLVKFLPAADKQLHRKITILKDHG 282
>gi|332229557|ref|XP_003263953.1| PREDICTED: SET domain-containing protein 4 [Nomascus leucogenys]
Length = 440
Score = 47.0 bits (110), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 65/278 (23%), Positives = 111/278 (39%), Gaps = 27/278 (9%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ + +T K S L L +L+ EK G +S W
Sbjct: 68 LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGDRSLW 126
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L + A P+ E E+ L KA+ E+ ++ + +
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
LF + I F++ A+ V + V+L Q L+ L P L
Sbjct: 179 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQWECLSAEPDTCALAPYLDL 234
Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY- 348
L +S + KA + ++ ++ E + + GP N +L + YGFV NP+
Sbjct: 235 LNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEYGFVSVHNPHA 294
Query: 349 ----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
R ++ L + D Q K + + +G + F
Sbjct: 295 CVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332
>gi|148671819|gb|EDL03766.1| SET domain containing 4, isoform CRA_a [Mus musculus]
Length = 378
Score = 47.0 bits (110), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 65/271 (23%), Positives = 112/271 (41%), Gaps = 27/271 (9%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ ++ + K +S L L +L+ EK G +S W
Sbjct: 6 LQEGQVMISLPESCLLTTDTVI-RSSLGPYIKKWKPPVSPLLALCTFLVSEKHAGCRSLW 64
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
Y+ L + + P+ E E+ L SP KA+ E+ ++ + +
Sbjct: 65 KSYLDILPK-------SYTCPVCL-EPEVVDLLPSPLKAKAEEQRARVQDLFTSARGFFS 116
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ---KVSLARRFALVPLGP--PL 291
LF + P D F++ F A+ V + V+L+ + L+ L P L
Sbjct: 117 TLQPLFAE-PVD---SVFSYRAFLWAWCTVNTRAVYLRSRRQECLSAEPDTCALAPFLDL 172
Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
L +S + KA ++ + + + + GP N +LL+ YGFV NP+
Sbjct: 173 LNHSPHVQVKAAFNEKTRCYEIRTASRCRKHQEVFICYGPHDNQRLLLEYGFVSVRNPHA 232
Query: 350 RLVVEAA-----LNTEDPQYQDKRMVAQRNG 375
+ V A L D Q K + + +G
Sbjct: 233 CVPVSADMLVKFLPAADKQLHRKITILKDHG 263
>gi|33468718|emb|CAE30375.1| SI:dZ63M10.4 (novel protein similar to human chromosome 21 open
reading frame 18 (C21orf18)) [Danio rerio]
Length = 440
Score = 47.0 bits (110), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 70/323 (21%), Positives = 136/323 (42%), Gaps = 47/323 (14%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVV 134
L+ W+++ G +I P+++ + +++ ++A ++ S+P ++
Sbjct: 37 LRRWLNERGFTSQSLI------------PVNFHGNGRGLMSTQTIKAKNSLISLPEECLL 84
Query: 135 TLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
T VL + +A+ + +S L L +L+ E+ G+ S W PYI L +
Sbjct: 85 TTSTVLKS-YMADYIKRWHPPISPLLALCCFLISERHHGEASEWNPYIDILPK------- 136
Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 252
PL + + + L S K + ++ E + ++ T + LF Q PTE
Sbjct: 137 TYTCPLYFPDNVIELLPRSLQK-KATQQKEQFQELFSSSQTFFHSLQPLFNQ-----PTE 190
Query: 253 A-FTFEIFKQAFVAVQSCVV---HLQKVSLARRFALVPLGP--PLLAY--SSKCKAMLAA 304
F+ + + A+ +V + V H Q L+R + L P LL + + + +A
Sbjct: 191 ELFSQDALRWAWCSVNTRTVYMEHDQSKYLSREKDVYALAPYLDLLNHCPNVQVEAGFNK 250
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY-----DRLVVEAALNT 359
++ K + + GP N +LL+ YGFV NP+ D ++ L+
Sbjct: 251 ETRCYEIRSVNGCKKFQQAFINYGPHDNHRLLLEYGFVAPCNPHSVVYVDLETLKVGLDE 310
Query: 360 EDPQYQDKRMVAQRNGKLSVQVF 382
+D Q ++K + + N L F
Sbjct: 311 KDKQLKEKLLYLKDNDFLRNLTF 333
>gi|294659704|ref|XP_462118.2| DEHA2G13354p [Debaryomyces hansenii CBS767]
gi|199434171|emb|CAG90604.2| DEHA2G13354p [Debaryomyces hansenii CBS767]
Length = 480
Score = 47.0 bits (110), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 55/234 (23%), Positives = 96/234 (41%), Gaps = 31/234 (13%)
Query: 134 VTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
+T+E +LG LS L+ Y+ +EK++G SFW P+I L + LA
Sbjct: 135 LTMEEMLG------------LSSFQLLSFYICFEKQRGSSSFWKPFIDMLP-ETSDFDLA 181
Query: 194 VESPLLWS------ETELAYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYP 246
PL+W EL L + TK + + + + +YN + D + +
Sbjct: 182 ---PLVWKVLKVDHYEELLKLLPNSTKRHMDKIYDRFQTDYNVVKDLISIKLKEISDNER 238
Query: 247 YDIPTEAFT----FEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGPPLLAYSSKCK 299
+ T+A E++ +++ + S +++ Q + A F + P L +S +
Sbjct: 239 SNDLTDAIRHLVPIELYLWSWMCINSRCLYMEIPQSKNAADNFTMAPY-VDFLNHSCDDQ 297
Query: 300 AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
L Q+ Y E + + GP N LL YGF +N ++ L V
Sbjct: 298 CGLKIDGTGFQVYTTCSYNPDEQLFLSYGPHSNEFLLCEYGFTLPENKWNDLDV 351
>gi|242059429|ref|XP_002458860.1| hypothetical protein SORBIDRAFT_03g041640 [Sorghum bicolor]
gi|241930835|gb|EES03980.1| hypothetical protein SORBIDRAFT_03g041640 [Sorghum bicolor]
Length = 491
Score = 47.0 bits (110), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 68/320 (21%), Positives = 118/320 (36%), Gaps = 67/320 (20%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
K WM +G V+ + S + +V A L+ GD ++P +T R
Sbjct: 13 FKRWMRAHG-----VVCSDALSLDVSDPLGVHVRAVTPLRDGDLVATIPRGACLT-PRTT 66
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIREL-DRQRGRGQLAVESPLL 199
G +L LA+ +MYE+ +G S W Y++ L DR+ PL+
Sbjct: 67 GAAAAI---EAAELGGCLALAVAVMYERARGTDSPWDAYLQLLPDRE--------SVPLV 115
Query: 200 WSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEI 258
W E L G+ + + E + ++ E ++G L D+ + F+ E
Sbjct: 116 WPADEAECLLAGTELDKIVKQDREFLCEDWKECIEPLLLSGEL------DVDPDDFSLEK 169
Query: 259 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML---------------- 302
+ A V S + F +VPL L + + C+ +
Sbjct: 170 YFSAKTLVSSRSFQIDSY---HGFGMVPLAD-LFNHKTDCEHVHFTSASDASDSDGEDAD 225
Query: 303 ----------------------AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 340
+ D+ +++++ R GE + G N+ LL YG
Sbjct: 226 DDQSDASADDESTIENPTSSSPGSKDEDLEMIIVRDVNEGEEVYNTYGTMGNAALLHRYG 285
Query: 341 FVDEDNPYDRLVVEAALNTE 360
F + DN YD + ++ AL T+
Sbjct: 286 FTELDNQYDIVNIDLALVTK 305
>gi|388579878|gb|EIM20197.1| RuBisCO-cytochrome methylase [Wallemia sebi CBS 633.66]
Length = 447
Score = 47.0 bits (110), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 89/450 (19%), Positives = 182/450 (40%), Gaps = 74/450 (16%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT-----LER 138
W NG K I+ + + R + VA D++A + F++P +V++ +
Sbjct: 10 WFTTNGGEFSKDIVAIGENVDGMGRGLVAVA---DIKAQTSLFTIPRDIVLSTRTSSFKE 66
Query: 139 VLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPL 198
+G + +L N + L + + +E QG S W Y + L +Q S +
Sbjct: 67 KVGQDVYKQLENDN-IGSWTPLIMAMCWEYNQGGSSKWDAYFKILPKQ-------FTSLM 118
Query: 199 LWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEI 258
WS+ EL+ L G+ +I E I+ E+ + + ++F DI +T ++
Sbjct: 119 FWSKEELSLLKGTTVVDKI--GLEDIENEFERVRDIVKQNENVFG----DIAN--YTLDL 170
Query: 259 FKQ--AFVAVQSCVVHLQKV---------------------SLARRFALVPLGPPLLAYS 295
FK+ + + +S V K + A+VP+ L + +
Sbjct: 171 FKRMGSLILSRSFTVEEWKTEEEREKEEEEEEDEDEEIDLRTSVDDVAMVPMADILNSRT 230
Query: 296 SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD-----ED----- 345
A ++ ++++ + KAG+ I PN+ L+ YG VD +D
Sbjct: 231 DSVNAHTEYEENCLRMISLQDIKAGDQIFNTYNDPPNADLIRRYGHVDYSPLSQDPDFMG 290
Query: 346 NPYD------RLVVEAALNTEDPQYQDKRM--VAQRNGKLSVQVFHVHAGREKEAISDML 397
N D +++E AL ++++R+ + G+ S ++ H + + ++L
Sbjct: 291 NKNDVVELPADILLELALPDAKESHKERRVEFLLDECGEDSFELTH------DDLVPELL 344
Query: 398 PYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM 457
L + +E ++ S P + + + L K R+ Y +TL +D +
Sbjct: 345 KICVLLFTESEAEFKTREKSRK--LPKASGFTKGKAEFLIKAIKQRMEQYGSTLEDDISK 402
Query: 458 LTDYNLHPKKRVATQLVRM-EKKMLNACLQ 486
L + + P+ +V + E+++LN ++
Sbjct: 403 LDNKDSLPENNFKALVVTVGERRILNKAIE 432
>gi|384248108|gb|EIE21593.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 229
Score = 47.0 bits (110), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 52/110 (47%), Gaps = 9/110 (8%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
D W+ K G + E + E R V A +++ G +VP L+++
Sbjct: 5 DFAEWLQKGGALIADI---EPGAVAEGFRG---VIAKANIEEGTLLVAVPERLLLSAHSA 58
Query: 140 LGNETIAE-LLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
+ AE LL TNK + LA +L++E +G++SFW PY+ L RQ
Sbjct: 59 KKDRAFAEALLATNKQSIGSSQVLAAHLLHEASKGQESFWRPYLATLPRQ 108
>gi|323309789|gb|EGA62995.1| Set7p [Saccharomyces cerevisiae FostersO]
Length = 417
Score = 47.0 bits (110), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 61/255 (23%), Positives = 110/255 (43%), Gaps = 26/255 (10%)
Query: 113 VAASEDLQAGDAAFSVPNS--LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKK- 169
V A++ ++ + F +P S L VT +++ + + N+ L + ++YE +
Sbjct: 41 VVATQKIKKDETLFKIPRSSVLSVTTSQLIKDYPSLKDKFLNETGSWEGLIICILYEMEV 100
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
++S W PY + ++ L + W + EL L G E+ ER
Sbjct: 101 LQERSRWAPYFKVWNKPSDMNAL-----IFWDDXELQLLKPSLVLERIGKKEAKEMHERI 155
Query: 222 -EGIKREYNELDTVWFMAGSL-FQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
+ IK+ E V A S F + Y I + +F E+ + + +++
Sbjct: 156 IKSIKQIGGEFSRV---ATSFEFDNFAYIASIILSYSFDLEMQDSSVNENEEEETSEEEL 212
Query: 277 SLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
R +++PL L A +SKC A L + +++V R + E + G PNS+L
Sbjct: 213 ENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSEL 272
Query: 336 LINYGFVDED-NPYD 349
L YG+V+ D + YD
Sbjct: 273 LRRYGYVEWDGSKYD 287
>gi|299470104|emb|CBN78133.1| protein N-methyltransferase [Ectocarpus siliculosus]
Length = 482
Score = 47.0 bits (110), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 36/74 (48%)
Query: 281 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 340
R AL+PL + YS M D A+ L V + G+ GP N LL YG
Sbjct: 215 RMALLPLIDSINHYSRMPTHMYWEADGALSLSVGAAFDPGDHAFASYGPVSNDDLLQYYG 274
Query: 341 FVDEDNPYDRLVVE 354
FV++DNP D V+E
Sbjct: 275 FVEQDNPSDTYVLE 288
>gi|428175768|gb|EKX44656.1| hypothetical protein GUITHDRAFT_109433 [Guillardia theta CCMP2712]
Length = 591
Score = 47.0 bits (110), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 69/277 (24%), Positives = 118/277 (42%), Gaps = 47/277 (16%)
Query: 100 KPSHNEKH-RPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK----- 153
KP E+ R I +A E++ FS+P ++++ + + + +IA + +K
Sbjct: 35 KPHDGERGVRVISDIAPCEEM------FSIPEKILMSRKSCMAS-SIAHVFRKHKDVLFS 87
Query: 154 -LSELACLALYLMYEK-KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
ELA L L ++YEK QG SFW P I L G + WSE EL L
Sbjct: 88 SRDELA-LTLLILYEKLDQGNASFWKPMIDILPADPG-------AASKWSEEELQELQDE 139
Query: 212 PTKAEILERAEGIKREYNE-LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
KAE + +++ Y L + G +F + +T+E F+ A + V+S
Sbjct: 140 SLKAEAMIVVASMQQTYQRVLRPILVQHGDVF-------SVDRYTWEEFRWALLCVESRT 192
Query: 271 VHLQKVSLARRF----ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD----RPYKAGES 322
RF ++VP L + + + + D ++ GE
Sbjct: 193 FG--------RFLPHPSIVPFADLLNHVNVQTSYRWLPEERRAAYMCDASGEHVHRRGEE 244
Query: 323 IVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 359
+ GP+ N++LL++YGF + N Y+ + + +NT
Sbjct: 245 AFMSYGPRSNAELLLHYGFALQSNRYEAVELNFRINT 281
>gi|115391295|ref|XP_001213152.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114194076|gb|EAU35776.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 691
Score = 46.6 bits (109), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 50/193 (25%), Positives = 87/193 (45%), Gaps = 15/193 (7%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
+L+ + +G + FW PYI L + G L +PL + +L +L G+ + A E+
Sbjct: 115 FFLIGQYLRGSEGFWYPYICTLPQP---GDLT--TPLYYEGADLRWLEGT-SLAPAREQK 168
Query: 222 EGIKRE-----YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
E + +E + EL F ++Y +++ A T + + V + VV ++
Sbjct: 169 ESLLKEKYQSTFEELRKSGFGDA---EKYTWELYLWASTIFVSRAFSAKVLAGVVPHAEL 225
Query: 277 SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLL 336
L+P +L + K A + V VV AGE + GP+ N +L+
Sbjct: 226 PEENVSVLLPF-IDVLNHRPLAKVEWRAGERDVLFVVLEHVAAGEEVANNYGPRNNEQLM 284
Query: 337 INYGFVDEDNPYD 349
+NYGF ++NP D
Sbjct: 285 MNYGFCLQNNPCD 297
>gi|67484540|ref|XP_657490.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56474743|gb|EAL52100.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
Length = 791
Score = 46.6 bits (109), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 70/317 (22%), Positives = 120/317 (37%), Gaps = 36/317 (11%)
Query: 78 LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
+ D+K W+ +NG V +K + + A+++ + + S+P S + +
Sbjct: 1 MEDIKKWVIQNGGVIDGVDVKTFEGYGRG------LCANKEFKKDEVIMSIPYS--IQIN 52
Query: 138 RVLGNETIAELLT------TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQ 191
R+ N E+ + +L L + K K F PYI L
Sbjct: 53 RINLNHIWPEVKLPKFNEGDDDRDDLNGLVYLYLAVNKTNPKCFHWPYINVLPE------ 106
Query: 192 LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPT 251
+ PL ++ EL + G+ A + E+ + V + L QQ+P
Sbjct: 107 -TYDCPLSYTIDELNLMKGTKLYAAV-EKINAFL-----MKVVDYYNNKLIQQFPQYF-- 157
Query: 252 EAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQL 311
++F ++FK+ A QS V + F V P +S+ C Q
Sbjct: 158 QSFD-DLFKRLQWAHQSFWSRAFLVIYPQPFGEVGSLIPFCDFSNHCTQAKVTYISNTQT 216
Query: 312 ------VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 365
+ K GE I + N KLL+ YGFV+E+NP D L++ +D QY
Sbjct: 217 ETFSFQTNEELVKPGEQIFNNYRIRSNEKLLLGYGFVEENNPCDNLLLRIYFEVDDNQYN 276
Query: 366 DKRMVAQRNGKLSVQVF 382
+ + ++ S F
Sbjct: 277 EIEEILKQEEIKSFDFF 293
>gi|294868786|ref|XP_002765694.1| hypothetical protein Pmar_PMAR013760 [Perkinsus marinus ATCC 50983]
gi|239865773|gb|EEQ98411.1| hypothetical protein Pmar_PMAR013760 [Perkinsus marinus ATCC 50983]
Length = 330
Score = 46.6 bits (109), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 21/72 (29%), Positives = 36/72 (50%)
Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
++PL S+K + V+ Q++ ++P K GE I G N LL+ +GF
Sbjct: 171 LCVIPLADQFNHSSTKWHTRVREVEGGFQMLAEKPVKKGEEIFNNYGLYTNEMLLLTHGF 230
Query: 342 VDEDNPYDRLVV 353
++ DNP+D +
Sbjct: 231 IEFDNPHDHFIT 242
>gi|400596811|gb|EJP64567.1| histone-lysine N-methyltransferase [Beauveria bassiana ARSEF 2860]
Length = 406
Score = 46.6 bits (109), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 86/392 (21%), Positives = 163/392 (41%), Gaps = 37/392 (9%)
Query: 81 LKSWMHKNG-LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+ +W++K+G + + L + P V A + + ++P + + T++
Sbjct: 1 MDAWLNKSGAVGLGDLDLADFPETGRG------VKAQRPFKEDERILTIPANCLWTVKGA 54
Query: 140 LGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
+ +L + + LS LALY+++ + +G+ + +RQ L E
Sbjct: 55 YADPLFGPVLQSVQPPLSVEDTLALYILFVRSRGEDPAYA------ERQTHVAMLPSEYT 108
Query: 198 L--LWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAF 254
L +++ EL GS + +Y +L T FM LF P + F
Sbjct: 109 LSMYFTDEELRVCAGSSLYTLTTHLRGRVGDDYKKLLTGVFMRHRDLF-------PLDKF 161
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV- 313
+F+ +K A ++ S + +S L+ +L ++S K A L V
Sbjct: 162 SFQHYKWALSSIWSRGMDF-TISEGNSVRLMAPFADMLNHASDAKQCHAYDPSTGSLTVL 220
Query: 314 -DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 372
R Y+ G+ + ++ G NS+LL YGFV DNP D + ++ P Y+ K Q
Sbjct: 221 ACRDYEVGDQVFIYYGNVSNSRLLRLYGFVLPDNPNDNYELVLQTSSMAPLYEQK----Q 276
Query: 373 RNGKLSV--QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLG--PICPVSPCM 428
R KL+ ++ + + +L YLR+ + D S++ ++ + +S
Sbjct: 277 RLWKLAGLDEISTIPLSLQNPLPDSVLRYLRIQRL-DASDLGTMTMQIATESYTKISDEN 335
Query: 429 ERAVLDQLADYFKARLAGYPATLSEDEAMLTD 460
E +L L+ +A L G+ +L + E L +
Sbjct: 336 ESQILLFLSQSIEALLEGFEISLEKLETQLAE 367
>gi|400594002|gb|EJP61885.1| histone-lysine N-methyltransferase [Beauveria bassiana ARSEF 2860]
Length = 481
Score = 46.6 bits (109), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 64/277 (23%), Positives = 111/277 (40%), Gaps = 24/277 (8%)
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
+ +S+ E+ GS + I +Y +L T M ++ P F E
Sbjct: 115 IFFSDEEMQVCKGSSLYTLTTQLRGRIGDDYKKLLTRVLM------RHRNLFPLSKFGIE 168
Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS---KCKAMLAAVDDAVQLVVD 314
+K A V S + VS L+ +L +SS +C A D + ++
Sbjct: 169 HYKWALCTVWSRGMDF-TVSEGNSLRLLAPFADMLNHSSDVKQCHAYDPTTGD-LSILAS 226
Query: 315 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 374
+ Y G+ + ++ GP PN++LL YGFV +NP+D + + P Y+ K + +
Sbjct: 227 KDYNVGDQVFIYYGPVPNNRLLRLYGFVLPENPHDSYDLVLQTSPMAPLYEQKERLWKLA 286
Query: 375 GKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL------GPICPVSPCM 428
G + + A +D LP L Y+ +S++ ++ G +S
Sbjct: 287 GLDTACTIPLTA-------NDPLPRSVLRYLRIQRLDESLLGAMTMQIATGADEKISDDS 339
Query: 429 ERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 465
E +L L D A L G+ L A L +++P
Sbjct: 340 ETLILQFLIDSISAILEGFSIPLDILTAQLAAGDVYP 376
>gi|367013376|ref|XP_003681188.1| hypothetical protein TDEL_0D03930 [Torulaspora delbrueckii]
gi|359748848|emb|CCE91977.1| hypothetical protein TDEL_0D03930 [Torulaspora delbrueckii]
Length = 484
Score = 46.6 bits (109), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 48/100 (48%), Gaps = 6/100 (6%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A +DL+ G +P S + + N +IA LL +++ + L + +YE K
Sbjct: 40 VFAKQDLEEGTVLLKLPKSCLFSA----SNSSIANLLVDDEIDGVLALNIAFLYETTVFK 95
Query: 173 -KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
KS W PY++ + R L V P WSETE L GS
Sbjct: 96 EKSHWFPYLKSI-RIYNDDGLLVLPPSHWSETEKLLLKGS 134
>gi|330797452|ref|XP_003286774.1| hypothetical protein DICPUDRAFT_54488 [Dictyostelium purpureum]
gi|325083217|gb|EGC36675.1| hypothetical protein DICPUDRAFT_54488 [Dictyostelium purpureum]
Length = 1335
Score = 46.6 bits (109), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 54/230 (23%), Positives = 95/230 (41%), Gaps = 23/230 (10%)
Query: 129 PNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
P ++ ++ N + + L++ L L+++YEK + +FW P+ L
Sbjct: 849 PRKYLINVDVAKSNPILGPIFEELHLNDETILFLFVIYEK-ENPNTFWRPFYDTLPS--- 904
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIK--REYNELDTVWFMAGSLFQQYP 246
+ + +S TEL L G+ AE L + ++ R+Y + L QYP
Sbjct: 905 ----YFTTSIHYSSTELLELEGTNLFAETLAVKQQLQAFRDY--------LFPELSNQYP 952
Query: 247 YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD 306
P F++E F A + S + L K+ + LVP+ ++ + + + D
Sbjct: 953 DIFPESVFSWENFLWARSLLDSRAIQL-KIDGKIKSCLVPMAD-MINHHTNAQISERHFD 1010
Query: 307 ---DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
+ ++V A I + G NS L + YGFV +N YD V
Sbjct: 1011 QDSNCFRMVSSCNIPANNQIFLHYGALQNSDLALYYGFVIPNNIYDSFHV 1060
>gi|402076002|gb|EJT71425.1| hypothetical protein GGTG_10683 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 497
Score = 46.6 bits (109), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 53/202 (26%), Positives = 82/202 (40%), Gaps = 27/202 (13%)
Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
L L +++E +G S W PY+ L + E+P+ WS ELA L SP A +
Sbjct: 104 SLILVMIHEHLRGSASPWRPYLDVLPAR-------FETPMFWSAAELAELQASPVVASV- 155
Query: 219 ERAEG-------IKREYNELDTVWFMAGS----------LFQQYPYDIPTEAFTFEIFKQ 261
RAEG I E + ++F AG L + I AF E
Sbjct: 156 GRAEGDAMIRSRILPVIRENEALFFGAGGAAMGDEELVELAHRMGSTIMAYAFDLERDDD 215
Query: 262 AFVAVQSCVVHLQKVSLARR-FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 320
A + + R +VP+ +L ++ A + ++A+ + R AG
Sbjct: 216 AMDEDDAEGDGWVEDRDGRTVMGMVPMA-DILNADAEFNAHINHSEEALVAISLRKIPAG 274
Query: 321 ESIVVWCGPQPNSKLLINYGFV 342
E I+ + GP PN +L YG+
Sbjct: 275 EEILNYYGPLPNGQLCRRYGYT 296
>gi|348684109|gb|EGZ23924.1| hypothetical protein PHYSODRAFT_296170 [Phytophthora sojae]
Length = 452
Score = 46.6 bits (109), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 51/192 (26%), Positives = 77/192 (40%), Gaps = 36/192 (18%)
Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 253
V+ PL W + + L G ++ R Y+++ F A + F + EA
Sbjct: 143 VDLPLYWDDKQFEELQGCEEARRAMQHG---ARFYSQVYKHLFGANNQF------VNAEA 193
Query: 254 FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY---SSKCKAMLAAVDDAVQ 310
F + I S ++ + FAL+P S C+ L + D+ VQ
Sbjct: 194 FFWAI---------SILMSRATSGQNQPFALIPFFDWFNHAGNGSDNCRHALDS-DECVQ 243
Query: 311 ---------LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD--RLVVEAAL-- 357
+ R Y+ GE + + G N +LL NYGF +NPYD L + AAL
Sbjct: 244 DFDMQKGFTIHTTRSYEPGEQLFINYGSHGNLRLLRNYGFTMPNNPYDVVNLPMPAALQQ 303
Query: 358 -NTEDPQYQDKR 368
N DP + KR
Sbjct: 304 PNEADPAFAQKR 315
>gi|211826273|gb|AAH09054.2| SETD3 protein [Homo sapiens]
Length = 228
Score = 46.6 bits (109), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 53/223 (23%), Positives = 100/223 (44%), Gaps = 29/223 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 5 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 58
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 59 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 114
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y V Q +P+
Sbjct: 115 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV-------IQTHPHA 162
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
+P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 163 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPL 205
>gi|17865444|sp|P58467.1|SETD4_MOUSE RecName: Full=SET domain-containing protein 4
gi|17061796|gb|AAK68849.1| C21orf18 [Mus musculus]
Length = 439
Score = 46.6 bits (109), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 65/271 (23%), Positives = 112/271 (41%), Gaps = 27/271 (9%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ ++ + K +S L L +L+ EK G +S W
Sbjct: 67 LQEGQVMISLPESCLLTTDTVI-RSSLGPYIKKWKPPVSPLLALCTFLVSEKHAGCRSLW 125
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
Y+ L + + P+ E E+ L SP KA+ E+ ++ + +
Sbjct: 126 KSYLDILPK-------SYTCPVCL-EPEVVDLLPSPLKAKAEEQRARVQDLFTSARGFFS 177
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ---KVSLARRFALVPLGP--PL 291
LF + P D F++ F A+ V + V+L+ + L+ L P L
Sbjct: 178 TLQPLFAE-PVD---SVFSYRAFLWAWCTVNTRAVYLRSRRQECLSAEPDTCALAPFLDL 233
Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
L +S + KA ++ + + + + GP N +LL+ YGFV NP+
Sbjct: 234 LNHSPHVQVKAAFNEKTRCYEIRTASRCRKHQEVFICYGPHDNQRLLLEYGFVSVRNPHA 293
Query: 350 RLVVEAA-----LNTEDPQYQDKRMVAQRNG 375
+ V A L D Q K + + +G
Sbjct: 294 CVPVSADMLVKFLPAADKQLHRKITILKDHG 324
>gi|172073177|ref|NP_663457.2| SET domain-containing protein 4 [Mus musculus]
gi|148671824|gb|EDL03771.1| SET domain containing 4, isoform CRA_e [Mus musculus]
Length = 439
Score = 46.6 bits (109), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 65/271 (23%), Positives = 112/271 (41%), Gaps = 27/271 (9%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ ++ + K +S L L +L+ EK G +S W
Sbjct: 67 LQEGQVMISLPESCLLTTDTVI-RSSLGPYIKKWKPPVSPLLALCTFLVSEKHAGCRSLW 125
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
Y+ L + + P+ E E+ L SP KA+ E+ ++ + +
Sbjct: 126 KSYLDILPK-------SYTCPVCL-EPEVVDLLPSPLKAKAEEQRARVQDLFTSARGFFS 177
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ---KVSLARRFALVPLGP--PL 291
LF + P D F++ F A+ V + V+L+ + L+ L P L
Sbjct: 178 TLQPLFAE-PVD---SVFSYRAFLWAWCTVNTRAVYLRSRRQECLSAEPDTCALAPFLDL 233
Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
L +S + KA ++ + + + + GP N +LL+ YGFV NP+
Sbjct: 234 LNHSPHVQVKAAFNEKTRCYEIRTASRCRKHQEVFICYGPHDNQRLLLEYGFVSVRNPHA 293
Query: 350 RLVVEAA-----LNTEDPQYQDKRMVAQRNG 375
+ V A L D Q K + + +G
Sbjct: 294 CVPVSADMLVKFLPAADKQLHRKITILKDHG 324
>gi|315045047|ref|XP_003171899.1| SET domain-containing protein 6 [Arthroderma gypseum CBS 118893]
gi|311344242|gb|EFR03445.1| SET domain-containing protein 6 [Arthroderma gypseum CBS 118893]
Length = 485
Score = 46.2 bits (108), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 67/318 (21%), Positives = 116/318 (36%), Gaps = 61/318 (19%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ AS D+ + F +P L+++++ + L +L L + ++YE QG+
Sbjct: 51 ICASRDITEDEELFVIPEDLILSVQNSEARTVLG--LDDKQLGPWLSLIIAMIYEYYQGE 108
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+S W PY L + ++ + W++ +L+ L GS +I + A D
Sbjct: 109 QSKWYPYFGVLPS-------SFDTLMFWTDEQLSELQGSAVVGKIGKAAAD--------D 153
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
T+ L Q P + + S +SLA R A + ++
Sbjct: 154 TILQKVVPLIQANSLHFPP--------RSDMPPLNSPDSQSALLSLAHRMASL-----IM 200
Query: 293 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 352
AY+ + A +D + Y DED P +V
Sbjct: 201 AYAFDIEKAEEADEDTAE--------------------------DGYMTDDEDEPAKGMV 234
Query: 353 VEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL----RLGYVSDT 408
A + D Q + R+ + + V ++H+G E LP R GYV+D
Sbjct: 235 PLADIFNADAQRNNARLFQEEGSFVMKAVRNIHSGEEIFNDYGELPRADLLRRYGYVTDN 294
Query: 409 SEMQSVIS-SLGPICPVS 425
V+ SL IC V+
Sbjct: 295 YTQYDVVEFSLDSICKVA 312
>gi|156374449|ref|XP_001629819.1| predicted protein [Nematostella vectensis]
gi|156216828|gb|EDO37756.1| predicted protein [Nematostella vectensis]
Length = 281
Score = 46.2 bits (108), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 64/278 (23%), Positives = 116/278 (41%), Gaps = 30/278 (10%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHY-VAASEDLQAGDAAFSVPNSLVVT-----LE 137
W H N L L K S +K Y + A ED+ + F VP L++ +
Sbjct: 22 WCHDNDLK-----LNNKVSSMQKGSCHRYGMVAMEDISPDECLFKVPRGLLLEPKTCGIS 76
Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
++L + I +L+ ++ L L LMYE S W PY+ + G ++ P
Sbjct: 77 KILTGKVIQNMLSQHE--GWVPLLLALMYEYTN-PTSLWKPYMDIV-----PGIDILDQP 128
Query: 198 LLW-SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 256
+ W ET + L G+ + ++ + + I+R+Y + +A + +++ + +
Sbjct: 129 MFWPDETRQSLLQGTGFEDDVEDDKQRIERQY------FTVAVPIMKKFKKFFDLKRHSL 182
Query: 257 EIFKQ--AFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD 314
++K AF+ S +VP+ +L + S A L ++ + +V
Sbjct: 183 SLYKHMAAFIMAYSFTEDSPSFHGNNVPVMVPMAD-ILNHHSNNNARLEFGEEELSMVST 241
Query: 315 RPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYDRL 351
+ G + G N LL +YGFV+ DNP D +
Sbjct: 242 QHILKGGEVFNTYGQLANCHLLQSYGFVEGPDNPNDTV 279
>gi|345566622|gb|EGX49564.1| hypothetical protein AOL_s00078g53 [Arthrobotrys oligospora ATCC
24927]
Length = 611
Score = 46.2 bits (108), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 74/313 (23%), Positives = 130/313 (41%), Gaps = 43/313 (13%)
Query: 158 ACLAL---YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 214
AC L L+ E+ Q FW PYIR L + ++PL +++ E+ L G+
Sbjct: 112 ACFHLSQHLLLKEQSQ----FW-PYIRLLPK-------TFDTPLYFNDDEMERLAGTNLG 159
Query: 215 A-EILERAEGIKREYNELDTVWFMAG---SLFQQYPYDIPTEA---FTFEIFKQAFVAV- 266
A ++L R + E+ F+ G ++Y +D+ A +T F V +
Sbjct: 160 AGDVLLRKQLWMEEWEAGKQ--FLEGVGAERAREYTWDLFLRAATIYTSRSFPSKLVGIT 217
Query: 267 -QSCVVHLQKVSLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 324
S + +S F L+PL +L + K + + L+ G +
Sbjct: 218 MDSSIEENTMLSDDNGFPVLIPL-VDILNHKPNTKIIWEPTQTSFSLITPETISEGSQVF 276
Query: 325 VWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV-QVFH 383
GP+ N +LL+ YGFV +NP D L ++ ++ P+ Q ++ QR K + +VFH
Sbjct: 277 NNYGPKGNEELLMGYGFVIPENPGDSLAMKFTIS---PRGQAAQIWEQRALKQTWREVFH 333
Query: 384 VHAGREKEAISDMLPYLRLGY-----------VSDTSEMQSVISSLGPICPVSPCMERAV 432
+ + + +P L + V++ +E+ + + P+S E AV
Sbjct: 334 LTKSADSGQKTSTVPALESDWPEAFVDLFRILVANENEIDDLENGDINATPISIRNELAV 393
Query: 433 LDQLADYFKARLA 445
L K +LA
Sbjct: 394 ALGLKAAIKQKLA 406
>gi|324503528|gb|ADY41532.1| SET domain-containing protein 3 [Ascaris suum]
Length = 502
Score = 46.2 bits (108), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 67/307 (21%), Positives = 120/307 (39%), Gaps = 58/307 (18%)
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
L++ + I + + L+ + C +K S WLPY+ L +
Sbjct: 142 LKKCFEQDMIVKTMDNVALALMVCC-------QKLSPDSSWLPYLDALPQ-------TFS 187
Query: 196 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS--------------- 240
+PL +S EL L+ SP E L + R++ V+F+A
Sbjct: 188 TPLYFSALELRKLSPSPAYEESLIMYRNVARQF-----VYFLAAVQRSERSRSAKKDKNH 242
Query: 241 -------LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL----QKVSLARRFALVPLGP 289
LF P+ + FTF++++ A V + + + K S + A VP
Sbjct: 243 AAVGMEPLFLNAPFTVSN--FTFDLYRWAVACVTTRINFIPSQYAKDSNGQPVA-VPCLI 299
Query: 290 PLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-D 343
PLL + + D + + YKAG+ + ++ G + N + ++ GFV D
Sbjct: 300 PLLDMANHEFDHPLTVHFSTEGDYASIKATKDYKAGDEVTIFYGIRTNRQFFLHNGFVPD 359
Query: 344 EDNPYDRLVVEAALNTEDPQYQDKRMV---AQRNGKLSVQVFHVHAGREKEAISDMLPYL 400
+N D ++ D Q + + + A N + V VF V+A +S +L +
Sbjct: 360 GENKNDTYKLKIGFPRGDKQVRARLKLMHDAGFNAESRVFVFEVNASERPVPLS-LLDFA 418
Query: 401 RLGYVSD 407
R+ V +
Sbjct: 419 RVFLVEN 425
>gi|330924929|ref|XP_003300837.1| hypothetical protein PTT_12198 [Pyrenophora teres f. teres 0-1]
gi|311324820|gb|EFQ91062.1| hypothetical protein PTT_12198 [Pyrenophora teres f. teres 0-1]
Length = 372
Score = 46.2 bits (108), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 62/285 (21%), Positives = 112/285 (39%), Gaps = 44/285 (15%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+L SW + G+ + + PS + A+ D+QAG+ VP + +L+ V
Sbjct: 6 ELLSWATERGVKLSGIKPQNIPSRGTG------IIATRDIQAGETILFVPFKVFRSLKHV 59
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQ--GKKSFWLPYIRELDRQRGRGQLAVESP 197
+ IA L N +S A LA YL +K + LP + + P
Sbjct: 60 --PKAIARRLPRN-MSLHALLAAYLTLDKTDTFAIANQTLPDLSSFE---------AGMP 107
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
LW EL P + ++ +R++ + V+ + E
Sbjct: 108 FLWP-AELHPFLPKPALDLLKKQQRNFQRDWATVSKVY----------------SNVSHE 150
Query: 258 IFKQAFVAVQSCVVHLQKVSLAR-----RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 312
+ +++ V + + S+ R R A++P+ C+A A+ + +
Sbjct: 151 QYLHSWLLVNTRSFYCTTPSMERLPHDDRLAILPVADLFNHADVGCEAQFAS--ENYSFI 208
Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 357
DR Y+AGE + + G LL YGFV +N +D + ++ A+
Sbjct: 209 ADRTYRAGEELYISYGTHSTDFLLAEYGFVPAENRWDVVCLDEAI 253
>gi|12718364|emb|CAC28558.1| related to histone-lysine N-methyltransferase [Neurospora crassa]
Length = 471
Score = 46.2 bits (108), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 59/265 (22%), Positives = 111/265 (41%), Gaps = 35/265 (13%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
+ G+ ++P ++ T++ + + L + + LS LA Y+++ K
Sbjct: 42 FKEGEKILTIPAGILWTVKHAYADPLLGPALRSAQPPLSVEDTLATYILFVKS------- 94
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
RE R +A S +L++E +L G+ + + I+ ++ L
Sbjct: 95 ----RESGYDGQRSHIAALPASYSSSILFAEDDLEACAGTSLYTITKQLEQSIEDDHRAL 150
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEI----FKQAFVAVQSCVVHLQKVSLARRFALVPL 287
LF Q+P P + FT E +K A V S + LA ++ L
Sbjct: 151 VV------RLFVQHPDLFPLDKFTVEDVGLHYKWALCTVWSRAMDF---VLADGNSIRLL 201
Query: 288 GP--PLLAYSSKCKA--MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 343
P +L ++S+ K + + + + Y+AG+ + + GP PNS+LL YGFV
Sbjct: 202 APFADMLNHTSEVKQCHVYDPSSGTLSVFAGKDYEAGDQVFINYGPVPNSRLLRLYGFVI 261
Query: 344 EDNPYDRLVVEAALNTEDPQYQDKR 368
NP D + + + + P ++ K+
Sbjct: 262 PGNPNDSYDLVLSTHPQAPFFEQKQ 286
>gi|407035166|gb|EKE37568.1| [Ribulose-bisphosphate-carboxylase]-lysine N-methyltransferase
[Entamoeba nuttalli P19]
Length = 791
Score = 46.2 bits (108), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 71/318 (22%), Positives = 120/318 (37%), Gaps = 38/318 (11%)
Query: 78 LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
+ D+K W+ +NG V +K + + A+++ + + S+P S + +
Sbjct: 1 MEDIKKWVIQNGGVIDGVDVKTFDGYGRG------LCANKEFKKDEIIMSIPYS--IQIN 52
Query: 138 RVLGNETIAELLT------TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQ 191
R+ N E+ + +L L + K K F PYI L
Sbjct: 53 RINLNHIWPEVKLPKFNEGDDDRDDLNGLVYLYLAVNKTNPKCFHWPYINVLPE------ 106
Query: 192 LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP-YDIP 250
+ PL ++ EL + G+ A + E+ + V + L QQ+P Y P
Sbjct: 107 -TYDCPLSYTIDELNLMKGTKLYAAV-EKINAFL-----MKVVDYYNNKLIQQFPQYFQP 159
Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 310
+ ++FK+ A QS V + F V P +S+ C Q
Sbjct: 160 FD----DLFKRLQWAHQSFWSRAFLVIYPQPFGEVGSLIPFCDFSNHCTQAKVTYISNTQ 215
Query: 311 L------VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
+ K GE I + N KLL+ YGFV+E+NP D L++ +D QY
Sbjct: 216 TETFSFQTNEALVKPGEQIFNNYRIRSNEKLLLGYGFVEENNPCDNLLLRIYFEVDDNQY 275
Query: 365 QDKRMVAQRNGKLSVQVF 382
+ + ++ S F
Sbjct: 276 NEIEEILKQEEIKSFDFF 293
>gi|410082051|ref|XP_003958604.1| hypothetical protein KAFR_0H00600 [Kazachstania africana CBS 2517]
gi|372465193|emb|CCF59469.1| hypothetical protein KAFR_0H00600 [Kazachstania africana CBS 2517]
Length = 508
Score = 46.2 bits (108), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 58/244 (23%), Positives = 104/244 (42%), Gaps = 22/244 (9%)
Query: 113 VAASEDLQAGDAAFSVP-NSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
V A +D+ G+ F +P +S++ L L ++ T + L L L+YE K
Sbjct: 41 VIAVKDIAEGEVLFEIPRDSILNVLTSSLSSDFSDLEETLQSIGSWEGLILCLLYEWKGK 100
Query: 172 K-KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
K KS W Y L A+ + W+E EL +L S I +++ K Y++
Sbjct: 101 KEKSKWWKYFNVLPSSN-----AMNGLMYWNEQELEHLRPSLVLDRIGKKSA--KNMYHK 153
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFE---IFKQAFVAVQSCVVHLQKVSLARRF----- 282
+ T+ + S F + ++ E F + I +F L + +
Sbjct: 154 VLTL--VKESKFPEVLCNVEWEDFVYAASVIMAYSFDVENGESQTLNEEDDDQDEEENTG 211
Query: 283 ---ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 339
+++PL L + + +C A L D +++ +P K GE + G PN+++L Y
Sbjct: 212 YIKSMIPLADTLNSDTHQCNANLMYDDKFLKMYAIKPIKKGEQVFNIYGNHPNAEILRRY 271
Query: 340 GFVD 343
G+V+
Sbjct: 272 GYVE 275
>gi|167389227|ref|XP_001738871.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165897700|gb|EDR24782.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 791
Score = 46.2 bits (108), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 70/318 (22%), Positives = 124/318 (38%), Gaps = 38/318 (11%)
Query: 78 LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
+ D+K W+ +NG V +K + + A+++ + + S+P S + +
Sbjct: 1 MEDIKKWVIQNGGIIDGVDVKTFEGYGRG------LCANKEFKQDEIIMSIPYS--IQIN 52
Query: 138 RVLGNETIAELLT------TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQ 191
R+ N E+ + +L L + K K F PYI L +
Sbjct: 53 RINLNHIWPEVKLPKFNEGDDDRDDLNGLVYLYLAINKTNPKCFHWPYINVLPK------ 106
Query: 192 LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP-YDIP 250
+ PL ++ EL + G+ + E+ + V + L QQ+P Y P
Sbjct: 107 -TYDCPLSYTIDELNIMKGTKLYVAV-EKINAFL-----MKVVDYYNNKLIQQFPQYFQP 159
Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC-KAMLAAVDDAV 309
+ ++FK+ A QS V + F V P +S+ C +A + + +
Sbjct: 160 FD----DLFKRLQWAHQSFWSRAFLVIYPQPFGEVGSLIPFCDFSNHCTQAKVTYISNTR 215
Query: 310 QLVV-----DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
+ K GE I + N KLL+ YGFV+E+NP D L++ +D QY
Sbjct: 216 TETFSFQTNEEVVKPGEQIFNNYRIRSNEKLLLGYGFVEENNPCDNLLLRIYFEVDDNQY 275
Query: 365 QDKRMVAQRNGKLSVQVF 382
+ + ++ S F
Sbjct: 276 NEIEEILKQEEIKSFDFF 293
>gi|367009050|ref|XP_003679026.1| hypothetical protein TDEL_0A04830 [Torulaspora delbrueckii]
gi|359746683|emb|CCE89815.1| hypothetical protein TDEL_0A04830 [Torulaspora delbrueckii]
Length = 484
Score = 46.2 bits (108), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 39/68 (57%), Gaps = 1/68 (1%)
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
+++PL L A +SKC A L +++++ +P GE + G PNS+LL YG+V
Sbjct: 213 SMIPLADTLNANTSKCNANLVYDIESLKMCATKPIGMGEQVYNIYGDHPNSELLRRYGYV 272
Query: 343 D-EDNPYD 349
+ E + YD
Sbjct: 273 EWEGSKYD 280
>gi|408392258|gb|EKJ71616.1| hypothetical protein FPSE_08255 [Fusarium pseudograminearum CS3096]
Length = 527
Score = 46.2 bits (108), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 35/63 (55%), Gaps = 2/63 (3%)
Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
S CK + +A+ +VQ DR YK GE + V GP N LL YGF+ + N +D + ++
Sbjct: 194 SQGCKLVYSALGYSVQ--TDRAYKQGEEVFVSYGPHSNDFLLTEYGFILDTNRWDEVYLD 251
Query: 355 AAL 357
+
Sbjct: 252 EVI 254
>gi|358395377|gb|EHK44764.1| hypothetical protein TRIATDRAFT_80097 [Trichoderma atroviride IMI
206040]
Length = 463
Score = 45.8 bits (107), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 77/353 (21%), Positives = 141/353 (39%), Gaps = 59/353 (16%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
Q G+ ++P + T+E + + +L + + LS LA+YL++ +
Sbjct: 34 FQQGERILTIPGDSLWTVEHADSDPLLGPVLRSVQPPLSVEDTLAVYLLFVR-------- 85
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
+RE + R +A S + ++E EL G+ + E I+ +Y
Sbjct: 86 ---LREHGYEGPRSHVAAMPARYSSSIFFNEDELEVCAGTSLYTITKQLEERIEDDYR-- 140
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
+ +F Q+P +P + + +K A V S + ++P G PL
Sbjct: 141 ----VLVMRVFTQHPDLLPLAKISIQDYKWALCTVWSRAMDF----------VLPNGKPL 186
Query: 292 ---------LAYSSKCKAMLAAVDDAVQLVV--DRPYKAGESIVVWCGPQPNSKLLINYG 340
+ +S + K A + L V + Y+ G+ I + G PN++LL YG
Sbjct: 187 RVLAPFADMINHSPEVKQCHAYDPSSGNLSVLAGKDYEIGDQIYISYGSIPNNRLLRLYG 246
Query: 341 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP-- 398
FV +NP D + + + P Y+ K+ + G S + + D LP
Sbjct: 247 FVIPENPNDSYDLVLSTHPMAPFYEQKQKLWASAGLDSASTIPL-------TLIDPLPKS 299
Query: 399 ---YLRLGYVSDTSEMQSV-ISSLGPICPVSPCMERAVLDQLADYFKARLAGY 447
YLR+ + D S++ ++ + L +S E +L L + A L G+
Sbjct: 300 VLRYLRIQRL-DASDLAAIALQKLDTNEKISNSKEVEILQFLVESISALLDGF 351
>gi|403158396|ref|XP_003307692.2| hypothetical protein PGTG_00642 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375163798|gb|EFP74686.2| hypothetical protein PGTG_00642 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 622
Score = 45.8 bits (107), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 30/78 (38%), Positives = 39/78 (50%), Gaps = 2/78 (2%)
Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
A+VPL L A + A L D +++ R K GE I G PNS LL YG
Sbjct: 345 IAMVPLADLLNAKTGSENARLFYETDCLKMKATRNIKKGEQIYNTYGDPPNSDLLRRYGH 404
Query: 342 VDEDNPYDRLVVEAALNT 359
VD+ N +D VVE ++ T
Sbjct: 405 VDDPNRFD--VVEISIKT 420
>gi|40068483|ref|NP_954574.1| histone-lysine N-methyltransferase setd3 isoform b [Homo sapiens]
gi|28071060|emb|CAD61911.1| unnamed protein product [Homo sapiens]
gi|111309143|gb|AAI20968.1| SET domain containing 3 [Homo sapiens]
gi|118341365|gb|AAI27625.1| SET domain containing 3 [Homo sapiens]
gi|118341638|gb|AAI27626.1| SET domain containing 3 [Homo sapiens]
gi|119602071|gb|EAW81665.1| SET domain containing 3, isoform CRA_b [Homo sapiens]
gi|156138972|gb|AAI48252.1| SET domain containing 3 [Homo sapiens]
Length = 296
Score = 45.8 bits (107), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 53/223 (23%), Positives = 100/223 (44%), Gaps = 29/223 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y V Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV-------IQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
+P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPL 273
>gi|44890428|gb|AAH66931.1| SETD3 protein [Homo sapiens]
Length = 292
Score = 45.8 bits (107), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 53/223 (23%), Positives = 100/223 (44%), Gaps = 29/223 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y V Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV-------IQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
+P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPL 273
>gi|452982650|gb|EME82409.1| hypothetical protein MYCFIDRAFT_40308 [Pseudocercospora fijiensis
CIRAD86]
Length = 449
Score = 45.8 bits (107), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 61/250 (24%), Positives = 99/250 (39%), Gaps = 36/250 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A+ DL + + F +P + ++T E + I + LT LS L L +++E G
Sbjct: 42 VVATSDLTSDEEIFRIPRTSILTTETTDLPQEILQQLTDPWLS----LILAMIFEYLLGT 97
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-LERAEGIKREY--- 228
S + PY+ L + + + W++ EL YL GS ++I E A+ E
Sbjct: 98 NSRFKPYLDILPE-------SFNTLMFWTDNELQYLQGSAILSKIGKEEADNTFSEQLLP 150
Query: 229 ----------------NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH 272
+L + GS+ Y +D+ T +
Sbjct: 151 IITKNPEIFKIGTCNNQDLLALCHRMGSIIMSYAFDLDPPPTT--TTSSSEEWESDSDSE 208
Query: 273 LQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPN 332
+K+S AL+PL L A + L D+ + +P AGE ++ GP P
Sbjct: 209 NEKISPK---ALIPLADMLNANGDLTNSKLFFSSDSFIMKTLQPVAAGEELLNDFGPLPP 265
Query: 333 SKLLINYGFV 342
+ LL YGFV
Sbjct: 266 ADLLRRYGFV 275
>gi|260822399|ref|XP_002606589.1| hypothetical protein BRAFLDRAFT_277814 [Branchiostoma floridae]
gi|229291933|gb|EEN62599.1| hypothetical protein BRAFLDRAFT_277814 [Branchiostoma floridae]
Length = 459
Score = 45.8 bits (107), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 66/262 (25%), Positives = 108/262 (41%), Gaps = 41/262 (15%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLE-----RVLGNETIAELLTTNKLSELACLALYLMYE 167
+ A E+L+ G+ F V S V++ E +L ET + + S L LMYE
Sbjct: 53 MVAQEELEEGECLFKVDKSAVLSTETTEIAHLLKEETSLHGDSLHGDSGWVPQILALMYE 112
Query: 168 KKQGKKSFWLPYIR------ELDRQRGRGQLAVESPLLWSETELAY---LTGSPTKAEIL 218
S W PY++ +LD+ P+ W+E E+ TG P +
Sbjct: 113 YT-NPNSRWRPYLQLVPDFSQLDQ-----------PMFWTEDEIERDLCNTGIPEASS-- 158
Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ--AFVAVQSCVVHLQKV 276
+K EY L A +++ + E +FE++K+ AF+ S +
Sbjct: 159 SDLTKMKLEYTSL------ALPFIRKHRHIFSEEVHSFELYKRMVAFIMAYSFFEPVNGR 212
Query: 277 SLARRFALVPLGPPL---LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 333
+ +PL P+ L + +K A L D +++V R AGE + G N
Sbjct: 213 EDEGGKSSLPLMVPMADILNHVAKNNAQLEWDADCLRMVTTRTVAAGEEVFNTFGQLANW 272
Query: 334 KLLINYGFVDE--DNPYDRLVV 353
+LL YGF + +N YD + +
Sbjct: 273 QLLHMYGFAEAWPENIYDTVDI 294
>gi|315042966|ref|XP_003170859.1| SET domain-containing protein [Arthroderma gypseum CBS 118893]
gi|311344648|gb|EFR03851.1| SET domain-containing protein [Arthroderma gypseum CBS 118893]
Length = 693
Score = 45.8 bits (107), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 50/194 (25%), Positives = 84/194 (43%), Gaps = 11/194 (5%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LAL++ +++ + K S W PY+ L R + S L + +L +L G+
Sbjct: 108 LALFVAHQQLKEKGSHWWPYLATLPRAS-----ELTSALFYHGDDLEWLQGTNLYQTHQA 162
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQ-YPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 278
+K EY+ ++ G L + Y +D+ A+T I +AF + + V+L +
Sbjct: 163 YMNAVKEEYDSAISILRDEGCLAAELYSWDLFCWAYTV-IASRAFTS-RVLSVYLSRNPA 220
Query: 279 ARRFALVPLGPPLLAYSSK---CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
++ + PL+ S+ K A + L V P + E I GP N +L
Sbjct: 221 LKQDEEFQILLPLVDSSNHKPLAKIEWRAEAAEIGLKVVEPIVSEEEIHNNYGPLNNQQL 280
Query: 336 LINYGFVDEDNPYD 349
+ YGF DNP D
Sbjct: 281 MTTYGFCIVDNPCD 294
>gi|358055500|dbj|GAA98620.1| hypothetical protein E5Q_05307 [Mixia osmundae IAM 14324]
Length = 462
Score = 45.8 bits (107), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 65/288 (22%), Positives = 120/288 (41%), Gaps = 39/288 (13%)
Query: 81 LKSWM-HKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT---L 136
L SW+ H +G +I+ + ++ + A+ DL AG S P++L +T
Sbjct: 10 LGSWLRHHDGFIHEHLIVVQDELGDKS------IIATTDLPAGTCIASCPHTLAITPTSA 63
Query: 137 ERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWL--PYIRELDRQRGRGQLAV 194
LG+ +LS+ + LYL+ K L Y+ L + A+
Sbjct: 64 RAALGHHA-------TELSDHQAMVLYLVLHKHPSPAVCCLHQAYVDTLPPRS-----AM 111
Query: 195 ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAG--SLFQQYPYDIPTE 252
+PL ++ E+ L G+ + +R + E+ TV AG LF+ E
Sbjct: 112 RTPLWFNPAEVQLLQGTNLAGAVTDRQRDWQLEWM---TVLRRAGQSGLFKASF----EE 164
Query: 253 AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS----KCKAMLAAVDDA 308
+ ++ ++ ++ HL + + + L P + A++ K ++
Sbjct: 165 TWPSALWAATILSSRAFPSHL--IDGNEQASTPVLFPGVDAFNHQQARKVTWQTSSASGR 222
Query: 309 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 356
LV D P AG+ + GP+ N + L+ YGF+ +NP D +V++ A
Sbjct: 223 FNLVQDEPTAAGQQVFNNYGPKSNEEFLLGYGFIIPNNPDDHMVLKLA 270
>gi|303275314|ref|XP_003056953.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226461305|gb|EEH58598.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 701
Score = 45.8 bits (107), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 38/136 (27%), Positives = 59/136 (43%), Gaps = 19/136 (13%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
WM + G+ V + P R A+ D+ GD SVP ++T E + +
Sbjct: 37 WMKRRGIVLNGVGVGRFP------RTGRGCVATRDIAPGDVLVSVPEDAIITAETSVAAD 90
Query: 144 TIAEL-LTTNKLS-------ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ + L +++S E L L ++ E +G +S + PY+ L R A
Sbjct: 91 ALTKFGLGGDEMSAEASPRLEREALVLAVLAEMSRGHESDFAPYLAALPTLR-----ATH 145
Query: 196 SPLLWSETELAYLTGS 211
SPL WS ELA L G+
Sbjct: 146 SPLAWSGAELAELEGT 161
>gi|346327621|gb|EGX97217.1| SET domain-containing protein, putative [Cordyceps militaris CM01]
Length = 371
Score = 45.8 bits (107), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 38/77 (49%), Gaps = 2/77 (2%)
Query: 281 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 340
R AL+P+ S C + + + DR Y+A E + G N LL YG
Sbjct: 178 RLALLPVADMFNHASVGCAVAFST--EVYDVTADRDYEADEELYTSYGAHSNDFLLAEYG 235
Query: 341 FVDEDNPYDRLVVEAAL 357
F+ +DNP+D+L ++A L
Sbjct: 236 FMLQDNPHDQLCLDAVL 252
>gi|328771298|gb|EGF81338.1| hypothetical protein BATDEDRAFT_87914 [Batrachochytrium
dendrobatidis JAM81]
Length = 607
Score = 45.8 bits (107), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 41/154 (26%), Positives = 71/154 (46%), Gaps = 17/154 (11%)
Query: 78 LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
L LK W +N + + ++ + N R V A + L+ GD ++P +++++
Sbjct: 4 LNILKQWFGENKIAYDEEKIRIEHDTNNGFR----VFAKQTLEVGDILCAIPKEAILSIK 59
Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
N +A++L L L + LM+E+ G+KS W YI+ L L P
Sbjct: 60 ----NCGVADVLEEQGLGGQLGLVIALMFERSLGEKSPWYGYIQSL-------PLRENIP 108
Query: 198 LLWSETELAYLTGSPTKAEILE-RAEGIKREYNE 230
L W + + A L G+ A +LE + +K +Y E
Sbjct: 109 LFWEKDQQACLDGTAV-AHLLEPMPKDLKADYKE 141
>gi|111306423|gb|AAI20969.1| SETD3 protein [Homo sapiens]
Length = 284
Score = 45.8 bits (107), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 53/223 (23%), Positives = 100/223 (44%), Gaps = 29/223 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y V Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV-------IQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
+P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPL 273
>gi|46129354|ref|XP_389038.1| hypothetical protein FG08862.1 [Gibberella zeae PH-1]
Length = 478
Score = 45.4 bits (106), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 40/77 (51%), Gaps = 3/77 (3%)
Query: 281 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 340
R +P+ L + CK + +A+ +VQ DR YK GE + V GP N LL YG
Sbjct: 178 RLVCMPVAD-LFNHDQGCKLVYSALGYSVQ--TDRVYKQGEEVYVSYGPHSNDFLLTEYG 234
Query: 341 FVDEDNPYDRLVVEAAL 357
F+ + N +D + ++ +
Sbjct: 235 FILDTNRWDEVYLDEVI 251
>gi|296808191|ref|XP_002844434.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
gi|238843917|gb|EEQ33579.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
Length = 684
Score = 45.4 bits (106), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 51/194 (26%), Positives = 84/194 (43%), Gaps = 11/194 (5%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LA ++ +++ + K S W PY+ L R G+L S L + +L +L +
Sbjct: 111 LAFFVAHQQLKAKDSHWWPYLATLPRA---GELT--SALFYQGEDLEWLQDTNFYHARQM 165
Query: 220 RAEGIKREYNELDTVWFMAGS-LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 278
+ +K EY+ ++ G L + Y ++I A+T I +AF + + ++ K
Sbjct: 166 YHDAVKTEYDAAISILRKEGCPLVESYSWNIFCWAYTV-IASRAFTS-RVLEAYISKNPA 223
Query: 279 ARRFALVPLGPPLLAYSSK---CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
R+ + PL+ S+ K A + L V P A E I GP N +L
Sbjct: 224 LRQDDEFQIMLPLVDSSNHRPLAKIEWRAEATRIGLKVIDPVSAKEEIHNNYGPLNNQQL 283
Query: 336 LINYGFVDEDNPYD 349
+ YGF DNP D
Sbjct: 284 MATYGFCIVDNPCD 297
>gi|452823683|gb|EME30691.1| hypothetical protein Gasu_19370 [Galdieria sulphuraria]
Length = 370
Score = 45.4 bits (106), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 65/279 (23%), Positives = 113/279 (40%), Gaps = 48/279 (17%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSE--------LACLALYLMY 166
A + + G +P+ L++T GN+ L N + + ++++L +
Sbjct: 38 AKKPITKGSILLEIPDPLLIT-----GNKVCKWLERNNWIGHQQISSVQGVLLVSIFLFF 92
Query: 167 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 226
E +Q SFW PY++ L L + LL +Y+T +A+I++ E ++R
Sbjct: 93 ESRQSD-SFWKPYLQVLPTSYDLLFLYRDGLLL------SYVT----EADIMQMVESVRR 141
Query: 227 EYNELDTVWFMAGSLFQQY--PY-----DIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
+ FQ Y P+ D F+ F + + AV S + +L
Sbjct: 142 ----------ILRDTFQTYVIPHFSSVDDRDKWNVLFKEFVRWYCAVVSRICYLPDDIAG 191
Query: 280 RRFALVPLGPPL--LAYSSKCKAMLAAVDDAVQLV-VDRPYKAGESIVVWCGPQPNSKLL 336
ALVPLG A + + A + + R + G + V G N++L+
Sbjct: 192 ---ALVPLGDIFNHEAVDTPVDILYAKWERGYYVFRAHRNFSIGTQVFVSYGALSNTELM 248
Query: 337 INYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 375
+ YGF DNP+D L E ++ + R+V R G
Sbjct: 249 MYYGFTLNDNPWDTLSFYPHELDESIKFYE-RVVLDREG 286
>gi|291000152|ref|XP_002682643.1| predicted protein [Naegleria gruberi]
gi|284096271|gb|EFC49899.1| predicted protein [Naegleria gruberi]
Length = 619
Score = 45.4 bits (106), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 51/207 (24%), Positives = 85/207 (41%), Gaps = 19/207 (9%)
Query: 157 LACLALYLMYE-KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 215
L ++L+YE + +KS PY+ L R+ + L + E E+A L +
Sbjct: 106 LIVFYMFLIYELHVEKEKSTHFPYLNLLPRE-------FTTALYFDEDEMAALRSTNLYK 158
Query: 216 EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 275
+ + +K+ Y E + M +YP + F++E F AF AV S V ++
Sbjct: 159 SVQSIRQNLKQIY-ETKVEYLM-----NKYPQKFDRQVFSYENFMWAFSAVWSRVFPIEY 212
Query: 276 -VSLARRFALVP-LGPPLLAYSSKCKA---MLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 330
+VP L P + + K A D L K+G+ + G +
Sbjct: 213 PAENGEGVEIVPTLLPTVDILNHKFNAKITYFTGSDRRFYLKTRESLKSGDYVCNNYGAK 272
Query: 331 PNSKLLINYGFVDEDNPYDRLVVEAAL 357
N L++YGFV +N D L V+ +
Sbjct: 273 SNDSFLLSYGFVIPNNSEDTLYVQFGI 299
>gi|428171155|gb|EKX40074.1| hypothetical protein GUITHDRAFT_113813 [Guillardia theta CCMP2712]
Length = 353
Score = 45.4 bits (106), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 24/79 (30%), Positives = 41/79 (51%), Gaps = 1/79 (1%)
Query: 275 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 334
++ L+R FA G L +S Q+V ++ +K G+S+ + G + N +
Sbjct: 183 EIVLSRAFAFSRTGGDDLVFSG-TSVKYDNSKQEFQIVAEKDFKVGQSVEISYGLKSNHE 241
Query: 335 LLINYGFVDEDNPYDRLVV 353
LL++YGF+ DNP D V+
Sbjct: 242 LLLSYGFILPDNPEDFFVI 260
>gi|449702130|gb|EMD42824.1| Hypothetical protein EHI5A_004190 [Entamoeba histolytica KU27]
Length = 749
Score = 45.4 bits (106), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 57/227 (25%), Positives = 91/227 (40%), Gaps = 23/227 (10%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
LYL K K W PYI L + PL ++ EL + G+ A + E+
Sbjct: 42 LYLAVNKTNPKCFHW-PYINVLPE-------TYDCPLSYTIDELNLMKGTKLYAAV-EKI 92
Query: 222 EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR 281
+ V + L QQ+P ++F ++FK+ A QS V +
Sbjct: 93 NAFL-----MKVVDYYNNKLIQQFPQYF--QSFD-DLFKRLQWAHQSFWSRAFLVIYPQP 144
Query: 282 FALVPLGPPLLAYSSKC-KAMLAAVDDAVQLVV-----DRPYKAGESIVVWCGPQPNSKL 335
F V P +S+ C +A + + + + K GE I + N KL
Sbjct: 145 FGEVGSLIPFCDFSNHCTQAKVTYISNTQTETFSFQTNEELVKPGEQIFNNYRIRSNEKL 204
Query: 336 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
L+ YGFV+E+NP D L++ +D QY + + ++ S F
Sbjct: 205 LLGYGFVEENNPCDNLLLRIYFEVDDNQYNEIEEILKQEEIKSFDFF 251
>gi|403350232|gb|EJY74567.1| hypothetical protein OXYTRI_04175 [Oxytricha trifallax]
Length = 766
Score = 45.1 bits (105), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 39/164 (23%), Positives = 67/164 (40%), Gaps = 23/164 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--------SELACLALYL 164
V A ED++ +A VP L++T+E + I + NK E L +++
Sbjct: 27 VRAREDIEHREAFLYVPFKLLITMELAHNHPIIGHVFKENKQIFTKEHEDFEQLTLTVFM 86
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
+YE ++G +SFW PY+ L VE WS++++ + E I
Sbjct: 87 LYEYQKGLESFWFPYLNLLP--------DVEFFCNWSKSDIEAIDDQELAYETKSYKRDI 138
Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 268
+ E+ E++ L YP + +F + F V S
Sbjct: 139 EIEWKEIEL-------LLLHYPQHFSSALIDKHLFMRIFAQVCS 175
>gi|321462357|gb|EFX73381.1| hypothetical protein DAPPUDRAFT_58066 [Daphnia pulex]
Length = 425
Score = 45.1 bits (105), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 67/306 (21%), Positives = 118/306 (38%), Gaps = 44/306 (14%)
Query: 56 RVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPC-KVILKEKPS-HNEKHRPIHYV 113
R+ + + SR + + +L WM NG K L KP+ N R +
Sbjct: 8 RIRNRLVRIVHSRPLRIDSHSEFVELCKWMSANGWNAVSKNCLVTKPALFNSTGRGL--- 64
Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKK 173
A ++ +P SL++T E+VL I++LL + ++ CL +++ K G
Sbjct: 65 MAMSNIAPNHLLVQIPQSLLITKEKVLAE--ISDLLQFS-MTTAECLTFFILNSKFNGLY 121
Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
S YI L + G L E+A L S + +I+ + ++Y ++
Sbjct: 122 S---SYISTLPKSFSVGGLC-------KSQEIAALP-SFLQEKIMCNQNFVLKKYEKIFA 170
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL--------- 284
+W I + E+F+ A+ V + V Q S L
Sbjct: 171 IW-----------RKIYGSTLSLELFQWAWFCVNTRAVFYQD-SKQHSHGLNKVDGMENN 218
Query: 285 VPLGPPLLAYSSKCKAMLAA----VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 340
+ L P L ++ + ++ A ++ DR K + + + GP N KL + YG
Sbjct: 219 MALAPYLDMFNHDAEVVVEAGFNKTTQCYEIRSDRHIKKYQQVFINYGPHDNMKLFLEYG 278
Query: 341 FVDEDN 346
F+ N
Sbjct: 279 FLATKN 284
>gi|303271033|ref|XP_003054878.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226462852|gb|EEH60130.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 664
Score = 45.1 bits (105), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 44/141 (31%), Positives = 67/141 (47%), Gaps = 17/141 (12%)
Query: 149 LTTNKLSELACLALYLMYE-KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY 207
+T+ +++ A +AL+L++E Q +KS W P++ L R VE+PLLW+ ELA
Sbjct: 189 ITSREVTIDAVIALHLLHELYVQREKSEWWPWVSILPRD-------VETPLLWTPRELAQ 241
Query: 208 LTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ 267
L GS I R +K + D ++ L Q++P P E F E + A V
Sbjct: 242 LEGSNL---IGFRDAVLKGWTTQRDALF---PKLTQKFPSLFPEEHFRTERWAWAMAIVW 295
Query: 268 SCVVHLQKVSLARRFALVPLG 288
S V + R A+ P G
Sbjct: 296 SRAA---DVPVPRPEAIFPSG 313
>gi|358399747|gb|EHK49084.1| hypothetical protein TRIATDRAFT_213818 [Trichoderma atroviride IMI
206040]
Length = 378
Score = 45.1 bits (105), Expect = 0.089, Method: Compositional matrix adjust.
Identities = 36/171 (21%), Positives = 72/171 (42%), Gaps = 24/171 (14%)
Query: 192 LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPT 251
V P++W EL +L + + +R + + ++++ F + DI
Sbjct: 102 FEVGMPMMWPR-ELKHLLPLEPRNLVFKREKAFQGDWSD-----------FHKAFSDISY 149
Query: 252 EAFTFEIFKQAFVAVQSCVVHLQ-----KVSLARRFALVPLGPPLLAYSSKCKAMLAAVD 306
E +T+ A++ V + + + K R AL+P+ + C+ +
Sbjct: 150 EEYTY-----AWLTVNTRTFYNESPETLKYPWEDRLALIPVADLFNHADAGCRVYYSP-- 202
Query: 307 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 357
+ +V DR YK GE + + N L+ YGFV ++NP D + ++ +
Sbjct: 203 EGYHIVADRDYKRGEELYISYSSHSNDYNLVEYGFVPDENPSDDVYIDDVI 253
>gi|302510645|ref|XP_003017274.1| hypothetical protein ARB_04152 [Arthroderma benhamiae CBS 112371]
gi|291180845|gb|EFE36629.1| hypothetical protein ARB_04152 [Arthroderma benhamiae CBS 112371]
Length = 479
Score = 45.1 bits (105), Expect = 0.092, Method: Compositional matrix adjust.
Identities = 30/126 (23%), Positives = 55/126 (43%), Gaps = 14/126 (11%)
Query: 97 LKEKPSHNEKHRPIHY-----VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTT 151
LK H + H IH A + + F +PN L+++++ + L
Sbjct: 24 LKRSSPHFKMHPGIHIADLRSTGAGRGISEDEELFVIPNDLILSVQNSEARSVLG--LDD 81
Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
+L L + ++YE QG++S W PY R L + ++ + W++ +L+ L GS
Sbjct: 82 KQLGPWLSLIITMIYEYYQGEQSKWYPYFRILPS-------SFDTLMFWTDEQLSELQGS 134
Query: 212 PTKAEI 217
+I
Sbjct: 135 AVVGKI 140
>gi|145349891|ref|XP_001419360.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579591|gb|ABO97653.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 465
Score = 45.1 bits (105), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 53/221 (23%), Positives = 83/221 (37%), Gaps = 47/221 (21%)
Query: 150 TTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLT 209
T + S L L L E+ G KS + Y R L R A W++ E +YL
Sbjct: 89 TKTEASWLCGLTAALCVERSLGLKSRYFAYDRVLPRCEANVVCA------WNDGERSYLA 142
Query: 210 GSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSC 269
G+ + + + A K E+ + +F+++ + +FE F +A V S
Sbjct: 143 GTEVETSLRDEAAAAKNEWER------VVAPVFKEHGVEC-----SFEQFIEARTVVSS- 190
Query: 270 VVHLQKVSLARRFALVP-----LGPPLLAYSSKCKAMLAAVDDA--------------VQ 310
R F L P L P A++ V D V+
Sbjct: 191 ----------RAFTLSPNAGVGLVPIADAFNHLTGNHHVNVGDGDAVVRSETGGEALCVK 240
Query: 311 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
+ ++ + G+ I G N+KLL +YGF DNP D +
Sbjct: 241 VTNEQGVRRGDEIFNTYGFHGNAKLLNSYGFTQNDNPADEV 281
>gi|403412960|emb|CCL99660.1| predicted protein [Fibroporia radiculosa]
Length = 508
Score = 45.1 bits (105), Expect = 0.094, Method: Compositional matrix adjust.
Identities = 62/225 (27%), Positives = 90/225 (40%), Gaps = 33/225 (14%)
Query: 284 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 343
+VP+ L A A L + +++V +P KAGE I G PNS LL YG VD
Sbjct: 258 MVPMADMLNARFGSENAKLFYEEHHLKMVTTKPIKAGEQIWNTYGDPPNSDLLRRYGHVD 317
Query: 344 ----------EDNPYD------RLVVEAALNTEDPQYQDK--RMVAQRNGKLSVQVFHVH 385
NP D L V AA + QDK + N V
Sbjct: 318 LVPLEPPLAGLGNPADIVEIGADLAVFAAKKDSPEKLQDKIDWWLEVANDDTFV------ 371
Query: 386 AGREKEAISDMLPYLRLGYV-SDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARL 444
G + + +++ + RL ++ D E S L P ++ VL D R+
Sbjct: 372 IGTDCQLPEELVSFARLLFLPRDEWEKVRQKSKL-----PKPKIDAQVLSVAEDVLSRRI 426
Query: 445 AGYPATLSEDEAMLTDYNLHP---KKRVATQLVRMEKKMLNACLQ 486
Y T+ +DEA+L N P K+ A + EK++L+ LQ
Sbjct: 427 NEYSTTIEDDEALLALENAQPLSLNKKHALIVRHGEKRILHGTLQ 471
>gi|226505024|ref|NP_001151430.1| SET domain containing protein [Zea mays]
gi|195646778|gb|ACG42857.1| SET domain containing protein [Zea mays]
gi|413923893|gb|AFW63825.1| SET domain containing protein [Zea mays]
Length = 491
Score = 45.1 bits (105), Expect = 0.094, Method: Compositional matrix adjust.
Identities = 57/253 (22%), Positives = 106/253 (41%), Gaps = 38/253 (15%)
Query: 114 AASEDLQAGDAAFSVPNSLVVTLER-VLGNETIAELLTTNKLSELACLALYL-MYEKKQG 171
AA D+ GD ++P+ L + L R + + L EL + L L + +++
Sbjct: 93 AAYGDIPIGDVLIALPSQLPLRLRRPTSAADDVLVQLAQQVPDELWAMKLGLRLLQERAK 152
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA----EGIKRE 227
SFW PYI L P+ + ++ L +P ++ +R E K
Sbjct: 153 SDSFWWPYIANLPE-------TFTVPIFFPGEDIKNLQYAPILHQVNKRCRFLLEFEKEV 205
Query: 228 YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 287
+L TV + + Q D+ + + + A A S L VP+
Sbjct: 206 QQKLHTVPLVDHPFYGQ---DVNSSSLGW-----AMSAASSRAFRLH--------GEVPM 249
Query: 288 GPPLL-----AYSSKCKAM----LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 338
PL+ +++ + + + ++D +V+++ ++ K E+I + G PN L++
Sbjct: 250 LLPLIDMCNHSFNPNARIVQERSVNSLDMSVKVLAEKKIKQNEAITLNYGCYPNDFFLLD 309
Query: 339 YGFVDEDNPYDRL 351
YGFV NPYD++
Sbjct: 310 YGFVITQNPYDQV 322
>gi|344300819|gb|EGW31140.1| hypothetical protein SPAPADRAFT_142076 [Spathaspora passalidarum
NRRL Y-27907]
Length = 436
Score = 45.1 bits (105), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 48/210 (22%), Positives = 78/210 (37%), Gaps = 19/210 (9%)
Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW---SETELAYLTG 210
LS L +Y+ E ++GK SFW P+ LD + PL+W ++ +L L
Sbjct: 117 LSSFQLLGMYITIETQRGKSSFWKPF---LDMLPSIADFEL-MPLVWQINNQHDLLDLLP 172
Query: 211 SPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
P + + +YN + + Q D + F A++ + S
Sbjct: 173 QPIRKTSEKVYTRFTSDYNTV--------TALLQTKIDNTEAVLPLDQFLLAWICINSRC 224
Query: 271 VHLQ---KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 327
+++ S + F + P L +S L Q+ Y E + +
Sbjct: 225 LYMNLPTSKSASDNFTMAPY-VDFLNHSPNDHCTLKIDGRGFQVFSTCAYSENEQVYLSY 283
Query: 328 GPQPNSKLLINYGFVDEDNPYDRLVVEAAL 357
GP N LL YGF DN ++ L V L
Sbjct: 284 GPHSNDFLLCEYGFTISDNKWNDLDVTEYL 313
>gi|146180409|ref|XP_001020886.2| hypothetical protein TTHERM_00411920 [Tetrahymena thermophila]
gi|146144524|gb|EAS00641.2| hypothetical protein TTHERM_00411920 [Tetrahymena thermophila
SB210]
Length = 726
Score = 45.1 bits (105), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 48/188 (25%), Positives = 81/188 (43%), Gaps = 31/188 (16%)
Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGNE-----TIAELLTTNKLSELA---CLALYLM 165
AA++D+ A S+PN ++++ +R +E +E L + K ++ A L ++ M
Sbjct: 67 AATKDIAPLTAFISIPNKIIISYDRARFSELKSFFKQSEDLFSEKENDEAGVNVLTVFFM 126
Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIK 225
YE+ +GKKS W Y L+ E+ L W+ E+ + + +
Sbjct: 127 YERLKGKKSLWHEYFEILENN--------ETILTWTAEEINRIPDPYIQKQA-------- 170
Query: 226 REYNE-LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CVVHLQKVSLARRFA 283
REY E +D +W L P T E+F A+ V S C + QK + +
Sbjct: 171 REYKEQVDELWDELKELLHSQPNFFQKATATKELFLWAYNIVMSRCFGYTQKGT-----S 225
Query: 284 LVPLGPPL 291
+VP L
Sbjct: 226 IVPFADCL 233
>gi|145517214|ref|XP_001444490.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124411912|emb|CAK77093.1| unnamed protein product [Paramecium tetraurelia]
Length = 748
Score = 45.1 bits (105), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 37/141 (26%), Positives = 66/141 (46%), Gaps = 23/141 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERV-LGNETIA-----ELLTTNKLS--ELACLALYL 164
V A++D+ A A VP +L+++ E+ L + +I EL N+ S E L YL
Sbjct: 46 VVATQDIPANTAIICVPQTLIISQEKCKLSSLSIVYDKHPELFDENQTSDAEFNILIFYL 105
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
EKK+G++SF+ PYI+ + + + W++ EL+ + E +E +
Sbjct: 106 FNEKKKGEQSFFYPYIQAIQTNN--------TLIDWTKEELSQIEDPIVLDEFAIVSEDL 157
Query: 225 KREYNELDTVWFMAGSLFQQY 245
K +W A +F ++
Sbjct: 158 K-------VLWNYAQDIFNEF 171
>gi|410900968|ref|XP_003963968.1| PREDICTED: SET domain-containing protein 4-like [Takifugu rubripes]
Length = 386
Score = 45.1 bits (105), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 59/249 (23%), Positives = 110/249 (44%), Gaps = 23/249 (9%)
Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSF 175
+++ GD S+P S ++T VL N + + + K LS L L ++L+ E+ +G+ S
Sbjct: 65 NVKPGDMLISLPESCLLTTSTVL-NSYLGSFIKSWKPHLSPLLALCVFLVCERHRGEASD 123
Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 235
W PYI L + + P +++ +A L S + + E+ E + RE + + +
Sbjct: 124 WFPYIDVLPK-------SYTCPAYFTDEVMALLPPS-VQRKAREQREAV-REIHSSNKAF 174
Query: 236 FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CVVHLQKVSLARRFALVPLGPPLLAY 294
F + P + + T+E + A+ +V + V L + R V P L
Sbjct: 175 FRSLQPVLTQPAE---DVLTYEALRWAWCSVNTRSVFMLHSSNDFLRGQDVYALAPFLDL 231
Query: 295 SSKC-----KAMLAAVDDAVQL-VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
+ C KA ++ V R + ++ + + G N +L++ YGFV NP+
Sbjct: 232 LNHCPDVQVKASFNEETKCYEIRSVSRMLQYQQAFINY-GSHDNQRLMLEYGFVAPCNPH 290
Query: 349 DRLVVEAAL 357
+ V+ L
Sbjct: 291 SVVYVDKDL 299
>gi|440464611|gb|ELQ34010.1| hypothetical protein OOU_Y34scaffold00824g3 [Magnaporthe oryzae
Y34]
Length = 373
Score = 45.1 bits (105), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 46/163 (28%), Positives = 71/163 (43%), Gaps = 24/163 (14%)
Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYN-ELDTVWFMAGSL----FQQYPYDIPT 251
P +W + EL L PT A + E + +YN E +TV S+ FQ Y + + T
Sbjct: 107 PFMWPK-ELQKLL--PTSARVF--LENQQTKYNHEWNTVSQAMPSISEERFQYYWHIVNT 161
Query: 252 EAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQL 311
F +E V+ C S R ALVPL C+ ++ + + +
Sbjct: 162 RTFLYE------VSETECY------SWEDRLALVPLADIFNHADEGCR--VSYMPEHYVI 207
Query: 312 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
DR Y+AGE + + G N LL YGF+ N +D + ++
Sbjct: 208 TTDRAYEAGEELFISYGDHSNDCLLTEYGFLLPKNRWDIICID 250
>gi|428174941|gb|EKX43834.1| hypothetical protein GUITHDRAFT_140267 [Guillardia theta CCMP2712]
Length = 805
Score = 45.1 bits (105), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 43/190 (22%), Positives = 77/190 (40%), Gaps = 35/190 (18%)
Query: 69 EVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
E +K+ EDL + W+ +NG+ KV L+ H + + A + ++ + F +
Sbjct: 540 EGSAKRNEDLIEFSKWLRRNGVDDSKVKLRADGGHGMG----NSLYARQMIKEDELLFRI 595
Query: 129 PNSLVVTLERVLGNETIAELLTTNKL-----SELACLALYLM--------YE-------- 167
P + + V + T+ ++ ++ E L+L LM YE
Sbjct: 596 PLKIAFYSDAVRRHPTLGSVIKGARIPQGMQGETFLLSLMLMGPLTHLEQYEACQVGHME 655
Query: 168 ---KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
K + SFWLPYI+ L + +P+ W+E E L GS + +
Sbjct: 656 TGCKLSNETSFWLPYIKILPK-------TFSAPIFWNEVERQELKGSQVMEMLNDDLAQA 708
Query: 225 KREYNELDTV 234
+RE+ + V
Sbjct: 709 RREWEMMKIV 718
>gi|407923069|gb|EKG16157.1| hypothetical protein MPH_06594 [Macrophomina phaseolina MS6]
Length = 305
Score = 45.1 bits (105), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 54/250 (21%), Positives = 99/250 (39%), Gaps = 42/250 (16%)
Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWL 177
D+Q G+ F++P S V++ + + +L L A L + ++YE +G S W
Sbjct: 3 DIQEGEVLFTIPRSAVLSATNSSLSSILPQLF--EHLDPWASLIVTMIYEYLRGDASPWK 60
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG----------IKRE 227
PY L ++ + WS+ ELA L S +I + + ++R
Sbjct: 61 PYFDVLPAH-------FDTLMFWSDDELAELQASAVTQKIGKDSANEMFTNTIIPLVRRH 113
Query: 228 YN---------------ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH 272
+ +L + GS Y +DI + + ++ ++ + +
Sbjct: 114 ASVFFPDPNTAQGASDGDLLALAHRMGSTIMAYAFDIEPDPASKQVDEEGYASDD----- 168
Query: 273 LQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPN 332
+ +L + +VPL L A + + A L DA+ + AG+ + G P
Sbjct: 169 -EDEALPK--GMVPLADMLNADADRNNARLHYGPDALTMEAVTNISAGDEVFNDYGSLPR 225
Query: 333 SKLLINYGFV 342
S LL YG+V
Sbjct: 226 SDLLRRYGYV 235
>gi|296232125|ref|XP_002761462.1| PREDICTED: SET domain-containing protein 4 [Callithrix jacchus]
Length = 440
Score = 45.1 bits (105), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 64/277 (23%), Positives = 109/277 (39%), Gaps = 25/277 (9%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-SELACLALYLMYEKKQGKKSFWL 177
LQ G S+P S ++T + V+ + A + S L L +L+ EK G +S W
Sbjct: 68 LQEGQMIISLPESCLLTTDTVIQSYLGAYIAKWKPPPSPLLALCTFLVSEKHAGDRSLWK 127
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
PY+ L + A P+ E E+ L KA+ E+ ++ + +
Sbjct: 128 PYLEILPK-------AYTCPVC-LEPEVVNLLPISLKAKAEEQRAHVQEFFASSRDFFSS 179
Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PLL 292
LF + I F++ A+ V + V+L Q L+ L P LL
Sbjct: 180 LQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQWECLSAEPDTCALAPYLDLL 235
Query: 293 AYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY-- 348
+S + KA ++ ++ E + + GP N +L + YGFV NP+
Sbjct: 236 NHSPHVQVKAAFNEETHCYEIRTTSRWRKHEEVFICYGPHDNHRLFLEYGFVSGHNPHAC 295
Query: 349 ---DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
R ++ L + D Q K + + +G + F
Sbjct: 296 VYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332
>gi|299115489|emb|CBN75653.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 451
Score = 44.7 bits (104), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 21/70 (30%), Positives = 38/70 (54%)
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 365
+D + LV + + +G + GP PNS+LL+ +GF DNP++ + + A + P +
Sbjct: 229 NDCLHLVTLQDWASGSEVKFSYGPLPNSRLLLLHGFCLPDNPFESVELWAMMEPGAPGFA 288
Query: 366 DKRMVAQRNG 375
+K + NG
Sbjct: 289 EKNKIMLDNG 298
>gi|395848935|ref|XP_003797093.1| PREDICTED: SET domain-containing protein 4 [Otolemur garnettii]
Length = 440
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 74/330 (22%), Positives = 133/330 (40%), Gaps = 44/330 (13%)
Query: 65 AGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHY------VAASED 118
A SR V + + +LK W LK++ + P H+ + +
Sbjct: 20 AESRGVNESFKCEFIELKKW------------LKDRKFEDTNLMPAHFPGTGRGLMSKTS 67
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
LQ G S+P + ++T + V+ + + +T K S L L +L+ EK G +S W
Sbjct: 68 LQEGQMIISLPENCLLTTDTVIES-YLGAYITKWKPPPSPLLALCTFLVSEKHAGDQSPW 126
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L + A P+ E E+ L P KA+ E+ ++ + +
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKPLKAKAEEQRAHVQEFFASSRDFFS 178
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ---KVSLARRFALVPLGP--PL 291
LF + I F++ A+ V + V+L+ + L+ L P L
Sbjct: 179 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRHRRRECLSAEPDTCALAPYLDL 234
Query: 292 LAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
L +S + A ++ ++ ++ E + + G N +LL+ YGFV NP+
Sbjct: 235 LNHSPNVQVRAAFNEETRCYEIRTASSWRKHEEVFICYGHHDNQRLLLEYGFVSIQNPHA 294
Query: 350 RLVVEAALNTEDPQYQDKRMVAQRNGKLSV 379
+ V + + DK+M N K+S+
Sbjct: 295 CVYVSREILVKYLPSTDKQM----NKKISI 320
>gi|390602144|gb|EIN11537.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
SS5]
Length = 503
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 59/220 (26%), Positives = 87/220 (39%), Gaps = 24/220 (10%)
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
A+VP+ L A A L +++V +P +GE I G PNS LL YG V
Sbjct: 255 AMVPMADMLNARYGSENAKLFYESRDLRMVTTKPIASGEQIWNTYGDPPNSDLLRRYGHV 314
Query: 343 D---------EDNPYDRLVVEA--ALNTEDPQYQDKRMVAQ-----RNGKLSVQVFHVHA 386
D NP D + V A LN + + Q + + G V VF
Sbjct: 315 DLLALSDGDGMGNPSDIVEVRADLVLNHVNSKKQSHELEERIDWWLEEGGDDVFVFT--- 371
Query: 387 GREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAG 446
R+ E S+++ +RL + T ++ P V +L + RL
Sbjct: 372 -RDAELPSELVSLIRLLILPPTEWTKTRDKGKLPKGKVDDVR---ILHVVTGALHERLQQ 427
Query: 447 YPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 486
YP ++ +DEA+L L KR A + EK +L L
Sbjct: 428 YPTSIEDDEALLA-TALSENKRQAVIVRLAEKHILRKALH 466
>gi|255080174|ref|XP_002503667.1| set domain protein [Micromonas sp. RCC299]
gi|226518934|gb|ACO64925.1| set domain protein [Micromonas sp. RCC299]
Length = 401
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 30/119 (25%), Positives = 54/119 (45%), Gaps = 10/119 (8%)
Query: 144 TIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 203
++AE L +L L + +M+E+ G+ S W Y L RG+ + P+ W+
Sbjct: 38 SVAETLREARLGGGLALNIAIMHERSLGEGSRWAGYFAVLP---ARGERTL--PMFWTSA 92
Query: 204 ELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
+L +L G+ + E AE ++ ++NE + L +P P T E + +A
Sbjct: 93 QLEHLRGTDLLRHVTEDAESMRLDFNE-----NVVDGLCVTHPVAFPPGKHTLEAYMEA 146
>gi|320167148|gb|EFW44047.1| hypothetical protein CAOG_02072 [Capsaspora owczarzaki ATCC 30864]
Length = 533
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 63/274 (22%), Positives = 114/274 (41%), Gaps = 38/274 (13%)
Query: 113 VAASEDLQAGDAA--FSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
+ AS+ ++A SVP S +L + +A L E A L+L +YE
Sbjct: 101 IFASQAIEASTTTPLLSVPLSTFFARFTLLDSPMMAALAVRPVAREEAKLSLLFLYEYFD 160
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
SFW P+ + R+ ++ W + L L + + I + I+ EY++
Sbjct: 161 -PDSFWQPWFQLFPRE-------LDCAGFWDDLLLMELDNTSIRDAIRQLEALIEYEYDQ 212
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL--- 287
LD +L ++P + F+++ FK AF+ + S + + V+ A ++P
Sbjct: 213 LDL-----PALRLRFPDSFVADRFSYDDFKWAFMVLASRGLTM-SVNNAPCTVMIPFVDF 266
Query: 288 ----GPPLLAYSSKCKAMLAA------VDDAVQ------LVVDRPYKAGESIVVWCGPQP 331
G +A+S +A A+ DD+V+ + + + GE + +
Sbjct: 267 FNHNGAKSIAFSYTRRAGDASDVSSGNYDDSVENLNCAVISGNETFLPGEQMFLNYKAHS 326
Query: 332 NSKLLINYGFVDEDNPYDRLVVEAALN---TEDP 362
N LL++YGF N +D +V + T DP
Sbjct: 327 NEVLLLHYGFALPHNEHDTFLVRLHFDREKTNDP 360
>gi|258567286|ref|XP_002584387.1| predicted protein [Uncinocarpus reesii 1704]
gi|237905833|gb|EEP80234.1| predicted protein [Uncinocarpus reesii 1704]
Length = 706
Score = 44.7 bits (104), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 55/206 (26%), Positives = 87/206 (42%), Gaps = 22/206 (10%)
Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIREL--DRQRGRGQLAVESPLLWSE-TELAYLTG 210
+ E LA +LM + G +SFW PYI+ L D Q R + L W E T L L
Sbjct: 120 VEEPGALAFFLMDQYLLGDESFWAPYIQSLPDDSQFTRLEYYTGDDLKWLEGTNLLKLRE 179
Query: 211 SPTKAEILERAEGIK--REYNELDT---VW--FMAGSLFQQYPYDIPTEAFTFEIFKQAF 263
+ + G++ +E+ +T W F+ S I + AF+ E+ K
Sbjct: 180 KLLERLKAKYETGLRLLKEFPNKNTPKYTWERFLWASSI------ILSRAFSSEVLKDYI 233
Query: 264 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESI 323
+ V L+ S+ LVPL + + + A + + L+V + GE +
Sbjct: 234 KGTPTRVKPLEDFSV-----LVPLVD-ISNHQPLAQVEWATSLEKIGLIVHKTLLPGEEV 287
Query: 324 VVWCGPQPNSKLLINYGFVDEDNPYD 349
GP+ N +L++NYGF N D
Sbjct: 288 PNNYGPRSNERLMMNYGFCIRGNVCD 313
>gi|406607002|emb|CCH41620.1| SET domain-containing protein 4 [Wickerhamomyces ciferrii]
Length = 424
Score = 44.3 bits (103), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 42/208 (20%), Positives = 89/208 (42%), Gaps = 24/208 (11%)
Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 213
LS ++L+L E +GK+S+W P+I+ L + SP LW + G
Sbjct: 114 LSSFQIMSLFLELESSRGKESWWDPFIQMLPTIND----FLTSPFLWQ------IQG--- 160
Query: 214 KAEILER-----AEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 268
K E++E+ + + +N ++ + +L + ++ + + F ++ + S
Sbjct: 161 KYELIEKLPKSTQKHSLKMFNRFESDFKAVKTLLE--THNASKDIINHDKFVLYWMCINS 218
Query: 269 CVVHL---QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 325
+++ QK + + F + P + +S+ + L ++ YK + + +
Sbjct: 219 RCLYMEIPQKKTTSDNFTMAPY-VDFINHSTNDQCKLKIDRTGFHVITTSNYKENDELYL 277
Query: 326 WCGPQPNSKLLINYGFVDEDNPYDRLVV 353
GP N LL YGF +N ++ L +
Sbjct: 278 SYGPHSNEFLLCEYGFHLSNNEWNDLDI 305
>gi|255077808|ref|XP_002502485.1| set domain protein [Micromonas sp. RCC299]
gi|226517750|gb|ACO63743.1| set domain protein [Micromonas sp. RCC299]
Length = 728
Score = 44.3 bits (103), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 61/137 (44%), Gaps = 19/137 (13%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE-RVLG 141
+WM K G+ V + P H + A+ D++ GD VP + ++T + V G
Sbjct: 51 AWMKKKGVKLNGVSIGRFP-HTGRG-----CVATRDIKEGDVLVEVPEAAIITADGSVAG 104
Query: 142 NETIAELLTTNKL-------SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
+ +A L L E L L +M E +G++S + PY+ L R A
Sbjct: 105 SALVAFGLGGEALLHEYSPRLEREALVLAVMAEMSRGEESEFAPYLAALPTLR-----AT 159
Query: 195 ESPLLWSETELAYLTGS 211
SPL WS EL+ L G+
Sbjct: 160 HSPLGWSGAELSELEGT 176
>gi|403338831|gb|EJY68658.1| hypothetical protein OXYTRI_10728 [Oxytricha trifallax]
Length = 770
Score = 44.3 bits (103), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 21/77 (27%), Positives = 40/77 (51%), Gaps = 6/77 (7%)
Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGNE------TIAELLTTNKLSELACLALYLMYE 167
A ED+Q +A +PN ++T+ER +E + +++ + L +++M E
Sbjct: 67 AVKEDIQHNEAFVYIPNKCLITVERARSSEIGFIFANHENVFKSSEDRDFLTLLVFMMCE 126
Query: 168 KKQGKKSFWLPYIRELD 184
++G +SFW PY +D
Sbjct: 127 FQKGDQSFWYPYFNAVD 143
>gi|440792461|gb|ELR13682.1| [Ribulose-bisphosphate-carboxylase]-lysine N-methyltransferase
[Acanthamoeba castellanii str. Neff]
Length = 400
Score = 44.3 bits (103), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 49/204 (24%), Positives = 88/204 (43%), Gaps = 23/204 (11%)
Query: 161 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE- 219
A+ + E +SFW PY+ EL AV + W++ ELA + + E++E
Sbjct: 49 AVLWLLESVNCAQSFWQPYLSELPD-------AVATVDRWNQEELAEVGHTLMLYEMVEY 101
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
+ + I +Y + + + Q + IP+E E +++A V S + L
Sbjct: 102 KKKKIAADYAAILLPFLQENT--QLFGGSIPSE----EEYRRALSLVYSRTFDFSE--LI 153
Query: 280 RRFALVPLGPPLLAYS------SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 333
+P L +S + C D +L+ Y GE + + G + +S
Sbjct: 154 GEHVFIPF-VDFLNHSINDTGKAACTYSYNHDKDCFELLAGADYDEGEEVFISYGEKTSS 212
Query: 334 KLLINYGFVDEDNPYDRLVVEAAL 357
+LL +YGF+ E+N D + + A+L
Sbjct: 213 QLLASYGFMYENNAEDTVDITASL 236
>gi|340503949|gb|EGR30449.1| SET domain protein [Ichthyophthirius multifiliis]
Length = 518
Score = 44.3 bits (103), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 50/211 (23%), Positives = 91/211 (43%), Gaps = 25/211 (11%)
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 365
D+ + +P K G+ I G + N LL+ YGF N YD + +N Q
Sbjct: 250 DNYFVVTTQKPEKKGQQIYNCYGQRTNKFLLMWYGFCFNKNRYDSYSLRLWINMRQEQLN 309
Query: 366 D---KRMVAQ---------------RNGKLSVQVFHVHAGREKEAIS-DMLPYLRLGYVS 406
+ +++V Q + K+++ + +K I+ D++ YLRL +
Sbjct: 310 NDLFEKIVFQEFLEKEDCKGGFVWKKQEKVNLDDITQNFRIKKNKINIDLIIYLRLYLMM 369
Query: 407 DTS--EMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 464
+++ V+ SL PVSP E VL L+ + T+ +D+ +L + NL+
Sbjct: 370 HYKGPDLKRVMVSL----PVSPVYECFVLSFAIRLLSYLLSRFTTTIKDDKELLQNQNLN 425
Query: 465 PKKRVATQLVRMEKKMLNACLQVTADMIMLL 495
K R A +K++L + + ++LL
Sbjct: 426 YKYRFAIIYRLNQKEILQEQISLMNQALILL 456
>gi|302754812|ref|XP_002960830.1| hypothetical protein SELMODRAFT_402221 [Selaginella moellendorffii]
gi|300171769|gb|EFJ38369.1| hypothetical protein SELMODRAFT_402221 [Selaginella moellendorffii]
Length = 393
Score = 44.3 bits (103), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 27/192 (14%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LAL ++ E+ +G+ + W PYI L + +++ W +TEL+YL SP + E
Sbjct: 167 LALIVLMERYKGQ-AIWAPYISCLPQPA-----ELDNTFRWEDTELSYLRASPLYGKARE 220
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ-SCVVHLQKVSL 278
R E I E+ ++ F L Q +++ Q F V + H+
Sbjct: 221 RLEMITTEFGQVQND-FCTCVLEQ-----------ALDVWPQLFGKVSLEDLKHVYATVF 268
Query: 279 ARRFALVPLGP---PLLAY-----SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 330
+R A+ P+L + +S K + + + DR Y + I + G
Sbjct: 269 SRSLAIGEDSTTLIPMLDFFNHNATSFAKLSFNGLLNYAVVTADRDYAENDQIWINYGDL 328
Query: 331 PNSKLLINYGFV 342
N++L ++YGF
Sbjct: 329 SNAELALDYGFT 340
>gi|345325921|ref|XP_001512684.2| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Ornithorhynchus anatinus]
Length = 392
Score = 44.3 bits (103), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 37/164 (22%), Positives = 73/164 (44%), Gaps = 29/164 (17%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHY-----VAASEDLQAGDAAFS 127
K+E+ DL W NG + E +++ + A+ +++A +
Sbjct: 74 KREDYFPDLMKWATANG------------ASTEGFELVNFEEGFGLRATREIKAEELFLW 121
Query: 128 VPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELD 184
VP L++T+E N + L + +++ + LA +L+ E+ SFWLPYI+ L
Sbjct: 122 VPRKLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERAN-PSSFWLPYIQTLP 179
Query: 185 RQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
+ ++PL + E E+ YL + ++ + + R+Y
Sbjct: 180 SE-------YDTPLYFEEDEVQYLQSTQAIHDVFSQYKNTARQY 216
>gi|357131408|ref|XP_003567330.1| PREDICTED: ribosomal N-lysine methyltransferase 3-like
[Brachypodium distachyon]
Length = 495
Score = 43.9 bits (102), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 67/318 (21%), Positives = 113/318 (35%), Gaps = 63/318 (19%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
K WM K+G V+ + + YV A L+ GD ++P +T R
Sbjct: 13 FKRWMSKHG-----VVCSDALCLDASEAGGVYVRALSALREGDLVATIPRRACLT-PRTS 66
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
G +L LA+ +MYE+ +G +S W Y+R + PL+W
Sbjct: 67 GAAAAI---EAAELGGTLALAVAVMYERARGAESPWNAYLRLIPD-------CEPVPLVW 116
Query: 201 SETELA-YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
+ E L+G+ + + E + ++ E +G L + E F+ E +
Sbjct: 117 PDEEAERLLSGTELDKIVKQDREFLCEDWKECIEPLISSGDL------GVNPEDFSLEKY 170
Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA----------- 308
A + S H+ + +VPL + V DA
Sbjct: 171 FAAKSLLSSRSFHIDSYHGS---GMVPLADLFNHKTDGEHVHFTKVSDASDSDEGEDDDD 227
Query: 309 --------------------------VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
+++++ R AGE + G N+ LL YGF
Sbjct: 228 QSNAGSDEEPTVENSATNPSGYNDEDLEMIIVRDANAGEEVYNTYGTMGNAALLHRYGFT 287
Query: 343 DEDNPYDRLVVEAALNTE 360
+ DNPYD + ++ L T+
Sbjct: 288 ELDNPYDIVNIDLTLVTK 305
>gi|145344497|ref|XP_001416768.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576994|gb|ABO95061.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 514
Score = 43.9 bits (102), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 99/417 (23%), Positives = 168/417 (40%), Gaps = 65/417 (15%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEK-KQG 171
VA + ++ AG+ VP + + + + + S A LA +++ E G
Sbjct: 85 VATTRNVSAGELLAEVPLEKCLCAASARMDARLWRAIGASGASGDAILAAHVLREAFDAG 144
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA--EGIKREYN 229
KS + P++R L R V+S + W+E EL+ L+GS + RA + EY+
Sbjct: 145 SKSAYWPWLRLLPRD-------VDSTVGWNEDELSELSGS--NVVVFTRAIKAQWRMEYD 195
Query: 230 ELDTVWFMAGSLFQQYPYDIPTEA---FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 286
LD +L +++P E +TF+ F A + S + L S +
Sbjct: 196 ALDV-----PTLGEKFPDVFGGERAAHYTFDKFTWARFIIWSRAIDLSTESA--EAPTIR 248
Query: 287 LGPPLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
+ PLL A K + A +AV++ ++ + +P+ L+ YGF
Sbjct: 249 VLVPLLDMANHAPGGKLRPEWDARSNAVKVYAASAFREHTELRFNYDTKPSQYFLLQYGF 308
Query: 342 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE------AIS- 394
+ E NP + VEA + D + R+ K + H +++ +I
Sbjct: 309 IPETNPAE--CVEATVRVSDHD-------SLRDAKEELLRLHGLDPKKRNFEWKPRSIDY 359
Query: 395 DMLPYLRLGYVSDTSEMQSVIS-----SLGPICPVSPCMERAV-LDQLADYFKARLAGYP 448
D+L R+ D +EM S S + + +AV L LA + L Y
Sbjct: 360 DLLAATRV-ITMDEAEMSDATSLTLAVSGASVSAKNDARTKAVLLKSLASF----LESYT 414
Query: 449 ATLSEDEAML-------TDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIML-LPD 497
TL+ED + D L K++ L+RM +K + L +AD + LPD
Sbjct: 415 TTLAEDNEYVARVDDESNDEPLPGKRKRFAVLLRMREKQI---LLASADALFKELPD 468
>gi|345795412|ref|XP_544872.3| PREDICTED: SET domain-containing protein 4 [Canis lupus familiaris]
Length = 440
Score = 43.9 bits (102), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 75/347 (21%), Positives = 132/347 (38%), Gaps = 39/347 (11%)
Query: 41 GSSLRLVRRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEK 100
G +RR+ F VS R V + + +LK W+ +I
Sbjct: 5 GGRTSRIRRRKLFRCSVS---------RGVNESYKPEFIELKKWLKDRKFEDTNLIPACF 55
Query: 101 PSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-SELAC 159
P + + L+ G S+P S ++T + V+ + + S L
Sbjct: 56 PGTGRG------LMSKTSLREGQMIISLPESCLITTDTVIRSYLGTYIAKWQPPPSPLLA 109
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
L +L+ EK G +S W PY+ L + A P+ E E+ L P KA+ E
Sbjct: 110 LCTFLVSEKHAGDQSLWKPYLEILPQ-------AYTCPVC-LEPEVVNLFPKPLKAKAEE 161
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQKV 276
+ ++ ++ + LF + I F++ A+ V + V H Q+
Sbjct: 162 QRARVQEFFSSSRDFFSSLQPLFSEAVESI----FSYRALLWAWCTVNTRAVYVKHRQRQ 217
Query: 277 SLARRFALVPLGP--PLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPN 332
+ L P LL +S + + A ++ ++ + E + + GP N
Sbjct: 218 CFSTEPNTYALAPYLDLLNHSPEVQVKGAFNEETRCYEIRTASNCRKHEEVFICYGPHDN 277
Query: 333 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 379
+LL+ YGFV NP+ + V + + DK+M N K+S+
Sbjct: 278 QRLLLEYGFVSIHNPHACVYVSEDILVKYLPTTDKQM----NKKISI 320
>gi|302820198|ref|XP_002991767.1| hypothetical protein SELMODRAFT_430007 [Selaginella moellendorffii]
gi|300140448|gb|EFJ07171.1| hypothetical protein SELMODRAFT_430007 [Selaginella moellendorffii]
Length = 389
Score = 43.9 bits (102), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 57/255 (22%), Positives = 104/255 (40%), Gaps = 35/255 (13%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+ ++AG+ +P+ LV+T E++ ++ + +LL+T + L L ++ E+ +G+ S
Sbjct: 14 AARSIRAGEQIVRIPHDLVLTAEKL--DDCVKKLLSTEY--DWCPLTLLILAEQHKGEAS 69
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PY+ L S + W + EL +L + ER E I EY + V
Sbjct: 70 RWAPYVSCLPSFGDH-----HSTIFWEKEELKFLECTRAFRGTAERREMISDEYISVKNV 124
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
P+ + F+ F A+ V V +L+ ++ P +
Sbjct: 125 -------ISSCPHVFGEDISLFQ-FAHAYATV---VSRAWNGALSSEISMRP-------F 166
Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC--GPQPNSKLLINYGFVDEDNPYDRLV 352
C D V ++ VV+ G + N+ L ++YGFV +N D+
Sbjct: 167 VDFCN------HDPVSHATVSHDSCKDATVVFISYGKRSNAVLAVDYGFVLPNNLSDQAE 220
Query: 353 VEAALNTEDPQYQDK 367
+ + DP + K
Sbjct: 221 LWMEIPWNDPLREKK 235
>gi|397642897|gb|EJK75526.1| hypothetical protein THAOC_02751 [Thalassiosira oceanica]
Length = 395
Score = 43.9 bits (102), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 69/319 (21%), Positives = 131/319 (41%), Gaps = 45/319 (14%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
K W G+ E+ ++K R I Y + +AG A VP L+++ + +
Sbjct: 82 FKYWASTMGIEKNDCFKLEE--QDKKQREI-YAMTTRSTEAGTAVLYVPEHLILSSSKAM 138
Query: 141 ----------GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRG 190
E +A + ++L E L L ++ E ++G S W ++ L R
Sbjct: 139 AELRTDGMAEAEEYLASVGAESQLREY-YLMLKVLLEYQKGSDSEWHKWLDALPRYYSNA 197
Query: 191 QLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIP 250
+ +E L L K + ER Y+ + +V F+A + + +P D+
Sbjct: 198 -------VAMTEFCLTCLPPLMKKLAVEERDAQKLLSYDSIQSVPFLADDIKEGFPRDMV 250
Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAV 309
T A+ +V+ + V ++P+G ++S ++ D+A
Sbjct: 251 TWAYQ--------------IVYTRSVETEDGDLKIIPMG-DFFDHASDYAEIVPQYDEAG 295
Query: 310 QLVVDRPYK--AGESI-VVWCGPQPNSKLLINYGFVDEDNP--YDRLVVEAALNTE--DP 362
Y AG+ + ++ P+ S LL YGF+DE P Y +L + +N E +
Sbjct: 296 NYYAVTAYDVPAGKKLRYIYSNPRNPSHLLARYGFIDEICPATYCKL-LPPTVNEEMIEL 354
Query: 363 QYQDKRMVAQRNGKLSVQV 381
Y ++M+ R+G+++ +V
Sbjct: 355 GYSQEKMLFYRSGEVADEV 373
>gi|358384831|gb|EHK22428.1| hypothetical protein TRIVIDRAFT_84056 [Trichoderma virens Gv29-8]
Length = 458
Score = 43.9 bits (102), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 66/316 (20%), Positives = 125/316 (39%), Gaps = 55/316 (17%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
++ W+ ++G + L P+ R + + G+ ++P+ + T+E
Sbjct: 1 MEGWLRESGAELDGLELAHFPAIGRGVRTLRC------FKQGERILTIPSGCLWTVEHAY 54
Query: 141 GNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV---- 194
+ + +L + + LS LA+Y+++ + RE R +A
Sbjct: 55 ADAVLGPVLRSAQPPLSVEDTLAIYILFVRS-----------RESGYDGLRSHVAALPAS 103
Query: 195 -ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT-VWFMAGSLFQQYPYDIPTE 252
S + + + EL GS + + I+ +Y L V+ + LF P
Sbjct: 104 YSSSIFFEDDELEVCAGSSLYTITRQLEQRIEEDYRGLVVRVFGLHLDLF-------PLN 156
Query: 253 AFTFEI--FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL---------LAYSSKCKA- 300
FT E +K A V S + ++P G PL + +S + K
Sbjct: 157 KFTIENVGYKWALCTVWSRAMDF----------VLPNGNPLRLLAPFADMVNHSPEVKQC 206
Query: 301 -MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 359
+ A + ++ + Y+A + + ++ GP PNS+LL YGFV DNP D + + +
Sbjct: 207 HVYDASSGNLSILAGKDYEAEDQVFIYYGPMPNSRLLRLYGFVIPDNPNDSYDLVLSTHP 266
Query: 360 EDPQYQDKRMVAQRNG 375
P Y+ K+ + G
Sbjct: 267 LAPFYEQKQKLWASAG 282
>gi|407852222|gb|EKG05847.1| hypothetical protein TCSYLVIO_003073 [Trypanosoma cruzi]
Length = 565
Score = 43.9 bits (102), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 43/148 (29%), Positives = 70/148 (47%), Gaps = 15/148 (10%)
Query: 320 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRL-----VVEAALNTEDPQYQDKR--MVAQ 372
G I + GP N +LL YGFV E N +DRL EAA+ E + +R +VA+
Sbjct: 362 GREIWMSYGPLQNWELLQFYGFVLEGNEHDRLPFPLDFPEAAVGDE---WDGRRAALVAK 418
Query: 373 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 432
L+ + H GR A ++ LR+ ++++ E ++ + GP + E V
Sbjct: 419 YGLHLAGCCWICHDGRPPPA---LVALLRV-HLAEAEEFDTMERN-GPFASLGAGTEARV 473
Query: 433 LDQLADYFKARLAGYPATLSEDEAMLTD 460
+AD + L + +L EDE +L +
Sbjct: 474 FATIADTIRCILDLFSTSLEEDERLLEN 501
>gi|222640175|gb|EEE68307.1| hypothetical protein OsJ_26571 [Oryza sativa Japonica Group]
Length = 422
Score = 43.9 bits (102), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 31/112 (27%), Positives = 53/112 (47%), Gaps = 17/112 (15%)
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR--LVVEAALNTED- 361
V +++ + RP KAGE + G P S L+ YGF+ DNPYD L ++ +++ ED
Sbjct: 315 VTKSLKFPLSRPCKAGEQCFLSYGKHPGSHLITFYGFLPRDNPYDVIPLDLDTSVDEEDS 374
Query: 362 ---------PQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 404
+ +RM+ R + +Q + ++ + YLRLG+
Sbjct: 375 SSPSVTTSQTSHMGERMLG-RQSRTGLQ----RSTKKDSFVHCYFVYLRLGH 421
>gi|207346544|gb|EDZ73016.1| YDR257Cp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 354
Score = 43.9 bits (102), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
+++PL L A +SKC A L + +++V R + E + G PNS+LL YG+V
Sbjct: 80 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 139
Query: 343 DED-NPYD 349
+ D + YD
Sbjct: 140 EWDGSKYD 147
>gi|410079629|ref|XP_003957395.1| hypothetical protein KAFR_0E01060 [Kazachstania africana CBS 2517]
gi|372463981|emb|CCF58260.1| hypothetical protein KAFR_0E01060 [Kazachstania africana CBS 2517]
Length = 534
Score = 43.9 bits (102), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 40/144 (27%), Positives = 63/144 (43%), Gaps = 9/144 (6%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A DL AG +P S + + N TI+ LL ++ + L L +YE +
Sbjct: 40 VFAKRDLPAGTTLLQLPKSAIFSA----SNSTISNLLVEEEIDGVLALNLAFIYETTVFR 95
Query: 173 -KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE-YNE 230
KS W PY++ + +G ++V P WSE L G T + L A ++E Y
Sbjct: 96 EKSHWYPYLKSIQVVDSQGNISV-PPGYWSEEAKDLLRG--TTLDTLYDALSPQQEVYEG 152
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAF 254
+ +A Q++ +P E F
Sbjct: 153 FEISLHVAKKWNQEFSLPLPEEYF 176
>gi|452986759|gb|EME86515.1| hypothetical protein MYCFIDRAFT_131111 [Pseudocercospora fijiensis
CIRAD86]
Length = 391
Score = 43.9 bits (102), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 31/120 (25%), Positives = 49/120 (40%), Gaps = 10/120 (8%)
Query: 257 EIFKQAFVAVQSCVVHLQKVSLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 315
++FK + V S H + F L P + S + ++ +R
Sbjct: 164 DVFKYYWAIVNSRSFHFKPPGAKPGFMVLCPFIDYMNHGPSGTGVNVRQTAKGYEVTANR 223
Query: 316 PYKAGESIVVWCGPQPNSKLLINYGFV---------DEDNPYDRLVVEAALNTEDPQYQD 366
Y AGE ++ G PN KLL++YGF+ D+D D +++ NT Q QD
Sbjct: 224 DYVAGEEVLATYGAHPNDKLLVHYGFINSSKPGAPSDDDIRLDHYILDNLSNTTRDQLQD 283
>gi|50546259|ref|XP_500648.1| YALI0B08624p [Yarrowia lipolytica]
gi|49646514|emb|CAG82890.1| YALI0B08624p [Yarrowia lipolytica CLIB122]
Length = 490
Score = 43.9 bits (102), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 53/185 (28%), Positives = 71/185 (38%), Gaps = 18/185 (9%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A +DL AGD VP S ++ R G IA LL + L +A L + +YE+ G
Sbjct: 36 VFAKKDLDAGDIVLKVPKSACLS-PRTCG---IANLLDEHDLDNIAGLLVAFLYERSLGD 91
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE-YNEL 231
+S W + L E P WS E L EI G E Y EL
Sbjct: 92 QSPWHEFFESLKPVIAD---VPEIPKFWSNDEDRALLSGTEVEEIGGLETGEDEEVYQEL 148
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
+F I E +F+ FK+ V + S + + R LVP G L
Sbjct: 149 IVPFFEDNGKL------INLECPSFDEFKKLVVVIASRAFEVDQF---RELCLVP-GACL 198
Query: 292 LAYSS 296
+S
Sbjct: 199 FNHSD 203
>gi|428181778|gb|EKX50641.1| hypothetical protein GUITHDRAFT_135258 [Guillardia theta CCMP2712]
Length = 254
Score = 43.9 bits (102), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 44/158 (27%), Positives = 66/158 (41%), Gaps = 25/158 (15%)
Query: 297 KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 356
+ K+ L V+ VQL+ P KAGE I ++ G + L +GF D DNP D + E
Sbjct: 72 RYKSELGRVE--VQLLA--PVKAGEQIFIYYGALSTASELTRFGFCDRDNPNDTVPFELD 127
Query: 357 LNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS 416
L +E + Q K M +V+ ++ D LP RL +
Sbjct: 128 L-SEMTELQRKAM----------EVWEFRPDVQQLLKRDGLPSWRL----------LAML 166
Query: 417 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSED 454
+ + +S E+ V + + A AGYP L ED
Sbjct: 167 RILHLNQLSVANEKLVWGTMEELLNAVTAGYPTRLEED 204
>gi|302498903|ref|XP_003011448.1| SET domain protein [Arthroderma benhamiae CBS 112371]
gi|291174999|gb|EFE30808.1| SET domain protein [Arthroderma benhamiae CBS 112371]
Length = 689
Score = 43.9 bits (102), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 46/207 (22%), Positives = 81/207 (39%), Gaps = 36/207 (17%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LA ++++E+ +G+ S W PY+ L R + S L + + +L +L G+
Sbjct: 104 LAFFMVHEQLKGRDSHWWPYLATLPRAS-----ELTSALFYQDNDLEWLQGTNLYQTHQA 158
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
+K EY+ ++ G L E++ ++IF A+ + S + +
Sbjct: 159 YRNAVKEEYDSAISILRDEGFL--------AVESYRWDIFCWAYTLIAS------RAFTS 204
Query: 280 RRFALVPLGPPLLAYSSKCKAMLAAVDDA----------------VQLVVDRPYKAGESI 323
R P L + + ML VD + + L V P +GE +
Sbjct: 205 RVLDAYFSNHPTLKQDEEFQIMLPLVDSSNHKPLAKIEWRAEATEIGLKVIEPTFSGEEV 264
Query: 324 VVWCGPQPNSK-LLINYGFVDEDNPYD 349
G N + ++ YGF DNP D
Sbjct: 265 HNNYGSLNNQQSVMTTYGFCIVDNPCD 291
>gi|307104961|gb|EFN53212.1| hypothetical protein CHLNCDRAFT_137077 [Chlorella variabilis]
Length = 512
Score = 43.9 bits (102), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 40/73 (54%), Gaps = 2/73 (2%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
V A++D+ AG VP L++++E + + L ++ L+ LA++L+ E +
Sbjct: 37 VLATQDIPAGTCVLRVPRHLLMSVESARRDAELCTALRQHRAALTSDQVLAVHLLCEASK 96
Query: 171 GKKSFWLPYIREL 183
G SFW PY+R L
Sbjct: 97 GAASFWQPYLRSL 109
>gi|308812294|ref|XP_003083454.1| N-methyltransferase (ISS) [Ostreococcus tauri]
gi|116055335|emb|CAL58003.1| N-methyltransferase (ISS) [Ostreococcus tauri]
Length = 492
Score = 43.5 bits (101), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 20/56 (35%), Positives = 32/56 (57%)
Query: 294 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
+S A + ++ V+LV R KAG+ I + G N +L ++YGF+ EDN +D
Sbjct: 248 HSFDASARVRECENGVELVTTRDLKAGQPIELCYGELSNDELFLDYGFIVEDNAFD 303
>gi|116197927|ref|XP_001224775.1| hypothetical protein CHGG_07119 [Chaetomium globosum CBS 148.51]
gi|88178398|gb|EAQ85866.1| hypothetical protein CHGG_07119 [Chaetomium globosum CBS 148.51]
Length = 555
Score = 43.5 bits (101), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 63/275 (22%), Positives = 117/275 (42%), Gaps = 45/275 (16%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
+ G+ ++P+ ++ T+E + + +L + + LS LA Y+++ +
Sbjct: 121 FKKGERILTIPSGILWTVEHAYADPLVGPVLRSARPPLSVEDTLATYILFIRS------- 173
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSP--TKAEILERAEGIKREYN 229
RE R +A S + ++E EL G+ T + L+R+ I+ +Y
Sbjct: 174 ----RESGYDGLRSHVAAFPTSYPSSIFFAEEELEVCAGTSLYTITKKLDRS--IEDDYR 227
Query: 230 ELDTVWFMAGS--LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRF 282
L V +A S LF P + F+ E +K A V S + + L F
Sbjct: 228 TL-VVRVLAQSRDLF-------PLDKFSIEDYKWALCTVWSRAMDFVLPDGNSIRLVAPF 279
Query: 283 ALVPLGPPLLAYSSKCK--AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 340
A +L +SS+ + + A + ++ + Y+AG+ ++ G PNS+LL YG
Sbjct: 280 A------DMLNHSSEVEPCHIYDASSGNLSVLAGKDYEAGDQAFIYYGSIPNSRLLRLYG 333
Query: 341 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 375
FV NP D + + + P ++ K+ + G
Sbjct: 334 FVMPGNPNDSYDLVISTHPSAPFFERKQKLWASAG 368
>gi|395326815|gb|EJF59220.1| SET domain-containing protein [Dichomitus squalens LYAD-421 SS1]
Length = 429
Score = 43.5 bits (101), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 69/326 (21%), Positives = 122/326 (37%), Gaps = 83/326 (25%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHY--------VAASEDLQAGD 123
S + E++ + KSW+ + G + H +H+ VAA D+ +
Sbjct: 3 SIEPENVANFKSWIAQQG--------------GQIHAGVHFEPVEFGFNVAARSDIPSDA 48
Query: 124 AAFSVPNSLVVTLERVLGNETIAELLTTNKLS----ELAC--LALYLMYEKKQGKKSFWL 177
S+P SL +T + I +LL T + +L C + L+ + E
Sbjct: 49 TVVSIPFSLAITPN--VARHAIKQLLNTEPQNWSERQLECTYIVLHSIVEPIDPSILRHR 106
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
PY+ L + +PL ++E EL+ GS L+R + E+ +
Sbjct: 107 PYLDTLPSPE-----QLRTPLHFTEAELSSFRGSNLFGATLDRKHEWETEWQQCKNTVSA 161
Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG--PPLLAYS 295
A + + Q +FT+E + A + S R F L P L+
Sbjct: 162 AIAGWGQ--------SFTWEKYLTAATYLSS-----------RAFPSTILSDTPSLVTTE 202
Query: 296 SKCKAMLAAVD---------------------------DAVQLVVDRPYKAGESIVVWCG 328
+ +L +D ++ LV+ P G ++ G
Sbjct: 203 TSYPVLLPGIDALNHARGHPVSWVVSAPSQTSSSQRSESSISLVIHTPTPRGSELLNNYG 262
Query: 329 PQPNSKLLINYGFVDEDNPYDRLVVE 354
P+PNS+L++ YGF +NP D +V++
Sbjct: 263 PKPNSELILGYGFSLPNNPDDTIVLK 288
>gi|119495234|ref|XP_001264406.1| SET domain protein [Neosartorya fischeri NRRL 181]
gi|119412568|gb|EAW22509.1| SET domain protein [Neosartorya fischeri NRRL 181]
Length = 492
Score = 43.5 bits (101), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 62/301 (20%), Positives = 121/301 (40%), Gaps = 57/301 (18%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASE----DLQAGDAAFSVPNSLVVTL 136
L +W NG+ + ++ + + VA +E D +A D +VP+ L +TL
Sbjct: 11 LSTWAKLNGMSLEGIAFQKLHGEHGTDKGTAIVATAEKKDEDAEA-DTLLTVPSDLALTL 69
Query: 137 ERVLGN-----------ETIAELLTTNKLSELACLALYLMY--------EKKQGKKSFWL 177
E V + + + + T + + L L + + + +K G + W
Sbjct: 70 EYVHNHAKTDRHLREVLDAVGDFGRTARGAILIFLIVQITHASPDFANQRQKIGVSNPWT 129
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-----D 232
YIR + ++ P +S E L G+ + + + +++E+ L D
Sbjct: 130 EYIRFM-------PASIPLPTFYSAEERELLRGTSLQTAVDAKLGSLEKEFEHLRQATED 182
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
W Q++ +D T FTF+ K +S VV L + A + P +
Sbjct: 183 IHWC------QEHWWDEDTGKFTFDDLKYVDAVYRSRVVDLPRSGHA-------IVPCVD 229
Query: 293 AYSSKCKAMLAAVDD-------AVQLVVDRPYKAGESIVVWCGPQ-PNSKLLINYGFVDE 344
+ C+ ++ A D +QL + + GE + + G + P S+++ +YGFV+
Sbjct: 230 MANHACEDLVKARYDEDGAGNAVLQLRTGKKLRVGEEVTISYGDEKPASEMVFSYGFVEN 289
Query: 345 D 345
+
Sbjct: 290 E 290
>gi|410970027|ref|XP_003991492.1| PREDICTED: SET domain-containing protein 4 [Felis catus]
Length = 440
Score = 43.5 bits (101), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 72/322 (22%), Positives = 128/322 (39%), Gaps = 32/322 (9%)
Query: 67 SREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAF 126
SR V + + +LK W+ +I P + + LQ G
Sbjct: 22 SRGVNESYKPEFIELKKWLKDRKFEDTNLIPACFPGTGRG------LMSKTSLQEGQVII 75
Query: 127 SVPNSLVVTLERVLGNETIAELLTTNKL-SELACLALYLMYEKKQGKKSFWLPYIRELDR 185
S+P + ++T + V+ + A + S L L +L+ EK G +S W PY+ L +
Sbjct: 76 SLPETCLLTTDTVIRSYLGAYIAKWRPPPSPLLALCTFLVSEKHAGDQSVWKPYLEILPK 135
Query: 186 QRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQY 245
A P+ E E+ L P +A+ E+ ++ ++ + LF +
Sbjct: 136 -------AYTCPVC-LEPEVVNLFPKPLRAKAEEQRARVREFFSSSRGFFSSLQPLFSEA 187
Query: 246 PYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP----LGP--PLLAYSSKCK 299
I F++ A+ V + V++ K R F+ P L P LL +S +
Sbjct: 188 VGSI----FSYRALLWAWCTVNTRAVYV-KPRRRRCFSAEPDTCALAPYLDLLNHSPHVQ 242
Query: 300 AMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 357
A ++ ++ + E + + GP N +LL+ YGFV NP+ + V +
Sbjct: 243 VEAAFNEETRCYEIRTASSCRKHEEVFICYGPHDNQRLLLEYGFVSIHNPHACVYVSEDI 302
Query: 358 NTEDPQYQDKRMVAQRNGKLSV 379
+ DK+M N K+S+
Sbjct: 303 LVKYLPSTDKQM----NKKISI 320
>gi|384251962|gb|EIE25439.1| ResB-domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 889
Score = 43.5 bits (101), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 35/135 (25%), Positives = 55/135 (40%), Gaps = 15/135 (11%)
Query: 79 GDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLER 138
G L+ W+ GLPP KV + + V E L +VP L++T +
Sbjct: 32 GSLEDWLTHRGLPPQKVAISHEIPEGRGLVATRRVRKHEKL------LNVPAQLLLTADV 85
Query: 139 VLGNETIAELLTTNKLSELACLALYLMYEKKQ--GKKSFWLPYIRELDRQRGRGQLAVES 196
L + LL + + + LA +L ++Q G K+ W Y+ L Q G
Sbjct: 86 ALQHSAYGGLLESCGVPAWSVLATFLAETRRQPEGDKNVWGQYVDALPSQTG-------C 138
Query: 197 PLLWSETELAYLTGS 211
L W+ E+ L G+
Sbjct: 139 VLEWASEEVDLLRGT 153
>gi|367016539|ref|XP_003682768.1| hypothetical protein TDEL_0G01900 [Torulaspora delbrueckii]
gi|359750431|emb|CCE93557.1| hypothetical protein TDEL_0G01900 [Torulaspora delbrueckii]
Length = 573
Score = 43.5 bits (101), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 25/79 (31%), Positives = 37/79 (46%), Gaps = 9/79 (11%)
Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD- 349
LL + + K +D V V K GE + G + N LL++YGFV + NPYD
Sbjct: 229 LLNHKNDTKVKWTFTNDNVCFVSQEIMKEGEEVFNNYGEKSNEDLLLSYGFVQDQNPYDL 288
Query: 350 --------RLVVEAALNTE 360
+ +++ ALN E
Sbjct: 289 TRLTLRLTKEMIDEALNAE 307
>gi|217074704|gb|ACJ85712.1| unknown [Medicago truncatula]
Length = 209
Score = 43.1 bits (100), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 32/120 (26%), Positives = 56/120 (46%), Gaps = 8/120 (6%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ +Q GD VP SL +T + + + + + +A LA L+ K G+ S
Sbjct: 66 ASKSIQTGDCILQVPYSLQLTPDNLPPE---IKPFISEDVGNIAKLATVLLIHKNLGQDS 122
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI L Q + + + W+E+EL + S E + + I++++ E+ V
Sbjct: 123 EWHPYISCLPPQA-----EMHNTIFWNESELEMIRQSSVYQETIYQKSQIEKDFLEIKPV 177
>gi|414886518|tpg|DAA62532.1| TPA: hypothetical protein ZEAMMB73_960129 [Zea mays]
Length = 483
Score = 43.1 bits (100), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 63/281 (22%), Positives = 117/281 (41%), Gaps = 58/281 (20%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNE-TIAELLTTNK--LSELACLALYLMYEKK 169
+AA+ DL+ G+ +P + ++T +RV ++ IA ++ +K LS + L + L+ E
Sbjct: 51 LAAARDLRRGELVLRLPRAALLTSDRVTADDPRIAACVSAHKPRLSSVQILIVCLLAEVG 110
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI---KR 226
+G S W PY+ +L T LA T + + E L+ + I ++
Sbjct: 111 KGSNSVWYPYLCQLPSYY---------------TILA--TFNDFEVEALQVDDAIWVAQK 153
Query: 227 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 286
+ + + W A L ++ + + F+ + AF V S +H ++ L P
Sbjct: 154 AKSAIKSDWEDATPLMKELEF--KPKLLMFKSWLWAFATVSSRTLH---IAWDEAGCLCP 208
Query: 287 LG-------------------PPLLAYSSKCKAMLAAVD----------DAVQLVVDRPY 317
+G L Y K M + + +A L + Y
Sbjct: 209 VGDLFNYAAPDDDTLLEDEDTAELTNYQQK-NGMTNSSERLTDGGYEDCNAYCLYARKNY 267
Query: 318 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 358
K GE +++ G N +LL +YGF+ +NP ++ +E L+
Sbjct: 268 KKGEQVLLAYGTYTNLELLEHYGFLLGENPNEKTFIELDLD 308
>gi|322707769|gb|EFY99347.1| SET domain protein [Metarhizium anisopliae ARSEF 23]
Length = 467
Score = 43.1 bits (100), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 78/192 (40%), Gaps = 13/192 (6%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDR--QRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
L+L+ E + KSFW PYIR L + Q + Q A+ W + E L G+ + I +
Sbjct: 95 LFLIKEYLKRDKSFWWPYIRALPQPGQGNKSQWALAP--FWDDDEAELLEGTNVEVGIDK 152
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE--IFKQAFVAVQSCVVHLQKVS 277
++R+ E + + G + + TE + + IF + Q+ S
Sbjct: 153 IRNDVRRDLQEAQELLRLHGDADGAFGKALTTELYQWAYCIFSSRSFRPSLVLSDEQRRS 212
Query: 278 LARRFALVPLGPPLLAYSSKCKAMLAAV----DDAVQ---LVVDRPYKAGESIVVWCGPQ 330
L R + L + M + DD Q L V R + G+ + +
Sbjct: 213 LPRGVTMDDFSVLLPLFDIGNHDMTTEIRWDLDDDRQTCELRVGRTHMPGQQVFNNYSMK 272
Query: 331 PNSKLLINYGFV 342
N++LL+ YGF+
Sbjct: 273 TNAELLLGYGFM 284
>gi|301763371|ref|XP_002917104.1| PREDICTED: SET domain-containing protein 4-like [Ailuropoda
melanoleuca]
Length = 440
Score = 43.1 bits (100), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 72/321 (22%), Positives = 124/321 (38%), Gaps = 30/321 (9%)
Query: 67 SREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAF 126
SR V + + +LK W+ +I P + + L+ G
Sbjct: 22 SRGVNESYKPEFIELKKWLKDRKFEDTNLIPACFPGTGRG------LMSKTSLREGQMII 75
Query: 127 SVPNSLVVTLERVLGNETIAELLTTNKL-SELACLALYLMYEKKQGKKSFWLPYIRELDR 185
S+P S ++T + V+ + A + S L L +L+ EK G +S W PY+ L +
Sbjct: 76 SLPESCLLTTDTVIRSYLGAYIAKWQPPPSPLLALCTFLVSEKHAGDQSLWKPYLEILPK 135
Query: 186 QRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQY 245
A P+ E E+ L P KA+ E+ ++ ++ + LF +
Sbjct: 136 -------AYTCPVC-LEPEVVNLFPKPLKAKAEEQRARVQGFFSSSRDFFSSLQPLFSEA 187
Query: 246 PYDIPTEAFTFEIFKQAFVAVQSCVV---HLQKVSLARRFALVPLGP--PLLAYSSKC-- 298
I F++ A+ V + V H Q+ + L P LL +S +
Sbjct: 188 VESI----FSYSALLWAWCTVNTRAVYVKHRQEQCFSTEPNTCALAPYLDLLNHSPRVQV 243
Query: 299 KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 358
KA ++ + E + + GP N +LL+ YGFV NP+ + V +
Sbjct: 244 KAAFNEETRCYEIRTASGCRKHEEVFICYGPHDNQQLLLEYGFVSIQNPHACVYVSEDVL 303
Query: 359 TEDPQYQDKRMVAQRNGKLSV 379
+ DK+M N K+S+
Sbjct: 304 VKYLPLTDKQM----NKKISI 320
>gi|336258546|ref|XP_003344085.1| hypothetical protein SMAC_09068 [Sordaria macrospora k-hell]
gi|380093059|emb|CCC09296.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 421
Score = 43.1 bits (100), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 29/50 (58%)
Query: 308 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 357
A + RPY AGE + + G N LLI YGF+ ++N +D + ++ A+
Sbjct: 263 AFTITTTRPYSAGEEVYICYGNHSNDFLLIEYGFLFDENVWDEVCIDDAI 312
>gi|195565510|ref|XP_002106342.1| GD16174 [Drosophila simulans]
gi|194203718|gb|EDX17294.1| GD16174 [Drosophila simulans]
Length = 395
Score = 43.1 bits (100), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 61/276 (22%), Positives = 104/276 (37%), Gaps = 56/276 (20%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLE-----RVLGNETIAELLTTNKLSELACLALYLMYEKK 169
A+ L + SVP L+ + E R+ G T A L LA L+ EK
Sbjct: 59 ATRPLAKDELVLSVPRKLIFSEESNSDCRLFGKMTQATHLN---------LAYDLVIEKI 109
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
+G+ S W PYI L + + L ++ ++ L G+ + L + I ++Y
Sbjct: 110 RGEFSEWRPYIDVLPAK-------YSTVLYFTTKQMELLRGTAAASLALRQCRVIAKQYA 162
Query: 230 ELDTVWFMA--------------GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 275
L G F Q+ +E+++ A S V+ Q
Sbjct: 163 FLYRYAHTMTEPSTGNRSHPGERGLFFTQH-------GLCYELYRWAV----STVMTRQN 211
Query: 276 VSLARR----------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 325
+ + + AL+P K + AAV ++ AGE +
Sbjct: 212 LVPSEKQESEDTPKLISALIPYWDMANHRPGKITSFYAAVPRQLECTAQEAVDAGEQFFI 271
Query: 326 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 361
+ G + N+ LL++ GFVD++N D + + L+ D
Sbjct: 272 YYGDRSNTDLLVHNGFVDDNNLKDYVNIRVGLSLTD 307
>gi|400598098|gb|EJP65818.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
Length = 356
Score = 43.1 bits (100), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 41/168 (24%), Positives = 69/168 (41%), Gaps = 22/168 (13%)
Query: 197 PLLWSETELAYLTGSPTKAEILERAE-GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
P W L G T +LE+ + R++ L + YPY +P+E +
Sbjct: 104 PFFWPPEAQRLLPG--TARRLLEKQQSNFGRDWKHLQSA----------YPY-VPSEDYM 150
Query: 256 FEIFKQAFVAVQSCVVHLQKVSL---ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 312
F V+ ++ Q+ L R A++P+ S CK A ++ +V
Sbjct: 151 HAWF---VVSSRAFYQETQQTLLYPWHDRLAMLPVADLFNHASVGCKVSYCA--ESYDIV 205
Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 360
DR Y G+ + G N LL YGF+ ++N DR + +++E
Sbjct: 206 ADREYGTGDEVCTCYGEHSNDFLLAEYGFLLQNNTNDRFDPDDLISSE 253
>gi|159131477|gb|EDP56590.1| SET domain protein [Aspergillus fumigatus A1163]
Length = 490
Score = 43.1 bits (100), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 66/297 (22%), Positives = 123/297 (41%), Gaps = 49/297 (16%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDL-QAGDAA--FSVPNSLVVTLE 137
L SW NG+ + ++ S + + VA +E + G+A +VP+ L +TLE
Sbjct: 11 LSSWAKLNGISLEGIAFQKLYSEHGTDKGSAIVATAEKKDEEGEANTLLTVPSDLALTLE 70
Query: 138 RVLGN-----------ETIAELLTTNKLSELACLALYLMY--------EKKQGKKSFWLP 178
V + + + + T + + L L + + + +K G + W
Sbjct: 71 YVHNHAKIDRHLREVLDAVGDFGRTARGAILIFLIIQITHASPDFVNKRQKIGISNPWTE 130
Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-----DT 233
YIR + +V P +S E L G+ + + + +++E++ L +
Sbjct: 131 YIRFM-------PASVPLPTFYSAEERELLRGTSLQTAVDAKLGSLEKEFDHLRQATEEI 183
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP---LGPP 290
W Q++ +D T FTF+ +K +S VV L + A+VP +
Sbjct: 184 PWC------QEHWWDEDTGKFTFDDWKYVDAVYRSRVVDLPRSG----HAIVPCVDMANH 233
Query: 291 LLAYSSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWCGPQ-PNSKLLINYGFVDED 345
S K K +AV QL + + GE + + G + P S+++ +YGFV+ +
Sbjct: 234 ACEDSVKAKYDEEGAGNAVLQLRTGKKLRVGEEVTISYGDEKPASEMVFSYGFVENE 290
>gi|424512980|emb|CCO66564.1| predicted protein [Bathycoccus prasinos]
Length = 542
Score = 43.1 bits (100), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 81/329 (24%), Positives = 123/329 (37%), Gaps = 66/329 (20%)
Query: 83 SWMHKNG-LPPCKVILKEKPSHNEKHRPIHY--VAASEDLQAGDAAFSVPNSLVVTLERV 139
+W KN L P + S EK Y V A+ D+ + D +P T+ V
Sbjct: 76 AWRVKNNILAPNVEVAYVGGSEKEKGGDDLYRGVKATSDIASEDDLVRLPRE--ATMLVV 133
Query: 140 LGNETIAELLTTNKLSELAC-------LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
G E E +N+L A +AL L+YEK G +S + YI +L +
Sbjct: 134 EGQENPHEEYISNELWAKAGDERWALRVALVLLYEKSLGSRSKFYEYIEQLPK------- 186
Query: 193 AVESPLLWSETE---LAYLTGSP-TKAEILE------------RAEGIKREYNELDTVWF 236
+ E+ W+E E L Y G K + LE R G+K E + +W
Sbjct: 187 SFENLGTWTEEEVRELQYSVGEKFAKEQRLENEKACELIQEYARDGGLKTIERE-EVIWA 245
Query: 237 M--------AGSLFQQYPYD---IPTEAFTFEIFKQAFVAVQS------CVVHL------ 273
+ +G + Q +P +F +F+ Q+ CV L
Sbjct: 246 LDVVRSRVFSGKIADQEALQRKLLPRALSVGTVFA-SFLTAQTTELKWLCVFALLALVVF 304
Query: 274 -----QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG 328
V + L+PL + + K + L + YK GE +++ G
Sbjct: 305 DSTKENDVKTDTAYVLMPL-IDAFNHQTMLKTEFEFTNSEFALKSPKSYKKGEEVLISYG 363
Query: 329 PQPNSKLLINYGFVDEDNPYDRLVVEAAL 357
PN +LL+ YGFVD+ N D E L
Sbjct: 364 LMPNDELLLRYGFVDDQNVADTYQFEGLL 392
>gi|298708218|emb|CBJ30557.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 493
Score = 43.1 bits (100), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 49/203 (24%), Positives = 85/203 (41%), Gaps = 25/203 (12%)
Query: 158 ACLA-LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAE 216
ACL L L++E+ G+ S + Y+ L + PL W+E E+ L G T AE
Sbjct: 133 ACLTVLRLLHERGLGESSPFHSYLSVLPQDH-------RLPLEWTEAEVGLLQG--TSAE 183
Query: 217 ILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
L A + ++ +V Q+P T F + V+S +
Sbjct: 184 PLVGAGSLDSQFEAFQSV-------VAQHPTVWEPSVCTKAAFAKGVNWVRS-----RGF 231
Query: 277 SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD--RPYKAGESIVVWCGPQPNSK 334
++ ++P G + + +++ D V+ +P KAGE + G N++
Sbjct: 232 TVMGDPHMIP-GADMFNHDPNKQSVQIGTDGEEHFVMKTVQPVKAGEEVFSSFGHISNAQ 290
Query: 335 LLINYGFVDEDNPYDRLVVEAAL 357
LL +YGFV N +D +++ L
Sbjct: 291 LLNSYGFVLPGNSFDTVLIPTQL 313
>gi|281338852|gb|EFB14436.1| hypothetical protein PANDA_005285 [Ailuropoda melanoleuca]
Length = 415
Score = 43.1 bits (100), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 64/269 (23%), Positives = 109/269 (40%), Gaps = 24/269 (8%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-SELACLALYLMYEKKQGKKSFWL 177
L+ G S+P S ++T + V+ + A + S L L +L+ EK G +S W
Sbjct: 44 LREGQMIISLPESCLLTTDTVIRSYLGAYIAKWQPPPSPLLALCTFLVSEKHAGDQSLWK 103
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
PY+ L + A P+ E E+ L P KA+ E+ ++ ++ +
Sbjct: 104 PYLEILPK-------AYTCPVC-LEPEVVNLFPKPLKAKAEEQRARVQGFFSSSRDFFSS 155
Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQKVSLARRFALVPLGP--PLL 292
LF + I F++ A+ V + V H Q+ + L P LL
Sbjct: 156 LQPLFSEAVESI----FSYSALLWAWCTVNTRAVYVKHRQEQCFSTEPNTCALAPYLDLL 211
Query: 293 AYSSKC--KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 350
+S + KA ++ + E + + GP N +LL+ YGFV NP+
Sbjct: 212 NHSPRVQVKAAFNEETRCYEIRTASGCRKHEEVFICYGPHDNQQLLLEYGFVSIQNPHAC 271
Query: 351 LVVEAALNTEDPQYQDKRMVAQRNGKLSV 379
+ V + + DK+M N K+S+
Sbjct: 272 VYVSEDVLVKYLPLTDKQM----NKKISI 296
>gi|302896942|ref|XP_003047350.1| hypothetical protein NECHADRAFT_106552 [Nectria haematococca mpVI
77-13-4]
gi|256728280|gb|EEU41637.1| hypothetical protein NECHADRAFT_106552 [Nectria haematococca mpVI
77-13-4]
Length = 471
Score = 43.1 bits (100), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 53/230 (23%), Positives = 96/230 (41%), Gaps = 36/230 (15%)
Query: 161 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-LE 219
A++L+ + G++SFW PYI+ L + A+ PLLW E++L +L G+ + + +
Sbjct: 111 AIFLVQQYLLGEQSFWYPYIQILPQPDDDKDSAI--PLLWPESDLLWLRGTHLEEAVSKQ 168
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV-------- 271
+ + +KR W A Q+Y +D P++ FT E+ A+ S
Sbjct: 169 KVDHVKR--------WTEAMETLQKYGWD-PSQ-FTLELGLWAYYCFYSRYFWSIILEPD 218
Query: 272 ---------HLQKVSLARRFALVPLGPPL--LAYSSKCKAMLAAVDDAVQLVVDRPYKAG 320
HL K + L P L L ++ + D + + + K G
Sbjct: 219 VANIKPEFQHLVKAGMNLDDTAKILLPILETLNHAQETNTEYNLDDKGLSVSKNIELKPG 278
Query: 321 ESIVVWCGPQP----NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQD 366
+ + + N+ LL ++GF+ DN LV+ + + P + D
Sbjct: 279 DPFYIAYDKETQRFNNTVLLKDFGFILPDNEAAELVLSSPFDLTRPMHLD 328
>gi|85113406|ref|XP_964517.1| hypothetical protein NCU02158 [Neurospora crassa OR74A]
gi|28926302|gb|EAA35281.1| hypothetical protein NCU02158 [Neurospora crassa OR74A]
Length = 504
Score = 43.1 bits (100), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 29/50 (58%)
Query: 308 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 357
A + RPY AGE + + G N LLI YGF+ ++N +D + ++ A+
Sbjct: 270 AFTITTTRPYAAGEEVYICYGNHSNDFLLIEYGFLFDENVWDEVCIDDAI 319
>gi|443733230|gb|ELU17670.1| hypothetical protein CAPTEDRAFT_97123, partial [Capitella teleta]
Length = 199
Score = 42.7 bits (99), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 53/205 (25%), Positives = 89/205 (43%), Gaps = 30/205 (14%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
L ++L+ E+ +G SFW PY+ L + L W+ E+ L TK +
Sbjct: 1 LVIFLLCERNKGCSSFWKPYVDILPS-------SYTDILHWTSKEMDLLPKF-TKRRACD 52
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKV 276
+ +N L + L +Q P AFT+++FK A+ +V + V++ Q
Sbjct: 53 LRLKAEESFNRLCNGFLPL--LVRQMPQF--NGAFTWDLFKWAWSSVNTRCVYMSQPQNS 108
Query: 277 SLA----RRFALVPLGPPLLAYSSKCKAMLAAVDDA------VQLVVDRPYKAGESIVVW 326
L+ + AL P LL ++ + A DD+ L +PY + + +
Sbjct: 109 VLSPDEEDKSALAPFLD-LLNHTVDVEVN-ARFDDSSKSYKITTLTACKPY---DQVFIN 163
Query: 327 CGPQPNSKLLINYGFVDEDNPYDRL 351
GP N KLL+ YGF NP++ +
Sbjct: 164 YGPHSNEKLLLEYGFTLPCNPHNNI 188
>gi|255584095|ref|XP_002532790.1| Protein SET DOMAIN GROUP, putative [Ricinus communis]
gi|223527460|gb|EEF29592.1| Protein SET DOMAIN GROUP, putative [Ricinus communis]
Length = 510
Score = 42.7 bits (99), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 23/74 (31%), Positives = 39/74 (52%), Gaps = 1/74 (1%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSELACLALYLMYEKKQG 171
+ A+ DL+ G+ VP S ++T + L + + + + LS L + L+YE +G
Sbjct: 57 LGAARDLKKGELVLRVPKSALLTKDSFLKDGLLLSAINNHSALSPTQTLTVCLLYEMSKG 116
Query: 172 KKSFWLPYIRELDR 185
+ SFW PY+ L R
Sbjct: 117 QSSFWYPYLMHLPR 130
>gi|391342782|ref|XP_003745694.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Metaseiulus occidentalis]
Length = 278
Score = 42.7 bits (99), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 37/182 (20%), Positives = 83/182 (45%), Gaps = 12/182 (6%)
Query: 315 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 374
R YK E + ++ G + N++ +++ GFV ++N +D L ++ L+ D ++ KR + ++
Sbjct: 96 REYKKNEQVNIFYGNRANAQFMLHNGFVPDENQWDSLAIKIGLSKADKLFEMKRRLCEQM 155
Query: 375 GKLSVQVFHVHAGREKEAISDMLPYLRLGYV--------SDTSEMQSVISSLGPICPVSP 426
+ VF + + + + ++P + L V SD + + + + P P
Sbjct: 156 KIPTSDVFELKKAPDGDGV--LVPKVLLHLVHILQWKAPSDGTTSGTDVGA-DPSDATDP 212
Query: 427 CMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML-NACL 485
+ L + + P ++ E +L D + ++A + E++ML NAC
Sbjct: 213 VRTKKAKTFLHVRCQLLMKALPRSVEELTEILNDPTTSLESKLAIRYRLSEQRMLTNACN 272
Query: 486 QV 487
++
Sbjct: 273 KI 274
>gi|297608243|ref|NP_001061350.2| Os08g0244400 [Oryza sativa Japonica Group]
gi|255678277|dbj|BAF23264.2| Os08g0244400, partial [Oryza sativa Japonica Group]
Length = 195
Score = 42.7 bits (99), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 34/59 (57%), Gaps = 2/59 (3%)
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD--RLVVEAALNTED 361
V +++ + RP KAGE + G P S L+ YGF+ DNPYD L ++ +++ ED
Sbjct: 14 VTKSLKFPLSRPCKAGEQCFLSYGKHPGSHLITFYGFLPRDNPYDVIPLDLDTSVDEED 72
>gi|336463341|gb|EGO51581.1| hypothetical protein NEUTE1DRAFT_125257 [Neurospora tetrasperma
FGSC 2508]
gi|350297448|gb|EGZ78425.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
Length = 503
Score = 42.7 bits (99), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 29/50 (58%)
Query: 308 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 357
A + RPY AGE + + G N LLI YGF+ ++N +D + ++ A+
Sbjct: 270 AFTITTTRPYAAGEEVYICYGNHSNDFLLIEYGFLFDENVWDEVCIDDAI 319
>gi|156849027|ref|XP_001647394.1| hypothetical protein Kpol_1018p68 [Vanderwaltozyma polyspora DSM
70294]
gi|156118080|gb|EDO19536.1| hypothetical protein Kpol_1018p68 [Vanderwaltozyma polyspora DSM
70294]
Length = 494
Score = 42.7 bits (99), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 54/253 (21%), Positives = 99/253 (39%), Gaps = 36/253 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEK-KQG 171
+ A ED+ G+ F +P ++ + T++L E L L ++YE G
Sbjct: 43 MVAVEDVAEGETLFEIPRGSILNVNTSALTRDYPSF-GTSQLGEWEELILCMLYEMFVLG 101
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA-E 222
+ S W PY L + S + WS+ EL L G E+ +
Sbjct: 102 ENSRWYPYFNVLP-----SSAELNSLIYWSDRELGLLKPSFVIERIGRGKSQEMFSKVLS 156
Query: 223 GIKREYNELDTV--------WFMAGSLFQQYPYDI----PTEAFTFEIFKQAFVAVQSCV 270
I+ + ++L + + S+ Y +D+ P EI + S
Sbjct: 157 YIENQDSDLSLIAKYLTWENFVYVASIIMSYSFDVEDLNPQSDEDDEIEDDDNDSEMSPD 216
Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 330
++ +++PL L + + C A L + +++ +P +AGE + G
Sbjct: 217 KSIK--------SMIPLADTLNSDTHLCNANLMYDKETLKMTAIKPIRAGEEVFNIYGEH 268
Query: 331 PNSKLLINYGFVD 343
PNS++L YG+V+
Sbjct: 269 PNSEILRRYGYVE 281
>gi|388250581|gb|AFK23406.1| histone-lysine N-methyltransferase [Cordyceps militaris]
Length = 479
Score = 42.7 bits (99), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 48/188 (25%), Positives = 81/188 (43%), Gaps = 7/188 (3%)
Query: 309 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 368
+ ++ + Y+ G+ I ++ G PN++LL YGFV DNP D + + P Y+ K
Sbjct: 239 LSILAAKDYQVGDQIFIYYGSVPNNRLLRLYGFVLLDNPNDSYDLVLQTSPMAPLYEQKE 298
Query: 369 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV--SDTSEMQSVISSLGPICPVSP 426
+ G S + A + ++L YLR + +D ++M + + G V+
Sbjct: 299 RLWALAGLDSTCTIPLTA--KHPLPKNVLRYLRTQRLDAADVADMTLQLLN-GTDGKVND 355
Query: 427 CMERAVLDQLADYFKARLAGYPATLSEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNAC 484
E VL L D + L G+ L + EA L Y A Q+ E+ +L
Sbjct: 356 GNEIQVLQFLIDSLGSVLEGFGIPLEKLEAQLAGGFYPAGGNAWAAAQVSAGEQGILTRA 415
Query: 485 LQVTADMI 492
+ DM+
Sbjct: 416 KKTAEDML 423
>gi|356511297|ref|XP_003524363.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
setd3-like [Glycine max]
Length = 449
Score = 42.7 bits (99), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 70/172 (40%), Gaps = 19/172 (11%)
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD--RLVVEAAL------ 357
D +++V + K + +++ G N L++YGFV NPYD L + AL
Sbjct: 232 DSKMKVVAETAIKEDDPLLLCYGCLNNDLFLLDYGFVMHSNPYDCIELKYDGALLDAAST 291
Query: 358 -------NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE 410
N P + +++Q N V G ++ +L LR+ ++
Sbjct: 292 AAGVSSPNFSTPAPWQELILSQLNLAGETPDLKVSLGGQETVEGRLLAALRVILSTNVET 351
Query: 411 MQ----SVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 458
MQ S++ SL P+ E AV L L +P + +DE++L
Sbjct: 352 MQKYDLSILQSLDAEAPLGVANEIAVFRTLIALCVIALGHFPTKIMDDESLL 403
>gi|359476494|ref|XP_002269094.2| PREDICTED: protein SET DOMAIN GROUP 40-like [Vitis vinifera]
Length = 504
Score = 42.7 bits (99), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 43/74 (58%), Gaps = 1/74 (1%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN-KLSELACLALYLMYEKKQG 171
+AA+ DL G+ +VP S ++T + +L +E ++ + + LS L + L+ E +G
Sbjct: 51 LAAARDLSQGELILTVPKSALMTSQSLLKDEKLSVAVKRHTSLSSPQILTICLLAEMSKG 110
Query: 172 KKSFWLPYIRELDR 185
K S+W PY+ +L R
Sbjct: 111 KSSWWHPYLMQLPR 124
>gi|297738159|emb|CBI27360.3| unnamed protein product [Vitis vinifera]
Length = 449
Score = 42.7 bits (99), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 43/74 (58%), Gaps = 1/74 (1%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN-KLSELACLALYLMYEKKQG 171
+AA+ DL G+ +VP S ++T + +L +E ++ + + LS L + L+ E +G
Sbjct: 51 LAAARDLSQGELILTVPKSALMTSQSLLKDEKLSVAVKRHTSLSSPQILTICLLAEMSKG 110
Query: 172 KKSFWLPYIRELDR 185
K S+W PY+ +L R
Sbjct: 111 KSSWWHPYLMQLPR 124
>gi|50303805|ref|XP_451849.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49640981|emb|CAH02242.1| KLLA0B07161p [Kluyveromyces lactis]
Length = 553
Score = 42.7 bits (99), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 25/45 (55%)
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
D+ V +++ KAGE I G NS LL YGF EDNP+D
Sbjct: 319 TDECVDIILSNDVKAGEEIFNSYGDHSNSYLLARYGFCIEDNPHD 363
>gi|363747293|ref|XP_003643967.1| PREDICTED: N-lysine methyltransferase SETD6-like [Gallus gallus]
Length = 447
Score = 42.7 bits (99), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 70/273 (25%), Positives = 113/273 (41%), Gaps = 37/273 (13%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLE----RVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
A+ DL+ G+ FSVP S +++ R L ++ L + S L L L++E
Sbjct: 51 AAADLEPGELLFSVPRSALLSQHTCAIRALLHDAQESLQSQ---SGWVPLLLALLHEYTT 107
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE-LAYLTGSPTKAEILERAEGIKREYN 229
G S W PY + +++ P+ W E E + L G+ + + I+ EY+
Sbjct: 108 G-TSHWRPYFS-----LWQDFSSLDHPMFWPEEERVRLLQGTGIPEAVDKDLANIQLEYS 161
Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ--AFVAVQSCVVHLQKVSLARRFALVPL 287
+ + FM + +P E T E++KQ AFV S L++ + P+
Sbjct: 162 SI-ILPFM-----KSHPDIFDPELHTLELYKQLVAFVMAYSFQEPLEEEDEDEKGPNPPM 215
Query: 288 GPP---LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 344
P +L + + A L +++V +P G+ I G N +LL YGF +
Sbjct: 216 MVPVADILNHVANHNASLKYAPTCLRMVTTQPISKGQEIFNTYGQMANWQLLHMYGFAE- 274
Query: 345 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL 377
PY NT D D +MV R L
Sbjct: 275 --PYPG-------NTNDT--ADIQMVTVRKAAL 296
>gi|440464432|gb|ELQ33864.1| hypothetical protein OOU_Y34scaffold00857g1 [Magnaporthe oryzae
Y34]
Length = 464
Score = 42.4 bits (98), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 36/129 (27%), Positives = 55/129 (42%), Gaps = 3/129 (2%)
Query: 241 LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA 300
L Q+ P E FT E +K A V S + L P +L +S K
Sbjct: 146 LLVQHRDLFPLEQFTIEDYKWALCTVWSRAMDFVLPGGNSIRLLAPFAD-MLNHSDNVKQ 204
Query: 301 MLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 358
A + + ++ + Y+AG+ + ++ GP NS+LL YGFV N D + A +
Sbjct: 205 CHAYDSSSKTLSVLAGKDYEAGDQVFIYYGPVSNSRLLRLYGFVLPGNSNDNYDLVLATH 264
Query: 359 TEDPQYQDK 367
E P + K
Sbjct: 265 PEAPFFARK 273
>gi|403370373|gb|EJY85047.1| hypothetical protein OXYTRI_17100 [Oxytricha trifallax]
Length = 777
Score = 42.4 bits (98), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 39/165 (23%), Positives = 74/165 (44%), Gaps = 31/165 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIA-------ELLTTNKLSELACLALYLM 165
VAA + + +A +PN L++ +++ +E E T K S+ L ++
Sbjct: 112 VAAKKFIGPNEAYLYIPNKLIINEDKLYKSEYAQIFIDHPNEFKNTEK-SDQTSLIFFVA 170
Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVES--PLLWSETELAYLTGSPTKAEILERAEG 223
E +G++S+W PY + A +S P W + + L + KAE+
Sbjct: 171 LELLKGEESYWHPYF----------ETAQDSDLPQFWEDQNIDELEDALIKAEL------ 214
Query: 224 IKREYNELDTV--WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 266
+ +++D + + +A + YP + E FT EI+K+A+ V
Sbjct: 215 ---QMHQVDFIGDYEIAHGIANHYPDLVHAEKFTIEIYKRAYNIV 256
>gi|170588849|ref|XP_001899186.1| SET domain containing protein [Brugia malayi]
gi|158593399|gb|EDP31994.1| SET domain containing protein [Brugia malayi]
Length = 278
Score = 42.4 bits (98), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 57/254 (22%), Positives = 105/254 (41%), Gaps = 36/254 (14%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+ D + + S+P L++T + ++ L L + + EK+Q K
Sbjct: 35 ATTDFRENETIISIPVGLIITAGFIAEMPDYCDVFKRYCLKPFEALVYFFLVEKEQNSK- 93
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PY+ L P +S + + P R + ++ NEL
Sbjct: 94 -WTPYLEVL-------------PKSFSTPASLHPSLKPEDFPYCLRKQWYVQK-NELKI- 137
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CVVHLQKV------SLARRFALVPL 287
+++++ I + ++ F A+ V + C+ K+ + A+VPL
Sbjct: 138 ------MYEKF-VTILADNTIWDHFLWAWHIVNTRCIYRNNKLHPLIDNTEDDSLAIVPL 190
Query: 288 GPPLLAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 345
+L +S S+C A+ + + +++V RP + GE I + G N L I YGF +D
Sbjct: 191 -IDMLNHSNDSQCCAIWDSKFNLYKVIVTRPIRKGEQIFICYGSHTNGSLWIEYGFYLKD 249
Query: 346 NPYDRLVVEAALNT 359
N D+ VE +L +
Sbjct: 250 NICDK--VEISLGS 261
>gi|367042232|ref|XP_003651496.1| hypothetical protein THITE_2111880 [Thielavia terrestris NRRL 8126]
gi|346998758|gb|AEO65160.1| hypothetical protein THITE_2111880 [Thielavia terrestris NRRL 8126]
Length = 377
Score = 42.4 bits (98), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 42/171 (24%), Positives = 69/171 (40%), Gaps = 22/171 (12%)
Query: 192 LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPT 251
LA PL WS L P +A L RA+ K + W + F
Sbjct: 104 LATALPLAWSSPVLHNYLPPPARA--LLRAQQAKFARD-----WAAVSAAFP-------- 148
Query: 252 EAFTFEIFKQAFVAVQSCVVHLQKVSLAR-----RFALVPLGPPLLAYSSKCKAMLAAVD 306
A + F+ A++ + + + AR R L P+ L +++ +A
Sbjct: 149 -ALAPDAFRHAWLLTNTRTFYHETARTARLPHDDRMVLQPVAD-LFNHAADGGCEVAFTP 206
Query: 307 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 357
+ + DR Y GE +++ G N LL+ YGFV E N +D + ++ A+
Sbjct: 207 ASFAITADRAYAEGEEVLICYGRHSNDFLLVEYGFVLEQNRWDEVGLDEAV 257
>gi|340507383|gb|EGR33354.1| SET domain protein [Ichthyophthirius multifiliis]
Length = 165
Score = 42.4 bits (98), Expect = 0.56, Method: Composition-based stats.
Identities = 33/103 (32%), Positives = 50/103 (48%), Gaps = 20/103 (19%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAEL--------LTTNKLSELACLALYL 164
V A E++ A ++PN+L+++ V +E L L + ++ LALYL
Sbjct: 51 VIAKEEIPANKVFVAIPNNLLLSTYLVEQSELKVILEENPHLFDLDEDDDAQFNKLALYL 110
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL--WSETEL 205
M EK +G+ SFW PY+ Q+A ES L W E E+
Sbjct: 111 MKEKIKGENSFWYPYL----------QIAPESFTLLDWKEEEV 143
>gi|326913214|ref|XP_003202935.1| PREDICTED: SET domain-containing protein 4-like, partial [Meleagris
gallopavo]
Length = 241
Score = 42.4 bits (98), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 31/100 (31%), Positives = 48/100 (48%), Gaps = 9/100 (9%)
Query: 94 KVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAE 147
K LK++ + RP + + + LQAG+ S+P +VT VL N + E
Sbjct: 36 KKWLKDRGFGDSSLRPAQFWGTGRGLMTTRALQAGELVISLPEKCLVTTNTVL-NSCLGE 94
Query: 148 LLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDR 185
+ K +S L L +L+ EK G+KS W PY+ L +
Sbjct: 95 YIMKWKPPVSPLIALCTFLIAEKHAGEKSLWKPYLDVLPK 134
>gi|302835223|ref|XP_002949173.1| hypothetical protein VOLCADRAFT_120737 [Volvox carteri f.
nagariensis]
gi|300265475|gb|EFJ49666.1| hypothetical protein VOLCADRAFT_120737 [Volvox carteri f.
nagariensis]
Length = 593
Score = 42.4 bits (98), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 62/252 (24%), Positives = 98/252 (38%), Gaps = 25/252 (9%)
Query: 109 PIHYVAASEDLQAGDAAFSVPNSLVVTLERV----LGNETIAELLTTNKLSELACLALYL 164
P+ + A + GD VP L+++ E LG A L + S
Sbjct: 191 PLRGLRADTAVAPGDVVLHVPADLLISYETAKKSDLGKVLSALPLDLSDDSIALIWTCVE 250
Query: 165 MYEKKQGKKSFW--LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAE 222
+E + FW LP+ + + L S+ ++A L G+P + + RA
Sbjct: 251 RHEPEAPHAPFWAALPH-------------SFSTALSASQEDVALLEGTPLHGDAV-RAR 296
Query: 223 GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF 282
E E + F SL YP E F++E + A S + +Q S R
Sbjct: 297 QHLSEAFESSSPAFR--SLLGAYPDYFKPEWFSWESYLWAAELWYSYGIQVQFASGDIRT 354
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVD---DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 339
L P + + + VD +++ RP +AG + + GP N+KLL+ Y
Sbjct: 355 CLAPYLGLMNHHPLPHVVHFSKVDPETGCLRVRAFRPCEAGNQLFLSYGPYSNAKLLLFY 414
Query: 340 GFVDEDNPYDRL 351
GF DNP D +
Sbjct: 415 GFAVRDNPADEV 426
>gi|307103393|gb|EFN51653.1| hypothetical protein CHLNCDRAFT_139846 [Chlorella variabilis]
Length = 712
Score = 42.4 bits (98), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 53/101 (52%), Gaps = 7/101 (6%)
Query: 315 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 374
RP +AG+ + + GP PN KLL YGFV NP+D +V L + + +++ A
Sbjct: 444 RPCQAGQQVFISYGPVPNLKLLCYYGFVVPHNPHD--LVPLQLEPPEGPLKQQQLAAMEA 501
Query: 375 GKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVI 415
L ++ H+ ++ +L LRL V+ ++E+Q V+
Sbjct: 502 LGLGLE----HSLQDGPLSKQLLACLRL-IVATSAELQLVV 537
>gi|383863095|ref|XP_003707018.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Megachile
rotundata]
Length = 277
Score = 42.4 bits (98), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 46/91 (50%), Gaps = 4/91 (4%)
Query: 315 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE---DPQYQDKRMVA 371
R +K G+ I + GP+PNS ++ GFV D+ +D L E DP ++R +
Sbjct: 80 RDFKKGDQIFISYGPRPNSDFFLHSGFVYMDHKHDTLKFWVGSFLESNLDPHLAERRQLL 139
Query: 372 QRNGKLSVQVFHVHAGREKEAISDMLPYLRL 402
++ F V++GRE S +L Y+R+
Sbjct: 140 KKLHLQPWSEFVVNSGREPIPGS-VLAYMRV 169
>gi|367001244|ref|XP_003685357.1| hypothetical protein TPHA_0D02870 [Tetrapisispora phaffii CBS 4417]
gi|357523655|emb|CCE62923.1| hypothetical protein TPHA_0D02870 [Tetrapisispora phaffii CBS 4417]
Length = 495
Score = 42.4 bits (98), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 61/263 (23%), Positives = 103/263 (39%), Gaps = 53/263 (20%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER-----------VLGNETIAELLTTNKLSELACLA 161
V ASE ++ + F +P ++ ++ + G I E+ L + CL
Sbjct: 43 VVASEHIEKDEVLFEIPRDSILNVDTSELFKNHYEGYIDGKTVIEEIGLWETL--ILCL- 99
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILER- 220
Y M+ KK+ +SFW Y L + L + W + EL L S ILER
Sbjct: 100 FYEMFVKKE--ESFWSQYFAVLPKATDFNTL-----MYWEDRELENLKPSF----ILERI 148
Query: 221 ------------AEGIKREYNELDTVWF------MAGSLFQQYPYDIPTEAFTFEIFKQA 262
E +++ + ++T F + S+ Y +DI E +
Sbjct: 149 GKDKSVAMHEKLMEFVEKNLDVIETSSFTWDRFLLVASIIMAYSFDI-------ERGECD 201
Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
+ + SL + +++PL L A + +C A L +++ +P KA E
Sbjct: 202 ADEEEEEEEEDIERSLIK--SMIPLADTLNADTKRCNANLIYDSGVLKMCAIKPIKANEQ 259
Query: 323 IVVWCGPQPNSKLLINYGFVDED 345
I G N +LL YG+V+ D
Sbjct: 260 IYNTYGNHANFELLRRYGYVEVD 282
>gi|303272215|ref|XP_003055469.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463443|gb|EEH60721.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 468
Score = 42.4 bits (98), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 32/119 (26%), Positives = 52/119 (43%), Gaps = 10/119 (8%)
Query: 144 TIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 203
++A+ L +L L + +M E+ G +S W Y L RG L P+ W+E
Sbjct: 85 SVAKELRDARLGGGLALNVAVMVERALGSESRWRDYFAVLP-SRGERTL----PMFWTEA 139
Query: 204 ELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
L L G+ + E AE ++ +Y+E + L +P E TFE + +A
Sbjct: 140 RLEALKGTDLATHVREDAENLRADYDEE-----VVNGLCVAHPEKFRREELTFERYLEA 193
>gi|190347905|gb|EDK40262.2| hypothetical protein PGUG_04360 [Meyerozyma guilliermondii ATCC
6260]
Length = 466
Score = 42.4 bits (98), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 67/284 (23%), Positives = 115/284 (40%), Gaps = 59/284 (20%)
Query: 113 VAASEDLQAGDAAFSVP-------NSLVVTLERVLGNETIAEL---------LTTNKLSE 156
V A++++ A + +P N+++ + R G E++ +L TT++ +E
Sbjct: 73 VYATQNVSAKETLVRIPHSFLMNTNTIIKHISRFNGKESVPDLGYSVSLPSEYTTDQWTE 132
Query: 157 LAC---------------LALYLMYEKKQGKKSFW------LPYIRELDRQRGRGQLAVE 195
L ALY+ EKK+ + SFW LP + ELD
Sbjct: 133 LYAKIPISKWLQLTAFQRTALYICLEKKRKENSFWCAFISSLPKLEELDF---------- 182
Query: 196 SPLLWS-ETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEA 253
+P++W E+E LTGS A+ E R + + +V F + ++ +E
Sbjct: 183 APIVWEVESE---LTGSKA-ADFFELLPRSSRNHAKKVSVRFNEDYTAVSEFLTAAKSEP 238
Query: 254 FTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGPPL-LAYSSKCKAMLAAVDDAV 309
F A++ + S +++ + A F L P L KC + + +V
Sbjct: 239 LNKMEFLWAWMCINSRCLYMSFPSSKAEADNFTLAPYVDFLNHDCDEKCAIKIDSRGFSV 298
Query: 310 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
VD + AG+ ++ GP N LL Y F E N ++ L V
Sbjct: 299 ISCVD--HAAGQELLFSYGPHSNEFLLCEYAFTMETNKWNNLDV 340
>gi|159471213|ref|XP_001693751.1| transcription factor, E2F and DP-related [Chlamydomonas
reinhardtii]
gi|158283254|gb|EDP09005.1| transcription factor, E2F and DP-related [Chlamydomonas
reinhardtii]
Length = 656
Score = 42.4 bits (98), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 41/149 (27%), Positives = 62/149 (41%), Gaps = 6/149 (4%)
Query: 206 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 265
A L GSP AE + + + + SL + YP F++E + A
Sbjct: 164 AALAGSPLAAEAGQARRHLAEAFAASQPAF---ESLLKAYPDYFQPHWFSWESYLWAAEL 220
Query: 266 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA---VQLVVDRPYKAGES 322
S + +Q + R LVP + + + VD A +++ RP G
Sbjct: 221 WYSYGIQVQVAAGDIRTCLVPYLGLMNHHPLPHVVHFSKVDPASRGLRVRAFRPCARGRQ 280
Query: 323 IVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
+ + GP PNSKLL+ YGF DNP D +
Sbjct: 281 LFLSYGPYPNSKLLLFYGFALPDNPVDEV 309
>gi|70995934|ref|XP_752722.1| SET domain protein [Aspergillus fumigatus Af293]
gi|66850357|gb|EAL90684.1| SET domain protein [Aspergillus fumigatus Af293]
Length = 490
Score = 42.4 bits (98), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 65/297 (21%), Positives = 123/297 (41%), Gaps = 49/297 (16%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDL-QAGDAA--FSVPNSLVVTLE 137
L SW NG+ + ++ S + + VA +E + G+A +VP+ L +TLE
Sbjct: 11 LSSWAKLNGISLEGIAFQKLYSEHGTDKGSAIVATAEKKDEEGEANTLLTVPSDLALTLE 70
Query: 138 RVLGN-----------ETIAELLTTNKLSELACLALYLMY--------EKKQGKKSFWLP 178
V + + + + T + + L L + + + +K G + W
Sbjct: 71 YVHNHAKIDRHLREVLDAVGDFGRTARGAILIFLIIQITHASPDFVNKRQKIGISNPWTE 130
Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-----DT 233
YIR + +V P +S E L G+ + + + +++E++ L +
Sbjct: 131 YIRFM-------PASVPLPTFYSAEERELLRGTSLQTAVDAKLGSLEKEFDHLRQATEEI 183
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP---LGPP 290
W Q++ +D T FTF+ +K +S VV L + A+VP +
Sbjct: 184 PWC------QEHWWDEDTGKFTFDDWKYVDAVYRSRVVDLPRSG----HAIVPCVDMANH 233
Query: 291 LLAYSSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWCGPQ-PNSKLLINYGFVDED 345
S K + +AV QL + + GE + + G + P S+++ +YGFV+ +
Sbjct: 234 ACEDSVKARYDEEGAGNAVLQLRTGKKLRVGEEVTISYGDEKPASEMVFSYGFVENE 290
>gi|226492747|ref|NP_001140859.1| uncharacterized protein LOC100272935 [Zea mays]
gi|194701488|gb|ACF84828.1| unknown [Zea mays]
gi|413951742|gb|AFW84391.1| hypothetical protein ZEAMMB73_159573 [Zea mays]
Length = 495
Score = 42.0 bits (97), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 71/324 (21%), Positives = 118/324 (36%), Gaps = 73/324 (22%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
K WM +G V+ + S + +V A+ L+ GD ++P +T R
Sbjct: 15 FKRWMRAHG-----VVCSDALSLDVSDPLGVHVRAATPLRDGDLVATIPRGACLT-PRTT 68
Query: 141 GNETIAELLTTNKLSELACLALYL--MYEKKQGKKSFWLPYIRELDRQRGRGQLAVES-P 197
G E CLAL + MYE+ QG S W Y++ L ES P
Sbjct: 69 GAAAAIEAAELG-----GCLALTVAVMYERAQGADSPWDAYLQLLPD--------CESVP 115
Query: 198 LLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 256
L+W E L G+ + + E + ++ E ++G L D+ + F+
Sbjct: 116 LVWPAGEAECLLAGTELDKIVKQDKEFLCEDWKECIEPLMLSGEL------DVDPDDFSL 169
Query: 257 EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML-------------- 302
E + A V S + + +VPL L + + C+ +
Sbjct: 170 EKYLSAKTLVSSRSFQIDSYHGS---GMVPLAD-LFNHKTDCEHVHFTSASDASDSDGEE 225
Query: 303 --------------------------AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLL 336
A D+ +++++ R GE + G N+ LL
Sbjct: 226 EEDDRSDASADDKPTTKNPTSSPPGSRANDEDLEIIIVRDVNEGEEVYNTYGTMGNAALL 285
Query: 337 INYGFVDEDNPYDRLVVEAALNTE 360
YGF + DN YD + ++ AL T+
Sbjct: 286 HRYGFTELDNQYDIVNIDLALVTK 309
>gi|218200748|gb|EEC83175.1| hypothetical protein OsI_28406 [Oryza sativa Indica Group]
Length = 319
Score = 42.0 bits (97), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 34/59 (57%), Gaps = 2/59 (3%)
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR--LVVEAALNTED 361
V +++ + RP KAGE + G P S L+ YGF+ DNPYD L ++ +++ ED
Sbjct: 179 VTKSLKFPLSRPCKAGEQCFLSYGKHPGSHLITFYGFLPRDNPYDVIPLDLDTSVDEED 237
>gi|358335378|dbj|GAA53907.1| histone-lysine N-methyltransferase setd3 [Clonorchis sinensis]
Length = 254
Score = 42.0 bits (97), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 37/173 (21%), Positives = 82/173 (47%), Gaps = 15/173 (8%)
Query: 323 IVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
I++ G + +++ L+ GFV NP++ + + ++ D + + + S +
Sbjct: 58 ILMDYGKRTSAEFLMFSGFVPATNPHNNVRIVLGVSKSDQLSSKREQLLELIALQSPLIL 117
Query: 383 HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS---------SLGPICPVSPCMERAVL 433
H+ + ++SD + + R+ +V D+ ++ + +S P+CP P ++A+
Sbjct: 118 HITG--DLSSLSDAIAFARV-FVMDSDQLDAHLSMTTSALHALRTSPLCPGDPIDDQAIA 174
Query: 434 DQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 486
L F+ ++ Y +SEDE NL P +R +L E ++L +C++
Sbjct: 175 -FLIMRFELLVSAYGPMVSEDEVGYE--NLTPIQRYCERLRVQEVQILRSCIE 224
>gi|195040205|ref|XP_001991024.1| GH12451 [Drosophila grimshawi]
gi|193900782|gb|EDV99648.1| GH12451 [Drosophila grimshawi]
Length = 573
Score = 42.0 bits (97), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 64/287 (22%), Positives = 114/287 (39%), Gaps = 42/287 (14%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A D+ A + SVP L+ + E L E EL N + L + L+ EK +G S
Sbjct: 158 AKRDIAAEELVLSVPRKLIFSEE--LLPEWKRELFR-NFPTHLN-VTYTLIIEKVRGAAS 213
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL--- 231
W P+I L + + L ++ ++ L G+ + + I R Y +
Sbjct: 214 AWQPFIDTLPTR-------YSTVLYFTVDQMQRLRGTSACSAAMRHCLVIARLYASMYKC 266
Query: 232 ------DTVWFMAGSLFQQYP--YDIPTEAFTFEIFKQAFVAVQ-SCVVHLQKV------ 276
D V +LF +Y Y++ A + +Q V + S V + +V
Sbjct: 267 AYIQPGDNVMAAKANLFTEYGLCYELYRWAVSTVTTRQNLVPRELSTVGEVDQVCQLGGF 326
Query: 277 ----------SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ---LVVDRPYKAGESI 323
+ AR + L P + +C + + D A Q +KAGE
Sbjct: 327 EGTEIKRDAETGARNAPISALIPYWDMTNHRCGKITSYYDRAAQQMECTAQEAFKAGEQF 386
Query: 324 VVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 370
++ G + N+ L+++GF+D N D + + L+ DP + + ++
Sbjct: 387 FIYYGDRSNADRLVHHGFLDMHNLKDYVQIRLGLSPTDPLVEQRSLL 433
>gi|71425330|ref|XP_813082.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70877934|gb|EAN91231.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 565
Score = 42.0 bits (97), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 86/353 (24%), Positives = 144/353 (40%), Gaps = 48/353 (13%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL- 218
L L L+YE+ + S W EL G V P W +LA L G ++L
Sbjct: 201 LVLSLIYERYVAETSHW----NELLLSCPGGYPNV--PSFWDWEDLAELEGLDVLDDVLA 254
Query: 219 ERAEGIKREYNELDTVWFMAGSLFQ--QYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
++A+ + + + + F+ +L ++ D E F+ E A S +L V
Sbjct: 255 KKAQLAQFQTETMAVLPFIHEALAGGCRFGKDEFLECFSIEAMMWARATFDSRAFNL-NV 313
Query: 277 SLARRFALVPLGPPLLAYSSKCKAMLAAV-----DDAVQLVVDRPYK-AGESIVVWCGPQ 330
ALVP+ ++ + ++ ++ V D +Q+ + G I + GP
Sbjct: 314 DGRVVIALVPVAD-MINHHNRSDVLVRKVEPNGGDFVMQIGASLTAQDIGREIWMSYGPL 372
Query: 331 PNSKLLINYGFVDEDNPYDRL-----VVEAALNTEDPQYQDKR--MVAQRNGKLSVQVFH 383
N +LL YGFV E N +DRL E + E + +R +VA L+ + +
Sbjct: 373 QNWELLQFYGFVLEGNEHDRLPFPFDFPEGVVGDE---WDGRRAALVATYGLHLAGRCWI 429
Query: 384 VHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 443
H GR A+ +L ++++ E + + GP + E V+ +AD +
Sbjct: 430 CHDGRPPPALVALLRV----HLAEAEEFDT-MERKGPFASLGAGTEARVVATIADTIRCI 484
Query: 444 LAGYPATLSEDEAML------------TDYNLHP---KKRVATQLVRMEKKML 481
L + +L EDE +L D N P KR+A L+RM K +
Sbjct: 485 LDLFSTSLEEDERLLENGSGPVATHSGDDGNTQPLSCNKRLAI-LLRMGMKRI 536
>gi|301094169|ref|XP_002997928.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262109714|gb|EEY67766.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 440
Score = 42.0 bits (97), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 44/164 (26%), Positives = 70/164 (42%), Gaps = 23/164 (14%)
Query: 315 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL-----NTEDPQYQDKRM 369
+ Y+ GE + + G N +LL NYGF +NPYD + + + N DP + KR
Sbjct: 247 KAYEPGEQLYINYGSHSNLRLLRNYGFTTPNNPYDVVTLPMPIALQQPNPADPAFLQKRG 306
Query: 370 VAQR-NGKLSVQV-------FHVHAGREKEAISDMLPYL-----RLGYVSDTSEMQS--V 414
+ Q G S + F+ H G+ L L L + + QS
Sbjct: 307 LLQSATGSHSTDIPALRSLRFN-HDGQLAPNAEHWLEILLATPEELSEIITQAASQSGAA 365
Query: 415 ISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 458
S++ P+S ++ V ++ ARL + +TL ED+A L
Sbjct: 366 DSTISLALPMS--LKHKVHSEVGSLVTARLKQHSSTLEEDDAFL 407
>gi|218200744|gb|EEC83171.1| hypothetical protein OsI_28399 [Oryza sativa Indica Group]
Length = 437
Score = 42.0 bits (97), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 21/56 (37%), Positives = 33/56 (58%), Gaps = 2/56 (3%)
Query: 308 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR--LVVEAALNTED 361
+++ + RP KAGE + G P S L+ YGF+ DNPYD L ++ +++ ED
Sbjct: 300 SLKFPLSRPCKAGEQCFLSYGKHPGSHLITFYGFLPRDNPYDVIPLDLDTSVDEED 355
>gi|10177069|dbj|BAB10511.1| unnamed protein product [Arabidopsis thaliana]
Length = 447
Score = 42.0 bits (97), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 40/179 (22%), Positives = 80/179 (44%), Gaps = 22/179 (12%)
Query: 307 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV--EAALNTEDPQY 364
+A L R Y+ GE +++ G N +LL +YGF+ E+N D++ + E +L + +
Sbjct: 213 NAYCLYARRNYQLGEQVLLCYGTYTNLELLEHYGFMLEENSNDKVFIPLETSLFSLASSW 272
Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPV 424
+ ++GKLS ++ LRL + + +SV+ + +
Sbjct: 273 PKDSLYIHQDGKLSFA---------------LISTLRLWLIPQSQRDKSVMRLVYAGSQI 317
Query: 425 SPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 483
S E V+ +++ + L P +++ED + LH ++ +R+E+K A
Sbjct: 318 SVKNEILVMKWMSEKCGSVLRDLPTSVTEDTVL-----LHNIDKLQDPELRLEQKETEA 371
>gi|169847976|ref|XP_001830696.1| hypothetical protein CC1G_03233 [Coprinopsis cinerea okayama7#130]
gi|116508170|gb|EAU91065.1| hypothetical protein CC1G_03233 [Coprinopsis cinerea okayama7#130]
Length = 496
Score = 42.0 bits (97), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 60/121 (49%), Gaps = 19/121 (15%)
Query: 115 ASEDLQAGDAAFSVPNSLVVT-----LERVLGNETIAELLTTNKLSE-LACLALYLMYEK 168
A +DL G F++P +L ++ L + G E L KL + A L L +M+E
Sbjct: 42 ALKDLPEGHVLFTIPRALTLSTRTSRLPELFGLEEWKRL----KLHQGWAGLMLCMMWEA 97
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
QGK+S W Y+ L A ++P+ W+E +L+ L G+ ++ + E +R+Y
Sbjct: 98 AQGKESRWAGYLDIL-------PAAFDTPMFWNEEDLSELAGTSIVGKLGK--EDAERDY 148
Query: 229 N 229
+
Sbjct: 149 D 149
>gi|302829721|ref|XP_002946427.1| hypothetical protein VOLCADRAFT_86703 [Volvox carteri f.
nagariensis]
gi|300268173|gb|EFJ52354.1| hypothetical protein VOLCADRAFT_86703 [Volvox carteri f.
nagariensis]
Length = 658
Score = 42.0 bits (97), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 25/90 (27%), Positives = 40/90 (44%), Gaps = 17/90 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-----------------LS 155
+ A+ DLQ G+A VP L++T + +A L + L
Sbjct: 32 IVATRDLQPGEAVLRVPERLLLTTRSAARDPQLAAALQRHTERSRGVAAAPSCGGGCGLG 91
Query: 156 ELACLALYLMYEKKQGKKSFWLPYIRELDR 185
LA +L+ E +G +SFW PY+++L R
Sbjct: 92 PHQVLACHLLLEVSRGPQSFWWPYLKQLPR 121
Score = 39.7 bits (91), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 29/91 (31%), Positives = 42/91 (46%), Gaps = 12/91 (13%)
Query: 306 DDAVQ---LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL----- 357
D+A Q +VV RPY+ GE +++ G N +LL YGFV E N +D ++ AL
Sbjct: 312 DEATQQYCIVVRRPYREGEQVMLCYGRYTNLELLEYYGFVLEGNLHDTARLDPALLPLPS 371
Query: 358 ----NTEDPQYQDKRMVAQRNGKLSVQVFHV 384
P NG+ S Q+ H+
Sbjct: 372 AARTAGGAPHLAPSDCFLHANGQPSWQLLHL 402
>gi|156717956|ref|NP_001096520.1| N-lysine methyltransferase setd6 [Xenopus (Silurana) tropicalis]
gi|325530258|sp|A4QNG5.1|SETD6_XENTR RecName: Full=N-lysine methyltransferase setd6; AltName: Full=SET
domain-containing protein 6
gi|140832737|gb|AAI35641.1| LOC100125156 protein [Xenopus (Silurana) tropicalis]
Length = 454
Score = 42.0 bits (97), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 56/246 (22%), Positives = 98/246 (39%), Gaps = 37/246 (15%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNET-IAELLTTNKLSELAC-----LALYLMYEK 168
A EDL G+ FS+P S +++ N T I +L+ + S +C L + L+YE
Sbjct: 55 AREDLSDGELLFSIPRSAILS-----QNTTRIRDLIEKEQDSLQSCSGWVPLLISLLYEA 109
Query: 169 KQGKKSFWLPYIR---ELDRQRGRGQLAVESPLLWSETE-LAYLTGSPTKAEILERAEGI 224
S W PY ELD + P+ WSE E L G+ + + + I
Sbjct: 110 TDS-SSHWAPYFGLWPELD--------PPDMPMFWSEEEQTKLLQGTGILEAVHKDLKNI 160
Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 284
++EYN + ++ P T +++K+ V + +
Sbjct: 161 EKEYNSI------VLPFIRRNPEKFCPMKHTLDLYKRLVAFVMAYSFQEPQEEDEEEDIE 214
Query: 285 VPLGPP-------LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 337
+ PP LL + ++ A L + ++++ + AG+ + G N +LL
Sbjct: 215 KDILPPMMVPVADLLNHVAQHNAHLEFTPECLRMITTKSVCAGQELFNTYGQMANWQLLH 274
Query: 338 NYGFVD 343
YGF +
Sbjct: 275 MYGFAE 280
>gi|407417214|gb|EKF38012.1| hypothetical protein MOQ_001785 [Trypanosoma cruzi marinkellei]
Length = 578
Score = 42.0 bits (97), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 37/137 (27%), Positives = 65/137 (47%), Gaps = 9/137 (6%)
Query: 328 GPQPNSKLLINYGFVDEDNPYDRLVVEAAL--NTEDPQYQDKR--MVAQRNGKLSVQVFH 383
GP N +LL YGFV E+N +DRL ++ +R +VA L+ + +
Sbjct: 370 GPLQNWELLQFYGFVVEENEHDRLPFPFDFPEGVAGDEWDRRRATLVATYGLHLAGRCWI 429
Query: 384 VHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 443
H GR A ++ LR+ ++++ E ++ + GP + E V+ +AD +
Sbjct: 430 CHDGRPPPA---LVALLRV-HLAEAEEFDTMERN-GPFASLGAGTEARVVATIADTIRCI 484
Query: 444 LAGYPATLSEDEAMLTD 460
L + +L EDE +L +
Sbjct: 485 LDLFSTSLEEDEWLLEN 501
>gi|150864441|ref|XP_001383253.2| hypothetical protein PICST_42613 [Scheffersomyces stipitis CBS
6054]
gi|149385697|gb|ABN65224.2| predicted protein [Scheffersomyces stipitis CBS 6054]
Length = 453
Score = 42.0 bits (97), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 51/227 (22%), Positives = 88/227 (38%), Gaps = 42/227 (18%)
Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS-------ETELA 206
L+ L+LYL +E+++ SFW P++ L +PL+W E +
Sbjct: 122 LTSFQLLSLYLCFERQRIHSSFWKPFLEMLPDISDFSL----NPLIWQVLQVDQWEELIQ 177
Query: 207 YLTGSPTKAEILERAEGIKREYNE------------LDTVWFMAGSLFQQYPYDIPTEAF 254
+L S + RAE + + E LD + S + P D
Sbjct: 178 FLPESAKR-----RAEDVYERFLEDYVVVRALVSRILDDLKLSESSADEYIPVD------ 226
Query: 255 TFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQL 311
+F A++ + S +++ Q + A F + P L +S + + +
Sbjct: 227 ---LFLWAWMCINSRCLYMTIPQGKTNADNFTMAPY-VDFLNHSCNDECSILIDTTGFHV 282
Query: 312 VVDRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAAL 357
PY G+ + + GP N LL YGFV DN ++ L + A +
Sbjct: 283 RTTTPYMPGDQLFLSYGPHCNEFLLCEYGFVIPHDNKWNDLDISAYI 329
>gi|401624185|gb|EJS42251.1| set7p [Saccharomyces arboricola H-6]
Length = 494
Score = 42.0 bits (97), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 22/68 (32%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
+++PL L A +SKC A L +++++ R + E + G PNS++L YG+V
Sbjct: 220 SMIPLADTLNADTSKCNANLTYDSGSLKMIAVRDIEIDEQVYNIYGEHPNSEILRRYGYV 279
Query: 343 DED-NPYD 349
+ D + YD
Sbjct: 280 EWDGSKYD 287
>gi|365982325|ref|XP_003667996.1| hypothetical protein NDAI_0A05980 [Naumovozyma dairenensis CBS 421]
gi|343766762|emb|CCD22753.1| hypothetical protein NDAI_0A05980 [Naumovozyma dairenensis CBS 421]
Length = 573
Score = 41.6 bits (96), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 23/54 (42%), Positives = 28/54 (51%)
Query: 296 SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
SK + L DD V +V R GE I + GP PN+ LL GF DNP+D
Sbjct: 335 SKPEEELNNPDDYVDIVTTRGILKGEEIFISYGPLPNAFLLAKCGFTMADNPFD 388
>gi|171684553|ref|XP_001907218.1| hypothetical protein [Podospora anserina S mat+]
gi|170942237|emb|CAP67889.1| unnamed protein product [Podospora anserina S mat+]
Length = 396
Score = 41.6 bits (96), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 21/87 (24%), Positives = 40/87 (45%), Gaps = 2/87 (2%)
Query: 274 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 333
++++ + AL P+ L C+ + + DR YK GE + + G N
Sbjct: 194 ERLTKDDKMALQPVADLLNHSDEGCEVVFDT--GCYTISADREYKQGEEVYICYGTHSND 251
Query: 334 KLLINYGFVDEDNPYDRLVVEAALNTE 360
L++ YGF E+N +D + ++ + E
Sbjct: 252 FLMVEYGFCPEENKWDEVCIDEVVLEE 278
>gi|357122881|ref|XP_003563142.1| PREDICTED: protein SET DOMAIN GROUP 40-like [Brachypodium
distachyon]
Length = 480
Score = 41.6 bits (96), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 65/284 (22%), Positives = 115/284 (40%), Gaps = 60/284 (21%)
Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGNE-TIAELLTTN--KLSELACLALYLMYEKKQ 170
AA+ DL+ G+ VP + ++T +RV+ ++ IA + +LS + L + L+ E +
Sbjct: 45 AAARDLRRGELVLRVPRAALLTSDRVMADDPEIASCIAARHPRLSSVQRLIVCLLAEVGK 104
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI-KREYN 229
GK S W Y+ +L T LA +A ++ A I ++ +
Sbjct: 105 GKSSSWYLYLSQLPSYY---------------TVLATFNDFEIEALQVDDAIWIAQKSLS 149
Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP 289
+ + W A L Q + + F+ + AF V S +H V+ L P+G
Sbjct: 150 AIRSEWEDATPLMQGLKF--KPKLLIFKTWLWAFATVSSRTLH---VAWDDAGCLCPVG- 203
Query: 290 PLLAYS----------------SKCKA---MLAAV----------------DDAVQLVVD 314
L Y+ +KC+ ML V +A L
Sbjct: 204 DLFNYAAPDDDISSEEENREEVTKCQQKNEMLEEVKFGRSSERLSDGGYEDSEAYCLYAR 263
Query: 315 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 358
+ Y GE +++ G N +LL +YGF+ +NP ++ ++ L+
Sbjct: 264 KCYTKGEQVLLGYGTYTNLELLEHYGFLLAENPNEKTYIQLDLD 307
>gi|218189844|gb|EEC72271.1| hypothetical protein OsI_05430 [Oryza sativa Indica Group]
Length = 1243
Score = 41.6 bits (96), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 31/114 (27%), Positives = 54/114 (47%), Gaps = 8/114 (7%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ +Q GD VP + +TL+++ L + + + + LA L+ E+ G +S
Sbjct: 66 ASKPIQEGDCIMQVPYHVQLTLDKLPQKFNT---LLDHAVGDTSKLAALLIMEQHLGNES 122
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
W PYI+ L + + + +LW EL + S E +E E K+E+
Sbjct: 123 GWAPYIKSLPTKD-----QMHNMVLWDLNELHAVQNSSIYDEAIEHKEQAKKEF 171
>gi|449472508|ref|XP_002187588.2| PREDICTED: N-lysine methyltransferase SETD6 [Taeniopygia guttata]
Length = 383
Score = 41.6 bits (96), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 58/243 (23%), Positives = 104/243 (42%), Gaps = 26/243 (10%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTT-NKLSELACLALYLMYEKKQGKK 173
A+E+L+AG+ F++P + +++ + + E + S L L L++E
Sbjct: 3 AAEELEAGEVLFTIPRTALLSQHTTSIHALLQEAQESLQSQSGWVPLLLALLHEYT-ASN 61
Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL---TGSPTKAEILERAEGIKREYNE 230
S W PY R +++ P+ W + E L TG P + + I+ EYN
Sbjct: 62 SHWQPYFSLWQDFR-----SLDHPMFWPQEERTRLLQGTGIPEAVD--KDLANIQLEYNS 114
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ--AFVAVQSCVVHLQKVSLARRFALVPLG 288
+ + FM + +P + T E++K+ AFV S L++ + P+
Sbjct: 115 I-ILPFM-----ETHPDIFDPKLHTLELYKELVAFVMAYSFQEPLEEEEEDEKGPNPPMM 168
Query: 289 PP---LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 345
P +L + + A L +++V +P + G+ I G N +LL YGF +
Sbjct: 169 VPVADILNHVANHNANLEYSPQCLRMVTTQPVRKGQEIFNTYGQMANWQLLHMYGFAE-- 226
Query: 346 NPY 348
PY
Sbjct: 227 -PY 228
>gi|320584053|gb|EFW98265.1| Nuclear protein that contains a SET-domain [Ogataea parapolymorpha
DL-1]
Length = 499
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 55/220 (25%), Positives = 88/220 (40%), Gaps = 31/220 (14%)
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
GN+ + E L++ L L L YE G++S W Y+ L + S + W
Sbjct: 74 GNQEVLE-----TLNQWEALILCLAYEMMLGEESRWSSYLAVLPEK-------FNSLMFW 121
Query: 201 SETELAYLTGSPTKAEI-LERAE--------------GIKR--EYNELDTVWFMAGSLFQ 243
S EL L S I E+AE G K+ EY +D + + S+
Sbjct: 122 SSEELEKLKPSNVLQRIGREQAEQMYSKLVPEYCLRLGSKKLVEYLTIDR-FHVVASIIM 180
Query: 244 QYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA 303
Y +D+ E + K + ++VPL L + ++ A L+
Sbjct: 181 SYSFDVDDPEDDPEDDEDEEEDFDEIEQECIKYDGYLK-SMVPLADTLNSNTNLVNANLS 239
Query: 304 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 343
+DA+ + + K GE I G PNS++L YG+V+
Sbjct: 240 YENDALVMTATKDIKKGEQIYNIYGELPNSEILRKYGYVE 279
>gi|149059901|gb|EDM10784.1| hypothetical protein RDA279, isoform CRA_d [Rattus norvegicus]
Length = 399
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 57/241 (23%), Positives = 101/241 (41%), Gaps = 26/241 (10%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ ++ + K +S L L +L+ E+ G S W
Sbjct: 27 LQEGQVIISLPESCLLTTDTVI-RSSVGPYIKKWKPPVSPLLALCTFLVSERHAGSHSLW 85
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
Y+ L + + P+ E E+ L P +A+ E+ ++ + +
Sbjct: 86 KSYLDILPK-------SYTCPVCL-EPEVVDLLPGPLRAKAEEQRARVQDLFASSRDFFS 137
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ---KVSLARRFALVPLGP--PL 291
LF + I F++ F A+ V + V+L+ + L+ L P L
Sbjct: 138 TLQPLFAESVDSI----FSYHAFLWAWCTVNTRAVYLKSRRQECLSSEPDTCALAPFLDL 193
Query: 292 LAYSSKCKAMLAAVDDAVQL----VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
L +S + AA ++ + R K E+ + + GP N +LL+ YGFV NP
Sbjct: 194 LNHSPHVQVK-AAFNEKTRCYEIRTASRCRKHQEAFICY-GPHDNQRLLLEYGFVAFGNP 251
Query: 348 Y 348
+
Sbjct: 252 H 252
>gi|406606937|emb|CCH41659.1| hypothetical protein BN7_1200 [Wickerhamomyces ciferrii]
Length = 577
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 55/222 (24%), Positives = 87/222 (39%), Gaps = 58/222 (26%)
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF- 236
P+I L R G SP W+E E + + T A++ I N+L W+
Sbjct: 110 PFIEFLPTGREIG-----SPFFWNEMERSLIKN--TDADL-----AIDVGLNKLVEEWYD 157
Query: 237 MAGSL---FQQYPYDIPTEAFT-----------FEIFKQAFVAVQSCVVHLQKVSL--AR 280
+ L FQ Y Y + F FE F V+ S +L ++ +R
Sbjct: 158 IVTKLPKKFQSYQYQKDLKFFHDFQKDRDVSKHFEFFNDDSVSWTSFAAYLWSSTIFTSR 217
Query: 281 RFALVPLGPPLLAYSSKCK----AMLAAVDDA-----------------VQLVVDRPYKA 319
F P L++ + +C+ ML + D + ++ K
Sbjct: 218 GF------PFLISSTDECRDLNEGMLVPIQDLSNHNPSVEIKWGRLDKFMTFTTEQIVKK 271
Query: 320 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 361
G+ I GP+ N +LL YGFV ++N YD+ V+ AL +D
Sbjct: 272 GDEIFSNYGPKSNHELLFGYGFVMDNNIYDKAVL--ALRLQD 311
>gi|170093191|ref|XP_001877817.1| SET-domain protein [Laccaria bicolor S238N-H82]
gi|164647676|gb|EDR11920.1| SET-domain protein [Laccaria bicolor S238N-H82]
Length = 524
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 60/228 (26%), Positives = 100/228 (43%), Gaps = 32/228 (14%)
Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
A+VP+ L A A L ++ ++++ RP K GE I G PN++LL YG
Sbjct: 276 IAMVPMADILNARYGSENAKLFYEENYLKMISTRPIKGGEQIWNTYGDLPNAELLRRYGH 335
Query: 342 VD--------EDNPYD------RLVVEAA-----LNTEDPQYQDKRMVAQRNGKLSVQVF 382
VD + NP D L+V A L+T+D + + + VF
Sbjct: 336 VDVIQLPNGGQGNPGDVAEIRADLIVSVAAEQHSLSTDDTHERIDWWLEEGGD----DVF 391
Query: 383 HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 442
++ + E ++ +RL + D E + + P P M+ L L + +
Sbjct: 392 DLYF--DLEIPPSIISVIRLLLLPD-EEWEKIKEKA---KPPKPKMDAVALTVLHEVLQR 445
Query: 443 RLAGYPATLSEDEAML-TDYNLHPKKRVATQLVRMEKKMLNACLQVTA 489
RL YP ++ +DE +L T +L+ + + +L EKK+L+ L TA
Sbjct: 446 RLKEYPTSIQDDEQLLMTAPSLNLRHAIIVRL--GEKKILDGILTKTA 491
>gi|158508540|ref|NP_001025734.2| N-lysine methyltransferase SETD6 [Gallus gallus]
Length = 447
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 70/273 (25%), Positives = 113/273 (41%), Gaps = 37/273 (13%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLE----RVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
A+ DL+ G+ FSVP S +++ R L ++ L + S L L L++E
Sbjct: 51 AAADLEPGELLFSVPRSALLSQHTCAIRALLHDAQESLQSQ---SVWVPLLLALLHEYTT 107
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE-LAYLTGSPTKAEILERAEGIKREYN 229
G S W PY + +++ P+ W E E + L G+ + + I+ EY+
Sbjct: 108 G-TSRWRPYF-----SLWQDFSSLDHPMFWPEEERVRLLQGTGIPEAVDKDLANIQLEYS 161
Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ--AFVAVQSCVVHLQKVSLARRFALVPL 287
+ + FM + +P E T E++KQ AFV S L++ + P+
Sbjct: 162 SI-ILPFM-----KSHPDIFDPELHTLELYKQLVAFVMAYSFQEPLEEEDEDEKGPNPPM 215
Query: 288 GPP---LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 344
P +L + + A L +++V +P G+ I G N +LL YGF +
Sbjct: 216 MVPVADILNHVANHNASLEYAPTCLRMVTTQPISKGQEIFNTYGQMANWQLLHMYGFAE- 274
Query: 345 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL 377
PY NT D D +MV R L
Sbjct: 275 --PYPG-------NTNDT--ADIQMVTVRKAAL 296
>gi|355718756|gb|AES06374.1| SET domain containing 4 [Mustela putorius furo]
Length = 256
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 57/233 (24%), Positives = 95/233 (40%), Gaps = 23/233 (9%)
Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 213
L L L +L+ EK G +S W PY+ L + A P+ E ++ L P
Sbjct: 7 LLALCTLCTFLVSEKHAGDQSLWKPYLDILPK-------AYTCPVC-LEPKVVNLFPEPL 58
Query: 214 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV-- 271
KA+ E+ ++ ++ + LF + +I F++ A+ V + V
Sbjct: 59 KAKAEEQRARVQGFFSSSRDFFSSLQPLFSEAVENI----FSYSALLWAWCTVNTRAVYM 114
Query: 272 -HLQKVSLARRFALVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVW 326
H Q+ + L P LL +S + KA ++ + E + +
Sbjct: 115 KHGQRKCFSPEPDTYALAPYLDLLNHSPDVQVKAAFNEETRCYEVRTASGCRKHEQVFIC 174
Query: 327 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 379
GP N +LL+ YGFV NP+ + V A L + DK+M N K+S+
Sbjct: 175 YGPHDNQRLLLEYGFVSIQNPHACVYVSADLLVKYLPSTDKQM----NKKISI 223
>gi|302790237|ref|XP_002976886.1| hypothetical protein SELMODRAFT_416932 [Selaginella moellendorffii]
gi|300155364|gb|EFJ21996.1| hypothetical protein SELMODRAFT_416932 [Selaginella moellendorffii]
Length = 177
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 47/98 (47%), Gaps = 6/98 (6%)
Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 350
L S CK + AV +++++ R KAG + G PN LL YGFV E+NP+D
Sbjct: 54 FLWASELCK--IDAVTNSLKVYSLRSCKAGMQCFISYGALPNIDLLCFYGFVLENNPFDT 111
Query: 351 LVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR 388
+ VE E P+ K + +R +S F + R
Sbjct: 112 IPVE----LEVPESPAKVALMERYNLVSHISFELRGFR 145
>gi|297807745|ref|XP_002871756.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297317593|gb|EFH48015.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 22/75 (29%), Positives = 40/75 (53%), Gaps = 2/75 (2%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAE--LLTTNKLSELACLALYLMYEKKQ 170
+ A +L+ G+ VP + ++T E ++ + ++ LS L++ L+YE +
Sbjct: 54 LGAVRELKKGELVLKVPRNALMTTESMIAKDRKLNDAVILHGSLSSTQILSVCLLYEMGK 113
Query: 171 GKKSFWLPYIRELDR 185
GK+SFW PY+ L R
Sbjct: 114 GKRSFWYPYLVHLPR 128
>gi|143584415|sp|Q5ZK17.2|SETD6_CHICK RecName: Full=N-lysine methyltransferase SETD6; AltName: Full=SET
domain-containing protein 6
Length = 447
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 70/273 (25%), Positives = 113/273 (41%), Gaps = 37/273 (13%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLE----RVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
A+ DL+ G+ FSVP S +++ R L ++ L + S L L L++E
Sbjct: 51 AAADLEPGELLFSVPRSALLSQHTCAIRALLHDAQESLQSQ---SVWVPLLLALLHEYTT 107
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE-LAYLTGSPTKAEILERAEGIKREYN 229
G S W PY + +++ P+ W E E + L G+ + + I+ EY+
Sbjct: 108 G-TSRWRPYF-----SLWQDFSSLDHPMFWPEEERVRLLQGTGIPEAVDKDLANIQLEYS 161
Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ--AFVAVQSCVVHLQKVSLARRFALVPL 287
+ + FM + +P E T E++KQ AFV S L++ + P+
Sbjct: 162 SI-ILPFM-----KSHPDIFDPELHTLELYKQLVAFVMAYSFQEPLEEEDEDEKGPNPPM 215
Query: 288 GPP---LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 344
P +L + + A L +++V +P G+ I G N +LL YGF +
Sbjct: 216 MVPVADILNHVANHNASLEYAPTCLRMVTTQPISKGQEIFNTYGQMANWQLLHMYGFAE- 274
Query: 345 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL 377
PY NT D D +MV R L
Sbjct: 275 --PYPG-------NTNDT--ADIQMVTVRKAAL 296
>gi|149059902|gb|EDM10785.1| hypothetical protein RDA279, isoform CRA_e [Rattus norvegicus]
Length = 475
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 57/241 (23%), Positives = 101/241 (41%), Gaps = 26/241 (10%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ ++ + K +S L L +L+ E+ G S W
Sbjct: 103 LQEGQVIISLPESCLLTTDTVI-RSSVGPYIKKWKPPVSPLLALCTFLVSERHAGSHSLW 161
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
Y+ L + + P+ E E+ L P +A+ E+ ++ + +
Sbjct: 162 KSYLDILPK-------SYTCPVCL-EPEVVDLLPGPLRAKAEEQRARVQDLFASSRDFFS 213
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ---KVSLARRFALVPLGP--PL 291
LF + I F++ F A+ V + V+L+ + L+ L P L
Sbjct: 214 TLQPLFAESVDSI----FSYHAFLWAWCTVNTRAVYLKSRRQECLSSEPDTCALAPFLDL 269
Query: 292 LAYSSKCKAMLAAVDDAVQL----VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
L +S + AA ++ + R K E+ + + GP N +LL+ YGFV NP
Sbjct: 270 LNHSPHVQVK-AAFNEKTRCYEIRTASRCRKHQEAFICY-GPHDNQRLLLEYGFVAFGNP 327
Query: 348 Y 348
+
Sbjct: 328 H 328
>gi|195480581|ref|XP_002101314.1| GE17555 [Drosophila yakuba]
gi|194188838|gb|EDX02422.1| GE17555 [Drosophila yakuba]
Length = 548
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 60/257 (23%), Positives = 105/257 (40%), Gaps = 40/257 (15%)
Query: 126 FSVPNSLVVTLE-----RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYI 180
SVP L+ + E R+ G T A L LA L+ EK +G+ S W PYI
Sbjct: 164 LSVPRKLIFSEENNSDCRLFGKMTQATHLN---------LAYDLLIEKIRGEFSEWRPYI 214
Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS 240
L + + L ++ ++ L G+ + L + I ++Y L + A +
Sbjct: 215 DVLPAK-------YSTVLYFTTKQMERLRGTAACSLALRQCRVIAKQYAFL---YRYAHT 264
Query: 241 LFQQ------YPYD----IPTEAFTFEIFKQAFVAV---QSCV-VHLQKVSLARRF--AL 284
L + +P + +++++ A V Q+ V Q+ + +F AL
Sbjct: 265 LAESSTGNRSHPGERGLFFTQRGLCYKLYRWAVSTVMTRQNLVPSEKQEAQDSPKFISAL 324
Query: 285 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 344
+P K + AAV ++ AGE ++ G + N+ LL++ GFVD
Sbjct: 325 IPYWDMANHRPGKITSFYAAVSRQLECTAQEAVAAGEQFFIYYGDRSNTDLLVHNGFVDV 384
Query: 345 DNPYDRLVVEAALNTED 361
+N D + + L+ D
Sbjct: 385 NNLKDYVNIRVGLSPTD 401
>gi|159464317|ref|XP_001690388.1| hypothetical protein CHLREDRAFT_144255 [Chlamydomonas reinhardtii]
gi|158279888|gb|EDP05647.1| predicted protein [Chlamydomonas reinhardtii]
Length = 486
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 35/55 (63%), Gaps = 3/55 (5%)
Query: 306 DDAVQ---LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 357
D+A Q +VV R AG+ +++ G N +LL +YGFV +DNP+D ++AAL
Sbjct: 244 DEARQQYVIVVRRRVAAGQQVLLCYGRHTNLELLEHYGFVMQDNPHDTAPLDAAL 298
>gi|67538920|ref|XP_663234.1| hypothetical protein AN5630.2 [Aspergillus nidulans FGSC A4]
gi|40743533|gb|EAA62723.1| hypothetical protein AN5630.2 [Aspergillus nidulans FGSC A4]
gi|259484901|tpe|CBF81518.1| TPA: SET domain protein (AFU_orthologue; AFUA_4G11040) [Aspergillus
nidulans FGSC A4]
Length = 707
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 61/241 (25%), Positives = 91/241 (37%), Gaps = 28/241 (11%)
Query: 118 DLQAGDAAF-----SVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
D AGDA F P+S + ERV G+E +A +L+ + +G
Sbjct: 78 DFHAGDAHFPAHDVKFPSSFI---ERV-GSEEVA--------------IFFLIGQYLRGP 119
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+SFW PYIR L + L E K ++ E + NEL
Sbjct: 120 ESFWHPYIRTLPQPGSLTTLPYYEEEEDLEWLEGTSLLQARKRKVALLREKYESSSNELR 179
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
F ++Y +D+ A T + + V S V+ ++ L+P +L
Sbjct: 180 ESGFQDA---ERYSWDLYLWASTIFVSRAFSEKVLSGVIPEHEMP-ENTSVLLPF-IDIL 234
Query: 293 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 352
+ K A V VV E I GP+ N +L++NYGF +NP D
Sbjct: 235 NHRPLAKVEWRAGLQNVDFVVLEDVSVNEEIANNYGPRNNEQLMMNYGFCLANNPCDYRT 294
Query: 353 V 353
V
Sbjct: 295 V 295
>gi|166091525|ref|NP_001107219.1| SET domain-containing protein 4 [Rattus norvegicus]
gi|165971256|gb|AAI58670.1| Setd4 protein [Rattus norvegicus]
Length = 439
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 57/241 (23%), Positives = 101/241 (41%), Gaps = 26/241 (10%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ ++ + K +S L L +L+ E+ G S W
Sbjct: 67 LQEGQVIISLPESCLLTTDTVI-RSSVGPYIKKWKPPVSPLLALCTFLVSERHAGSHSLW 125
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
Y+ L + + P+ E E+ L P +A+ E+ ++ + +
Sbjct: 126 KSYLDILPK-------SYTCPVCL-EPEVVDLLPGPLRAKAEEQRARVQDLFASSRDFFS 177
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ---KVSLARRFALVPLGP--PL 291
LF + I F++ F A+ V + V+L+ + L+ L P L
Sbjct: 178 TLQPLFAESVDSI----FSYHAFLWAWCTVNTRAVYLKSRRQECLSSEPDTCALAPFLDL 233
Query: 292 LAYSSKCKAMLAAVDDAVQL----VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
L +S + AA ++ + R K E+ + + GP N +LL+ YGFV NP
Sbjct: 234 LNHSPHVQVK-AAFNEKTRCYEIRTASRCRKHQEAFICY-GPHDNQRLLLEYGFVAFGNP 291
Query: 348 Y 348
+
Sbjct: 292 H 292
>gi|121701277|ref|XP_001268903.1| SET domain protein [Aspergillus clavatus NRRL 1]
gi|119397046|gb|EAW07477.1| SET domain protein [Aspergillus clavatus NRRL 1]
Length = 498
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 47/203 (23%), Positives = 90/203 (44%), Gaps = 18/203 (8%)
Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 235
W YIR + ++ P ++E EL L G+ + + + +++E+ L
Sbjct: 134 WTEYIRFM-------PPSIRLPTFYTEAELELLRGTSLRTAVFAKLASLEKEFERLRQS- 185
Query: 236 FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP---LGPPLL 292
Q+Y +D T TF+ +K +S VV L + A+VP +
Sbjct: 186 TEGIPWCQKYWWDEDTGRLTFDDWKYVDAVYRSRVVELPESG----HAIVPCVDMANHAS 241
Query: 293 AYSSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWCGPQ-PNSKLLINYGFV-DEDNPYD 349
S K + ++ +DA+ QL R +GE + + G + P S+++ +YGFV +E
Sbjct: 242 EDSVKARYDESSTEDALLQLRQGRRICSGEEVTISYGSEKPASEMVFSYGFVENERTDAK 301
Query: 350 RLVVEAALNTEDPQYQDKRMVAQ 372
++ ++ + +DP K+M +
Sbjct: 302 QIFLDLEIPDDDPLRMAKQMFCK 324
>gi|396469509|ref|XP_003838423.1| similar to SET domain-containing protein [Leptosphaeria maculans
JN3]
gi|312214991|emb|CBX94944.1| similar to SET domain-containing protein [Leptosphaeria maculans
JN3]
Length = 415
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 36/75 (48%), Gaps = 2/75 (2%)
Query: 275 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 334
K++ A +A+ P S C+ A ++ DR Y+AGE + V GP N
Sbjct: 190 KLTSADCYAMCPFMDYFNHSDSGCEPQHNA--HGYSVLADRAYRAGEEVYVSYGPHTNDF 247
Query: 335 LLINYGFVDEDNPYD 349
LL+ YGF+ + N D
Sbjct: 248 LLVEYGFLLDANSND 262
>gi|297845640|ref|XP_002890701.1| hypothetical protein ARALYDRAFT_472886 [Arabidopsis lyrata subsp.
lyrata]
gi|297336543|gb|EFH66960.1| hypothetical protein ARALYDRAFT_472886 [Arabidopsis lyrata subsp.
lyrata]
Length = 471
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 38/175 (21%), Positives = 71/175 (40%), Gaps = 19/175 (10%)
Query: 303 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL----------V 352
A + V++V + K + +++ G N L++YGFV E NPYD +
Sbjct: 252 AESNTLVKVVAETELKENDPLLLNYGCLSNDFFLLDYGFVIESNPYDTIELKYDEQLMDA 311
Query: 353 VEAALNTEDPQYQ-----DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD 407
A P++ ++++Q N + V G + +L +R+ +
Sbjct: 312 ASMAAGVSSPKFSSPAPWQHQLLSQLNLAGEMPNLKVTIGGPEPVEGRLLAAIRILLCGE 371
Query: 408 TSEMQ----SVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 458
E++ + SL I P+ E AV + L+ +P + EDEA++
Sbjct: 372 MVEVEKHDLDTLKSLSAIAPLGIANEIAVFRTVIALCVIALSHFPTKIMEDEAII 426
>gi|302660547|ref|XP_003021952.1| hypothetical protein TRV_03939 [Trichophyton verrucosum HKI 0517]
gi|291185873|gb|EFE41334.1| hypothetical protein TRV_03939 [Trichophyton verrucosum HKI 0517]
Length = 479
Score = 41.2 bits (95), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 94/450 (20%), Positives = 169/450 (37%), Gaps = 84/450 (18%)
Query: 97 LKEKPSHNEKHRPIHY-----VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTT 151
LK H + H IH A + + F +P+ L+++++ + L
Sbjct: 24 LKRSSPHFKMHPGIHIADLRSTGAGRGISEDEELFVIPDDLILSVQNSEARSVLG--LDD 81
Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
+L L + ++YE QG++S W Y R L + ++ + W++ +L+ L GS
Sbjct: 82 KQLGPWLSLIITMIYEYYQGEQSKWYSYFRILPS-------SFDTLMFWTDEQLSELQGS 134
Query: 212 PTKAEILERA-------------EGIKREY---------------NELDTVWFMAGSLFQ 243
+I + A + R + N L + GS+
Sbjct: 135 SVVGKIGKAAADDTILQKVVPLIQANSRHFPPRPNMPPLNSPDSQNALLCLAHRMGSIIM 194
Query: 244 QYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA 303
Y +DI E A + + +VPL A + + A L
Sbjct: 195 AYAFDIEKTDEADE-----HTADDGYMTDDEDEPAK---GMVPLADIFNADAQRNNARLF 246
Query: 304 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE--------- 354
+ + + + +GE I G P + LL YG+V DN VVE
Sbjct: 247 QEEGSFVMKAIKNIYSGEEIFNDYGELPRADLLRRYGYV-TDNYAQYDVVEFSLDAICKV 305
Query: 355 AALNTEDPQYQDKRMVAQRNGKLSVQVFHV----HAGREKEAI-SDMLPYLRLGY--VSD 407
A L +P + R+ N + + +++ G ++AI D L LR + D
Sbjct: 306 AGLPDSEPSPSNPRLELLDNLDMLEEGYNISRIPRNGTLEDAIPEDFLVLLRALTLPIED 365
Query: 408 TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML--------- 458
+ + + + P S E ++L L R + YP ++ EDE++L
Sbjct: 366 LNRLGARNKAPKPEFSAS---EASLLRSLV---TLRQSEYPTSVQEDESILNCLEQQNGY 419
Query: 459 -TDYNLHPKKRVATQLVRMEKKMLNACLQV 487
D L+ +K++A Q+ + EK++L L +
Sbjct: 420 INDSGLN-RKKMAVQVRKGEKEILTQILSL 448
>gi|255637489|gb|ACU19071.1| unknown [Glycine max]
Length = 497
Score = 41.2 bits (95), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 39/72 (54%), Gaps = 1/72 (1%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSELACLALYLMYEKKQG 171
+ A DL+ G+ VP S ++T E V+ ++ + + + + LS L + L+YE +G
Sbjct: 55 LGAVRDLRRGEIVLRVPKSALMTRETVMEDKKLCDAVNRHSSLSSAQILIVCLLYEMGKG 114
Query: 172 KKSFWLPYIREL 183
K S W PY+ L
Sbjct: 115 KTSRWHPYLMHL 126
>gi|71659283|ref|XP_821365.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70886742|gb|EAN99514.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 661
Score = 41.2 bits (95), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 45/95 (47%), Gaps = 10/95 (10%)
Query: 149 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 208
L ++ + +AC+A Y+ YEKKQ + + L Y R L Q+ V++ LW+ L
Sbjct: 466 LDSSNMESIACIAAYMFYEKKQPEIALRL-YRRLL-------QMGVQTTELWNNLGLCCF 517
Query: 209 TGSPTKAEI--LERAEGIKREYNELDTVWFMAGSL 241
S + L+RA I E L VW+ G +
Sbjct: 518 YSSQYDIALSCLQRAVAISTEDETLADVWYNIGHI 552
>gi|428177750|gb|EKX46628.1| hypothetical protein GUITHDRAFT_107412 [Guillardia theta CCMP2712]
Length = 606
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 17/49 (34%), Positives = 27/49 (55%)
Query: 309 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 357
+Q P K G + + GP N++LL+ YG+ ++DNPY +E L
Sbjct: 440 LQFCTMAPIKQGSQVFLNYGPLDNTQLLLYYGYAEQDNPYQTYAIELEL 488
>gi|195396323|ref|XP_002056781.1| GJ16703 [Drosophila virilis]
gi|194146548|gb|EDW62267.1| GJ16703 [Drosophila virilis]
Length = 539
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 71/360 (19%), Positives = 137/360 (38%), Gaps = 40/360 (11%)
Query: 116 SEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSF 175
+ DL G+ +VP L+ + E + + + L+ + L+ EK +G S
Sbjct: 151 TRDLAEGELVLTVPRQLIFSEELLPEAQRKLFIDFPTHLN----VTYMLIIEKVRGAASN 206
Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL---- 231
W P+I L + + L ++ ++ L G+ + + I R Y +
Sbjct: 207 WQPFIDTLPTR-------YNTVLYFTVEQMQRLRGTSACSAAVRHCRVIARIYASMYKCA 259
Query: 232 -----DTVWFMAGSLFQQYP--YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 284
D+V +LF +Y Y++ A + +Q V Q V + AL
Sbjct: 260 YMQPDDSVMAGMANLFTEYGLCYELYRWAVSTVTTRQNLVPRQ-LATDSDGVRNSPMSAL 318
Query: 285 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 344
+P K + ++ + +KAGE ++ G + N+ L+++GF+D
Sbjct: 319 IPFWDMANHRCGKITSYYKPSAQQMECIAQEAFKAGEQFFIYYGDRCNADRLVHHGFLDM 378
Query: 345 DNPYDRLVVEAALNTEDPQYQDKRMV-----AQRNGKLSVQVFHVHAGREKEAISDMLPY 399
+N D + + L+ D + + ++ +R +L V H E +L +
Sbjct: 379 NNLKDYVHIRLGLSPTDALAEQRALLLSELNIERKAELRVLPAPEHISGE------LLAF 432
Query: 400 LRLGYVSD------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSE 453
+R+ +S S+++ + L C + +E L K L ATL E
Sbjct: 433 VRVFNMSKEQLEHWCSDLERAVDLLHIDCALETDLETRTWQYLYQRLKLLLGVLDATLKE 492
>gi|363747032|ref|XP_003643892.1| PREDICTED: histone-lysine N-methyltransferase setd3-like, partial
[Gallus gallus]
Length = 283
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 42/182 (23%), Positives = 85/182 (46%), Gaps = 23/182 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
+ A+ +++A + VP L++T+E N + L + +++ + LA +L+ E+
Sbjct: 108 LKATREIKAEELFLWVPRKLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERA 166
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
SFWLPYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 167 -NPNSFWLPYIQTLPSE-------YDTPLYFEEDEVQYLRSTQAIHDVFSQYKNTARQYA 218
Query: 230 ELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALV 285
V Q +P +P ++FT++ ++ A +V + + +R AL+
Sbjct: 219 YFYKV-------IQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRVTLALI 271
Query: 286 PL 287
PL
Sbjct: 272 PL 273
>gi|297598048|ref|NP_001044988.2| Os01g0879500 [Oryza sativa Japonica Group]
gi|255673923|dbj|BAF06902.2| Os01g0879500 [Oryza sativa Japonica Group]
Length = 263
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 20/63 (31%), Positives = 33/63 (52%)
Query: 298 CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 357
C + D+ ++++V R GE + G N+ LL YGF + DN YD + ++ AL
Sbjct: 15 CSYYVGDDDEDLEMIVVRDVNEGEEVFNTYGTMGNAALLHRYGFTEMDNSYDIVNIDLAL 74
Query: 358 NTE 360
T+
Sbjct: 75 VTK 77
>gi|340522118|gb|EGR52351.1| predicted protein [Trichoderma reesei QM6a]
Length = 377
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 39/164 (23%), Positives = 71/164 (43%), Gaps = 25/164 (15%)
Query: 197 PLLWSETELAYLTGSPTKAEI-LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
P+LW EL L P ++++ LER E E W F++ D+P + +T
Sbjct: 107 PMLWPR-ELKQLL--PLESQVTLERRE------KEFQDNW----DDFKEAFPDVPRDDYT 153
Query: 256 FEIFKQAFVAVQSCVVHLQ-----KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 310
+ A++ V + + + K R AL+P+ + C+ + +
Sbjct: 154 Y-----AWLVVNTRTFYHETPETLKYPWEDRLALIPVADLFNHAAGGCRVYYSP-EGCYH 207
Query: 311 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
+V DR YK GE + + N L+ YGF+ ++N D + ++
Sbjct: 208 VVADRAYKKGEELFISYSSHSNDYNLLEYGFIPDENSLDDVYID 251
>gi|145354661|ref|XP_001421597.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581835|gb|ABO99890.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 341
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 72/313 (23%), Positives = 124/313 (39%), Gaps = 30/313 (9%)
Query: 185 RQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG-----IKREYN-ELDTVWFMA 238
R+R +G ++ +P + S E T R EG REY + + W A
Sbjct: 2 RERAKGGVSAYAPFVESLYEHTPARAVETSRAARARLEGHAAAETMREYERDAEDGWRAA 61
Query: 239 GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC 298
F+ +P FT F++A V++ + R ALVP+ LL +
Sbjct: 62 RRTFETFPSIFSVHEFTRAAFEEALAIVRANSFEARSEDGTRARALVPMAHLLLHDTGSE 121
Query: 299 KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG----FVDEDNPYDRLVVE 354
+ VD + VD ++ G+ + G +++ +G + E N ++ ++
Sbjct: 122 VPCVKIVDGVFVINVD-EHEEGDELSCSHGDYSDAETFARFGVSAFYSAEKNARNK--IK 178
Query: 355 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV 414
A +E Y K + R G + F AG A + + LRL ++T E ++
Sbjct: 179 FAFPSE--IYSMKSL--DRCGSVENIAF-TDAG----ATEEFMCALRLASANET-EWAAI 228
Query: 415 ISSLGPI-----CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH--PKK 467
S + P+S E AV + L L YP++ +EDE +L L P +
Sbjct: 229 SKSKASVRALRKKPLSEESEIAVYEALFATLTELLNSYPSSDNEDERLLQSRTLQSAPDE 288
Query: 468 RVATQLVRMEKKM 480
A + EK++
Sbjct: 289 ERAVTIRLREKRL 301
>gi|323473309|gb|ADX78230.1| CIA6 [Chlamydomonas reinhardtii]
Length = 699
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 35/55 (63%), Gaps = 3/55 (5%)
Query: 306 DDAVQ---LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 357
D+A Q +VV R AG+ +++ G N +LL +YGFV +DNP+D ++AAL
Sbjct: 396 DEARQQYVIVVRRRVAAGQQVLLCYGRHTNLELLEHYGFVMQDNPHDTAPLDAAL 450
>gi|449301991|gb|EMC98000.1| hypothetical protein BAUCODRAFT_146595 [Baudoinia compniacensis
UAMH 10762]
Length = 633
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 56/230 (24%), Positives = 93/230 (40%), Gaps = 27/230 (11%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW-SETELAYLTGSPTKAEILER 220
YLM + ++SFW PY+ L +PL + + +LA+L G+ +L R
Sbjct: 87 FYLMTQYLNKEQSFWKPYLDVLPSPS-----EFSTPLWFDAPADLAWLDGTDVLHTMLAR 141
Query: 221 AEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA---FTFEIFKQAFVAVQS---CVVHLQ 274
E + Y V +G Y +D+ A FT F + Q+ VH
Sbjct: 142 REVYAQYYQSGLKVLSESGIDVTLYTWDLFRWAITTFTSRSFTSRVLLPQNRKYWPVHRT 201
Query: 275 KVSLARRFALVPLG-------------PPLLAYSSKCKAMLAAVDDAVQLVVD--RPYKA 319
+ R+ L+ + P L + + A + DA Q + +P +A
Sbjct: 202 STNGRRQTVLLDMSHSPAEDLDFSVLFPGLDSGNHDPNAQVDWSFDANQFSIALVQPIEA 261
Query: 320 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRM 369
G + GP+ N +LL+ YGF +NP D +++ E Q + KR+
Sbjct: 262 GAEVCNNYGPKANDELLMGYGFCIPNNPRDEVLLTLKAPPEALQVELKRI 311
>gi|342875304|gb|EGU77102.1| hypothetical protein FOXB_12400 [Fusarium oxysporum Fo5176]
Length = 371
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 35/77 (45%), Gaps = 2/77 (2%)
Query: 281 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 340
R +P CK +A+ +VQ DR Y GE + V GP N LL YG
Sbjct: 178 RLVCMPTADLFNHADQGCKLAYSALGYSVQ--ADRVYHQGEEVYVSYGPHSNDFLLSEYG 235
Query: 341 FVDEDNPYDRLVVEAAL 357
F+ + N +D + ++ +
Sbjct: 236 FILDTNRWDEVYLDEVI 252
>gi|4185151|gb|AAD08954.1| unknown protein [Arabidopsis thaliana]
gi|20197036|gb|AAM14885.1| unknown protein [Arabidopsis thaliana]
Length = 441
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 40/152 (26%), Positives = 64/152 (42%), Gaps = 15/152 (9%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASEDL+ GD A +P S +++ E V ++ L T + ++ L L+ M EK
Sbjct: 173 ASEDLKFGDVALEIPVSSIISEEYVYNSDMYPILETFDGITSETMLLLWTMREKHNLDSK 232
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F PY L G L + + L G+ EI++ E ++ Y+EL
Sbjct: 233 F-KPYFDSLQENFCTG-------LSFGVDAIMELDGTLLLDEIMQAKELLRERYDEL--- 281
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 266
L + P E +T+E + A+ V
Sbjct: 282 ----IPLLSNHREVFPPELYTWEHYLWAYFDV 309
>gi|71019075|ref|XP_759768.1| hypothetical protein UM03621.1 [Ustilago maydis 521]
gi|46099291|gb|EAK84524.1| hypothetical protein UM03621.1 [Ustilago maydis 521]
Length = 685
Score = 40.8 bits (94), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 38/83 (45%), Gaps = 6/83 (7%)
Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
++ P+ L A A L +++ +P AGE I PNS LL YG
Sbjct: 368 ISMTPMADMLNAKFESDNARLFYKSHVLEMRATKPIAAGEQIFNTYADPPNSDLLRRYGH 427
Query: 342 VDEDNPYD------RLVVEAALN 358
VDE N D +LVV+AA+N
Sbjct: 428 VDEPNGNDVVELDAKLVVQAAVN 450
>gi|146415322|ref|XP_001483631.1| hypothetical protein PGUG_04360 [Meyerozyma guilliermondii ATCC
6260]
Length = 466
Score = 40.8 bits (94), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 64/282 (22%), Positives = 111/282 (39%), Gaps = 55/282 (19%)
Query: 113 VAASEDLQAGDAAFSVP-------NSLVVTLERVLGNETIAEL---------LTTNKLSE 156
V A++++ A + +P N+++ + R G E++ +L TT++ +E
Sbjct: 73 VYATQNVSAKETLVRIPHSFLMNTNTIIKHISRFNGKESVPDLGYSVLLPSEYTTDQWTE 132
Query: 157 LAC---------------LALYLMYEKKQGKKSFW------LPYIRELDRQRGRGQLAVE 195
L ALY+ EKK+ + SFW LP + ELD
Sbjct: 133 LYAKIPISKWLQLTAFQRTALYICLEKKRKENSFWCAFISSLPKLEELDF---------- 182
Query: 196 SPLLWS-ETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEA 253
+P++W E+E LTGS A+ E R + + V F + ++ +E
Sbjct: 183 APIVWEVESE---LTGSKA-ADFFELLPRSSRNHAKKVLVRFNEDYTAVSEFLTAAKSEP 238
Query: 254 FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 313
F A++ + S +++ S L P + + C A D+ +V
Sbjct: 239 LNKMEFLWAWMCINSRCLYMSFPSSKAEADNFTLAPYVDFLNHDCDEKCAIKIDSRGFLV 298
Query: 314 DR--PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
+ AG+ ++ GP N LL Y F E N ++ L V
Sbjct: 299 ISCVDHAAGQELLFSYGPHSNEFLLCEYAFTMETNKWNNLDV 340
>gi|148237199|ref|NP_001085404.1| N-lysine methyltransferase setd6 [Xenopus laevis]
gi|82184826|sp|Q6INM2.1|SETD6_XENLA RecName: Full=N-lysine methyltransferase setd6; AltName: Full=SET
domain-containing protein 6
gi|48734800|gb|AAH72257.1| MGC82362 protein [Xenopus laevis]
Length = 455
Score = 40.8 bits (94), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 58/242 (23%), Positives = 95/242 (39%), Gaps = 28/242 (11%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETI-AELLTTNKLSELACLALYLMYEKKQGKK 173
A ED+ G+ F+VP S +++ E + E + S L + L+YE
Sbjct: 55 AREDIADGELLFTVPRSAILSQNTTRIQELLEKEQESLQSTSGWVPLLISLLYEATDSS- 113
Query: 174 SFWLPYIR---ELDRQRGRGQLAVESPLLWSETE-LAYLTGSPTKAEILERAEGIKREYN 229
S W PY ELD + P+ WSE E L G+ I + I+ EYN
Sbjct: 114 SLWAPYFGLWPELD--------PPDMPMFWSEEEQTKLLQGTGVLEAIRNDLKNIEEEYN 165
Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ--AFV---AVQSCVVHLQKVSLARRFAL 284
+ + P T +++K+ AFV + Q + + + L
Sbjct: 166 SI------VLPFITRNPEKFCPMKHTLDLYKRLVAFVMAYSFQEPLEENDEEDEDEKDIL 219
Query: 285 VPLGPP---LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
P+ P LL + + A L + +++V + AG+ + G N +LL YGF
Sbjct: 220 PPMMVPVADLLNHVAHHNAHLEFTPECLRMVTTKSVHAGQELFNTYGEMANWQLLHMYGF 279
Query: 342 VD 343
+
Sbjct: 280 AE 281
>gi|158295743|ref|XP_001688855.1| AGAP006364-PD [Anopheles gambiae str. PEST]
gi|347965224|ref|XP_003435732.1| AGAP013401-PA [Anopheles gambiae str. PEST]
gi|333469389|gb|EGK97284.1| AGAP013401-PA [Anopheles gambiae str. PEST]
Length = 451
Score = 40.8 bits (94), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 41/86 (47%), Gaps = 13/86 (15%)
Query: 310 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT------EDPQ 363
L D Y+AGE I + G N+KLL+ YGF NP D VE + T DP+
Sbjct: 276 NLHTDTAYRAGEQIFISYGTHNNTKLLLEYGFSIPSNPDD--FVELTIGTINAFMKHDPE 333
Query: 364 YQDKRMVAQR-----NGKLSVQVFHV 384
+ R+ ++ + +L Q+F V
Sbjct: 334 LRCLRLPREKYRFLADHRLDEQLFFV 359
>gi|260946533|ref|XP_002617564.1| hypothetical protein CLUG_03008 [Clavispora lusitaniae ATCC 42720]
gi|238849418|gb|EEQ38882.1| hypothetical protein CLUG_03008 [Clavispora lusitaniae ATCC 42720]
Length = 430
Score = 40.8 bits (94), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 46/204 (22%), Positives = 86/204 (42%), Gaps = 22/204 (10%)
Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 213
LS LA+YL+ EK++G SFW P+I D +L++ +P++W ++ P
Sbjct: 117 LSSFQLLAIYLVLEKERGAASFWKPFI---DMLPSIEELSL-APVVWKVLQV------PH 166
Query: 214 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPY--DIPT-EAFTFEIFKQAFVAVQSCV 270
++ R++ E + + Y D+P+ AF F A++ + S
Sbjct: 167 CDDLWRMLSRSARKHAES-----VVARFEKDYAVVCDLPSVPAFERSSFLWAWMCINSRC 221
Query: 271 VHL---QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 327
+++ Q + F + P L +S++ + + ++ YK E +
Sbjct: 222 LYMSMPQAKDTSDNFTMAPY-VDFLNHSNEDQCGIKIDPHGFHVLTSSAYKPQEELYFSY 280
Query: 328 GPQPNSKLLINYGFVDEDNPYDRL 351
GP N LL YGF N ++ +
Sbjct: 281 GPHSNEFLLCEYGFTLPHNKWNYI 304
>gi|323455796|gb|EGB11664.1| hypothetical protein AURANDRAFT_61664 [Aureococcus anophagefferens]
Length = 1916
Score = 40.4 bits (93), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 17/46 (36%), Positives = 27/46 (58%)
Query: 307 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 352
DA + R Y AG+ + G + N++L+ NYGF++ NP+D V
Sbjct: 294 DAFAVNAHRDYDAGDEVHASYGKKSNAQLVANYGFLEPGNPFDDYV 339
>gi|145502426|ref|XP_001437191.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124404340|emb|CAK69794.1| unnamed protein product [Paramecium tetraurelia]
Length = 637
Score = 40.4 bits (93), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 38/132 (28%), Positives = 64/132 (48%), Gaps = 23/132 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIA------ELLT---TNKLSELACLALY 163
V ASEDL + +P SL+++ ++ + I E+ TN+ +E L Y
Sbjct: 46 VVASEDLPSDTVIICIPQSLIISPDKCKQSTLITVYNSHPEMFDEEETNE-AEFNILTFY 104
Query: 164 LMYEKKQGKKSFWLPYIRELDRQRGRGQLA--------VESPLLWSETELAY--LTGSPT 213
+ EKK+G++SF+ PYI+ + Q +A +E PL+ E +L G +
Sbjct: 105 MFNEKKKGEQSFYYPYIQAI--QTSNTLMAWSNEDLQKIEDPLILEEFQLIKQDFLGLWS 162
Query: 214 KAE-ILERAEGI 224
KA+ I + A+ I
Sbjct: 163 KAKLIFDNAQDI 174
>gi|387219019|gb|AFJ69218.1| set domain-containing protein 3, partial [Nannochloropsis gaditana
CCMP526]
Length = 265
Score = 40.4 bits (93), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 39/163 (23%), Positives = 62/163 (38%), Gaps = 29/163 (17%)
Query: 317 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQ-YQDKRMVAQRNG 375
YK GE + G + N++LL+ YGF DN ++ + + P +Q G
Sbjct: 18 YKKGEEVFTSYGRRTNAELLLFYGFALLDNEHESVALSMPGIPSPPSWFQASHSALGTAG 77
Query: 376 KLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---SEMQSVISSLGPICPVS------- 425
+ GR + D+L L + T SE+ + +L C ++
Sbjct: 78 SV--------GGRARSMAEDVLRPSHLLFAGATELPSELVAYFRALTACCSMNEKDLVEQ 129
Query: 426 --------PC--MERAVLDQLADYFKARLAGYPATLSEDEAML 458
PC ER L + A LA +P ++ EDE L
Sbjct: 130 KLDYMQHFPCSRHERDAFSTLGAHMSASLAAFPTSIEEDEVEL 172
>gi|340517549|gb|EGR47793.1| hypothetical protein TRIREDRAFT_122428 [Trichoderma reesei QM6a]
Length = 482
Score = 40.4 bits (93), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 68/300 (22%), Positives = 119/300 (39%), Gaps = 59/300 (19%)
Query: 128 VPNSLVVTLERVLG----NETIAELL-TTNKLSELACLALYLMYEKKQ------GKKSF- 175
+P LV++ E V ++ +LL S + LYL+ Q G ++F
Sbjct: 57 IPRDLVLSAEAVEEYAKVDQNFKQLLDVAGHQSTRGDIMLYLLTHLVQSKATSPGTRAFA 116
Query: 176 ---WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
W YIR L R + P +W+ E L G+ +A + + + EY++L
Sbjct: 117 STPWTEYIRFLPR-------PIPVPTMWTNDERELLKGTSLEAAVSAKLSALSSEYDKLC 169
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
A +L + + +E+ T E + A +S + L + A+VP G +
Sbjct: 170 E---EASAL--SFWSTLLSESATLEDWVLADAWYRSRCLELPRAG----HAMVP-GLDMA 219
Query: 293 AYSSKCKAMLAAVDDAVQLVVDRPYK---AGESIVVWCG-PQPNSKLLINYGFVDEDNPY 348
+S A D +++ RP AG I + G +P +++L +YGF+D+D+
Sbjct: 220 NHSQSHSAYYDESSDGDVVLLPRPGSKIPAGAEITISYGEAKPAAEMLFSYGFIDKDSTV 279
Query: 349 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 408
L + +DP L FH++ G P +RL ++D
Sbjct: 280 KELTLHLEALPDDP--------------LGRAKFHIYKGP---------PTVRLSIINDN 316
>gi|320166344|gb|EFW43243.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 514
Score = 40.4 bits (93), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 57/230 (24%), Positives = 95/230 (41%), Gaps = 20/230 (8%)
Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
N + + LAL LMYE+ + S W ++R L +ES L W++ EL +
Sbjct: 132 NAIDPMTALALGLMYERSRA-DSPWRAWLRMLPD-------PIESMLEWNDVELWPVEQL 183
Query: 212 PTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 271
K ER ++ Y + T Y D+ FT E F A V Q+ +
Sbjct: 184 YVKELREERIRNLEAVYESVIT------PFIDTYESDLVGVDFTIEAFVWAAVIAQTRGL 237
Query: 272 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 331
H S +L+P+ ++ + + A++ A + + KAGE I +
Sbjct: 238 H---ESEKNGLSLLPIV-DMINHHREPNAVVVASGPNILVRTKTSLKAGEEITI-DYEMS 292
Query: 332 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQV 381
+ LL+ YGFV+ D + + ++D Y + + + G LS QV
Sbjct: 293 SHVLLLLYGFVEMSENLDFYPIRLSWESKDIDYPRRLRLLEGRG-LSRQV 341
>gi|300124011|emb|CBK25282.2| unnamed protein product [Blastocystis hominis]
Length = 366
Score = 40.4 bits (93), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 44/183 (24%), Positives = 74/183 (40%), Gaps = 19/183 (10%)
Query: 164 LMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG 223
+ E + + SF+ PY L P++W+ +E+ L GS I R
Sbjct: 57 FLLEDMENEDSFYKPYYDTLPED------ISNIPVIWTNSEINQLHGSYFSICIRSRVVE 110
Query: 224 IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 283
I R+Y ++ V S F +YP+D I + F + + + + V LA
Sbjct: 111 IYRDYQKMCDV----NSFFCRYPFDQYLRV-RLLIGSRNFGSFFNSLNNGILVPLADMLN 165
Query: 284 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 343
Y K KA + + + L + G ++ G + N +LL +YGFV+
Sbjct: 166 HTRPRQTTWEYDDKEKAFV--ITSLLNL------RQGAQVMDSYGRRDNRRLLFSYGFVE 217
Query: 344 EDN 346
+DN
Sbjct: 218 DDN 220
>gi|145354549|ref|XP_001421544.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581782|gb|ABO99837.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 488
Score = 40.4 bits (93), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 17/42 (40%), Positives = 27/42 (64%)
Query: 308 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
V+L+ R +GE I + G N +LL++YGF+ +DNP+D
Sbjct: 276 GVELIARRALTSGEPIELSYGNLSNDELLLDYGFIVKDNPFD 317
>gi|432119027|gb|ELK38252.1| SET domain-containing protein 4 [Myotis davidii]
Length = 339
Score = 40.4 bits (93), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 56/241 (23%), Positives = 100/241 (41%), Gaps = 26/241 (10%)
Query: 111 HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-SELACLALYLMYEKK 169
H+ A + + G S+P S ++T + V+ + A + S L L +L+ EK
Sbjct: 22 HFRAGASGAREGQVIISLPESCLLTTDTVIRSYLGAYIAKWQPPPSPLLALCTFLVAEKH 81
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
G +S W PY+ L + A P+ E E+ L P +A+ E+ ++
Sbjct: 82 AGDRSPWKPYLEVLPK-------AYTCPVC-LEPEVVALLPRPLEAKAREQRTRVR---- 129
Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL----- 284
EL T S Q + F++ F+ A+ V + V++++ RR L
Sbjct: 130 ELFTSSRGRFSSLQPLLSEAAASVFSYRAFRWAWCTVNTRAVYMER---GRRQGLSAEPD 186
Query: 285 -VPLGPPL-LAYSSKCKAMLAAVDDAV---QLVVDRPYKAGESIVVWCGPQPNSKLLINY 339
L P L L +S + AA ++ ++ + E + + GP + +LL+ Y
Sbjct: 187 TCALAPYLDLLNNSPAVQVKAAFNEETRCYEIRTGSGCRRHEEVFICYGPHDSRRLLLEY 246
Query: 340 G 340
G
Sbjct: 247 G 247
>gi|301099608|ref|XP_002898895.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262104601|gb|EEY62653.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 440
Score = 40.0 bits (92), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 93/433 (21%), Positives = 172/433 (39%), Gaps = 56/433 (12%)
Query: 90 LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELL 149
L P +L+ P R Y+ +E+++ G S+P S V+++E + LL
Sbjct: 16 LAPMSTVLQ--PEGFNFGRGTAYIT-TENVEVGSVLLSLPMSQVMSVESA-ARGRVGLLL 71
Query: 150 TTN-KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 208
N L L L+L+ E+ G S + ++ L A+ S L +SE E+ L
Sbjct: 72 EVNPDLPSAIALGLHLLEERALGAASNFSDFVATLPTIE-----AINSTLFYSEDEMKGL 126
Query: 209 TGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPT---EAFTFEIFKQAFVA 265
GS + L RA+ + Y+ L + + D P FT + F+ A
Sbjct: 127 EGSQLQRFTLGRAQAVDAFYDAL------VQPVTSREAVDPPIFHKSEFTLDKFRWAMGV 180
Query: 266 VQSCVVHLQK----VSLARRFALVPLGPPLLAYSSK-CKAMLAAVD-DAVQLVV--DRPY 317
V S + V LA + + L ++ C VD D +L V Y
Sbjct: 181 VWSSTFQFGENEDDVILAPVLNTIGICTDLNQEGNEACPETSIKVDTDTQRLTVYASVAY 240
Query: 318 KAGESIVVWCGPQPNSKLLINYGFVD-EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 376
G+ + + + +++L++++GF + D+L + L++ D K + Q
Sbjct: 241 SKGQEVRLSMPGKSSTQLMLSHGFARARASKLDKLDLTVTLDSSDTLAPLKNYLLQTQLN 300
Query: 377 LSVQV---FHVHAGREKEAIS-----------DMLPYLRLGYVSDTSEMQSVISSLGPIC 422
S+ F + + E +S ++ Y L ++ E + ++S
Sbjct: 301 ESINATYEFFYGSSKIDEYVSTSLRMKLLSGGELARYKELLTPTEGEEHRPIVSLRNEF- 359
Query: 423 PVSPCMERAVLDQLADYFKARLAGYPATLSEDE---AMLTDYNLHPKKRVA--TQLVRME 477
RAV+ K YP ++ +D+ A L+D + R+A +++ ME
Sbjct: 360 ----VFTRAVISTCTTLLKQ----YPTSIEQDQENLAKLSDKDDVESVRIAHVQRILIME 411
Query: 478 KKMLNACLQVTAD 490
K++LN +++ D
Sbjct: 412 KQILNETMELALD 424
>gi|302828172|ref|XP_002945653.1| hypothetical protein VOLCADRAFT_120141 [Volvox carteri f.
nagariensis]
gi|300268468|gb|EFJ52648.1| hypothetical protein VOLCADRAFT_120141 [Volvox carteri f.
nagariensis]
Length = 163
Score = 40.0 bits (92), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 20/73 (27%), Positives = 36/73 (49%)
Query: 111 HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
+ + A E ++ G VP L+++ + +E + L+E L L+L+ E+
Sbjct: 77 YSLVADEPVRRGQILVRVPRRLLMSQDTARASEACGRTVREAGLNEWQSLILHLLCERAL 136
Query: 171 GKKSFWLPYIREL 183
G +SFW PY+ L
Sbjct: 137 GSRSFWAPYLDTL 149
>gi|440640494|gb|ELR10413.1| hypothetical protein GMDG_00825 [Geomyces destructans 20631-21]
Length = 492
Score = 40.0 bits (92), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 87/337 (25%), Positives = 134/337 (39%), Gaps = 48/337 (14%)
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-LERAEGIKR----- 226
+S W PY L + ++S + WS ELA L S ++ ++AE I
Sbjct: 104 ESKWAPYFNVLPTK-------LDSLVFWSPEELAELQASAVLKKVGKDKAEEIFHQSISK 156
Query: 227 ---EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH-LQKVSLARRF 282
E ++D ++ S Y +DIP E + V QK SLA
Sbjct: 157 VTPEGTDVD-IFHRVASTIMAYAFDIPD----IEQEDEEGANEDDLVDDDEQKTSLA--- 208
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
++PL +L + A L + +++ P K GE I+ G P S LL YG+V
Sbjct: 209 -MIPLAD-MLNADADNNARLHYDGEELEMRTINPIKTGEEILNDYGQLPRSDLLRRYGYV 266
Query: 343 -DEDNPYDRLVVEAALNT-EDPQYQD---KRMVAQRNGKLSVQVFHV------------- 384
D+ +D V E + +T D YQD + V R G++ ++
Sbjct: 267 TDKYATFD--VAEISTSTITDHIYQDLAGELKVYLRAGEIEARLELARREDVYEDAHDVG 324
Query: 385 HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARL 444
HA E ISD L L + + ++ SS S V L + R
Sbjct: 325 HATEEWPCISDELVALVYLLLVGEETLAAIQSSKMSFPSRSKMETELVGKALQRILERRE 384
Query: 445 AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 481
Y T+ EDE +L N + ++A Q VRM +K++
Sbjct: 385 REYATTVVEDENLLQSGNHSNRVKMAIQ-VRMGEKVV 420
>gi|340505659|gb|EGR31971.1| SET domain protein [Ichthyophthirius multifiliis]
Length = 705
Score = 40.0 bits (92), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 26/104 (25%), Positives = 52/104 (50%), Gaps = 10/104 (9%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN---------KLSELACLALY 163
+AA+ED+ A +PN ++++L ++ E + +++ N +E +A+Y
Sbjct: 21 IAAAEDIPANTIIACIPNKIMISLNQIKECE-LKDIINENPSLFDEEENAEAEFNIIAMY 79
Query: 164 LMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY 207
+++EK +G+KSF+ PY + R +E L E+ Y
Sbjct: 80 VIHEKLKGEKSFYKPYFDTIQRSYTMYDWTIEEVKLTESEEIIY 123
>gi|299748031|ref|XP_002911244.1| tho2 protein [Coprinopsis cinerea okayama7#130]
gi|298407787|gb|EFI27750.1| tho2 protein [Coprinopsis cinerea okayama7#130]
Length = 2474
Score = 40.0 bits (92), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 15/47 (31%), Positives = 30/47 (63%)
Query: 308 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
++ L+ G+ + GP+PNS+L+++YGF +DNP D ++++
Sbjct: 209 SISLIAHSAIWTGQEVFNNYGPKPNSELILSYGFSIQDNPDDSIILK 255
>gi|428167603|gb|EKX36559.1| hypothetical protein GUITHDRAFT_155193 [Guillardia theta CCMP2712]
Length = 321
Score = 40.0 bits (92), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 31/133 (23%), Positives = 58/133 (43%), Gaps = 10/133 (7%)
Query: 57 VSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNG-LPPCKVILKEKPSHNEKHRPIHYVAA 115
V++ D A + ++ +ED W NG + K+ +K + V
Sbjct: 50 VAAGDQGAASGADQQAQLQEDWTAFVKWFRSNGGIISSKLTVKVRNGRQG-------VYF 102
Query: 116 SEDLQAGDAAFSVPNSLVVTLERVLGNET--IAELLTTNKLSELACLALYLMYEKKQGKK 173
E ++ G+ S P +L + + + + + + L +K + L++++E K GK
Sbjct: 103 KERMRRGETIVSFPRNLRLDEKTAMKGKAGHVFQRLKQDKCYPDLMVILHVVHEDKLGKD 162
Query: 174 SFWLPYIRELDRQ 186
SFW PY + L RQ
Sbjct: 163 SFWFPYFKLLRRQ 175
>gi|195353393|ref|XP_002043189.1| GM17489 [Drosophila sechellia]
gi|194127287|gb|EDW49330.1| GM17489 [Drosophila sechellia]
Length = 537
Score = 40.0 bits (92), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 22/79 (27%), Positives = 36/79 (45%)
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
AL+P K + AAV ++ AGE ++ G + N+ LL++ GFV
Sbjct: 319 ALIPYWDMANHRQGKITSFYAAVPRQLECTAQEAVDAGEQFFIYYGDRSNTDLLVHNGFV 378
Query: 343 DEDNPYDRLVVEAALNTED 361
D+ N D + + L+ D
Sbjct: 379 DDYNLKDYVNIRVGLSLTD 397
>gi|41054567|ref|NP_955894.1| N-lysine methyltransferase setd6 [Danio rerio]
gi|82177062|sp|Q803K4.1|SETD6_DANRE RecName: Full=N-lysine methyltransferase setd6; AltName: Full=SET
domain-containing protein 6
gi|27882107|gb|AAH44440.1| SET domain containing 6 [Danio rerio]
Length = 460
Score = 40.0 bits (92), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 59/244 (24%), Positives = 96/244 (39%), Gaps = 31/244 (12%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-----LSELACLALYLMYEKK 169
A ED++ G F++P ++ G + ++L K S L L LMYE
Sbjct: 53 AKEDIEEGHVLFTIPREALLHQ----GTTKVKKVLEEGKKCLESASGWVPLLLSLMYEYT 108
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREY 228
S W PY+ R ++ P+ WSE E L G+ ++ ++ EY
Sbjct: 109 SST-SHWKPYLSLWPDFR-----TLDQPMFWSEEECDKLLKGTGIPESVITDLRKLQDEY 162
Query: 229 NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ--AFVAVQS----CVVHLQKVSLARRF 282
N + + FM + +P E E++K AFV S + +
Sbjct: 163 NSV-VLPFM-----KSHPDLWDPEKHNLELYKSLVAFVMAYSFQEPVEDDDEDEEDDEKK 216
Query: 283 ALVPLGPPL---LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 339
+P+ P+ L + SK A L + +++V R GE + G N +LL Y
Sbjct: 217 PNLPMMVPMADMLNHISKHNANLEYTPECLKMVSIRRIGKGEEVFNTYGQMANWQLLHMY 276
Query: 340 GFVD 343
GF +
Sbjct: 277 GFAE 280
>gi|242045610|ref|XP_002460676.1| hypothetical protein SORBIDRAFT_02g032970 [Sorghum bicolor]
gi|241924053|gb|EER97197.1| hypothetical protein SORBIDRAFT_02g032970 [Sorghum bicolor]
Length = 489
Score = 40.0 bits (92), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 79/382 (20%), Positives = 149/382 (39%), Gaps = 75/382 (19%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNE-TIAELLTTNK--LSELACLALYLMYEKK 169
+AA+ DL+ G+ P + ++T +RV ++ IA ++ ++ LS + L + L+ E
Sbjct: 57 LAAARDLRRGELVLRAPRAALLTSDRVTADDPRIAACVSAHRPRLSSVQILIVCLLAEVG 116
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI---KR 226
+G+ S W PY+ +L T LA T + E L+ + I ++
Sbjct: 117 KGRNSVWYPYLSQLPSYY---------------TILA--TFDDFEVEALQVDDAIWVAQK 159
Query: 227 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 286
+ + + W L ++ + + F+ + AF V S +H ++ L P
Sbjct: 160 AKSAIKSDWEDVTPLMKELEF--KPKLLMFKSWLWAFATVSSRTLH---IAWDEAGCLCP 214
Query: 287 LGPPLLAYSSKCKAMLAAVDDAVQL---------------VVDRPY-------------- 317
+G L Y++ +D +L + D Y
Sbjct: 215 VG-DLFNYAAPDDDTSLEAEDTAELTNYQQKNEMINSSERLTDGGYEDSNAYCLYARKNY 273
Query: 318 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN-TEDPQYQDKRMVAQRNGK 376
K GE +++ G N +LL +YGF+ +NP ++ +E L+ + M NG
Sbjct: 274 KQGEQVLLGYGTYTNLELLEHYGFLLGENPNEKTFIELDLDICSGGTWPKDSMYIHSNGH 333
Query: 377 LSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQL 436
S +L LRL + + T+ ++V + +S E ++ L
Sbjct: 334 PSFA---------------LLCALRL-WSTPTNHRKAVGHQIYSGSMLSTENEMGIMKWL 377
Query: 437 ADYFKARLAGYPATLSEDEAML 458
+ L P T+ DE++L
Sbjct: 378 ISRCEGTLQQLPTTVEFDESLL 399
>gi|194896580|ref|XP_001978500.1| GG17647 [Drosophila erecta]
gi|190650149|gb|EDV47427.1| GG17647 [Drosophila erecta]
Length = 544
Score = 40.0 bits (92), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 22/79 (27%), Positives = 37/79 (46%)
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
AL+P K + AAV ++ +AGE ++ G + N+ LL++ GFV
Sbjct: 319 ALIPYWDMANHKPGKITSFYAAVSRQLECTAQEAVEAGEQFFIYYGDRSNTDLLVHNGFV 378
Query: 343 DEDNPYDRLVVEAALNTED 361
D +N D + + L+ D
Sbjct: 379 DVNNLKDYVNIRVGLSPTD 397
>gi|354502761|ref|XP_003513450.1| PREDICTED: SET domain-containing protein 4 [Cricetulus griseus]
Length = 440
Score = 40.0 bits (92), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 56/239 (23%), Positives = 98/239 (41%), Gaps = 22/239 (9%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T V+ ++ + K S L L +L+ E+ G +S W
Sbjct: 66 LQEGQMIISLPESCLLTTNTVI-RSSLGPYMKKWKPPPSPLLALCTFLISERHAGGQSLW 124
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
Y+ L + + P+ E ++ L P KA+ E+ ++ + +
Sbjct: 125 KSYLDILPK-------SYTCPVCL-EPDVVDLLPQPLKAKAEEQRADVQDFFASSRAFFS 176
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKV---SLARRFALVPLGP--PL 291
LF + P D F++ F A+ V + V+L+ L+ L P L
Sbjct: 177 TLQPLFVE-PVD---GIFSYSAFLWAWCTVNTRAVYLRSTRQECLSAEPDTCALAPYLDL 232
Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
L +S + KA + ++ + E + + GP N +LL+ YGFV NP+
Sbjct: 233 LNHSPHVQVKAAFSEKTGCYEIRTASRCRKHEQVFICYGPYDNQRLLLEYGFVSVCNPH 291
>gi|407846232|gb|EKG02467.1| hypothetical protein TCSYLVIO_006496 [Trypanosoma cruzi]
Length = 546
Score = 40.0 bits (92), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 45/95 (47%), Gaps = 10/95 (10%)
Query: 149 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA-- 206
L ++ + +AC+A Y+ YEKKQ + + L Y R L Q+ V++ LW+ L
Sbjct: 351 LDSSNMESIACIAAYMFYEKKQPEIALRL-YRRLL-------QMGVQTTELWNNLGLCCF 402
Query: 207 YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSL 241
Y + L+RA I E L VW+ G +
Sbjct: 403 YSSQYDIALSCLQRAVAISTEDETLADVWYNIGHI 437
>gi|452825744|gb|EME32739.1| ribulose-1,5 bisphosphate carboxylase oxygenase large subunit
N-methyltransferase, putative isoform 1 [Galdieria
sulphuraria]
Length = 487
Score = 40.0 bits (92), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 79/331 (23%), Positives = 125/331 (37%), Gaps = 47/331 (14%)
Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE-LD 232
S W PYI L G + WS +ELA L P E+ I + Y E L
Sbjct: 166 SLWKPYIDILPHALNTGLVY------WSSSELAQLQYRPLIEEV-----KINQYYREALY 214
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL- 291
T F + S P + + +F A VQS + V + +AL+P+ L
Sbjct: 215 TRVFESLS----SPVRVWLQNEKENVFFWALDMVQSRAFGIPDVG-NKTYALLPMMDMLN 269
Query: 292 LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
+S+ + ++ + ++ G I + GP N LL YGF+ +NP D
Sbjct: 270 HRVNSQTHFLYDSIANQYEMKTYSKLSPGTDIYISYGPLDNDHLLHFYGFLQTNNPSDYF 329
Query: 352 VVEAALNTEDPQYQDKRMVAQ------------------RNGKLSVQVFHVHAGREKEAI 393
V+ Y+ + AQ NGK + ++H H E + I
Sbjct: 330 QVKDIFQWLHLMYEQEEWQAQPSHLLEEKLSLLRKYHIYENGK-TFHLYHDHYDDEIDII 388
Query: 394 SDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERA--VLDQLADYFKARLAGYPATL 451
LR+ S T Q + + + +E V + K L ++
Sbjct: 389 ------LRVFMASKTDWQQIQENFAMGLFHKALSLENQLHVWQVIIGGCKHLLKDMKTSV 442
Query: 452 SEDEAMLTDYN-LHPKKRVATQLVRMEKKML 481
EDE +L + + L K ++A Q R+EKK +
Sbjct: 443 EEDEQLLKNKDQLSTKLQLAIQF-RLEKKYI 472
>gi|71409849|ref|XP_807248.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70871208|gb|EAN85397.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 544
Score = 40.0 bits (92), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 45/95 (47%), Gaps = 10/95 (10%)
Query: 149 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA-- 206
L ++ + +AC+A Y+ YEKKQ + + L Y R L Q+ V++ LW+ L
Sbjct: 349 LDSSNMESIACIAAYMFYEKKQPEIALRL-YRRLL-------QMGVQTTELWNNLGLCCF 400
Query: 207 YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSL 241
Y + L+RA I E L VW+ G +
Sbjct: 401 YSSQYDIALSCLQRAVAISTEDETLADVWYNIGHI 435
>gi|392563539|gb|EIW56718.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
Length = 441
Score = 39.7 bits (91), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 16/46 (34%), Positives = 29/46 (63%)
Query: 309 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
+ LV+ P G ++ GP+PN++L++ YGF +NP D +V++
Sbjct: 250 ISLVIHTPTTTGSELLNNYGPKPNAELILGYGFSLPNNPDDTIVLK 295
>gi|336473420|gb|EGO61580.1| hypothetical protein NEUTE1DRAFT_58975 [Neurospora tetrasperma FGSC
2508]
gi|350293291|gb|EGZ74376.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
Length = 533
Score = 39.7 bits (91), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 49/205 (23%), Positives = 83/205 (40%), Gaps = 30/205 (14%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-L 218
L L LM+E QG S W PY+ L Q ++P+ W+E ELA L S A++
Sbjct: 131 LILILMHEYLQGSSSNWSPYLSILPHQ-------FDTPMFWTEAELAELQASALVAKVGK 183
Query: 219 ERAEGIKRE-----YNELDTVWFMAGSLFQQ-----------YPYDIPTEAFTFEIFKQA 262
+ A+ + R E + V++ AG+ Q + A+ F++ K+
Sbjct: 184 DEADKMIRTKIVKVVQENEDVFYPAGTPKTQRLDEGELLKLGHRMGSAIMAYAFDLAKEE 243
Query: 263 FVAVQSCVVHLQKV-----SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 317
V + +VP+ +L + A + + + R
Sbjct: 244 DDDEDEEEEEDGWVEDKIGGMNDTMGMVPMA-DMLNADAVFNAHINHGEACLTATSLREI 302
Query: 318 KAGESIVVWCGPQPNSKLLINYGFV 342
K GE I+ + GP +++LL YG+V
Sbjct: 303 KEGEEILNYYGPLSSAELLRRYGYV 327
>gi|254585507|ref|XP_002498321.1| ZYRO0G07502p [Zygosaccharomyces rouxii]
gi|238941215|emb|CAR29388.1| ZYRO0G07502p [Zygosaccharomyces rouxii]
Length = 562
Score = 39.7 bits (91), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 66/299 (22%), Positives = 121/299 (40%), Gaps = 47/299 (15%)
Query: 78 LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
L D W KNG + I ++ S + + + A+E+ +P+ L++T E
Sbjct: 4 LEDCIQWAVKNGSIVDERIHFKQSSISGISAVVEGILATEE-----PLIQIPSKLLITNE 58
Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKK----QGKKSFWLPYIRELDRQRGRGQLA 193
+ E+ + ++ + + A AL +Y K +G S + PYI L L
Sbjct: 59 K--AQESFQ--VDSDVIDKNAPNALVQLYVAKLKFAKGMPSIYQPYIDLLP-------LK 107
Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL--------------------DT 233
+E P W EL + G+ + +R + E+ L D
Sbjct: 108 LEQPYFWDWKELQVIKGTDLYLVMKQRLPKLLEEWTTLLKKLSLEPSDDLGQLETPGLDL 167
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS---CVVHLQKVSLARRFALVPLGPP 290
V ++A +++ +P +F ++ A ++ ++ Q +S+ F L P+
Sbjct: 168 VDYVAR--YRETNEQLPWNSFAAYVWSAGIFASRAFPKIALNDQCLSINEAF-LYPI-VD 223
Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
L + + K D + V K+GE + G + N +LL+NYGFV ++N YD
Sbjct: 224 FLNHKNDTKVKWCFQDGKMCFVSKESLKSGEELFNNYGDKSNEELLLNYGFVQDNNQYD 282
>gi|270005260|gb|EFA01708.1| hypothetical protein TcasGA2_TC007288 [Tribolium castaneum]
Length = 253
Score = 39.7 bits (91), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 35/119 (29%), Positives = 58/119 (48%), Gaps = 12/119 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSEL--ACLALYLMYEKK 169
V A+ D+ +VP L++++E + +L+ +K L + L+++L+ EK
Sbjct: 116 VKANVDIAESSLVIAVPRKLMMSVENA-KESVLKDLIEKDKILGSMPNVALSIFLLLEKY 174
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
+G SFW PYI L + + L +S EL L GSPT L + + I R+Y
Sbjct: 175 KGD-SFWKPYIDILPK-------TYTTVLYFSIDELEELRGSPTLEVALRQIKSITRQY 225
>gi|194707708|gb|ACF87938.1| unknown [Zea mays]
Length = 352
Score = 39.7 bits (91), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 47/205 (22%), Positives = 87/205 (42%), Gaps = 37/205 (18%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
L L L+ E+ + SFW PYI L P+ + ++ L +P ++ +
Sbjct: 3 LGLRLLQERAK-SDSFWWPYIANLPE-------TFTVPIFFPGEDIKNLQYAPILHQVNK 54
Query: 220 RA----EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 275
R E K +L TV + + Q D+ + + + A A S L
Sbjct: 55 RCRFLLEFEKEVQQKLHTVPLVDHPFYGQ---DVNSSSLGW-----AMSAASSRAFRLH- 105
Query: 276 VSLARRFALVPLGPPLL-----AYSSKCKAM----LAAVDDAVQLVVDRPYKAGESIVVW 326
VP+ PL+ +++ + + + ++D +V+++ ++ K E+I +
Sbjct: 106 -------GEVPMLLPLIDMCNHSFNPNARIVQERSVNSLDMSVKVLAEKKIKQNEAITLN 158
Query: 327 CGPQPNSKLLINYGFVDEDNPYDRL 351
G PN L++YGFV NPYD++
Sbjct: 159 YGCYPNDFFLLDYGFVITQNPYDQV 183
>gi|300122775|emb|CBK23792.2| unnamed protein product [Blastocystis hominis]
Length = 854
Score = 39.7 bits (91), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 32/121 (26%), Positives = 56/121 (46%), Gaps = 10/121 (8%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGN----ETIAELLTTNKLSELACLALYLMYEK 168
V A E +Q G+ + N V+ L L + + + N+LSE A +AL L++EK
Sbjct: 512 VIAKEAIQKGEEVLRIHNDTVIGLHTALTHPRFGKAFSAFYHQNQLSEYALIALTLLWEK 571
Query: 169 KQGKK-SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
++ S + P++ +L P+L S+ +L +L GS E+ + RE
Sbjct: 572 FDNERWSLFAPFLAKLPSIE-----EFHHPVLLSKDDLLHLYGSALLDEVSALNATLHRE 626
Query: 228 Y 228
+
Sbjct: 627 F 627
>gi|363746364|ref|XP_003643627.1| PREDICTED: histone-lysine N-methyltransferase setd3-like, partial
[Gallus gallus]
Length = 225
Score = 39.7 bits (91), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 29/119 (24%), Positives = 59/119 (49%), Gaps = 12/119 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
+ A+ +++A + VP L++T+E N + L + +++ + LA +L+ E+
Sbjct: 108 LKATREIKAEELFLWVPRKLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERA 166
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
SFWLPYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 167 N-PNSFWLPYIQTLPSE-------YDTPLYFEEDEVQYLRSTQAIHDVFSQYKNTARQY 217
>gi|407920105|gb|EKG13323.1| hypothetical protein MPH_09605 [Macrophomina phaseolina MS6]
Length = 574
Score = 39.7 bits (91), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 49/224 (21%), Positives = 86/224 (38%), Gaps = 32/224 (14%)
Query: 143 ETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSE 202
E + LL N LS +A + L+ G+KS W PYI L + + +P+ ++E
Sbjct: 71 EVVQGLLPNNVLSNIALIKELLL-----GEKSLWAPYINCLPKSE-----QLNTPIYFAE 120
Query: 203 -----------TELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPT 251
+ A+L G+ R E + E+ +V G I T
Sbjct: 121 EMTQEAINGRRNDTAWLLGTNLDKSWRPRKEQWEEEWKNAVSVLKRQG---------IAT 171
Query: 252 EAFTFEIFKQA--FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV 309
E +T++ + A +S + + ++A++ LL + K +
Sbjct: 172 EGYTWDAYAWAATIFTSRSFISDPGLSKESSQYAVLMPVIDLLNHRFPTKVAWFFNEGNF 231
Query: 310 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
Q + + P G I G + N +LL YGF +N D + +
Sbjct: 232 QFITEEPVPKGHEIFNNYGGKGNEELLNGYGFCIPNNHCDEVAI 275
>gi|194764087|ref|XP_001964163.1| GF21412 [Drosophila ananassae]
gi|190619088|gb|EDV34612.1| GF21412 [Drosophila ananassae]
Length = 1017
Score = 39.7 bits (91), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 17/52 (32%), Positives = 30/52 (57%)
Query: 319 AGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 370
AGE ++ G + N++ L+N GFVD DN D + + L+ DP + + ++
Sbjct: 838 AGEQFFIYYGDRTNTEFLVNNGFVDPDNRNDYVNIRLGLSPTDPLAEKRAII 889
>gi|156054286|ref|XP_001593069.1| hypothetical protein SS1G_05991 [Sclerotinia sclerotiorum 1980]
gi|154703771|gb|EDO03510.1| hypothetical protein SS1G_05991 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 418
Score = 39.7 bits (91), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 51/228 (22%), Positives = 94/228 (41%), Gaps = 24/228 (10%)
Query: 147 ELLTTNKLSELACLA-LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 205
E L T K + + +LM + + +KS W YIR L + L + P+ W E +
Sbjct: 77 EFLETLKQDDPNIIGHFFLMQQYLKCEKSPWWQYIRLLPQPGDPKSLGI--PIWWPEEDQ 134
Query: 206 AYLTGSPT----------------KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDI 249
+L G+ K +L R +EY+ + W A ++F +
Sbjct: 135 KFLAGTNAGPPLQKREQMWRDQWKKGVVLLRELPNHKEYSYILYQW--AATIFDSRSFR- 191
Query: 250 PTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV 309
P+ E ++ + + H++ + + LV +G + K L++ ++
Sbjct: 192 PSLTICPEALSESSKEMDLNLDHVRNDRFSILYPLVDIGNHNGINQVEWKKDLSS--NSF 249
Query: 310 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 357
LV G+ I + G + NS+LL+ YGF+ ++ +R VV L
Sbjct: 250 DLVHSAGVSEGDQIYNYYGNKSNSELLLGYGFILPNDIVNRNVVNLKL 297
>gi|255071473|ref|XP_002499410.1| predicted protein [Micromonas sp. RCC299]
gi|226514673|gb|ACO60669.1| predicted protein [Micromonas sp. RCC299]
Length = 323
Score = 39.3 bits (90), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 24/35 (68%)
Query: 320 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
GE +V+ G + N +LL +GF D DNP+D LV++
Sbjct: 199 GEEVVISYGDKTNEELLFVHGFADRDNPHDALVLQ 233
>gi|406978090|gb|EKE00118.1| hypothetical protein ACD_22C00090G0009 [uncultured bacterium]
Length = 478
Score = 39.3 bits (90), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 44/191 (23%), Positives = 74/191 (38%), Gaps = 28/191 (14%)
Query: 109 PIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEK 168
P+HY+ ++ G FS P + T + G E + E L SE +Y MY K
Sbjct: 77 PMHYMQKEKEHVKG---FS-PELAIAT---IAGGEKLTEELAIRPTSETI---MYDMYRK 126
Query: 169 KQGKKSFWLPYIRELD----------RQRGRGQLAVE-SPLLWSETELAYLTGSPTKAEI 217
W R+L R R L + S LW E A+LT + +
Sbjct: 127 -------WTNSWRDLPVLINQWCNVVRWEKRTYLFLRTSEFLWQEGHCAHLTHEESTDTV 179
Query: 218 LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 277
+ K+ YN+L +++ + G + + + +TFE ++Q+C H +
Sbjct: 180 IWAINAYKKTYNDLMSIYGIVGVKSESEKFAGAVKTYTFESLMPNGKSLQTCTSHDLGQN 239
Query: 278 LARRFALVPLG 288
++ F G
Sbjct: 240 FSKSFEWTVQG 250
>gi|412987667|emb|CCO20502.1| related to histone-lysine N-methyltransferase (ISS) [Bathycoccus
prasinos]
Length = 866
Score = 39.3 bits (90), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 62/269 (23%), Positives = 111/269 (41%), Gaps = 53/269 (19%)
Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELAC---------LALYL 164
A +ED++ GD +P S +LE +E + + + + +A+++
Sbjct: 28 AVTEDVRRGDVLLEIPLSRCFSLESAQKSEMLTKAMAKAAAAAAGTRFTPTHDQYMAMFI 87
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE--LAYLTGSPTKAEILERAE 222
+ E+ GK+S +I + + A + PL WSE E + L G+ T AE L E
Sbjct: 88 LLEQNLGKQSSHYEHILSIPK-------AYDLPLFWSEEERQRSLLFGTTTYAETLALDE 140
Query: 223 GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK--QAFVAVQSC----------- 269
+ ++Y L + F++ + T + FK +A + + C
Sbjct: 141 EVIQDYELLKH--HLGEDFFRE-------QNITMDRFKWVRATLWSRQCDLLRPAPETTR 191
Query: 270 -VVHLQKVSLARRFALVPLGPP-LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 327
V + + + + VPLG L YS + ++ A A + P I
Sbjct: 192 LRVLIPEFDMFNHSSKVPLGSSHKLNYS---RGLVTAFATA-----NVPKGEQAYISYGS 243
Query: 328 GPQPNSKLLINYGFV---DEDNPYDRLVV 353
G +SKLL+ YGF + +NP+++L V
Sbjct: 244 GEASSSKLLLWYGFAPLNEGENPFEQLDV 272
>gi|321257099|ref|XP_003193469.1| nucleus protein [Cryptococcus gattii WM276]
gi|317459939|gb|ADV21682.1| nucleus protein, putative [Cryptococcus gattii WM276]
Length = 491
Score = 39.3 bits (90), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 58/260 (22%), Positives = 99/260 (38%), Gaps = 47/260 (18%)
Query: 115 ASEDLQAGDAAFSVPNSLVVT--LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
A +D++ G F V ++L+++ + + +E NK A L L +M+E +G
Sbjct: 46 AVKDIEEGTPLFHVTDNLILSPYTSDLKDHLDASEWDQLNK--GWAQLILVMMWETIKGS 103
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
KS W Y+ + + E+P+ W+E + L+G+ + A+ I RE E +
Sbjct: 104 KSRWAGYLTNM-------PVMFETPMFWTEQQRDQLSGT-------DIADRIGREDAEAE 149
Query: 233 TVWFMAGSLFQQ---YPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR-------- 281
+A + +P D P + + + +S V L + ++
Sbjct: 150 YTSLLAPFIKAHPDLFPVDSPHTTIDAFHIQGSRILSRSFTVPLHRFGRSQSQSQSDGNE 209
Query: 282 -----------FALVPLGPPLLAYSSKCKAML-------AAVDDAVQLVVDRPYKAGESI 323
++P L A K A L D+ V + R K E I
Sbjct: 210 TESDDEEEEEVVVMIPFADMLNAAWGKDNAHLYVDEDTIEGFDEGVVMKSTRLVKQSEQI 269
Query: 324 VVWCGPQPNSKLLINYGFVD 343
PNS+LL YG VD
Sbjct: 270 YNTYDSPPNSELLRKYGHVD 289
>gi|449544081|gb|EMD35055.1| hypothetical protein CERSUDRAFT_107074 [Ceriporiopsis subvermispora
B]
Length = 457
Score = 38.9 bits (89), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 18/49 (36%), Positives = 30/49 (61%)
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
D AV L++ P G ++ GP+PN++L++ YGF NP D +V++
Sbjct: 275 DLAVSLLLHSPTPRGAELLNNYGPKPNAELVLGYGFALPSNPDDTIVLK 323
>gi|335300684|ref|XP_003358991.1| PREDICTED: SET domain-containing protein 4 [Sus scrofa]
Length = 440
Score = 38.9 bits (89), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 74/332 (22%), Positives = 127/332 (38%), Gaps = 40/332 (12%)
Query: 65 AGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDA 124
A SR V + + +LK W+ +I P + + LQ G
Sbjct: 20 AESRGVNESYKPEFIELKKWLKDRNFEDTNLIPARFPGTGRG------LMSKTSLQEGQL 73
Query: 125 AFSVPNSLVVTLERVLGNETIAELLTTNKL-SELACLALYLMYEKKQGKKSFWLPYIREL 183
++P S ++T + VL + + S L L +L+ EK G +S W PY+ L
Sbjct: 74 VIALPESCLLTTDTVLRSYLGPYIAKWQPPPSPLLALCTFLVSEKHAGDQSPWKPYLEVL 133
Query: 184 DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQ 243
+ P+ E E+ L P K++ E+ R + + SL
Sbjct: 134 PK-------TYTCPVC-LEPEVVNLLPGPLKSKAREQR---TRVWEFFSSSRDFFSSLQP 182
Query: 244 QYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR----FALVP----LGP--PLLA 293
+P + + IF + + C V+ + V + +R F+ P L P LL
Sbjct: 183 LFPEAVES------IFSYSALLWAWCTVNTRAVYMKQRPRQCFSTEPDTCALAPYLDLLN 236
Query: 294 YSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
+S + KA ++ + E + + GP + +LL+ YGFV NP+ +
Sbjct: 237 HSPAVQVKAAFNEESRCYEIRTGTSCRKHEEVFICYGPHGSHRLLLEYGFVSPRNPHACV 296
Query: 352 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFH 383
V + + DK+M N K+S+ H
Sbjct: 297 YVPKDILVKYLPSTDKQM----NKKISILKDH 324
>gi|328726082|ref|XP_001952202.2| PREDICTED: SET domain-containing protein 3-like [Acyrthosiphon
pisum]
Length = 241
Score = 38.9 bits (89), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 38/161 (23%), Positives = 73/161 (45%), Gaps = 20/161 (12%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
+ ++ + L W KNG IL H ++ + + A++++ GD +VP +L
Sbjct: 81 RNDQSIEKLTKWATKNG-----AILNGVEIHQFENYA-YGMKANKNITVGDKLVTVPRAL 134
Query: 133 VVTLERV----LGNETIAELLTTNKLSELACLALYLMYEK-KQGKKSFWLPYIRELDRQR 187
++T E + L +++ N + LA++++ E ++ KKSFW Y+ L
Sbjct: 135 MMTEENIPSSPLWKLHSQDMMLRNMPN--VALAIFILVESLRKDKKSFWHSYLTTLP--- 189
Query: 188 GRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
+ +P+ + +L L GSP L+ I R+Y
Sbjct: 190 ----VTYSTPVYFDVADLEALKGSPAFEAALKLNRNIARQY 226
>gi|42820762|emb|CAF32075.1| SET domain protein, putative [Aspergillus fumigatus]
Length = 530
Score = 38.9 bits (89), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 65/289 (22%), Positives = 117/289 (40%), Gaps = 41/289 (14%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDL-QAGDAA--FSVPNSLVVTLE 137
L SW NG+ + ++ S + + VA +E + G+A +VP+ L +TLE
Sbjct: 59 LSSWAKLNGISLEGIAFQKLYSEHGTDKGSAIVATAEKKDEEGEANTLLTVPSDLALTLE 118
Query: 138 RVLGNETIAELL-----TTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
V + I L + ++ +K G + W YIR +
Sbjct: 119 YVHNHAKIDRHLREVLDAVGDFGRVCYSPDFVNKRQKIGISNPWTEYIRFM-------PA 171
Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-----DTVWFMAGSLFQQYPY 247
+V P +S E L G+ + + + +++E++ L + W Q++ +
Sbjct: 172 SVPLPTFYSAEERELLRGTSLQTAVDAKLGSLEKEFDHLRQATEEIPWC------QEHWW 225
Query: 248 DIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP---LGPPLLAYSSKCKAMLAA 304
D T FTF+ +K +S VV L + A+VP + S K +
Sbjct: 226 DEDTGKFTFDDWKYVDAVYRSRVVDLPRSG----HAIVPCVDMANHACEDSVKARYDEEG 281
Query: 305 VDDAV-QLVVDRPYKAGE----SIVVWC---GPQPNSKLLINYGFVDED 345
+AV QL + + GE + V C +P S+++ +YGFV+ +
Sbjct: 282 AGNAVLQLRTGKKLRVGEEKLHADAVACRYGDEKPASEMVFSYGFVENE 330
>gi|392594054|gb|EIW83379.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
Length = 508
Score = 38.9 bits (89), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 28/109 (25%), Positives = 48/109 (44%), Gaps = 17/109 (15%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELAC------LALYLMYEK 168
A +D+ G FS+P L ++L T+ LL ++ E L L +M+E+
Sbjct: 43 ALQDIHEGTTLFSLPRELTLSLR----TSTLPSLLGVDRWKEFGLNKGWVGLILCMMWEE 98
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
+G +S W Y+ L ++P+ WS +L L G+ +I
Sbjct: 99 SRGVESKWDVYLSSLPS-------TFDTPMFWSAEDLEELKGTAVPDKI 140
>gi|308802011|ref|XP_003078319.1| N-methyltransferase (ISS) [Ostreococcus tauri]
gi|116056770|emb|CAL53059.1| N-methyltransferase (ISS) [Ostreococcus tauri]
Length = 429
Score = 38.9 bits (89), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 51/185 (27%), Positives = 76/185 (41%), Gaps = 17/185 (9%)
Query: 315 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQD-KRMVAQR 373
+ Y+ GE +++ G N +L+ YGFVD DN D E ++ Y KR +
Sbjct: 246 KDYETGEEVLISYGVLNNDELITRYGFVDVDNVADIYRFEGLMSYLQASYDPMKRALGAD 305
Query: 374 NGKLSVQVFHVH-----AGREKEAISD------MLPYLRLGYVSDTSEMQSVISSLGPIC 422
+LS + H A E ISD +L LR V T E + +
Sbjct: 306 QKRLST-LKRTHPELDQALWEGNFISDGNADPKLLWALRT--VLATPEEYAAAKGVDGFK 362
Query: 423 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 482
ER D + ++RLA YP T+ EDE L N +R A Q +K++L
Sbjct: 363 LGGGAPERRAADAVRAAVESRLAEYPTTIEEDEEALKTAN--GNERTAIQYRIRKKRILR 420
Query: 483 ACLQV 487
++
Sbjct: 421 DASRI 425
>gi|413951745|gb|AFW84394.1| hypothetical protein ZEAMMB73_159573, partial [Zea mays]
Length = 339
Score = 38.9 bits (89), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 19/57 (33%), Positives = 32/57 (56%)
Query: 304 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 360
A D+ +++++ R GE + G N+ LL YGF + DN YD + ++ AL T+
Sbjct: 99 ANDEDLEIIIVRDVNEGEEVYNTYGTMGNAALLHRYGFTELDNQYDIVNIDLALVTK 155
>gi|169606334|ref|XP_001796587.1| hypothetical protein SNOG_06204 [Phaeosphaeria nodorum SN15]
gi|160706968|gb|EAT86035.2| hypothetical protein SNOG_06204 [Phaeosphaeria nodorum SN15]
Length = 634
Score = 38.9 bits (89), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 45/203 (22%), Positives = 89/203 (43%), Gaps = 23/203 (11%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
L L+ ++ +GK+S W YI L G ++ +PL + + ++A+L G+ ER
Sbjct: 105 LLLIEQRNKGKESPWHAYIACLP-----GAESMTTPLWFDDEDMAFLAGTSLAPAAKERK 159
Query: 222 EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF-KQAFVAVQSCVVHLQKVSLAR 280
+++ + + AG D + + IF +AF++ H
Sbjct: 160 SLYYQQWEQALGIMKDAGVALAD-EVDFESLLWAATIFTSRAFISTHILPDH-------- 210
Query: 281 RFALVPLGPPLL-----AYSSKCKAMLAAVDD-AVQLVVDRPYKAGESIVVWCGPQPNSK 334
VPL P++ + S+K + + +++L+ + AG+ + P+ N +
Sbjct: 211 --ETVPLLFPIVDILNHSVSAKVEWEFQPLASFSLKLLEGDTFTAGQELFNNYAPKQNDE 268
Query: 335 LLINYGFVDEDNPYDRLVVEAAL 357
LL+ YGF E NP ++ ++ A
Sbjct: 269 LLLGYGFCLEHNPIEQFPLKLAF 291
>gi|219122993|ref|XP_002181819.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407095|gb|EEC47033.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 579
Score = 38.9 bits (89), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 17/47 (36%), Positives = 28/47 (59%)
Query: 307 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
+A L D+ +G+ + + GP+ N +LL YGFV+ +NP D V+
Sbjct: 361 NAYSLATDQAIPSGDEVYISYGPRSNDQLLQYYGFVERNNPNDVYVM 407
>gi|125528589|gb|EAY76703.1| hypothetical protein OsI_04658 [Oryza sativa Indica Group]
Length = 495
Score = 38.9 bits (89), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 19/55 (34%), Positives = 31/55 (56%)
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 360
D+ ++++V R GE + G N+ LL YGF + DN YD + ++ AL T+
Sbjct: 255 DEDLEMIVVRDVNEGEEVFNTYGTMGNAALLHRYGFTEMDNSYDIVNIDLALVTK 309
>gi|21952799|dbj|BAC06215.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
gi|22202682|dbj|BAC07340.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
gi|215769224|dbj|BAH01453.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222619626|gb|EEE55758.1| hypothetical protein OsJ_04288 [Oryza sativa Japonica Group]
Length = 495
Score = 38.9 bits (89), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 19/55 (34%), Positives = 31/55 (56%)
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 360
D+ ++++V R GE + G N+ LL YGF + DN YD + ++ AL T+
Sbjct: 255 DEDLEMIVVRDVNEGEEVFNTYGTMGNAALLHRYGFTEMDNSYDIVNIDLALVTK 309
>gi|384490907|gb|EIE82103.1| hypothetical protein RO3G_06808 [Rhizopus delemar RA 99-880]
Length = 216
Score = 38.5 bits (88), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 28/119 (23%), Positives = 59/119 (49%), Gaps = 12/119 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V ++ ++ + +VP S+ +T E+V N T S +L+L+ +K GK
Sbjct: 22 VYTTDTVKENEKFATVPFSICIT-EKVARNA----FPTLTGFSGRVLQSLFLVQQKNLGK 76
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
KSF+ PYI L ++ + + L + E ++ Y+ + + + ER ++ ++++L
Sbjct: 77 KSFYFPYINILPKK-------IVTALHFDENDMNYIKKTNLELALRERKTALRDDFDKL 128
>gi|414886517|tpg|DAA62531.1| TPA: hypothetical protein ZEAMMB73_960129 [Zea mays]
Length = 147
Score = 38.5 bits (88), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 23/74 (31%), Positives = 43/74 (58%), Gaps = 3/74 (4%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNE-TIAELLTTNK--LSELACLALYLMYEKK 169
+AA+ DL+ G+ +P + ++T +RV ++ IA ++ +K LS + L + L+ E
Sbjct: 51 LAAARDLRRGELVLRLPRAALLTSDRVTADDPRIAACVSAHKPRLSSVQILIVCLLAEVG 110
Query: 170 QGKKSFWLPYIREL 183
+G S W PY+ +L
Sbjct: 111 KGSNSVWYPYLCQL 124
>gi|403375581|gb|EJY87766.1| hypothetical protein OXYTRI_23666 [Oxytricha trifallax]
Length = 789
Score = 38.5 bits (88), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 42/194 (21%), Positives = 84/194 (43%), Gaps = 26/194 (13%)
Query: 62 TLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQA 121
T + + + K++E + + W+ NG+ V + P + + +AA +D+
Sbjct: 28 TFIHHEKTNLLKQQEKYVNFQKWLEDNGVLHPGV---DYPVAFGRQGQLIGMAARKDIPP 84
Query: 122 GDAAFSVPNSLVVTLERVLGNETIA-------ELLTTNKLSELACLALYLMYEKKQGKKS 174
A VP L+++ E + N IA E+ ++ +E + ++ +E +G+ S
Sbjct: 85 QKAFLFVPQRLMIS-EVTVRNSKIAPLLSKHPEIFKHHQDAEYLVIIAFVWHELMKGEAS 143
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-NELDT 233
FW PY + ++ + P+LWS+ E+ + +I + K EY NE
Sbjct: 144 FWHPYFQIIN--------LSDLPMLWSDQEIQEFQDQVLQKDI----QDYKVEYENEWKL 191
Query: 234 VW--FMAGSLFQQY 245
V+ F + +Y
Sbjct: 192 VYEAFSKDETYDEY 205
>gi|169595142|ref|XP_001790995.1| hypothetical protein SNOG_00305 [Phaeosphaeria nodorum SN15]
gi|160701026|gb|EAT91800.2| hypothetical protein SNOG_00305 [Phaeosphaeria nodorum SN15]
Length = 391
Score = 38.5 bits (88), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 26/48 (54%), Gaps = 5/48 (10%)
Query: 311 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN-----PYDRLVV 353
+ DR YKAGE + V G N LL+ YGF+ + N P D L++
Sbjct: 203 VTADREYKAGEEVFVSYGAHTNDFLLVEYGFILDSNRNDAIPLDHLIL 250
>gi|351694473|gb|EHA97391.1| SET domain-containing protein 3 [Heterocephalus glaber]
Length = 297
Score = 38.5 bits (88), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 40/72 (55%), Gaps = 4/72 (5%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQG 171
A+ D++AG+ VP LV+T+E N + L + +++ + LA +L+ ++
Sbjct: 112 ATRDIKAGELFLWVPRKLVMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLLCERAS 170
Query: 172 KKSFWLPYIREL 183
SFWLPYI+ L
Sbjct: 171 PISFWLPYIQTL 182
>gi|212544736|ref|XP_002152522.1| hypothetical protein PMAA_003730 [Talaromyces marneffei ATCC 18224]
gi|210065491|gb|EEA19585.1| hypothetical protein PMAA_003730 [Talaromyces marneffei ATCC 18224]
Length = 429
Score = 38.5 bits (88), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 19/59 (32%), Positives = 30/59 (50%)
Query: 315 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQR 373
R YK GE I + GP PN L + YGF E N D + ++ + + + + ++ QR
Sbjct: 264 RLYKKGEEIYMSYGPHPNDFLFVEYGFYLETNESDAIFLDDIIFKDFTVAEKEELIRQR 322
>gi|223946389|gb|ACN27278.1| unknown [Zea mays]
Length = 289
Score = 38.5 bits (88), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 19/57 (33%), Positives = 32/57 (56%)
Query: 304 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 360
A D+ +++++ R GE + G N+ LL YGF + DN YD + ++ AL T+
Sbjct: 47 ANDEDLEIIIVRDVNEGEEVYNTYGTMGNAALLHRYGFTELDNQYDIVNIDLALVTK 103
>gi|443699166|gb|ELT98776.1| hypothetical protein CAPTEDRAFT_151537 [Capitella teleta]
Length = 413
Score = 38.5 bits (88), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 65/262 (24%), Positives = 104/262 (39%), Gaps = 42/262 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELL-----TTNKLSELACLALYLMYE 167
+ A+ D+ GD F +P SL++T + N TI LL + + S L + LMYE
Sbjct: 1 MVATSDISQGDTIFEIPRSLLLTPQ----NSTIGVLLNEEADSLQEASRWVPLLITLMYE 56
Query: 168 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY-LTGSPTKAEILERAEGIKR 226
S W PY D QL + P+ WS E+ L G+ + + I +
Sbjct: 57 YT-SPSSRWKPY---FDLVPDFDQLDL--PMFWSSDEVKRELKGTGIPSLVESDLLNISK 110
Query: 227 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ--AFVAVQSCVVHL----------- 273
E+N+L Q++ E + +K+ AFV S
Sbjct: 111 EFNDL------VLPFIQKHSNVFSDECKCLKFYKKMVAFVMAYSFTEPPPSPDLDDSDDL 164
Query: 274 --QKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGP 329
+ L + +VP+ +L + +K A L ++++V + + GE I G
Sbjct: 165 SGDEHDLMPQPMMVPMA-DILNHVAKNSARLDFPKGSSSLKMVATQDIQKGEEIFNTYGE 223
Query: 330 QPNSKLLINYGFVDED--NPYD 349
N LL YGF ++ N YD
Sbjct: 224 LANMNLLHMYGFAEDIGCNEYD 245
>gi|403366800|gb|EJY83208.1| hypothetical protein OXYTRI_19172 [Oxytricha trifallax]
Length = 869
Score = 38.5 bits (88), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 37/170 (21%), Positives = 75/170 (44%), Gaps = 18/170 (10%)
Query: 302 LAAVDDAVQLVVDRPYKAG---ESIVVWC-GPQPNSKLLINYGFVDEDNPYDRLVVEAAL 357
++ V D ++ R Y G S V C G N ++L YGF N Y+ + ++ L
Sbjct: 454 VSQVPDDFNFII-RTYNDGFPKGSQVFLCYGRMSNREMLKRYGFCLTYNKYNYIFIKLRL 512
Query: 358 NTEDPQYQDKRMVAQR-------NGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE 410
+DP + ++ V ++ K+ + H +K + +L ++++ Y + +
Sbjct: 513 EQQDPDFIYRKYVLRKFFSIEPETDKMDISSRHFRIYFQK-LNTKVLKFIKILYFNVQED 571
Query: 411 MQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 460
S I + S +E +L D ++ L +P T+ ED+ +L++
Sbjct: 572 DISCI-----VETRSLSLEYLAFQRLRDVYETFLKSFPTTIGEDKKILSE 616
>gi|428183324|gb|EKX52182.1| hypothetical protein GUITHDRAFT_150712, partial [Guillardia theta
CCMP2712]
Length = 205
Score = 38.5 bits (88), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 29/116 (25%), Positives = 55/116 (47%), Gaps = 15/116 (12%)
Query: 116 SEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSF 175
++D+++ S+P+ L L R + +KL +AL ++YEK + ++SF
Sbjct: 88 TQDVKSNSVVCSIPSKLF--LSRSTTRAAFGSM--ADKLDVRTAMALQILYEKSKKEESF 143
Query: 176 WLPYIREL-DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
W +++ L DR+ + +P LW E + L G+ + E E K Y++
Sbjct: 144 WCEWLKVLPDREN------LGTPCLWPEDDQNLLKGT----SVFEEVEASKSLYSK 189
>gi|424513789|emb|CCO66411.1| predicted protein [Bathycoccus prasinos]
Length = 532
Score = 38.5 bits (88), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 75/332 (22%), Positives = 131/332 (39%), Gaps = 53/332 (15%)
Query: 75 EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
E+D DL +W KNG+ K S + R + GDAA + S++
Sbjct: 219 EKDRDDLLNWGVKNGVDFIAASFVRKGSDIDYIRSV----------LGDAAPKI--SIIS 266
Query: 135 TLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
+E + G + +++ + +A L + +Q +L R + R G+
Sbjct: 267 KVENMEGLDNFEDIVDKSDGVMVARGDLGMEIRMEQ----IFLAQKRMIKRCNEAGK--- 319
Query: 195 ESPLLWSETELAYLTGSP--TKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 252
P++ + L +TG+P T+AE + A I + D V + YP +
Sbjct: 320 --PVITATQMLESMTGAPRPTRAEATDVANAI---LDGTDCVMLSGETAAGDYP--LEAV 372
Query: 253 AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 312
+ +I ++A + S V Q LLAY S +L ++ +
Sbjct: 373 SCMADICREAEAYIDSAAVFQQ----------------LLAYQSVPMNILESLASSS--- 413
Query: 313 VDRPYKAGESIVVWCGPQPN-SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 371
V K G ++V N S+L+ Y D P + V NT DP+ +RM+A
Sbjct: 414 VRSAQKVGAKLIVTLAKSGNTSRLIAKY---RPDCPVLSVCVNMEENTHDPENTARRMLA 470
Query: 372 QRNGKLSVQ--VFHVHAGREKEAISDMLPYLR 401
R K ++ +H +G +E ++ + Y R
Sbjct: 471 SRGLKPMIEPAEWHAQSGHPQEISANAILYAR 502
>gi|301119251|ref|XP_002907353.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262105865|gb|EEY63917.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 424
Score = 38.5 bits (88), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 62/248 (25%), Positives = 109/248 (43%), Gaps = 24/248 (9%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLS---ELACLALYLMYEK- 168
V +ED+ FS+P V++++ + + + +L+ E LA+ L+YEK
Sbjct: 43 VFIAEDVTPHTEVFSIPLDSVLSVKSLQDISALQSITFFQQLTPEREDDQLAIALLYEKY 102
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
QG KS W +I L + + L + E+ L GS + E + +Y
Sbjct: 103 MQGDKSKWAKHIELLPK-------TYHNALYFEAGEIKALEGSNLFFIAQQMEEKVASDY 155
Query: 229 NEL-DTVWF-MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CVVHLQKVSLARRFALV 285
L ++V F + ++ + D+ E F+ + +K A + S V+ + K S A+V
Sbjct: 156 AVLKESVLFELFENITEGITVDLFDEIFSLDNYKWALSTIWSRFVLPVAKQSFK---AMV 212
Query: 286 PLGPPLLAYSSKCKAMLAAVDD----AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
P+ L + +A ++ D +LV + + AG + + G N KLL YGF
Sbjct: 213 PVFDML---NHDPEAEMSHFFDMETQCFKLVSHQHWNAGAQMFINYGALSNHKLLSLYGF 269
Query: 342 VDEDNPYD 349
V N +D
Sbjct: 270 VIIGNLFD 277
>gi|301112144|ref|XP_002905151.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262095481|gb|EEY53533.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 510
Score = 38.1 bits (87), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 38/141 (26%), Positives = 63/141 (44%), Gaps = 10/141 (7%)
Query: 92 PCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTT 151
P +L+ P R Y+ A E+++ G S+P S V+++E + LL
Sbjct: 88 PMSTVLQ--PEGFNFGRGTAYITA-ENVEVGSELLSLPMSQVMSVESA-ARGRVGLLLEV 143
Query: 152 N-KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 210
N L L L+L+ E+ G S + ++ L A+ S L +SE E+ L G
Sbjct: 144 NPDLPSAIALGLHLLEERALGAASNFSDFVATLPTIE-----AINSTLFYSEDEMNELEG 198
Query: 211 SPTKAEILERAEGIKREYNEL 231
S + L RA+ ++ Y+ L
Sbjct: 199 SQLQRFTLGRAQAVEAFYDAL 219
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.133 0.392
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,692,010,624
Number of Sequences: 23463169
Number of extensions: 308674708
Number of successful extensions: 825864
Number of sequences better than 100.0: 885
Number of HSP's better than 100.0 without gapping: 331
Number of HSP's successfully gapped in prelim test: 554
Number of HSP's that attempted gapping in prelim test: 824424
Number of HSP's gapped (non-prelim): 1222
length of query: 512
length of database: 8,064,228,071
effective HSP length: 147
effective length of query: 365
effective length of database: 8,910,109,524
effective search space: 3252189976260
effective search space used: 3252189976260
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 79 (35.0 bits)