BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 011632
(481 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225452167|ref|XP_002264334.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
[Vitis vinifera]
Length = 509
Score = 748 bits (1932), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/462 (79%), Positives = 393/462 (85%), Gaps = 38/462 (8%)
Query: 58 SSSDTLVAGSR-------EVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPI 110
S SDTLVAGSR E KKE++ GDLKSWMH+NGLPPCKV+LKE+PSH+E+H+ I
Sbjct: 48 SGSDTLVAGSRKEDGRVSEAARKKEDEFGDLKSWMHENGLPPCKVVLKERPSHHEQHKAI 107
Query: 111 HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
HY+AASEDLQAGD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ
Sbjct: 108 HYIAASEDLQAGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 167
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
GKKSFW PYIRELDRQRGRGQLAVESPLLWSE+ELAYLTGSPTKAE+LERAEGIKREYNE
Sbjct: 168 GKKSFWYPYIRELDRQRGRGQLAVESPLLWSESELAYLTGSPTKAEVLERAEGIKREYNE 227
Query: 231 LDTVWFMAGSLFQ-------------------------------KVSLARRFALVPLGPP 259
LDTVWFMAGSLFQ KVSLARRFALVPLGPP
Sbjct: 228 LDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFVAIQSCVVHLQKVSLARRFALVPLGPP 287
Query: 260 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 319
LLAY S CKAMLAAVD +VQLVVDRPYKAGESIVVWCGPQPNSKLL+NYGFVDEDN YDR
Sbjct: 288 LLAYRSNCKAMLAAVDGSVQLVVDRPYKAGESIVVWCGPQPNSKLLLNYGFVDEDNSYDR 347
Query: 320 LVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE 379
+VVEAALNTEDPQYQDKRMVAQRNGKL+VQ FHV G+E+EA+SDMLPYLRLGYVSD SE
Sbjct: 348 IVVEAALNTEDPQYQDKRMVAQRNGKLTVQKFHVSVGKEREAVSDMLPYLRLGYVSDPSE 407
Query: 380 MQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVA 439
MQSVISS GPICPVSPCMERAVLDQL DYF+ RLAGYP T+SEDE +L D NL+PKK VA
Sbjct: 408 MQSVISSQGPICPVSPCMERAVLDQLVDYFERRLAGYPTTMSEDECLLADSNLNPKKLVA 467
Query: 440 TQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 481
TQLVR+EKKMLNACL+ T D+I LPD TVSPCPAPY PLL
Sbjct: 468 TQLVRLEKKMLNACLKATVDLINQLPDHTVSPCPAPYTPLLK 509
>gi|449455876|ref|XP_004145676.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cucumis
sativus]
gi|449492872|ref|XP_004159127.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cucumis
sativus]
Length = 521
Score = 747 bits (1928), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/499 (74%), Positives = 411/499 (82%), Gaps = 41/499 (8%)
Query: 21 HHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSSDTLVAGSR-------EVVSK 73
H PL + S IS+S R +F +S +RR N S SSS+TLVAGSR E V+K
Sbjct: 24 HRPLLLLSKISVSAPRISHFSNSFSPIRRWNVCS--ASSSETLVAGSRKENGKTGEAVTK 81
Query: 74 KEED-LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
KE+D GDLK+WMH NGLPPCKVIL+EKPSH++ HRPIHYVAASEDL+ GD AFSVPNSL
Sbjct: 82 KEDDEFGDLKAWMHDNGLPPCKVILEEKPSHDKNHRPIHYVAASEDLEVGDVAFSVPNSL 141
Query: 133 VVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
VVTLERVLGNET+AELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQL
Sbjct: 142 VVTLERVLGNETVAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQL 201
Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQ--------- 243
AVESPLLWSE EL YL+GSPTK E+LERAEGIK+EYNELDTVWFMAGSLFQ
Sbjct: 202 AVESPLLWSEDELDYLSGSPTKKEVLERAEGIKKEYNELDTVWFMAGSLFQQYPYDIPTE 261
Query: 244 ----------------------KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 281
KVSLARRFALVPLGPPLLAY S CKAML AVD AV+LV
Sbjct: 262 AFSFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYRSNCKAMLTAVDGAVELV 321
Query: 282 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 341
VDRPYKAGESI VWCGPQPNSKLL+NYGFVDEDN YDRLVVEAALNTEDPQYQDKRMVAQ
Sbjct: 322 VDRPYKAGESIAVWCGPQPNSKLLLNYGFVDEDNRYDRLVVEAALNTEDPQYQDKRMVAQ 381
Query: 342 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 401
RNG+LS+Q F+V+AG+EKEA+ DMLPYLRLGYV+ SEMQSVISS GP+CPVSPCMERA+
Sbjct: 382 RNGRLSIQAFYVYAGKEKEAVLDMLPYLRLGYVTHPSEMQSVISSQGPVCPVSPCMERAM 441
Query: 402 LDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 461
L+Q+ADYFK RLAGYP TLSEDE +L D NL+PKKRVATQLVR+EKK+L++CL+VT D I
Sbjct: 442 LEQVADYFKRRLAGYPTTLSEDEFLLADGNLNPKKRVATQLVRLEKKLLHSCLEVTIDFI 501
Query: 462 MLLPDVTVSPCPAPYAPLL 480
LPD TVSPCPAPYAPLL
Sbjct: 502 NQLPDHTVSPCPAPYAPLL 520
>gi|359488614|ref|XP_003633789.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
[Vitis vinifera]
Length = 515
Score = 741 bits (1913), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/468 (78%), Positives = 393/468 (83%), Gaps = 44/468 (9%)
Query: 58 SSSDTLVAGSR-------EVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPI 110
S SDTLVAGSR E KKE++ GDLKSWMH+NGLPPCKV+LKE+PSH+E+H+ I
Sbjct: 48 SGSDTLVAGSRKEDGRVSEAARKKEDEFGDLKSWMHENGLPPCKVVLKERPSHHEQHKAI 107
Query: 111 HYVAASEDLQ------AGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYL 164
HY+AASEDLQ AGD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYL
Sbjct: 108 HYIAASEDLQGFLLLQAGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYL 167
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
MYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE+ELAYLTGSPTKAE+LERAEGI
Sbjct: 168 MYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESELAYLTGSPTKAEVLERAEGI 227
Query: 225 KREYNELDTVWFMAGSLFQ-------------------------------KVSLARRFAL 253
KREYNELDTVWFMAGSLFQ KVSLARRFAL
Sbjct: 228 KREYNELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFVAIQSCVVHLQKVSLARRFAL 287
Query: 254 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 313
VPLGPPLLAY S CKAMLAAVD +VQLVVDRPYKAGESIVVWCGPQPNSKLL+NYGFVDE
Sbjct: 288 VPLGPPLLAYRSNCKAMLAAVDGSVQLVVDRPYKAGESIVVWCGPQPNSKLLLNYGFVDE 347
Query: 314 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 373
DN YDR+VVEAALNTEDPQYQDKRMVAQRNGKL+VQ FHV G+E+EA+SDMLPYLRLGY
Sbjct: 348 DNSYDRIVVEAALNTEDPQYQDKRMVAQRNGKLTVQKFHVSVGKEREAVSDMLPYLRLGY 407
Query: 374 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 433
VSD SEMQSVISS GPICPVSPCMERAVLDQL DYF+ RLAGYP T+SEDE +L D NL+
Sbjct: 408 VSDPSEMQSVISSQGPICPVSPCMERAVLDQLVDYFERRLAGYPTTMSEDECLLADSNLN 467
Query: 434 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 481
PKK VATQLVR+EKKMLNACL+ T D+I LPD TVSPCPAPY PLL
Sbjct: 468 PKKLVATQLVRLEKKMLNACLKATVDLINQLPDHTVSPCPAPYTPLLK 515
>gi|224117488|ref|XP_002331687.1| SET domain protein [Populus trichocarpa]
gi|222874165|gb|EEF11296.1| SET domain protein [Populus trichocarpa]
Length = 502
Score = 729 bits (1883), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/517 (72%), Positives = 409/517 (79%), Gaps = 53/517 (10%)
Query: 1 MEASCSLRSSKFISPPIRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSS 60
ME +C +K ISP L++ S +SIS P S RR F+
Sbjct: 1 MEFTC--LHNKCISPS-------LTVLSRVSISFSNLPKRAVSFHRRRRNLCFA------ 45
Query: 61 DTLVAGSR--EVVSKK----EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVA 114
TLV G R EVVSK+ E++ GDLKSWMHKNGLPPCKV+LKE+PSH++K RPIHYVA
Sbjct: 46 -TLVDGKRTSEVVSKRGGEEEDEFGDLKSWMHKNGLPPCKVVLKERPSHDKKLRPIHYVA 104
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASEDLQA D A SVPNSLVVTLERVLGNET+AELLTTNKLSELACLALYLMYEKKQGKKS
Sbjct: 105 ASEDLQASDVAVSVPNSLVVTLERVLGNETLAELLTTNKLSELACLALYLMYEKKQGKKS 164
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
FW PYIRELDRQRGRGQLAVESPLLWSE ELAYLTGSPTKAE+L+RA+GIKREY ELDTV
Sbjct: 165 FWYPYIRELDRQRGRGQLAVESPLLWSEAELAYLTGSPTKAEVLDRADGIKREYEELDTV 224
Query: 235 WFMAGSLFQ-------------------------------KVSLARRFALVPLGPPLLAY 263
WFMAGSLFQ KVSLARRFALVPLGPPLLAY
Sbjct: 225 WFMAGSLFQQYPYDIPTEAFPFEIFKQAFVAIQSCVVHLQKVSLARRFALVPLGPPLLAY 284
Query: 264 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 323
SS CKAML AVD AV+LVVDRPYKAGE IVVWCGPQPNSKLL+NYGFVDEDNPYDR+ VE
Sbjct: 285 SSNCKAMLTAVDGAVELVVDRPYKAGEPIVVWCGPQPNSKLLLNYGFVDEDNPYDRIAVE 344
Query: 324 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV 383
AALNTEDPQYQDKRMVAQRNGKLSVQVF V+AG+EKEA+SD+LPYLRLGYVSD SEMQSV
Sbjct: 345 AALNTEDPQYQDKRMVAQRNGKLSVQVFQVYAGKEKEAVSDILPYLRLGYVSDPSEMQSV 404
Query: 384 ISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLV 443
ISS GP+CPVSPCME+AVLDQL YF+ RLAGY ++SEDE ML D NL+PKKRVATQLV
Sbjct: 405 ISSQGPVCPVSPCMEQAVLDQLTVYFRTRLAGYCTSISEDELMLADPNLNPKKRVATQLV 464
Query: 444 RMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 480
R+EKKML ACLQ T D+I LPD T+ PCPAPYAPLL
Sbjct: 465 RLEKKMLKACLQATVDLINQLPDHTMPPCPAPYAPLL 501
>gi|296090251|emb|CBI40070.3| unnamed protein product [Vitis vinifera]
Length = 428
Score = 714 bits (1842), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 347/427 (81%), Positives = 371/427 (86%), Gaps = 31/427 (7%)
Query: 85 MHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNET 144
MH+NGLPPCKV+LKE+PSH+E+H+ IHY+AASEDLQAGD AFSVP+SLVVTLERVLGNET
Sbjct: 1 MHENGLPPCKVVLKERPSHHEQHKAIHYIAASEDLQAGDVAFSVPDSLVVTLERVLGNET 60
Query: 145 IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 204
IAELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE+E
Sbjct: 61 IAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESE 120
Query: 205 LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQ--------------------- 243
LAYLTGSPTKAE+LERAEGIKREYNELDTVWFMAGSLFQ
Sbjct: 121 LAYLTGSPTKAEVLERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFV 180
Query: 244 ----------KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 293
KVSLARRFALVPLGPPLLAY S CKAMLAAVD +VQLVVDRPYKAGESIV
Sbjct: 181 AIQSCVVHLQKVSLARRFALVPLGPPLLAYRSNCKAMLAAVDGSVQLVVDRPYKAGESIV 240
Query: 294 VWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV 353
VWCGPQPNSKLL+NYGFVDEDN YDR+VVEAALNTEDPQYQDKRMVAQRNGKL+VQ FHV
Sbjct: 241 VWCGPQPNSKLLLNYGFVDEDNSYDRIVVEAALNTEDPQYQDKRMVAQRNGKLTVQKFHV 300
Query: 354 HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARL 413
G+E+EA+SDMLPYLRLGYVSD SEMQSVISS GPICPVSPCMERAVLDQL DYF+ RL
Sbjct: 301 SVGKEREAVSDMLPYLRLGYVSDPSEMQSVISSQGPICPVSPCMERAVLDQLVDYFERRL 360
Query: 414 AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCP 473
AGYP T+SEDE +L D NL+PKK VATQLVR+EKKMLNACL+ T D+I LPD TVSPCP
Sbjct: 361 AGYPTTMSEDECLLADSNLNPKKLVATQLVRLEKKMLNACLKATVDLINQLPDHTVSPCP 420
Query: 474 APYAPLL 480
APY PLL
Sbjct: 421 APYTPLL 427
>gi|357497055|ref|XP_003618816.1| SET domain protein [Medicago truncatula]
gi|355493831|gb|AES75034.1| SET domain protein [Medicago truncatula]
Length = 501
Score = 706 bits (1821), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 347/476 (72%), Positives = 390/476 (81%), Gaps = 33/476 (6%)
Query: 36 RDPNFGSSLRLVRRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKV 95
R P+F S RR+ R S+SDTLVA + + K++ED GDLK+WMHKNGLPPCKV
Sbjct: 27 RLPSFLSLSTNHRRRRRSFCSASNSDTLVAATGK--KKRDEDDGDLKTWMHKNGLPPCKV 84
Query: 96 ILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLS 155
+LK+KPS ++ +PIHYVAASEDLQ GD AFSVPNSLVVTLERVLGNETIAELLTTNK S
Sbjct: 85 VLKDKPSLDDSVKPIHYVAASEDLQKGDIAFSVPNSLVVTLERVLGNETIAELLTTNKFS 144
Query: 156 ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 215
ELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE+ELAYL GSP K
Sbjct: 145 ELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESELAYLEGSPLKD 204
Query: 216 EILERAEGIKREYNELDTVWFMAGSLFQK------------------------------- 244
EI++R EGI++EYNELDTVWFM+GSLFQ+
Sbjct: 205 EIVKRIEGIRKEYNELDTVWFMSGSLFQQYPYDLPTEAFPFEIFKQAFAAVQSCVVHLQN 264
Query: 245 VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 304
VSLARRFALVPLGPPLLAY S CKAML AVD AVQLVVDRPYKAG+ IVVWCGPQPN+KL
Sbjct: 265 VSLARRFALVPLGPPLLAYCSNCKAMLTAVDGAVQLVVDRPYKAGDPIVVWCGPQPNTKL 324
Query: 305 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD 364
L NYGFVDEDN DRL+VE AL+TEDPQYQDKR+VAQRNGKLS+Q F+V+ G+E+EA+SD
Sbjct: 325 LTNYGFVDEDNSNDRLIVEVALSTEDPQYQDKRIVAQRNGKLSIQTFYVYTGKEREAVSD 384
Query: 365 MLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 424
M+PY+RLGYVSD SEMQSVISS GP+CPVSPCMERAVLDQLADYF RLA YP TL+EDE
Sbjct: 385 MIPYMRLGYVSDPSEMQSVISSQGPVCPVSPCMERAVLDQLADYFNTRLAAYPTTLAEDE 444
Query: 425 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 480
+MLTD +L+PK+RVATQLVR+EKKML+ACLQ D+I LPD +VSPCPAPYAP L
Sbjct: 445 SMLTDGSLNPKRRVATQLVRLEKKMLHACLQAIMDLISQLPDHSVSPCPAPYAPSL 500
>gi|22326803|ref|NP_196930.2| Rubisco methyltransferase family protein [Arabidopsis thaliana]
gi|30684815|ref|NP_851038.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
gi|42573363|ref|NP_974778.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
gi|17473570|gb|AAL38260.1| putative protein [Arabidopsis thaliana]
gi|23297671|gb|AAN13005.1| unknown protein [Arabidopsis thaliana]
gi|332004624|gb|AED92007.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
gi|332004625|gb|AED92008.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
gi|332004626|gb|AED92009.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
Length = 514
Score = 699 bits (1804), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/520 (69%), Positives = 403/520 (77%), Gaps = 47/520 (9%)
Query: 1 MEASCSLRSSKFISPPIRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSS 60
ME + +K +S PIR PLS S S+ R+ SS R V + S+ VSSS
Sbjct: 1 MEGVITCFHTKCVSLPIR--SFPLSRVS--SLPRWRNNKLISSSRSVHLR---SLCVSSS 53
Query: 61 DTLVAGSR--------EVVSKKE-EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIH 111
DTLVA +V SKKE +D DLK WM KNGLPPCKVILKE+P+H++KH+PIH
Sbjct: 54 DTLVASGSPKEDERQSKVSSKKEGDDSEDLKFWMDKNGLPPCKVILKERPAHDQKHKPIH 113
Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
YVAASEDLQ GD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG
Sbjct: 114 YVAASEDLQKGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 173
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
KKS W PYIRELDRQRGRGQL ESPLLWSE EL YLTGSPTKAE+LERAEGIKREYNEL
Sbjct: 174 KKSVWYPYIRELDRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGIKREYNEL 233
Query: 232 DTVWFMAGSLFQK-------------------------------VSLARRFALVPLGPPL 260
DTVWFMAGSLFQ+ V LARRFALVPLGPPL
Sbjct: 234 DTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQNVGLARRFALVPLGPPL 293
Query: 261 LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 320
LAY S CKAML AVD AV+LVVDRPYKAG+ IVVWCGPQPN+KLL+NYGFVDEDNPYDR+
Sbjct: 294 LAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVVWCGPQPNAKLLLNYGFVDEDNPYDRV 353
Query: 321 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM 380
+VEAALNTEDPQYQDKRMVAQRNGKLS QVF V G+E+EA+ DMLPYLRLGY+SD SEM
Sbjct: 354 IVEAALNTEDPQYQDKRMVAQRNGKLSQQVFQVRVGKEREAVQDMLPYLRLGYMSDPSEM 413
Query: 381 QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVAT 440
QSVISS GP+CP+SPCMERAVLDQLA+YF RL+GYP T ED+A+L D +L P+KRVAT
Sbjct: 414 QSVISSQGPVCPMSPCMERAVLDQLANYFMRRLSGYPTTPKEDDALLADPSLSPRKRVAT 473
Query: 441 QLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 480
+LV++EKK+L ACL T D++ LPD +SPCPAPYAP L
Sbjct: 474 RLVQLEKKILVACLTTTVDLLNQLPDTAISPCPAPYAPSL 513
>gi|18377718|gb|AAL67009.1| unknown protein [Arabidopsis thaliana]
Length = 514
Score = 696 bits (1796), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/520 (69%), Positives = 402/520 (77%), Gaps = 47/520 (9%)
Query: 1 MEASCSLRSSKFISPPIRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSS 60
ME + +K +S PIR PLS S S+ R+ SS R V + S+ VSSS
Sbjct: 1 MEGVITCFHTKCVSLPIR--SFPLSRVS--SLPRWRNNKLISSSRSVHLR---SLCVSSS 53
Query: 61 DTLVAGSR--------EVVSKKE-EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIH 111
DTLVA +V SKKE +D DLK WM KNGLPPCKVILKE+P+H++KH+PIH
Sbjct: 54 DTLVASGSPKEDERQSKVSSKKEGDDSEDLKFWMDKNGLPPCKVILKERPAHDQKHKPIH 113
Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
YVAASEDLQ GD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG
Sbjct: 114 YVAASEDLQKGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 173
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
KKS W PYIRELDRQRGRGQL ESPLLWSE EL YLTGSPTKAE+LERAEGIKREYNEL
Sbjct: 174 KKSVWYPYIRELDRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGIKREYNEL 233
Query: 232 DTVWFMAGSLFQK-------------------------------VSLARRFALVPLGPPL 260
DTVWFMAGSLFQ+ V LARRFALVPLGPPL
Sbjct: 234 DTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQNVGLARRFALVPLGPPL 293
Query: 261 LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 320
LAY S CKAML AVD AV+LVVDRPYKAG+ IVVWCGPQPN+KLL+NYGFVDEDNPYDR+
Sbjct: 294 LAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVVWCGPQPNAKLLLNYGFVDEDNPYDRV 353
Query: 321 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM 380
+VEAALNTE PQYQDKRMVAQRNGKLS QVF V G+E+EA+ DMLPYLRLGY+SD SEM
Sbjct: 354 IVEAALNTEGPQYQDKRMVAQRNGKLSQQVFQVRVGKEREAVQDMLPYLRLGYMSDPSEM 413
Query: 381 QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVAT 440
QSVISS GP+CP+SPCMERAVLDQLA+YF RL+GYP T ED+A+L D +L P+KRVAT
Sbjct: 414 QSVISSQGPVCPMSPCMERAVLDQLANYFMRRLSGYPTTPKEDDALLADPSLSPRKRVAT 473
Query: 441 QLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 480
+LV++EKK+L ACL T D++ LPD +SPCPAPYAP L
Sbjct: 474 RLVQLEKKILVACLTTTVDLLNQLPDTAISPCPAPYAPSL 513
>gi|297807453|ref|XP_002871610.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297317447|gb|EFH47869.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 516
Score = 693 bits (1788), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 357/521 (68%), Positives = 401/521 (76%), Gaps = 45/521 (8%)
Query: 1 MEASCSLRSSKFISPPIRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSS 60
ME + +K +S PIR PLS S S+ R+ SS R V ++ + SSS
Sbjct: 1 MEGVITCFHTKCVSLPIR--SFPLSRVS--SLPRWRNTKLISSSRSVPLRS-LCVSASSS 55
Query: 61 DTLVAGSR--------EVVSKKE-EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIH 111
DTLVAG +V SKKE +D DLK WM KNGLPPCKV+LKE+P+H+ K++PIH
Sbjct: 56 DTLVAGGSPKEDERQSKVSSKKEGDDSEDLKFWMDKNGLPPCKVLLKERPAHDLKYKPIH 115
Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
YVAASEDLQ GD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG
Sbjct: 116 YVAASEDLQKGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 175
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
KKS W PYIRELDRQRGRGQL ESPLLWSE EL YLTGSPTKAE+LERAEGIKREYNEL
Sbjct: 176 KKSVWYPYIRELDRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGIKREYNEL 235
Query: 232 DTVWFMAGSLFQK-------------------------------VSLARRFALVPLGPPL 260
DTVWFMAGSLFQ+ V LARRFALVPLGPPL
Sbjct: 236 DTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQNVGLARRFALVPLGPPL 295
Query: 261 LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 320
LAY S CKAML AVD AV+LVVDRPYKAG+ IVVWCGPQPN+KLL+NYGFVDEDNPYDR+
Sbjct: 296 LAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVVWCGPQPNAKLLLNYGFVDEDNPYDRI 355
Query: 321 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM 380
+VEAALNTEDPQYQDKRMVAQRNGKLS QVF V G+E+EA+ DMLPYLRLGY+SD SEM
Sbjct: 356 IVEAALNTEDPQYQDKRMVAQRNGKLSQQVFQVRVGKEREAVQDMLPYLRLGYMSDPSEM 415
Query: 381 QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVAT 440
QSVISS GP+C +SPCMERAVLDQLA+YF RL+GYP T ED+A+L D +L P+KRVAT
Sbjct: 416 QSVISSQGPVCTMSPCMERAVLDQLANYFMRRLSGYPTTPKEDDALLADPSLSPRKRVAT 475
Query: 441 QLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 481
+LV++EKK+L ACL T D++ LPD +SPCPAPYAP L
Sbjct: 476 RLVQLEKKILAACLTTTVDLLNQLPDTAISPCPAPYAPSLK 516
>gi|356571407|ref|XP_003553868.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
isoform 1 [Glycine max]
gi|356571409|ref|XP_003553869.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
isoform 2 [Glycine max]
Length = 502
Score = 691 bits (1783), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 333/432 (77%), Positives = 367/432 (84%), Gaps = 31/432 (7%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
DLKSWMHK+GLPPCKV+LK+KP N+ H+PIHYVAAS+DLQ GD AFSVPNSLVVTLERV
Sbjct: 70 DLKSWMHKHGLPPCKVVLKDKPCPNDSHKPIHYVAASQDLQVGDVAFSVPNSLVVTLERV 129
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
LGNET+AELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQL+VESPLL
Sbjct: 130 LGNETVAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLSVESPLL 189
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQ---------------- 243
W ++EL YL+GSP K E+++R E I++EYNELDTVWFMAGSLFQ
Sbjct: 190 WLKSELDYLSGSPIKDEVIQREEAIRKEYNELDTVWFMAGSLFQQYPYDIPTEAFSFEIF 249
Query: 244 ---------------KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 288
KVSLARRFALVPLGPPLL+Y S CKAML AVD AV+L VDRPYKA
Sbjct: 250 KQAFAAIQSCVVHLQKVSLARRFALVPLGPPLLSYQSNCKAMLTAVDGAVELAVDRPYKA 309
Query: 289 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 348
G+ IVVWCGPQPNSKLLINYGFVDE+N DRL+VEAALNTEDPQYQDKRMVAQRNGKLSV
Sbjct: 310 GDPIVVWCGPQPNSKLLINYGFVDENNSNDRLIVEAALNTEDPQYQDKRMVAQRNGKLSV 369
Query: 349 QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADY 408
QVFHV+AG+E+EA+ DML Y+RLGYVSD SEM+SVISS GP+CPVSPCMERA LDQLADY
Sbjct: 370 QVFHVYAGKEREAVLDMLRYMRLGYVSDPSEMESVISSQGPVCPVSPCMERAALDQLADY 429
Query: 409 FKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVT 468
FKARLAGYP TL+EDE+MLTD NL+PKKRVATQ VR+EKKML+ACLQ T D I LPD T
Sbjct: 430 FKARLAGYPTTLAEDESMLTDDNLNPKKRVATQYVRLEKKMLHACLQATTDFINQLPDHT 489
Query: 469 VSPCPAPYAPLL 480
+SPCPAPYAPLL
Sbjct: 490 ISPCPAPYAPLL 501
>gi|356511552|ref|XP_003524489.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
[Glycine max]
Length = 503
Score = 686 bits (1770), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 333/433 (76%), Positives = 366/433 (84%), Gaps = 32/433 (7%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
DLKSWMHK+GLPPCKV+LK+KP N+ H+PIHYVAAS+DLQ GD AFSVPNSLVVTLERV
Sbjct: 70 DLKSWMHKHGLPPCKVVLKDKPCPNDSHKPIHYVAASQDLQVGDVAFSVPNSLVVTLERV 129
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
LGNET+AELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQL+VESPLL
Sbjct: 130 LGNETVAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLSVESPLL 189
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQ---------------- 243
WS++EL YL+GSP K E+++R E I++EY ELDTVWFMAGSLFQ
Sbjct: 190 WSKSELDYLSGSPIKDEVIQREEAIRKEYKELDTVWFMAGSLFQQYPYDIPTEAFSFEIF 249
Query: 244 ---------------KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 288
KVSLARRFALVPLGPPLL+Y S CKAML AVD AV+L VDRPYKA
Sbjct: 250 KQAFAAIQSCVVHLQKVSLARRFALVPLGPPLLSYQSNCKAMLTAVDGAVELAVDRPYKA 309
Query: 289 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 348
G+ IVVWCGPQPNSKLLINYGFVDE+N DRL+VEAALNTEDPQYQDKRMVAQRNGKLSV
Sbjct: 310 GDPIVVWCGPQPNSKLLINYGFVDENNSNDRLIVEAALNTEDPQYQDKRMVAQRNGKLSV 369
Query: 349 QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADY 408
QVFHV+AG+E+EA+ DML Y+RLGYVSD SEMQSVISS GP+CPVSPCMERA LDQLADY
Sbjct: 370 QVFHVYAGKEREAVLDMLRYMRLGYVSDPSEMQSVISSQGPVCPVSPCMERAALDQLADY 429
Query: 409 FKARLAGYPATLSEDEAMLTD-YNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDV 467
FKARLAGYP L+EDE+MLTD NL+PKKRVATQ VR+EKKML+ACLQ T D I LPD
Sbjct: 430 FKARLAGYPTILAEDESMLTDGGNLNPKKRVATQYVRLEKKMLHACLQATIDFINQLPDH 489
Query: 468 TVSPCPAPYAPLL 480
T+SPCPAPYAPLL
Sbjct: 490 TISPCPAPYAPLL 502
>gi|116786810|gb|ABK24248.1| unknown [Picea sitchensis]
Length = 507
Score = 646 bits (1667), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 319/476 (67%), Positives = 373/476 (78%), Gaps = 38/476 (7%)
Query: 42 SSLRLVRRKNRFSIRVSSSDTLVAGS------REVVSKKEEDLGDLKSWMHKNGLPPCKV 95
S +RL R F + V S+DTL A S ++ + KEE++ DLKSWMH++GLPPC+V
Sbjct: 32 SRVRLPGRCVGFPMVVYSADTLTASSQHGEDKKDAIRGKEEEV-DLKSWMHRHGLPPCRV 90
Query: 96 ILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLS 155
+LKE+PS + KH+PI YVAASEDLQ GD AFS+PNSL+VTLERVLGNETIAELLTTNKLS
Sbjct: 91 MLKERPSPDGKHKPIKYVAASEDLQPGDVAFSIPNSLIVTLERVLGNETIAELLTTNKLS 150
Query: 156 ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 215
ELACLALYLMYEKKQG +SFW P+IRELDRQRGRGQLAVESPLLWS EL Y TGSP K
Sbjct: 151 ELACLALYLMYEKKQGNQSFWRPFIRELDRQRGRGQLAVESPLLWSSEELKYFTGSPMKE 210
Query: 216 EILERAEGIKREYNELDTVWFMAGSLF-------------------------------QK 244
+LER GIKREY ELDTVWFMAGSLF Q
Sbjct: 211 IMLERNSGIKREYEELDTVWFMAGSLFKQYPYDIPTEAFPFEIFKQAFVAVQSCVVHLQN 270
Query: 245 VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 304
V+LARRFALVPLGPPLL+Y S CKAML AV D+VQL VDR YKAGE IVVWCGPQPN++L
Sbjct: 271 VNLARRFALVPLGPPLLSYKSNCKAMLKAVGDSVQLEVDREYKAGEPIVVWCGPQPNARL 330
Query: 305 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD 364
L+NYGFVDEDNP+DRL+VE +L+T+DP YQDKR++AQRNGKLSVQ F+++ GREKEA+ D
Sbjct: 331 LLNYGFVDEDNPHDRLIVEVSLDTKDPLYQDKRIIAQRNGKLSVQTFNIYIGREKEAVLD 390
Query: 365 MLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 424
MLPYLRL YVSD SEMQSV+SS GP+CPVSPC ERAVLDQL+ YF+ RLAGYP T SEDE
Sbjct: 391 MLPYLRLAYVSDPSEMQSVLSSQGPVCPVSPCTERAVLDQLSRYFRERLAGYPTTASEDE 450
Query: 425 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 480
+L D +PK++VATQLV +EKKMLN+CL ++I LPD+ V+PCP+PY+P+L
Sbjct: 451 IVLADPTTNPKRQVATQLVLIEKKMLNSCLAAVYEIIDQLPDLAVTPCPSPYSPIL 506
>gi|7573451|emb|CAB87765.1| putative protein [Arabidopsis thaliana]
Length = 537
Score = 624 bits (1610), Expect = e-176, Method: Compositional matrix adjust.
Identities = 329/489 (67%), Positives = 362/489 (74%), Gaps = 65/489 (13%)
Query: 1 MEASCSLRSSKFISPPIRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSS 60
ME + +K +S PIR PLS S S+ R+ SS R V + S+ VSSS
Sbjct: 1 MEGVITCFHTKCVSLPIR--SFPLSRVS--SLPRWRNNKLISSSRSVHLR---SLCVSSS 53
Query: 61 DTLVAGSR--------EVVSKKE-EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIH 111
DTLVA +V SKKE +D DLK WM KNGLPPCKVILKE+P+H++KH+PIH
Sbjct: 54 DTLVASGSPKEDERQSKVSSKKEGDDSEDLKFWMDKNGLPPCKVILKERPAHDQKHKPIH 113
Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
YVAASEDLQ GD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG
Sbjct: 114 YVAASEDLQKGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 173
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
KKS W PYIRELDRQRGRGQL ESPLLWSE EL YLTGSPTKAE+LERAEGIKREYNEL
Sbjct: 174 KKSVWYPYIRELDRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGIKREYNEL 233
Query: 232 DTVWFMAGSLFQK----------------------------------------------- 244
DTVWFMAGSLFQ+
Sbjct: 234 DTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQVVLVASSNLDCYASSCT 293
Query: 245 --VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 302
V LARRFALVPLGPPLLAY S CKAML AVD AV+LVVDRPYKAG+ IVVWCGPQPN+
Sbjct: 294 QNVGLARRFALVPLGPPLLAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVVWCGPQPNA 353
Query: 303 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI 362
KLL+NYGFVDEDNPYDR++VEAALNTEDPQYQDKRMVAQRNGKLS QVF V G+E+EA+
Sbjct: 354 KLLLNYGFVDEDNPYDRVIVEAALNTEDPQYQDKRMVAQRNGKLSQQVFQVRVGKEREAV 413
Query: 363 SDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSE 422
DMLPYLRLGY+SD SEMQSVISS GP+CP+SPCMERAVLDQLA+YF RL+GYP T E
Sbjct: 414 QDMLPYLRLGYMSDPSEMQSVISSQGPVCPMSPCMERAVLDQLANYFMRRLSGYPTTPKE 473
Query: 423 DEAMLTDYN 431
D+A+ N
Sbjct: 474 DDALEASCN 482
>gi|357160358|ref|XP_003578740.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
[Brachypodium distachyon]
Length = 516
Score = 601 bits (1549), Expect = e-169, Method: Compositional matrix adjust.
Identities = 294/432 (68%), Positives = 336/432 (77%), Gaps = 34/432 (7%)
Query: 84 WMHKNGLPPCKVILKEKP---SHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
W+ +GLPP KV + E+P S K RP+H+VAA +DL+ GD AF +P SLVVTLERVL
Sbjct: 85 WLLTHGLPPGKVAILERPVPCSRGGKDRPLHFVAAGQDLEVGDVAFEMPMSLVVTLERVL 144
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
G+E++AELLTTNKLSELACLALYLMYEKKQGK S W PYI+ELDRQRGRGQLAVESPLLW
Sbjct: 145 GDESVAELLTTNKLSELACLALYLMYEKKQGKDSLWYPYIKELDRQRGRGQLAVESPLLW 204
Query: 201 SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLF------------------ 242
+E+EL YL GSP + E++ R EGI+REYNELDT+WFMAGSLF
Sbjct: 205 TESELDYLNGSPMRDEVVVRDEGIRREYNELDTLWFMAGSLFKQYPFDVPTEAFPFEIFK 264
Query: 243 -------------QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 289
QKVSLARRFALVPLGPPLL Y S CKAML AVDD+V+LVVDRPYKAG
Sbjct: 265 QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTAVDDSVRLVVDRPYKAG 324
Query: 290 ESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ 349
E I+VWCGPQPNS+LL+NYGFVDEDNPYDR+ +EA+LN EDPQYQ+KRMVAQRNGKL++Q
Sbjct: 325 EPIIVWCGPQPNSRLLLNYGFVDEDNPYDRIAIEASLNMEDPQYQEKRMVAQRNGKLAIQ 384
Query: 350 VFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYF 409
F V G+EKE IS+MLPYLRLGY+SD EMQ ++SS G CPVSPC ERAVLDQL Y
Sbjct: 385 KFQVCVGKEKETISEMLPYLRLGYISDPDEMQCILSSEGDTCPVSPCSERAVLDQLVVYL 444
Query: 410 KARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTV 469
K+RLAGYP TL EDEAML D NL PKK VAT+LVR+EKK+L+ CLQ + I LPD TV
Sbjct: 445 KSRLAGYPTTLDEDEAMLADGNLEPKKEVATRLVRLEKKLLHGCLQAAHEFISALPDHTV 504
Query: 470 SPCPAPYAPLLN 481
SPCPA YAP L
Sbjct: 505 SPCPALYAPNLK 516
>gi|125536207|gb|EAY82695.1| hypothetical protein OsI_37912 [Oryza sativa Indica Group]
Length = 505
Score = 600 bits (1546), Expect = e-169, Method: Compositional matrix adjust.
Identities = 291/434 (67%), Positives = 339/434 (78%), Gaps = 34/434 (7%)
Query: 81 LKSWMHKNGLPPCKVILKEKPS---HNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
W+ ++GLPP KV + ++P K P+HYVAA +DL+AGD AF VP SLVVTLE
Sbjct: 71 FSDWLREHGLPPGKVAILDRPVPCFREGKDLPLHYVAAGQDLEAGDVAFEVPMSLVVTLE 130
Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
RVLG+E++AELLTTNKLSELACLALYLMYEKKQG+ SFW PYI+ELDRQRGRGQLAVESP
Sbjct: 131 RVLGDESVAELLTTNKLSELACLALYLMYEKKQGQDSFWYPYIKELDRQRGRGQLAVESP 190
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQ-------------- 243
LLW+E+EL YL GSP K E++ R EGI+REYNELDT+WFMAGSLFQ
Sbjct: 191 LLWTESELNYLKGSPIKDEVVARDEGIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFE 250
Query: 244 -----------------KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 286
KVSLARRFALVPLGPPLL Y S CKAML AV D+V+LVVDRPY
Sbjct: 251 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTAVGDSVRLVVDRPY 310
Query: 287 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL 346
KAGE I+VWCGPQPNS+LL+NYGF+DEDNPYDR+V+EA+LN EDPQ+Q+KRMVAQRNGKL
Sbjct: 311 KAGEPIIVWCGPQPNSRLLLNYGFIDEDNPYDRIVIEASLNIEDPQFQEKRMVAQRNGKL 370
Query: 347 SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLA 406
++Q FHV G+EKE I++MLPYLRLGY+SD EMQS++SS G CPVSPC ERAVLDQL
Sbjct: 371 AIQNFHVCVGKEKETIAEMLPYLRLGYISDPDEMQSILSSEGDTCPVSPCTERAVLDQLV 430
Query: 407 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPD 466
Y ++RLA YP TL ED+AML D NL PKK VAT+LVR+EKK+L+ CLQ + I LPD
Sbjct: 431 GYLESRLADYPTTLDEDDAMLADGNLEPKKEVATRLVRLEKKLLHGCLQAANEFINDLPD 490
Query: 467 VTVSPCPAPYAPLL 480
TVSPCPAP+AP L
Sbjct: 491 HTVSPCPAPFAPEL 504
>gi|326510275|dbj|BAJ87354.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525555|dbj|BAJ88824.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 523
Score = 600 bits (1546), Expect = e-169, Method: Compositional matrix adjust.
Identities = 309/519 (59%), Positives = 365/519 (70%), Gaps = 54/519 (10%)
Query: 8 RSSKFISPP-----IRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSSDT 62
RSS+ +PP + HH L + + R P GS R +R + +DT
Sbjct: 14 RSSEARAPPMASSALSGTHHRLLLPCFLR----RLPQPGS-----RSCSRLRLAACHADT 64
Query: 63 LVAGS------REVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKP---SHNEKHRPIHYV 113
L++ S G W+ NGLPP K+ + E+P S + RP+H+V
Sbjct: 65 LLSSSGAQGPPSPAACLSASSAGGFSDWLLTNGLPPGKLAILERPVPCSRGGRDRPLHFV 124
Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKK 173
AA +DL+AGD AF VP SLVVTLERVLG+E++AELLTTNKLSELACLALYLMYEKKQG+
Sbjct: 125 AAGQDLEAGDVAFEVPMSLVVTLERVLGDESVAELLTTNKLSELACLALYLMYEKKQGRD 184
Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
S W PYI+ELDRQRGRGQLAVESPLLW+E+EL YL GSP + E++ R EGIK+EYNELDT
Sbjct: 185 SLWYPYIKELDRQRGRGQLAVESPLLWTESELDYLNGSPMRDEVVVRDEGIKKEYNELDT 244
Query: 234 VWFMAGSLF-------------------------------QKVSLARRFALVPLGPPLLA 262
+WFMAGSLF QKVSLARRFALVPLGPPLL
Sbjct: 245 LWFMAGSLFKQYPFDVPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLT 304
Query: 263 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 322
Y S CKAML AVD +V+L+VDRPYKAGE I+VWCGPQPNS+LL+NYGFVDEDNPYDR+ +
Sbjct: 305 YKSNCKAMLTAVDGSVRLLVDRPYKAGEPIIVWCGPQPNSRLLLNYGFVDEDNPYDRIAI 364
Query: 323 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 382
EA+LNTEDPQYQ+KRMVAQRNGKL++Q F V G+EK+ IS+MLPYLRLGY+SD EMQ
Sbjct: 365 EASLNTEDPQYQEKRMVAQRNGKLAIQKFQVCVGKEKQTISEMLPYLRLGYISDPDEMQC 424
Query: 383 VISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQL 442
++SS G CPVSPC ERAVLDQL Y K+RLAGYP L EDEAML D +L PKK VAT+L
Sbjct: 425 ILSSEGDTCPVSPCSERAVLDQLVVYLKSRLAGYPTNLDEDEAMLADGSLEPKKEVATRL 484
Query: 443 VRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 481
VR+EKKML+ CL+ + I LPD TVSPCPA YAP L
Sbjct: 485 VRLEKKMLHGCLEAANEFISGLPDHTVSPCPALYAPELK 523
>gi|115487958|ref|NP_001066466.1| Os12g0236900 [Oryza sativa Japonica Group]
gi|77554044|gb|ABA96840.1| SET domain containing protein, expressed [Oryza sativa Japonica
Group]
gi|113648973|dbj|BAF29485.1| Os12g0236900 [Oryza sativa Japonica Group]
Length = 509
Score = 599 bits (1545), Expect = e-169, Method: Compositional matrix adjust.
Identities = 291/434 (67%), Positives = 339/434 (78%), Gaps = 34/434 (7%)
Query: 81 LKSWMHKNGLPPCKVILKEKPS---HNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
W+ ++GLPP KV + ++P K P+HYVAA +DL+AGD AF VP SLVVTLE
Sbjct: 75 FSDWLREHGLPPGKVAILDRPVPCFREGKDLPLHYVAAGQDLEAGDVAFEVPMSLVVTLE 134
Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
RVLG+E++AELLTTNKLSELACLALYLMYEKKQG+ SFW PYI+ELDRQRGRGQLAVESP
Sbjct: 135 RVLGDESVAELLTTNKLSELACLALYLMYEKKQGQDSFWYPYIKELDRQRGRGQLAVESP 194
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQ-------------- 243
LLW+E+EL YL GSP K E++ R EGI+REYNELDT+WFMAGSLFQ
Sbjct: 195 LLWTESELNYLKGSPIKDEVVARDEGIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFE 254
Query: 244 -----------------KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 286
KVSLARRFALVPLGPPLL Y S CKAML AV D+V+LVVDRPY
Sbjct: 255 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTAVGDSVRLVVDRPY 314
Query: 287 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL 346
KAGE I+VWCGPQPNS+LL+NYGF+DEDNPYDR+V+EA+LN EDPQ+Q+KRMVAQRNGKL
Sbjct: 315 KAGEPIIVWCGPQPNSRLLLNYGFIDEDNPYDRIVIEASLNIEDPQFQEKRMVAQRNGKL 374
Query: 347 SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLA 406
++Q FHV G+EKE I++MLPYLRLGY+SD EMQS++SS G CPVSPC ERAVLDQL
Sbjct: 375 AIQNFHVCVGKEKETIAEMLPYLRLGYISDPDEMQSILSSEGDTCPVSPCTERAVLDQLV 434
Query: 407 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPD 466
Y ++RLA YP TL ED+AML D NL PKK VAT+LVR+EKK+L+ CLQ + I LPD
Sbjct: 435 GYLESRLADYPTTLDEDDAMLADGNLEPKKEVATRLVRLEKKLLHGCLQAANEFINDLPD 494
Query: 467 VTVSPCPAPYAPLL 480
TVSPCPAP+AP L
Sbjct: 495 HTVSPCPAPFAPEL 508
>gi|242053769|ref|XP_002456030.1| hypothetical protein SORBIDRAFT_03g029140 [Sorghum bicolor]
gi|241928005|gb|EES01150.1| hypothetical protein SORBIDRAFT_03g029140 [Sorghum bicolor]
Length = 512
Score = 594 bits (1531), Expect = e-167, Method: Compositional matrix adjust.
Identities = 289/431 (67%), Positives = 337/431 (78%), Gaps = 34/431 (7%)
Query: 84 WMHKNGLPPCKVILKEKPSH---NEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
W+ GLPP KV ++E+P N K P+ YVAA DLQAGD AF VP SLVVTLERVL
Sbjct: 81 WLRARGLPPGKVDIRERPVPCLLNGKDLPLRYVAAGVDLQAGDVAFEVPMSLVVTLERVL 140
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
G+E+IAELLT NKLSELACLALYLMYEKKQGK SFW PYI+ELDR RGRGQLAVESPLLW
Sbjct: 141 GDESIAELLTNNKLSELACLALYLMYEKKQGKDSFWYPYIKELDRHRGRGQLAVESPLLW 200
Query: 201 SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQ----------------- 243
+E+EL YLTGSP K E++ R E I+REYNELDT+WFMAGSLFQ
Sbjct: 201 TESELDYLTGSPLKDEVVARDEAIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIFK 260
Query: 244 --------------KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 289
KVSLARRFALVPLGPPLL Y S CKAML A D+V+LVVDRPYKAG
Sbjct: 261 QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTADGDSVRLVVDRPYKAG 320
Query: 290 ESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ 349
E I++WCGPQ NS+L++NYGFVDEDNP+DR+ +EA+LN+EDPQYQ+KRMVAQRNGKL++Q
Sbjct: 321 EPIIIWCGPQTNSRLVLNYGFVDEDNPFDRIAIEASLNSEDPQYQEKRMVAQRNGKLAIQ 380
Query: 350 VFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYF 409
F+V+ G+EK+ +++MLPYLRLGY+SD EMQS++SS G CP+SPC ERAVLDQL Y
Sbjct: 381 NFNVYVGKEKQTVAEMLPYLRLGYISDPDEMQSILSSEGDTCPLSPCTERAVLDQLVGYL 440
Query: 410 KARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTV 469
++RLAGYP TL EDEAML D +L PKK VAT+LVR+EKKM++ACLQ T + I LPD TV
Sbjct: 441 ESRLAGYPTTLDEDEAMLADGSLEPKKEVATRLVRLEKKMIHACLQATNEFINDLPDHTV 500
Query: 470 SPCPAPYAPLL 480
SPCPAPYAP L
Sbjct: 501 SPCPAPYAPEL 511
>gi|414881266|tpg|DAA58397.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
Length = 512
Score = 593 bits (1530), Expect = e-167, Method: Compositional matrix adjust.
Identities = 295/457 (64%), Positives = 344/457 (75%), Gaps = 34/457 (7%)
Query: 58 SSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPS---HNEKHRPIHYVA 114
SSS+ A V E W+ GLPP KV ++E+P + K +P+ YV+
Sbjct: 55 SSSEARAAPGPAVEPSSESATDCFVDWLRARGLPPGKVDIRERPVPCLRDGKDQPLRYVS 114
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A DLQAGD AF V SLVVTLERVLG+E+IAELLT NKLSELACLALYLMYEKKQGK S
Sbjct: 115 AVVDLQAGDVAFEVSMSLVVTLERVLGDESIAELLTNNKLSELACLALYLMYEKKQGKDS 174
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
FW PYI+ELDR RGRGQLAVESPLLW+E+EL YLTGSP K E++ R E I+REYNELDT+
Sbjct: 175 FWYPYIKELDRHRGRGQLAVESPLLWTESELDYLTGSPLKDEVVARDEAIRREYNELDTL 234
Query: 235 WFMAGSLFQ-------------------------------KVSLARRFALVPLGPPLLAY 263
WFMAGSLFQ KVSLARRFALVPLGPPLL Y
Sbjct: 235 WFMAGSLFQQYPFDIPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTY 294
Query: 264 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 323
S CKAML A D+V+LVVDRPYKAGE I++WCGPQ NS+L++NYGFVDEDNP+DR+ +E
Sbjct: 295 RSNCKAMLTADGDSVRLVVDRPYKAGEPIIIWCGPQTNSRLVLNYGFVDEDNPFDRVAIE 354
Query: 324 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV 383
A+LNTEDPQYQ+KRMVAQRNGKL++Q F+V+ G+EK+ +++MLPYLRLGY+S+ EMQS+
Sbjct: 355 ASLNTEDPQYQEKRMVAQRNGKLAIQNFNVYVGKEKQTVAEMLPYLRLGYISNPDEMQSI 414
Query: 384 ISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLV 443
+SS G CPVSPC ERAVLDQL Y ++RLAGYP TL EDEAML D NL PKK VAT+LV
Sbjct: 415 LSSEGDTCPVSPCTERAVLDQLVGYLESRLAGYPTTLDEDEAMLADGNLEPKKEVATRLV 474
Query: 444 RMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 480
R+EKKML+ACLQ T + I LPD TVSPCPAPYAP L
Sbjct: 475 RLEKKMLHACLQATNEFINDLPDHTVSPCPAPYAPEL 511
>gi|125578929|gb|EAZ20075.1| hypothetical protein OsJ_35675 [Oryza sativa Japonica Group]
Length = 536
Score = 569 bits (1467), Expect = e-160, Method: Compositional matrix adjust.
Identities = 276/393 (70%), Positives = 315/393 (80%), Gaps = 31/393 (7%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLP 178
LQAGD AF VP SLVVTLERVLG+E++AELLTTNKLSELACLALYLMYEKKQG+ SFW P
Sbjct: 143 LQAGDVAFEVPMSLVVTLERVLGDESVAELLTTNKLSELACLALYLMYEKKQGQDSFWYP 202
Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMA 238
YI+ELDRQRGRGQLAVESPLLW+E+EL YL GSP K E++ R EGI+REYNELDT+WFMA
Sbjct: 203 YIKELDRQRGRGQLAVESPLLWTESELNYLKGSPIKDEVVARDEGIRREYNELDTLWFMA 262
Query: 239 GSLFQ-------------------------------KVSLARRFALVPLGPPLLAYSSKC 267
GSLFQ KVSLARRFALVPLGPPLL Y S C
Sbjct: 263 GSLFQQYPFDIPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNC 322
Query: 268 KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 327
KAML AV D+V+LVVDRPYKAGE I+VWCGPQPNS+LL+NYGF+DEDNPYDR+V+EA+LN
Sbjct: 323 KAMLTAVGDSVRLVVDRPYKAGEPIIVWCGPQPNSRLLLNYGFIDEDNPYDRIVIEASLN 382
Query: 328 TEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL 387
EDPQ+Q+KRMVAQRNGKL++Q FHV G+EKE I++MLPYLRLGY+SD EMQS++SS
Sbjct: 383 IEDPQFQEKRMVAQRNGKLAIQNFHVCVGKEKETIAEMLPYLRLGYISDPDEMQSILSSE 442
Query: 388 GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEK 447
G CPVSPC ERAVLDQL Y ++RLA YP TL ED+AML D NL PKK VAT+LVR+EK
Sbjct: 443 GDTCPVSPCTERAVLDQLVGYLESRLADYPTTLDEDDAMLADGNLEPKKEVATRLVRLEK 502
Query: 448 KMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 480
K+L+ CLQ + I LPD TVSPCPAP+AP L
Sbjct: 503 KLLHGCLQAANEFINDLPDHTVSPCPAPFAPEL 535
>gi|255536985|ref|XP_002509559.1| conserved hypothetical protein [Ricinus communis]
gi|223549458|gb|EEF50946.1| conserved hypothetical protein [Ricinus communis]
Length = 348
Score = 562 bits (1448), Expect = e-157, Method: Compositional matrix adjust.
Identities = 282/396 (71%), Positives = 314/396 (79%), Gaps = 49/396 (12%)
Query: 85 MHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNET 144
MHKNGLPPCKV+LKE+PSH+ K RPIHYVAASEDLQ GD AFSVPNSLVVTLERVLGNET
Sbjct: 1 MHKNGLPPCKVVLKERPSHDAKLRPIHYVAASEDLQTGDVAFSVPNSLVVTLERVLGNET 60
Query: 145 IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 204
+ ELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE E
Sbjct: 61 VVELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSEAE 120
Query: 205 LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQKVSLARRFALVPLGPPLLAYS 264
LAYLTGSPTKAE+LERA+GIKREY+ELDTVWFMAGSLFQ+ P P A+
Sbjct: 121 LAYLTGSPTKAEVLERADGIKREYDELDTVWFMAGSLFQQ---------YPYDIPTEAFP 171
Query: 265 SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEA 324
+ + + + A +S VV ++A
Sbjct: 172 FE--------------IFKQAFVAIQSCVVH--------------------------LQA 191
Query: 325 ALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVI 384
ALNTEDPQYQDKRMVAQRNGKLS+QVF ++ G+EKEAISD+LPYLRLGYVSD SEMQSVI
Sbjct: 192 ALNTEDPQYQDKRMVAQRNGKLSIQVFQIYVGKEKEAISDILPYLRLGYVSDPSEMQSVI 251
Query: 385 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 444
SS GPICPVSPCME+AVLDQLADYFK RLAGYP +L+EDE ML D+NL+PKKRVATQLVR
Sbjct: 252 SSQGPICPVSPCMEQAVLDQLADYFKRRLAGYPTSLNEDELMLADHNLNPKKRVATQLVR 311
Query: 445 MEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 480
+EKK+LNACLQ TAD+I LPD++VSPCPAPYAP+L
Sbjct: 312 LEKKILNACLQATADLINQLPDLSVSPCPAPYAPIL 347
>gi|168044593|ref|XP_001774765.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673920|gb|EDQ60436.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 523
Score = 530 bits (1365), Expect = e-148, Method: Compositional matrix adjust.
Identities = 278/481 (57%), Positives = 330/481 (68%), Gaps = 41/481 (8%)
Query: 38 PNFGSSLRLVRRKNRFSIRVSSSDTLVAGSRE---VVSKKEEDLG--DLKSWMHKNGLPP 92
P FG+ V + R S ++ T V E SKK+E DLK WM + GLP
Sbjct: 45 PRFGTQKVAVSSEKRGSRCRNTLTTDVYKQDENDLAQSKKQEHESGIDLKQWMEEQGLPE 104
Query: 93 CKVILKE-KPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTT 151
CKV L E +PS +K +PIHYV ASEDLQ G+ A ++P SLVVTLERVLG+ETIAELLTT
Sbjct: 105 CKVSLAEHQPSEGDKGKPIHYVVASEDLQPGELALTIPKSLVVTLERVLGDETIAELLTT 164
Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTG 210
NKLSELACLALYLMYEKKQGK+S+W PYIRELDRQRGRGQL+V SPLLWS EL Y TG
Sbjct: 165 NKLSELACLALYLMYEKKQGKESYWYPYIRELDRQRGRGQLSVASPLLWSREELNEYFTG 224
Query: 211 SPTKAEILERAEGIKREYNELDTVWFMAGSLF---------------------------- 242
S K +LER GIKREY ELDTVWFMAGSLF
Sbjct: 225 STMKEVVLERLAGIKREYEELDTVWFMAGSLFKQYPFDLPTEAFSFEIFKQAFVAVQSCV 284
Query: 243 ---QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 299
Q VSLARRFALVPLGPPLLAY S CKAML AVDD V L VDR YKAG+ I VWCGPQ
Sbjct: 285 VHLQGVSLARRFALVPLGPPLLAYKSNCKAMLKAVDDNVVLEVDRAYKAGDPIAVWCGPQ 344
Query: 300 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREK 359
PNSKLL+NYGFVDEDNPYDRL VEA+L+TEDP YQ KR + Q+N +L++Q F ++ G+E
Sbjct: 345 PNSKLLLNYGFVDEDNPYDRLAVEASLDTEDPLYQQKRAIVQKNNRLTIQTFQIYKGKEM 404
Query: 360 EAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPAT 419
EA+ DMLPY+RL +++D EM++V + GP+CPVS C ERAVL+QL YF+ RLAGY ++
Sbjct: 405 EAVLDMLPYMRLAHLADPEEMETVSFAQGPVCPVSACNERAVLEQLEQYFEKRLAGYKSS 464
Query: 420 LSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPL 479
+ + D + KKRVA +L+ +EK +L L ++I LPD +SPC PY P
Sbjct: 465 HATEGG---DAKKNAKKRVAEKLLCIEKSILRNALAAVQELISQLPDSAISPCIGPYLPN 521
Query: 480 L 480
L
Sbjct: 522 L 522
>gi|302794360|ref|XP_002978944.1| hypothetical protein SELMODRAFT_110000 [Selaginella moellendorffii]
gi|300153262|gb|EFJ19901.1| hypothetical protein SELMODRAFT_110000 [Selaginella moellendorffii]
Length = 432
Score = 517 bits (1331), Expect = e-144, Method: Compositional matrix adjust.
Identities = 266/432 (61%), Positives = 313/432 (72%), Gaps = 38/432 (8%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
WM + GLPPCKV LKE+ + + I YV ASEDL+ GD A SVP SLVVTLERVLGNE
Sbjct: 3 WMLEQGLPPCKVSLKERDLNG---KTIRYVVASEDLKPGDLALSVPMSLVVTLERVLGNE 59
Query: 144 TIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 203
TIAELLTTNKLSELACLALYLMYEKK+GK+SFW P+IRELDRQRGRGQ+AVESPLLW+
Sbjct: 60 TIAELLTTNKLSELACLALYLMYEKKRGKESFWYPFIRELDRQRGRGQVAVESPLLWTSE 119
Query: 204 EL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLF-------------------- 242
EL Y TGS K +LER EGIKREY ELDTVWFMAGSLF
Sbjct: 120 ELDEYFTGSRMKEVVLERLEGIKREYQELDTVWFMAGSLFKEYPFDIPTEAFSFEIFKQA 179
Query: 243 -----------QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 291
Q VSL RRFALVPLGPPLLAY S CKAML A D V+L VDR YK GE
Sbjct: 180 FVAVQSCVVHLQGVSLPRRFALVPLGPPLLAYKSNCKAMLKAAGDLVRLEVDRAYKKGEQ 239
Query: 292 IVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 351
I+VWCGPQPN++LL+NYGFVD DNP+DRL VEA+LNT DP YQ+KR++ Q+N +L++Q F
Sbjct: 240 ILVWCGPQPNTRLLLNYGFVDPDNPHDRLSVEASLNTRDPFYQNKRIIVQKNNRLTIQNF 299
Query: 352 HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 411
+ GREKEA+ +MLPYLRLG+VSD M+SV S+ GP CPVS C ERAVLDQLA YF+
Sbjct: 300 QIFKGREKEAVLEMLPYLRLGHVSDPYHMESVFSAEGPTCPVSACNERAVLDQLAQYFQE 359
Query: 412 RLAGYPATLSEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTV 469
R+A Y T+ ED A+L D +++PK+RVATQL+ +EK++L+ L V LPD +V
Sbjct: 360 RIAKYKTTIDEDRALLEDGSSDINPKQRVATQLLLIEKEILHNTLDVVNGFRNQLPDGSV 419
Query: 470 S-PCPAPYAPLL 480
+ PC + P L
Sbjct: 420 APPCCGDFVPKL 431
>gi|302809535|ref|XP_002986460.1| hypothetical protein SELMODRAFT_269129 [Selaginella moellendorffii]
gi|300145643|gb|EFJ12317.1| hypothetical protein SELMODRAFT_269129 [Selaginella moellendorffii]
Length = 432
Score = 517 bits (1331), Expect = e-144, Method: Compositional matrix adjust.
Identities = 266/432 (61%), Positives = 313/432 (72%), Gaps = 38/432 (8%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
WM + GLPPCKV LKE+ + + I YV ASEDL+ GD A SVP SLVVTLERVLGNE
Sbjct: 3 WMLEQGLPPCKVSLKERDLNG---KTIRYVVASEDLKPGDLALSVPMSLVVTLERVLGNE 59
Query: 144 TIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 203
TIAELLTTNKLSELACLALYLMYEKK+GK+SFW P+IRELDRQRGRGQ+AVESPLLW+
Sbjct: 60 TIAELLTTNKLSELACLALYLMYEKKRGKESFWYPFIRELDRQRGRGQVAVESPLLWTSE 119
Query: 204 EL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLF-------------------- 242
EL Y TGS K +LER EGIKREY ELDTVWFMAGSLF
Sbjct: 120 ELDEYFTGSRMKEVVLERLEGIKREYQELDTVWFMAGSLFKEYPFDIPTEAFSFEIFKQA 179
Query: 243 -----------QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 291
Q VSL RRFALVPLGPPLLAY S CKAML A D V+L VDR YK GE
Sbjct: 180 FVAVQSCVVHLQGVSLPRRFALVPLGPPLLAYKSNCKAMLKAAGDLVRLEVDRAYKKGEQ 239
Query: 292 IVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 351
I+VWCGPQPN++LL+NYGFVD DNP+DRL VEA+LNT DP YQ+KR++ Q+N +L++Q F
Sbjct: 240 ILVWCGPQPNTRLLLNYGFVDPDNPHDRLSVEASLNTRDPFYQNKRIIVQKNNRLTIQNF 299
Query: 352 HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 411
+ GREKEA+ +MLPYLRLG+VSD M+SV S+ GP CPVS C ERAVLDQLA YF+
Sbjct: 300 QIFKGREKEAVLEMLPYLRLGHVSDPYHMESVFSAEGPTCPVSACNERAVLDQLAQYFQE 359
Query: 412 RLAGYPATLSEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTV 469
R+A Y T+ ED A+L D +++PK+RVATQL+ +EK++L+ L V LPD +V
Sbjct: 360 RIAKYKTTIDEDRALLEDCSSDINPKQRVATQLLLIEKEILHNTLDVVNGFRNQLPDGSV 419
Query: 470 S-PCPAPYAPLL 480
+ PC + P L
Sbjct: 420 APPCCGDFVPKL 431
>gi|168020073|ref|XP_001762568.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686301|gb|EDQ72691.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 427
Score = 498 bits (1281), Expect = e-138, Method: Compositional matrix adjust.
Identities = 254/429 (59%), Positives = 299/429 (69%), Gaps = 36/429 (8%)
Query: 85 MHKNGLPPCKVILKE-KPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
M + GLP C V L E + + +K +PIHYV AS+DLQ GD A +VP SLVVTLERVLG+E
Sbjct: 1 MEEQGLPKCNVALVEHQLAEGDKGKPIHYVVASQDLQPGDVALTVPKSLVVTLERVLGDE 60
Query: 144 TIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 203
TIAELLTTNKLSELACLALYLMYEKKQGK+S+W PYIRELDRQRGRGQL+V SPLLWS
Sbjct: 61 TIAELLTTNKLSELACLALYLMYEKKQGKESYWYPYIRELDRQRGRGQLSVASPLLWSPE 120
Query: 204 EL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLF-------------------- 242
EL Y TGS K +LER GIKREY ELDTVWFMAGSLF
Sbjct: 121 ELNEYFTGSTMKEVVLERLAGIKREYEELDTVWFMAGSLFKQYPFDLPTEAFSFEIFKQA 180
Query: 243 -----------QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 291
Q VSLARRFALVPLGPPLLAY S CKAML AV D VQL VD YK G+
Sbjct: 181 FVAVQSCVVHLQGVSLARRFALVPLGPPLLAYKSNCKAMLKAVGDNVQLEVDHAYKTGDP 240
Query: 292 IVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 351
I VWCGPQPNSKLL+NYGFVDEDNP+DRL VEA+LNTEDP YQ KR V Q+N +L++Q F
Sbjct: 241 IAVWCGPQPNSKLLLNYGFVDEDNPFDRLAVEASLNTEDPLYQQKRAVVQKNNRLTIQTF 300
Query: 352 HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 411
++ G+E EA+ DMLPY+RLG+++D E+++V + P+C VS C ERAVL+Q+ +F+
Sbjct: 301 QIYKGKEMEAVRDMLPYMRLGHLADPEEIETVSFAQEPLCYVSACNERAVLNQIEHFFER 360
Query: 412 RLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSP 471
RLAGY S D D K+ VA +L+ +EK +L L ++I LPD +SP
Sbjct: 361 RLAGYK---SSDTTKAVDAKKDAKRTVAKKLMSIEKNILRNALAAVHELIRELPDGAISP 417
Query: 472 CPAPYAPLL 480
C PY P L
Sbjct: 418 CIGPYLPNL 426
>gi|326503142|dbj|BAJ99196.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 425
Score = 478 bits (1230), Expect = e-132, Method: Compositional matrix adjust.
Identities = 247/426 (57%), Positives = 296/426 (69%), Gaps = 54/426 (12%)
Query: 8 RSSKFISPP-----IRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSSDT 62
RSS+ +PP + HH L + + R P GS R +R + +DT
Sbjct: 9 RSSEARAPPMASSALSGTHHRLLLPCFLR----RLPQPGS-----RSCSRLRLAACHADT 59
Query: 63 LVAGS------REVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKP---SHNEKHRPIHYV 113
L++ S G W+ NGLPP K+ + E+P S + RP+H+V
Sbjct: 60 LLSSSGAQGPPSPAACLSASSAGGFSDWLLTNGLPPGKLAILERPVPCSRGGRDRPLHFV 119
Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKK 173
AA +DL+AGD AF VP SLVVTLERVLG+E++AELLTTNKLSELACLALYLMYEKKQG+
Sbjct: 120 AAGQDLEAGDVAFEVPMSLVVTLERVLGDESVAELLTTNKLSELACLALYLMYEKKQGRD 179
Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
S W PYI+ELDRQRGRGQLAVESPLLW+E+EL YL GSP + E++ R EGIK+EYNELDT
Sbjct: 180 SLWYPYIKELDRQRGRGQLAVESPLLWTESELDYLNGSPMRDEVVVRDEGIKKEYNELDT 239
Query: 234 VWFMAGSLF-------------------------------QKVSLARRFALVPLGPPLLA 262
+WFMAGSLF QKVSLARRFALVPLGPPLL
Sbjct: 240 LWFMAGSLFKQYPFDVPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLT 299
Query: 263 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 322
Y S CKAML AVD +V+L+VDRPYKAGE I+VWCGPQPNS+LL+NYGFVDEDNPYDR+ +
Sbjct: 300 YKSNCKAMLTAVDGSVRLLVDRPYKAGEPIIVWCGPQPNSRLLLNYGFVDEDNPYDRIAI 359
Query: 323 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 382
EA+LNTEDPQYQ+KRMVAQRNGKL++Q F V G+EK+ IS+MLPYLRLGY+SD EMQ
Sbjct: 360 EASLNTEDPQYQEKRMVAQRNGKLAIQKFQVCVGKEKQTISEMLPYLRLGYISDPDEMQC 419
Query: 383 VISSLG 388
++SS G
Sbjct: 420 ILSSEG 425
>gi|212721460|ref|NP_001132025.1| uncharacterized protein LOC100193433 [Zea mays]
gi|194693232|gb|ACF80700.1| unknown [Zea mays]
gi|414881264|tpg|DAA58395.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
gi|414881265|tpg|DAA58396.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
Length = 252
Score = 374 bits (959), Expect = e-101, Method: Compositional matrix adjust.
Identities = 173/238 (72%), Positives = 204/238 (85%)
Query: 243 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 302
QKVSLARRFALVPLGPPLL Y S CKAML A D+V+LVVDRPYKAGE I++WCGPQ NS
Sbjct: 14 QKVSLARRFALVPLGPPLLTYRSNCKAMLTADGDSVRLVVDRPYKAGEPIIIWCGPQTNS 73
Query: 303 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI 362
+L++NYGFVDEDNP+DR+ +EA+LNTEDPQYQ+KRMVAQRNGKL++Q F+V+ G+EK+ +
Sbjct: 74 RLVLNYGFVDEDNPFDRVAIEASLNTEDPQYQEKRMVAQRNGKLAIQNFNVYVGKEKQTV 133
Query: 363 SDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSE 422
++MLPYLRLGY+S+ EMQS++SS G CPVSPC ERAVLDQL Y ++RLAGYP TL E
Sbjct: 134 AEMLPYLRLGYISNPDEMQSILSSEGDTCPVSPCTERAVLDQLVGYLESRLAGYPTTLDE 193
Query: 423 DEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 480
DEAML D NL PKK VAT+LVR+EKKML+ACLQ T + I LPD TVSPCPAPYAP L
Sbjct: 194 DEAMLADGNLEPKKEVATRLVRLEKKMLHACLQATNEFINDLPDHTVSPCPAPYAPEL 251
>gi|413950742|gb|AFW83391.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
gi|413950743|gb|AFW83392.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
Length = 252
Score = 363 bits (931), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 166/236 (70%), Positives = 201/236 (85%)
Query: 243 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 302
QKVSLARRFALVPLGPPLL Y S CKAML ++V+LVVDRPYKAGE I++WCGPQ NS
Sbjct: 14 QKVSLARRFALVPLGPPLLTYKSNCKAMLTVDGESVRLVVDRPYKAGEPIIIWCGPQTNS 73
Query: 303 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI 362
+L++NYGFVDE+NP+DR+ +EA+LNTEDPQYQ+KRMVAQRNGK ++Q F+V+ G+EK+ +
Sbjct: 74 RLVLNYGFVDENNPFDRISIEASLNTEDPQYQEKRMVAQRNGKHAIQNFNVYVGKEKQTV 133
Query: 363 SDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSE 422
++MLPYLRLGY+SD EMQS++SS G CPVSPC ERAVLDQL Y ++RLAGYP TL+E
Sbjct: 134 AEMLPYLRLGYISDPDEMQSILSSEGDTCPVSPCTERAVLDQLGGYLESRLAGYPTTLNE 193
Query: 423 DEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAP 478
DEAML D +L PK+ VAT+LVR+EKKML+ACLQ T + I LPD TVSPCPA YAP
Sbjct: 194 DEAMLADGSLEPKQEVATRLVRLEKKMLHACLQATNEFITDLPDHTVSPCPAQYAP 249
>gi|413950744|gb|AFW83393.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
Length = 281
Score = 347 bits (889), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 166/265 (62%), Positives = 201/265 (75%), Gaps = 29/265 (10%)
Query: 243 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 302
QKVSLARRFALVPLGPPLL Y S CKAML ++V+LVVDRPYKAGE I++WCGPQ NS
Sbjct: 14 QKVSLARRFALVPLGPPLLTYKSNCKAMLTVDGESVRLVVDRPYKAGEPIIIWCGPQTNS 73
Query: 303 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI 362
+L++NYGFVDE+NP+DR+ +EA+LNTEDPQYQ+KRMVAQRNGK ++Q F+V+ G+EK+ +
Sbjct: 74 RLVLNYGFVDENNPFDRISIEASLNTEDPQYQEKRMVAQRNGKHAIQNFNVYVGKEKQTV 133
Query: 363 SDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSE 422
++MLPYLRLGY+SD EMQS++SS G CPVSPC ERAVLDQL Y ++RLAGYP TL+E
Sbjct: 134 AEMLPYLRLGYISDPDEMQSILSSEGDTCPVSPCTERAVLDQLGGYLESRLAGYPTTLNE 193
Query: 423 DEAM-----------------------------LTDYNLHPKKRVATQLVRMEKKMLNAC 453
DEAM L D +L PK+ VAT+LVR+EKKML+AC
Sbjct: 194 DEAMVMSCDFLRVVSWSLYKLAECYGIGFGHCQLADGSLEPKQEVATRLVRLEKKMLHAC 253
Query: 454 LQVTADMIMLLPDVTVSPCPAPYAP 478
LQ T + I LPD TVSPCPA YAP
Sbjct: 254 LQATNEFITDLPDHTVSPCPAQYAP 278
>gi|384246822|gb|EIE20311.1| hypothetical protein COCSUDRAFT_48681 [Coccomyxa subellipsoidea
C-169]
Length = 539
Score = 334 bits (856), Expect = 7e-89, Method: Compositional matrix adjust.
Identities = 178/372 (47%), Positives = 238/372 (63%), Gaps = 40/372 (10%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+ DLQAG+ A +P+ LV+TL+RV +E++AELLTT+KLSELACL LYLMYEKK G++S
Sbjct: 49 AARDLQAGELALRIPDHLVITLDRVFEDESLAELLTTDKLSELACLTLYLMYEKKNGRQS 108
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDT 233
W +I+ELDR +GRGQ+ +SPLLW E ++ YL GSP AEI ER +GI++EY ELDT
Sbjct: 109 VWYEFIKELDRIQGRGQMGAKSPLLWDEGQVDEYLAGSPLVAEIKERLKGIEKEYAELDT 168
Query: 234 VWFMAGSLF-------------------------------QKVSLARRFALVPLGPPLLA 262
VWFMAGSLF Q V L++RFALVPLGPPLL+
Sbjct: 169 VWFMAGSLFKSYPYDVPTEAFSLKLFRQGFAAVQASVVHLQGVPLSKRFALVPLGPPLLS 228
Query: 263 YSSKCKAMLAAVDDA--VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 320
YSS KAML +A VQL VDR Y GE I WCGPQPN +LL+NYG V ++NP+D++
Sbjct: 229 YSSTAKAMLTYNREAKEVQLAVDRSYTKGEPIEAWCGPQPNRRLLLNYGIVTDNNPHDKM 288
Query: 321 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM 380
+ L DP +Q KR V Q+N + Q F + R+K +LPYLRL + +D + +
Sbjct: 289 ALTVTLPHADPLFQAKRAVLQQNNLSTQQTFQLQ--RDKGLPELLLPYLRLAHCTDAASL 346
Query: 381 QSVISSLGPIC--PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRV 438
+ +++L C P+SP ER VL QLA + + RL Y T EDE ++ P+++V
Sbjct: 347 K--LATLDTCCAAPISPENERTVLHQLASHLQDRLDRYKTTCEEDEVIIRSTTAGPRQKV 404
Query: 439 ATQLVRMEKKML 450
A +L+R+EK +L
Sbjct: 405 AARLLRIEKAIL 416
>gi|307107385|gb|EFN55628.1| hypothetical protein CHLNCDRAFT_57818 [Chlorella variabilis]
Length = 435
Score = 308 bits (789), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 173/404 (42%), Positives = 239/404 (59%), Gaps = 40/404 (9%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
+ W+ ++G P KV L+ + + A+E LQ GD A +P L+VTL+RVL
Sbjct: 9 MMQWLTESGAPQQKVKLQTVVREGTE---VDITVAAEALQPGDVALRIPEHLIVTLDRVL 65
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
+ T+AEL+TT KLSELACL LYL YEKK+GK+ W +I+ELDR +GRG +SPLLW
Sbjct: 66 EDNTLAELVTTGKLSELACLTLYLAYEKKRGKEGCWYRFIKELDRMQGRGSQGAKSPLLW 125
Query: 201 SETELA-YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLF----------------- 242
E + A L GSP EI R +GI++EY ELDTVW++AGSLF
Sbjct: 126 DEGQAAELLAGSPVVGEIEARLQGIRKEYEELDTVWYLAGSLFNRQPFSPPTEQFSFPVF 185
Query: 243 --------------QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPY 286
Q V+L +RFALVP+GPPLL YSS KAML ++ V+L VDR Y
Sbjct: 186 RQAFTAVQSSVVHLQGVALGKRFALVPMGPPLLTYSSTAKAMLKFDPESHEVRLAVDRAY 245
Query: 287 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL 346
+ GE+++ WCGPQPNS+LLINYG VDE NPYD+L + + ++DP Y+ KR G
Sbjct: 246 QPGEAVLAWCGPQPNSRLLINYGIVDESNPYDKLPLSITIPSDDPLYRLKRDRLAERGLS 305
Query: 347 SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLA 406
+ Q F + A A +LPYLRL + + ++++ V PV+P E VL+QL
Sbjct: 306 TQQTFQLQAAASLPA--QLLPYLRLVHSTREADVEGVKWE-EEAGPVAPENELTVLNQLI 362
Query: 407 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 450
+ + R + Y T+ EDEA++ D P+ VA +L+++EK +L
Sbjct: 363 THLRLRQSRYRTTIEEDEAIIADPAKGPRPTVAARLLKIEKGIL 406
>gi|302847476|ref|XP_002955272.1| hypothetical protein VOLCADRAFT_76643 [Volvox carteri f.
nagariensis]
gi|300259344|gb|EFJ43572.1| hypothetical protein VOLCADRAFT_76643 [Volvox carteri f.
nagariensis]
Length = 488
Score = 284 bits (727), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 174/408 (42%), Positives = 229/408 (56%), Gaps = 45/408 (11%)
Query: 80 DLKSWMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
+L W+ +NG + +V + PS RP+ V A L AG+ A SVP L +TL+
Sbjct: 43 ELVDWLRENGAKIDAVEVKTMDVPSAG---RPLDVVVAGRSLAAGEVALSVPERLCLTLD 99
Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
R+ +E +AELLTT+KLSELACLALYLMYEKK KKSFW PYI+ELD+Q+ RG A ESP
Sbjct: 100 RIFESEFVAELLTTDKLSELACLALYLMYEKKLKKKSFWYPYIKELDKQQARGPQAAESP 159
Query: 198 LLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQK------------ 244
LLW + EL + L GSP + +R GI++EY LDTVWFMAGSLF K
Sbjct: 160 LLWGDQELDSLLKGSPLLPAVRQRQAGIRKEYEALDTVWFMAGSLFNKYPFDLPTETFSF 219
Query: 245 -------------------VSLARRFALVPLGPPLLAYSSKCKAMLAAVDD--AVQLVVD 283
V +A+RFALVPLGPPL+AYSS K M+ +D +V+LVV
Sbjct: 220 ELFQQAFAVVQASIVHLQGVPIAKRFALVPLGPPLMAYSSTSKNMMTYDEDSRSVRLVVS 279
Query: 284 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE--AALNTEDPQYQDKRMVAQ 341
P +AG + WCGPQPNS+LL+NYG VDE NP+D+L L T DP + KR V
Sbjct: 280 GPVEAGRPVAAWCGPQPNSRLLLNYGVVDEHNPFDKLQARFTFTLPTSDPLFPAKRAVLS 339
Query: 342 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 401
G + Q F V R +LPY+ L + ++ SV S +E A
Sbjct: 340 EAGLATQQSFDVSVARPLP--PQLLPYMMLALATTPEQVASV--SFSDTAGHDRELEAAA 395
Query: 402 LDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 449
L L Y + R A Y L D ++ D + P+++VA +L ++EK +
Sbjct: 396 LAALMAYVQRRTAAYAHPLWRDLEIINDPSSTPRQKVAARLTKIEKSI 443
>gi|413950741|gb|AFW83390.1| hypothetical protein ZEAMMB73_201403, partial [Zea mays]
Length = 130
Score = 171 bits (434), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 79/98 (80%), Positives = 88/98 (89%)
Query: 147 ELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA 206
ELLT NKLSELACLALYLMYEKKQGK SFW PYI+ELDR RGRGQLAVESPLLW+E+EL
Sbjct: 1 ELLTNNKLSELACLALYLMYEKKQGKDSFWYPYIKELDRHRGRGQLAVESPLLWTESELD 60
Query: 207 YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQK 244
YL+GSP K E++ R E I+REYNELDT+WFMAGSLFQ+
Sbjct: 61 YLSGSPLKDEVVARDEAIRREYNELDTLWFMAGSLFQQ 98
>gi|452821842|gb|EME28868.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Galdieria sulphuraria]
Length = 490
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 115/441 (26%), Positives = 196/441 (44%), Gaps = 63/441 (14%)
Query: 56 RVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGL----------PPCKVILKEKPSHNE 105
R S + + +G V ++ W+ +NG+ P ++++ E+ + +E
Sbjct: 58 RSSDAFSFTSGDPAVQKGWSSEISAFYDWLKENGVYLSEKASWTHAPHRLVIAEE-TKDE 116
Query: 106 KHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLM 165
+ +S + G+ +P L+ T R L ET + + E + L L+
Sbjct: 117 GEYSGRGLLSSRSVNLGEKVLEIPEKLMFT--RKLALETFPTSIIASIEDEYVSIGLLLL 174
Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPL-LWSETELAYLTGSPTKAEILERAEGI 224
YEK +G SF+ PY+ L L +PL LWS +L L GSPT + + + +
Sbjct: 175 YEKAKGFDSFFKPYLDILP------TLDELNPLFLWSNKDLDLLQGSPTLSACEQLRDKL 228
Query: 225 KREYN------------------ELDTVWFMAGSLFQKVSL---ARRFALVPLGPPLLAY 263
REY + + G LF + ++R ALVP LL +
Sbjct: 229 LREYTYLGKNIIPQIPNFASKPIDFKQFQWAFGILFSRAICFPSSKRIALVPYAD-LLNH 287
Query: 264 SSKCKAML--------AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 315
S C A + V +AV + VDR Y+ E + V GP+ N +LL+ YGF E N
Sbjct: 288 SPFCSAFIDEEKIPFGNGVTEAV-VYVDRLYEPYEQVYVSYGPRSNQELLLLYGFSLERN 346
Query: 316 PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS 375
P+D + + L+ DP Y +K + + GK +Q F ++ R +M +LR +
Sbjct: 347 PFDCVEITIGLDKTDPLYLEKCRMLESYGKSPLQSFPLYMDRYP---VEMAEFLRFCCID 403
Query: 376 DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD----YN 431
+++Q+ ++ VS E + LD+L +Y +L YP +L +DE ++ D
Sbjct: 404 TETDLQADFGTI-----VSASNEESALDKLLNYIVDQLRKYPTSLEDDEKIIRDRAMFQT 458
Query: 432 LHPKKRVATQLVRMEKKMLNA 452
L +R+A + EK++L+A
Sbjct: 459 LEKNQRMAIRQRLGEKRILHA 479
>gi|168002824|ref|XP_001754113.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694667|gb|EDQ81014.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 638
Score = 119 bits (297), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 111/402 (27%), Positives = 177/402 (44%), Gaps = 46/402 (11%)
Query: 78 LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
L L W+ K G P VIL E+ + AA D + G+ A +P + VT
Sbjct: 82 LHQLSEWLSKQGFPTQDVILT---GFGEEGVGL---AAGRDFKEGEVALKIPENYTVTGV 135
Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
V+ + +A ++ L L+LMYE+ G+KS W PY++ SP
Sbjct: 136 DVVNHPVVAAPAAGR--GDVIGLTLWLMYERSLGEKSVWYPYLQTFPS-------TTLSP 186
Query: 198 LLWS-ETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQK------------ 244
+LW+ E + L GSP E+ +R+ ++ EY +L + + F +
Sbjct: 187 ILWTAEEQQKLLKGSPALEEVQQRSAALEGEYEDLQSYFTKDPQAFPQEYFSLEAFKSAF 246
Query: 245 -VSLARR--------FALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIV 293
V L+R FALVP L + + +A L + D AV VDR YK GE +
Sbjct: 247 SVILSRAVYLPSADLFALVPYADAL-NHRADSQAYLDYSMEDQAVVFPVDRNYKEGEQVF 305
Query: 294 VWCG-PQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFH 352
G + N+ LLI YGFVDE+N D L +E L D K+ + Q+ S Q F
Sbjct: 306 TSYGRERSNADLLITYGFVDENNAMDYLDLEVGLVDGDRLLVLKQQILQQAMLDSPQTFP 365
Query: 353 VHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 412
++ R + +L Y+RL + D + ++ + + E L L + +
Sbjct: 366 LYLDRFP---TQLLTYMRLSRLQDPALFPKIVFDKDIM--LDQANEYECLQLLMGECRTK 420
Query: 413 LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 454
L Y + ++ +L + + ++RVA QL EKK+L + +
Sbjct: 421 LGNYEGGVDDEIRLLKNKKISQRERVAAQLRLCEKKILTSTM 462
>gi|298706765|emb|CBJ29688.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Ectocarpus siliculosus]
Length = 521
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 115/427 (26%), Positives = 181/427 (42%), Gaps = 58/427 (13%)
Query: 81 LKSWMHKNGL-----------PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVP 129
LK WM +NG+ P + + NE + A+ +++ GD F++P
Sbjct: 92 LKEWMGENGVWVYDKSDWGVGPHALSVAVDTVDENENETAGRGMIANREIKEGDELFTLP 151
Query: 130 NSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
L++T + E A+++T + LSE +AL ++EK +GK+SFW YI L
Sbjct: 152 IDLLLT-KDAAKKEFGADVITED-LSEYIAIALLAVHEKAKGKESFWSSYIGVLPTVE-- 207
Query: 190 GQLAVESPLLWSETELAYLTGSPTKA-------------------------EILERAEGI 224
V LW+E +LA L GSP A EIL R
Sbjct: 208 ---EVYPTYLWAEEDLALLEGSPVIAATESMRRKLEVEYATVENDLLDKFPEILPREVHT 264
Query: 225 KREYNELDTVWFMAGSLFQKVSLARRFALVPLG------PPLLAY-SSKCKAMLAAVDDA 277
E+ + F +S ALVP P +Y ++ + + + D
Sbjct: 265 YEEFQWAFAMLFSRAIRLGGLSTGEAVALVPYADLFNHNPFANSYIDARQQGLFFSKTDE 324
Query: 278 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 337
V + DR YK E + + GP+ NS LL+ YGF + NPY+ + V +L+ D Y+ K+
Sbjct: 325 VVVYADRSYKKMEQVYISYGPKGNSDLLLLYGFSLDRNPYNSVDVTVSLDENDELYERKK 384
Query: 338 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCM 397
G + F ++ R + ++L YLRL ++ + L +
Sbjct: 385 AFLSEAGLPPTKAFPLYNDRYPD---ELLQYLRLIQLNTDQLRGRTLEDLSFEKKQTDVN 441
Query: 398 ERAVLDQLADYFKARLAGYPATLSEDEAMLTD----YNLHPKKRVATQLVRMEKKMLNAC 453
E VLD L + KA +AGYP T +D ++ D L +R+A + R EK +L
Sbjct: 442 ELMVLDSLVEACKATIAGYPTTEEQDSKLMNDPGMFRALSKTQRMAVKHRRQEKVILRRT 501
Query: 454 L-QVTAD 459
+ VT D
Sbjct: 502 IAAVTKD 508
>gi|302764082|ref|XP_002965462.1| hypothetical protein SELMODRAFT_406852 [Selaginella moellendorffii]
gi|300166276|gb|EFJ32882.1| hypothetical protein SELMODRAFT_406852 [Selaginella moellendorffii]
Length = 481
Score = 101 bits (252), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 111/397 (27%), Positives = 174/397 (43%), Gaps = 42/397 (10%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
D+ W+ + G P +++ S +K A+ DLQAGDAA S+P + VT V
Sbjct: 49 DMTKWLQEQGFPQQPLLVS---SFEDKGLG---CCATRDLQAGDAALSIPENFTVTAVDV 102
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
+ I+ EL LAL+LMYE+++ + S W PY++ + SPLL
Sbjct: 103 ANHPVISS--AAEGRDELVGLALWLMYEQERSQDSPWYPYLKVF-------PASTLSPLL 153
Query: 200 WSETELAYLT-GSPTKAEILERAEGIKREYNEL-DTV-----------WFMAGSLFQKVS 246
W + E L GS A++ ++ +++ ++ L DT+ F A V
Sbjct: 154 WEQEEQEELLRGSSALAKVKDQLTSLRQTFDALKDTLKDNKDFPMEKFTFSAFKAAFSVV 213
Query: 247 LARR--------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCG 297
L+R FALVP G + SS+ + V+L VD+ YK G+ + +
Sbjct: 214 LSRAVYLPSAELFALVPFGDLINHESSRSLLDYDIEEQKVKLAVDKRYKKGDQVFASYAQ 273
Query: 298 PQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR 357
++ LI YGF+DE + D + +E L + D KR + Q G Q F V+ R
Sbjct: 274 NLTSADFLIRYGFLDESDENDFIEIEVGLVSGDSLAPLKREILQEVGLTVPQKFPVYLNR 333
Query: 358 EKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP 417
+ +L Y RL + D+ + I V E L L + +L +
Sbjct: 334 FP---TQLLTYTRLARIQDSGLFAKITFEKDLI--VCQTNEYETLMLLMADCRTKLLSFS 388
Query: 418 ATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 454
T+ +D L NL K+RVA QL EK++L +
Sbjct: 389 DTMEDDMQTLKRKNLSYKQRVAAQLRLKEKRILTDTM 425
>gi|21537309|gb|AAM61650.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
Length = 504
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 85/307 (27%), Positives = 140/307 (45%), Gaps = 42/307 (13%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L++W+ +GLPP K+ + ++ E+ + AS++L+ G+ VP SLV++
Sbjct: 72 ENATSLQNWLSDSGLPPQKMAI-DRVDIGERG-----LVASQNLRKGEKLLFVPPSLVIS 125
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ N E++ + + LA YL+ E K S W YI L RQ
Sbjct: 126 ADSEWTNAEAGEVMKRYDVPDWPLLATYLISEANLQKSSRWFNYISALPRQ-------PY 178
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL-------------------DTVW 235
S L W+ TEL YL S + +ER + Y +L +T
Sbjct: 179 SLLYWTRTELDMYLEASQIRERAIERITNVVGTYEDLRSRIFSKHPQLFPKEVFNDETFK 238
Query: 236 FMAGSLFQKV----SLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAG 289
+ G LF ++ S+ RFALVP +L ++ + + L V DRPY+ G
Sbjct: 239 WSFGILFSRLVRLPSMDGRFALVPWA-DMLNHNCEVETFLDYDKSSKGVIFTTDRPYQPG 297
Query: 290 ESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLS 347
E + + G + N +LL++YGFV + NP D + + +L D Y++K +++G +
Sbjct: 298 EQVFISYGNKSNGELLLSYGFVPREGTNPSDSVELALSLRKNDKCYEEKLDALKKHGLST 357
Query: 348 VQVFHVH 354
Q F V
Sbjct: 358 PQCFPVR 364
>gi|15231493|ref|NP_187424.1| rubisco methyltransferase-like protein [Arabidopsis thaliana]
gi|6466950|gb|AAF13085.1|AC009176_12 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
gi|6648179|gb|AAF21177.1|AC013483_1 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
gi|15028205|gb|AAK76599.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
gi|19310671|gb|AAL85066.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
gi|332641064|gb|AEE74585.1| rubisco methyltransferase-like protein [Arabidopsis thaliana]
Length = 504
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 85/307 (27%), Positives = 140/307 (45%), Gaps = 42/307 (13%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L++W+ +GLPP K+ + ++ E+ + AS++L+ G+ VP SLV++
Sbjct: 72 ENATSLQNWLSDSGLPPQKMAI-DRVDIGERG-----LVASQNLRKGEKLLFVPPSLVIS 125
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ N E++ + + LA YL+ E K S W YI L RQ
Sbjct: 126 ADSEWTNAEAGEVMKRYDVPDWPLLATYLISEASLQKSSRWFNYISALPRQ-------PY 178
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL-------------------DTVW 235
S L W+ TEL YL S + +ER + Y +L +T
Sbjct: 179 SLLYWTRTELDMYLEASQIRERAIERITNVVGTYEDLRSRIFSKHPQLFPKEVFNDETFK 238
Query: 236 FMAGSLFQKV----SLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAG 289
+ G LF ++ S+ RFALVP +L ++ + + L V DRPY+ G
Sbjct: 239 WSFGILFSRLVRLPSMDGRFALVPWA-DMLNHNCEVETFLDYDKSSKGVVFTTDRPYQPG 297
Query: 290 ESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLS 347
E + + G + N +LL++YGFV + NP D + + +L D Y++K +++G +
Sbjct: 298 EQVFISYGNKSNGELLLSYGFVPREGTNPSDSVELALSLRKNDKCYEEKLDALKKHGLST 357
Query: 348 VQVFHVH 354
Q F V
Sbjct: 358 PQCFPVR 364
>gi|297829320|ref|XP_002882542.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297328382|gb|EFH58801.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 504
Score = 99.4 bits (246), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 85/307 (27%), Positives = 140/307 (45%), Gaps = 42/307 (13%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L++W+ +GLPP K+ + ++ E+ + AS++L+ G+ VP SLV++
Sbjct: 72 ENATSLQNWLSDSGLPPQKMAI-DRVDIGERG-----LVASQNLRKGEKLLFVPPSLVIS 125
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ N E++ + + LA YL+ E K S W YI L RQ
Sbjct: 126 ADSEWTNPEAGEVMKRYDVPDWPLLATYLISEASLQKSSRWYNYISALPRQ-------PY 178
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL-------------------DTVW 235
S L W+ TEL YL S + +ER + Y +L +T
Sbjct: 179 SLLYWTRTELDMYLEASQIRERAIERITNVVGTYEDLRSRIFSKHPHLFPKEVFNDETFK 238
Query: 236 FMAGSLFQKV----SLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAG 289
+ G LF ++ S+ RFALVP +L ++ + + L V DRPY+ G
Sbjct: 239 WSFGILFSRLVRLPSMDGRFALVPWA-DMLNHNCEVETFLDYDKSSKGVVFTTDRPYQPG 297
Query: 290 ESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLS 347
E + + G + N +LL++YGFV + NP D + + +L D Y++K +++G +
Sbjct: 298 EQVFISYGNKSNGELLLSYGFVPREGTNPSDSVELALSLRKNDKCYKEKLDALKKHGLST 357
Query: 348 VQVFHVH 354
Q F V
Sbjct: 358 PQCFPVR 364
>gi|302823067|ref|XP_002993188.1| hypothetical protein SELMODRAFT_449044 [Selaginella moellendorffii]
gi|300138958|gb|EFJ05708.1| hypothetical protein SELMODRAFT_449044 [Selaginella moellendorffii]
Length = 600
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 106/393 (26%), Positives = 169/393 (43%), Gaps = 40/393 (10%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
D+ W+ + G P +++ S +K A+ DLQAGDAA S+P + VT V
Sbjct: 49 DMTKWLQEQGFPQQPLLV---SSFEDKGLG---CCATRDLQAGDAALSIPENFTVTAVDV 102
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
+ I+ EL LAL+LMYE+++ + S W PY++ + L
Sbjct: 103 ANHPVISS--AAEGRDELVGLALWLMYEQERSQDSPWYPYVKVFPAS------TLSLLLW 154
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNEL-DTV-----------WFMAGSLFQKVSL 247
E + L GS A++ ++ +++ ++ L DT+ F A V L
Sbjct: 155 EQEEQEELLRGSSALAKVKDQLTSLRQTFDALKDTLKDNKDFPMEKFTFSAFKTAFSVVL 214
Query: 248 ARR--------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGP 298
+R FALVP G + SS+ + V+L VD+ YK G+ + +
Sbjct: 215 SRAVYLPSAELFALVPFGDLINHESSRSLLDYDIEEQKVKLAVDKRYKKGDQVFASYAQN 274
Query: 299 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGRE 358
++ LI YGF+DE + D + +E L + D KR + Q G Q F ++ R
Sbjct: 275 LTSADFLIRYGFLDESDENDCIEIEVGLVSGDSLAPLKREILQEVGLTVPQKFPLYLNR- 333
Query: 359 KEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 418
+ +L Y RL + D+ + I VS E L L + +L
Sbjct: 334 --FPTQLLTYTRLARIQDSGLFAKITFEKDLI--VSQTNEYETLMLLMADCRTKLLSSSD 389
Query: 419 TLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 451
T+ ++ L NL K+RVA QL EK++L
Sbjct: 390 TMEDEMQTLRRKNLSYKQRVAAQLRLKEKRILT 422
>gi|308807993|ref|XP_003081307.1| putative methyltransferase (ISS) [Ostreococcus tauri]
gi|116059769|emb|CAL55476.1| putative methyltransferase (ISS) [Ostreococcus tauri]
Length = 505
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 103/392 (26%), Positives = 167/392 (42%), Gaps = 62/392 (15%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
DL W+ NGL K+ L+ + + A+E+++ G+A V S ++T+ER
Sbjct: 66 DLTRWLASNGLRAQKMTLESNLAEG------RGLVATEEIKRGEALLGVDASCLITVERA 119
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEK---KQGKKSFWLPYIRELDRQRGRGQLAVES 196
+ + +L E + LA +L + + G + YIR L R+ G S
Sbjct: 120 IAEAKLGP--RHAELQEWSVLATFLAQQAMALESGNAGTFGEYIRALPRRTG-------S 170
Query: 197 PLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVW--FMAGSL-------FQKV- 245
L W E E+ L GSP++ ER E + E+ + + G+L F ++
Sbjct: 171 VLDWPEDEVETLLKGSPSRLAAAERQESVNAAIAEIRSSFPDITEGALRWAFDILFSRLI 230
Query: 246 ---SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 302
++ ALVP +L + C A + AV L DR Y AGE + G +P+S
Sbjct: 231 RLDAMGGELALVPWAD-MLNHKPGCAAFIDLNGSAVNLTTDRAYAAGEQVWASYGQRPSS 289
Query: 303 KLLINYGFVDE--DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH-AGREK 359
+LLI+YGF E +NP D + ++ DP Q K V +R G V+ F + G +
Sbjct: 290 ELLISYGFAPEVGENPDDEYSLTLGVDVNDPYAQAKADVLRRMGLSPVETFPLRLNGYPR 349
Query: 360 EAI-------------SDMLPYLRLGYVSDTSEMQSVISSL------------GPICPVS 394
+ + S++ R + + QS+ S+ G I
Sbjct: 350 QLLQYASFILCNPDKPSELEGLARTAFTGSANFGQSIFDSVRGLAQGQARGKQGVILGGV 409
Query: 395 PCMERAVLDQLADYFKARLAGYPATLSEDEAM 426
P E AV + LAD L+ YP +L +D+ +
Sbjct: 410 PG-EIAVREMLADMCAEALSAYPNSLEKDKGI 440
>gi|440792294|gb|ELR13522.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
Length = 568
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 115/453 (25%), Positives = 185/453 (40%), Gaps = 85/453 (18%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
+DL L+ W+ KNGL + E ++ + V A +D + G+ VP L+ T
Sbjct: 66 DDLEQLRVWLLKNGLDSKWLEGIEFAANLPEGSG---VVAKKDFKKGEPFLQVPRKLMFT 122
Query: 136 LERVLGNETIAELLTTNKL---SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
+ + N + +LL +K S CLAL+L+ EK SFW PYI+ L + G
Sbjct: 123 CQ-AMQNTPLGQLLKVDKFLAQSPSLCLALHLLVEK-HNHSSFWTPYIKTLPKSYG---- 176
Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-------------------DT 233
+ L ++ EL L GSPT ++ + +Y + +
Sbjct: 177 ---TCLYFTLEELEGLRGSPTFTSAIKVIATVAIQYTYIHDLFQIRKDILHINAFTWDEF 233
Query: 234 VWFMA--GSLFQKV-----SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 286
+W M+ GS +V + +AL+P + D+ + R Y
Sbjct: 234 IWAMSAVGSRQNQVPQWGHNALSEYALIPAWDMCNHDHGDLQTFWDVNSDSTESHAMRAY 293
Query: 287 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK-RMVAQRNGK 345
K GE + ++ GP+PNS LL++ GFV E+N +D L + L + +DK R++ N K
Sbjct: 294 KKGEQVYIFYGPRPNSDLLLHAGFVYENNRFDALAIRVRLAPDAEHIKDKLRLLHLNNMK 353
Query: 346 LSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISS------------------- 386
+ Q + G D++ +LR+ + + E+Q V+ +
Sbjct: 354 MDSQYYLYGLG----LAVDLMAFLRI-HAMNEQELQQVLGAYDQQEAKVHNGEHPASNGE 408
Query: 387 ------LGPICPVSPCMERAVLDQLADYFKARLAGYPATLS---------EDEAMLTDYN 431
P ++ E A L + L+ YP TL ED+A L +
Sbjct: 409 VVASGVFDPRVKLNDRNELAALQLAEAKCLSLLSLYPTTLQVANGVELKQEDQAALRTTS 468
Query: 432 LHPKKRVATQLVRMEKKMLNACLQVTADMIMLL 464
L P R T L EK++LN L D I LL
Sbjct: 469 LTPNMRAVTLLRLKEKEILNRTL----DAIRLL 497
>gi|242066146|ref|XP_002454362.1| hypothetical protein SORBIDRAFT_04g029430 [Sorghum bicolor]
gi|241934193|gb|EES07338.1| hypothetical protein SORBIDRAFT_04g029430 [Sorghum bicolor]
Length = 499
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 97/377 (25%), Positives = 166/377 (44%), Gaps = 72/377 (19%)
Query: 9 SSKFISPPIRPPHH----PLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSSDTLV 64
S+ + PP+R P H P S +S+ S R + R IR S++
Sbjct: 4 STTTLHPPLRAPRHLRPLPHSYSSSFS----------------RTRGRAPIRASAASASA 47
Query: 65 AGSREVVS--------KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAAS 116
RE + + E L+ W+ +GLP ++ + ++ E+ + A
Sbjct: 48 PAQREAAAGVPWGCEIESLESAASLERWLIDSGLPEQRLAI-QRVDIGERG-----LVAL 101
Query: 117 EDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFW 176
++++ G+ VP SLV+T + G + E++ N + + +A YL+ E S W
Sbjct: 102 KNIRKGEKLLFVPPSLVITADSEWGRPEVGEVMKRNSVPDWPLIATYLISEASLEGSSRW 161
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL---- 231
YI L RQ S L W+ EL AYL SP + ++R + YN+L
Sbjct: 162 SSYIAALPRQ-------PYSLLYWTRAELDAYLVASPIRKRAIQRITDVIGTYNDLRDRI 214
Query: 232 ---------------DTVWFMAGSLFQKV----SLARRFALVPLGPPLLAYSSKCKAMLA 272
+T + G LF ++ S+ + ALVP +L +S + + L
Sbjct: 215 FSRHSDLFPEEVYNIETFLWSFGILFSRLVRLPSMDEKVALVPWA-DMLNHSPEVETFL- 272
Query: 273 AVDDAVQLVV---DRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALN 327
D + Q +V DR Y+ GE + + G + + +LL++YGFV ++ NP D + + +L+
Sbjct: 273 DFDKSSQGIVFTTDRSYQPGEQVFISYGKKSSGELLLSYGFVPKEGTNPNDSVELLVSLD 332
Query: 328 TEDPQYQDKRMVAQRNG 344
D Y++K +RNG
Sbjct: 333 KSDKCYKEKLQALKRNG 349
>gi|444909511|ref|ZP_21229702.1| hypothetical protein D187_00317 [Cystobacter fuscus DSM 2262]
gi|444720460|gb|ELW61244.1| hypothetical protein D187_00317 [Cystobacter fuscus DSM 2262]
Length = 445
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 103/417 (24%), Positives = 171/417 (41%), Gaps = 44/417 (10%)
Query: 68 REVVSKKEEDLGDLKSWMHKNG-LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAF 126
R E+ L L WM + G L P I+++ V A D+ G+
Sbjct: 2 RTSAESSEQKLSSLLRWMEQGGALFPKMHIVRQADGERS-------VLARTDIAEGEVVL 54
Query: 127 SVPNSLVVTLERV----LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRE 182
+P + + TLER +G ++L N + LA +L+ EK +G SFW P++
Sbjct: 55 QIPTTHLFTLERAKASDIGRRIQSQLQPDN---DFLYLASWLLEEKHRGADSFWKPFVDS 111
Query: 183 LDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL----------- 231
L + PL +SE E A + GS + + + + ++EY +L
Sbjct: 112 LP------EAYPHVPLFYSEQERARMKGSQLERLVEVQRQSFEQEYAQLREKLPEYERFG 165
Query: 232 --DTVWF---MAGSLFQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDR 284
+ VW + LF + +LVPL + + + + +D +++ R
Sbjct: 166 FEEYVWARISLYSRLFSLKGGLQGPSLVPLSD-MFNHRQPPDVLWSTSEDGQTFRMIAQR 224
Query: 285 PYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 343
AG I G + + L++ GFV D + D + + L DP K+ +
Sbjct: 225 AVPAGTEIHTHYGAKSSDVFLLHSGFVPDGNEENDEVYLSVGLPPGDPLASVKQQMFGLA 284
Query: 344 GKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM---QSVISSLGPICPVSPCMERA 400
+ F V + A + +LR+ + S + ++S I PVS E
Sbjct: 285 SATAKHPFKVSRQGKYLASWSVFSFLRMAHASPDEFLALSNRLLSGTKTIAPVSVACEER 344
Query: 401 VLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVT 457
VL LA + RL +P TL EDE +L + L P +R L R EK++L L++T
Sbjct: 345 VLGTLAAACEERLKAFPTTLEEDERLLREGPLSPNERSCVLLRRQEKRLLGDYLELT 401
>gi|307109960|gb|EFN58197.1| hypothetical protein CHLNCDRAFT_142047 [Chlorella variabilis]
Length = 485
Score = 96.7 bits (239), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 107/417 (25%), Positives = 179/417 (42%), Gaps = 59/417 (14%)
Query: 49 RKNRFSIRVSSSDTLV-----AGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSH 103
R R R+++ L+ AGS E+ + E + +LK+W+ + GLPP K+ P
Sbjct: 26 RHRRCRCRLAAQAGLLDLLRGAGSTEIATDAEGE--ELKAWLIERGLPPPKLAAAATPGS 83
Query: 104 NEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALY 163
+ A++ + G++ S+P LV+T L + LL L + LAL+
Sbjct: 84 GRG------LVAAQPIGKGESLLSIPQQLVLTPAAALEQSCLRPLLEEQPLPAWSVLALW 137
Query: 164 LMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG 223
L ++ G W PY+R L + G L WSE E+ +L GS ++ LE
Sbjct: 138 LAEQRAAGSAGGWWPYVRLLPERTG-------CVLEWSEEEVEWLCGSQLHSDALEIRAA 190
Query: 224 IKREYNELDTV----------------------WFMA---GSLFQKVSLARRFALVPLGP 258
+ + E+ V W A L + L + AL+P
Sbjct: 191 AEASWAEMQAVLAAAKAQGRAPAHGAFGRAQLQWAFAVLLSRLVRLAGLGDQEALLPWA- 249
Query: 259 PLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE--D 314
LL + + L +A + AV L +R Y+AGE +++ G + + +LL++YGF +
Sbjct: 250 DLLNHDCAAASFLDWSATEAAVVLRAERRYRAGEQLLISYGQKTSGELLLSYGFCPDLGS 309
Query: 315 NPYD--RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPY--LR 370
NP+D RL++E L D K +++G + Q+F + R A +++ Y
Sbjct: 310 NPHDGCRLLLE--LAPGDAARNWKAAALRQHGLAASQLFPL---RMAAAPFELVHYTAFS 364
Query: 371 LGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 427
V E + + L + P ++ A L+ + KA LA YP + D A L
Sbjct: 365 AAVVGSRQEAEQLARRLFEEGDIPPALQTAALEAVVAACKAALAAYPRSFDGDRAEL 421
>gi|3065835|gb|AAC14296.1| putative methyltransferase [Arabidopsis thaliana]
Length = 504
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 84/307 (27%), Positives = 138/307 (44%), Gaps = 42/307 (13%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L++W+ +GLPP K+ + ++ E+ + AS++L+ G+ V SLV+
Sbjct: 72 ENATSLQNWLSDSGLPPQKMAI-DRVDIGERG-----LVASQNLRKGEKLLFVSPSLVIC 125
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ N E++ + + LA YL+ E K S W YI L RQ
Sbjct: 126 ADSEWTNAEAGEVMKRYDVPDWPLLATYLISEASLQKSSRWFNYISALPRQ-------PY 178
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL-------------------DTVW 235
S L W+ TEL YL S + +ER + Y +L +T
Sbjct: 179 SLLYWTRTELDMYLEASQIRERAIERITNVVGTYEDLRSRIFSKHPQLFPKEVFNDETFK 238
Query: 236 FMAGSLFQKV----SLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAG 289
+ G LF ++ S+ RFALVP +L ++ + + L V DRPY+ G
Sbjct: 239 WSFGILFSRLVRLPSMDGRFALVPWAD-MLNHNCEVETFLDYDKSSKGVVFTTDRPYQPG 297
Query: 290 ESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLS 347
E + + G + N +LL++YGFV + NP D + + +L D Y++K +++G +
Sbjct: 298 EQVFISYGNKSNGELLLSYGFVPREGTNPSDSVELALSLRKNDKCYEEKLDALKKHGLST 357
Query: 348 VQVFHVH 354
Q F V
Sbjct: 358 PQCFPVR 364
>gi|226501968|ref|NP_001140387.1| uncharacterized protein LOC100272441 [Zea mays]
gi|194699272|gb|ACF83720.1| unknown [Zea mays]
gi|413923744|gb|AFW63676.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Zea mays]
Length = 503
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 77/297 (25%), Positives = 136/297 (45%), Gaps = 42/297 (14%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E L+ W+ +GLP ++ + ++ E+ + A ++++ G+ VP SLV+T
Sbjct: 71 ESAASLERWLIDSGLPEQRLAI-QRVDIGERG-----LVALKNIRKGEKLLFVPPSLVIT 124
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ G + +++ N + + +A YL+ E S W+ YI L RQ
Sbjct: 125 ADSEWGRPEVGDVMKRNSVPDWPLIATYLISEASLEGSSRWISYIAALPRQ-------PY 177
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL-------------------DTVW 235
S L W+ EL AYL SP + ++R + YN+L +T
Sbjct: 178 SLLYWTRAELDAYLVASPIRKRAIQRITDVIGTYNDLRDRIFSRHPDLFPEEVYNIETFL 237
Query: 236 FMAGSLFQKV----SLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAG 289
+ G LF ++ S+ R ALVP +L +S + + L + DR Y+ G
Sbjct: 238 WSFGILFSRLVRLPSMDGRVALVPWA-DMLNHSPEVETFLDFDKSSRGIVFTTDRSYQPG 296
Query: 290 ESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNG 344
E + + G + + +LL++YGFV ++ NP D + + +L+ D Y++K +RNG
Sbjct: 297 EQVFISYGKKSSGELLLSYGFVPKEGTNPNDSVELLVSLDKSDNCYKEKLQALKRNG 353
>gi|224129218|ref|XP_002320530.1| predicted protein [Populus trichocarpa]
gi|222861303|gb|EEE98845.1| predicted protein [Populus trichocarpa]
Length = 503
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 84/304 (27%), Positives = 134/304 (44%), Gaps = 42/304 (13%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L+ W+ +GLPP K+ + +K E+ + A ++++ G+ VP SLV+
Sbjct: 71 ENAEALQKWLSDSGLPPQKMAI-QKVEVGERG-----LVALKNIRKGEMLLFVPPSLVIA 124
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ E+L + + LA YL+ E K S W YI L RQ
Sbjct: 125 ADSEWSCPEAGEVLKKYSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PY 177
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL-------------------DTVW 235
S L W+ EL YL S + +ER + YN+L +T
Sbjct: 178 SLLYWTRAELDTYLEASQIRERAIERITNVTGTYNDLRLRIFSKYPHLFPEEVFNMETFK 237
Query: 236 FMAGSLFQKV----SLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAG 289
+ G LF ++ S+ R ALVP +L +SS+ + L V DRPY+ G
Sbjct: 238 WSFGILFSRLVRLPSMDGRVALVPWAD-MLNHSSEVETFLDYDKSSKGVVFTTDRPYQPG 296
Query: 290 ESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLS 347
E + + G + N +LL++YGFV + NP D + + +L D Y++K +++G
Sbjct: 297 EQVFISYGRKSNGELLLSYGFVPREGTNPSDSVELSLSLKKSDKCYKEKLEALKKHGLSV 356
Query: 348 VQVF 351
Q F
Sbjct: 357 SQCF 360
>gi|168003103|ref|XP_001754252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694354|gb|EDQ80702.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 431
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 83/320 (25%), Positives = 141/320 (44%), Gaps = 48/320 (15%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L+ W+ K GL K++L S + A++ L+ G+ VP+ L++T +
Sbjct: 16 LQDWLMKEGLAKQKLVLDRVDSGGRG------LVATQSLRQGERLLFVPSGLLITADSEW 69
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
G ++ L E LA++L+ E + + S W PY L + S L W
Sbjct: 70 GCAETGRIIKEAGLPEWPMLAIFLISEASREESSRWFPYFATLPK-------TPSSILQW 122
Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMA-------------------GS 240
+E E+ +LT SP + + LE + Y +L F+ G
Sbjct: 123 TEEEVNTWLTASPVREKALECIRDVTETYRDLRATIFLKHPEVFPSQVYTLAAFKWAFGI 182
Query: 241 LFQKVSLAR-----RFALVPLGPPLLAYSSKCKAMLAAVDD---AVQLVVDRPYKAGESI 292
LF + L R + ALVP +L +S + + L + +V V DR Y++GE +
Sbjct: 183 LFSR--LVRLPSVGKLALVPWA-DMLNHSPQVDSFLDFDQNNAKSVVTVTDRAYQSGEQV 239
Query: 293 VVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 351
+ G + + +L + YGF+ E N +D + +E ++++DP ++ K A G S Q F
Sbjct: 240 FISYGKRSSGELFLAYGFIPSELNVHDSVELEMEIDSDDPSFEAKLRAANEQGLSSPQRF 299
Query: 352 HVHAGREKEAISDMLPYLRL 371
V R+ + +L Y RL
Sbjct: 300 PV---RKDGFPAQLLAYARL 316
>gi|356547583|ref|XP_003542190.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Glycine
max]
Length = 499
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 83/307 (27%), Positives = 138/307 (44%), Gaps = 42/307 (13%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L+ W+ ++GLPP K+ + E+ E+ + A ++++ G+ VP SLV+T
Sbjct: 67 ENSSALQRWLSESGLPPQKMGI-ERVEVGERG-----LVALKNIRKGEKLLFVPPSLVIT 120
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ E+L N + + LA YL+ E + S W YI L RQ
Sbjct: 121 PDSEWSCPEAGEVLKRNSVPDWPLLATYLISEASLMESSRWSNYISALPRQ-------PY 173
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL-------------DTVW------ 235
S L W++ EL YL S + +ER + YN+L D V+
Sbjct: 174 SLLYWTQAELDRYLEASQIRERAIERINNVIGTYNDLRLRIFSKYPDLFPDEVFNIESFK 233
Query: 236 FMAGSLFQKV----SLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAG 289
+ G LF ++ S+ ALVP +L +S + L + DRPY+ G
Sbjct: 234 WSFGILFSRLVRLPSMGGNVALVPWAD-MLNHSCDVETFLDYDKTSKGIVFTTDRPYQPG 292
Query: 290 ESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLS 347
E + + G + N +LL++YGFV ++ NP D + + +L D Y++K + + G +
Sbjct: 293 EQVFISYGKKSNGELLLSYGFVPKEGANPSDSVELSLSLKKSDASYKEKLELLKNYGLSA 352
Query: 348 VQVFHVH 354
Q F +
Sbjct: 353 SQCFPIQ 359
>gi|225447500|ref|XP_002267469.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic [Vitis
vinifera]
gi|296085051|emb|CBI28466.3| unnamed protein product [Vitis vinifera]
Length = 497
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 84/308 (27%), Positives = 140/308 (45%), Gaps = 44/308 (14%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L+ W+ +GLPP K+ + E+ E+ + A ++++ G+ VP SLV+T
Sbjct: 65 ENAALLQKWLSDSGLPPQKMGI-ERVEVGERG-----LVALKNIRKGEKLLFVPPSLVIT 118
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ E+L N + + LA YL+ E + S W YI L RQ
Sbjct: 119 ADSEWSCTEAGEVLKRNSVPDWPLLATYLIGEASFMQSSRWSNYISALPRQ-------PY 171
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL-------------------DTVW 235
S L W+ EL YL S + +ER + YN+L +T
Sbjct: 172 SLLYWTRAELDKYLEASQIRERAIERINDVTGTYNDLRLRIFSKHPHLFPEEVFNMETFK 231
Query: 236 FMAGSLFQKV----SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DRPYKA 288
+ G LF ++ S+ + ALVP +L +S + + L D + Q VV DR Y+
Sbjct: 232 WSFGILFSRLVRLPSMDEKIALVPWA-DMLNHSCEVETFL-DYDKSSQGVVFTTDRTYQP 289
Query: 289 GESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL 346
E + + G + N +LL++YGFV + NP D++ + +L D Y++K +++G
Sbjct: 290 SEQVFISYGKKSNGELLLSYGFVPREGTNPNDKVELLLSLKKSDKCYKEKSEAMKKHGLS 349
Query: 347 SVQVFHVH 354
+ Q F +
Sbjct: 350 TSQCFPIQ 357
>gi|195651313|gb|ACG45124.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Zea mays]
Length = 503
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 76/297 (25%), Positives = 135/297 (45%), Gaps = 42/297 (14%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E L+ W+ +GLP ++ + ++ E+ + A ++++ G+ VP SLV+T
Sbjct: 71 ESAASLERWLIDSGLPEQRLAI-QRVDIGERG-----LVALKNIRKGENLLFVPPSLVIT 124
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ G + +++ N + + +A YL+ E S W+ YI L RQ
Sbjct: 125 ADSEWGRPEVGDVMKRNSVPDWPLIATYLISEASLEGSSRWISYIAALPRQ-------PY 177
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL-------------------DTVW 235
S L W+ EL AYL SP + ++R + YN+L +T
Sbjct: 178 SLLYWTRAELDAYLVASPIRKRAIQRITDVIGTYNDLRDRIFSRHPDLFPEEVYNIETFL 237
Query: 236 FMAGSLFQKV----SLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAG 289
+ G LF ++ S+ R LVP +L +S + + L + DR Y+ G
Sbjct: 238 WSFGILFSRLVRLPSMDGRVVLVPWA-DMLNHSPEVETFLDFDKSSRGIVFTTDRSYQPG 296
Query: 290 ESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNG 344
E + + G + + +LL++YGFV ++ NP D + + +L+ D Y++K +RNG
Sbjct: 297 EQVFISYGKKSSGELLLSYGFVPKEGTNPNDSVELLVSLDKSDNCYKEKLQALKRNG 353
>gi|255582876|ref|XP_002532210.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
gi|223528106|gb|EEF30179.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
Length = 508
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 85/303 (28%), Positives = 137/303 (45%), Gaps = 44/303 (14%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L+ W+ NGLP K+ + +K E+ + A ++++ G+ VP SLV+T +
Sbjct: 81 LQRWLSNNGLPDQKMAI-DKVEVGERG-----LVALKNIRKGEKLLFVPPSLVITADSEW 134
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
E+L + + LA+YL+ E K S W YI L RQ S L W
Sbjct: 135 SCPEAGEVLKQYSVPDWPLLAIYLISEANLQKSSKWSNYISALPRQ-------PYSLLYW 187
Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNE-------------------LDTVWFMAGS 240
+ EL YL S + +ER + YN+ L+T + G
Sbjct: 188 TRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNLETFKWSFGI 247
Query: 241 LFQKV----SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DRPYKAGESIV 293
LF ++ S+ + ALVP +L +S + + L D + Q VV DR Y+ GE +
Sbjct: 248 LFSRLVRLPSMDGKVALVPWA-DMLNHSCEVETFL-DYDKSSQGVVFTTDRQYEPGEQVF 305
Query: 294 VWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 351
+ G + N +LL++YGFV + NP D + + +L D Y++K +++G + Q F
Sbjct: 306 ISYGKKSNGELLLSYGFVPREGTNPSDSVELSLSLKKSDKSYKEKLEALKKHGFSASQCF 365
Query: 352 HVH 354
V
Sbjct: 366 PVR 368
>gi|357462493|ref|XP_003601528.1| SET domain-containing protein [Medicago truncatula]
gi|355490576|gb|AES71779.1| SET domain-containing protein [Medicago truncatula]
gi|388500078|gb|AFK38105.1| unknown [Medicago truncatula]
Length = 497
Score = 89.0 bits (219), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 80/307 (26%), Positives = 138/307 (44%), Gaps = 42/307 (13%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L+ W+ ++GLP K+ + +K E+ + A +++ G+ VP LV+T
Sbjct: 65 ENSSSLQKWLSQSGLPSQKMSI-DKVDVGERG-----LVALNNIRKGEKLLFVPPQLVIT 118
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ E+L N + + LA YL+ E K S W YI L RQ
Sbjct: 119 PDSEWSCPEAGEVLKKNSVPDWPLLATYLISEASLMKSSRWFSYISALPRQ-------PY 171
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL-------------------DTVW 235
S L WS+ EL YL S + +ER + YN++ ++
Sbjct: 172 SLLYWSQAELDRYLEASQIRERAIERTNNVIGTYNDMRVRIFSKYPDFFPEEVFNIESFK 231
Query: 236 FMAGSLFQKV----SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV--VDRPYKAG 289
+ G LF ++ S+ + ALVP ++ +S + + L + +V DRPY+ G
Sbjct: 232 WSFGILFSRMVRLPSMDGKNALVPWA-DMMNHSCEVETFLDYDKSSKGIVFPTDRPYQPG 290
Query: 290 ESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLS 347
E + + G + N +LL++YGFV ++ NP D + + +L D Y++K + ++ G
Sbjct: 291 EQVFISYGKKSNGELLLSYGFVPKEGTNPSDSVELSLSLKKSDESYKEKLELLKKYGLSG 350
Query: 348 VQVFHVH 354
Q F +
Sbjct: 351 SQCFPIR 357
>gi|443722302|gb|ELU11224.1| hypothetical protein CAPTEDRAFT_181634 [Capitella teleta]
Length = 541
Score = 89.0 bits (219), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 97/427 (22%), Positives = 175/427 (40%), Gaps = 74/427 (17%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPI--HYVAASEDLQAGDAAFSVP 129
S +E++ W+ N + V ++ H + + + A+ D + G+ ++P
Sbjct: 71 SGREKNFDGFMGWLKSNSVDAEAVEIQ--------HFDVGGYGIKATRDFKEGELFLAIP 122
Query: 130 NSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQ 186
S+++T + N + L+ N++ + LAL+++ E SFWLPY++ L
Sbjct: 123 RSVMMTTDTA-KNSALGALIADNRILQTMPNILLALHVLCELC-SPASFWLPYLKILPH- 179
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQKVS 246
+ SPL ++ +L L SPT +E++ + I R+Y +F +LFQ
Sbjct: 180 ------SYSSPLYFNPEDLQLLKASPTLSEMINQFRNITRQY-----AYFF--NLFQGHE 226
Query: 247 LARRF------------------------------------ALVPLGPPLLAYSSKCKAM 270
LA + AL+PL + +
Sbjct: 227 LASKLPIQVKNICYDDYRWAVSSVMTRQNQIPTLDGQRMISALIPLWDMCNHTNGQITTD 286
Query: 271 LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 330
+ +D + AG + ++ G + N++LLI+ GFV N DRL + ++ D
Sbjct: 287 FSLKNDRSECFSLEGTVAGAQVFIFYGSRSNAELLIHNGFVYPQNHSDRLTIRLGISKND 346
Query: 331 PQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPI 390
P + K V R + ++F +H G SD L +LR+ +++ ++++ ++ I
Sbjct: 347 PLFSMKSEVLSRLSMQASRLFSLHCG-VNPVDSDTLAFLRVVVMTE-DDLRTALACRQQI 404
Query: 391 CP-------VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLV 443
VS ER LA L YP + ED +L +L R+A QL
Sbjct: 405 SKLRDFDDFVSEDNERKAWAFLATRVLLLLKAYPTSAQEDATLLQGNDLSTHARLAVQLR 464
Query: 444 RMEKKML 450
EK +L
Sbjct: 465 HCEKNIL 471
>gi|357469947|ref|XP_003605258.1| SET domain-containing protein [Medicago truncatula]
gi|355506313|gb|AES87455.1| SET domain-containing protein [Medicago truncatula]
Length = 494
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 80/307 (26%), Positives = 138/307 (44%), Gaps = 42/307 (13%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L+ W+ ++GLP K+ + +K E+ + A +++ G+ VP LV+T
Sbjct: 62 ENSSSLQKWLSQSGLPSQKMSI-DKVDVGERG-----LVALNNIRKGEKLLFVPPQLVIT 115
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ E+L N + + LA YL+ E K S W YI L RQ
Sbjct: 116 PDSEWSCPEAGEVLKKNSVPDWPLLATYLISEASLMKSSRWFSYISALPRQ-------PY 168
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL-------------------DTVW 235
S L WS+ EL YL S + +ER + YN++ ++
Sbjct: 169 SLLYWSQAELDRYLEASQIRERAIERTNNVIGTYNDMRVRIFSKYPDFFPEEVFNIESFK 228
Query: 236 FMAGSLFQKV----SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV--VDRPYKAG 289
+ G LF ++ S+ + ALVP ++ +S + + L + +V DRPY+ G
Sbjct: 229 WSFGILFSRMVRLPSMDGKNALVPWA-DMMNHSCEVETFLDYDKSSKGIVFPTDRPYQPG 287
Query: 290 ESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLS 347
E + + G + N +LL++YGFV ++ NP D + + +L D Y++K + ++ G
Sbjct: 288 EQVFISYGKKSNGELLLSYGFVPKEGTNPSDSVELSLSLKKSDESYKEKLELLKKYGLSG 347
Query: 348 VQVFHVH 354
Q F +
Sbjct: 348 SQCFPIR 354
>gi|3403236|gb|AAC29137.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Spinacia oleracea]
Length = 491
Score = 88.2 bits (217), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 94/375 (25%), Positives = 162/375 (43%), Gaps = 48/375 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A +D+ + VP + + V +E + N L +AL+LM EKK G
Sbjct: 86 LVAQKDISRNEVVLEVPQKFWINPDTVAASEIGS---VCNGLKPWVSVALFLMREKKLGN 142
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL- 231
S W PYI L + S + WSE EL+ L GS L E + E+ +L
Sbjct: 143 SSSWKPYIDILPD-------STNSTIYWSEEELSELQGSQLLNTTLGVKELVANEFAKLE 195
Query: 232 -----------------DTVWFMAGSLFQKVSL---ARRFALVPLG------PPLLA--Y 263
D ++ G L + + L+PL P + A Y
Sbjct: 196 EEVLVPHKQLFPFDVTQDDFFWAFGMLRSRAFTCLEGQSLVLIPLADLANHSPDITAPKY 255
Query: 264 SSKCK-AMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLV 321
+ + + A L + + L P KAG+ +++ + + N++L ++YG + + +
Sbjct: 256 AWEIRGAGLFSRELVFSLRNPTPVKAGDQVLIQYDLNKSNAELALDYGLTESRSERNAYT 315
Query: 322 VEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS--DTSE 379
+ + D Y DK +A+ NG F + E+ ++MLPYLRL + D
Sbjct: 316 LTLEIPESDSFYGDKLDIAESNGMGESAYFDIVL--EQPLPANMLPYLRLVALGGEDAFL 373
Query: 380 MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKK 436
++S+ S G + P+SP E + + D + L+GY T++EDE +L + ++ P+
Sbjct: 374 LESIFRNSIWGHLDLPISPANEELICQVIRDACTSALSGYSTTIAEDEKLLAEGDIDPRL 433
Query: 437 RVATQLVRMEKKMLN 451
+A + EKK+L
Sbjct: 434 EIAITIRLGEKKVLQ 448
>gi|145350419|ref|XP_001419603.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579835|gb|ABO97896.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 524
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 78/291 (26%), Positives = 130/291 (44%), Gaps = 35/291 (12%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+L W+ LP K+ L+ + + A+E+++ G+A VP + ++T+ER
Sbjct: 83 ELARWLEGRRLPGQKMALEVNLAEG------RGLVATEEIKRGEALLGVPRTTLITVERA 136
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEK---KQGKKSFWLPYIRELDRQRGRGQLAVES 196
+ + +L E + LA +L + + G + YIR L R+ G S
Sbjct: 137 IAEAKLGP--KHAELQEWSVLATFLAQQALALESGTAGTFGEYIRALPRRTG-------S 187
Query: 197 PLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVW--FMAGSL-------FQKV- 245
L W E E+ L GSP++ ER + + +E+ + + G+L F ++
Sbjct: 188 VLDWPEDEVDKLLKGSPSRLAAAERQDSVNAAIDEIRSYFPEITVGALRWAFDILFSRLI 247
Query: 246 ---SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 302
++ ALVP +L + C A + DAV L DR Y GE + G +P+S
Sbjct: 248 RLDAMGGELALVPW-ADMLNHKPGCAAFIDLNGDAVNLTTDRSYVKGEQVWASYGQRPSS 306
Query: 303 KLLINYGFVDE--DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 351
+LLI+YGF E +NP D + ++ DP K V + G V+ F
Sbjct: 307 ELLISYGFAPEVGENPDDEYALTLGVDVNDPLADAKAQVLRDMGLSPVETF 357
>gi|302755392|ref|XP_002961120.1| hypothetical protein SELMODRAFT_402746 [Selaginella moellendorffii]
gi|300172059|gb|EFJ38659.1| hypothetical protein SELMODRAFT_402746 [Selaginella moellendorffii]
Length = 371
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 68/226 (30%), Positives = 110/226 (48%), Gaps = 20/226 (8%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+ + AG +P ++T E V ++ LL+T+ L+L+L+ EK + ++S
Sbjct: 10 ATRRVPAGSRFLEIPRIAIITPENVPSQ--VSHLLSTSNPKTR--LSLFLLSEKHKAQES 65
Query: 175 FWLPYIRELDRQRGRGQLA-VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
W PY+R L QL +ES + W + ELA+L SPT E +E + IK E++ L+
Sbjct: 66 QWAPYLRCL------PQLGDIESTMFWKDEELAWLKHSPTYRETMECLKIIKSEFHVLEA 119
Query: 234 VWF-MAGSLFQKVSL-----ARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRP 285
F + +VSL A +P + C+ L+ D V D+
Sbjct: 120 NVFPWCRDVLGEVSLTDFMHAYSTDQIPFAD-FFNHDHNCQTRLSYDKEKDCAVAVADQD 178
Query: 286 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 331
YKAG+ I + G PNS L ++YGF NP++++ V ++ DP
Sbjct: 179 YKAGDEIFLSYGSTPNSILAVDYGFAVASNPHEQVEVPMGVSLTDP 224
>gi|62857953|ref|NP_001016577.1| histone-lysine N-methyltransferase setd3 [Xenopus (Silurana)
tropicalis]
gi|89272100|emb|CAJ81720.1| novel protein containing a SET domain [Xenopus (Silurana)
tropicalis]
Length = 581
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 99/448 (22%), Positives = 190/448 (42%), Gaps = 64/448 (14%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ +L W +NG L E P + A+ +++A + VP
Sbjct: 72 GKREDYFPELMEWCKENGASTDGFELVEFPEEG------FGLKATREIKAEELFLWVPRK 125
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E G+ + L + +++ + LA +L+ E+ SFWLPYI+ L +
Sbjct: 126 LLMTVESAKGS-VLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWLPYIKTLPNE-- 181
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL---- 231
++PL ++E E+ YL + ++ + + R+Y N+L
Sbjct: 182 -----YDTPLYFNEDEVQYLQSTQAILDVFSQYKNTARQYAYFYKVIQTHPNANKLPLKD 236
Query: 232 -----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQL 280
D W ++ + ++ + AL+PL + DD +
Sbjct: 237 SFTFDDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCEC 296
Query: 281 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 340
V + +K+GE I ++ G + N++ +I+ GF E+N +DR+ ++ ++ D Y K V
Sbjct: 297 VALQDFKSGEQIYIFYGTRSNAEFVIHNGFFFENNLHDRVKIKLGVSKSDRLYAMKAEVL 356
Query: 341 QRNGKLSVQVFHVHAGREKEAISDMLPYLRL---------GYVSDTSEMQSVISSLGPIC 391
R G + VF +H E + +L +LR+ G++ + + +
Sbjct: 357 ARAGIPTSSVFALHV-TEPPISAQLLAFLRVFCMNEDELKGHLIGDHAIDKIFTLGNSEF 415
Query: 392 PVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEK 447
PVS E + +L + +AR L Y T+ +D +L ++ +A +L R+EK
Sbjct: 416 PVS--WENEI--KLWTFLEARASLLLKTYKTTVEDDNKVLEQPDMTFHSAMAIKLRRVEK 471
Query: 448 KMLNACLQVTADMIMLLPDVTVSPCPAP 475
++L L+ +D L + P P
Sbjct: 472 EILEKALKSASDNRKLYSKNSEEGTPLP 499
>gi|332321747|sp|B7ZUF3.1|SETD3_XENTR RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|213624517|gb|AAI71209.1| LOC549331 protein [Xenopus (Silurana) tropicalis]
Length = 582
Score = 86.7 bits (213), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 99/448 (22%), Positives = 190/448 (42%), Gaps = 64/448 (14%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ +L W +NG L E P + A+ +++A + VP
Sbjct: 73 GKREDYFPELMEWCKENGASTDGFELVEFPEEG------FGLKATREIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E G+ + L + +++ + LA +L+ E+ SFWLPYI+ L +
Sbjct: 127 LLMTVESAKGS-VLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWLPYIKTLPNE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL---- 231
++PL ++E E+ YL + ++ + + R+Y N+L
Sbjct: 183 -----YDTPLYFNEDEVQYLQSTQAILDVFSQYKNTARQYAYFYKVIQTHPNANKLPLKD 237
Query: 232 -----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQL 280
D W ++ + ++ + AL+PL + DD +
Sbjct: 238 SFTFDDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCEC 297
Query: 281 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 340
V + +K+GE I ++ G + N++ +I+ GF E+N +DR+ ++ ++ D Y K V
Sbjct: 298 VALQDFKSGEQIYIFYGTRSNAEFVIHNGFFFENNLHDRVKIKLGVSKSDRLYAMKAEVL 357
Query: 341 QRNGKLSVQVFHVHAGREKEAISDMLPYLRL---------GYVSDTSEMQSVISSLGPIC 391
R G + VF +H E + +L +LR+ G++ + + +
Sbjct: 358 ARAGIPTSSVFALHV-TEPPISAQLLAFLRVFCMNEDELKGHLIGDHAIDKIFTLGNSEF 416
Query: 392 PVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEK 447
PVS E + +L + +AR L Y T+ +D +L ++ +A +L R+EK
Sbjct: 417 PVS--WENEI--KLWTFLEARASLLLKTYKTTVEDDNKVLEQPDMTFHSAMAIKLRRVEK 472
Query: 448 KMLNACLQVTADMIMLLPDVTVSPCPAP 475
++L L+ +D L + P P
Sbjct: 473 EILEKALKSASDNRKLYSKNSEEGTPLP 500
>gi|3403234|gb|AAC29136.1| ribulose-1,5-bisphosphate carboxylase/oxygenase N-methyltransferase
[Spinacia oleracea]
gi|3403238|gb|AAC29138.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase II [Spinacia oleracea]
Length = 495
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 94/379 (24%), Positives = 162/379 (42%), Gaps = 52/379 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A +D+ + VP + + V +E + N L +AL+LM EKK G
Sbjct: 86 LVAQKDISRNEVVLEVPQKFWINPDTVAASEIGS---VCNGLKPWVSVALFLMREKKLGN 142
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL- 231
S W PYI L + S + WSE EL+ L GS L E + E+ +L
Sbjct: 143 SSSWKPYIDILPD-------STNSTIYWSEEELSELQGSQLLNTTLGVKELVANEFAKLE 195
Query: 232 -----------------DTVWFMAGSLFQKVSL---ARRFALVPLG----------PPLL 261
D ++ G L + + L+PL P +
Sbjct: 196 EEVLVPHKQLFPFDVTQDDFFWAFGMLRSRAFTCLEGQSLVLIPLADLWVQQANHSPDIT 255
Query: 262 A--YSSKCK-AMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPY 317
A Y+ + + A L + + L P KAG+ +++ + + N++L ++YG + +
Sbjct: 256 APKYAWEIRGAGLFSRELVFSLRNPTPVKAGDQVLIQYDLNKSNAELALDYGLTESRSER 315
Query: 318 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS-- 375
+ + + D Y DK +A+ NG F + E+ ++MLPYLRL +
Sbjct: 316 NAYTLTLEIPESDSFYGDKLDIAESNGMGESAYFDIVL--EQPLPANMLPYLRLVALGGE 373
Query: 376 DTSEMQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNL 432
D ++S+ S G + P+SP E + + D + L+GY T++EDE +L + ++
Sbjct: 374 DAFLLESIFRNSIWGHLDLPISPANEELICQVIRDACTSALSGYSTTIAEDEKLLAEGDI 433
Query: 433 HPKKRVATQLVRMEKKMLN 451
P+ +A + EKK+L
Sbjct: 434 DPRLEIAITIRLGEKKVLQ 452
>gi|449453618|ref|XP_004144553.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Cucumis
sativus]
gi|449511789|ref|XP_004164054.1| PREDICTED: LOW QUALITY PROTEIN: ribulose-1,5 bisphosphate
carboxylase/oxygenase large subunit N-methyltransferase,
chloroplastic-like [Cucumis sativus]
Length = 497
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 84/308 (27%), Positives = 140/308 (45%), Gaps = 44/308 (14%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L+ W+ ++GLP K+ ++ N R + A ++++ G+ VP SLV++
Sbjct: 65 ENASALQKWLSESGLPDQKMSIQRV---NVGERGL---VALKNVRKGEKLLFVPPSLVIS 118
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
E E+L N + + +A YL+ E K S W YI L RQ
Sbjct: 119 AESEWSCPEAGEVLKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQ-------PY 171
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL-------------------DTVW 235
S L W+ EL YL S + +ER + YN+L +T
Sbjct: 172 SLLYWTREELDRYLEASEIRERAIERITNVVGTYNDLSIRVFSKHPELFPEEVFNIETFK 231
Query: 236 FMAGSLFQKV----SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DRPYKA 288
+ G LF ++ S+ + ALVP +L ++ + + L D A Q VV DR Y+
Sbjct: 232 WSFGILFSRLVRLPSMDGKVALVPWA-DMLNHNCEVETFL-DYDKASQGVVFTTDRAYQP 289
Query: 289 GESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL 346
GE + + G + N +LL++YGFV ++ NP D + + +L D Y++K +++G
Sbjct: 290 GEQVFISYGKKSNGELLLSYGFVPKEGSNPSDSVELLLSLKKSDKCYKEKLEALKKHGLR 349
Query: 347 SVQVFHVH 354
+ Q F +
Sbjct: 350 ASQCFPIQ 357
>gi|326495906|dbj|BAJ90575.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 507
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 81/298 (27%), Positives = 137/298 (45%), Gaps = 44/298 (14%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E L+ W+ +GLP ++ L EK E+ + A ++++ G+ VP +LV+T
Sbjct: 74 ESAASLERWLTASGLPEQRLAL-EKVDIGERG-----LVALKNVRNGEKLLFVPPTLVIT 127
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ N + +++ + + LA YL+ E S W YI L RQ
Sbjct: 128 ADSEWTNREVGDVMKRYSVPDWPLLATYLISEASLEGSSRWSSYIDALPRQ-------PY 180
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL-------------DTVWFMA--- 238
S L W+ TE+ AYL SP + + R + YN+L + V+ M
Sbjct: 181 SLLYWTRTEIDAYLVASPIRERAISRISDVIGTYNDLRDRIFSKHPDLFPEKVYNMENFR 240
Query: 239 ---GSLFQKV----SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DRPYKA 288
G LF ++ S+ + ALVP +L +S + A L D + Q +V DR Y+
Sbjct: 241 WSFGILFSRLVRLESMGGKVALVPWA-DMLNHSPEVDAFL-DYDKSSQGIVFTTDRSYQP 298
Query: 289 GESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNG 344
GE + + G + + +LL++YGFV ++ NP D + +L D Y++K +++G
Sbjct: 299 GEQVFISYGKKSSGELLLSYGFVPKEGTNPNDSVEFLVSLKKSDECYKEKLQALKKHG 356
>gi|302785554|ref|XP_002974548.1| hypothetical protein SELMODRAFT_101776 [Selaginella moellendorffii]
gi|300157443|gb|EFJ24068.1| hypothetical protein SELMODRAFT_101776 [Selaginella moellendorffii]
Length = 467
Score = 85.5 bits (210), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 74/291 (25%), Positives = 127/291 (43%), Gaps = 41/291 (14%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L+ W+ + GLP KV LK + + + L GD +P +L +T E
Sbjct: 41 LQQWLSQAGLPIQKVELKNVGAGGRG------LVSKRMLYKGDRLLFLPATLAITTESEW 94
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
+++ L E LA YL+ E GK S W PYI L R+ G S LLW
Sbjct: 95 ACAEAGKVIRAKDLPEWPFLACYLISEASLGKSSPWYPYIAALPRRPG-------SILLW 147
Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMA-------------------GS 240
+ ++ A+L+ + K L+ ++ +N+L+ FM G
Sbjct: 148 TALDVEAHLSATSIKDRALQCVREVEDTFNDLNKQVFMKNREEFPPEVFNLESFKWAFGI 207
Query: 241 LFQKV----SLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVV 294
LF ++ SL ++ AL+P G +L + ++ L + ++ +DR Y++ + + +
Sbjct: 208 LFSRLVRLPSLGQKLALIPFG-DMLNHDTEVTTFLDFDSGSKSITCTLDRGYESNKEVFI 266
Query: 295 WCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 344
G + N +LL+ YGFV N D + + L+ D Y+ K + +G
Sbjct: 267 SYGKRSNGELLVAYGFVPSGKNSEDSVSITLGLDPADEMYEAKLGALKEHG 317
>gi|218202140|gb|EEC84567.1| hypothetical protein OsI_31339 [Oryza sativa Indica Group]
Length = 649
Score = 85.5 bits (210), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 97/381 (25%), Positives = 159/381 (41%), Gaps = 58/381 (15%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A+ DL G+ VP L + + V ++ + + L +AL L+ E +G
Sbjct: 244 LVAARDLPRGEVLAEVPKKLWLDADAVAASD-LGGAVGRGGLRPWVAVALLLLREAARGA 302
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL- 231
S W PY+ L RQ +S + WSE EL + G+ + + E ++ E+ +
Sbjct: 303 GSPWAPYLAILPRQ-------TDSTIFWSEEELLEIQGTQLLSTTMGVKEYVQSEFESVE 355
Query: 232 ------------------DTVWFMAGSLFQKVSLARR---FALVPLGPPLLAYSS----- 265
D +W G L +V R AL+P L+ +S
Sbjct: 356 AEIISENRELFPGTVTFNDFLWAF-GILRSRVFAELRGDKLALIPFAD-LVNHSDDITSK 413
Query: 266 ------KCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYD 318
K K + D L K+GE I + + + N++L ++YGF + ++ D
Sbjct: 414 ESSWEIKGKGLFGR-DVVFSLRTPVNVKSGEQIYIQYDLDKSNAELALDYGFTESNSSRD 472
Query: 319 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS-DMLPYLRLGYVSDT 377
+ ++ DP Y DK +A+ NG F + G E++ MLPYLRL + T
Sbjct: 473 AYTLTLEISESDPFYDDKLDIAELNGMGETAYFDIVLG---ESLPPQMLPYLRLLCLGGT 529
Query: 378 SEM-------QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDY 430
+V L PVS E A+ + + K+ L Y T+ EDE +L
Sbjct: 530 DAFLLEALFRNAVWGHLE--LPVSQDNEEAICQVIRNACKSALGAYHTTIEEDEELLGSE 587
Query: 431 NLHPKKRVATQLVRMEKKMLN 451
NL P+ ++A ++ EKK+L
Sbjct: 588 NLQPRLQIAVEVRAGEKKVLQ 608
>gi|50252331|dbj|BAD28364.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplast precursor
[Oryza sativa Japonica Group]
gi|215769445|dbj|BAH01674.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 495
Score = 85.5 bits (210), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 91/379 (24%), Positives = 157/379 (41%), Gaps = 54/379 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A+ DL G+ VP L + + V ++ + + L +AL L+ E +G
Sbjct: 90 LVAARDLPRGEVLAEVPKKLWLDADAVAASD-LGGAVGRGGLRPWVAVALLLLREAARGA 148
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL- 231
S W PY+ L RQ +S + WSE EL + G+ + + E ++ E+ +
Sbjct: 149 GSPWAPYLAILPRQ-------TDSTIFWSEEELLEIQGTQLLSTTMGVKEYVQSEFESVE 201
Query: 232 ------------------DTVW---FMAGSLFQKVSLARRFALVPLGPPL---------- 260
D +W + +F ++ + AL+P +
Sbjct: 202 AEIISENRELFPGTVTFNDFLWAFGILRSRVFAELR-GDKLALIPFADLVNHSDDITSKE 260
Query: 261 LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDR 319
++ K K + D L K+GE I + + + N++L ++YGF + ++ D
Sbjct: 261 SSWEIKGKGLFGR-DVVFSLRTPVNVKSGEQIYIQYDLDKSNAELALDYGFTESNSSRDA 319
Query: 320 LVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE 379
+ ++ DP Y DK +A+ NG F + G + MLPYLRL + T
Sbjct: 320 YTLTLEISESDPFYDDKLDIAELNGMGETAYFDIVLG--ESLPPQMLPYLRLLCLGGTDA 377
Query: 380 M-------QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNL 432
+V L PVS E A+ + + K+ L Y T+ EDE +L NL
Sbjct: 378 FLLEALFRNAVWGHLE--LPVSQDNEEAICQVIRNACKSALGAYHTTIEEDEELLGSENL 435
Query: 433 HPKKRVATQLVRMEKKMLN 451
P+ ++A ++ EKK+L
Sbjct: 436 QPRLQIAVEVRAGEKKVLQ 454
>gi|134254196|gb|AAI35195.1| LOC549331 protein [Xenopus (Silurana) tropicalis]
Length = 507
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 99/445 (22%), Positives = 189/445 (42%), Gaps = 64/445 (14%)
Query: 75 EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
E+ +L W +NG L E P + A+ +++A + VP L++
Sbjct: 1 EDYFPELMEWCKENGASTDGFELVEFPEEG------FGLKATREIKAEELFLWVPRKLLM 54
Query: 135 TLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQ 191
T+E G+ + L + +++ + LA +L+ E+ SFWLPYI+ L +
Sbjct: 55 TVESAKGS-VLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWLPYIKTLPNE----- 107
Query: 192 LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL------- 231
++PL ++E E+ YL + ++ + + R+Y N+L
Sbjct: 108 --YDTPLYFNEDEVQYLQSTQAILDVFSQYKNTARQYAYFYKVIQTHPNANKLPLKDSFT 165
Query: 232 --DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD 283
D W ++ + ++ + AL+PL +S DD + V
Sbjct: 166 FDDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNSLITTGYNLEDDRCECVAL 225
Query: 284 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 343
+ +K+GE I ++ G + N++ +I+ GF E+N +DR+ ++ ++ D Y K V R
Sbjct: 226 QDFKSGEQIYIFYGTRSNAEFVIHNGFFFENNLHDRVKIKLGVSKSDRLYAMKAEVLARA 285
Query: 344 GKLSVQVFHVHAGREKEAISDMLPYLRL---------GYVSDTSEMQSVISSLGPICPVS 394
G + VF +H E + +L +LR+ G++ + + + PVS
Sbjct: 286 GIPTSSVFALHV-TEPPISAQLLAFLRVFCMNEDELKGHLIGDHAIDKIFTLGNSEFPVS 344
Query: 395 PCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 450
E + +L + +AR L Y T+ +D +L ++ +A +L R+EK++L
Sbjct: 345 --WENEI--KLWTFLEARASLLLKTYKTTVEDDNKVLEQPDMTFHSAMAIKLRRVEKEIL 400
Query: 451 NACLQVTADMIMLLPDVTVSPCPAP 475
L+ +D L + P P
Sbjct: 401 EKALKSASDNRKLYSKNSEEGTPLP 425
>gi|218191491|gb|EEC73918.1| hypothetical protein OsI_08761 [Oryza sativa Indica Group]
Length = 502
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 79/299 (26%), Positives = 135/299 (45%), Gaps = 42/299 (14%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L+ W+ +GLP ++ + ++ E+ + A ++++ G+ VP SLV+T +
Sbjct: 75 LERWLTDSGLPEQRLGI-QRVDVGERG-----LVALKNIRKGEKLLFVPPSLVITADSEW 128
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
G + +L N + + +A YL+ E S W YI L RQ S L W
Sbjct: 129 GCPEVGNVLKRNSVPDWPLIATYLISEASLESSSRWSSYIAALPRQ-------PYSLLYW 181
Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNE-------------------LDTVWFMAGS 240
+ EL AYL SP + ++R + YN+ L+T + G
Sbjct: 182 TRPELDAYLVASPIRERAIQRITDVVGTYNDLRDRIFSKHSDLFPEEVYNLETFRWSFGI 241
Query: 241 LFQKV----SLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVV 294
LF ++ S+ R ALVP +L +S + + L + DR Y+ GE + +
Sbjct: 242 LFSRLVRLPSMDGRVALVPWA-DMLNHSPEVETFLDYDKSSGGIVFTTDRSYQPGEQVFI 300
Query: 295 WCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 351
G + + +LL++YGFV ++ NP D + + +LN D Y++K +RNG + F
Sbjct: 301 SYGKKSSGELLLSYGFVPKEGTNPNDSVELLVSLNKSDKCYKEKLQALKRNGLSEFESF 359
>gi|115448405|ref|NP_001047982.1| Os02g0725200 [Oryza sativa Japonica Group]
gi|45735887|dbj|BAD12920.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase [Oryza sativa Japonica
Group]
gi|45736017|dbj|BAD13045.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase [Oryza sativa Japonica
Group]
gi|113537513|dbj|BAF09896.1| Os02g0725200 [Oryza sativa Japonica Group]
gi|215737236|dbj|BAG96165.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623589|gb|EEE57721.1| hypothetical protein OsJ_08208 [Oryza sativa Japonica Group]
Length = 502
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 79/299 (26%), Positives = 135/299 (45%), Gaps = 42/299 (14%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L+ W+ +GLP ++ + ++ E+ + A ++++ G+ VP SLV+T +
Sbjct: 75 LERWLTDSGLPEQRLGI-QRVDVGERG-----LVALKNIRKGEKLLFVPPSLVITADSEW 128
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
G + +L N + + +A YL+ E S W YI L RQ S L W
Sbjct: 129 GCPEVGNVLKRNSVPDWPLIATYLISEASLESSSRWSSYIAALPRQ-------PYSLLYW 181
Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNE-------------------LDTVWFMAGS 240
+ EL AYL SP + ++R + YN+ L+T + G
Sbjct: 182 TRPELDAYLVASPIRERAIQRITDVVGTYNDLRDRIFSKHSDLFPEEVYNLETFRWSFGI 241
Query: 241 LFQKV----SLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVV 294
LF ++ S+ R ALVP +L +S + + L + DR Y+ GE + +
Sbjct: 242 LFSRLVRLPSMDGRVALVPWA-DMLNHSPEVETFLDYDKSSGGIVFTTDRSYQPGEQVFI 300
Query: 295 WCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 351
G + + +LL++YGFV ++ NP D + + +LN D Y++K +RNG + F
Sbjct: 301 SYGKKSSGELLLSYGFVPKEGTNPNDSVELLVSLNKSDKCYKEKLQALKRNGLSEFESF 359
>gi|293333172|ref|NP_001168589.1| uncharacterized protein LOC100382373 [Zea mays]
gi|223949395|gb|ACN28781.1| unknown [Zea mays]
gi|414885391|tpg|DAA61405.1| TPA: hypothetical protein ZEAMMB73_723554 [Zea mays]
Length = 489
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 96/379 (25%), Positives = 154/379 (40%), Gaps = 53/379 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A+ DL G+ VP L + + V ++ L +AL L+ E +G
Sbjct: 84 LVAARDLPRGEVVAEVPKKLWMDADAVAASDIGRACGGGGGLRPWVAVALLLLSEVARGA 143
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL- 231
S W PY+ L RQ +S + WSE EL + G+ + + E ++ E++ +
Sbjct: 144 DSPWAPYLAILPRQ-------TDSTIFWSEEELLEIQGTQLLSTTVGVKEYVQSEFDSVQ 196
Query: 232 ------------------DTVWFMAGSLFQKVS---LARRFALVPLG------PPLLAYS 264
D +W G L +V + AL+P P + +
Sbjct: 197 AEIISTNKDLFPGSITFDDFLWAF-GMLRSRVFPELRGDKLALIPFADLVNHSPNITSEG 255
Query: 265 S----KCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDR 319
S K K + + L K+G+ I + + + N++L ++YGFV+ + D
Sbjct: 256 SSWEIKGKGLFGR-ELMFSLRTPVNVKSGQQIYIQYDLDKSNAELALDYGFVESNPSRDS 314
Query: 320 LVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE 379
V ++ DP Y DK +A+ NG F V + MLPYLRL + T
Sbjct: 315 FTVTLEISESDPFYGDKLDIAEANGLGETAYFDVIL--NEPLPPQMLPYLRLLCIGGTDA 372
Query: 380 M-------QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNL 432
SV L P+SP E ++ + D K+ LA Y T+ EDE + NL
Sbjct: 373 FLLEALFRNSVWGHLE--LPLSPDNEESICQAMRDACKSALADYHTTIEEDEELSGRENL 430
Query: 433 HPKKRVATQLVRMEKKMLN 451
P+ +A + EKK+L
Sbjct: 431 QPRLAIAIGVRAGEKKVLQ 449
>gi|148744485|gb|AAI42996.1| SET domain containing 3 [Homo sapiens]
Length = 594
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 95/426 (22%), Positives = 185/426 (43%), Gaps = 56/426 (13%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 74 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 127
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 128 LMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE--- 182
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL----- 231
++PL + E E+ YL + ++ + + R+Y N+L
Sbjct: 183 ----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKDS 238
Query: 232 ----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 281
D W ++ + ++ + AL+PL + DD + V
Sbjct: 239 FTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECV 298
Query: 282 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 341
+ ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 299 ALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLA 358
Query: 342 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVISSLGPICP 392
R G + VF +H E + +L +LR+ +++ S + + + P
Sbjct: 359 RAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSKFP 417
Query: 393 VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 452
VS E + L D L Y T+ ED+++L +++L + ++A +L EK++L
Sbjct: 418 VSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNHDLSVRAKMAIKLRLGEKEILEK 477
Query: 453 CLQVTA 458
++ A
Sbjct: 478 AVKSAA 483
>gi|444915331|ref|ZP_21235465.1| SET domain containing protein [Cystobacter fuscus DSM 2262]
gi|444713560|gb|ELW54457.1| SET domain containing protein [Cystobacter fuscus DSM 2262]
Length = 449
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 101/424 (23%), Positives = 181/424 (42%), Gaps = 49/424 (11%)
Query: 70 VVSKKEEDLGDLKSWMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFS 127
S + L +L W+ + G P +++ +E V A + AG+
Sbjct: 10 AASSSNQKLSNLLRWLEEGGARFPKLQLVRREDGERA--------VLAQAPISAGETVLQ 61
Query: 128 VPNSLVVTLERVLGNET-----IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRE 182
VP + ++TLE L E+ IAE L + +E LA +L+ EK + + SFW PYI
Sbjct: 62 VPRTHMLTLE--LARESDIGRAIAEGLDPD--NEDLYLASFLLQEKHR-EGSFWKPYIDS 116
Query: 183 LDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-----DTVWFM 237
L + PL + E A L G + +A+ ++ +Y L F
Sbjct: 117 LPESYS------QMPLFYGSEEHALLKGCFALTLLTHQAQSLREDYLSLCQNVPGYERFT 170
Query: 238 AGSL-FQKVSLARRF-----------ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV--D 283
G + ++S++ R LVP+ +L + + +D V+ +
Sbjct: 171 PGEFVWARLSVSSRLFSLKKGGFLGQTLVPMAD-MLNHRRPPDVLWETTEDGESFVMKAN 229
Query: 284 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 343
AG+ + G + N +L+++GFV +DN +D + + DP K+M+
Sbjct: 230 NAVAAGDEVHDSYGAKSNDLMLLHFGFVTDDNEHDEAFLGLRILDGDPLAATKQMLLMLP 289
Query: 344 GKLSVQVFHVHAGREKEAISDMLPYLRLGYV--SDTSEMQS-VISSLGPICPVSPCMERA 400
+ + F + +LR+ +D ++ S V+S + P+S E
Sbjct: 290 SPTAARPFKISRPYVHTTTRMAFSFLRIAAAVPNDIEDISSRVMSGERALGPLSVENEEN 349
Query: 401 VLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADM 460
VL+ LA +ARL+ +P +L++DE +L +L P R + R EK+++ L++T
Sbjct: 350 VLELLAATCQARLSIFPTSLAQDEELLRGESLSPNARNCVLVRRAEKQLIEDYLEMTRVC 409
Query: 461 IMLL 464
+ LL
Sbjct: 410 LKLL 413
>gi|40068481|ref|NP_115609.2| histone-lysine N-methyltransferase setd3 isoform a [Homo sapiens]
gi|74750394|sp|Q86TU7.1|SETD3_HUMAN RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|28071092|emb|CAD61927.1| unnamed protein product [Homo sapiens]
gi|119602070|gb|EAW81664.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
gi|119602072|gb|EAW81666.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
gi|119602073|gb|EAW81667.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
gi|194380984|dbj|BAG64060.1| unnamed protein product [Homo sapiens]
gi|307686103|dbj|BAJ20982.1| SET domain containing 3 [synthetic construct]
Length = 594
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 95/426 (22%), Positives = 185/426 (43%), Gaps = 56/426 (13%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 74 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 127
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 128 LMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE--- 182
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL----- 231
++PL + E E+ YL + ++ + + R+Y N+L
Sbjct: 183 ----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKDS 238
Query: 232 ----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 281
D W ++ + ++ + AL+PL + DD + V
Sbjct: 239 FTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECV 298
Query: 282 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 341
+ ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 299 ALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLA 358
Query: 342 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVISSLGPICP 392
R G + VF +H E + +L +LR+ +++ S + + + P
Sbjct: 359 RAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFP 417
Query: 393 VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 452
VS E + L D L Y T+ ED+++L +++L + ++A +L EK++L
Sbjct: 418 VSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNHDLSVRAKMAIKLRLGEKEILEK 477
Query: 453 CLQVTA 458
++ A
Sbjct: 478 AVKSAA 483
>gi|146181028|ref|XP_001021989.2| SET domain containing protein [Tetrahymena thermophila]
gi|146144300|gb|EAS01744.2| SET domain containing protein [Tetrahymena thermophila SB210]
Length = 590
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 93/391 (23%), Positives = 178/391 (45%), Gaps = 63/391 (16%)
Query: 103 HNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNET-IAELLTTNKLSELA--- 158
+++ +R +H A + + +P S ++TLE + ET +A+ + KL+ L+
Sbjct: 173 YSKNYRGVH---ARRKVYNKETILFIPKSHLITLE--MAKETDVAKKIIAAKLNLLSPKH 227
Query: 159 -CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
L+ +L+ E+K K+S W PY+ L + P+ +SE +L++L GSP + ++
Sbjct: 228 SFLSTFLLQERK-NKESKWKPYLDILPSDYN------QFPIFFSEDDLSWLKGSPFQNQV 280
Query: 218 LERAEGIKREYNEL-------------DTVWF-MAGS---LFQKVSLARRFALVPLGPPL 260
E+ IKR+Y+++ D W M S +++ + A VPL L
Sbjct: 281 REKKADIKRDYDDICSVAPEFAEYTFEDFCWARMTASSRVFGLQINEQKTDAFVPLADML 340
Query: 261 LAYSSKCKAMLAAVDD-----AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 315
+ K DD +Q + D P GE + G + NS+ +NYGF++ DN
Sbjct: 341 --NHRRPKQTSWQYDDQREGFVIQALEDIP--RGEQVYDSYGRKCNSRFFLNYGFINLDN 396
Query: 316 PYDRLVVEAALNTEDPQYQDKRMVAQRNGKL-SVQVFHVHAGREKEAISDMLPYLRLGYV 374
+ + + + EDP + K+ + G + +V+ + +++ +S+ + YLR +
Sbjct: 397 DANEVALRLTFDAEDPTIERKKEMM--GGDVPEFKVYRILENYQEQNVSEFMSYLRFILI 454
Query: 375 SDTSEMQSVISSL----------------GPICPVSPCMERAVLDQLADYFKARLAGYPA 418
D S++ ++SSL P+S E + ++++ + ++ Y
Sbjct: 455 RDNSKL-LMLSSLHEQQTENSENLSGYKPQKTPPISIQNETDMWVRISNMCQTSISLYNT 513
Query: 419 TLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 449
TL ED+ +L NL +R L EK++
Sbjct: 514 TLKEDKELLAKDNLTQNQRNCVLLRSGEKEV 544
>gi|340780678|pdb|3SMT|A Chain A, Crystal Structure Of Human Set Domain-Containing Protein3
Length = 497
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 95/427 (22%), Positives = 181/427 (42%), Gaps = 56/427 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 72 GKREDYFPDLXKWASENG---ASVEGFEXVNFKEEGFGLR---ATRDIKAEELFLWVPRK 125
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L+ T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 126 LLXTVESA-KNSVLGPLYSQDRILQAXGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 181
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL---- 231
++PL + E E+ YL + ++ + + R+Y N+L
Sbjct: 182 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKD 236
Query: 232 -----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQL 280
D W ++ ++ + AL+PL + DD +
Sbjct: 237 SFTYEDYRWAVSSVXTRQNQIPTEDGSRVTLALIPLWDXCNHTNGLITTGYNLEDDRCEC 296
Query: 281 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 340
V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 297 VALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAXKAEVL 356
Query: 341 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVISSLGPIC 391
R G + VF +H E + +L +LR+ ++ S + + +
Sbjct: 357 ARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCXTEEELKEHLLGDSAIDRIFTLGNSEF 415
Query: 392 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 451
PVS E + L D L Y T+ ED+++L +++L + + A +L EK++L
Sbjct: 416 PVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNHDLSVRAKXAIKLRLGEKEILE 475
Query: 452 ACLQVTA 458
++ A
Sbjct: 476 KAVKSAA 482
>gi|410962953|ref|XP_003988033.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Felis catus]
Length = 591
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 101/417 (24%), Positives = 181/417 (43%), Gaps = 46/417 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM---AGSLFQKV 245
++PL + E E+ L + ++ + + R+Y V A L K
Sbjct: 183 -----YDTPLYFEEDEVRDLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKD 237
Query: 246 SLAR---RFALVPLGPPLLAYSSKCKAMLAAV------------DDAVQLVVDRPYKAGE 290
+ R LV L A +C +A DD + V + ++AGE
Sbjct: 238 AFTYEDYRLGLVSLALGRWALGLECGVGIARCGKPQITTGYNLEDDRCECVALQDFRAGE 297
Query: 291 SIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQV 350
I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V R G + V
Sbjct: 298 QIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSV 357
Query: 351 FHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG-PICPVSPCMERAV 401
F +H E + +L +LR+ +++ + + I +LG PVS E +
Sbjct: 358 FALHF-TEPPVSAQLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEYPVSWDNEVKL 416
Query: 402 LDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 458
L D L Y T+ ED+A L ++NL + +A +L EK++L ++ A
Sbjct: 417 WTFLEDRASLLLKTYKTTIEEDKAFLKNHNLSVRATMAIKLRLGEKEILEKAVKSAA 473
>gi|323456050|gb|EGB11917.1| hypothetical protein AURANDRAFT_61181 [Aureococcus anophagefferens]
Length = 516
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 85/332 (25%), Positives = 145/332 (43%), Gaps = 43/332 (12%)
Query: 151 TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 210
+ +E +AL L+ E+ +G +SFW YI L G + W ELAYL G
Sbjct: 162 NDDTNEYIAIALLLILERSKGSRSFWSEYIAILPTNEDVG-----ATFTWPAEELAYLEG 216
Query: 211 SPT--------------KAEILERAEGIKREYNELDTVWFMAGSLFQK---VSLARRFAL 253
SP A +LE + E + + +LF + + +R L
Sbjct: 217 SPAASATASMMAKLRAEHAAVLEGNSALDPEIFTFEAWQWAFTNLFSRAIRLKASRAGEL 276
Query: 254 VPLGP--PLLAYSSKCKAMLAA-----------VDDAVQLVVDRPYKAGESIVVWCGPQP 300
+ + P + +S + + A +D V L DR YK E + + GP+
Sbjct: 277 LAMVPYVDFINHSPFSSSYVDAREVPKAFPWEEKEDEVVLFADRAYKKFEQVFISYGPKS 336
Query: 301 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK-LSVQVFHVHAGREK 359
N+ LL+ YGF + NP++ + + + +D Y K A+ G+ +S F ++A R
Sbjct: 337 NADLLLLYGFALDRNPFNSVDLAVGASKDDALYDAKERFARGAGRDVSSAAFPLYADRFP 396
Query: 360 EAISDMLPYLRLGYVS-DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 418
+ +++ +LR+ + D + + + +S E AVLD + D A +A YPA
Sbjct: 397 D---ELVQFLRMACATEDHLGARPLDDPDNYVDILSLDNELAVLDTIRDACDAAVAAYPA 453
Query: 419 TLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 450
+D + D L +R+A +LV EK++L
Sbjct: 454 KSGDD---VPDAFLSRNQRMAKRLVNTEKRIL 482
>gi|431839268|gb|ELK01195.1| SET domain-containing protein 3 [Pteropus alecto]
Length = 805
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 98/427 (22%), Positives = 183/427 (42%), Gaps = 56/427 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E E+ + A+ D++A + VP
Sbjct: 252 GKREDYFPDLMKWASENG---ASVEGFEMVDFKEEGFGLR---ATRDIKAEELFLWVPRK 305
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 306 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 361
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL---- 231
++PL + E E+ YL + ++ + + R+Y N+L
Sbjct: 362 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKD 416
Query: 232 -----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQL 280
D W ++ + ++ + AL+PL + DD +
Sbjct: 417 SFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCEC 476
Query: 281 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 340
V R ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 477 VALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVL 536
Query: 341 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG-PIC 391
R G + VF +H E + +L +LR+ +++ + + I +LG
Sbjct: 537 ARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEY 595
Query: 392 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 451
PVS E + L D L Y T+ ED + L +++L + +A +L EK++L
Sbjct: 596 PVSWDNEVKLWTFLEDRASLLLKTYKTTVEEDRSFLRNHDLSVRAAMAVKLRLGEKEILE 655
Query: 452 ACLQVTA 458
++ A
Sbjct: 656 RAVKSAA 662
>gi|10439587|dbj|BAB15525.1| unnamed protein product [Homo sapiens]
Length = 512
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 91/405 (22%), Positives = 178/405 (43%), Gaps = 55/405 (13%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQG 171
A+ D++A + VP L++T+E N + L + +++ + LA +L+ E+
Sbjct: 28 ATRDIKAEELFLWVPRKLLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERAS- 85
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY--- 228
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 86 PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYF 138
Query: 229 ----------NEL---------DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAY 263
N+L D W ++ + ++ + AL+PL
Sbjct: 139 YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHT 198
Query: 264 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 323
+ DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++
Sbjct: 199 NGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIK 258
Query: 324 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD------- 376
++ D Y K V R G + VF +H E + +L +LR+ +++
Sbjct: 259 LGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHL 317
Query: 377 --TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 434
S + + + PVS E + L D L Y T+ ED+++L +++L
Sbjct: 318 LGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNHDLSV 377
Query: 435 KKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPL 479
+ ++A +L EK++L ++ A + P + +SP APL
Sbjct: 378 RAKMAIKLRLGEKEILEKAVKSAA----VNPGI-LSPTDGGKAPL 417
>gi|308802083|ref|XP_003078355.1| ribulose-1,5-bisphosphate carb (ISS) [Ostreococcus tauri]
gi|116056807|emb|CAL53096.1| ribulose-1,5-bisphosphate carb (ISS) [Ostreococcus tauri]
Length = 520
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 106/425 (24%), Positives = 177/425 (41%), Gaps = 65/425 (15%)
Query: 76 EDLGDLKSWM-HKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
ED +L +W+ + G+ + KE + V D +AG A VP S V
Sbjct: 48 EDARELAAWLSYDKGVDASALAFKEDAKGGVR------VILKADAEAGATALRVPQSAAV 101
Query: 135 TLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
T V + ++EL + EL LAL+L E+ +G S W PY++ L +
Sbjct: 102 TSVDVGEHPIVSELASGR--PELIGLALWLCAERIKGGASEWAPYVKTL-------RANP 152
Query: 195 ESPLLWSET-ELAYLTGSPTKAEILERAEGIKREYNELDTVW-------------FMAGS 240
++PL W++ + A L GSP A+ +ER++ + EY + V F+ +
Sbjct: 153 DAPLFWTDAKDFALLKGSPVAADAIERSKSARTEYASITEVIKSDPSSYPPEAYEFLTEA 212
Query: 241 LFQK-----------VSLARRFALVPL----------GPPLLAYSS-----KCKAMLAAV 274
F + A+ +ALVPL P +L S+ +C V
Sbjct: 213 RFVDALATVCAKATWLPTAQCYALVPLLDVISIGGAPVPGVLPPSASDGVVRCGPADYDV 272
Query: 275 DDAVQLVVDRPYKAGESIVVWCGP--QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQ 332
D A ++ A S V+ + N +L +N G+VD+ +P D + ++ + T D
Sbjct: 273 DTASVVLRCATKAAANSEVIQLDALQRNNGELFLNTGYVDQKHPGDYIYMKTDIQTSDRL 332
Query: 333 YQDKRMVAQRNGKLSV-QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPIC 391
+ K+ V + G + Q F V+ R + + YLR V D EM +V I
Sbjct: 333 FTAKKQVLEGMGFTAADQYFPVYKDRMP---TQLYSYLRFSRVQDPGEMMAVSFEEDKI- 388
Query: 392 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM-EKKML 450
VS E +L L + +A Y ++ +L + P + + +RM EKK++
Sbjct: 389 -VSVMNEYEILQILMGDCRELMAEYDTNEEDELNLLKLSDQMPVREIEAAKLRMSEKKLI 447
Query: 451 NACLQ 455
A +
Sbjct: 448 GATMN 452
>gi|302759643|ref|XP_002963244.1| hypothetical protein SELMODRAFT_80789 [Selaginella moellendorffii]
gi|300168512|gb|EFJ35115.1| hypothetical protein SELMODRAFT_80789 [Selaginella moellendorffii]
Length = 467
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 73/291 (25%), Positives = 125/291 (42%), Gaps = 41/291 (14%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L+ W+ + GLP KV LK + + + L GD +P +L +T E
Sbjct: 41 LQQWLSQAGLPIQKVELKNVGAGGRG------LVSKRMLYKGDRLLFLPATLAITTESEW 94
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
+++ L E LA YL+ E GK S W PYI L R+ G S LLW
Sbjct: 95 ACAEAGKVIRAKDLPEWPFLACYLISEASLGKSSPWYPYIAALPRRPG-------SILLW 147
Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMA-------------------GS 240
+ ++ +L+ + K L+ ++ +N+L+ FM G
Sbjct: 148 TALDVETHLSATSIKDRALQCVREVEDTFNDLNKQVFMKNREEFPPEVFNLKSFKWAFGI 207
Query: 241 LFQKV----SLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVV 294
LF ++ SL ++ AL+P G +L + ++ L + ++ +DR Y++ + +
Sbjct: 208 LFSRLVRLPSLGQKLALIPFG-DMLNHDTEVTTFLDFDSGSKSITCTLDRGYESNREVFI 266
Query: 295 WCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 344
G + N +LL+ YGFV N D + + L+ D Y+ K + +G
Sbjct: 267 SYGKRSNGELLVAYGFVPSGKNSEDSVSITLGLDPADEMYEAKLGTLKEHG 317
>gi|146162512|ref|XP_001009518.2| SET domain containing protein [Tetrahymena thermophila]
gi|146146406|gb|EAR89273.2| SET domain containing protein [Tetrahymena thermophila SB210]
Length = 789
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 99/401 (24%), Positives = 176/401 (43%), Gaps = 68/401 (16%)
Query: 107 HRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNET-IAELLTTNKLSELA----CLA 161
+R +H A + ++ G+ +P ++TLE L E I +L+ + + L+ L+
Sbjct: 373 YRGVH---ARQKIKKGECILFIPVDNMITLE--LSKELPICQLIESKNIRLLSPKHTFLS 427
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
+Y++ EKK KSFW P++ L + P+L+++ EL +L GSP ++ ER
Sbjct: 428 IYIIIEKK-NHKSFWKPFLDILPVE------YTTFPILYTDEELFWLKGSPFLNQVKERR 480
Query: 222 EGIKREYNEL--------------DTVW--FMAGS----LFQKVSLARRFALVPLGPPLL 261
E I ++Y + + W MA S LF ++ R A VPL
Sbjct: 481 ECITQDYQAIVSKIPEFAKLCTLDEFAWARMMAASRIYGLF--INKKRTDAFVPLADMFN 538
Query: 262 ----AYSS------KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
AY++ K ML A +D + G+ I CG + NS+ L+NYGFV
Sbjct: 539 HRRPAYTNWGFCEDKGGFMLKASEDI---------RRGDQIYYSCGRKCNSRFLLNYGFV 589
Query: 312 DEDNPYDRLVVEAALNTEDPQYQDK-RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 370
++N + + + + +D K +M+ +R K +F +H E++++ + +LR
Sbjct: 590 VKNNEANEIQLRVDFDKKDETLPIKLQMIGKR--KPESLIFRIHINYEEKSVLEFFGFLR 647
Query: 371 LGYVSDTSEMQ-----SVISSLGPIC--PVSPCMERAVLDQLADYFKARLAGYPATLSED 423
+ D ++ S P+ P S E+ + ++ + YP TL ED
Sbjct: 648 FVLIRDYIVLEKFHEMSEGKEFDPLRTPPFSIENEKQMWTEIHKICAEIMIQYPTTLDED 707
Query: 424 EAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLL 464
+ +L L ++ L EK++L + + M LL
Sbjct: 708 KKILETSKLTINQKNCVILRMGEKEILMYYITMADRMKKLL 748
>gi|110331827|gb|ABG67019.1| hypothetical protein LOC84193 [Bos taurus]
Length = 488
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 93/423 (21%), Positives = 179/423 (42%), Gaps = 56/423 (13%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 81 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 134
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 135 LMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE--- 189
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL------------------ 231
++PL + E E+ YL + ++ + + R+Y
Sbjct: 190 ----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHAHKLPLKDS 245
Query: 232 ----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 281
D W ++ + ++ + AL+PL S DD + V
Sbjct: 246 FTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTSGLITTGYNLEDDRCECV 305
Query: 282 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 341
+ ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 306 ALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLA 365
Query: 342 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVISSLGPICP 392
R G + VF +H E + +L +LR+ +++ S + + + P
Sbjct: 366 RAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEYP 424
Query: 393 VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 452
VS E + L D L Y T+ ED++ L +++L + +A +L EK++L
Sbjct: 425 VSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLKNHDLSARATMAIKLRLGEKEILER 484
Query: 453 CLQ 455
++
Sbjct: 485 AVK 487
>gi|114654683|ref|XP_522946.2| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2 [Pan
troglodytes]
gi|332843114|ref|XP_003314566.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Pan
troglodytes]
gi|397525919|ref|XP_003832895.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1 [Pan
paniscus]
gi|397525921|ref|XP_003832896.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2 [Pan
paniscus]
gi|410227562|gb|JAA11000.1| SET domain containing 3 [Pan troglodytes]
gi|410255618|gb|JAA15776.1| SET domain containing 3 [Pan troglodytes]
gi|410289938|gb|JAA23569.1| SET domain containing 3 [Pan troglodytes]
gi|410342147|gb|JAA40020.1| SET domain containing 3 [Pan troglodytes]
Length = 594
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 95/426 (22%), Positives = 184/426 (43%), Gaps = 56/426 (13%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 74 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 127
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 128 LMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE--- 182
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL----- 231
++PL + E E+ YL + ++ + + R+Y N+L
Sbjct: 183 ----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKDS 238
Query: 232 ----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 281
D W ++ + ++ + AL+PL + DD + V
Sbjct: 239 FTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECV 298
Query: 282 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 341
+ ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 299 ALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLA 358
Query: 342 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVISSLGPICP 392
R G + VF +H E + +L +LR+ +++ S + + + P
Sbjct: 359 RAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFP 417
Query: 393 VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 452
VS E + L D L Y T+ ED+++L + +L + ++A +L EK++L
Sbjct: 418 VSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEK 477
Query: 453 CLQVTA 458
++ A
Sbjct: 478 AVKSAA 483
>gi|440907688|gb|ELR57800.1| SET domain-containing protein 3 [Bos grunniens mutus]
Length = 594
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 94/427 (22%), Positives = 180/427 (42%), Gaps = 56/427 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL----------------- 231
++PL + E E+ YL + ++ + + R+Y
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHAHKLPLKD 237
Query: 232 -----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQL 280
D W ++ + ++ + AL+PL S DD +
Sbjct: 238 SFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTSGLITTGYNLEDDRCEC 297
Query: 281 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 340
V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 298 VALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVL 357
Query: 341 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVISSLGPIC 391
R G + VF +H E + +L +LR+ +++ S + + +
Sbjct: 358 ARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEY 416
Query: 392 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 451
PVS E + L D L Y T+ ED++ L +++L + +A +L EK++L
Sbjct: 417 PVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLKNHDLSARATMAIKLRLGEKEILE 476
Query: 452 ACLQVTA 458
++ A
Sbjct: 477 RAVKSAA 483
>gi|355778846|gb|EHH63882.1| hypothetical protein EGM_16943 [Macaca fascicularis]
Length = 595
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 95/426 (22%), Positives = 184/426 (43%), Gaps = 56/426 (13%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 74 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 127
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 128 LMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-NPNSFWQPYIQTLPSE--- 182
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL----- 231
++PL + E E+ YL + ++ + + R+Y N+L
Sbjct: 183 ----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKDS 238
Query: 232 ----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 281
D W ++ + ++ + AL+PL + DD + V
Sbjct: 239 FTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECV 298
Query: 282 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 341
+ ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 299 ALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLA 358
Query: 342 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVISSLGPICP 392
R G + VF +H E + +L +LR+ +++ S + + + P
Sbjct: 359 RAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFP 417
Query: 393 VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 452
VS E + L D L Y T+ ED+++L + +L + ++A +L EK++L
Sbjct: 418 VSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEK 477
Query: 453 CLQVTA 458
++ A
Sbjct: 478 AVKSAA 483
>gi|426248573|ref|XP_004018037.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
setd3 [Ovis aries]
Length = 596
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 94/426 (22%), Positives = 180/426 (42%), Gaps = 56/426 (13%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 81 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 134
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 135 LMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE--- 189
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL------------------ 231
++PL + E E+ YL + ++ + + R+Y
Sbjct: 190 ----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYRVIQTHPHAHKLPLKDS 245
Query: 232 ----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 281
D W ++ + ++ + AL+PL S DD + V
Sbjct: 246 FTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTSGLITTGYNLEDDRCECV 305
Query: 282 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 341
+ ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 306 ALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLA 365
Query: 342 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVISSLGPICP 392
R G + VF +H E + +L +LR+ +++ S + + + P
Sbjct: 366 RAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEYP 424
Query: 393 VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 452
VS E + L D L Y T+ ED++ L +++L + +A +L EK++L
Sbjct: 425 VSWDNEVRLWAFLEDRASLLLKTYKTTIEEDKSFLKNHDLSARATMAVKLRLGEKEILER 484
Query: 453 CLQVTA 458
++ A
Sbjct: 485 AVKSAA 490
>gi|297695854|ref|XP_002825140.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
[Pongo abelii]
gi|395746278|ref|XP_003778419.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
[Pongo abelii]
Length = 595
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 95/426 (22%), Positives = 184/426 (43%), Gaps = 56/426 (13%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 74 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 127
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 128 LMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE--- 182
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL----- 231
++PL + E E+ YL + ++ + + R+Y N+L
Sbjct: 183 ----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKDS 238
Query: 232 ----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 281
D W ++ + ++ + AL+PL + DD + V
Sbjct: 239 FTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECV 298
Query: 282 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 341
+ ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 299 ALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLA 358
Query: 342 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVISSLGPICP 392
R G + VF +H E + +L +LR+ +++ S + + + P
Sbjct: 359 RAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFP 417
Query: 393 VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 452
VS E + L D L Y T+ ED+++L + +L + ++A +L EK++L
Sbjct: 418 VSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEK 477
Query: 453 CLQVTA 458
++ A
Sbjct: 478 AVKSAA 483
>gi|426377975|ref|XP_004055723.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Gorilla
gorilla gorilla]
Length = 594
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 95/426 (22%), Positives = 184/426 (43%), Gaps = 56/426 (13%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 74 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 127
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 128 LMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE--- 182
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL----- 231
++PL + E E+ YL + ++ + + R+Y N+L
Sbjct: 183 ----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKDS 238
Query: 232 ----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 281
D W ++ + ++ + AL+PL + DD + V
Sbjct: 239 FTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECV 298
Query: 282 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 341
+ ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 299 ALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLA 358
Query: 342 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVISSLGPICP 392
R G + VF +H E + +L +LR+ +++ S + + + P
Sbjct: 359 RAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFP 417
Query: 393 VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 452
VS E + L D L Y T+ ED+++L + +L + ++A +L EK++L
Sbjct: 418 VSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEK 477
Query: 453 CLQVTA 458
++ A
Sbjct: 478 TVKSAA 483
>gi|355718753|gb|AES06373.1| SET domain containing 3 [Mustela putorius furo]
Length = 585
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 98/427 (22%), Positives = 185/427 (43%), Gaps = 56/427 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL---- 231
++PL + E E+ L + ++ + + R+Y N+L
Sbjct: 183 -----YDTPLYFEEDEVRDLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKD 237
Query: 232 -----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQL 280
D W ++ + ++ + AL+PL + DD +
Sbjct: 238 AFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCEC 297
Query: 281 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 340
V R ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 298 VALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVL 357
Query: 341 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG-PIC 391
R G + VF +H+ E + +L +LR+ +++ + + I +LG
Sbjct: 358 ARAGIPTSSVFALHS-TEPPVSAQLLAFLRVFCMTEEELKEHLLGDNALDRIFTLGNSEY 416
Query: 392 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 451
PVS E + L D L Y T+ ED++ L D++L + +A +L EK++L
Sbjct: 417 PVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLKDHDLSVRAAMAIKLRLGEKEILE 476
Query: 452 ACLQVTA 458
++ A
Sbjct: 477 KAVKSAA 483
>gi|386780935|ref|NP_001247800.1| SET domain containing 3 [Macaca mulatta]
gi|355693560|gb|EHH28163.1| hypothetical protein EGK_18532 [Macaca mulatta]
gi|380817110|gb|AFE80429.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
gi|383422129|gb|AFH34278.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
gi|384949778|gb|AFI38494.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
Length = 595
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 95/426 (22%), Positives = 184/426 (43%), Gaps = 56/426 (13%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 74 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 127
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 128 LMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-NPNSFWQPYIQTLPSE--- 182
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL----- 231
++PL + E E+ YL + ++ + + R+Y N+L
Sbjct: 183 ----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKDS 238
Query: 232 ----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 281
D W ++ + ++ + AL+PL + DD + V
Sbjct: 239 FTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECV 298
Query: 282 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 341
+ ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 299 ALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLA 358
Query: 342 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVISSLGPICP 392
R G + VF +H E + +L +LR+ +++ S + + + P
Sbjct: 359 RAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFP 417
Query: 393 VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 452
VS E + L D L Y T+ ED+++L + +L + ++A +L EK++L
Sbjct: 418 VSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEK 477
Query: 453 CLQVTA 458
++ A
Sbjct: 478 AVKSAA 483
>gi|281182452|ref|NP_001162549.1| histone-lysine N-methyltransferase setd3 [Papio anubis]
gi|332321745|sp|A9X1D0.1|SETD3_PAPAN RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|163781076|gb|ABY40825.1| SET domain containing 3, isoform 1 (predicted) [Papio anubis]
Length = 595
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 95/427 (22%), Positives = 184/427 (43%), Gaps = 56/427 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERAN-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL---- 231
++PL + E E+ YL + ++ + + R+Y N+L
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKD 237
Query: 232 -----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQL 280
D W ++ + ++ + AL+PL + DD +
Sbjct: 238 SFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCEC 297
Query: 281 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 340
V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 298 VALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVL 357
Query: 341 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVISSLGPIC 391
R G + VF +H E + +L +LR+ +++ S + + +
Sbjct: 358 ARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEF 416
Query: 392 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 451
PVS E + L D L Y T+ ED+++L + +L + ++A +L EK++L
Sbjct: 417 PVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILE 476
Query: 452 ACLQVTA 458
++ A
Sbjct: 477 KAVKSAA 483
>gi|219121061|ref|XP_002185762.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Phaeodactylum tricornutum CCAP
1055/1]
gi|209582611|gb|ACI65232.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Phaeodactylum tricornutum CCAP
1055/1]
Length = 575
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 92/401 (22%), Positives = 167/401 (41%), Gaps = 75/401 (18%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKK-QGKK 173
A D+ GD +P +L +T + + + + + ++++E +A +L+YE+ +G++
Sbjct: 131 ARRDINDGDELLRIPMALCMT--KSAARKAVGKDVLPSEINEYLAMACHLIYERNVRGEE 188
Query: 174 SFWLPYIR---ELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
S W PY+ ++D V W + +LA+L GSP A ++REY+
Sbjct: 189 SPWKPYLDVLPDIDE--------VNPTFTWPDEDLAFLNGSPVIAATKSLQMKLRREYDA 240
Query: 231 L----------------------------DTVWFMAGSLFQKVSLARRFALVPLGPPLLA 262
L T+ F + + ALVP L+
Sbjct: 241 LLGGEDGLLAKYPDRFPAEAFNFKAWEWAFTMLFSRAIRLRSLKQGETLALVPYAD-LIN 299
Query: 263 YSSKCKAMLAAV----------DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 312
+S +A + A D+ V L DR Y+ E I + GP+ N++LL+ YGF
Sbjct: 300 HSPFSQAYIDARQNGDWLFKSGDEEVILYADRGYRRMEQIYISYGPKSNAELLLLYGFAV 359
Query: 313 EDNPYDRLVVEAALNTE---------------DPQYQDKRMVAQRNGKLSVQVFHVHAGR 357
E NP++ + V ++ DP ++K ++ G+ + F +A R
Sbjct: 360 ERNPFNSVDVTVSIAPRTASFVKELDDDTIPVDPLAEEKAAFLEQVGRDATVDFPCYADR 419
Query: 358 EKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP 417
+ML YLRL ++ ++ +S E AVL + +L+ YP
Sbjct: 420 YP---VEMLEYLRLMQMTPEDTRGKPLAEFDYSRTISLGNEAAVLTSVITAVSRQLSNYP 476
Query: 418 ATLSEDEAMLTDYNLHP----KKRVATQLVRMEKKMLNACL 454
+ +D A++ D +L +R+A + R EK++L +
Sbjct: 477 QSEEDDAALIKDKSLFRLLSYNQRMAVRHRRNEKRLLKRTI 517
>gi|194038089|ref|XP_001925323.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Sus scrofa]
gi|456754196|gb|JAA74239.1| SET domain containing 3 [Sus scrofa]
Length = 595
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 96/427 (22%), Positives = 178/427 (41%), Gaps = 56/427 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASDNG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYAQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL----------------- 231
++PL + E E+ YL + ++ + + R+Y
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPQAHKLPLKE 237
Query: 232 -----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQL 280
D W ++ + ++ + AL+PL + DD +
Sbjct: 238 SFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCEC 297
Query: 281 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 340
V R ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 298 VALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVL 357
Query: 341 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG-PIC 391
R G + VF +H E + +L +LR+ +++ + + I +LG
Sbjct: 358 ARAGIPTSSVFALHF-TEPPVSAQLLAFLRVFCMTEGELKEHLLGENAIDRIFTLGNSEY 416
Query: 392 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 451
PVS E + L D L Y T+ ED+ L ++ L + +A +L EK++L
Sbjct: 417 PVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKTFLKNHGLSVRATMAVKLRLGEKEILE 476
Query: 452 ACLQVTA 458
++ A
Sbjct: 477 KAVESAA 483
>gi|119914085|ref|XP_589822.3| PREDICTED: histone-lysine N-methyltransferase setd3 [Bos taurus]
gi|297488270|ref|XP_002696879.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Bos taurus]
gi|296475307|tpg|DAA17422.1| TPA: SET domain containing 3 [Bos taurus]
Length = 601
Score = 82.4 bits (202), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 94/427 (22%), Positives = 180/427 (42%), Gaps = 56/427 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 80 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 133
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 134 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 189
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL----------------- 231
++PL + E E+ YL + ++ + + R+Y
Sbjct: 190 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHAHKLPLKD 244
Query: 232 -----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQL 280
D W ++ + ++ + AL+PL S DD +
Sbjct: 245 SFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTSGLITTGYNLEDDRCEC 304
Query: 281 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 340
V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 305 VALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVL 364
Query: 341 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVISSLGPIC 391
R G + VF +H E + +L +LR+ +++ S + + +
Sbjct: 365 ARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEY 423
Query: 392 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 451
PVS E + L D L Y T+ ED++ L +++L + +A +L EK++L
Sbjct: 424 PVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLKNHDLSARATMAIKLRLGEKEILE 483
Query: 452 ACLQVTA 458
++ A
Sbjct: 484 RAVKSAA 490
>gi|332252553|ref|XP_003275417.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
[Nomascus leucogenys]
gi|332252555|ref|XP_003275418.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
[Nomascus leucogenys]
Length = 595
Score = 82.4 bits (202), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 95/426 (22%), Positives = 184/426 (43%), Gaps = 56/426 (13%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 74 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 127
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 128 LMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE--- 182
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL----- 231
++PL + E E+ YL + ++ + + R+Y N+L
Sbjct: 183 ----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKDS 238
Query: 232 ----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 281
D W ++ + ++ + AL+PL + DD + V
Sbjct: 239 FTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECV 298
Query: 282 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 341
+ ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 299 ALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLA 358
Query: 342 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVISSLGPICP 392
R G + VF +H E + +L +LR+ +++ S + + + P
Sbjct: 359 RAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFP 417
Query: 393 VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 452
VS E + L D L Y T+ ED+++L + +L + ++A +L EK++L
Sbjct: 418 VSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEK 477
Query: 453 CLQVTA 458
++ A
Sbjct: 478 AVKSAA 483
>gi|296215874|ref|XP_002754318.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Callithrix jacchus]
Length = 610
Score = 82.4 bits (202), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 97/426 (22%), Positives = 186/426 (43%), Gaps = 56/426 (13%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 89 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 142
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 143 LMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE--- 197
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL----- 231
++PL + E E+ YL + ++ + + R+Y N+L
Sbjct: 198 ----YDTPLYFEEEEVRYLQSTQAVHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKDS 253
Query: 232 ----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 281
D W ++ + ++ + AL+PL + DD + V
Sbjct: 254 FTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECV 313
Query: 282 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 341
+ ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 314 ALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLA 373
Query: 342 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG-PICP 392
R G + VF +H E + +L +LR+ +++ + + I +LG P
Sbjct: 374 RAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEFP 432
Query: 393 VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 452
VS E + L D L Y T+ ED+++L + +L + ++A +L EK++L
Sbjct: 433 VSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEK 492
Query: 453 CLQVTA 458
++ A
Sbjct: 493 AVKSAA 498
>gi|332320543|sp|B0VX69.2|SETD3_CALJA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
Length = 595
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 97/426 (22%), Positives = 186/426 (43%), Gaps = 56/426 (13%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 74 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 127
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 128 LMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE--- 182
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL----- 231
++PL + E E+ YL + ++ + + R+Y N+L
Sbjct: 183 ----YDTPLYFEEEEVRYLQSTQAVHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKDS 238
Query: 232 ----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 281
D W ++ + ++ + AL+PL + DD + V
Sbjct: 239 FTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECV 298
Query: 282 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 341
+ ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 299 ALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLA 358
Query: 342 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG-PICP 392
R G + VF +H E + +L +LR+ +++ + + I +LG P
Sbjct: 359 RAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEFP 417
Query: 393 VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 452
VS E + L D L Y T+ ED+++L + +L + ++A +L EK++L
Sbjct: 418 VSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEK 477
Query: 453 CLQVTA 458
++ A
Sbjct: 478 AVKSAA 483
>gi|168986666|gb|ACA35060.1| SET domain containing 3 isoform a (predicted) [Callithrix jacchus]
Length = 597
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 97/426 (22%), Positives = 186/426 (43%), Gaps = 56/426 (13%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 76 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 129
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 130 LMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE--- 184
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL----- 231
++PL + E E+ YL + ++ + + R+Y N+L
Sbjct: 185 ----YDTPLYFEEEEVRYLQSTQAVHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKDS 240
Query: 232 ----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 281
D W ++ + ++ + AL+PL + DD + V
Sbjct: 241 FTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECV 300
Query: 282 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 341
+ ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 301 ALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLA 360
Query: 342 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG-PICP 392
R G + VF +H E + +L +LR+ +++ + + I +LG P
Sbjct: 361 RAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEFP 419
Query: 393 VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 452
VS E + L D L Y T+ ED+++L + +L + ++A +L EK++L
Sbjct: 420 VSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEK 479
Query: 453 CLQVTA 458
++ A
Sbjct: 480 AVKSAA 485
>gi|302786274|ref|XP_002974908.1| hypothetical protein SELMODRAFT_102436 [Selaginella moellendorffii]
gi|300157067|gb|EFJ23693.1| hypothetical protein SELMODRAFT_102436 [Selaginella moellendorffii]
Length = 389
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 65/242 (26%), Positives = 117/242 (48%), Gaps = 38/242 (15%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS + G+ V + L++T E++ E + +LL+ + +S A LAL+L+ +K+ + S
Sbjct: 5 ASRPIHTGECMLHVSHDLMITPEKL--PEEVTKLLSKD-VSAWAKLALFLLAHQKKKETS 61
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI L ++ S + W++ EL YL SP E ++R + ++ E+ + V
Sbjct: 62 AWAPYISCLPPFG-----SMHSTIFWTQDELVYLKVSPVYRETVQRKDVVRMEFAAAENV 116
Query: 235 WFMAGSLFQKVSL----------------------ARRFALVPLGPPLLAYSSKCKAMLA 272
L Q+V L + ALVP + + C+AML+
Sbjct: 117 CM----LMQQVKLFVCSRILTDYITVCSRAWGIETIKSLALVPF-VDFFNHDANCRAMLS 171
Query: 273 AVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV-VEAALNTE 329
+D ++V DR Y G+ +V+ G N+ L +++GF NP+D++ + +L+ +
Sbjct: 172 YDEDRHCAEVVSDRDYATGDQVVISYGQLSNATLALDFGFALPFNPHDQVAGIWLSLSEK 231
Query: 330 DP 331
DP
Sbjct: 232 DP 233
>gi|302814473|ref|XP_002988920.1| hypothetical protein SELMODRAFT_129035 [Selaginella moellendorffii]
gi|300143257|gb|EFJ09949.1| hypothetical protein SELMODRAFT_129035 [Selaginella moellendorffii]
Length = 389
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 61/240 (25%), Positives = 115/240 (47%), Gaps = 32/240 (13%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS + G+ V + L++T E++ E + +LL+ + +S A LAL+L+ +K+ + S
Sbjct: 5 ASRPIHTGECMLHVSHDLMITPEKL--PEEVTKLLSKD-VSAWAKLALFLLAHQKKKETS 61
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI L ++ S + W++ EL YL SP E ++R + ++ E+ +
Sbjct: 62 AWAPYISCLPPFG-----SMHSTIFWTQDELVYLKVSPVYRETVQRKDVVRMEFAAAENA 116
Query: 235 WFMAGSLFQK--------------------VSLARRFALVPLGPPLLAYSSKCKAMLAAV 274
+ +F + + ALVP + + C+AML+
Sbjct: 117 LLLCPHIFGSRVSALEFKHAYATVCSRAWGIETIKSLALVPF-VDFFNHDANCRAMLSYD 175
Query: 275 DD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV-VEAALNTEDP 331
+D ++V DR Y G+ +V+ G N+ L +++GF NP+D++ + +L+ +DP
Sbjct: 176 EDRHCAEVVSDRDYATGDQVVISYGQLSNATLALDFGFALPFNPHDQVAGIWLSLSEKDP 235
>gi|302766942|ref|XP_002966891.1| hypothetical protein SELMODRAFT_408134 [Selaginella moellendorffii]
gi|300164882|gb|EFJ31490.1| hypothetical protein SELMODRAFT_408134 [Selaginella moellendorffii]
Length = 374
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 68/229 (29%), Positives = 108/229 (47%), Gaps = 23/229 (10%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+ + AG +P ++T E V ++ LL+T+ L+L+L+ EK + ++S
Sbjct: 10 ATRRVPAGSRFLEIPRIAIITPENVPSQ--VSHLLSTSNPKTR--LSLFLLSEKHKAQES 65
Query: 175 FWLPYIRELDRQRGRGQLA-VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-- 231
W PY+R L QL +ES + W ELA+L SPT E +E + IK E++ L
Sbjct: 66 QWAPYLRCL------PQLGDIESTMFWKAEELAWLKHSPTYRETMECLKIIKSEFHLLTL 119
Query: 232 --DTVWFMAGSLFQKVSL-----ARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVV 282
V+ +VSL A +P + C+ L+ D V
Sbjct: 120 ANKQVFPWCRDALGEVSLTDFMHAYSTDQIPFAD-FFNHDHNCQTRLSYDKEKDCAVAVA 178
Query: 283 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 331
D+ YKAG+ I + G PNS L ++YGF NP++++ V ++ DP
Sbjct: 179 DQDYKAGDEIFLSYGSTPNSILAVDYGFAVASNPHEQVEVPMGVSLTDP 227
>gi|332321478|sp|B1MTJ4.2|SETD3_CALMO RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
Length = 595
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 97/427 (22%), Positives = 186/427 (43%), Gaps = 56/427 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL---- 231
++PL + E E+ YL + ++ + + R+Y N+L
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKD 237
Query: 232 -----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQL 280
D W ++ + ++ + AL+PL + DD +
Sbjct: 238 SFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCEC 297
Query: 281 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 340
V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 298 VALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVL 357
Query: 341 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG-PIC 391
R G + VF +H E + +L +LR+ +++ + + I +LG
Sbjct: 358 ARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEF 416
Query: 392 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 451
PVS E + L D L Y T+ ED+++L + +L + ++A +L EK++L
Sbjct: 417 PVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILE 476
Query: 452 ACLQVTA 458
++ A
Sbjct: 477 KAVKSAA 483
>gi|169409575|gb|ACA57918.1| SET domain containing 3 isoform a (predicted) [Callicebus moloch]
Length = 597
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 97/427 (22%), Positives = 186/427 (43%), Gaps = 56/427 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 75 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 128
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 129 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 184
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL---- 231
++PL + E E+ YL + ++ + + R+Y N+L
Sbjct: 185 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKD 239
Query: 232 -----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQL 280
D W ++ + ++ + AL+PL + DD +
Sbjct: 240 SFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCEC 299
Query: 281 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 340
V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 300 VALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVL 359
Query: 341 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG-PIC 391
R G + VF +H E + +L +LR+ +++ + + I +LG
Sbjct: 360 ARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEF 418
Query: 392 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 451
PVS E + L D L Y T+ ED+++L + +L + ++A +L EK++L
Sbjct: 419 PVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILE 478
Query: 452 ACLQVTA 458
++ A
Sbjct: 479 KAVKSAA 485
>gi|332321743|sp|C1FXW2.1|SETD3_DASNO RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|226526916|gb|ACO71275.1| SET domain containing 3 isoform a (predicted) [Dasypus
novemcinctus]
Length = 589
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 97/426 (22%), Positives = 185/426 (43%), Gaps = 56/426 (13%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 74 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 127
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 128 LMTVESA-KNSMLGPLYSQDRILQAMGNITLAFHLLCERAN-PNSFWQPYIQSLPGE--- 182
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL----- 231
++PL + E E+ YL + ++ + + R+Y N+L
Sbjct: 183 ----YDTPLYFEEDEVRYLHSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKDS 238
Query: 232 ----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 281
D W ++ + ++ + AL+PL + DD + V
Sbjct: 239 FTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECV 298
Query: 282 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 341
+ ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 299 ALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLA 358
Query: 342 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG-PICP 392
R G + VF +H E + +L +LR+ +++ + + I +LG P
Sbjct: 359 RAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGENAIDRIFTLGNSEFP 417
Query: 393 VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 452
VS E + L D L Y T+ ED++ L +++L + +A +L EK++L
Sbjct: 418 VSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSFLKNHDLSVRATMAIKLRLGEKEILEK 477
Query: 453 CLQVTA 458
++ A
Sbjct: 478 AVKSAA 483
>gi|332321746|sp|B2KI88.1|SETD3_RHIFE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|183637154|gb|ACC64548.1| SET domain containing 3 isoform a (predicted) [Rhinolophus
ferrumequinum]
Length = 594
Score = 81.6 bits (200), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 97/427 (22%), Positives = 184/427 (43%), Gaps = 56/427 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E S E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVSFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL---- 231
++PL + E E+ YL + ++ + + R+Y N+L
Sbjct: 183 -----YDTPLYFGEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKD 237
Query: 232 -----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQL 280
D W ++ + ++ + AL+PL + DD +
Sbjct: 238 SFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCEC 297
Query: 281 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 340
V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 298 VALQDFQAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVL 357
Query: 341 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG-PIC 391
R G + VF +H E + +L +LR+ +++ + + I +LG
Sbjct: 358 ARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEY 416
Query: 392 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 451
PVS E + L D L Y + ED++ L +++L + +A +L EK++L
Sbjct: 417 PVSWDNEVKLWTFLEDRASLLLKTYKTNIEEDKSFLKNHDLSVRATMAIKLRLGEKEILE 476
Query: 452 ACLQVTA 458
++ A
Sbjct: 477 KAVKSAA 483
>gi|348554489|ref|XP_003463058.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cavia
porcellus]
Length = 789
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 99/427 (23%), Positives = 184/427 (43%), Gaps = 56/427 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFWLPYI+ L +
Sbjct: 127 LLMTVESA-KNSILGPLYSQDRILQAMGNIALAFHLLCERAN-PNSFWLPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL---- 231
++PL + E E+ L + ++ + + R+Y N+L
Sbjct: 183 -----YDTPLYFEEEEVQCLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKD 237
Query: 232 -----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQL 280
D W ++ + ++ + AL+PL + DD +
Sbjct: 238 SFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCEC 297
Query: 281 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 340
V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 298 VALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVL 357
Query: 341 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG-PIC 391
R G + VF +H E + +L +LR+ +++ + + I +LG
Sbjct: 358 ARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGENAIDRIFTLGNSEF 416
Query: 392 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 451
PVS E + L D L Y T+ ED+A+L L + ++A +L EK++L
Sbjct: 417 PVSWENEVKLWSFLEDRASLLLKTYKTTIEEDKAVLKGPELPTRMKMAVKLRLGEKEILE 476
Query: 452 ACLQVTA 458
+Q A
Sbjct: 477 RTVQSAA 483
>gi|357137766|ref|XP_003570470.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like
[Brachypodium distachyon]
Length = 389
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 72/261 (27%), Positives = 122/261 (46%), Gaps = 38/261 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A +++ G+ VP SLV++ + N + +++ + + + LA YL+ E
Sbjct: 13 LVALTNVRNGEKLLFVPPSLVISADSEWSNREVGDVMKSYSVPDWPLLATYLISEASLEG 72
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL 231
S W YI L RQ S L W+ TE+ AYL SP + + R + YN+L
Sbjct: 73 SSRWSSYIDALPRQ-------PYSLLYWTRTEIDAYLVASPIRERAISRIGDVIGTYNDL 125
Query: 232 -------------DTVWFMA------GSLFQKV----SLARRFALVPLGPPLLAYSSKCK 268
+ V+ M G LF ++ S+ + ALVP +L ++ +
Sbjct: 126 RDRIFSKHPELFPEEVYNMENFRWSFGILFSRLVRLPSMDGKVALVPWA-DMLNHNPEVD 184
Query: 269 AMLAAVDDAVQLVV---DRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVE 323
A L D + Q +V DR Y+ GE + + G + + +LL++YGFV ++ NP D +
Sbjct: 185 AFL-DFDKSSQGIVFTTDRSYQPGEQVFISYGKKSSGELLLSYGFVPKEGTNPNDSVEFS 243
Query: 324 AALNTEDPQYQDKRMVAQRNG 344
+LN D Y++K +R+G
Sbjct: 244 VSLNKSDDCYREKLQALKRHG 264
>gi|344273731|ref|XP_003408672.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Loxodonta
africana]
Length = 597
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 97/427 (22%), Positives = 184/427 (43%), Gaps = 56/427 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEVVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAN-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL---- 231
++PL + E E+ +L + ++ + + R+Y N+L
Sbjct: 183 -----YDTPLYFEEDEVRHLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKD 237
Query: 232 -----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQL 280
D W ++ + ++ + AL+PL + DD +
Sbjct: 238 SFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCEC 297
Query: 281 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 340
V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 298 VALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVL 357
Query: 341 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG-PIC 391
R G + VF +H E + +L +LR+ +++ + + I +LG
Sbjct: 358 ARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEF 416
Query: 392 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 451
PVS E + L D L Y T+ ED+A L ++L + +A +L EK++L
Sbjct: 417 PVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKAFLKGHDLSIRATMAVKLRLGEKEILE 476
Query: 452 ACLQVTA 458
++ A
Sbjct: 477 RAVKSAA 483
>gi|302754606|ref|XP_002960727.1| hypothetical protein SELMODRAFT_449995 [Selaginella moellendorffii]
gi|300171666|gb|EFJ38266.1| hypothetical protein SELMODRAFT_449995 [Selaginella moellendorffii]
Length = 430
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 97/368 (26%), Positives = 163/368 (44%), Gaps = 52/368 (14%)
Query: 127 SVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
S+P +L + ++ V +E I E L +ALYL++EK + S W YIR L R
Sbjct: 37 SIPKTLWMDVDTVRRSE-IGECCA--GLRPWIAVALYLLHEKAK-PHSDWSAYIRVLPR- 91
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF------MAGS 240
++SPL WSE ELA L G+ + I E +KREY+++ T S
Sbjct: 92 ------TLDSPLFWSEEELAELKGTQLLSSINGFKEFLKREYDKVMTEVIEPRPDVFDRS 145
Query: 241 LFQKVSLARRFALVP--LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY----KAG----- 289
L+ + F ++ PPL+ + + V+ L + PY AG
Sbjct: 146 LYTLEAFTWAFGILRSRTFPPLIGDNLALVPLADFVNHGFGLTNEDPYWHVKSAGVFARQ 205
Query: 290 ---------------ESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQ 332
E ++ + + N++L +YGFVD D N D + ++ +
Sbjct: 206 ETLTLQAAANCAEKQEVLMQYGKKKGNAQLATDYGFVDSDEKNNRDSFTLTLQVSLSERF 265
Query: 333 YQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLG--YVSDTSEMQSVISSL--- 387
DK +AQ G S F+++ R + DM+ YLRL + SD+ ++++ +
Sbjct: 266 ADDKVDIAQMAGLDSTAYFNLY--RNQGPPEDMIAYLRLIALFGSDSFLLEALFRNTVWD 323
Query: 388 GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEK 447
P+S E A+ + + + +A L Y +T+ ED +L L +K++A + EK
Sbjct: 324 HLRLPISRENEEAICEAMIEGCRATLREYSSTIDEDTMLLNSSELSTRKKMAVVVRLGEK 383
Query: 448 KMLNACLQ 455
++L LQ
Sbjct: 384 RILQEQLQ 391
>gi|332321744|sp|B5FW36.1|SETD3_OTOGA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|197215622|gb|ACH53017.1| SET domain containing 3 isoform a (predicted) [Otolemur garnettii]
Length = 595
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 97/427 (22%), Positives = 184/427 (43%), Gaps = 56/427 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKRENYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQSLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL---- 231
++PL + E E+ YL + ++ + + R+Y N+L
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKD 237
Query: 232 -----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQL 280
D W ++ + ++ + AL+PL + DD +
Sbjct: 238 SFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCEC 297
Query: 281 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 340
V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 298 VALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVL 357
Query: 341 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG-PIC 391
R G + VF +H E + +L +LR+ +++ + + I +LG
Sbjct: 358 ARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEF 416
Query: 392 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 451
PVS E + L D L Y T+ ED+ +L +++L + +A +L EK++L
Sbjct: 417 PVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKFVLKNHDLSVRATMAIKLRLGEKEILE 476
Query: 452 ACLQVTA 458
++ A
Sbjct: 477 KAVKSAA 483
>gi|395827792|ref|XP_003787079.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Otolemur
garnettii]
Length = 595
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 97/427 (22%), Positives = 184/427 (43%), Gaps = 56/427 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKRENYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQSLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL---- 231
++PL + E E+ YL + ++ + + R+Y N+L
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIYDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKD 237
Query: 232 -----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQL 280
D W ++ + ++ + AL+PL + DD +
Sbjct: 238 SFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCEC 297
Query: 281 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 340
V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 298 VALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVL 357
Query: 341 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG-PIC 391
R G + VF +H E + +L +LR+ +++ + + I +LG
Sbjct: 358 ARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEF 416
Query: 392 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 451
PVS E + L D L Y T+ ED+ +L +++L + +A +L EK++L
Sbjct: 417 PVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKFVLKNHDLSVRATMAIKLRLGEKEILE 476
Query: 452 ACLQVTA 458
++ A
Sbjct: 477 KAVKSAA 483
>gi|343961019|dbj|BAK62099.1| SET domain containing 3 isoform a [Pan troglodytes]
Length = 492
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 91/411 (22%), Positives = 176/411 (42%), Gaps = 56/411 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL---- 231
++PL + E E+ YL + ++ + + R+Y N+L
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKD 237
Query: 232 -----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQL 280
D W ++ + ++ + AL+PL + DD +
Sbjct: 238 SFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCEC 297
Query: 281 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 340
V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 298 VALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVL 357
Query: 341 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD---------TSEMQSVISSLGPIC 391
R G + VF +H E + +L +LR+ +++ S + + +
Sbjct: 358 ARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEF 416
Query: 392 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQL 442
PVS E + L D L Y T+ ED+++L + +L + ++A +L
Sbjct: 417 PVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKL 467
>gi|302768639|ref|XP_002967739.1| hypothetical protein SELMODRAFT_408995 [Selaginella moellendorffii]
gi|300164477|gb|EFJ31086.1| hypothetical protein SELMODRAFT_408995 [Selaginella moellendorffii]
Length = 421
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 68/257 (26%), Positives = 111/257 (43%), Gaps = 15/257 (5%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+L SW+ G +LK P + A D++AG+ V ++T +R+
Sbjct: 39 ELVSWLKIRGEHDACSLLKTGPDKRG-------LFAVRDIKAGECILRVSRDTMMTADRL 91
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
+LL++ +SE A LAL L++EK+ G+ S W PYI L R + S
Sbjct: 92 --PLEFQQLLSSG-VSEWAQLALLLLFEKRAGEASIWAPYISCLPRWG-----TIHSTAF 143
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQKVSLARRFALVPLGPP 259
W + EL + S E + R I+ E+NE+ +V F S A G
Sbjct: 144 WRKEELTMIQESSLSYETMSRRAAIREEFNEMQSVPFADFMNHDWSSNAMLTYDTDNGST 203
Query: 260 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 319
+ L QL D+ Y AGE + + GP N+ L +++GF NP+D+
Sbjct: 204 EVEEVKVYSDCLYIALFCAQLFADKNYAAGEQVTISFGPLCNASLALDFGFTVPYNPWDK 263
Query: 320 LVVEAALNTEDPQYQDK 336
+ + ++ D ++K
Sbjct: 264 VQLWLGISRRDSLRKEK 280
>gi|302821397|ref|XP_002992361.1| hypothetical protein SELMODRAFT_430576 [Selaginella moellendorffii]
gi|300139777|gb|EFJ06511.1| hypothetical protein SELMODRAFT_430576 [Selaginella moellendorffii]
Length = 463
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 74/318 (23%), Positives = 125/318 (39%), Gaps = 70/318 (22%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+L SW+ G +LK P + A D++AG+ V ++T +R+
Sbjct: 39 ELVSWLKIRGEHDACSLLKTGPDKRG-------LFAVRDIKAGECILRVSRDTMMTADRL 91
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
+LL++ +SE A LAL L++EK+ G+ S W PYI L R + S
Sbjct: 92 --PLEFQQLLSSG-VSEWAQLALLLLFEKRAGEASIWAPYISCLPRWG-----TIHSTAF 143
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLF----------------- 242
W + ELA + S E + R I+ E+NE+ ++ +F
Sbjct: 144 WRKEELAMIQESSLSYETMSRRAAIREEFNEMQPIFQRYEHVFGGPVSYASFKHAYVTAT 203
Query: 243 -----QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD--------------------- 276
++ + A+VP + + AML D
Sbjct: 204 VCSRAWRIDGLEKLAMVPFA-DFMNHDWSSNAMLTYDTDNGSTEVEEVKVYSDCLDIALF 262
Query: 277 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 336
QL D+ Y AGE + + GP N+ L +++GF NP+D++ + ++ D ++K
Sbjct: 263 CAQLFADKNYAAGEQVTISFGPLCNASLALDFGFTVPYNPWDKVQLWLGISRRDSLRKEK 322
Query: 337 RMVAQRNGKLSVQVFHVH 354
+Q H H
Sbjct: 323 -----------LQYLHAH 329
>gi|338719872|ref|XP_001488117.2| PREDICTED: histone-lysine N-methyltransferase setd3-like [Equus
caballus]
Length = 609
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 95/427 (22%), Positives = 181/427 (42%), Gaps = 56/427 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 87 GKREDYFPDLMKWASENG---ASVDGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 140
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 141 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 196
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL---- 231
++PL + E E+ YL + ++ + + R+Y N+L
Sbjct: 197 -----YDTPLYFEEDEVRYLQSTQAVHDVFSQYKNTARQYAYFYRVIQTHPHANKLPLKD 251
Query: 232 -----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQL 280
D W ++ + ++ + AL+PL + DD +
Sbjct: 252 SFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTTGLITTGYNLEDDRCEC 311
Query: 281 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 340
V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 312 VALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVL 371
Query: 341 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVISSLGPIC 391
R G + VF +H E + +L +LR+ +++ + + + +
Sbjct: 372 ARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKDHLLGDNAIDRIFTLGNSEY 430
Query: 392 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 451
PVS E + L D L Y T+ ED A L + +L + +A +L EK++L
Sbjct: 431 PVSWDNEVKLWTFLEDRALLLLKTYKTTVEEDRAFLKNSDLSVRATMAIKLRLGEKEILE 490
Query: 452 ACLQVTA 458
++ A
Sbjct: 491 KAVKSAA 497
>gi|147843303|emb|CAN82664.1| hypothetical protein VITISV_015206 [Vitis vinifera]
Length = 507
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 92/377 (24%), Positives = 161/377 (42%), Gaps = 53/377 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A D+ +A VP + + V +E + L +AL+L+ EK +
Sbjct: 81 LVAQRDIARNEAVLEVPKRFWINPDAVAASEIGS---VCGGLKPWVSVALFLIREKLR-D 136
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL- 231
+S W Y+ L S + WSE EL + G+ L E ++ E+ ++
Sbjct: 137 ESPWRSYLDILPEY-------TNSTIYWSEEELVEIQGTQLSNTTLGVKEYVQSEFLKVE 189
Query: 232 ------------------DTVWFMAGSLFQKVSLAR----RFALVPLG------PPLLA- 262
D +W A + + + +R L+PL P +
Sbjct: 190 EEVILPHSQLFPFPVTLDDFLW--AFGILRSRAFSRLRGQNLVLIPLADLINHSPSITTE 247
Query: 263 -YSSKCK-AMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDR 319
Y+ + K A L + D L KAGE +++ + + N++L ++YGF++ +
Sbjct: 248 EYAWEIKGAGLFSRDQLFSLRTPVSVKAGEQVLIQYDLDKSNAELALDYGFIESRPNRNS 307
Query: 320 LVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS--DT 377
+ ++ DP + DK +A+ NG + F + G+ A MLPYLRL + D
Sbjct: 308 YTLTLEISESDPFFGDKLDIAESNGLSEIAYFDIVLGQSLPAA--MLPYLRLVALGGPDA 365
Query: 378 SEMQSVISS--LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 434
++S+ + G + PVS E + + D K+ L+GY T+ EDE + + NLHP
Sbjct: 366 FLLESIFRNTIWGHLELPVSRANEELICQVIQDACKSALSGYLTTIEEDEKLKEEGNLHP 425
Query: 435 KKRVATQLVRMEKKMLN 451
+ +A + EKK+L
Sbjct: 426 RLEIAVGVRTGEKKVLQ 442
>gi|224051705|ref|XP_002200601.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Taeniopygia
guttata]
Length = 593
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 84/386 (21%), Positives = 169/386 (43%), Gaps = 50/386 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
+ A+ +++A + VP L++T+E N + L + +++ + LA +L+ E+
Sbjct: 108 LKATREIKAEELFLWVPRKLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERA 166
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
SFWLPYI+ L + ++PL + E E+ +L + ++ + + R+Y
Sbjct: 167 N-PHSFWLPYIQTLPSE-------YDTPLYFEEDEVQHLQSTQAIHDVFSQYKNTARQYA 218
Query: 230 EL----------------------DTVWFMAGSLFQKVSLARR------FALVPLGPPLL 261
D W ++ + ++ + AL+PL
Sbjct: 219 YFYKVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCN 278
Query: 262 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 321
+ DD + V + +KAGE I ++ G + N++ +I+ GF ++N +DR+
Sbjct: 279 HTNGLITTGYNLEDDRCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVK 338
Query: 322 VEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQ 381
++ ++ D Y K V R G + VF +H+ E + +L +LR+ +S+ +
Sbjct: 339 IKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPAISAQLLAFLRVFCMSEEELKE 397
Query: 382 SVIS--SLGPICPVSPCMERAVLD---QLADYFKAR----LAGYPATLSEDEAMLTDYNL 432
+I ++G I + D +L + +AR L Y T+ D++ L ++L
Sbjct: 398 HLIGEHAIGKIFTLGNSDFPVSWDNEVKLWTFLEARASLLLKTYKTTVEVDKSFLETHDL 457
Query: 433 HPKKRVATQLVRMEKKMLNACLQVTA 458
P +A +L EK++L ++ A
Sbjct: 458 TPHAIMAIKLRLGEKEILEKAVKSAA 483
>gi|225462926|ref|XP_002267249.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic [Vitis
vinifera]
gi|296087793|emb|CBI35049.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 92/377 (24%), Positives = 161/377 (42%), Gaps = 53/377 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A D+ +A VP + + V +E + L +AL+L+ EK +
Sbjct: 81 LVAQRDIARNEAVLEVPKRFWINPDAVAASEIGS---VCGGLKPWVSVALFLIREKLR-D 136
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL- 231
+S W Y+ L S + WSE EL + G+ L E ++ E+ ++
Sbjct: 137 ESPWRSYLDILPEY-------TNSTIYWSEEELVEIQGTQLSNTTLGVKEYVQSEFLKVE 189
Query: 232 ------------------DTVWFMAGSLFQKVSLAR----RFALVPLG------PPLLA- 262
D +W A + + + +R L+PL P +
Sbjct: 190 EEVILPHSQLFPFPVTLDDFLW--AFGILRSRAFSRLRGQNLVLIPLADLINHSPSITTE 247
Query: 263 -YSSKCK-AMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDR 319
Y+ + K A L + D L KAGE +++ + + N++L ++YGF++ +
Sbjct: 248 EYAWEIKGAGLFSRDQLFSLRTPVSVKAGEQVLIQYDLDKSNAELALDYGFIESRPNRNS 307
Query: 320 LVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS--DT 377
+ ++ DP + DK +A+ NG + F + G+ A MLPYLRL + D
Sbjct: 308 YTLTLEISESDPFFGDKLDIAESNGLSEIAYFDIVLGQSLPAA--MLPYLRLVALGGPDA 365
Query: 378 SEMQSVISS--LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 434
++S+ + G + PVS E + + D K+ L+GY T+ EDE + + NLHP
Sbjct: 366 FLLESIFRNTIWGHLELPVSRANEELICQVIQDACKSALSGYLTTIEEDEKLKEEGNLHP 425
Query: 435 KKRVATQLVRMEKKMLN 451
+ +A + EKK+L
Sbjct: 426 RLEIAVGVRTGEKKVLQ 442
>gi|291411315|ref|XP_002721936.1| PREDICTED: SET domain containing 3 [Oryctolagus cuniculus]
Length = 591
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 96/430 (22%), Positives = 180/430 (41%), Gaps = 62/430 (14%)
Query: 72 SKKEEDLGDLKSWMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVP 129
K+E+ +L W NG + +V+ E+ + A+ +++A + VP
Sbjct: 73 GKREDYFPELMKWASANGASVEGFEVVNFEEEGFG--------LRATREIKAEELFLWVP 124
Query: 130 NSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQ 186
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 125 RKLLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERAS-PNSFWQPYIQTLPSE 182
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL-- 231
++PL + E E+ YL + ++ + + R+Y N+L
Sbjct: 183 -------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYRVIQTHPHANKLPL 235
Query: 232 -------DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAV 278
D W ++ + ++ + AL+PL + DD
Sbjct: 236 KDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRC 295
Query: 279 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRM 338
+ V R + AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K
Sbjct: 296 ECVALRDFHAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAE 355
Query: 339 VAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPI-------- 390
V R G + VF +H E + +L +LR+ + E++ + G I
Sbjct: 356 VLARAGIPTSSVFALHF-TEPPISAQLLAFLRV-FCMTEEELREHLLGDGAIDRIFTLGN 413
Query: 391 --CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKK 448
PVS E + L D L Y T+ ED+A+L L + +A +L EK+
Sbjct: 414 SEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKAVLRSPALSARAAMAVKLRLGEKE 473
Query: 449 MLNACLQVTA 458
+L ++ A
Sbjct: 474 ILEKAVRSAA 483
>gi|115657973|ref|XP_798530.2| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Strongylocentrotus purpuratus]
Length = 682
Score = 79.0 bits (193), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 98/433 (22%), Positives = 176/433 (40%), Gaps = 58/433 (13%)
Query: 64 VAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGD 123
VAG S +E W++ NG+ V + K + + A++D++
Sbjct: 66 VAGEPMQQSDREVHFETFFKWLNTNGVTTDAVKMA-------KFDEGYGLQATQDIKMDQ 118
Query: 124 AAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKKQGKKSFWLPYI 180
++P +++T + + + TI +L+ ++L + LA++++ EK + SFW PY+
Sbjct: 119 ELMNIPRKVMMTDQNAVDSPTIGDLVRGDRLLKGMPNVSLAIFILSEKLK-SDSFWKPYL 177
Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL--------- 231
L + PL ++ E+ GS E L++ + I R+Y L
Sbjct: 178 DVLPS-------SYSLPLYFTPDEIQLFQGSTMYGECLKQHKNIARQYAYLFKLLNLPEN 230
Query: 232 ------------------DTVWFMAGSLFQKVSLARRFALVPLGPPLLAYSSKCKAMLAA 273
TV + K +L+PL + + K
Sbjct: 231 SKLHIREYFTYDFYRWAVSTVMTRQNQIPAKDGKGMSLSLIPLWDMCNHANGEMKTDFIE 290
Query: 274 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 333
D+ + R + GE I + G + ++ LL+ GFV N YD + ++ L++ D Y
Sbjct: 291 ERDSCVNMALRDFSVGEQIFICYGRRSSADLLLYSGFVYPGNVYDGMAIQLGLSSSDRLY 350
Query: 334 QDKRMVAQRNGKLSV--QVFHVHAGREKEAISDMLPYLRLGYVSD---------TSEMQS 382
K + KL V Q +H+ AG+E + ++L +LR+ + D + Q+
Sbjct: 351 AMKAQLCSVM-KLGVPSQNYHISAGKEPVTL-ELLTFLRIFCMQDLELRDRLLGDNRAQA 408
Query: 383 VISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQL 442
+ S + +S E LA Y ++ EDE L D NL ++R A QL
Sbjct: 409 LFSLVDRSQIISKLNELRTCVYLATRVTLLQRQYKTSIQEDEEKLKDGNLSAQERSALQL 468
Query: 443 VRMEKKMLNACLQ 455
+ +EK L L+
Sbjct: 469 LLIEKCTLENVLE 481
>gi|332321742|sp|E2RBS6.1|SETD3_CANFA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
Length = 588
Score = 78.6 bits (192), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 96/427 (22%), Positives = 184/427 (43%), Gaps = 56/427 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL---- 231
++PL + E E+ L + ++ + + R+Y N+L
Sbjct: 183 -----YDTPLYFEEDEVRDLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKD 237
Query: 232 -----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQL 280
D W ++ + ++ + AL+PL + DD +
Sbjct: 238 AFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCEC 297
Query: 281 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 340
V R ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 298 VALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVL 357
Query: 341 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG-PIC 391
R G + VF +H + + +L +LR+ +++ + + I +LG
Sbjct: 358 ARAGIPTSSVFALHY-TDPPVSAQLLAFLRVFCMTEEELKEHLLGDNALDRIFTLGNSEY 416
Query: 392 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 451
PVS E + L D L Y T+ ED++ L +++L + +A +L EK++L
Sbjct: 417 PVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLRNHDLSVRATMAIKLRLGEKEILE 476
Query: 452 ACLQVTA 458
++ A
Sbjct: 477 KAVKSAA 483
>gi|403274243|ref|XP_003928891.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Saimiri
boliviensis boliviensis]
Length = 513
Score = 78.6 bits (192), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 87/384 (22%), Positives = 169/384 (44%), Gaps = 50/384 (13%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQG 171
A+ D++A + VP L++T+E N + L + +++ + LA +L+ E+
Sbjct: 28 ATRDIKAEELFLWVPRKLLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-S 85
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY--- 228
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 86 PNSFWQPYIQTLPSE-------YDTPLYFEEEEVRYLQSTQAIHDVFSQYKNTARQYAYF 138
Query: 229 ----------NEL---------DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAY 263
N+L D W ++ + ++ + AL+PL
Sbjct: 139 YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHT 198
Query: 264 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 323
+ DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++
Sbjct: 199 NGLITTGYNLEDDRCECVALQDFQAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIK 258
Query: 324 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV 383
++ D Y K V R G + VF +H E + +L +LR+ +++ + +
Sbjct: 259 LGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHL 317
Query: 384 --------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 434
I +LG PVS E + L D L Y T+ ED+ +L + +L
Sbjct: 318 LGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKFVLKNQDLSV 377
Query: 435 KKRVATQLVRMEKKMLNACLQVTA 458
+ ++A +L EK++L ++ A
Sbjct: 378 RAKMAIKLRLGEKEILEKAVKSAA 401
>gi|73964462|ref|XP_547974.2| PREDICTED: SET domain containing 3 [Canis lupus familiaris]
Length = 589
Score = 78.6 bits (192), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 96/427 (22%), Positives = 184/427 (43%), Gaps = 56/427 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL---- 231
++PL + E E+ L + ++ + + R+Y N+L
Sbjct: 183 -----YDTPLYFEEDEVRDLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKD 237
Query: 232 -----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQL 280
D W ++ + ++ + AL+PL + DD +
Sbjct: 238 AFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCEC 297
Query: 281 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 340
V R ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 298 VALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVL 357
Query: 341 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG-PIC 391
R G + VF +H + + +L +LR+ +++ + + I +LG
Sbjct: 358 ARAGIPTSSVFALHY-TDPPVSAQLLAFLRVFCMTEEELKEHLLGDNALDRIFTLGNSEY 416
Query: 392 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 451
PVS E + L D L Y T+ ED++ L +++L + +A +L EK++L
Sbjct: 417 PVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLRNHDLSVRATMAIKLRLGEKEILE 476
Query: 452 ACLQVTA 458
++ A
Sbjct: 477 KAVKSAA 483
>gi|281338628|gb|EFB14212.1| hypothetical protein PANDA_005835 [Ailuropoda melanoleuca]
Length = 585
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 96/427 (22%), Positives = 184/427 (43%), Gaps = 56/427 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL---- 231
++PL + E E+ L + ++ + + R+Y N+L
Sbjct: 183 -----YDTPLYFEEEEVRDLQCTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKD 237
Query: 232 -----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQL 280
D W ++ + ++ + AL+PL + DD +
Sbjct: 238 AFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCEC 297
Query: 281 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 340
V R ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 298 VALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVL 357
Query: 341 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG-PIC 391
R G + VF +H + + +L +LR+ +++ + + I +LG
Sbjct: 358 ARAGIPTSSVFALHF-TDPPVSAQLLAFLRVFCMTEEELKEHLLGDNALDRIFTLGNSEY 416
Query: 392 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 451
PVS E + L D L Y T+ ED++ L +++L + +A +L EK++L
Sbjct: 417 PVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLKNHDLSVRATMAIKLRLGEKEILE 476
Query: 452 ACLQVTA 458
++ A
Sbjct: 477 KAVKSAA 483
>gi|432098266|gb|ELK28072.1| Histone-lysine N-methyltransferase setd3 [Myotis davidii]
Length = 585
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 94/427 (22%), Positives = 181/427 (42%), Gaps = 56/427 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ +L W +NG V E + E+ + A+ D++A + VP
Sbjct: 62 GKREDYFPNLMKWASENG---ASVEGFEMFNFKEEGFGLR---ATRDIKAEELFLWVPRK 115
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 116 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 171
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL---- 231
++PL + E E+ L + ++ + + R+Y N+L
Sbjct: 172 -----YDTPLYFEEDEVRSLQSTQAVHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKD 226
Query: 232 -----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQL 280
D W ++ + ++ + AL+PL + DD +
Sbjct: 227 SFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCEC 286
Query: 281 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 340
V R ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 287 VALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVL 346
Query: 341 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVISSLGPIC 391
R G + VF +H E + +L +LR+ +++ + + + +
Sbjct: 347 ARAGIPTSSVFALHF-MEPPISAQLLAFLRVFCMTEEELKDHLLGDNAIDKIFTLGNSEY 405
Query: 392 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 451
PVS E + L D L Y T ED++ L +++L + R+A +L EK++L
Sbjct: 406 PVSWDNEVKLWTFLEDRASLLLKTYKTTSEEDKSFLKNHDLSVRARMAIKLRLGEKEILE 465
Query: 452 ACLQVTA 458
+ A
Sbjct: 466 KAVTSAA 472
>gi|301764186|ref|XP_002917505.1| PREDICTED: SET domain-containing protein 3-like [Ailuropoda
melanoleuca]
Length = 591
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 96/427 (22%), Positives = 184/427 (43%), Gaps = 56/427 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL---- 231
++PL + E E+ L + ++ + + R+Y N+L
Sbjct: 183 -----YDTPLYFEEEEVRDLQCTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKD 237
Query: 232 -----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQL 280
D W ++ + ++ + AL+PL + DD +
Sbjct: 238 AFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCEC 297
Query: 281 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 340
V R ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 298 VALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVL 357
Query: 341 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG-PIC 391
R G + VF +H + + +L +LR+ +++ + + I +LG
Sbjct: 358 ARAGIPTSSVFALHF-TDPPVSAQLLAFLRVFCMTEEELKEHLLGDNALDRIFTLGNSEY 416
Query: 392 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 451
PVS E + L D L Y T+ ED++ L +++L + +A +L EK++L
Sbjct: 417 PVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLKNHDLSVRATMAIKLRLGEKEILE 476
Query: 452 ACLQVTA 458
++ A
Sbjct: 477 KAVKSAA 483
>gi|145524453|ref|XP_001448054.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124415587|emb|CAK80657.1| unnamed protein product [Paramecium tetraurelia]
Length = 581
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 96/408 (23%), Positives = 174/408 (42%), Gaps = 75/408 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNET-IAELLTTNKLSELA----CLALYLMYE 167
V A + + A + +P S ++TLE + ET +A+ + +L L+ L+ +L+ E
Sbjct: 165 VNAKQKINAKELILFIPKSHMITLE--MAKETPVAKKMIQFRLDLLSPKHSFLSTFLLQE 222
Query: 168 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
K + SFW PY+ L Q P+ ++ +L +L GSP +I ++ +K++
Sbjct: 223 KSRPN-SFWKPYLDIL------PQSYPSFPIFFNNYDLEWLQGSPFLKQINDKLSDLKKD 275
Query: 228 YNELDTVW--FMAGSLFQ----KVSLARRF-----------ALVPLGPPL---------L 261
YN++ V F S ++ +++ + R A VPL L
Sbjct: 276 YNDICNVAPEFSQYSFYEFCWARMTASSRIFGINIKGVKTDAFVPLADMLNHKRPKLTSW 335
Query: 262 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 321
YS + + + D+ + DR G+ I G + NS+ L+NYGFV +DN + +
Sbjct: 336 CYSEEKQGFIIETDEKI----DR----GQMIFDSYGRKCNSRFLLNYGFVVDDNDANEVN 387
Query: 322 VEAALNTEDPQYQDK--------------RMVAQRNGKLSVQVFHVHAGREKEAISDMLP 367
V A DP Q K R++ +G + F + + + +
Sbjct: 388 VTVAAEFNDPLIQLKEDATEEQLKQPKTFRLIMDTDGINEITHFL------EATVMEFMS 441
Query: 368 YLRLGYVSDTSEMQSVISS-----LGP--ICPVSPCMERAVLDQLADYFKARLAGYPATL 420
Y+R + D +++Q +++ + P I P+ E + D + L+ YP TL
Sbjct: 442 YIRFLVIRDQTQLQFLLNERESKYIKPTKIQPLGIHNELDMWDLIRRICYVSLSRYPTTL 501
Query: 421 SEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVT 468
+D+ +L +L +R L EK++L Q + M LL +
Sbjct: 502 EQDKEILQICDLTTNQRNCLILRMGEKEILKFYYQFSEKMKQLLSNFN 549
>gi|348537527|ref|XP_003456245.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Oreochromis niloticus]
Length = 607
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 95/428 (22%), Positives = 175/428 (40%), Gaps = 63/428 (14%)
Query: 74 KEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
+E+ +L SW +NG C+ + + + A+ D++A + +P ++
Sbjct: 75 REDHFPELMSWAKENG-ASCECFTVANFG-----KEGYGLRATRDIKAEELFLWIPRKML 128
Query: 134 VTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRG 190
+T+E N + L + +++ + LAL+L+ E+ SFWLPYIR L ++
Sbjct: 129 MTVESA-QNSILGPLYSQDRILQAMGNVTLALHLLCERA-NPASFWLPYIRSLPQE---- 182
Query: 191 QLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY----------NELDTVWF---- 236
+ PL + + ++ L G+ ++L + + R+Y L +V
Sbjct: 183 ---YDIPLYYQQEDVQLLLGTQAVQDVLSQYKNTARQYAYFYKLVQDKGMLGSVELRLFA 239
Query: 237 -----MAGSLFQKVSLAR----------------RFALVPLGPPLLAYSSKCKAMLAAVD 275
M G LF + +++ AL+PL + D
Sbjct: 240 SLTPVMGGKLFDQWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLED 299
Query: 276 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQD 335
D + V + YK E I ++ G + N++ +I+ GF +D+ +DR+ ++ ++ + Y
Sbjct: 300 DRCECVALQDYKENEQIYIFYGTRSNAEFVIHNGFFFQDDAHDRVKIKLGVSKSERLYAM 359
Query: 336 KRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLG---------YVSDTSEMQSVISS 386
K V R G + VF +H E + +L +LR+ Y+ + + +
Sbjct: 360 KAEVLARAGIPASYVFALHCN-EPPISAQLLAFLRVFCMTEDELKYYLLGDRAINKIFTL 418
Query: 387 LGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRME 446
PVS E + L L Y T ED +ML +L R+A QL E
Sbjct: 419 GNSEFPVSWENEIKLWTFLETRAALLLKTYKTTSEEDRSMLEKPDLSLHSRMAIQLRLAE 478
Query: 447 KKMLNACL 454
K++L L
Sbjct: 479 KQILEKAL 486
>gi|307108530|gb|EFN56770.1| hypothetical protein CHLNCDRAFT_8187, partial [Chlorella
variabilis]
Length = 398
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 97/383 (25%), Positives = 160/383 (41%), Gaps = 56/383 (14%)
Query: 116 SEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSF 175
S+ + G+ F+VP + +T + ++ + L L +AL+L++E+ G S
Sbjct: 4 SKAVNKGEQLFAVPEAAWITADTAQQSQIGSHL---TGLESWLAIALFLLHERAMGNASR 60
Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL---- 231
W PYI L G SP+ W E +LA L GS + ++ +++L
Sbjct: 61 WAPYIALLPADSG-------SPVQWEEADLAELQGSQVLGTVQGYRAYFQQRFDQLQAEV 113
Query: 232 --------DTVWFMAGSLFQKVSLAR----------RFALVPLG----------PPLLAY 263
D + F + R ALVPL P +
Sbjct: 114 FGPNSQAFDPIVFNFDAFLWAACTVRARAHPPLDGGNIALVPLADMVRSQPSWPPDSAGW 173
Query: 264 SSKCKAMLAAVDDAVQLVVDRP--YKAGESIVVWCGPQ-PNSKLLINYGFVDE--DNPYD 318
K L LV++ AG++I + GPQ + +LL+++G +D + P
Sbjct: 174 QLKQTGGLFGAGSTQALVMEASGSMAAGDAIAMDFGPQKSDGQLLVDHGVIDPLVNQPSY 233
Query: 319 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV-HAGREKEAISDMLPYLRLGYVSDT 377
L +E L+ ED Y DK + + N +L+ H+ A R +A + L +D
Sbjct: 234 ALTLE--LSKEDRNYDDKADILELN-ELAESTEHILRADRAPDAGLLPVLRLLNLSGTDA 290
Query: 378 SEMQSVISSL---GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 434
++S+ + PVS ER QL D A LA YP ++ ED A++ +L P
Sbjct: 291 FLLESIFRNEVWEHMQLPVSEDNERGCYQQLIDGCTAALAAYPTSIDEDLALMASGSLQP 350
Query: 435 KKRVATQL-VRM-EKKMLNACLQ 455
R + + VR+ EK+ L+A L+
Sbjct: 351 GSRRQSAVRVRLGEKEALDATLR 373
>gi|432952574|ref|XP_004085141.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Oryzias
latipes]
Length = 606
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 94/429 (21%), Positives = 174/429 (40%), Gaps = 62/429 (14%)
Query: 62 TLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQA 121
T+ GSRE + DL SW +NG + + R + D++A
Sbjct: 69 TVFEGSRE------DSFADLMSWAQENGASCDGFTITNFGTEGYGLR------TTRDIKA 116
Query: 122 GDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKKQGKKSFWLP 178
+ VP +++T+E N + + + +++ + LAL+L+ E+ SFW P
Sbjct: 117 EELFLWVPRKMLMTVESA-QNSVLGPIYSQDRILQAMGNVTLALHLLCERGD-PASFWSP 174
Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL------- 231
YIR L ++ ++PL + + ++ L G+ ++L + + R+Y
Sbjct: 175 YIRSLPQE-------YDTPLYYQQEDVQLLLGTQAVQDVLNQYKNTARQYAYFYKLVQTH 227
Query: 232 ---------------DTVWFMAGSLFQK-----VSLAR-RFALVPLGPPLLAYSSKCKAM 270
D W ++ + ++ V +R AL+PL +
Sbjct: 228 PAASKLPLKDGFSFDDYRWAVSSVMTRQNQIPTVDGSRVTLALIPLWDMCNHTNGLITTG 287
Query: 271 LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 330
DD + V + YK E I ++ G + N++ +I+ GF +DN +DR+ ++ ++ +
Sbjct: 288 YNLEDDRCECVALQDYKKNEQIYIFYGTRSNAEFVIHNGFFFQDNAHDRVKIKLGVSKSE 347
Query: 331 PQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL---------GYVSDTSEMQ 381
Y K V R G + VF +H + + +L +LR+ Y+ +
Sbjct: 348 RLYAMKAEVLARAGIPASCVFALHC-NDPPISAQLLAFLRVFCMTEEELKDYLLGERAIN 406
Query: 382 SVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQ 441
+ + PVS E + L L Y T ED ++L +L R+A Q
Sbjct: 407 KIFTLGNSDFPVSWENEIKLWTFLETRAALLLKTYKTTSEEDRSILEKPDLSLHTRLAVQ 466
Query: 442 LVRMEKKML 450
L EK++L
Sbjct: 467 LRLAEKQIL 475
>gi|168063638|ref|XP_001783777.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664720|gb|EDQ51429.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 395
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 67/261 (25%), Positives = 118/261 (45%), Gaps = 33/261 (12%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+ ++ G+ V L++T ++ + ELL T ++E A LAL+++ E+ G+ S
Sbjct: 5 AARPIEVGEQVLRVSGDLMITPNKL--PTEVKELLPTG-VTEWARLALFILVEQHLGQAS 61
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI L A+ S + W + EL + + E ++R I E+ + V
Sbjct: 62 QWAPYINCLPTCG-----ALHSTVFWKKEELELVRFTSLHRETMQRRAVIGSEFASVLPV 116
Query: 235 WFMAGSLFQKVSLARRFA------------------LVPLGPPLLAYSSKCKAMLAAVDD 276
+F + L +F VP + S C+A+L+ ++
Sbjct: 117 LQKCPHIFGERVLHSKFKQAYATGKSLRRSSNTRILTVPF-VDFFNHDSNCRALLSYDEE 175
Query: 277 --AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 334
+++ D+ Y GE +V+ G PN+ L +++GF NPYD++ V AL+ DP +
Sbjct: 176 RACAEVIADKNYARGEQVVISYGRLPNTTLALDFGFTISCNPYDQVEVWMALSHRDPLRK 235
Query: 335 DKRMVAQRNGKLSVQVFHVHA 355
K + +G +V VHA
Sbjct: 236 MKLALLHAHGMPTV----VHA 252
>gi|57899520|dbj|BAD87034.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
gi|57899939|dbj|BAD87851.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
Length = 509
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 60/231 (25%), Positives = 105/231 (45%), Gaps = 12/231 (5%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASE +Q GD VP + +TL+++ L + + + + LA L+ E+ G +S
Sbjct: 63 ASEPIQEGDCIMQVPYHVQLTLDKL---PQKFNTLLDHAVGDTSKLAALLIMEQHLGNES 119
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI+ L + + + +LW EL + S E +E E K+E+ L
Sbjct: 120 GWAPYIKSLPTKD-----QMHNMVLWDLNELHAVQNSSIYDEAIEHKEQAKKEFLALKPA 174
Query: 235 WFMAGSLFQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD--DAVQLVVDRPYKAGESI 292
LF +V L L L + ++L + D +++ DR Y GE +
Sbjct: 175 LDHFPHLFGEVKLGDFMHASAL--DFLNHDGVFGSVLIYDEQKDVCEIIADRNYAVGEQV 232
Query: 293 VVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 343
++ G N+ L +N+GF N YD+ ++ + +DP Y+ K + Q++
Sbjct: 233 MIRYGKYSNATLALNFGFTLARNIYDQALIRIDMPVQDPLYKKKLDIWQKH 283
>gi|357153645|ref|XP_003576520.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
[Brachypodium distachyon]
Length = 492
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 88/377 (23%), Positives = 160/377 (42%), Gaps = 49/377 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A+ +L G+ VP L + + V ++ + L ++L ++ E +G
Sbjct: 86 LVAARNLPRGEVVAEVPKKLWMDADAVAASDIGRACRSGGDLRPWVSVSLLILREAARGG 145
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL- 231
S W PY+ L RQ +S + WSE EL + G+ + + E ++ E++ +
Sbjct: 146 DSLWAPYLAILPRQ-------TDSTIFWSEEELLEIQGTQLLSTTMGVKEYVQSEFDNVE 198
Query: 232 ------------DTVWF-----MAGSLFQKVS---LARRFALVPLGPPLLAYSS------ 265
DT+ F G L +V + AL+P L+ +S+
Sbjct: 199 AKIIGPNKDLFPDTITFDDFLWAFGILRSRVFPELRGDKLALIPFAD-LINHSADITSKQ 257
Query: 266 -----KCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDR 319
+ K L D L K+GE + V + + N++L ++YGF + ++ D
Sbjct: 258 SCWEIQGKGFLGR-DVVFSLRTPMEVKSGEQVYVQYDLDKSNAELALDYGFTETNSTRDS 316
Query: 320 LVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE 379
+ ++ DP Y DK +A+ NG F V G + M+ YLRL + T
Sbjct: 317 YTLTLEISESDPFYGDKLDIAELNGMGETAYFDVVLG--ESLPPQMITYLRLLCLGGTDA 374
Query: 380 --MQSVISS--LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 434
++++ + G + PVS E ++ + K+ L Y T+ EDE +L +L
Sbjct: 375 FLLEALFRNKVWGFLELPVSRDNEESICQVIQTACKSALTAYHTTIEEDEELLKREDLQS 434
Query: 435 KKRVATQLVRMEKKMLN 451
+ ++A ++ EKK+L
Sbjct: 435 RHQIAVEVRAGEKKVLQ 451
>gi|302804384|ref|XP_002983944.1| hypothetical protein SELMODRAFT_119151 [Selaginella moellendorffii]
gi|300148296|gb|EFJ14956.1| hypothetical protein SELMODRAFT_119151 [Selaginella moellendorffii]
Length = 439
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 95/368 (25%), Positives = 159/368 (43%), Gaps = 52/368 (14%)
Query: 127 SVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
S+P +L + + V +E I E L +ALYL++EK + S W YIR L R
Sbjct: 37 SIPKTLWMDADTVRRSE-IGE--CCEGLRPWIAVALYLLHEKAK-PHSDWSAYIRVLPR- 91
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE---------------- 230
++SPL WSE ELA L G+ + + E +KREY++
Sbjct: 92 ------TLDSPLFWSEEELAELKGTQLLSSMNGFKEFLKREYDKVMTEVIEPRPDVFDRS 145
Query: 231 ---LDTVWFMAGSLFQKVS---LARRFALVPL------GPPLL----AYSSKCKAMLAAV 274
L+ + G L + + ALVPL G L + K + A
Sbjct: 146 LYTLEAFTWAFGILRSRTFPPLIGDNLALVPLADFVNHGFGLTNEDPGWKVKSAGVFARQ 205
Query: 275 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQ 332
+ + E ++ + + N++L +YGFVD D N D + ++ +
Sbjct: 206 ETLTLQAAANCAEKQEVLIQYGKKKGNAQLATDYGFVDSDEKNNRDSFTLTLQVSLSERF 265
Query: 333 YQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLG--YVSDTSEMQSVISSL--- 387
DK +AQ G S F+++ R + DM+ YLRL + SD+ ++++ +
Sbjct: 266 ADDKVDIAQMAGLDSTAYFNLY--RNQGPPEDMIAYLRLIALFGSDSFLLEALFRNTVWD 323
Query: 388 GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEK 447
P+S E A+ + + + +A L Y +T+ ED +L L +K++A + EK
Sbjct: 324 HLRLPISRENEEAICEAMIEGCRATLREYSSTIDEDTMLLNSSELSTRKKMAVVVRLGEK 383
Query: 448 KMLNACLQ 455
++L LQ
Sbjct: 384 RILQEQLQ 391
>gi|397613505|gb|EJK62256.1| hypothetical protein THAOC_17139 [Thalassiosira oceanica]
Length = 648
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 96/409 (23%), Positives = 163/409 (39%), Gaps = 79/409 (19%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLE---RVLGNETIAELLTTNKLSELACLALYLMYEK-KQ 170
A + GD +P L +T + R LG + + E ++E +A L++EK +
Sbjct: 210 ARRSINDGDELLKIPLDLCLTRKSARRELGKDALQE-----GINEYLAVACQLIHEKFVK 264
Query: 171 GKKSFWLPYIR---ELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
G+ SF+ Y+ E+D V W + +LA+L GSP A ++RE
Sbjct: 265 GEDSFYAAYMGVLPEVDE--------VNPTFTWPDEDLAFLEGSPVVAATRSLQMKLRRE 316
Query: 228 YNEL----------------------------DTVWFMAGSLFQKVSLARRFALVPLGPP 259
Y++L T+ F + + + R A+VP
Sbjct: 317 YDDLLGGPDGLVAKFPLRFPAEHYTFENWEWAFTMLFSRAIRLRNLQVGERLAMVPYAD- 375
Query: 260 LLAYSSKCKAMLAA----------VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 309
L+ +S+ +A + A ++ V L DR Y+ E + + G + N++LL+ YG
Sbjct: 376 LINHSAFSQAFIDARESGDWLFKSGEEEVILYADRGYRQMEQVYISYGQKSNAELLLLYG 435
Query: 310 FVDEDNPYDRLVVEAALN-------------TEDPQYQDKRMVAQRNGKLSVQVFHVHAG 356
F E NPY+ + V ++ EDP +K G+ F +A
Sbjct: 436 FALERNPYNSVDVTVSIAPRTKQIAEANEGVEEDPLADEKLEFLLSVGRDQTVDFPCYAD 495
Query: 357 REKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGY 416
R +ML YLRL ++ +S +S E +VL + K +L +
Sbjct: 496 RYP---VEMLEYLRLMMMTPEDTRGKPLSDFDYSRTISSANEASVLRSVVAAVKYQLGLF 552
Query: 417 PATLSEDEAMLTDYNLHP----KKRVATQLVRMEKKMLNACLQVTADMI 461
P T +D A++ D + +R+A + R EK++L L I
Sbjct: 553 PQTEEDDAAIIKDKGMFRLFSYNQRMAVRHRRNEKRLLKRTLAALEKQI 601
>gi|354483159|ref|XP_003503762.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Cricetulus
griseus]
gi|344254671|gb|EGW10775.1| SET domain-containing protein 3 [Cricetulus griseus]
Length = 577
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 94/427 (22%), Positives = 181/427 (42%), Gaps = 56/427 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL---- 231
++PL + E E+ L + ++ + + R+Y N+L
Sbjct: 183 -----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKD 237
Query: 232 -----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQL 280
D W ++ + ++ + AL+PL + DD +
Sbjct: 238 SFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCEC 297
Query: 281 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 340
V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 298 VALQDFQAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVL 357
Query: 341 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVISSLGPIC 391
R G + VF +H E + +L +LR+ +++ S + + +
Sbjct: 358 ARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEF 416
Query: 392 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 451
PVS E + L D L Y T+ ED+ +L + +L + +A +L EK++L
Sbjct: 417 PVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRLGEKEILE 476
Query: 452 ACLQVTA 458
++ A
Sbjct: 477 KAVKSAA 483
>gi|268370088|ref|NP_082538.2| histone-lysine N-methyltransferase setd3 [Mus musculus]
gi|81879567|sp|Q91WC0.1|SETD3_MOUSE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=Endothelial differentiation inhibitory protein D10;
AltName: Full=SET domain-containing protein 3
gi|16359331|gb|AAH16123.1| SET domain containing 3 [Mus musculus]
gi|18044800|gb|AAH19973.1| Setd3 protein [Mus musculus]
gi|26327255|dbj|BAC27371.1| unnamed protein product [Mus musculus]
gi|74145116|dbj|BAE27425.1| unnamed protein product [Mus musculus]
gi|74151505|dbj|BAE38861.1| unnamed protein product [Mus musculus]
gi|148686776|gb|EDL18723.1| mCG18357, isoform CRA_a [Mus musculus]
Length = 594
Score = 75.9 bits (185), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 95/427 (22%), Positives = 184/427 (43%), Gaps = 56/427 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL---- 231
++PL + E E+ L + ++ + + R+Y N+L
Sbjct: 183 -----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKE 237
Query: 232 -----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQL 280
D W ++ + ++ + AL+PL + DD +
Sbjct: 238 SFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCEC 297
Query: 281 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 340
V + ++AG+ I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 298 VALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVL 357
Query: 341 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLGPI-C 391
R G + VF +H+ E + +L +LR+ +++ + + I +LG
Sbjct: 358 ARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNAEF 416
Query: 392 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 451
PVS E + L D L Y T+ ED+ +L + +L + +A +L EK++L
Sbjct: 417 PVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRLGEKEILE 476
Query: 452 ACLQVTA 458
++ A
Sbjct: 477 KAVKSAA 483
>gi|449280698|gb|EMC87934.1| SET domain-containing protein 3 [Columba livia]
Length = 593
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 87/390 (22%), Positives = 169/390 (43%), Gaps = 58/390 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
+ A+ +++A + VP L++T+E N + L + +++ + LA +L+ E+
Sbjct: 108 LKATREIKAEELFLWVPRRLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERA 166
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
SFWLPYI+ L + +PL + E E+ YL + ++ + + R+Y
Sbjct: 167 N-PNSFWLPYIQTLPSE-------YNTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYA 218
Query: 230 EL----------------------DTVWFMAGSLFQKVSLARR------FALVPLGPPLL 261
D W ++ + ++ + AL+PL
Sbjct: 219 YFYKVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCN 278
Query: 262 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 321
+ DD + V + +KAGE I ++ G + N++ +I+ GF ++N +DR+
Sbjct: 279 HTNGLITTGYNLEDDRCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVK 338
Query: 322 VEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQ 381
++ ++ D Y K V R G + VF +H+ E + +L +LR+ +S+ +
Sbjct: 339 IKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMSEEELKE 397
Query: 382 SVIS--------SLG-PICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLT 428
+I +LG PVS E +L + +AR L Y T+ +D++ L
Sbjct: 398 HLIGEHAIDKIFTLGNSEFPVSWDNEV----KLWTFLEARASLLLKTYKTTVEDDKSFLE 453
Query: 429 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 458
++L +A +L EK++L ++ A
Sbjct: 454 THDLTSHAIMAIKLRLGEKEILEKAVKSAA 483
>gi|148686779|gb|EDL18726.1| mCG18357, isoform CRA_d [Mus musculus]
Length = 597
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 95/426 (22%), Positives = 184/426 (43%), Gaps = 56/426 (13%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 77 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 130
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 131 LMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE--- 185
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL----- 231
++PL + E E+ L + ++ + + R+Y N+L
Sbjct: 186 ----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKES 241
Query: 232 ----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 281
D W ++ + ++ + AL+PL + DD + V
Sbjct: 242 FTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECV 301
Query: 282 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 341
+ ++AG+ I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 302 ALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLA 361
Query: 342 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLGPI-CP 392
R G + VF +H+ E + +L +LR+ +++ + + I +LG P
Sbjct: 362 RAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNAEFP 420
Query: 393 VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 452
VS E + L D L Y T+ ED+ +L + +L + +A +L EK++L
Sbjct: 421 VSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRLGEKEILEK 480
Query: 453 CLQVTA 458
++ A
Sbjct: 481 AVKSAA 486
>gi|444705829|gb|ELW47217.1| Histone-lysine N-methyltransferase setd3 [Tupaia chinensis]
Length = 539
Score = 75.1 bits (183), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 89/399 (22%), Positives = 171/399 (42%), Gaps = 58/399 (14%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ +++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVDGFEMVNFKEEGFGLR---ATREIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERAS-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQKVSLA 248
++PL + E E+ YL + ++ + + R+Y + F KV
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----------AYFYKV--- 223
Query: 249 RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 308
+ G L DD + V + ++ GE I ++ G + N++ +I+
Sbjct: 224 ---IQITTGYNL-------------EDDRCECVALQDFRPGEQIYIFYGTRSNAEFVIHS 267
Query: 309 GFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPY 368
GF ++N +DR+ ++ ++ D Y K V R G + VF +H + + +L +
Sbjct: 268 GFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TDPPISAQLLAF 326
Query: 369 LRLGYVSDT---------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPAT 419
LR+ +++ S + + + PVS E + L D L Y T
Sbjct: 327 LRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTT 386
Query: 420 LSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 458
+ ED+++L +L + +A +L EK++L ++ A
Sbjct: 387 IEEDKSVLKSRDLSVRATMAIKLRLGEKEILERAVRSAA 425
>gi|410928182|ref|XP_003977480.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Takifugu
rubripes]
Length = 598
Score = 75.1 bits (183), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 85/381 (22%), Positives = 159/381 (41%), Gaps = 52/381 (13%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKKQG 171
A+ D++A + +P +++T+E + L +++ + LAL+L+ E+
Sbjct: 110 ATRDIKAEELFLWIPRKMLMTVESA-KKSVLGPLYNQDRILQAMDNVTLALHLLCERA-N 167
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
SFWLPYIR L ++ ++PL + + E+ L G+ ++L + R+Y
Sbjct: 168 PASFWLPYIRTLPQE-------YDTPLFYEQDEVQLLQGTQAVQDVLSQYRNTARQYAYF 220
Query: 232 ----------------------DTVWFMAGSLFQKVSL----ARR--FALVPLGPPLLAY 263
D W ++ + ++ + R+ AL+PL
Sbjct: 221 YKLIQTHPASSKLPLKDSFTFDDYRWAVSSVMTRQNQIPTEDGRQVTLALIPLWDMCNHR 280
Query: 264 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 323
+ DD + V + YK E I ++ G + N++ +I+ GF ++N +D++ ++
Sbjct: 281 NGLITTGYNLEDDRCECVALQDYKKNEQIYIFYGTRSNAEFVIHNGFFYQENAHDQVKIK 340
Query: 324 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV 383
++ + Y K V R G +F ++ E+ + +L +LR+ + E++
Sbjct: 341 LGISKSERLYAMKAEVLARAGIPVSSIFALYCN-EQPISAQLLAFLRV-FCMKEEELRDY 398
Query: 384 ---------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 433
I +LG + PVS E + L L Y T ED + L L
Sbjct: 399 LLGGHAINKIVTLGSMEFPVSWDNEIKLWTFLETRVALLLKAYKTTSEEDSSTLEKSELS 458
Query: 434 PKKRVATQLVRMEKKMLNACL 454
P R+A QL EK +L L
Sbjct: 459 PHSRMAIQLRLAEKWILEKAL 479
>gi|148686778|gb|EDL18725.1| mCG18357, isoform CRA_c [Mus musculus]
Length = 536
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 88/399 (22%), Positives = 173/399 (43%), Gaps = 58/399 (14%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQKVSLA 248
++PL + E E+ L + ++ + + R+Y ++ + +
Sbjct: 183 -----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY----------AYFYKVIQIT 227
Query: 249 RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 308
+ L DD + V + ++AG+ I ++ G + N++ +I+
Sbjct: 228 TGYNLE--------------------DDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHS 267
Query: 309 GFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPY 368
GF ++N +DR+ ++ ++ D Y K V R G + VF +H+ E + +L +
Sbjct: 268 GFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISAQLLAF 326
Query: 369 LRLGYVSDTSEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPAT 419
LR+ +++ + + I +LG PVS E + L D L Y T
Sbjct: 327 LRVFCMTEEELKEHLLGDSAIDRIFTLGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTT 386
Query: 420 LSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 458
+ ED+ +L + +L + +A +L EK++L ++ A
Sbjct: 387 IEEDKIVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAA 425
>gi|392341246|ref|XP_002726820.2| PREDICTED: histone-lysine N-methyltransferase setd3 [Rattus
norvegicus]
gi|392349051|ref|XP_216781.6| PREDICTED: histone-lysine N-methyltransferase setd3 [Rattus
norvegicus]
gi|149044195|gb|EDL97577.1| rCG27725, isoform CRA_a [Rattus norvegicus]
Length = 596
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 93/427 (21%), Positives = 181/427 (42%), Gaps = 56/427 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSILGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL---- 231
++PL + E E+ L + ++ + + R+Y N+L
Sbjct: 183 -----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKD 237
Query: 232 -----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQL 280
D W ++ + ++ + AL+PL + DD +
Sbjct: 238 SFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCEC 297
Query: 281 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 340
V + ++AG+ I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 298 VALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVL 357
Query: 341 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVISSLGPIC 391
R G + VF +H E + +L +LR+ +++ S + + +
Sbjct: 358 ARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEF 416
Query: 392 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 451
PVS E + L D L Y T+ ED+ +L + +L + +A +L EK++L
Sbjct: 417 PVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKTVLKNPDLSVRATMAIKLRLGEKEILE 476
Query: 452 ACLQVTA 458
++ A
Sbjct: 477 KAVKSAA 483
>gi|223992783|ref|XP_002286075.1| rubisco small subunit small subunit n-methyltransferase
[Thalassiosira pseudonana CCMP1335]
gi|220977390|gb|EED95716.1| rubisco small subunit small subunit n-methyltransferase
[Thalassiosira pseudonana CCMP1335]
Length = 434
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 93/399 (23%), Positives = 158/399 (39%), Gaps = 73/399 (18%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLE---RVLGNETIAELLTTNKLSELACLALYLMYEKKQ- 170
A + GD +P L +T + + LG + + + ++E +A L++EK
Sbjct: 50 ARRSINDGDELLKIPMDLCITRKSARKALGKDAL-----QDGINEYLAIACQLIHEKYVL 104
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
G +S W Y+ L V W + +LA+L GSP A ++REY+
Sbjct: 105 GDESEWDAYMGVLPEVE-----EVNPTFTWKDEDLAFLDGSPVVAATRSLQMKLRREYDA 159
Query: 231 L-----------------------DTVW-----FMAGSLFQKVSLARRFALVPLGPPLLA 262
L + VW F + + + R A+VP L+
Sbjct: 160 LLGGQDGLIAKFPDRFPAEHFTYENWVWAFTMLFSRAIRLRNLQVGERLAMVPYA-DLIN 218
Query: 263 YSSKCKAMLAA----------VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 312
+S+ A + A ++ V L DR Y+ E + + G + N++LL+ YGF
Sbjct: 219 HSAFSGAFIDARESGDWLFKNGEEEVILYADRGYRQMEQVYISYGQKSNAELLLLYGFAL 278
Query: 313 EDNPYDRLVVEAALNTE-------------DPQYQDKRMVAQRNGKLSVQVFHVHAGREK 359
E NPY+ + V ++ DP Q+K G+ F +A R
Sbjct: 279 ERNPYNSVDVTVSIAPRTAALAAANEGIEVDPLAQEKVEFLASVGRDQTVDFPCYADRYP 338
Query: 360 EAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPAT 419
+ML +LRL ++ ++ +SP E AVL + + K +L YP +
Sbjct: 339 ---VEMLEFLRLMMMTPEDTRGKPLADFDYSRTISPANEAAVLSSVVEAVKYQLNLYPQS 395
Query: 420 LSEDEAMLTDYNLHP----KKRVATQLVRMEKKMLNACL 454
+D ++ D L +R+A + R EK++L L
Sbjct: 396 EEDDANIIKDKALFRLLSYNQRMAVRHRRNEKRLLKRTL 434
>gi|57529914|ref|NP_001006486.1| histone-lysine N-methyltransferase setd3 [Gallus gallus]
gi|363734802|ref|XP_003641459.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Gallus
gallus]
gi|75571462|sp|Q5ZML9.1|SETD3_CHICK RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|53127281|emb|CAG31024.1| hypothetical protein RCJMB04_1k10 [Gallus gallus]
Length = 593
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 85/390 (21%), Positives = 170/390 (43%), Gaps = 58/390 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
+ A+ +++A + VP L++T+E N + L + +++ + LA +L+ E+
Sbjct: 108 LKATREIKAEELFLWVPRKLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERA 166
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
SFWLPYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 167 -NPNSFWLPYIQTLPSE-------YDTPLYFEEDEVQYLRSTQAIHDVFSQYKNTARQYA 218
Query: 230 EL----------------------DTVWFMAGSLFQKVSLARR------FALVPLGPPLL 261
D W ++ + ++ + AL+PL
Sbjct: 219 YFYKVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCN 278
Query: 262 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 321
+ DD + V + +KAGE I ++ G + N++ +I+ GF ++N +DR+
Sbjct: 279 HTNGLITTGYNLEDDRCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVK 338
Query: 322 VEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQ 381
++ ++ D Y K V R G + VF +H+ E + +L +LR+ +++ +
Sbjct: 339 IKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-IEPPISAQLLAFLRVFCMNEEELKE 397
Query: 382 SVIS--------SLG-PICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLT 428
+I +LG P+S E +L + +AR L Y T+ +D++ L
Sbjct: 398 HLIGEHAIDKIFTLGNSEFPISWDNEV----KLWTFLEARASLLLKTYKTTVEDDKSFLE 453
Query: 429 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 458
++L +A +L EK++L ++ A
Sbjct: 454 THDLTSHATMAIKLRLGEKEILEKAVKSAA 483
>gi|47215092|emb|CAF98166.1| unnamed protein product [Tetraodon nigroviridis]
Length = 444
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 82/380 (21%), Positives = 155/380 (40%), Gaps = 50/380 (13%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKKQG 171
A+ D++A + +P +++T+E + L T +++ + LAL+L+ E+
Sbjct: 28 ATRDIKAEELFLWIPRKMLMTVESA-KKSVLGPLYTQDRILQAMDNVTLALHLLCERAD- 85
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
SFWLPYIR L ++ ++PL + + ++ L G+ ++L + R+Y
Sbjct: 86 PASFWLPYIRTLPQE-------YDTPLFYQQQDVQLLHGTQAIQDVLSQYRNTARQYAYF 138
Query: 232 ----------------------DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAY 263
D W ++ + ++ + AL+PL
Sbjct: 139 YKLVQTHPASSKLPLKDSFTFDDYRWAVSSVMTRQNQIPTEDGRQVTLALIPLWDMCNHR 198
Query: 264 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 323
+ DD + V + YK E I ++ G + N++ +I+ GF ++N +D++ ++
Sbjct: 199 NGLITTGYNLEDDRCECVALQDYKKNEQIYIFYGTRSNAEFVIHNGFFYQENAHDQVKIK 258
Query: 324 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL---------GYV 374
++ + Y K V R G VF ++ E + +L +LR+ Y+
Sbjct: 259 LGISKSERLYAMKAEVLGRAGIPVSSVFALYCN-EPPISAQLLAFLRVFCMMEEELKDYL 317
Query: 375 SDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 434
+ +++ PVS E + L L Y T ED + L +L P
Sbjct: 318 FGAQAINRLVTLGSMEFPVSWENEIKLWTFLETRAALLLKAYKTTAEEDSSTLDKTDLSP 377
Query: 435 KKRVATQLVRMEKKMLNACL 454
R+A QL EK +L L
Sbjct: 378 HSRMAVQLRLAEKAILEKAL 397
>gi|12848462|dbj|BAB27964.1| unnamed protein product [Mus musculus]
gi|46241521|gb|AAS82953.1| endothelial differentiation inhibitory protein D10 [Mus musculus]
Length = 594
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 95/426 (22%), Positives = 184/426 (43%), Gaps = 56/426 (13%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 74 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 127
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 128 LMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE--- 182
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL----- 231
++PL + E E+ L + ++ + + R+Y N+L
Sbjct: 183 ----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKES 238
Query: 232 ----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 281
D W ++ + ++ + AL+PL + DD + V
Sbjct: 239 FTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECV 298
Query: 282 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 341
+ ++AG+ I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 299 ALQDFQAGDQIYIFYGTRSNAESVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLA 358
Query: 342 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLGPI-CP 392
R G + VF +H+ E + +L +LR+ +++ + + I +LG P
Sbjct: 359 RAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNAEFP 417
Query: 393 VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 452
VS E + L D L Y T+ ED+ +L + +L + +A +L EK++L
Sbjct: 418 VSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRLGEKEILEK 477
Query: 453 CLQVTA 458
++ A
Sbjct: 478 AVKSAA 483
>gi|346474100|gb|AEO36894.1| hypothetical protein [Amblyomma maculatum]
Length = 459
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 95/435 (21%), Positives = 176/435 (40%), Gaps = 88/435 (20%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
W +NG V +K++P + + + A E ++ +P LV+T ++
Sbjct: 50 WCSENGAYLGSVAIKDRPDGD------YGLVAEEKIEESMQFLGIPMKLVMTTASARKSK 103
Query: 144 TIAELLTTN----KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
+ LL + +S +A LA++L+ E G+ SFW PYI L + + L
Sbjct: 104 -LGPLLRDDPIMKSMSNVA-LAIFLILELSAGESSFWHPYISVLPD-------SFNTVLY 154
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTV----------------------WFM 237
++ EL L+GS E L+ I R+Y + W +
Sbjct: 155 FNIEELELLSGSAVLDEALKLHRSIARQYAYFHKIFRTHPLAKSLPFKDCFTYDLYRWAV 214
Query: 238 AGSLFQKVSL------------------ARRFALVPL--------GPPLLAYSS-----K 266
+ + ++ ++ A ALVPL G L Y S +
Sbjct: 215 SAVMTRQNAVPWTESDGLGGDDVEIDGTAAVTALVPLWDMCNHSDGKVLTDYDSSASMVR 274
Query: 267 CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 326
C AM R + GE + ++ G + N++ I+ GFV EDN YD + ++ +
Sbjct: 275 CYAM-------------RDFDKGEEVTIFYGKRTNAEFFIHNGFVFEDNRYDAVDIKLGV 321
Query: 327 NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISS 386
+ +DP + K + + + LS+ R++ D+ +LR+ + D S+ ++ S
Sbjct: 322 SKKDPLFAVKSKLCE-DHDLSLSGTFALVARDRPVSEDLSTFLRILVLKDASQPEAF--S 378
Query: 387 LGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRME 446
I S R L L + L +P + E E ++ D + + ++A +L +E
Sbjct: 379 AEHILTSSDSNARDALTFLVVRIELLLKAFPKSDEEYEDIIKDGASNARVKMAARLRLLE 438
Query: 447 KKMLNACLQVTADMI 461
K+L + L+ + +
Sbjct: 439 SKVLASVLETLGNHV 453
>gi|41056027|ref|NP_956348.1| histone-lysine N-methyltransferase setd3 [Danio rerio]
gi|82187658|sp|Q7SXS7.1|SETD3_DANRE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|32766447|gb|AAH55261.1| SET domain containing 3 [Danio rerio]
Length = 596
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 82/378 (21%), Positives = 157/378 (41%), Gaps = 50/378 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKK 169
+ A++D++A + +P +++T+E N + L + +++ + LAL+L+ E+
Sbjct: 108 LKATKDIKAEELFLWIPRKMLMTVESA-KNSVLGPLYSQDRILQAMGNVTLALHLLCERA 166
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
S WLPYI+ L + ++PL + E E+ +L + ++L + + R+Y
Sbjct: 167 N-PSSPWLPYIKTLPSE-------YDTPLYFEEEEVRHLLATQAIQDVLSQYKNTARQYA 218
Query: 230 EL----------------------DTVWFMAGSLFQKVSLAR------RFALVPLGPPLL 261
D W ++ + ++ + AL+PL
Sbjct: 219 YFYKVIHTHPNASKLPLKDAFTFDDYRWAVSSVMTRQNQIPTADGSRVTLALIPLWDMCN 278
Query: 262 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 321
+ DD + V + YK GE I ++ G + N++ +I+ GF EDN +DR+
Sbjct: 279 HTNGLITTGYNLEDDRCECVALKDYKEGEQIYIFYGTRSNAEFVIHNGFFFEDNAHDRVK 338
Query: 322 VEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL---------G 372
++ ++ + Y K V R G + +F +H E + +L +LR+
Sbjct: 339 IKLGVSKGERLYAMKAEVLARAGIPASSIFALHCS-EPPISAQLLAFLRVFCMTEEELRD 397
Query: 373 YVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNL 432
Y+ + + + PVS E + L L Y ED +ML +L
Sbjct: 398 YLVGDHAINKIFTLGNTEFPVSWENEIKLWTFLETRAALLLKTYKTASEEDRSMLEKPDL 457
Query: 433 HPKKRVATQLVRMEKKML 450
R+A +L EK++L
Sbjct: 458 SLHSRIAIKLRLAEKEIL 475
>gi|255080880|ref|XP_002504006.1| predicted protein [Micromonas sp. RCC299]
gi|226519273|gb|ACO65264.1| predicted protein [Micromonas sp. RCC299]
Length = 529
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 100/387 (25%), Positives = 148/387 (38%), Gaps = 65/387 (16%)
Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWL 177
D++AG+ +P +L VT V + +A L EL LAL+L E+ +G S W
Sbjct: 83 DVRAGEPLIEIPQNLAVTSVDVADSPIVAGLAAGR--GELVGLALWLCLERHKGPLSEWA 140
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL----- 231
PY+ L + PLLW+ EL L GSP + + + R E EY +
Sbjct: 141 PYVATLP------SAGSDHPLLWTAGELQTLLQGSPVREQAVSRLESADDEYASIADQIR 194
Query: 232 ----------------------------DTVWFMAGSLFQKVSLARRFALV--------- 254
VW A + + V L LV
Sbjct: 195 SNPNDFPPDAYEFLTRDAFVDALATVLARAVWLNAANCYAMVPLVDLLPLVGSPPPGVSP 254
Query: 255 -------PLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 307
+G P LA ++ AA + + + + + V + L +
Sbjct: 255 AAAAGGPAVGKPGLAAAAGVVDYDAATECVAVVSANDAQQTARVVCVDPLARNAGDLFLA 314
Query: 308 YGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG-KLSVQVFHVHAGREKEAISDML 366
G VDE + D L A+ D Y+ KR + + G Q F V A R +L
Sbjct: 315 TGAVDESHCGDYLAFAASCTQTDRLYEAKRQILEGMGMSADGQTFPVFADRMP---MQLL 371
Query: 367 PYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM 426
Y+R V D E+ SV I VSP E VL L + LA Y ++ E E +
Sbjct: 372 AYMRFARVQDPGELMSVSFEEDRI--VSPMNEYEVLQLLMQDAREMLAEYESSSEEFELL 429
Query: 427 -LTDYNLHPKKRVATQLVRMEKKMLNA 452
L + L ++RVA +L EK+++NA
Sbjct: 430 QLKEKGLSARQRVAAKLRLAEKRLINA 456
>gi|149044196|gb|EDL97578.1| rCG27725, isoform CRA_b [Rattus norvegicus]
Length = 538
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 86/399 (21%), Positives = 170/399 (42%), Gaps = 58/399 (14%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSILGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQKVSLA 248
++PL + E E+ L + ++ + + R+Y ++ + +
Sbjct: 183 -----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY----------AYFYKVIQIT 227
Query: 249 RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 308
+ L DD + V + ++AG+ I ++ G + N++ +I+
Sbjct: 228 TGYNLE--------------------DDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHS 267
Query: 309 GFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPY 368
GF ++N +DR+ ++ ++ D Y K V R G + VF +H E + +L +
Sbjct: 268 GFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAF 326
Query: 369 LRLGYVSDT---------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPAT 419
LR+ +++ S + + + PVS E + L D L Y T
Sbjct: 327 LRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTT 386
Query: 420 LSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 458
+ ED+ +L + +L + +A +L EK++L ++ A
Sbjct: 387 IEEDKTVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAA 425
>gi|297849804|ref|XP_002892783.1| hypothetical protein ARALYDRAFT_471564 [Arabidopsis lyrata subsp.
lyrata]
gi|297338625|gb|EFH69042.1| hypothetical protein ARALYDRAFT_471564 [Arabidopsis lyrata subsp.
lyrata]
Length = 482
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 95/384 (24%), Positives = 168/384 (43%), Gaps = 51/384 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A D+ + +P L + E V ++ I L L +AL+L+ EK + +
Sbjct: 79 LVARRDIGRNEVVLEIPKRLWINPETVTASK-IGPL--CGGLKPWVSVALFLIREKYE-E 134
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL- 231
+S W Y+ L + + +S + WSE ELA L G+ + L E ++ E+ +L
Sbjct: 135 ESSWRLYLDMLPQ-------STDSTVFWSEEELAELKGTQLLSTTLGVKEYVENEFLKLE 187
Query: 232 ------------------DTVW---FMAGSLFQKVSLARRFALVPLG------PPLLA-- 262
D +W + F ++ + L+PL P +
Sbjct: 188 QEILLPNKDLFSSRITLDDFIWAFGILKSRAFSRLR-GQNLVLIPLADLINHNPAITTED 246
Query: 263 YSSKCK-AMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRL 320
Y+ + K A L + D L KAGE + + + + N++L ++YGFV+ + +
Sbjct: 247 YAYEIKGAGLFSRDLLFSLKSPVYVKAGEQVYIQYDLNKSNAELALDYGFVESNPNRNSY 306
Query: 321 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV--SDTS 378
+ + DP + DK +A+ N F V G+ A ML YLRL + SD
Sbjct: 307 TLTIEIPESDPFFGDKLDIAETNKMGETGYFDVVDGQTLPA--GMLQYLRLVALGGSDAF 364
Query: 379 EMQSVISS--LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPK 435
++S+ ++ G + PVS E + + D K+ L+G+ T+ EDE +L + L P+
Sbjct: 365 LLESIFNNTIWGHLELPVSRSNEELICRVVRDACKSALSGFSTTIEEDEKLLEEGKLDPR 424
Query: 436 KRVATQLVRMEKKMLNACLQVTAD 459
+A ++ EK++L Q+ D
Sbjct: 425 LEMALKIRIGEKRVLQQIDQIFKD 448
>gi|395504553|ref|XP_003756612.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Sarcophilus
harrisii]
Length = 602
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 96/430 (22%), Positives = 183/430 (42%), Gaps = 64/430 (14%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG L N K + A+ +++A + VP L
Sbjct: 81 KREDYFPDLIKWAAENGASTDGFELV-----NFKEEGFG-LRATREIKAEELFLWVPRKL 134
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFWLPYI+ L +
Sbjct: 135 LMTVESA-KNSVLGALYSQDRILQAMGNITLAFHLLCERA-NPSSFWLPYIQTLPSE--- 189
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL----- 231
++PL + E E+ +L + ++ + + R+Y N+L
Sbjct: 190 ----YDTPLYFEEDEVQHLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPNANKLPLKDS 245
Query: 232 ----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 281
D W ++ + ++ + AL+PL + DD + V
Sbjct: 246 FTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECV 305
Query: 282 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 341
+ + GE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 306 ALQDFNVGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLA 365
Query: 342 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS--------SLG-PICP 392
R G + VF +H E + +L +LR+ +++ + +I +LG P
Sbjct: 366 RAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLIGEHAIDRIFTLGNSEFP 424
Query: 393 VSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKK 448
VS E +L + +AR L Y T+ ED++ L ++L +A +L EK+
Sbjct: 425 VSWDNEV----KLWTFLEARASLLLKTYKTTIEEDKSFLATHDLTFHATMAIKLRLGEKE 480
Query: 449 MLNACLQVTA 458
+L ++ A
Sbjct: 481 ILEKAVKSAA 490
>gi|145344456|ref|XP_001416748.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576974|gb|ABO95041.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 515
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 108/421 (25%), Positives = 168/421 (39%), Gaps = 63/421 (14%)
Query: 76 EDLGDLKSWM-HKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
ED +L +W+ + G+ ++ KE R VA D+ AG +VP V
Sbjct: 47 EDARELAAWLSYDKGVDASGLVFKEG------ARGEVEVALRGDVDAGARVLAVPQDCAV 100
Query: 135 TLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
T V + ++ L EL LAL+L E+ +G S W PY++ L
Sbjct: 101 TSVDVDAHPIVSGL--AKGRPELVGLALWLCAERIKGGASDWAPYVKTLAAN-------P 151
Query: 195 ESPLLWSETE-LAYLTGSPTKAEILERAEGIKREYNEL------DTVWFMAGS--LFQK- 244
++PL W+E E A L GSP + +ER+ + EY + D F A + F +
Sbjct: 152 DAPLFWTEAEDFALLKGSPIVNDAVERSRSAREEYAAIVEVIKGDPTAFPAEAYEFFTEE 211
Query: 245 ---------------VSLARRFALVPL-------GPPLLAY---SSKCKAMLAAVD---D 276
+ A +ALVPL G P+ S+K A D D
Sbjct: 212 RFVDALATVCAKATWLPTASCYALVPLLDVITIAGSPVPGVSPPSAKDGIARCAADYDVD 271
Query: 277 AVQLVVDRPYKA-GESIVVWCGP--QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 333
+ +V+ KA S VV P + N +L +N G VD+ +P D L + + D +
Sbjct: 272 SACVVLSAVVKAPANSRVVQLDPLQRNNGELFLNTGRVDQKHPGDYLYMRTEIQPSDRLF 331
Query: 334 QDKRMVAQRNG-KLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICP 392
K+ V + G Q F V+ E + + YLR V D EM +V I
Sbjct: 332 SAKKQVLEGMGFTAENQYFPVY---EDRMPTQLYSYLRFARVQDPGEMMAVSFEEDKI-- 386
Query: 393 VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 452
VS E +L L + ++ Y ++ +L + + + +RM +K L
Sbjct: 387 VSVMNEYEILQLLMGDCRELMSEYDTNEEDELNLLKLSDTMRVREIEAAKLRMSEKKLIG 446
Query: 453 C 453
C
Sbjct: 447 C 447
>gi|348675930|gb|EGZ15748.1| hypothetical protein PHYSODRAFT_561468 [Phytophthora sojae]
Length = 430
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 96/408 (23%), Positives = 159/408 (38%), Gaps = 65/408 (15%)
Query: 111 HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
H V A L +G VP L + E ++ L ++ + LAL+LM+E+ +
Sbjct: 35 HGVFAKRALTSGQVTLQVPFKLTMNTESAATSDLAPVLEKYPQIPDDEVLALHLMHERSK 94
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
G +SF+ P+I + + P+ W+E EL L G+ + ++R++
Sbjct: 95 GGESFFAPFIASMPT-------TFDLPVFWTEAELNELKGTNVLLLTQLMKQHLERDFEN 147
Query: 231 L---------------------DTVWFMAGSLFQKVSLARRFALVPLGPPLLAYSSKCKA 269
+ D +W M+ V +R F + G L C A
Sbjct: 148 IHQAVAADFPDIFASLPTLTIDDYMWAMS------VIWSRAFGVSKGGKYLHVL---CPA 198
Query: 270 M---------------LAAVDDAVQLV---VDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
M + ++ Q++ V AG ++ + G N+KLL +YGFV
Sbjct: 199 MDMFNHDVTVRKPLDDFVSFNEEKQMMTHHVPEDVAAGSAVHISYGQYSNAKLLYSYGFV 258
Query: 312 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-MLPYLR 370
+N + + DP ++ K+ V N Q + H + + +L LR
Sbjct: 259 SPENFRRGVDFWMKIPLSDPYFKLKQTVLDSNELTKEQTYDFHGTLLSNDVDERLLATLR 318
Query: 371 LGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDY 430
+ +++ Q + I V E AV + L + +L+ Y TL EDEA+L +
Sbjct: 319 VILMNEQEIRQYKKAFESSILSVRN--ELAVYENLQSTCRRKLSNYATTLEEDEAILAET 376
Query: 431 NLHPKKRVATQL-VRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYA 477
K R+A + VRME K QVT +I L S P A
Sbjct: 377 ETESKPRLAFAVRVRMEDK------QVTTSVIETLEQWKQSLASKPDA 418
>gi|160774366|gb|AAI55279.1| SET domain containing 3 [Danio rerio]
Length = 596
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 81/378 (21%), Positives = 156/378 (41%), Gaps = 50/378 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKK 169
+ A++D++A + +P +++T+E N + L + +++ + LAL+L+ E+
Sbjct: 108 LKATKDIKAEELFLWIPRKMLMTVESA-KNSVLGPLYSQDRILQAMGNVTLALHLLCERA 166
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
S WLPYI+ L + ++PL + E E+ +L + ++L + + R+Y
Sbjct: 167 N-PSSPWLPYIKTLPSE-------YDTPLYFEEEEVRHLLATQAIQDVLSQYKNTARQYA 218
Query: 230 EL----------------------DTVWFMAGSLFQKVSLAR------RFALVPLGPPLL 261
D W ++ + ++ + AL+PL
Sbjct: 219 YFYKVIHTHPNASKLPLKDAFTFDDYRWAVSSVMTRQNQIPTADGSRVTLALIPLWDMCN 278
Query: 262 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 321
+ DD + V + YK GE I ++ G + N++ +I+ GF EDN +DR+
Sbjct: 279 HTNGLITTGYNLEDDRCECVALKDYKEGEQIYIFYGTRSNAEFVIHNGFFFEDNAHDRVK 338
Query: 322 VEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL---------G 372
++ ++ + Y K V R G + +F +H E + +L +LR+
Sbjct: 339 IKLGVSKSERLYAMKAEVLARAGIPASSIFALHCS-EPPISAQLLAFLRVFCMTEEELRD 397
Query: 373 YVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNL 432
Y+ + + + PVS E + L L Y ED +ML +L
Sbjct: 398 YLVGDHAINKIFTLGNTEFPVSWENEIKLWTFLETRAALLLKTYKTASEEDRSMLEKPDL 457
Query: 433 HPKKRVATQLVRMEKKML 450
R+ +L EK++L
Sbjct: 458 SLHSRITIKLRLAEKEIL 475
>gi|326921018|ref|XP_003206761.1| PREDICTED: LOW QUALITY PROTEIN: SET domain-containing protein
3-like [Meleagris gallopavo]
Length = 593
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 85/390 (21%), Positives = 169/390 (43%), Gaps = 58/390 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
+ A+ +++A + VP L++T+E + + L + +++ + LA +L+ E+
Sbjct: 108 LKATREIKAEELFLWVPRKLLMTVESA-KSSVLGSLYSQDRILQAMGNITLAFHLLCERA 166
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
SFWLPYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 167 -NPNSFWLPYIQTLPNE-------YDTPLYFEEDEVQYLRSTQAIHDVFSQYKNTARQYA 218
Query: 230 EL----------------------DTVWFMAGSLFQKVSLARR------FALVPLGPPLL 261
D W ++ + ++ + AL+PL
Sbjct: 219 YFYKVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCN 278
Query: 262 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 321
+ DD + V + +KAGE I ++ G + N++ +I+ GF ++N +DR+
Sbjct: 279 HTNGLITTGYNLEDDRCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVK 338
Query: 322 VEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQ 381
++ ++ D Y K V R G + VF +H+ E + +L +LR+ +++ +
Sbjct: 339 IKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-IEPPISAQLLAFLRVFCMNEEELKE 397
Query: 382 SVIS--------SLG-PICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLT 428
+I +LG P+S E +L + +AR L Y T+ +D+ L
Sbjct: 398 HLIGEHAIDKIFTLGNSEFPISWDNEV----KLWTFLEARASLLLKTYKTTVEDDKLFLE 453
Query: 429 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 458
++L +A +L EKK+L ++ A
Sbjct: 454 THDLTSHATMAIKLRLGEKKILEKTVKSAA 483
>gi|387016380|gb|AFJ50309.1| Histone-lysine N-methyltransferase setd3 [Crotalus adamanteus]
Length = 592
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 96/432 (22%), Positives = 181/432 (41%), Gaps = 68/432 (15%)
Query: 73 KKEEDLGDLKSWMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPN 130
K+E+ +L W +NG +V E+ K A+ D++A + VP
Sbjct: 74 KREDYFSELIKWAGENGAFTDGFEVANFEEEGFGLK--------ATRDIKAEELFLWVPR 125
Query: 131 SLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQR 187
L++T+E N + L + +++ + LA +L+ E+ SFWLPYI+ L +
Sbjct: 126 KLLMTVESA-KNSILGSLYSQDRILQAMGNITLAFHLLCER-YNPNSFWLPYIQTLPNE- 182
Query: 188 GRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL---------------- 231
+ L + E E+ YL + +I + + R+Y
Sbjct: 183 ------YNTALYFEEDEVQYLQSTQAIHDIFSQYKNTARQYAYFYKVVQTHPNASKLPLK 236
Query: 232 ------DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQ 279
D W ++ + ++ + AL+PL + DD +
Sbjct: 237 DSFTYDDYRWAVSSVMARQNQIPAEDGSRVTLALIPLWDMCNHTNGLITTGYNLKDDRCE 296
Query: 280 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 339
V + +KAGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 297 CVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEV 356
Query: 340 AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS--------SLG-PI 390
R G + VF +H+ E + +L +LR+ +++ + +I +LG
Sbjct: 357 LARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEDELKEHLIGEHTIDRIFTLGNSE 415
Query: 391 CPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRME 446
PVS E +L + +AR L Y T+ +D+ +L +L +A +L E
Sbjct: 416 FPVSWDNEV----KLWTFLEARASLLLKTYKTTIHDDKFILETQDLTHNATMAIKLRLGE 471
Query: 447 KKMLNACLQVTA 458
K++L ++ A
Sbjct: 472 KEILEKAIKSAA 483
>gi|8778402|gb|AAF79410.1|AC068197_20 F16A14.25 [Arabidopsis thaliana]
Length = 474
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 94/378 (24%), Positives = 163/378 (43%), Gaps = 47/378 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A D+ + +P L + E V ++ I L L +AL+L+ EK + +
Sbjct: 79 LVARRDIGRNEVVLEIPKRLWINPETVTASK-IGPL--CGGLKPWVSVALFLIREKYE-E 134
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL- 231
+S W Y+ L + + +S + WSE ELA L G+ + L E ++ E+ +L
Sbjct: 135 ESSWRVYLDMLPQ-------STDSTVFWSEEELAELKGTQLLSTTLGVKEYVENEFLKLE 187
Query: 232 ------------------DTVWFMAGSLFQKVSLARRFAL--VPLGPPL----LAYSSKC 267
D +W A + + SL F + P + AY K
Sbjct: 188 QEILLPNKDLFSSRITLDDFIW--AFGILNRESLTSMFEFEQINHNPAIKTEDYAYEIKG 245
Query: 268 KAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 326
A L + D L KAGE + + + + N++L ++YGFV+ + + + +
Sbjct: 246 -AGLFSRDLLFSLKSPVYVKAGEQVYIQYDLNKSNAELALDYGFVESNPKRNSYTLTIEI 304
Query: 327 NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS--DTSEMQSVI 384
DP + DK +A+ N F + G+ A ML YLRL + D ++S+
Sbjct: 305 PESDPFFGDKLDIAESNKMGETGYFDIVDGQTLPA--GMLQYLRLVALGGPDAFLLESIF 362
Query: 385 SS--LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQ 441
++ G + PVS E + + D K+ L+G+ T+ EDE +L L P+ +A +
Sbjct: 363 NNTIWGHLELPVSRTNEELICRVVRDACKSALSGFDTTIEEDEKLLDKGKLEPRLEMALK 422
Query: 442 LVRMEKKMLNACLQVTAD 459
+ EK++L Q+ D
Sbjct: 423 IRIGEKRVLQQIDQIFKD 440
>gi|356534483|ref|XP_003535783.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Glycine
max]
Length = 463
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 60/252 (23%), Positives = 114/252 (45%), Gaps = 44/252 (17%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ +Q GD VP + +T + +L L ++ +A LA ++ EKK G+ S
Sbjct: 65 ASKIIQTGDCILKVPYRVQITADNLLPE---IRSLIGEEVGNIAKLATVILIEKKLGQGS 121
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI L +Q G+L + + W+E+EL + S E +++ I++++ + +
Sbjct: 122 EWYPYISCLPQQ---GEL--HNTVFWTESELEMIRPSSVYQETIDQKSQIEKDFLAIKHI 176
Query: 235 W--------------FMAGS---LFQKVSL-----------ARRFALVPLGPPLLAYSSK 266
+ FM LF ++ AL+P L +
Sbjct: 177 FECSHQSFGDSTYKDFMHACTLVLFDHFNVELPVGSRAWGSTNGLALIPFA-DFLNHDGV 235
Query: 267 CKAMLAAVDD-------AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 319
+A++ + DD ++Q++ DR Y GE +++ G N+ L++++GF N YD+
Sbjct: 236 SEAIVMSDDDKQCSEVQSLQIIADRDYAPGEQVLIRYGKFSNATLMLDFGFTIPYNIYDQ 295
Query: 320 LVVEAALNTEDP 331
+ ++ + DP
Sbjct: 296 VQIQFDIPKHDP 307
>gi|302754340|ref|XP_002960594.1| hypothetical protein SELMODRAFT_402971 [Selaginella moellendorffii]
gi|300171533|gb|EFJ38133.1| hypothetical protein SELMODRAFT_402971 [Selaginella moellendorffii]
Length = 403
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 80/294 (27%), Positives = 132/294 (44%), Gaps = 48/294 (16%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEK----HRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
+ SW+ + G + + S + + HRP + AG+ +LV+T
Sbjct: 41 EFMSWLRRRGEDMNSIAVAIGMSKHGRALFAHRP---------MCAGECMIKFSQNLVLT 91
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
E+ L E IA L N+ + ++ L +M EK++G+ S W PYI L G+ +
Sbjct: 92 PEK-LPCEVIALLDQANEFTRVS---LLVMAEKRKGQNSAWAPYIECLP---SFGE--IH 142
Query: 196 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQ-KVSLAR---RF 251
S + W ELA L SP ER ++ EY E+ V L+ VSL + +
Sbjct: 143 STIFWDPKELACLECSPIHRGTGERNALLQSEYREVKKVVESCPHLYDPDVSLEQFKHEY 202
Query: 252 ALVP-----LGP-------PLLAYSS---KCKAMLAAVDDAVQLVV-DRPYKAGES---- 291
A V GP PL+ +++ + + + + DD +VV R Y+ G+
Sbjct: 203 ATVSSRAWGQGPHSDMTMIPLVDFANHDPRSRTLFSHADDNCTVVVASRDYQTGDENFHL 262
Query: 292 -IVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK-RMVAQRN 343
+ + G N+ L ++YGFV DNP+D + + +EDP + K + +AQ N
Sbjct: 263 KVHICYGDHSNAVLALDYGFVVPDNPFDEAEIFLEIPSEDPLREIKLQYMAQNN 316
>gi|302771638|ref|XP_002969237.1| hypothetical protein SELMODRAFT_410177 [Selaginella moellendorffii]
gi|300162713|gb|EFJ29325.1| hypothetical protein SELMODRAFT_410177 [Selaginella moellendorffii]
Length = 336
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 76/263 (28%), Positives = 120/263 (45%), Gaps = 44/263 (16%)
Query: 107 HRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMY 166
HRP + AG+ LV+T E+ L E IA L N+ + ++ L +M
Sbjct: 72 HRP---------MCAGECMIKFSQDLVLTPEK-LPCEVIALLDQANEFTRVS---LLVMA 118
Query: 167 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 226
EK++G+ S W PYI L G+ + S + W ELA L SP ER ++
Sbjct: 119 EKRKGQNSAWAPYIECLP---SFGE--IHSTIFWDPKELACLECSPIHRGTGERNALLQS 173
Query: 227 EYNELDTVWFMAGSLFQ-KVSLAR---RFALVP-----LGP-------PLLAYSS---KC 267
EY E+ V L+ VSL + +A V GP PL+ +++ +
Sbjct: 174 EYREVKKVVESCPHLYDPDVSLEQFKHEYATVSSRAWGQGPHSDMTMIPLVDFANHDPRS 233
Query: 268 KAMLAAVDDAVQLVV-DRPYKAGES-----IVVWCGPQPNSKLLINYGFVDEDNPYDRLV 321
+ + + DD +VV R Y+ G+ + + G N+ L ++YGFV DNP+D
Sbjct: 234 RTLFSHADDNCTVVVASRDYQTGDENFHLKVHICYGDHSNAVLALDYGFVVPDNPFDEAE 293
Query: 322 VEAALNTEDPQYQDK-RMVAQRN 343
+ + +EDP + K + +AQ N
Sbjct: 294 IFLEIPSEDPLREIKLQYMAQNN 316
>gi|255083899|ref|XP_002508524.1| set domain protein [Micromonas sp. RCC299]
gi|226523801|gb|ACO69782.1| set domain protein [Micromonas sp. RCC299]
Length = 425
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 70/259 (27%), Positives = 114/259 (44%), Gaps = 30/259 (11%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYE----K 168
+ A+E+++ G++ +P S ++T+ER + + L E + LA +L +
Sbjct: 24 LVATEEVRRGESLLDIPESTLITVERAIAESNLGP--AHANLQEWSVLAAFLAEQALAID 81
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA-YLTGSPTKAEILERAEGIKRE 227
S + Y+R L R+ G L W E ++ L GSP++ +ER +
Sbjct: 82 AGADGSRFATYVRALPRRTG-------GVLDWPEEDVKELLAGSPSQRAAMERQASVDAA 134
Query: 228 YNELDTVW--FMAGSL-------FQK-VSLARR---FALVPLGPPLLAYSSKCKAMLAAV 274
+E+ + G+L F + + L R ALVP +L + C A +
Sbjct: 135 IDEIRASFPQLTPGALRWAFDVLFSRLIRLPNRGGALALVPWAD-MLNHRPGCDAYIDDT 193
Query: 275 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE--DNPYDRLVVEAALNTEDPQ 332
AV L DR YK GE + GP+P+S+LLI+YGF +NP D V ++ D
Sbjct: 194 GGAVCLSPDRRYKPGEQVYASYGPRPSSELLISYGFAPAVGENPDDEFEVVLGIDPNDRH 253
Query: 333 YQDKRMVAQRNGKLSVQVF 351
K +R G V+ F
Sbjct: 254 ADAKADALRRIGLSPVEAF 272
>gi|126290266|ref|XP_001367810.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Monodelphis domestica]
Length = 595
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 96/430 (22%), Positives = 182/430 (42%), Gaps = 64/430 (14%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W NG L N K + A+ +++A + VP L
Sbjct: 74 KREDYFPDLIKWAAANGASTDGFELV-----NFKEEGFG-LRATREIKAEELFLWVPRKL 127
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFWLPYI+ L +
Sbjct: 128 LMTVESA-KNSVLGALYSQDRILQAMGNITLAFHLLCERA-NPSSFWLPYIQTLPSE--- 182
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL----- 231
++PL + E E+ +L + ++ + + R+Y N+L
Sbjct: 183 ----YDTPLYFEEDEVQHLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPNANKLPLKDS 238
Query: 232 ----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 281
D W ++ + ++ + AL+PL + DD + V
Sbjct: 239 FTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECV 298
Query: 282 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 341
+ + GE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 299 ALQDFNVGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLA 358
Query: 342 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS--------SLG-PICP 392
R G + VF +H E + +L +LR+ +++ + +I +LG P
Sbjct: 359 RAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLIGEHAIDRIFTLGNSEFP 417
Query: 393 VSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKK 448
VS E +L + +AR L Y T+ ED++ L ++L +A +L EK+
Sbjct: 418 VSWDNEV----KLWTFLEARASLLLKTYKTTIEEDKSFLATHDLTFHATMAIKLRLGEKE 473
Query: 449 MLNACLQVTA 458
+L ++ A
Sbjct: 474 ILEKAVKSAA 483
>gi|225448769|ref|XP_002275729.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Vitis
vinifera]
Length = 480
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 66/277 (23%), Positives = 125/277 (45%), Gaps = 36/277 (12%)
Query: 75 EEDLGDLKSWM-HKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
++D D W+ K G+ V+ K ++ + AS+ +Q GD VP ++
Sbjct: 39 DKDCDDFLPWLEQKAGVEISSVLSIGKSTYGRS------LFASKSIQTGDCILKVPYNVQ 92
Query: 134 VTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
++ + V I LL +++ +A LA+ + E K G+ S W PYI L Q G
Sbjct: 93 ISPDNV--PSKINSLLG-DEVGNIAKLAIVISVEWKMGQDSEWAPYINRLP-QPGE---- 144
Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF-MAGSLFQKVSL----- 247
+ S + WSE EL + S E + + I++++ + V + +LF+ +SL
Sbjct: 145 MHSTIFWSEGELKMIQQSSVYQETINQKAQIQKDFLAIKPVLHHFSENLFKDISLKEFMH 204
Query: 248 ------------ARRFALVPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGESIV 293
+ +L+P + + ++L +D +++ DR Y GE ++
Sbjct: 205 ACALVGSRAWGSTKGLSLIPFAD-FVNHDGFSDSVLLGDEDKQLSEVIADRNYAPGEQVL 263
Query: 294 VWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 330
+ G PN+ LL+++GF N YD++ ++ + D
Sbjct: 264 IRYGKFPNATLLLDFGFTLPYNIYDQVQIQVNIPHHD 300
>gi|297736447|emb|CBI25318.3| unnamed protein product [Vitis vinifera]
Length = 487
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 66/283 (23%), Positives = 127/283 (44%), Gaps = 41/283 (14%)
Query: 75 EEDLGDLKSWM-HKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
++D D W+ K G+ V+ K ++ + + AS+ +Q GD VP ++
Sbjct: 39 DKDCDDFLPWLEQKAGVEISSVLSIGKSTYGSRS-----LFASKSIQTGDCILKVPYNVQ 93
Query: 134 VTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
++ + V I LL +++ +A LA+ + E K G+ S W PYI L Q G
Sbjct: 94 ISPDNV--PSKINSLLG-DEVGNIAKLAIVISVEWKMGQDSEWAPYINRLP-QPGE---- 145
Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW-FMAGSLFQKVSL----- 247
+ S + WSE EL + S E + + I++++ + V + +LF+ +SL
Sbjct: 146 MHSTIFWSEGELKMIQQSSVYQETINQKAQIQKDFLAIKPVLHHFSENLFKDISLKEFMH 205
Query: 248 ------------ARRFALVPLGPPLLAYSSKCKAMLAAVDD--------AVQLVVDRPYK 287
+ +L+P + + ++L +D ++++ DR Y
Sbjct: 206 ACALVGSRAWGSTKGLSLIPFA-DFVNHDGFSDSVLLGDEDKQLSESSSTLEVIADRNYA 264
Query: 288 AGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 330
GE +++ G PN+ LL+++GF N YD++ ++ + D
Sbjct: 265 PGEQVLIRYGKFPNATLLLDFGFTLPYNIYDQVQIQVNIPHHD 307
>gi|424513480|emb|CCO66102.1| predicted protein [Bathycoccus prasinos]
Length = 571
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 105/397 (26%), Positives = 165/397 (41%), Gaps = 75/397 (18%)
Query: 116 SEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSF 175
S++++ GD S+P VT + +A L+ + EL LAL+L EK + K S
Sbjct: 122 SKNVEGGDVILSIPQDNCVTAVDAKEHPIVAPLI--EEKPELVQLALWLCCEKAKAKGSE 179
Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELA-YLTGSPTKAEILERAEGIKREYNEL--- 231
W PY++ L+ S L ++E E L G+ E +R + K EY L
Sbjct: 180 WWPYLKTLNGNPN-------SVLRFTEEEFKELLKGTSIDKEARQRRDSAKEEYEALRAA 232
Query: 232 ----------DTVWFMAGSLF-----------QKVSLARRFALVPL--------GPPLLA 262
D F+ S F Q ++ A +A+VPL PP ++
Sbjct: 233 IAEDPGKYPLDVYAFLTESAFIDALDIVCARAQWLNSANCYAMVPLMDAIPICGAPPPVS 292
Query: 263 -------------------YSSKCKAMLAAVDDA-VQLVVDRPYKAGESIV-VWCGPQPN 301
+ +C VD A V L + AG I+ + + N
Sbjct: 293 PEDPSFARFYEIRDIKTGLTAVRCGYADYDVDSASVVLCANTRASAGSKILQIDHSVRNN 352
Query: 302 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV-QVFHVHAGREKE 360
S+L +++G VD+ +P D L+ DP Y K+ V + G Q F V+ R
Sbjct: 353 SELYLSFGDVDDQHPGDYEYWPTELSENDPLYAAKKSVLEAQGFADKGQTFPVYKDRMPR 412
Query: 361 AISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATL 420
+ L YLR V+++ E+ +V + + VSP E L L + R++ Y T
Sbjct: 413 ---EFLSYLRFARVTNSEELFAVSFTEDKV--VSPMNEYETLQLLMADCRDRMSAYD-TN 466
Query: 421 SEDEAMLTDY-NLHPKKRVATQLVRMEK----KMLNA 452
EDE +L ++ K R A++L R EK +M+NA
Sbjct: 467 EEDELLLQKRDDVSLKIRNASRLRRCEKELVGEMMNA 503
>gi|412990750|emb|CCO18122.1| predicted protein [Bathycoccus prasinos]
Length = 543
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 68/244 (27%), Positives = 103/244 (42%), Gaps = 33/244 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLM------- 165
+ A+E ++ G+ +P ++T+E L + E +L E + LA +L
Sbjct: 118 LVATESIKRGEKVLEIPQEAIITVEVALKESLLREKKKLAELQEWSILATFLAETAQNLS 177
Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGI 224
E K + Y++ L R G S L W E+++ L GSP+ LER +
Sbjct: 178 TEDNSSNKYRFATYVKALPRSTG-------SVLEWPESDVRTLLAGSPSLFSALERRASV 230
Query: 225 KR----------EYNELDTVW---FMAGSLFQKVSLARRFALVPLGPPLLAYSSKCKAM- 270
E NE W + L + SL ALVP +L + C+A
Sbjct: 231 AAAIAEIRVNFPELNEKTLQWAFDILFSRLIRLESLGGNLALVPWAD-MLNHQPGCEAFI 289
Query: 271 -LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE--DNPYDRLVVEAALN 327
L V L DR Y+ GE + G +P+S+LLI+YGF DNP D + ++
Sbjct: 290 DLDRGSRKVCLTTDRSYEPGEQVWASYGQRPSSELLISYGFAPAVGDNPDDEYALNLQID 349
Query: 328 TEDP 331
EDP
Sbjct: 350 EEDP 353
>gi|168067849|ref|XP_001785817.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162662541|gb|EDQ49381.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 489
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 100/422 (23%), Positives = 174/422 (41%), Gaps = 72/422 (17%)
Query: 93 CKVILKEKPSHNEKHRPIHYVAASEDL--------QAGDAAFSVPNSLVVTLERVLGNET 144
+ I SH + + SE L AGD +VP S+ + L V N +
Sbjct: 43 VQTIWSWAQSHGIQGEAVKPAEVSEGLGLIAQRPVNAGDEILNVPESVWINLAAVQ-NSS 101
Query: 145 IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 204
+ + L +AL+L++E S W PY+ L + +++SPL WS+ E
Sbjct: 102 LGK--ACEGLKPWVAVALFLIHESSN-PSSKWRPYLDSLPK-------SLDSPLFWSDEE 151
Query: 205 LAYLTGSPTKAEILERAEGIKREYNEL--------DTVWFMAGSLFQKVSLA------RR 250
LA L G+ + E ++ EYN L + ++ A F A R
Sbjct: 152 LAELVGTQLLGSVTGYLEFLENEYNNLVEEVLEPNNKIFNPAVYTFDGFKWAFGILRSRT 211
Query: 251 F--------ALVPL------------GPP--LLAYSSKCKAMLAAVDDAVQLVVDRPYKA 288
F ALVP+ G P + +S+ + D + + + A
Sbjct: 212 FSPLTGEDIALVPIADLVNHGKGLGDGSPSWVRKGTSQFWNIGKGSSDLLTVRASANFSA 271
Query: 289 GESIVVWCGP-QPNSKLLINYGFVDEDN--------PYDRLVVEAALNTEDPQYQDKRMV 339
GE +++ G + N+ L ++YGFV+ D D L + ++ +D DK +
Sbjct: 272 GEQVLMQYGATKSNADLALDYGFVERDRGSQFSPGIERDSLALSLEISPDDRFVDDKADI 331
Query: 340 AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS--DTSEMQSVI--SSLGPIC-PVS 394
+ NG F + G+ +M+ +LRL +S D+ ++++ + G + PVS
Sbjct: 332 LEINGFQCSMQFDLSRGQGPS--DEMITFLRLSALSGPDSFLLEALFRNEAWGHVSLPVS 389
Query: 395 PCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 454
E A+ + + KA L GY T+ +D +L +L + +A +VR+ +K + L
Sbjct: 390 RDNEEALCTSMLEGLKAALDGYSTTVEQDMELLARGDLSTRMEIAV-VVRLGEKRVMQEL 448
Query: 455 QV 456
Q
Sbjct: 449 QT 450
>gi|145528147|ref|XP_001449873.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124417462|emb|CAK82476.1| unnamed protein product [Paramecium tetraurelia]
Length = 605
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 90/395 (22%), Positives = 166/395 (42%), Gaps = 56/395 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNET-IAELLTTNKLSELA----CLALYLMYE 167
V A + + + + VP S ++TLE + +T +A+ + +L L+ L+ +L+ E
Sbjct: 190 VNARKAISSKEVILFVPRSHMITLE--MAKDTPVAKKIIQYRLDLLSPKHSFLSTFLLQE 247
Query: 168 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
KK + SFW PY+ L + P+ +++++L +L GSP ++ ++ +K++
Sbjct: 248 KK-IQDSFWKPYLDVLPKSYSNF------PIFFNDSDLEWLKGSPFLKQVKDKITDLKKD 300
Query: 228 YNELDTVW--FMAGSLFQKVSLARRFA----------------LVPLGPPL--------- 260
Y ++ V F+ S F + AR A VPL L
Sbjct: 301 YCDICQVAPEFLQNS-FDEFCWARMTASSRIFGINIKGVKTDAFVPLADMLNHKRPKLTS 359
Query: 261 LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 320
YS + + + D+ ++ G+ I G + NS+ L+NYGFV +DN + +
Sbjct: 360 WCYSDERQGFIIETDENIE--------KGQMIFDSYGSKCNSRFLLNYGFVVDDNNANEV 411
Query: 321 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM 380
V + Q K +++ + + F + + SD + ++R + + E
Sbjct: 412 NVMVEPDGTISLIQLKEGLSRETLQFP-KSFRLVIDPNDVSFSDFMSFIRFILIQEEKEF 470
Query: 381 QSVISSLGPICP-----VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPK 435
+++ I P +S E A + + + L YP TL +D +L L
Sbjct: 471 ANLLGKNSYIKPTKIHFISIQNELATWNLIENICIRALNQYPTTLEQDLEILKICELTTN 530
Query: 436 KRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVS 470
+R L EKK+LN Q + M L + S
Sbjct: 531 QRNCLILRMGEKKILNFYKQFSEKMRQLFSNFDFS 565
>gi|15223054|ref|NP_172856.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase
[Arabidopsis thaliana]
gi|17369870|sp|Q9XI84.1|RBCMT_ARATH RecName: Full=[Fructose-bisphosphate aldolase]-lysine
N-methyltransferase, chloroplastic; AltName:
Full=Aldolases N-methyltransferase; AltName:
Full=[Ribulose-bisphosphate carboxylase]-lysine
N-methyltransferase-like; Short=AtLSMT-L;
Short=LSMT-like enzyme; Flags: Precursor
gi|5080779|gb|AAD39289.1|AC007576_12 Putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase [Arabidopsis thaliana]
gi|28973755|gb|AAO64193.1| putative ribulose-1,5 bisphosphate carboxylase oxygenase large
subunit N-methyltransferase [Arabidopsis thaliana]
gi|332190979|gb|AEE29100.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase
[Arabidopsis thaliana]
Length = 482
Score = 71.6 bits (174), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 94/386 (24%), Positives = 165/386 (42%), Gaps = 55/386 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A D+ + +P L + E V ++ I L L +AL+L+ EK + +
Sbjct: 79 LVARRDIGRNEVVLEIPKRLWINPETVTASK-IGPL--CGGLKPWVSVALFLIREKYE-E 134
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL- 231
+S W Y+ L + + +S + WSE ELA L G+ + L E ++ E+ +L
Sbjct: 135 ESSWRVYLDMLPQ-------STDSTVFWSEEELAELKGTQLLSTTLGVKEYVENEFLKLE 187
Query: 232 ------------------DTVWFMAGSLFQKVSLAR----RFALVPLGPPL--------- 260
D +W A + + + +R L+PL +
Sbjct: 188 QEILLPNKDLFSSRITLDDFIW--AFGILKSRAFSRLRGQNLVLIPLADLINHNPAIKTE 245
Query: 261 -LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYD 318
AY K A L + D L KAGE + + + + N++L ++YGFV+ + +
Sbjct: 246 DYAYEIKG-AGLFSRDLLFSLKSPVYVKAGEQVYIQYDLNKSNAELALDYGFVESNPKRN 304
Query: 319 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS--D 376
+ + DP + DK +A+ N F + G+ A ML YLRL + D
Sbjct: 305 SYTLTIEIPESDPFFGDKLDIAESNKMGETGYFDIVDGQTLPA--GMLQYLRLVALGGPD 362
Query: 377 TSEMQSVISS--LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 433
++S+ ++ G + PVS E + + D K+ L+G+ T+ EDE +L L
Sbjct: 363 AFLLESIFNNTIWGHLELPVSRTNEELICRVVRDACKSALSGFDTTIEEDEKLLDKGKLE 422
Query: 434 PKKRVATQLVRMEKKMLNACLQVTAD 459
P+ +A ++ EK++L Q+ D
Sbjct: 423 PRLEMALKIRIGEKRVLQQIDQIFKD 448
>gi|17368377|sp|P94026.1|RBCMT_TOBAC RecName: Full=Ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic; AltName:
Full=[Ribulose-bisphosphate carboxylase]-lysine
N-methyltransferase; Short=RuBisCO LSMT; Short=RuBisCO
methyltransferase; Short=rbcMT; Flags: Precursor
gi|1731475|gb|AAC49565.1| ribulose-1,5-bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase [Nicotiana tabacum]
gi|1731477|gb|AAC49566.1| ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase [Nicotiana tabacum]
Length = 491
Score = 71.6 bits (174), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 84/377 (22%), Positives = 156/377 (41%), Gaps = 54/377 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A D+ G+ VP + + V +E I + + L +AL+L+ EK +
Sbjct: 87 LVAKRDIAKGETVLQVPKRFWINPDAVAESE-IGNVCS--GLKPWISVALFLLREKWR-D 142
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W Y+ L + + +S + WSE EL+ + G+ + + + ++ E+ +++
Sbjct: 143 DSKWKYYMDVLPK-------STDSTIYWSEEELSEIQGTQLLSTTMSVKDYVQNEFQKVE 195
Query: 233 TV-----------------WFMAGSLFQKVSLAR----RFALVPLGPPL----------L 261
+F A + + + +R LVP
Sbjct: 196 EEVILRNKQLFPFPITLDDFFWAFGILRSRAFSRLRNQNLILVPFADLTNHNARVTTEDH 255
Query: 262 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRL 320
A+ + A L + D L KAG+ + + + + N+ + ++YGF++ + D
Sbjct: 256 AHEVRGPAGLFSWDLLFSLRSPLKLKAGDQLFIQYDLNKSNADMALDYGFIEPSSARDAF 315
Query: 321 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM 380
+ ++ D Y DK +A+ NG F + G+ M+PYLRL + T
Sbjct: 316 TLTLEISESDEFYGDKLDIAETNGIGETAYFDIKIGQSLPPT--MIPYLRLVALGGTDAF 373
Query: 381 -------QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 433
SV LG PVS E + + D K+ L+GY T+ EDE ++ + NL
Sbjct: 374 LLESIFRNSVWGHLG--LPVSRANEELICKVVRDACKSALSGYHTTIEEDEKLMEEGNLS 431
Query: 434 PKKRVATQLVRMEKKML 450
+ ++A + EK++L
Sbjct: 432 TRLQIAVGIRLGEKRVL 448
>gi|145516108|ref|XP_001443948.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124411348|emb|CAK76551.1| unnamed protein product [Paramecium tetraurelia]
Length = 572
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 85/393 (21%), Positives = 163/393 (41%), Gaps = 54/393 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA----CLALYLMYEK 168
V A + + A + +P S ++TLE + T+A+ + +L L+ L+ +L+ EK
Sbjct: 165 VNAKQTINAKELILFIPKSHMITLE-MAKETTVAKKMMQFRLDLLSPKHSFLSTFLLQEK 223
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
+ SFW PYI L P+ ++ ++L +L GSP +I ++ ++++Y
Sbjct: 224 FRPN-SFWKPYIDILPSSYPSF------PIFYNNSDLEWLKGSPFLKQIKDKLADLQKDY 276
Query: 229 NEL-DTVWFMAGSLFQKVSLARRFA----------------LVPLG-------PPLLA-- 262
N++ + V F + AR A VPL P L +
Sbjct: 277 NDICNVVPEFTQYQFHEFCWARMTASSRIFGININGVKTDAFVPLADMLNHKRPKLTSWC 336
Query: 263 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 322
YS + + + D+ ++ G+ I G + NS+ +NYGFV E N + + +
Sbjct: 337 YSDEKQGFIIETDEKIE--------RGQMIFDSYGRKCNSRFFLNYGFVVEGNDANEVNL 388
Query: 323 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 382
+ DP Q K + + + + F + ++ A+ D + ++R + D ++++
Sbjct: 389 AVEADQNDPLLQLKEQAIKESLQWP-KNFKLLMDTDETAVIDFMSHIRFLVIRDEAQLKL 447
Query: 383 VISSLG-------PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPK 435
+++ P+ E + + K L YP T +D+ +L L
Sbjct: 448 LLNQKNSQNFKSTKTQPLGIYNELEMWKMIGRICKKTLKQYPTTFEQDQEILQICELTTN 507
Query: 436 KRVATQLVRMEKKMLNACLQVTADMIMLLPDVT 468
+R L EK++L Q + M LL +
Sbjct: 508 QRNCLILRMGEKEILKFYFQFSERMKELLSNFN 540
>gi|62642307|gb|AAX92711.1| SET domain-containing protein [Picea abies]
Length = 106
Score = 70.1 bits (170), Expect = 2e-09, Method: Composition-based stats.
Identities = 37/75 (49%), Positives = 52/75 (69%), Gaps = 7/75 (9%)
Query: 42 SSLRLVRRKNRFSIRVSSSDTLVAGS------REVVSKKEEDLGDLKSWMHKNGLPPCKV 95
S +RL R F + V S+DTL A S ++ + KEE++ DLKSWMH++GLPPC+V
Sbjct: 32 SRVRLPGRCVGFPMVVYSADTLTASSQHGEDKKDAIRGKEEEV-DLKSWMHRHGLPPCRV 90
Query: 96 ILKEKPSHNEKHRPI 110
+LKE+PS + KH+PI
Sbjct: 91 MLKERPSPDGKHKPI 105
>gi|168043570|ref|XP_001774257.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674384|gb|EDQ60893.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 458
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 60/247 (24%), Positives = 110/247 (44%), Gaps = 29/247 (11%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLP 178
++ G+ V L++T R+ + E + ++E + LAL+ + K GK S W P
Sbjct: 70 IKRGEQVLRVSRELMITPNRL---PSCVEESLSEDVNEWSRLALFQLLHKHAGKASPWEP 126
Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMA 238
YIR L RG +++ + W + EL L S + R I +++ + V
Sbjct: 127 YIRCLPPLRG-----LQNTVFWRDEELELLRQSNVYDQTEHRKTLISNQFDLVQAVVNKY 181
Query: 239 GSLFQK------------VSLARRFALVPLGP-------PLLAYSSKCKAMLAAVDDA-- 277
LF + V+ +R + + LG + + S +A+LA ++
Sbjct: 182 PELFGETVTLESFKHAYCVASSRSWGVEALGSITMVPFVDMFNHDSSARALLAYYEEEGY 241
Query: 278 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 337
++V D+ Y G +V+ G PNS L +++GF DNP+D + + + DP +K
Sbjct: 242 AEVVADKDYNQGSQVVITYGTLPNSSLALDFGFTLPDNPHDEVQIWMEAPSGDPLRAEKL 301
Query: 338 MVAQRNG 344
+ + +G
Sbjct: 302 KLLRDHG 308
>gi|217038301|gb|ACJ76599.1| SET domain-containing protein 3 (predicted) [Oryctolagus cuniculus]
Length = 394
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 73/333 (21%), Positives = 142/333 (42%), Gaps = 51/333 (15%)
Query: 72 SKKEEDLGDLKSWMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVP 129
K+E+ +L W NG + +V+ E+ + A+ +++A + VP
Sbjct: 73 GKREDYFPELMKWASANGASVEGFEVVNFEEEGFG--------LRATREIKAEELFLWVP 124
Query: 130 NSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQ 186
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 125 RKLLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE 182
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL-- 231
++PL + E E+ YL + ++ + + R+Y N+L
Sbjct: 183 -------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYRVIQTHPHANKLPL 235
Query: 232 -------DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAV 278
D W ++ + ++ + AL+PL + DD
Sbjct: 236 KDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRC 295
Query: 279 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRM 338
+ V R + AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K
Sbjct: 296 ECVALRDFHAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAE 355
Query: 339 VAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 371
V R G + VF +H E + +L +LR+
Sbjct: 356 VLARAGIPTSSVFALHF-TEPPISAQLLAFLRV 387
>gi|449442309|ref|XP_004138924.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Cucumis
sativus]
Length = 503
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 87/384 (22%), Positives = 162/384 (42%), Gaps = 51/384 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+A +++L + VP + + V +E + L +AL+L+ E +G
Sbjct: 100 LATTKNLSKNEVVLEVPKRFWINPDAVADSEIGN---VCSGLKPWISVALFLIRENLKGD 156
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W Y+ L ++ +S + WSE ELA + G+ + L E +K E+ +++
Sbjct: 157 -SRWRRYLDILPQE-------TDSTVFWSEEELAEIQGTQLLSTTLNVKEYVKSEFLKVE 208
Query: 233 TV-----------------WFMAGSLFQKVSLAR----RFALVPLGPPL----------L 261
+F A + + + +R L+P +
Sbjct: 209 EEILLRHKDLFPSRITLDDFFWAFGILRSRAFSRLRGQNLVLIPFADLVNHSANVTTEEH 268
Query: 262 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRL 320
A+ K A L + D L KAG+ + + + + N+ L ++YGF+++ + +
Sbjct: 269 AWEVKGPAGLFSWDVLFSLRSPLSVKAGDQVFIQYDLKKSNADLALDYGFIEQKSDRNAY 328
Query: 321 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE- 379
+ + D + DK +A+ NG F + E+ MLP+LRL + T
Sbjct: 329 TLTLEIPESDLFFDDKLDIAETNGLNQTAYFDIIL--ERPFPPAMLPFLRLLALGGTDAF 386
Query: 380 -MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPK 435
++S+ S G + PVS E + + + +A L+GY T+ EDE L + NL +
Sbjct: 387 LLESLFRNSVWGHLEMPVSRANEELICQVVRNACEAALSGYHTTIEEDEK-LKEENLDSR 445
Query: 436 KRVATQLVRMEKKMLNACLQVTAD 459
R+A + EK++L +Q+ D
Sbjct: 446 LRIAVGIREGEKRVLQQIIQIFKD 469
>gi|302753470|ref|XP_002960159.1| hypothetical protein SELMODRAFT_437298 [Selaginella moellendorffii]
gi|300171098|gb|EFJ37698.1| hypothetical protein SELMODRAFT_437298 [Selaginella moellendorffii]
Length = 377
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 55/193 (28%), Positives = 89/193 (46%), Gaps = 29/193 (15%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LAL ++ E+ +G+ S W PYI L + +++ LW +TEL+YL SP + E
Sbjct: 114 LALIVLMERYKGQSSVWAPYISCLPQPA-----ELDNTFLWEDTELSYLKASPLYGKTRE 168
Query: 220 RAEGIKREYNELDTVWFMAGSLFQKVSL-----------ARRFAL----VPLGPPLLAY- 263
R E I E+ ++ + LF KVSL +R A+ + P+L +
Sbjct: 169 RLEMITTEFGQVQNALNVWPQLFGKVSLEDFKHVYATVFSRSLAIGEDSTLVMIPMLDFF 228
Query: 264 ----SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 319
+S K + + + DR Y + I + G N++L ++YGF +NPYD
Sbjct: 229 NHNATSFAKLSFNGLLNYAVVTADRAYTENDQIWINYGDLSNAELALDYGFTVPENPYD- 287
Query: 320 LVVEAALNTEDPQ 332
E L T+ P+
Sbjct: 288 ---ETDLLTQFPE 297
>gi|449495943|ref|XP_004159992.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Cucumis
sativus]
Length = 503
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 87/384 (22%), Positives = 162/384 (42%), Gaps = 51/384 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+A +++L + VP + + V +E + L +AL+L+ E +G
Sbjct: 100 LATTKNLSKNEVVLEVPKRFWINPDAVADSEIGN---VCSGLKPWISVALFLIRENLKGD 156
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W Y+ L ++ +S + WSE ELA + G+ + L E +K E+ +++
Sbjct: 157 -SRWRRYLDILPQE-------TDSTVFWSEEELAEIQGTQLLSTTLNVKEYVKSEFLKVE 208
Query: 233 TV-----------------WFMAGSLFQKVSLAR----RFALVPLGPPL----------L 261
+F A + + + +R L+P +
Sbjct: 209 EEILLRHKDLFPSRITLDDFFWAFGILRSRAFSRLRGQNLVLIPFADLVNHSANVTTEEH 268
Query: 262 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRL 320
A+ K A L + D L KAG+ + + + + N+ L ++YGF+++ + +
Sbjct: 269 AWEVKGPAGLFSWDVLCSLRSPLSVKAGDQVFIQYDLKKSNADLALDYGFIEQKSDRNAY 328
Query: 321 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE- 379
+ + D + DK +A+ NG F + E+ MLP+LRL + T
Sbjct: 329 TLTLEIPESDLFFDDKLDIAETNGLNQTAYFDIIL--ERPFPPAMLPFLRLLALGGTDAF 386
Query: 380 -MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPK 435
++S+ S G + PVS E + + + +A L+GY T+ EDE L + NL +
Sbjct: 387 LLESLFRNSVWGHLEMPVSRANEELICQVVRNACEAALSGYHTTIEEDEK-LKEENLDSR 445
Query: 436 KRVATQLVRMEKKMLNACLQVTAD 459
R+A + EK++L +Q+ D
Sbjct: 446 LRIAVGIREGEKRVLQQIIQIFKD 469
>gi|42565948|ref|NP_191068.2| SET domain-containing protein [Arabidopsis thaliana]
gi|56236044|gb|AAV84478.1| At3g55080 [Arabidopsis thaliana]
gi|59958342|gb|AAX12881.1| At3g55080 [Arabidopsis thaliana]
gi|332645816|gb|AEE79337.1| SET domain-containing protein [Arabidopsis thaliana]
Length = 463
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 58/266 (21%), Positives = 118/266 (44%), Gaps = 42/266 (15%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ + AGD VP + +T + + + +L +N++ + LA L+ EKK G+KS
Sbjct: 75 ASKVIYAGDCMLKVPFNAQITPDELPSD---IRVLLSNEVGNIGMLAAVLIREKKMGQKS 131
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W+PYI L + + S + W E EL+ + S E +++ I+++++
Sbjct: 132 RWVPYISRLPQP-----AEMHSSIFWGEDELSMIRCSAVHQETVKQKAQIEKDFS----- 181
Query: 235 WFMAGSLFQKVSL------------------------ARRFALVPLGPPLLAYSSKCKAM 270
F+A + Q + ++R +L+P + +
Sbjct: 182 -FVAQAFKQHCPIVTERPDLEDFMYAYALVGSRAWENSKRISLIPFADFMNHDGLSASIV 240
Query: 271 LAAVDDAV-QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 329
L D+ + ++ DR Y G+ + + G N+ L++++GF N +D + ++ + +
Sbjct: 241 LRDEDNQLSEVTADRNYSPGDEVFIKYGEFSNATLMLDFGFTFPYNIHDEVQIQMDVPND 300
Query: 330 DPQYQDKRMVAQRNGKLSVQ---VFH 352
DP K + Q + +V+ +FH
Sbjct: 301 DPLRNMKLGLLQTHHTRTVKDINIFH 326
>gi|310799999|gb|EFQ34892.1| SET domain-containing protein [Glomerella graminicola M1.001]
Length = 478
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 100/431 (23%), Positives = 166/431 (38%), Gaps = 109/431 (25%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER----------VLGNETIAELLTTNKLSELACLAL 162
+ A++D+ F++P ++ +E GN+ E + L L L
Sbjct: 44 IVATKDIAPETVLFTIPRKSIINIETSELPKKIPQVFTGNDGDDEDMENEPLDSWGSLIL 103
Query: 163 YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAE 222
++YE QG S W Y L + ++ + W +L YL GS ++I +
Sbjct: 104 VMIYEYLQGNASPWKTYFEVLPEK-------FDTLMFWESPDLEYLKGSAVLSKIGKDEA 156
Query: 223 GIKREYNELDTVWFMAGSLF-QKVSLARRFALVPL----GPPLLAYS------------- 264
L + AG F Q VS L+ L G ++AY+
Sbjct: 157 DEMFRSRILPVISANAGIFFPQGVSPPSESELLQLAHRMGSIIMAYAFDLENEEEPEQED 216
Query: 265 ------SKCKAMLAAV-------------------DDAVQLVVDRPYKAGESIVVWCGPQ 299
+ K ML V +D + + RP KAGE I+ + GP
Sbjct: 217 EEWVEDREGKTMLGMVPMADILNADAEFNAHVNHGEDDLSVTALRPIKAGEEILNYYGPH 276
Query: 300 PNSKLLINYGFVDEDN--------PYDRLVVEAALNTEDPQYQDK--RMVAQRNGKLSVQ 349
PNS+LL YG+V + P+D +V++ L TE + D+ + VA+ ++
Sbjct: 277 PNSELLRRYGYVTPKHSRYDVVEIPWD--LVQSTL-TEQLRLTDEVWKQVAEHVDPEDLE 333
Query: 350 VFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL---------------GPICP-- 392
V E S+ G++ +++Q V + L G + P
Sbjct: 334 DVFVLERESGEPDSE-------GHLQTPAKVQEVSAELEEQLKDVLKAIKKVRGDLIPDK 386
Query: 393 -----VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEK 447
V C+ + L +L LA YP T EDEAML N+ ++++A ++ EK
Sbjct: 387 RKRDEVYQCVVVSTLQKL-------LAQYPTTAEEDEAMLASGNVTSRQKLAVEVRLGEK 439
Query: 448 KMLNACLQVTA 458
+++ LQV
Sbjct: 440 RLIKEALQVAG 450
>gi|148686780|gb|EDL18727.1| mCG18357, isoform CRA_e [Mus musculus]
Length = 458
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 79/343 (23%), Positives = 151/343 (44%), Gaps = 52/343 (15%)
Query: 128 VPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELD 184
VP L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L
Sbjct: 45 VPRKLLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLP 102
Query: 185 RQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQK 244
+ ++PL + E E+ L + ++ + + R+Y + F K
Sbjct: 103 SE-------YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY-----------AYFYK 144
Query: 245 VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 304
V + G L DD + V + ++AG+ I ++ G + N++
Sbjct: 145 V------IQITTGYNL-------------EDDRCECVALQDFQAGDQIYIFYGTRSNAEF 185
Query: 305 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD 364
+I+ GF ++N +DR+ ++ ++ D Y K V R G + VF +H+ E +
Sbjct: 186 VIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISAQ 244
Query: 365 MLPYLRLGYVSDTSEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAG 415
+L +LR+ +++ + + I +LG PVS E + L D L
Sbjct: 245 LLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNAEFPVSWDNEVKLWTFLEDRASLLLKT 304
Query: 416 YPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 458
Y T+ ED+ +L + +L + +A +L EK++L ++ A
Sbjct: 305 YKTTIEEDKIVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAA 347
>gi|384246985|gb|EIE20473.1| rubisco small subunit N-methyltransferase, partial [Coccomyxa
subellipsoidea C-169]
Length = 363
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 91/345 (26%), Positives = 146/345 (42%), Gaps = 61/345 (17%)
Query: 124 AAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIREL 183
A +P +L VT V +E +A L EL LAL+LM E+++G++S W P++ L
Sbjct: 2 ALVELPGNLSVTAVDVAAHEEVAGL--AEGRGELTGLALWLMAERQKGEESRWAPFLECL 59
Query: 184 DRQRGRGQLAVESPLLW-SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLF 242
A SP+LW E + L SPT E R +++E++ +
Sbjct: 60 PE-------ATLSPVLWPEEVQDELLKNSPTLKECRARRAALQQEWD----------VIA 102
Query: 243 QKVSL--ARRFA-------LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 293
Q+++ ARRF+ + LG P + +L A+ D
Sbjct: 103 QRIATGDARRFSGGDELKLWITLGSP--GWGGTSDKLLMAIYDG---------------- 144
Query: 294 VWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV 353
+PN +L + G V++DN D L V L D + K+ + + G VQ F +
Sbjct: 145 -----RPNGELAMATGRVEDDNASDCLTVRVGLVQADRLFSVKKQILESLGFDIVQEFPI 199
Query: 354 HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARL 413
R + +L YLRL ++D + + V S ++P E VL L + RL
Sbjct: 200 FRDR---MPTQLLAYLRLARLTDPALLAKV--SFEEDIILNPVNEYEVLQLLLGECRDRL 254
Query: 414 AGYP----ATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 454
Y + ED +L L ++R+A +L + EK +L L
Sbjct: 255 TSYAGMHMGSAEEDVKLLQRPGLTAQERLAARLRKAEKAILQGTL 299
>gi|301094750|ref|XP_002896479.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262109454|gb|EEY67506.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 478
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 105/447 (23%), Positives = 181/447 (40%), Gaps = 55/447 (12%)
Query: 62 TLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQA 121
T V+ R +SK++ +L W+ NG K+ L+E + + R +H + + L
Sbjct: 18 TPVSPPRNGMSKEDVVGQELIQWLETNGADSKKLTLQE---YAPEVRGVH---SRKVLVP 71
Query: 122 GDAAFSVPNSLVVTLERVLGNET-IAELLTTNKLSELA----CLALYLMYEKKQGKKSFW 176
G+ +P ++T+E +G +T I L + +A L ++L+ + + + SF+
Sbjct: 72 GERILVIPKKCLITVE--MGKQTDIGRKLLARNVDFVAPKHIFLMMFLLTDMEHVETSFF 129
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV-- 234
Y L P+ WSE EL++L GS +I ER I+++Y+ + V
Sbjct: 130 RNYYSTLPSTLS------NMPIFWSEEELSWLKGSYIIQQIQERKAAIRKDYDVICRVDP 183
Query: 235 WFMAGSLFQKVSLARRF----------------ALVPLGPPLLAYSSK-CKAMLAAVDDA 277
F SL + S AR ALVP L Y + DA
Sbjct: 184 SFARFSL-DRFSWARMIVCSRNFGLTIDGVKTAALVPFADMLNHYRPRETSWTFDQSIDA 242
Query: 278 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY------DRLVVEAALNTEDP 331
+ G + G + N + L+NYGF EDN + ++++ L+ D
Sbjct: 243 FTITSLGTIGTGAQVYDSYGKKCNHRFLLNYGFAVEDNTEEDGRNPNEVLIDFQLSPADG 302
Query: 332 Q-YQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPI 390
Q + DKR +G ++ + + + RL + T E S + P
Sbjct: 303 QLFYDKRAYLHESGIYTMDA-RLSCSHSDANTREGFSFARL--IVATEEEFSTMKMKSPA 359
Query: 391 C---PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT--DYNLHPKKRVATQLVRM 445
P+S E L+ L + +L+ Y T+ ED +L Y L + A +R
Sbjct: 360 HSSPPISFDNEIRALEYLRNLMTHQLSLYDTTIEEDNELLASKQYPLFSNRIQALFFIRG 419
Query: 446 EKKMLNACLQVTADMIMLLPDVTVSPC 472
EK++ Q AD ++ L + ++ C
Sbjct: 420 EKQVCRY-FQELADKVIPLFSLPLAEC 445
>gi|427784595|gb|JAA57749.1| Putative histone-lysine n-methyltransferase setd3 [Rhipicephalus
pulchellus]
Length = 485
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 89/413 (21%), Positives = 161/413 (38%), Gaps = 58/413 (14%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
W NG V +K+ P + A E ++ + VP L++T
Sbjct: 80 WCSDNGAYLGSVSIKDLPDGE------YGFVADEHIEESNQFLGVPLKLMMTTAAA-KKS 132
Query: 144 TIAELLTTN----KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
+ LL + +S +A LA++L+ E G+ SFW PYI L + + L
Sbjct: 133 KLGPLLRDDPIMMSMSNVA-LAMFLILEFCTGESSFWHPYISTL-------PASFNTVLY 184
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF---MAGSLFQKVSLARRF----- 251
+S EL L GS E L+ I R+Y+ ++ +A SL K
Sbjct: 185 FSVEELELLHGSTVLDEALKLHRSIARQYSYFHKIFRTHPLAKSLPYKDCFTYDLYRWAV 244
Query: 252 ----------------------------ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD 283
A+VPL K + ++
Sbjct: 245 SAVMTRQNAVPLTDTAGGDDEDGTDAMTAMVPLWDMCNHSDGKVFTDYDISANMLRCYAM 304
Query: 284 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 343
R ++ G+ + ++ G + N++ I+ GFV +N +D + ++ ++ +DP Y K + +
Sbjct: 305 RDFEKGQEVTIFYGRRTNAEFFIHNGFVFPENRHDSVDIKLGISKQDPLYAVKAKLCDDH 364
Query: 344 GKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLD 403
+F + RE+ D+ +LR+ + D S+ S I + R L+
Sbjct: 365 ELTPSGIFAL-VPRERPVCEDLSTFLRILVLKDASQAASFTDE--HIMVATDDNAREALN 421
Query: 404 QLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQV 456
L + L +P + E E ++ D + + ++A QL +E+K+L A L+
Sbjct: 422 FLIVRIQLLLRAFPKSDQEYENIIADEGSNGRLKMAAQLRLLERKILTAVLET 474
>gi|298711968|emb|CBJ32910.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 247
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 64/196 (32%), Positives = 91/196 (46%), Gaps = 23/196 (11%)
Query: 280 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT-EDPQYQDK-R 337
+ R + AGE +++ GP+ N LL YGFV++DNP D + ++ D +D R
Sbjct: 40 VTTQRGWTAGEQVLISYGPRSNDHLLRRYGFVEQDNPNDVYRITGLIDKLSDVLGKDSVR 99
Query: 338 MVAQRNGKLSVQ--------VFHVHAGR-----EKEAISDMLPYLRLGYVSDTS--EMQS 382
++ + GKL V V GR EKE ++P RL V D E ++
Sbjct: 100 VLRESGGKLGTTGDNAEGRPVESVTVGRSGLLGEKEE-GRVMPVFRLAVVKDDQLPEGKA 158
Query: 383 VISSLGPIC-PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYN--LHPKKRVA 439
SL +SP E A D L G+ TL+EDEA L+ L +KRVA
Sbjct: 159 AGISLKDFSNEISPANEAAARDALRKLCIKEREGFATTLAEDEAYLSSLGNSLGAQKRVA 218
Query: 440 TQLVRMEKK-MLNACL 454
RMEKK +L+A +
Sbjct: 219 FSF-RMEKKRVLDAAI 233
>gi|260803924|ref|XP_002596839.1| hypothetical protein BRAFLDRAFT_284593 [Branchiostoma floridae]
gi|229282099|gb|EEN52851.1| hypothetical protein BRAFLDRAFT_284593 [Branchiostoma floridae]
Length = 500
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 84/379 (22%), Positives = 161/379 (42%), Gaps = 50/379 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKK 169
+ A +D++A + ++P L++T E ++ L+ +++ ++ LAL+++ EK
Sbjct: 121 LKAVKDIKAEELFITIPRKLMLTTETA-RESSLGPLIKKDRILQVMANVSLALHVLCEK- 178
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
SFW PYI +PL + E E+ +L GS +++L + + I R+Y
Sbjct: 179 YSSNSFWAPYINIFPG-------TYTTPLYFEEGEMLHLQGSLNFSDVLNQYKSIARQYA 231
Query: 230 EL----------------------DTVWFMAGSLFQK----VSLARRF--ALVPLGPPLL 261
+ W ++ + ++ S R AL+P+
Sbjct: 232 YFYKLFQTQPEAAGLPLKECFTFDEYRWAVSTVMTRQNQVPTSDGRHLITALIPMWDMCN 291
Query: 262 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 321
+ + D+ + + R + + ++ G + N++ LI+ GFV +N +DR+
Sbjct: 292 HSNGEVSTEFNLGSDSAECLAMREFPTDSQVYIFYGMRSNAEFLIHNGFVYPENVHDRVN 351
Query: 322 VEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL-----GYVSD 376
V+ ++ D + K V R G + F VH G++ ++L +LR+ G + D
Sbjct: 352 VKLGVSKNDSLFAMKAEVLSRAGIHASTSFQVHCGKDP-IPPELLVFLRVFTMVEGDLRD 410
Query: 377 --TSEMQSV-ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNL 432
TSE QS +S LG C V+ E L + Y ++ + E L ++
Sbjct: 411 LLTSEHQSAYLSCLGRSDCMVTQEQETKAWAFLETRLSLLIRSYRTSIKDVETELQAPDM 470
Query: 433 HPKKRVATQLVRMEKKMLN 451
R A QL E ++L+
Sbjct: 471 TYHSRAALQLKLAEMQILS 489
>gi|348671353|gb|EGZ11174.1| hypothetical protein PHYSODRAFT_361758 [Phytophthora sojae]
Length = 486
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 99/429 (23%), Positives = 171/429 (39%), Gaps = 55/429 (12%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+L W+ NG K+ L+E + + R +H + + L G+ +P ++T+E
Sbjct: 44 ELIQWLEGNGADTKKLALQE---YAPEVRGVH---SRKVLAPGERILVIPKKCLITVE-- 95
Query: 140 LGNET-IAELLTTNKLSELA----CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
+G +T I L + +A L ++L+ + ++ + SF+ Y L
Sbjct: 96 MGKQTDIGRKLLARNVDFVAPKHIFLMMFLLTDMERAETSFFRNYYSTLP------STLS 149
Query: 195 ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV--WFMAGSLFQKVSLARRF- 251
P+ WS+ EL +L GS +I ER I+++Y+ + V F SL + S AR
Sbjct: 150 NMPIFWSDEELGWLKGSYIIQQIQERKAAIRKDYDVICRVDPAFARFSL-DRFSWARMIV 208
Query: 252 ---------------ALVPLGPPLLAYSSK-CKAMLAAVDDAVQLVVDRPYKAGESIVVW 295
ALVP L Y + DA + G +
Sbjct: 209 CSRNFGLTIDGVKTAALVPFADMLNHYRPRETSWTFDQSIDAFTITSLGTIGTGAQVYDS 268
Query: 296 CGPQPNSKLLINYGFVDEDNPY------DRLVVEAALNTEDPQ-YQDKRMVAQRNGKLSV 348
G + N + L+NYGF EDN + ++++ L+ D Q + DKR +G ++
Sbjct: 269 YGKKCNHRFLLNYGFAVEDNTEEDGRNPNEVLIDFQLSQADGQLFYDKRAYLHESGIYTM 328
Query: 349 QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPIC---PVSPCMERAVLDQL 405
+ + + RL + T + S + P P+S E L L
Sbjct: 329 DA-RLSCSHSDANTREGFSFARL--IVATEDEFSSMKMKSPAHSSPPISFDNEIRALQYL 385
Query: 406 ADYFKARLAGYPATLSEDEAMLT--DYNLHPKKRVATQLVRMEKKMLNACLQVTADMIML 463
D +L+ Y T+ ED +L Y L + A +R EK++ Q AD ++
Sbjct: 386 RDLMTHQLSLYDTTIEEDNELLASKQYPLFSNRIQALFFIRGEKQVCRY-FQELADKVIQ 444
Query: 464 LPDVTVSPC 472
L + ++ C
Sbjct: 445 LFSLPLAEC 453
>gi|189236574|ref|XP_975615.2| PREDICTED: similar to SET domain containing 3 [Tribolium castaneum]
Length = 667
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 58/226 (25%), Positives = 101/226 (44%), Gaps = 26/226 (11%)
Query: 251 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRP-------YKAGESIVVWCGPQPNSK 303
+AL+PL C + A V+DR +KAGE + ++ G + N+
Sbjct: 451 YALIPLW-------DMCNHTNGTISTAYNPVLDRSECLAVKNFKAGEQLFIFYGSRSNAD 503
Query: 304 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 363
L ++ GFV E+N YD + ++ DP Q + + GKLS+ + R+ +
Sbjct: 504 LFVHNGFVFENNDYDVYWIRLGISKSDPLQQKRGHLL---GKLSIASTCDFSIRKGASPI 560
Query: 364 D--MLPYLRLGYVSDTSEMQSVISS-----LGPI-CPVSPCMERAVLDQLADYFKARLAG 415
D +L +LR+ + + ++ I+S LG I C + +E L K L+
Sbjct: 561 DGQLLAFLRV-FNMNEEQLDHWINSDKSADLGHIDCALDTALETKSWRFLHARLKLLLST 619
Query: 416 YPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 461
Y TL EDE +L + P + +A ++ EK+++ L+ I
Sbjct: 620 YKTTLDEDEKLLAEAQATPNRLLAIKMRATEKRIIRETLEYVEQYI 665
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 35/119 (29%), Positives = 58/119 (48%), Gaps = 12/119 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSEL--ACLALYLMYEKK 169
V A+ D+ +VP L++++E + +L+ +K L + L+++L+ EK
Sbjct: 116 VKANVDIAESSLVIAVPRKLMMSVENA-KESVLKDLIEKDKILGSMPNVALSIFLLLEKY 174
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
+G SFW PYI L + + L +S EL L GSPT L + + I R+Y
Sbjct: 175 KGD-SFWKPYIDILPK-------TYTTVLYFSIDELEELRGSPTLEVALRQIKSITRQY 225
>gi|326492674|dbj|BAJ90193.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 90/382 (23%), Positives = 154/382 (40%), Gaps = 53/382 (13%)
Query: 109 PIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEK 168
P+ + A +L G+ VP L + + V + + L ++L ++ E
Sbjct: 39 PVLGLVAERNLPRGEVVAEVPKKLWLDADAVAASVLGRVCGSGGDLRPWVSVSLLILREA 98
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
+G S W PY+ L RQ +S + WSE EL + G+ + + E ++ E+
Sbjct: 99 ARGGDSLWAPYLAILPRQ-------TDSTIFWSEEELLEIQGTQLLSTTMGVKEYVQSEF 151
Query: 229 NEL-------------------DTVWFMAGSLFQKVS---LARRFALVPLGPPL------ 260
+ + D +W G L +V + AL+P +
Sbjct: 152 DNVEAGIINVNKDLFPGTITFDDFLWAF-GVLRSRVFPELRGDKLALIPFADLINHNGDI 210
Query: 261 ----LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDN 315
+ K K L D L K+GE I V + + N++L ++YGF + ++
Sbjct: 211 TSKESCWEIKGKGFLGR-DTVFSLRTPVDVKSGEQIYVQYDLDKSNAELALDYGFTESNS 269
Query: 316 PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS 375
D + ++ DP Y+DK +A+ NG F V G + M+ YLRL +
Sbjct: 270 SRDSYTLTLEISESDPFYEDKLDIAELNGMGETAYFDVVLG--ESLPPQMITYLRLLCLG 327
Query: 376 DTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 428
T V L PVS E ++ + + K+ LA Y T+ EDE +L
Sbjct: 328 GTDAFLLEALFRNKVWEHLE--LPVSRDNEESICQVIQNACKSALAAYHTTIEEDEELLE 385
Query: 429 DYNLHPKKRVATQLVRMEKKML 450
+L + ++A ++ EKK+L
Sbjct: 386 REDLQSRHQIAVEVRVGEKKVL 407
>gi|148686777|gb|EDL18724.1| mCG18357, isoform CRA_b [Mus musculus]
Length = 466
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 75/337 (22%), Positives = 145/337 (43%), Gaps = 46/337 (13%)
Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
LA +L+ E+ SFW PYI+ L + ++PL + E E+ L + ++
Sbjct: 28 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVF 79
Query: 219 ERAEGIKREY-------------NEL---------DTVWFMAGSLFQKVSLARR------ 250
+ + R+Y N+L D W ++ + ++ +
Sbjct: 80 SQYKNTARQYAYFYKVIQTHPHANKLPLKESFTYEDYRWAVSSVMTRQNQIPTEDGSRVT 139
Query: 251 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 310
AL+PL + DD + V + ++AG+ I ++ G + N++ +I+ GF
Sbjct: 140 LALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGF 199
Query: 311 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 370
++N +DR+ ++ ++ D Y K V R G + VF +H+ E + +L +LR
Sbjct: 200 FFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLR 258
Query: 371 LGYVSDTSEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLS 421
+ +++ + + I +LG PVS E + L D L Y T+
Sbjct: 259 VFCMTEEELKEHLLGDSAIDRIFTLGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIE 318
Query: 422 EDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 458
ED+ +L + +L + +A +L EK++L ++ A
Sbjct: 319 EDKIVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAA 355
>gi|340720054|ref|XP_003398458.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Bombus
terrestris]
Length = 484
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 91/430 (21%), Positives = 170/430 (39%), Gaps = 65/430 (15%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+ + +G +W+ +NG + E P ++ + A + + +P L
Sbjct: 77 KRSQGIGRFINWLKQNGANVYGASVAEFPGYDLG------LKAERNFLENELILRIPREL 130
Query: 133 VVTLERV------LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
+ ++ L N+ + +L+ LA+ L+ EK + + S W PY+ L
Sbjct: 131 IFSIHNAAPELVALQNDPLLQLMPQ------VALAIALLIEKHK-EYSKWKPYLDILPT- 182
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW----FMAGSLF 242
+ L + ++ L GSPT L++ I R+Y + ++ ++
Sbjct: 183 ------TYTTVLYMTAADMNELKGSPTLEAALKQCRNIARQYAYFNKLFQKNNNAVSAIL 236
Query: 243 QKVSLARRF------------------------ALVPLGPPLLAYSSKCKAMLAAVDDAV 278
+ V ++ AL+P+ SK A +
Sbjct: 237 RDVFTYEKYCWAVSTVMTRQNIIPSKDGSLMIHALIPMWDMCNHEDSKITTDFNATLNCC 296
Query: 279 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRM 338
+ R +K E I + GP+ NS ++ GFV DN D + ++ DP ++++
Sbjct: 297 ECYALRDFKKAEQIFISYGPRTNSDFFVHSGFVYMDNEQDGFKLRLGISKADPLHKERVE 356
Query: 339 VAQRNGKLSVQVFHVHAGREKEAISD-MLPYLRLGYVSDTSEMQSVISS--LGPICPVSP 395
+ + +V F + G E ISD +L +LR+ + E+ I S + + +
Sbjct: 357 LLNKLDLPAVGEFLLKPG--TEPISDTLLAFLRV-FSMRKEELAHWIQSDRVNDLKHMDC 413
Query: 396 CMERAVLDQLADYFKARL----AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 451
+E V + + + RL A YP TL ED +L + L K++A QL EK++L
Sbjct: 414 ALETVVEENVKKFLLTRLQLLIANYPTTLKEDLQLL-ETTLPRIKKLAIQLRVTEKRILQ 472
Query: 452 ACLQVTADMI 461
L+ I
Sbjct: 473 GALEYVQQWI 482
>gi|270005261|gb|EFA01709.1| hypothetical protein TcasGA2_TC007289 [Tribolium castaneum]
Length = 230
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 58/226 (25%), Positives = 101/226 (44%), Gaps = 26/226 (11%)
Query: 251 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRP-------YKAGESIVVWCGPQPNSK 303
+AL+PL C + A V+DR +KAGE + ++ G + N+
Sbjct: 14 YALIPLW-------DMCNHTNGTISTAYNPVLDRSECLAVKNFKAGEQLFIFYGSRSNAD 66
Query: 304 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 363
L ++ GFV E+N YD + ++ DP Q + + GKLS+ + R+ +
Sbjct: 67 LFVHNGFVFENNDYDVYWIRLGISKSDPLQQKRGHLL---GKLSIASTCDFSIRKGASPI 123
Query: 364 D--MLPYLRLGYVSDTSEMQSVISS-----LGPI-CPVSPCMERAVLDQLADYFKARLAG 415
D +L +LR+ + + ++ I+S LG I C + +E L K L+
Sbjct: 124 DGQLLAFLRV-FNMNEEQLDHWINSDKSADLGHIDCALDTALETKSWRFLHARLKLLLST 182
Query: 416 YPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 461
Y TL EDE +L + P + +A ++ EK+++ L+ I
Sbjct: 183 YKTTLDEDEKLLAEAQATPNRLLAIKMRATEKRIIRETLEYVEQYI 228
>gi|297820264|ref|XP_002878015.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297323853|gb|EFH54274.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 473
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 51/238 (21%), Positives = 109/238 (45%), Gaps = 31/238 (13%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ + AGD VP ++ +T + + + ++ T+++ + LA L+ EKK+G+KS
Sbjct: 75 ASKVIHAGDCMLKVPFNVQITPDELSPDIRVS---LTDEVGNIGKLAAVLIREKKKGQKS 131
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN----- 229
W+PYI L + + S + W E E + + S E +++ I++E++
Sbjct: 132 RWVPYISRLPQ-----PAEMHSTIFWGEDEFSMIRCSAVHKETVKQKAQIEKEFSFVAQA 186
Query: 230 ---------------ELDTVWFMAGSLFQKVSLARRFALVPLGPPLLAYSSKCKAMLAAV 274
+ + + GS + S + +L+P + +L+
Sbjct: 187 FKQHYPMVIERPYLEDFMYAYALVGSRAWETS--KGISLIPFADFMNHDGLSASIVLSDE 244
Query: 275 DDAV-QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 331
D+ + ++ DR Y G+ + + G N+ L++++GF N +D + ++ + +DP
Sbjct: 245 DNQLSEVTADRNYSPGDEVFIKYGEFSNATLMLDFGFTVPYNIHDEVQIQMDVPNDDP 302
>gi|149044197|gb|EDL97579.1| rCG27725, isoform CRA_c [Rattus norvegicus]
Length = 468
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 73/337 (21%), Positives = 142/337 (42%), Gaps = 46/337 (13%)
Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
LA +L+ E+ SFW PYI+ L + ++PL + E E+ L + ++
Sbjct: 28 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVF 79
Query: 219 ERAEGIKREY-------------NEL---------DTVWFMAGSLFQKVSLARR------ 250
+ + R+Y N+L D W ++ + ++ +
Sbjct: 80 SQYKNTARQYAYFYKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVT 139
Query: 251 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 310
AL+PL + DD + V + ++AG+ I ++ G + N++ +I+ GF
Sbjct: 140 LALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGF 199
Query: 311 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 370
++N +DR+ ++ ++ D Y K V R G + VF +H E + +L +LR
Sbjct: 200 FFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLR 258
Query: 371 LGYVSDT---------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLS 421
+ +++ S + + + PVS E + L D L Y T+
Sbjct: 259 VFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIE 318
Query: 422 EDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 458
ED+ +L + +L + +A +L EK++L ++ A
Sbjct: 319 EDKTVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAA 355
>gi|302896454|ref|XP_003047107.1| SET domain protein [Nectria haematococca mpVI 77-13-4]
gi|256728035|gb|EEU41394.1| SET domain protein [Nectria haematococca mpVI 77-13-4]
Length = 1037
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 102/438 (23%), Positives = 169/438 (38%), Gaps = 106/438 (24%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN----------KLSELACLAL 162
+ A +D+ A F++P ++ +E + + ++ + +L + L L
Sbjct: 610 IIALQDIPAETTLFTIPRKGIINVETSELPKKLPDVFDLDKPIDDDDEAPRLDSWSSLIL 669
Query: 163 YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK-------A 215
LMYE QG+KS W PY L + ++P+ WSE+EL L S + A
Sbjct: 670 VLMYEYLQGEKSQWKPYFDVLPS-------SFDTPMFWSESELDQLQASHMRHKIGKADA 722
Query: 216 EILERAE-------------GIKREYNELDTVWFMAGSLFQKVSLARR------------ 250
E + R G R ++L + GS +
Sbjct: 723 ESMFRKTLLPIIRKNSSVFGGENRSDDDLVEIAHRMGSTIMAYAFDLENDEDEEEEETDG 782
Query: 251 ----------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 300
+VP+ +L ++ A + ++++ + RP KAGE I + GP P
Sbjct: 783 WVEDREGKSMMGMVPMA-DILNADAEFNAHVNHEEESLTVTSLRPIKAGEEIFNYYGPHP 841
Query: 301 NSKLLINYGFVDEDNP-YDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV--HAGR 357
NS+LL YG+V E + YD VVE + + V + N +S QV H
Sbjct: 842 NSELLRRYGYVTERHSRYD--VVEIPWDVVES-------VMRLNFGISGQVLEKLRHGLE 892
Query: 358 EKEAISDMLPYLRLGYVSDTSEMQSVISSLGP-----------------ICPVSPCMERA 400
E+E D R +T E+ S + GP + V A
Sbjct: 893 EEEEFEDTFVLER-----ETGEVNSDGTFSGPARFESMPEDLQEQLKTFLKGVKKAQPEA 947
Query: 401 VLDQ----------LADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 450
+ D+ LA +A + YP + SED +L +L + R+A ++ EKK+L
Sbjct: 948 IPDKRKRDEIHHAVLAKTLQALASKYPTSTSEDGILLQRQDLSQRTRMAIEVRLGEKKLL 1007
Query: 451 NACLQVTA--DMIMLLPD 466
+ T+ D+ M + D
Sbjct: 1008 QEAIASTSSVDVEMTVDD 1025
>gi|229596469|ref|XP_001008992.3| SET domain containing protein [Tetrahymena thermophila]
gi|225565279|gb|EAR88747.3| SET domain containing protein [Tetrahymena thermophila SB210]
Length = 629
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 74/273 (27%), Positives = 113/273 (41%), Gaps = 44/273 (16%)
Query: 66 GSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAA 125
+E + K E +L SW+ N + LK +HN + + +QA +
Sbjct: 142 ADKETLKKSE----NLLSWVQANKGEFSSIKLKYLSTHNRS------IVSKRIIQADETV 191
Query: 126 FSVPNSLVVTLERVLGNETIAELLTTNKLSEL-----ACLALYLMYEKKQGKKSFWLPYI 180
S+P V+TL+ V + ++LT K ++L A AL+L+ E+K+ S + YI
Sbjct: 192 ISIPQEQVITLD-VASSSDFCKILT-EKNTQLVQQKHAYFALFLLQEQKKKDASHYKAYI 249
Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW--FMA 238
L P L+SE EL YL G+ + E+ E IK +Y + V F +
Sbjct: 250 DSLPTDLSSF------PALFSEEELQYLEGTAALKLVQEQKEDIKTDYESISQVIPEFKS 303
Query: 239 GSLFQKV------SLARRFA----------LVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 282
F++ S +R F +VPL L S + DDA
Sbjct: 304 EFSFEQFRWAFLCSHSRVFGIKVKGVKTSVMVPLADMLNHKHSGQEDSEWVFDDATNCFT 363
Query: 283 DRPYKA---GESIVVWCGPQPNSKLLINYGFVD 312
+ K + I G + NSKL +NYGFVD
Sbjct: 364 VKALKKIQRNQQIHFSYGSKCNSKLFLNYGFVD 396
>gi|392349055|ref|XP_003750278.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Rattus
norvegicus]
Length = 416
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 73/331 (22%), Positives = 142/331 (42%), Gaps = 47/331 (14%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 12 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 65
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 66 LLMTVESA-KNSILGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 121
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-------------NEL---- 231
++PL + E E+ L + ++ + + R+Y N+L
Sbjct: 122 -----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKD 176
Query: 232 -----DTVWFMAGSLFQKVSLARR------FALVPLGPPLLAYSSKCKAMLAAVDDAVQL 280
D W ++ + ++ + AL+PL + DD +
Sbjct: 177 SFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCEC 236
Query: 281 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 340
V + ++AG+ I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K V
Sbjct: 237 VALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVL 296
Query: 341 QRNGKLSVQVFHVHAGREKEAISDMLPYLRL 371
R G + VF +H E + +L +LR+
Sbjct: 297 ARAGIPTSSVFALHF-TEPPISAQLLAFLRV 326
>gi|58177849|gb|AAH89108.1| Setd3 protein [Rattus norvegicus]
Length = 450
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 73/337 (21%), Positives = 142/337 (42%), Gaps = 46/337 (13%)
Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
LA +L+ E+ SFW PYI+ L + ++PL + E E+ L + ++
Sbjct: 10 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVF 61
Query: 219 ERAEGIKREY-------------NEL---------DTVWFMAGSLFQKVSLARR------ 250
+ + R+Y N+L D W ++ + ++ +
Sbjct: 62 SQYKNTARQYAYFYKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVT 121
Query: 251 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 310
AL+PL + DD + V + ++AG+ I ++ G + N++ +I+ GF
Sbjct: 122 LALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGF 181
Query: 311 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 370
++N +DR+ ++ ++ D Y K V R G + VF +H E + +L +LR
Sbjct: 182 FFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLR 240
Query: 371 LGYVSDT---------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLS 421
+ +++ S + + + PVS E + L D L Y T+
Sbjct: 241 VFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIE 300
Query: 422 EDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 458
ED+ +L + +L + +A +L EK++L ++ A
Sbjct: 301 EDKTVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAA 337
>gi|357444999|ref|XP_003592777.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase [Medicago truncatula]
gi|355481825|gb|AES63028.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase [Medicago truncatula]
Length = 451
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 57/247 (23%), Positives = 108/247 (43%), Gaps = 28/247 (11%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ +Q GD VP SL +T + + + + + +A LA L+ K G+ S
Sbjct: 66 ASKSIQTGDCILQVPYSLQLTPDNLPPE---IKPFISEDVGNIAKLATVLLIHKNLGQDS 122
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI L Q + + + W+E+EL + S E + + I++++ E+ V
Sbjct: 123 EWHPYISCLPPQA-----EMHNTIFWNESELEMIRQSSVYQETIYQKSQIEKDFLEIKPV 177
Query: 235 W---------FMAGSLFQKVSL--------ARRFALVPLGPPLLAYSSKCKAMLAAVDD- 276
+ F +L + +L+P L + +A++ + DD
Sbjct: 178 FQPFCQSFGDFTWKDFMHACTLVGSRAWGSTKGLSLIPFAD-FLNHDGISEAIVMSDDDN 236
Query: 277 -AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQD 335
++ DR Y GE +++ G N+ L++++GF N YD++ ++ + DP
Sbjct: 237 KCSEVFSDRDYVPGEQVLIRYGKFSNATLMLDFGFTIPYNIYDQVQIQYDIPKYDPLRHT 296
Query: 336 KRMVAQR 342
K + Q+
Sbjct: 297 KLELLQQ 303
>gi|291235388|ref|XP_002737626.1| PREDICTED: SET domain containing 4-like [Saccoglossus kowalevskii]
Length = 353
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 73/292 (25%), Positives = 123/292 (42%), Gaps = 55/292 (18%)
Query: 75 EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
+ D +L WM +NG K L + + E R + A++ Q GD S+P L++
Sbjct: 28 DNDYIELVRWMSRNGF---KGALLKPANFKETGRGL---MATKPFQIGDQVISIPEMLLI 81
Query: 135 TLERVLGNE--TIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
T + VL + + T KLS + + YL+ E+ + K SFW YI+ L +
Sbjct: 82 TTQNVLSSYLGDFIKQQTRPKLSPMQVICTYLICERSRQKDSFWYNYIKVLPK------- 134
Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQ--------- 243
+ +P+ ++ E+ +L K ++ + E I Y EL ++ + S F
Sbjct: 135 SYSNPVYFTNEEINWLP-RRIKRKVFDECEKINTAYRELKNLFSILESTFVSFKGIFEYS 193
Query: 244 ----------------------KVSLAR-RFALVPLGPPLLAYSS--KCKAMLAAVDDAV 278
+S+ R +AL P LL +++ + KA V
Sbjct: 194 AFRWAWCTVNTRSVYMLQEQNPHLSIERDHYALAPF-LDLLNHTNTVEVKASYNPVSKCY 252
Query: 279 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 330
++ K + + ++ GP N KL I YGFV N ++ VVE L+ ED
Sbjct: 253 EIFTCTACKKYDQMFIYYGPHDNVKLFIEYGFVLPQNQHN--VVE--LDFED 300
>gi|242007310|ref|XP_002424484.1| SET domain-containing protein, putative [Pediculus humanus
corporis]
gi|212507902|gb|EEB11746.1| SET domain-containing protein, putative [Pediculus humanus
corporis]
Length = 492
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 91/421 (21%), Positives = 173/421 (41%), Gaps = 58/421 (13%)
Query: 74 KEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
+E+ +L SW+ +NG V +K NE + + A++DL+ + ++P +++
Sbjct: 81 REDHFSNLISWIKENGGVADNVTIKH---FNEMG---YGLEAAKDLEESELICAIPKNVM 134
Query: 134 VTLERVLGNETIAELLTTN----KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
+TL+ V + L N + +A LAL+L+ E + + SFW YI L
Sbjct: 135 MTLDNV-KVSPLKYLYENNPILKNMGNVA-LALFLILEHVKNENSFWHHYISSLPSD--- 189
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY--------NELDTVWFMAGSL 241
+ L + + + SPT + + I R+Y N D + ++
Sbjct: 190 ----YNTVLYFDLNDFLEMKNSPTFEMATKHCKNIARQYAYFNNLFQNSNDEASLILRNV 245
Query: 242 FQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV---------------QLVVDR-- 284
F + V + SS + ++ + Q VDR
Sbjct: 246 FTYQLYRWAVSTVMTRQNFIPSSSTSNDVENGINGLIPLWDMCNHTNGYLSTQYKVDRSE 305
Query: 285 -----PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 339
P+K GE ++++ G + NS L++ GFV ++NP+D + ++ D + + +
Sbjct: 306 CLACKPFKKGEQVLIFYGERSNSDFLVHNGFVYDENPHDSFRLRLGISKSDKLHGLRCEL 365
Query: 340 AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY--VSDTSEMQSVISSLGPI----CPV 393
+ G F++++G E ++L +LR+ V + + +S S L + C +
Sbjct: 366 LKDLGIPDSGDFYLYSGSEP-VRENLLAFLRIFNMDVENLNHWKSHSSRLSDLMWKDCAL 424
Query: 394 SPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNAC 453
+E V + L D L Y E E + D N +R+ ++ EKK L++
Sbjct: 425 DTKIESKVWNFLYDRINLLLKTYKG--DEVEVRVEDSNSTECRRLVRAQLKCEKKFLSSI 482
Query: 454 L 454
L
Sbjct: 483 L 483
>gi|34784341|gb|AAH57968.1| Setd3 protein [Mus musculus]
Length = 408
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 68/309 (22%), Positives = 134/309 (43%), Gaps = 48/309 (15%)
Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
LA +L+ E+ SFW PYI+ L + ++PL + E E+ L + ++
Sbjct: 28 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVF 79
Query: 219 ERAEGIKREYNELDTVWFMAGSLFQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV 278
+ + R+Y ++ + + + L DD
Sbjct: 80 SQYKNTARQY----------AYFYKVIQITTGYNLE--------------------DDRC 109
Query: 279 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRM 338
+ V + ++AG+ I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y K
Sbjct: 110 ECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAE 169
Query: 339 VAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLGPI 390
V R G + VF +H+ E + +L +LR+ +++ + + I +LG
Sbjct: 170 VLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNA 228
Query: 391 -CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 449
PVS E + L D L Y T+ ED+ +L + +L + +A +L EK++
Sbjct: 229 EFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRLGEKEI 288
Query: 450 LNACLQVTA 458
L ++ A
Sbjct: 289 LEKAVKSAA 297
>gi|168046556|ref|XP_001775739.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672891|gb|EDQ59422.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 524
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 76/306 (24%), Positives = 114/306 (37%), Gaps = 47/306 (15%)
Query: 52 RFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIH 111
RF R S+ VS + L++W+ K C + L+ P
Sbjct: 49 RFGCRWVQSNGSTHTKESNVSISNTKVERLRNWLKKLNHDDCNLKLERCPQGGSGS-GYG 107
Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
A + G VP ++T E + + L+ + L+ + L+L+YE+ +G
Sbjct: 108 AFAGPGGVGNGSTIVKVPRKALMTEETARLCQDVGPLVKKSDLTPWQAMCLHLLYERARG 167
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSET-ELAYLTGSPTKAEILERAEGIKRE--- 227
+ SFW PYI L ++ +L P+LWS+ +L GSP ++ ER I RE
Sbjct: 168 ETSFWYPYIAVLPKEL---ELIGIHPMLWSQKMRREWLEGSPM-LDVTERRLAICREDYE 223
Query: 228 ---------------------YNELDTVWFMAGSLFQKVSLARR-------------FAL 253
E W L + SL + AL
Sbjct: 224 AMLLAGAGRLTPRGNEGEPISITETAVQWAATMLLSRSFSLNLQTQKLRPGSFAEDTIAL 283
Query: 254 VPLGPPLLAYSSKCKAMLAAVDD---AVQLVVDRPYKAGESIVVWCGPQPN-SKLLINYG 309
VP L SS + D L R Y GE + GP + S+LL++YG
Sbjct: 284 VPWADMLNHSSSAGRESCLVYDQKSGVATLQAHRTYSEGEQVFDSYGPSCSPSRLLLDYG 343
Query: 310 FVDEDN 315
FVDE+N
Sbjct: 344 FVDEEN 349
>gi|325183831|emb|CCA18289.1| conserved hypothetical protein [Albugo laibachii Nc14]
gi|325183979|emb|CCA18437.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 561
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 99/427 (23%), Positives = 176/427 (41%), Gaps = 70/427 (16%)
Query: 68 REVVSKKEEDL--GDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAA 125
RE V+ E D+ +L W+ G K++L++ + + R +H +L G+
Sbjct: 105 REDVADLENDVVGAELIDWLQNQGAETKKLMLQQ---YAPEVRGVH---CRNELVPGERI 158
Query: 126 FSVPNSLVVTLERVLGNET-IAELLTTNKLSELA----CLALYLMYEKKQGKKSFWLPYI 180
+P + ++T+E +G +T I + + + + +A L LYL+ + ++ +F+ Y
Sbjct: 159 LFIPKNCLITVE--MGKQTEIGQKVLAHNIEFVAPKHIFLILYLLTDMEKKDLTFFKYYY 216
Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS 240
L P+ WS+ EL++L GS +I ER I+++Y+ + A
Sbjct: 217 STL------PSTLKNMPIFWSDQELSWLKGSYILHQIQERKAAIRKDYDAI----CRADP 266
Query: 241 LFQKVSL-----------ARRF----------ALVPLGPPLLAYSSKCKA-MLAAVDDAV 278
F + SL +R F ALVP L Y + + D
Sbjct: 267 SFSRFSLERFSWARMIVCSRNFGLTIDGVKTAALVPFADMLNHYRPRETSWTFDQKLDGF 326
Query: 279 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY------DRLVVEAALNTEDPQ 332
+ +G + G + N + L+NYGF EDN + ++V+ L+ D Q
Sbjct: 327 TITSLESICSGAQVYDSYGKKCNHRFLLNYGFAVEDNTEEDGSNPNEIMVDFQLDPGDGQ 386
Query: 333 YQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY-----VSDTSEMQSVISSL 387
++ + L + R + SD P R G+ ++ T + S +
Sbjct: 387 -----LLYDKTAYLYESGIYTMNARLSCSHSD--PSTREGFSFARLIAATEDEFSSMKMR 439
Query: 388 GPIC---PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML--TDYNLHPKKRVATQL 442
P P+S E A L+ L +L YP +L E EA+L +Y L+ + A
Sbjct: 440 SPAHASPPISFRNEIAALNLLKQLMDTQLDQYPTSLDEGEAILKSKEYPLYSNRIQALFF 499
Query: 443 VRMEKKM 449
+R EK++
Sbjct: 500 IRGEKQV 506
>gi|322703179|gb|EFY94792.1| UV-endonuclease UVE-1 [Metarhizium anisopliae ARSEF 23]
Length = 1118
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 95/422 (22%), Positives = 170/422 (40%), Gaps = 89/422 (21%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN-------KLSELACLALYLM 165
+ A D+ A F++P ++ + E + EL + L + L L +M
Sbjct: 694 IVALRDIPADTTLFTIPRDAIINSDTSSLREKLPELFESQGDEDEQQALDSWSALILIMM 753
Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEGI 224
YE G +S W PYI L L ++P+ WSE EL+YL S T +I + AE +
Sbjct: 754 YEFFLGHQSKWKPYIDVL-------PLTFDTPMFWSEEELSYLQASATVNKIGKADAEEM 806
Query: 225 KRE-----YNELDTVWFMAGSLFQK--VSLARRFALVPLGPPLLAYS------------- 264
R +V+ +G + + LA R +G ++AY+
Sbjct: 807 FRTRLIPAIRGNPSVFASSGDCSDEDLIGLAHR-----MGSTIMAYAFDLENEEAENDDE 861
Query: 265 -------SKCKAMLAAV-------------------DDAVQLVVDRPYKAGESIVVWCGP 298
+ K+M+ V D+ + + R KAGE I+ + GP
Sbjct: 862 SDGWVEDREGKSMMGMVAMADILNADAEFNAHVNHGDEELTVTSIRDIKAGEEILNYYGP 921
Query: 299 QPNSKLLINYGFVDEDN--------PYDRLVVEAALNTEDPQYQDKRMVAQRNGKL---- 346
PNS+LL YG++ E + P+D V+ +L +E QD ++ + K+
Sbjct: 922 HPNSELLRRYGYITEKHSRYDVVEIPWD--AVQHSLMSELGVPQD--IMTETMDKMDQDD 977
Query: 347 --SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE-MQSVISSL----GPICPVSPCMER 399
+ V +G + P + G D E +++ I L G + +
Sbjct: 978 LEDIFVLERDSGEPNPDGTFAGPAVVDGMPPDLKEQLKATIKLLQKVDGNLISDKRKRDD 1037
Query: 400 AVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTAD 459
+ + + + + Y T++EDE +L +L ++R+A ++ EKK+L ++
Sbjct: 1038 ILRSTMVETLRLIASRYSTTIAEDEILLAQDSLTRRQRMAVRVRLGEKKLLQEAFDHFSE 1097
Query: 460 MI 461
M+
Sbjct: 1098 MV 1099
>gi|52545671|emb|CAH56365.1| hypothetical protein [Homo sapiens]
Length = 380
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 48/193 (24%), Positives = 95/193 (49%), Gaps = 10/193 (5%)
Query: 275 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 334
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 78 DDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 137
Query: 335 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD---------TSEMQSVIS 385
K V R G + VF +H E + +L +LR+ +++ S + + +
Sbjct: 138 MKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFT 196
Query: 386 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 445
PVS E + L D L Y T+ ED+++L +++L + ++A +L
Sbjct: 197 LGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNHDLSVRAKMAIKLRLG 256
Query: 446 EKKMLNACLQVTA 458
EK++L ++ A
Sbjct: 257 EKEILEKAVKSAA 269
>gi|303275964|ref|XP_003057276.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226461628|gb|EEH58921.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 308
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 70/260 (26%), Positives = 121/260 (46%), Gaps = 32/260 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEK---K 169
+ A ED++ G+ +P++ ++T+ER + + +L E + LA +L + +
Sbjct: 25 LVAREDVKRGEPLLEIPDASLITVERAVKESKLGP--KHAELQEWSLLAAFLAEQALDIE 82
Query: 170 QGKKS-FWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERA------ 221
G +S + Y++ L R+ G L W E ++ L GSP++ ER
Sbjct: 83 NGDESGVFAAYVKALPRRTG-------GVLDWPEEDVKTLLAGSPSQRAAYERQASVDGA 135
Query: 222 -EGIKREYNEL--DTVWFMAGSLFQK-VSLARR---FALVPLGPPLLAYSSKCKAMLAAV 274
E I+ E+ +L + + LF + + L R ALVP +L + C A +
Sbjct: 136 IEEIRAEFPQLTPGALRWAFDVLFSRLIRLPNRGGELALVPWAD-MLNHKPGCNAYIDDS 194
Query: 275 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE--DNPYDRLVVEAALNTEDPQ 332
V L DR YK GE + G +P+++LLI+YGF E +NP D + ++ D +
Sbjct: 195 GGKVCLQPDRAYKPGEQVFASYGQRPSAELLISYGFAPEVGENPDDEYEITLGIDPND-R 253
Query: 333 YQDKRMVA-QRNGKLSVQVF 351
Y D + A ++ G V+ F
Sbjct: 254 YADAKAAALEKIGLRPVESF 273
>gi|297726941|ref|NP_001175834.1| Os09g0411650 [Oryza sativa Japonica Group]
gi|255678893|dbj|BAH94562.1| Os09g0411650, partial [Oryza sativa Japonica Group]
Length = 206
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 48/160 (30%), Positives = 78/160 (48%), Gaps = 13/160 (8%)
Query: 299 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGRE 358
+ N++L ++YGF + ++ D + ++ DP Y DK +A+ NG F + G
Sbjct: 10 KSNAELALDYGFTESNSSRDAYTLTLEISESDPFYDDKLDIAELNGMGETAYFDIVLG-- 67
Query: 359 KEAIS-DMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYFK 410
E++ MLPYLRL + T +V L PVS E A+ + + K
Sbjct: 68 -ESLPPQMLPYLRLLCLGGTDAFLLEALFRNAVWGHLE--LPVSQDNEEAICQVIRNACK 124
Query: 411 ARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 450
+ L Y T+ EDE +L NL P+ ++A ++ EKK+L
Sbjct: 125 SALGAYHTTIEEDEELLGSENLQPRLQIAVEVRAGEKKVL 164
>gi|242049248|ref|XP_002462368.1| hypothetical protein SORBIDRAFT_02g024510 [Sorghum bicolor]
gi|241925745|gb|EER98889.1| hypothetical protein SORBIDRAFT_02g024510 [Sorghum bicolor]
Length = 489
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 94/371 (25%), Positives = 151/371 (40%), Gaps = 37/371 (9%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETI-AELLTTNKLSELACLALYLMYEKKQG 171
+ A+ DL G+ VP L + + V ++ A L +AL L+ E +G
Sbjct: 83 LVAARDLPRGEVVAEVPKKLWMDADAVAASDIGRACGGGGGGLRPWVAVALLLLSEVARG 142
Query: 172 KKSFWLPYIRELDRQR-------GRGQLAVESPLLWSETELAYLTGS---PTKAEILERA 221
S W PY+ L RQ G + ++ LL + + S +AEI+ R
Sbjct: 143 ADSPWAPYLAILPRQTDSTIFCAGLKKSSLRYKLLSTTVGVKEYVQSEFDSVQAEIISRN 202
Query: 222 EGIKREYNELDTVWFMAGSLFQKVS---LARRFALVPLG------PPLLAYSS----KCK 268
+ + D + G L +V + ALVP P + + S K K
Sbjct: 203 KDLFPGSITFDDFLWAFGILRSRVFPELRGDKLALVPFADLVNHSPDITSEGSSWEIKGK 262
Query: 269 AMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 327
+ + L K+G+ I + + + N++L ++YGFV+ + D V ++
Sbjct: 263 GLFGR-EPMFSLRTPVDVKSGQQIYIQYDLDKSNAELALDYGFVESNPSRDSYTVTLEIS 321
Query: 328 TEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM------- 380
DP Y DK +A+ N F + ++ MLPYLRL + T
Sbjct: 322 ESDPFYGDKLDIAELNELGETAYFDIIL--DEPLPPQMLPYLRLLCIGGTDAFILEALFR 379
Query: 381 QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVAT 440
SV L P+SP E ++ + D K+ LA Y T+ EDE + NL P+ +A
Sbjct: 380 NSVWGHLE--LPLSPDNEESICQVMRDACKSALAAYHTTIEEDEELSERENLQPRLTIAI 437
Query: 441 QLVRMEKKMLN 451
+ EKK+L
Sbjct: 438 GVRAGEKKVLQ 448
>gi|301122791|ref|XP_002909122.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262099884|gb|EEY57936.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 426
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 87/401 (21%), Positives = 165/401 (41%), Gaps = 49/401 (12%)
Query: 111 HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
H V A + L +G +P L + +E ++ L ++ + LAL+LM+E+ +
Sbjct: 35 HGVFAKQALTSGQVTLRIPFKLTMNIESAARSDLARVLEKYPQIPDDEVLALHLMHERSK 94
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
SF+ P+I L + P+ WSE+EL L G+ + ++R++
Sbjct: 95 RSDSFFAPFIASLPT-------TFDLPVFWSESELNELKGTNVLLLTQLMKQQLQRDFEN 147
Query: 231 L---------------------DTVWFMAGSLFQKVSLARRFALVPLGPP---LLAYSSK 266
+ D W M+ + + R + + P + +
Sbjct: 148 IHQAVVEDFPEVFALLPTLTLEDYTWAMSVIWSRAFGVTREKKYLRVLCPAMDMFNHDVS 207
Query: 267 CKAML---AAVDDAVQLV---VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 320
+ +L + D+ Q++ V + AG ++ + G N+KLL +YGFV ++N +
Sbjct: 208 LRILLDDFVSFDEETQMLTHHVPKEVAAGSALQISYGQYSNAKLLFSYGFVAKENSRRAV 267
Query: 321 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAG-REKEAISDMLPYLRLGYVSDTSE 379
+ DP + K+ V N Q + E + +L LR+ +++ E
Sbjct: 268 DFWMKIPPNDPYLKLKQTVLDSNELTRDQTYDFCGTLFENDVDERLLATLRVILMNE-QE 326
Query: 380 MQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVA 439
++ + +S E AV + L + + +LA + TL EDEA+L + R++
Sbjct: 327 IR-LYKKAFETSIISIRNELAVYENLQNTCRRKLANFATTLEEDEAILAEMATESSPRLS 385
Query: 440 TQL-VRME-KKMLNACL-------QVTADMIMLLPDVTVSP 471
+ VR+E K++L + QV A + + P T P
Sbjct: 386 FAVRVRVEDKQVLTGVIDTLEKWKQVLASNLEMYPPSTTRP 426
>gi|449464220|ref|XP_004149827.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Cucumis
sativus]
Length = 499
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 58/235 (24%), Positives = 101/235 (42%), Gaps = 42/235 (17%)
Query: 112 YVAASEDLQAGDAAFSVP-------NSLVVTLERVLGNETIAELLTTNKLSELACLALYL 164
++ ASE ++AGD VP +SL + + +LGNE + +A LA+ +
Sbjct: 77 FLFASETIRAGDCILKVPFNVQISPDSLPLPIRDLLGNE----------IGNVAKLAVVV 126
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
+ E K G S W PYI L + + + + W E+EL + S E L + I
Sbjct: 127 LLEHKLGLGSEWAPYIIRLPQP-----WEMHNTIFWKESELEMIRKSSLYEESLNQRSQI 181
Query: 225 KREYNELDTVWFMAGSLFQKVSL-----------------ARRFALVPLGPPLLAYSSKC 267
KRE+ + + ++S +L+P L +
Sbjct: 182 KREFLAIRKALEAFPEIIDRISCDDFMHAYALVTSRAWRSTEGVSLIPFAD-FLNHDGAS 240
Query: 268 KAMLAAVDDA--VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 320
+AML DD ++V DR + GE +++ G N+ L++++GF N +D++
Sbjct: 241 EAMLLNDDDKQLSEVVADRDFAPGEHVLIRYGKYSNATLMLDFGFALPYNIHDQV 295
>gi|380015248|ref|XP_003691619.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Apis
florea]
Length = 483
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 90/431 (20%), Positives = 173/431 (40%), Gaps = 65/431 (15%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
SK+ + +G +W+ +NG + E P ++ + A + + +P
Sbjct: 75 SKRSQGIGQFINWLKENGANVDGASVAEFPGYDLG------LKAERNFLENELILRIPRG 128
Query: 132 LVVTLERVLGNETIAELLT------TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDR 185
L+ ++ + EL+T + ++A LA+ L+ E+ + + S W PY+ L
Sbjct: 129 LIFSI-----HNAAPELITLQNDPLIQHMPQVA-LAIALLIERHK-ENSKWKPYLDILPT 181
Query: 186 QRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW----FMAGSL 241
+ L + ++ L GSPT L++ I R+Y+ + V+ ++
Sbjct: 182 -------TYTTVLYMTAADMIELKGSPTLEAALKQCRNIARQYSYFNKVFQNNNNAVSAI 234
Query: 242 FQKVSLARRF------------------------ALVPLGPPLLAYSSKCKAMLAAVDDA 277
+ V R+ AL+P+ + + A +
Sbjct: 235 LRDVFTYERYCWAVSTVMTRQNLIPSEDGSRMIHALIPMWDMCNHENGRITTDFNATSNY 294
Query: 278 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 337
+ R +K GE I + GP+ NS ++ GFV +N D + ++ D +++
Sbjct: 295 CECYALRDFKKGEQIFISYGPRTNSDFFVHSGFVYMENKQDGFKLRLGISKADSLQKERI 354
Query: 338 MVAQRNGKLSVQVFHVHAGREKEAISD-MLPYLRLGYVSDTSEMQSVISS--LGPICPVS 394
+ + +V F + G E ISD +L +LR+ + +E+ I S + + +
Sbjct: 355 ELLNKLDLPTVGEFLLKLG--TEPISDLLLAFLRV-FSMRKAELAHWIRSDRVNDLKHMD 411
Query: 395 PCMERAVLDQLADYFKARL----AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 450
+E V + + + RL A YP TL ED +L + L K++ QL EK++L
Sbjct: 412 CALETVVEENVRKFLLTRLQLLIANYPTTLKEDLQLL-ETTLPQIKKLTIQLRVTEKRIL 470
Query: 451 NACLQVTADMI 461
L+ I
Sbjct: 471 QGALEYVEQWI 481
>gi|327259114|ref|XP_003214383.1| PREDICTED: SET domain-containing protein 3-like, partial [Anolis
carolinensis]
Length = 311
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 55/197 (27%), Positives = 97/197 (49%), Gaps = 18/197 (9%)
Query: 275 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 334
DD + V + +KAGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 9 DDRCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 68
Query: 335 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS--------S 386
K V R G + VF +HA E + +L +LR+ +++ + +I +
Sbjct: 69 MKAEVLARAGIPTSSVFALHA-TEPPISAQLLAFLRVFCMTEDELKEHLIGEHAIDRIFT 127
Query: 387 LG-PICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVATQ 441
LG PVS E +L + +AR L Y T+SED+A L +L +A +
Sbjct: 128 LGNSEFPVSWDNEV----KLWTFLEARASLLLKTYKTTVSEDKAFLGTQDLTCNATMAIK 183
Query: 442 LVRMEKKMLNACLQVTA 458
L EK++L ++ A
Sbjct: 184 LRLGEKEILEKAIKSAA 200
>gi|449017905|dbj|BAM81307.1| similar to ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplast precursor
[Cyanidioschyzon merolae strain 10D]
Length = 567
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 112/435 (25%), Positives = 165/435 (37%), Gaps = 100/435 (22%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLE---RVLGNETIAELLTTNKLSELACLALY------LM 165
A D+QAG+ F VP L T + R + EL + LA L LY
Sbjct: 120 ARRDIQAGEVLFQVPFHLCFTKDVAVRRFAALNVPELADEEEFFALATLLLYERGLDESW 179
Query: 166 YEKKQGKKSFWLPYIRELDR--QRGRGQLAVES----PL----LWSETELAYLTGSPT-- 213
+ +G SFW PY+ L +G ES PL LW+E E+ +L GSPT
Sbjct: 180 KKSGRGPGSFWGPYLDILPPVPWEFKGAEPAESLSMDPLDALWLWAEDEMQWLQGSPTLL 239
Query: 214 KAEIL------ERAEGIKREYNE------------LDTVWFMAGSLFQK-VSLARRFALV 254
A L E AE +R Y L+ + G LF + VSL ++
Sbjct: 240 SARALRSKVEREYAEACERLYRRHPHIFDLEGAFRLERFLWAFGVLFSRAVSLPAENGML 299
Query: 255 PLGP--PLLAYSSKCKAMLAA----------------------------VDDA------- 277
L P L +S+ C + + A DDA
Sbjct: 300 ALVPYADLANHSAFCVSFIDARTAAFPYAFRASSKQKRGQWWQRFLAPNSDDAGAVANTD 359
Query: 278 -----------VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 326
V DR Y E + V G + N++LL+ YGFV + NPY+ + V +L
Sbjct: 360 SSHYREDAQREVVAYADRFYDKFEQVYVSYGQKSNAELLLLYGFVSDRNPYNSVEVCVSL 419
Query: 327 NTEDPQ----YQDKRMVAQRNGK--LSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM 380
+ + KR G+ + F ++A R + +L + L +
Sbjct: 420 SGSEAAGAGLLDRKRSFLLACGRDPDKPECFPLYADRYPLELMQLLRFASL--TEQDAAG 477
Query: 381 QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYN----LHPKK 436
S + + PV+ E A L K L YP + ED+A L D + L K+
Sbjct: 478 YSDLEQIDVAQPVNRENEIAAKSALLQACKIALQAYPTSADEDDAALKDKSMAQLLSRKQ 537
Query: 437 RVATQLVRMEKKMLN 451
R++ +L R EK++L
Sbjct: 538 RLSVRLRRSEKRILE 552
>gi|159477607|ref|XP_001696900.1| rubisco small subunit N-methyltransferase [Chlamydomonas
reinhardtii]
gi|158274812|gb|EDP00592.1| rubisco small subunit N-methyltransferase [Chlamydomonas
reinhardtii]
Length = 411
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 75/271 (27%), Positives = 111/271 (40%), Gaps = 52/271 (19%)
Query: 196 SPLLWSETELAYL-TGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQKVSLARRFALV 254
SPLLWS+ EL L GSP E R K +LA +A
Sbjct: 114 SPLLWSDAELEELMAGSPVLPEARSR-----------------------KKALADTWA-- 148
Query: 255 PLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP---------QPNSKLL 305
L P L A ++ A R A + +VVW G +PN +LL
Sbjct: 149 ALAPKLAADPARFPAG------------RRAAGARKGVVVWDGAGSEMLLNDGRPNGELL 196
Query: 306 INYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDM 365
+ G + ++N D L A L D Y K V + G + + F V+A R +
Sbjct: 197 LATGTLQDNNSSDFLSWPAGLVPADRYYMMKSQVLESMGYSAAEEFPVYADRMP---IQL 253
Query: 366 LPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEA 425
L YLRL V+D + + + +S E +L L + RLA Y + ED
Sbjct: 254 LAYLRLSRVADPALLAKC--TFEADVELSQMNEYEILQILMGDCRERLASYTKSYEEDVK 311
Query: 426 MLTDYNLHPKKRVATQLVRMEKKMLNACLQV 456
+ +L PK+R+A +L EK+++NA ++
Sbjct: 312 IAQQSDLSPKERLAVKLRLGEKRIINATMEA 342
>gi|190402231|gb|ACE77646.1| hypothetical protein [Sorex araneus]
Length = 350
Score = 63.5 bits (153), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 51/193 (26%), Positives = 93/193 (48%), Gaps = 10/193 (5%)
Query: 275 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 334
DD + V + ++AGE I ++ G + N++ +++ GF ++N +DR+ ++ ++ D Y
Sbjct: 67 DDRCECVALQDFRAGEQIYIFYGTRSNAEFVVHSGFFFDNNSHDRVKIKLGVSKSDRLYA 126
Query: 335 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISS 386
K V R G + VF +H E + +L +LR+ +++ + + I +
Sbjct: 127 MKAEVLARAGIPTSSVFALHV-TELPISAQLLAFLRVFCMTEEELREHLLGENAIDRIFT 185
Query: 387 LG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 445
LG PVS E + L D L Y T+ ED+A L L P+ +A +L
Sbjct: 186 LGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKAFLQSPGLSPRAAMAVKLRLG 245
Query: 446 EKKMLNACLQVTA 458
EK++L ++ A
Sbjct: 246 EKEILEKAVRSAA 258
>gi|326496433|dbj|BAJ94678.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 453
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 88/373 (23%), Positives = 150/373 (40%), Gaps = 53/373 (14%)
Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWL 177
+L G+ VP L + + V + + L ++L ++ E +G S W
Sbjct: 52 NLPRGEVVAEVPKKLWLDADAVAASVLGRVCGSGGDLRPWVSVSLLILREAARGGDSLWA 111
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL------ 231
PY+ L RQ +S + WSE EL + G+ + + E ++ E++ +
Sbjct: 112 PYLAILPRQ-------TDSTIFWSEEELLEIQGTQLLSTTMGVKEYVQSEFDNVEAGIIN 164
Query: 232 -------------DTVWFMAGSLFQKVS---LARRFALVPLGPPL----------LAYSS 265
D +W G L +V + AL+P + +
Sbjct: 165 VNKDLFPGTITFDDFLWAF-GVLRSRVFPELRGDKLALIPFADLINHDGDITSKESCWEI 223
Query: 266 KCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEA 324
K K L D L K+GE I V + + N++L ++YGF + ++ D +
Sbjct: 224 KGKGFLGR-DTVFSLRTPVDVKSGEQIYVQYDLDKSNAELALDYGFTESNSSRDSYTLTL 282
Query: 325 ALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM---- 380
++ DP Y+DK +A+ NG F V G + M+ YLRL + T
Sbjct: 283 EISESDPFYEDKLDIAELNGMGETAYFDVVLG--ESLPPQMITYLRLLCLGGTDAFLLEA 340
Query: 381 ---QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKR 437
V L PVS E ++ + + K+ LA Y T+ EDE +L +L + +
Sbjct: 341 LFRNKVWEHLE--LPVSRDNEESICQVIQNACKSALAAYHTTIEEDEELLEREDLQSRHQ 398
Query: 438 VATQLVRMEKKML 450
+A ++ EKK+L
Sbjct: 399 IAVEVRVGEKKVL 411
>gi|330822500|ref|XP_003291689.1| hypothetical protein DICPUDRAFT_57488 [Dictyostelium purpureum]
gi|325078125|gb|EGC31794.1| hypothetical protein DICPUDRAFT_57488 [Dictyostelium purpureum]
Length = 540
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 95/445 (21%), Positives = 175/445 (39%), Gaps = 63/445 (14%)
Query: 67 SREVVSKKEED---LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGD 123
+++ K+E D + + W+ +G K +K + E + ++ D++ G+
Sbjct: 56 GKQIAVKQETDQQLVSNFMEWLKNSGFDETKSKVKIGRNLAEGSG----LVSTCDIKEGE 111
Query: 124 AAFSVPNSL---VVTLERVLGNETIAELLTTNKLSEL--ACLALYLMYEKKQGKKSFWLP 178
+P L ++T + G LL N + + LALYL+ E S P
Sbjct: 112 EFLEIPEKLFIDIMTALKSFGQSGYDILLRDNLIRRVPNLVLALYLIKESTNPDSSI-AP 170
Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY---------- 228
Y++ L + + W + L GSP + G R+Y
Sbjct: 171 YLKVLPK-------TYSTIGYWGIEDFKQLEGSPVFQTAVNYTRGSMRQYCYFYQLFDNN 223
Query: 229 -NELDT--------VWFMAG--SLFQKVSLARRFALVPLGP--PLLAYSSKCKAMLAAVD 275
L T +W +A S V + AL+P ++ K + V
Sbjct: 224 PGILQTSNFTYEAFIWAVATVQSRQNPVGGGQEMALIPFWDFCNHSSHGGKITTFIDPVK 283
Query: 276 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQ--Y 333
+ + YK GE + ++ GP+PNS+ + GF + N D + L+ ED +
Sbjct: 284 HVLTCSAAKSYKKGEQVYMYYGPRPNSQFYLFQGFSLKTNLNDDYSFDMDLDNEDDRDIA 343
Query: 334 QDK-RMVAQRNGKLSVQVFHVHAGREKEAI-SDMLPYLRLGYVS--DTSEM--------- 380
DK ++ +R G Q + E + ++++P+ R+ +S +T ++
Sbjct: 344 HDKIHILEERCGLRVGQTVSLSQNPSSEKLPAEIIPFYRIAALSPEETKKLAPPQEEGHH 403
Query: 381 --QSVISSLGP--ICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKK 436
+ P +S E+ L D KARL+GYP TL++DE + + N ++
Sbjct: 404 HHHQGPMDMKPEAFNIISEENEKKAFKLLLDSLKARLSGYPTTLAQDEQEMKN-NPTTQR 462
Query: 437 RVATQLVRMEKKMLNACLQVTADMI 461
R ++ EKK+L ++ +I
Sbjct: 463 RYVLYILINEKKILERNIKYVEQLI 487
>gi|322694827|gb|EFY86647.1| SET domain protein [Metarhizium acridum CQMa 102]
Length = 467
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 96/422 (22%), Positives = 170/422 (40%), Gaps = 89/422 (21%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN-------KLSELACLALYLM 165
+ A D+ A F++P ++ E + + +L + L + L L +M
Sbjct: 43 ITALRDIPADTTLFTIPRDAIINSETSSLRKKLPDLFESQGDEDEEQALDSWSALILIMM 102
Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEGI 224
YE G +S W PYI L L ++P+ WSE EL+YL S T +I + AE +
Sbjct: 103 YEFFLGDESKWKPYIDVL-------PLTFDTPMFWSEEELSYLQASATVNKIGKADAEEM 155
Query: 225 KRE-----YNELDTVWFMAGSLFQK--VSLARRFALVPLGPPLLAYS------------- 264
R +V+ +G + + LA R +G ++AY+
Sbjct: 156 FRTRLIPAIRGNPSVFVSSGDCSDEDLIGLAHR-----MGSTIMAYAFDLENEEAENDEE 210
Query: 265 -------SKCKAMLAAV-------------------DDAVQLVVDRPYKAGESIVVWCGP 298
+ K+M+ V D+ + + R KAGE I+ + GP
Sbjct: 211 SDGWVEDREGKSMMGMVAMADILNADAEFNAHVNHGDEELTVTSIRDIKAGEEILNYYGP 270
Query: 299 QPNSKLLINYGFVDEDN--------PYDRLVVEAALNTEDPQYQD------KRMVAQRNG 344
PNS+LL YG++ E + P+D V+ +L +E QD +RM ++
Sbjct: 271 HPNSELLRRYGYITEKHSRYDVVEIPWD--AVQHSLMSELGVPQDIMAETMERM--DQDD 326
Query: 345 KLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE-MQSVISSL----GPICPVSPCMER 399
+ V +G + P + G D E +++ I L G + +
Sbjct: 327 LEDIFVLERDSGEPNPDGTFAGPAVVDGMPPDLKEQLKATIKLLQKLDGNLISDKRKRDD 386
Query: 400 AVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTAD 459
+ + + + + Y T++EDE +L +L ++R+A Q+ EKK+L ++
Sbjct: 387 ILRSTMVETLRLIASRYSTTIAEDEVLLAQDSLTRRQRMAVQVRLGEKKLLQEACDHFSE 446
Query: 460 MI 461
M+
Sbjct: 447 MV 448
>gi|219126444|ref|XP_002183467.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217405223|gb|EEC45167.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 519
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 87/339 (25%), Positives = 135/339 (39%), Gaps = 63/339 (18%)
Query: 160 LALYLMYEKK-QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
L +YL++++K G SF+ PY L P+ WS EL L GS ++I
Sbjct: 116 LMIYLLWDRKTHGSSSFFHPYYEILP------PTLRNMPIFWSAFELQELEGSHLLSQIA 169
Query: 219 ERAEGIKREYNELDTVWFMAGSL-----FQ-------------KVSLARRFALVPLGPPL 260
+R + I+ +Y + V G+L F+ ++ R ALVP L
Sbjct: 170 DRGQAIQDDYEAILEVAPSLGTLCTLDEFKWARMCVCSRNFGLQIDGHRTSALVPHADML 229
Query: 261 LAYSSK-CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY-- 317
Y + K V + + +AG + G + N + L+NYGF EDN
Sbjct: 230 NHYRPRETKWTFDEVTQCFTITSLQSIQAGAQVYDSYGQKCNHRFLLNYGFAVEDNRELD 289
Query: 318 ----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 373
+ + +E ++ D +QDK R E IS G
Sbjct: 290 GFCPNEVPLELYVDPADILFQDKLEFWTRG--------------ETNQIS--------GA 327
Query: 374 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 433
V+ Q+V S+G P S E + + RLA YP T+S+D A L D +
Sbjct: 328 VTAGLIAQAVGGSMGRGVP-SHAAESYTSGPVVK--RVRLASYPTTISQDMADLQDEASY 384
Query: 434 PK---KRVATQLVRMEKKMLN---ACLQVTADMIMLLPD 466
P+ +R A VR EK++L+ + DM+ + D
Sbjct: 385 PQFSNRRHAKIQVRGEKEVLHHFRVWSETALDMLTFIED 423
>gi|46130858|ref|XP_389160.1| hypothetical protein FG08984.1 [Gibberella zeae PH-1]
Length = 1000
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 97/438 (22%), Positives = 172/438 (39%), Gaps = 100/438 (22%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--------LSELACLALYL 164
+ A +D+ A F++P ++ E + I ++ +K L + L L +
Sbjct: 578 IIALKDIPAETTLFTIPRKGIINTETSELPKKIPDVFDLDKPDEDDVPGLDSWSSLILIM 637
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
+YE QG S W Y L + ++P+ WSE EL L S + +I +A+
Sbjct: 638 IYEYLQGDSSQWKSYFDVLPS-------SFDTPMFWSENELDQLQASHMRHKI-GKADAE 689
Query: 225 KREYNELDTVWFMAGSLFQK--------VSLARRFALVPLGPPLLAYS------------ 264
L + S+F V +A R +G ++AY+
Sbjct: 690 DMFKKTLVPIIRSNPSIFNAENRSDYELVEIAHR-----MGSTIMAYAFDLENDEEEEEE 744
Query: 265 -------SKCKAMLAAVDDAVQLVVD-------------------RPYKAGESIVVWCGP 298
+ K+M+ V A L D RP KAGE I+ + GP
Sbjct: 745 TEEWVEDREGKSMMGMVPMADILNADAEFNAHVNHEEESLTVTSLRPIKAGEEILNYYGP 804
Query: 299 QPNSKLLINYGFVDEDN--------PYDRLVVEAALNT----------------EDPQYQ 334
PNS+LL YG+V E + P+D +VE+ L E+ +++
Sbjct: 805 HPNSELLRRYGYVTEKHSRYDVVEIPWD--IVESVLTNFGISSKILEQIRGEFEEEEEFE 862
Query: 335 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVS 394
D ++ + G+++ + + D+ L+ ++ ++Q S P
Sbjct: 863 DTFVLERDTGEVNSDGTFAEPAKFEGMPEDLQEQLK-SFLKGIKKLQ---SDTIPDKRKR 918
Query: 395 PCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 454
+ +AVL + + AR YP ++SED+ +L NL + R+AT + EKK+L +
Sbjct: 919 DEIHQAVLVKTLEALAAR---YPTSISEDQILLNGQNLDQRARMATVVRLGEKKLLQEAI 975
Query: 455 QVTADMIMLLPDVTVSPC 472
++ + + D P
Sbjct: 976 ATFSEDVEMTMDDESGPA 993
>gi|303271159|ref|XP_003054941.1| methyltransferase [Micromonas pusilla CCMP1545]
gi|226462915|gb|EEH60193.1| methyltransferase [Micromonas pusilla CCMP1545]
Length = 544
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 101/385 (26%), Positives = 154/385 (40%), Gaps = 62/385 (16%)
Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWL 177
D++AG+ +P +L VT V + +A L EL LAL+L E+ +G S W
Sbjct: 98 DVRAGEPLLEIPQNLAVTSVDVSDHPIVAGLAAGR--GELVGLALWLCCERAKGSLSDWA 155
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKRE--------- 227
PY+ L V+ PL W E+E+ + L GSPT + + RA + E
Sbjct: 156 PYVNTLPT-----GCTVDHPLRWEESEIRSLLKGSPTCEQAVGRAVDAREEYASIRAAIA 210
Query: 228 -------------YNELDTVWFMAGSLFQKVSL--ARRFALVPL--------------GP 258
EL +A L + V L A +ALVPL P
Sbjct: 211 ADADAYPADAYEFLTELAFTDALATVLARAVWLNAANVYALVPLVDLLPVVGAPPPGVNP 270
Query: 259 PLLAYSSKCKAMLAAV-----DDAVQLV-VDRPYKAGESIVVWCGP---QPNSKLLINYG 309
A + + + AAV D A + V V A ++ V C + L ++ G
Sbjct: 271 AAAAADAGARGLDAAVGVVDYDAATECVAVVSANDARQTAPVVCADALGRNAGDLFLSTG 330
Query: 310 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG-KLSVQVFHVHAGREKEAISDMLPY 368
V+ + D L + D Y K+ + + G Q F V A R + Y
Sbjct: 331 RVNGAHVGDYLTFVTSTVMSDKLYAAKKQILEGMGYSADAQAFPVFADRMP---LQLFAY 387
Query: 369 LRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM-L 427
+R V + SE+ +V I VSP E +L L + LA Y + E E + L
Sbjct: 388 MRFARVQEPSELMTVSFEEDRI--VSPMNEYEILQLLMGDAREMLAEYENSSEEFELLQL 445
Query: 428 TDYNLHPKKRVATQLVRMEKKMLNA 452
+ N+ ++ A +L EK+++NA
Sbjct: 446 KETNISERQMTAAKLRLGEKRLINA 470
>gi|159479580|ref|XP_001697868.1| rubisco large subunit N-methyltransferase [Chlamydomonas
reinhardtii]
gi|158273966|gb|EDO99751.1| rubisco large subunit N-methyltransferase [Chlamydomonas
reinhardtii]
Length = 475
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 100/416 (24%), Positives = 167/416 (40%), Gaps = 65/416 (15%)
Query: 83 SWMHKNGLPPCKV-----ILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
+W K G K IL +KP + AS D+Q G++ VP++ V++
Sbjct: 46 AWATKQGAKLEKANLSTDILTDKP----------ILVASADVQPGESLIVVPDAAWVSVP 95
Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
V T+ +L ++ L LAL L+ E+ KS Y L G +P
Sbjct: 96 NV-AKTTVGKLASSAGLEPWLQLALVLVAERFGSAKSELAGYASSLPEDLG-------TP 147
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAG------------SLFQKV 245
LLWSE E L G+ + + + +L F A + V
Sbjct: 148 LLWSEEETRALAGTQVAGTLNSYLTFFRSTFAQLQAGLFTANPAAFPPAVFTLPNFVWAV 207
Query: 246 SLAR----------RFALVPLGPPLLAYSSKCKAMLAAVDDAV----QLVV---DRPYKA 288
+ R + AL PL L+++ L+ + Q+ V R +
Sbjct: 208 AAVRSRSHPPLEGDKIALAPL-VDLVSHRRAANTKLSVRSSGLFGRGQVAVVEATRAIRK 266
Query: 289 GESIVVWCGP-QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLS 347
GE++ + P + + +L++YG +D +P + L+ D DK + + G
Sbjct: 267 GEALGMDYAPGKLDGPVLLDYGVMDTASPKPGYSLTLTLDESDKFVDDKADIVEGAGLRP 326
Query: 348 VQVFHVHAGREKEAISDMLPYLRLGYVS--DTSEMQSVISSLGPICPVSPCMERAVLDQL 405
+ + +++ +M+ +LRL + D ++S+ + VS E AV L
Sbjct: 327 SMTYSITP--DQQPGEEMMAFLRLMNIKAMDAFLLESIFRN-----EVSEGNEEAVCAML 379
Query: 406 ADYFKARLAGYPATLSEDEAMLTDYNLHPKKRV-ATQLVRM-EKKMLNACLQVTAD 459
A+ +A LAGYP TL +D A L + R A LVR+ EK+ L+A + D
Sbjct: 380 AEGARAALAGYPTTLDQDLAALRSNSTPLGSRAEAALLVRLGEKESLDAVARFFED 435
>gi|160331079|ref|XP_001712247.1| met [Hemiselmis andersenii]
gi|159765694|gb|ABW97922.1| met [Hemiselmis andersenii]
Length = 464
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 72/373 (19%), Positives = 159/373 (42%), Gaps = 53/373 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER-VLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
+ A + +Q G+ +P +L+++++R + NE + L+E L ++L+ + G
Sbjct: 103 LLAFKKIQQGEKLIEIPENLILSVDRDQIKNEG------NDFLNEYDSLGIFLIQQMAMG 156
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
KS W Y L R+ + W+ ++ +L GS T L E IK ++ L
Sbjct: 157 DKSKWKIYFDILPREED-----LNLGFRWNLNDIVFLRGSKTLNASLYLKEKIKIQFLRL 211
Query: 232 DTVWFMAGSLFQKVSLA----------------------RRFALVPLGP-----PLLAYS 264
+ F L VS+ ++ +LVP P
Sbjct: 212 EKTIFSKNRLKYPVSIFNLAQWEWALSILLSRAIFLQNLKKVSLVPYADFMNHNPFSTSY 271
Query: 265 SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEA 324
K + + + + + D+ Y + I G + N +LL+ YGF+ E NP+D + +
Sbjct: 272 INSKKISFSKNHEIVMYADKDYNKFDQIFTTYGQKTNLELLLLYGFILERNPFDSIELRI 331
Query: 325 ALNTEDPQYQDKRMV---AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQ 381
+L+ +D ++ K+ ++ +++ +F+ +E + +LR +S+ E+
Sbjct: 332 SLSDKDSFFEKKKQFMIECEKTSEITFPIFYYKYPKE------LYEFLRFC-ISNQEELG 384
Query: 382 SV-ISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDE---AMLTDYNLHPKKR 437
S +S + +E+ + + + L Y +SE++ ++ +++ + ++
Sbjct: 385 STDLSDFNFNDENNYEIEKIIRKLVLFSCEKLLKNYSKKVSEEKILNSLNSNFLISKNQK 444
Query: 438 VATQLVRMEKKML 450
+A + + EKK++
Sbjct: 445 MALKQSKCEKKII 457
>gi|413923745|gb|AFW63677.1| hypothetical protein ZEAMMB73_839660 [Zea mays]
gi|413923746|gb|AFW63678.1| hypothetical protein ZEAMMB73_839660 [Zea mays]
Length = 306
Score = 62.0 bits (149), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 60/244 (24%), Positives = 104/244 (42%), Gaps = 40/244 (16%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E L+ W+ +GLP ++ + ++ E+ + A ++++ G+ VP SLV+T
Sbjct: 71 ESAASLERWLIDSGLPEQRLAI-QRVDIGERG-----LVALKNIRKGEKLLFVPPSLVIT 124
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ G + +++ N + + +A YL+ E S W+ YI L RQ
Sbjct: 125 ADSEWGRPEVGDVMKRNSVPDWPLIATYLISEASLEGSSRWISYIAALPRQ-------PY 177
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL-------------------DTVW 235
S L W+ EL AYL SP + ++R + YN+L +T
Sbjct: 178 SLLYWTRAELDAYLVASPIRKRAIQRITDVIGTYNDLRDRIFSRHPDLFPEEVYNIETFL 237
Query: 236 FMAGSLFQKV----SLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAG 289
+ G LF ++ S+ R ALVP +L +S + + L + DR Y+ G
Sbjct: 238 WSFGILFSRLVRLPSMDGRVALVPWA-DMLNHSPEVETFLDFDKSSRGIVFTTDRSYQPG 296
Query: 290 ESIV 293
I+
Sbjct: 297 IYIL 300
>gi|156064409|ref|XP_001598126.1| hypothetical protein SS1G_00212 [Sclerotinia sclerotiorum 1980]
gi|154691074|gb|EDN90812.1| hypothetical protein SS1G_00212 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 470
Score = 62.0 bits (149), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 97/402 (24%), Positives = 163/402 (40%), Gaps = 74/402 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSE----LACLALYLMYEK 168
V A+ D+ + FS+P + V L + +A L + +L E L LM E
Sbjct: 42 VVATGDIDDDEIIFSIPRNAV------LNAQNVAPLPVSRRLFEKMPSWLVLTSILMTEA 95
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILER-AEGIKRE 227
Q + S W PY+ L + ++S + WS++ELA L S +I ++ AE + +
Sbjct: 96 -QMENSKWAPYLAVLPER-------LDSLVFWSDSELAELQASAVVKKIGKKDAEDMFKS 147
Query: 228 YNELDTVWFMAGSLFQKVS---LARRF--------------------------------- 251
Y + + + KV+ +A F
Sbjct: 148 YIAPQGLKHSSTEMCHKVASVIMAYAFDIPDPSDAPTSGGKGGEAGDDLVSDDGEDEKTI 207
Query: 252 -ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 310
+++PL L A + + A L ++ +++ +P GE I G P S LL YG+
Sbjct: 208 LSMIPLADMLNADADRNNARLICDNEELEMRAIKPISKGEEIFNDYGQLPRSDLLRRYGY 267
Query: 311 V-DEDNPYD------RLVVEAALN----------TEDPQYQDKRMVAQRNGKLSVQVFHV 353
V D + YD L+V N T+D + + +A+R G V
Sbjct: 268 VTDGYSAYDVAEISAELIVSLFRNGKVHPSLHKLTQD-GLKTRLELAEREGVYEDSFDLV 326
Query: 354 HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARL 413
H+ ++ +I D L + D S ++++++S I S LA +AR
Sbjct: 327 HSSPDEPSIPDELLAFLYLLLVDESHLKAILNSESSIPSRSKLTTELAGQVLATLLQARE 386
Query: 414 AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 455
Y TL EDE +L + +L + +A Q+ EKK+L A +Q
Sbjct: 387 KEYSTTLEEDEDLLKNADLPVRHAMAIQVRSGEKKVLRAAMQ 428
>gi|350408192|ref|XP_003488333.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Bombus
impatiens]
Length = 484
Score = 62.0 bits (149), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 90/430 (20%), Positives = 169/430 (39%), Gaps = 65/430 (15%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+ + +G +W+ +NG + E P ++ + A + + +P L
Sbjct: 77 KRSQGIGRFINWLKQNGANVYGASVAEFPGYDLG------LKAERNFLENELILRIPREL 130
Query: 133 VVTLERV------LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
+ ++ L N+ + +L+ LA+ L+ EK + + S W PY+ L
Sbjct: 131 IFSIHNAAPELVALQNDPLLQLMPQ------VALAIALLIEKHK-EYSKWKPYLDILPT- 182
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW----FMAGSLF 242
+ L + ++ L GSPT L++ I R+Y + ++ ++
Sbjct: 183 ------TYTTVLYMTAADMNELKGSPTLEAALKQCRNIARQYAYFNKLFQKNNNAVSAIL 236
Query: 243 QKVSLARRF------------------------ALVPLGPPLLAYSSKCKAMLAAVDDAV 278
+ V ++ AL+P+ +SK A +
Sbjct: 237 RDVFTYEKYCWAVSTVMTRQNIIPSKDGSLMIHALIPMWDMCNHENSKITTDFNATLNCC 296
Query: 279 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRM 338
+ R +K E I + G + NS ++ GFV DN D + ++ DP +++
Sbjct: 297 ECYALRDFKKAEQIFISYGARTNSDFFVHSGFVYMDNEQDGFKLRLGISKADPLQKERVE 356
Query: 339 VAQRNGKLSVQVFHVHAGREKEAISD-MLPYLRLGYVSDTSEMQSVISS--LGPICPVSP 395
+ + +V F + G E ISD +L +LR+ + E+ I S + + +
Sbjct: 357 LLNKLDLPAVGEFLLKPG--TEPISDTLLAFLRV-FSMRKEELAHWIQSDRVNDLKHMDC 413
Query: 396 CMERAVLDQLADYFKARL----AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 451
+E V + + + RL A YP TL ED +L + L K++A QL EK++L
Sbjct: 414 ALETVVEENVKKFLLTRLQLLIANYPTTLKEDLQLL-ETTLPRIKKLAIQLRVTEKRILQ 472
Query: 452 ACLQVTADMI 461
L+ I
Sbjct: 473 GALEYVQQWI 482
>gi|408393455|gb|EKJ72719.1| hypothetical protein FPSE_07119 [Fusarium pseudograminearum CS3096]
Length = 465
Score = 61.6 bits (148), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 91/433 (21%), Positives = 168/433 (38%), Gaps = 90/433 (20%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--------LSELACLALYL 164
+ A D+ A F++P + +E + I ++ +K L + L L +
Sbjct: 43 IIALRDIPAETTLFTIPRKGSINIETSELPQKIPDVFDLDKPDEDDVPGLDSWSSLILIM 102
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEG 223
+YE +G S W Y L + ++P+ WSE EL L S + +I + AE
Sbjct: 103 IYEYLRGDSSQWKSYFDVLPS-------SFDTPMFWSENELDQLQASHMRHKIGKADAEN 155
Query: 224 I-------------------KREYNELDTVWFMAGSLFQKVSL----------------- 247
+ R +EL + GS +
Sbjct: 156 MFKKTLVPIIRSNPSIFNAENRSDSELVEIAHRMGSTIMAYAFDLENDEEEEEETEEWVE 215
Query: 248 ----ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 303
+VP+ +L ++ A + ++++ + RP KAGE I+ + GP PNS+
Sbjct: 216 DRDGKSMMGMVPMAD-ILNADAEFNAHVNHEEESLTVTSLRPIKAGEEILNYYGPHPNSE 274
Query: 304 LLINYGFVDEDN--------PYDRLVVEAALNT----------------EDPQYQDKRMV 339
LL YG+V E + P+D +VE+ L + +++D ++
Sbjct: 275 LLRRYGYVTEKHSRYDVVEIPWD--IVESVLTNFGISSKILKQIRGEFEGEEEFEDTFVL 332
Query: 340 AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMER 399
+ G+++ + + D+ L+ S ++ V S P + +
Sbjct: 333 ERDTGEINSDGTFAEPAKFEGMPEDLQEQLK----SFLKGIKKVQSDTIPDKRKRDEIHQ 388
Query: 400 AVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTAD 459
AVL + + AR YP ++SED+ +L NL + R+AT + EKK+L + ++
Sbjct: 389 AVLVKTLEALAAR---YPTSISEDQTLLNGQNLDQRARMATVVRLGEKKLLQEAIATFSE 445
Query: 460 MIMLLPDVTVSPC 472
+ + D P
Sbjct: 446 DVEMTMDDESGPA 458
>gi|358392567|gb|EHK41971.1| hypothetical protein TRIATDRAFT_251278, partial [Trichoderma
atroviride IMI 206040]
Length = 956
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 93/441 (21%), Positives = 167/441 (37%), Gaps = 99/441 (22%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER----------VLGNETIAELLTTNKLSELACLAL 162
+ A +D+ A F+VP S +V +E L +T E+ + + L +
Sbjct: 531 IVALQDIPADTVLFTVPRSAIVNIETSELRAKLPDVFLNQDTAMEVDNKPQQDPWSTLII 590
Query: 163 YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAE 222
L+YE +G +S W PY+ L + E+P+ WS+ E+ L S T+++I +
Sbjct: 591 VLIYEYFKGDQSSWKPYLDVL-------PASFETPMFWSDAEVDELQASATRSKI---GK 640
Query: 223 GIKREYNELDTVWFMAGS--LFQKVSLARRFALVPL----GPPLLAYS------------ 264
E + + G+ +FQ L+ L G +++Y+
Sbjct: 641 TNAEEMFHAKILPVIRGNPDIFQTSQAKSDEELIQLAHRMGSTIMSYAFDFQNEDEEEED 700
Query: 265 --------SKCKAMLAAVDDAVQLVVDRPY-------------------KAGESIVVWCG 297
+ K+ + V A L D Y KAGE I+ + G
Sbjct: 701 DSEEWVEDREAKSTMGMVPMADILNADAEYNAHVNYGDDALTVATLRTIKAGEEILNYYG 760
Query: 298 PQPNSKLLINYGFVDEDN--------PY----DRLVVEAALNTED----------PQYQD 335
P PNS+LL YG+V + P+ D L L++E ++++
Sbjct: 761 PHPNSELLRRYGYVTPKHSRYDVVELPWKMIEDALAANLGLSSEQLDSAREHLDLDEFEE 820
Query: 336 KRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSP 395
++ + + + + + + E D+ L+ M I + P C V
Sbjct: 821 TFVLERESDEPNPDGTFANPAKFSEIPEDLREQLK--------SMLKAIRKVDPSCIVDK 872
Query: 396 CMERAVLDQ-LADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 454
V L A + YP T+ EDE +L+ NL +++ A + EK++L
Sbjct: 873 RKRDEVQHTVLITALDALTSQYPTTIIEDELILSGSNLSERRKAAVTVRLGEKRLLQEAR 932
Query: 455 QVTADMIMLLPDVTVSPCPAP 475
+ ++ + D + PAP
Sbjct: 933 VLLSE---IASDAILDDAPAP 950
>gi|384484604|gb|EIE76784.1| hypothetical protein RO3G_01488 [Rhizopus delemar RA 99-880]
Length = 400
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 66/282 (23%), Positives = 117/282 (41%), Gaps = 68/282 (24%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A++D++ GD FS+P S++++ + ++EL ++LS + L L +MYE ++
Sbjct: 41 VTANKDIKEGDLLFSLPRSILLSQLTSSLKDQVSEL---SELSGWSPLILCMMYEIEK-P 96
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
SFW PY L R+ +P+ W++ +L L G+ ++I + E + +NEL+
Sbjct: 97 DSFWKPYFDVLPRE-------FTTPMFWNQEDLKELEGTDIISKI-GKKESEELFHNELE 148
Query: 233 TV-----------------WFMAGSLFQKVSLARRF------------------------ 251
+ + + GSL S
Sbjct: 149 PIIKKYPNLFDEQKHTIELFHICGSLIMAYSFNDELQKAPKENNKEEEKEEEEEEEEEEE 208
Query: 252 ---------ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 302
++VP+ L + A L D++Q+ + K GE I G N+
Sbjct: 209 EEEEEEGLISMVPMADMLNHKTGFNNARLFHEPDSLQMRAIKDIKEGEQIYNTYGDLCNA 268
Query: 303 KLLINYGFVDEDNPYDR------LVVEAALNTEDPQYQDKRM 338
LL YGFVDE N +D L+VE +D +++++
Sbjct: 269 DLLRKYGFVDEKNDFDLVELDGPLLVEVCCEDQDEALKERKI 310
>gi|391340216|ref|XP_003744440.1| PREDICTED: SET domain-containing protein 4-like [Metaseiulus
occidentalis]
Length = 381
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 61/257 (23%), Positives = 108/257 (42%), Gaps = 28/257 (10%)
Query: 78 LGDLKSWMHKNGLPPCKVI-LKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTL 136
+G+L SW+ + G P V+ L P+ + +++AGD +P++L++T
Sbjct: 21 IGELYSWIQRLGFKPTSVLRLACTPASGRG------IVCLSNIEAGDVIIDLPSTLLITP 74
Query: 137 ERVLGNETIAEL-LTTNKLSELACLALYLMYEKKQGKKSFWLPYIREL----DRQRGRGQ 191
+ V EL ++ LS L ++++ E+ G+KS W PYI + D + R
Sbjct: 75 DLVR-----KELNMSKENLSAEEILTIFVLSERSLGEKSKWKPYIESIPDVFDGLQCRKS 129
Query: 192 LAVESPLL-----WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQKVS 246
+ + L W+ A ++ + R GI + W + V
Sbjct: 130 VRLPRRLAQAIDRWN----AERRNVFSRLRMFFRGRGIDLNFETFSWAWSAVNTRCIYVE 185
Query: 247 LARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 306
L P LL + K + V++ + + Y+AG + + G N L +
Sbjct: 186 -GHGSTLAPF-LDLLNHHWKASIETSFVNNHFIIRSNVGYEAGSEVFIGYGSHDNRTLFL 243
Query: 307 NYGFVDEDNPYDRLVVE 323
NYGFV ++NP D + VE
Sbjct: 244 NYGFVLDENPNDCITVE 260
>gi|255087300|ref|XP_002505573.1| set domain protein [Micromonas sp. RCC299]
gi|226520843|gb|ACO66831.1| set domain protein [Micromonas sp. RCC299]
Length = 509
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 49/163 (30%), Positives = 78/163 (47%), Gaps = 22/163 (13%)
Query: 64 VAGSREVVSKKEEDLGDLKSWMHKNGLPPCKV---ILKEKPSHNEKHRPIHYVAASEDLQ 120
VA V S+ + D L +W+ G+ KV ++ P R VAA ED+
Sbjct: 44 VAVDASVDSRTQADFDALWAWLGSEGVDVSKVSPALVDAAPGG----RGWGLVAA-EDIG 98
Query: 121 AGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYI 180
GDA ++P SL +T++ L + A + +AL L++E+ G+KS W Y+
Sbjct: 99 GGDAVLAIPRSLWMTVDTALASPIGAH---CGDEAGWIAVALQLLHERSIGEKSRWAAYV 155
Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG 223
L Q +++PL WS E+A LTG+ ++L+ A G
Sbjct: 156 NALPAQ-------LDAPLFWSAEEVATLTGT----QLLDAAAG 187
>gi|342881738|gb|EGU82570.1| hypothetical protein FOXB_06936 [Fusarium oxysporum Fo5176]
Length = 467
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 105/442 (23%), Positives = 175/442 (39%), Gaps = 102/442 (23%)
Query: 110 IHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--------LSELACLA 161
++ A ED+ A F++P ++ +E + I + +K L + L
Sbjct: 44 VNKTVALEDIPAETTLFTIPRKGIINVETSELPKKIPDAFDLDKPDDDDAPGLDSWSSLI 103
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-R 220
L ++YE QG+ S W PY L + ++P+ WS+ EL L S + +I +
Sbjct: 104 LIMIYEYLQGENSKWKPYFDVLPS-------SFDTPMFWSDNELDQLQASHMRHKIGKAD 156
Query: 221 AEG--------IKREYNELDTVWFMAGSLF--QKVSLARRFALVPLGPPLLAYS------ 264
AE I R E+ F AG+ + + +A R +G ++AY+
Sbjct: 157 AENMFQKTLLPIIRSNAEI----FNAGNKTDAELIEIAHR-----MGSTIMAYAFDLEND 207
Query: 265 -------------SKCKAMLAAVDDAVQLVVD-------------------RPYKAGESI 292
K+M+ V A L D RP KAGE I
Sbjct: 208 EEEEEEADGWVEDRDGKSMMGMVPMADILNADAEFNAHVNHEEESLTVTSLRPIKAGEEI 267
Query: 293 VVWCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAAL--NTEDPQYQDKRMVAQR 342
+ + GP PNS+LL YG+V E + P+D +VE+AL N P +++ Q
Sbjct: 268 LNYYGPHPNSELLRRYGYVTEKHSRYDVVEIPWD--IVESALTSNFGIP----GQVLEQI 321
Query: 343 NGKL-------SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE-MQSVISSLGPICP-- 392
G L V G + P D E +++ + + P
Sbjct: 322 RGALEEDEEFEDTFVLERETGEVNSDGTFAEPARFESMPEDLQEQLKTFLKGIKKAQPDA 381
Query: 393 VSPCMERAVLDQ--LADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 450
+ +R + Q LA +A +A YP ++SEDE +L +L+ + R+A + EKK+L
Sbjct: 382 IPDKRKRDEIHQAVLAKTLEALVARYPTSISEDENLLKQ-DLNQRTRMAIAVRLGEKKLL 440
Query: 451 NACLQVTADMIMLLPDVTVSPC 472
+ ++ + + D P
Sbjct: 441 QEAITASSGDVEMTMDDESGPA 462
>gi|380480025|emb|CCF42668.1| SET domain-containing protein RMS1 [Colletotrichum higginsianum]
Length = 318
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 55/230 (23%), Positives = 107/230 (46%), Gaps = 34/230 (14%)
Query: 251 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 310
+VP+ +L ++ A + +D + +V RP KAGE I+ + GP PNS+LL YG+
Sbjct: 71 LGMVPMAD-ILNADAEFNAHVNHGEDDLSVVALRPIKAGEEILNYYGPHPNSELLRRYGY 129
Query: 311 VDEDN--------PYDRLVVEAALNTE---------------DPQ-YQDKRMVAQRNGKL 346
V + P+D +V++ L + DP+ ++D ++ + +G+
Sbjct: 130 VTPKHSRYDVVEIPWD--LVQSILTEQLRLTDDVWKQLAEHVDPEDFEDVFVLERDSGEP 187
Query: 347 SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLA 406
+ + +E +++ L+ + ++++ G + P + +A
Sbjct: 188 DSEGRLTTPAKVQEVSAELEEQLK-------AVLKAIKKVRGDLIPDKRKRDEVYQHVVA 240
Query: 407 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQV 456
+ LA YP T EDEA+L NL ++R+A ++ EK++L LQ+
Sbjct: 241 AALQKLLAQYPTTAEEDEALLASGNLTSRQRMAVEVRLGEKRLLKEALQM 290
>gi|399949805|gb|AFP65462.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Chroomonas mesostigmatica
CCMP1168]
Length = 464
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 77/369 (20%), Positives = 150/369 (40%), Gaps = 49/369 (13%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A +Q G+ +P +L+ L++ L E +E L+ L+E LA+ + E+ G+KS
Sbjct: 100 AFRKIQQGEKLIEIPENLI--LKKSLK-ENRSEDLSF--LNEYDSLAIKAIQERAIGEKS 154
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W Y L +++ + W +++ +L GS E IK ++ ++
Sbjct: 155 KWKVYYEILPKEKDLNLV-----FRWKISDIVFLRGSKVLNASFYLKEKIKIQFLRIEKT 209
Query: 235 WFMAGSLFQ------------KVSL----------ARRFALVPLGP-----PLLAYSSKC 267
F L +SL ++ ALVP P
Sbjct: 210 IFSKNRLVYPEKIFNLQSWEWAISLLLSRAIFLQNMKKIALVPYADFINHNPFSTSYINS 269
Query: 268 KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 327
K + + ++ + + D+ Y + I G + N +LL+ YGF+ E NP+D + + AL+
Sbjct: 270 KKIAFSENNEIVMYADKDYNKFDQIFTTYGQKTNLELLVLYGFIIERNPFDSIELRVALS 329
Query: 328 TEDPQYQDKRMV---AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVI 384
T+D Y K ++ +++ VF+ +E + ++RL
Sbjct: 330 TKDELYNKKEKFINDCEKTEQITFPVFYYKYPKE------LYEFMRLCLSGPRDFFGEEF 383
Query: 385 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYN---LHPKKRVATQ 441
S+L + +E+ + + K L Y T++E++ + N L ++ + +
Sbjct: 384 SNLNFTDEENFNLEKIIRKTVIFACKKNLKAYNKTINEEKILNNLSNIIVLTKNQKTSIK 443
Query: 442 LVRMEKKML 450
+ EKK+L
Sbjct: 444 QRKCEKKIL 452
>gi|429861365|gb|ELA36056.1| set domain protein [Colletotrichum gloeosporioides Nara gc5]
Length = 471
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 90/422 (21%), Positives = 159/422 (37%), Gaps = 90/422 (21%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER----------VLGNETIAELLTTNKLSELACLAL 162
+ A++D+ A F++P ++ +E GN+ E + L L L
Sbjct: 42 IIATKDIPAETTLFTIPRRSIINVETSELPKKIPQVFTGNDGDDEDMENEPLDSWGSLIL 101
Query: 163 YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-LERA 221
++YE QG S W PY L + + + W ++L L GS ++I E A
Sbjct: 102 VMIYEFLQGAASPWKPYFEVLPEK-------FHTLMFWESSDLENLKGSAVLSKIGKEEA 154
Query: 222 EGIKREYNELDTVWFMAGSLF-----------QKVSLARRFALVPLGPPLLAYS------ 264
+ + R + + TV ++F + + LA R +G ++AY+
Sbjct: 155 DEMFR--SRILTVIAANPAIFYPEGSSPLGEAELLQLAHR-----MGSIIMAYAFDLDNE 207
Query: 265 --------------SKCKAMLAAV-------------------DDAVQLVVDRPYKAGES 291
K ML V DD + + RP AGE
Sbjct: 208 EEPEQEEDDEWIEDRDGKTMLGMVPMADILNADAEFNAHVNHGDDELTVTALRPIPAGEE 267
Query: 292 IVVWCGPQPNSKLLINYGFVDEDN--------PYD--RLVVEAALNTEDPQYQDKRMVAQ 341
I+ + GP PNS+LL YG+V + P+D + V L D ++ +
Sbjct: 268 ILNYYGPHPNSELLRRYGYVTPKHSRYDVVEIPWDLVQASVSEHLKIGDDVWKQVQEYVD 327
Query: 342 RNGKLSVQVFHVHAGR-EKEAISDMLPYLRLGYVSDTSEMQSVISSL----GPICPVSPC 396
V V +G + E + +R ++++V+ ++ G + P
Sbjct: 328 PEELEDVFVLERESGEPDSEGQFRTVAEVREISAELEEQLKAVLKAIKKINGDLIPDKRK 387
Query: 397 MERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQV 456
+ + + L+ YP + EDEA+L +L ++R+A + EKK+L L+
Sbjct: 388 RDEVFHAVIVSTLQKILSQYPTSTQEDEALLATSDLTNRQRMAIHVRLGEKKLLKEALEF 447
Query: 457 TA 458
Sbjct: 448 AG 449
>gi|302804448|ref|XP_002983976.1| hypothetical protein SELMODRAFT_423083 [Selaginella moellendorffii]
gi|300148328|gb|EFJ14988.1| hypothetical protein SELMODRAFT_423083 [Selaginella moellendorffii]
Length = 266
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 51/183 (27%), Positives = 82/183 (44%), Gaps = 29/183 (15%)
Query: 160 LALYLMYEKKQGK----KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 215
LAL ++ E+ +G+ S W PYI L +++ LW +TEL+YL SP
Sbjct: 67 LALIVLMERYKGQAKLQSSAWAPYISCLPEPA-----ELDNTFLWEDTELSYLRASPLYG 121
Query: 216 EILERAEGIKREYNELDTVWFMAGSLFQKVSL-----------ARRFAL----VPLGPPL 260
+ ER E I E+ ++ + LF KVSL +R A+ + P+
Sbjct: 122 KTRERLEMITTEFGQVQNALDVWPQLFGKVSLEDFKHVYATVFSRSLAIGEDSTLVMIPM 181
Query: 261 LAY-----SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 315
L + +S K + + + DR Y + I + G N++L ++YGF +N
Sbjct: 182 LDFFNHNATSFAKLSFNGLLNYAVVTADRDYAENDQIWINYGDLSNAELALDYGFAVPEN 241
Query: 316 PYD 318
PYD
Sbjct: 242 PYD 244
>gi|325186532|emb|CCA21071.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 441
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 89/412 (21%), Positives = 168/412 (40%), Gaps = 57/412 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A++ LQ G+ +P L ++ + ++ L N+L + +AL+LM E+ +
Sbjct: 39 VYAAKSLQKGEITMEIPFHLTISKVTAMQSDLRQILQDKNELDQDEIVALFLMIERFKSS 98
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK--AEILERA-----EGIK 225
SF+ P+I+ L Q + P+ W++++ A L G+ A+I+ + + I
Sbjct: 99 DSFFEPFIQSLPSQ-------FDLPIFWNDSDFAELEGTNVALLAKIMRKQIEADFQAIH 151
Query: 226 ----REYNEL-----------DTVWFMAGSLFQKVSLARRFALVPLGPPLLAYSSKCKAM 270
R Y E D W ++ + + R + + P L + +
Sbjct: 152 IPLLRAYEERLNLRTSEISISDYEWALSIIWTRAFGITRYGEYLRVLCPALDMFNHSVLV 211
Query: 271 LAAVDDAVQL---------VVDRPYKAGESIVVWCGPQPNSKLLINYGFV--DEDNPYDR 319
+D+ ++ V A + + G ++KLL +YGFV +E N ++
Sbjct: 212 QEPLDEFIKYDHMKDVLAHCVVMETSANDPFYISYGSYSDAKLLYSYGFVSLNEKNRFNG 271
Query: 320 LVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-MLPYLRLGYVS--D 376
+ + + DP ++ K+ + + N Q + + + + L R+ +S +
Sbjct: 272 IDLWMRVPVTDPNFKLKQAILEGNAATRDQTYDFRGTIHLDDVDERFLASFRIILLSQEE 331
Query: 377 TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKK 436
E + S VS E AV + D + RLA +P +L +D L + ++
Sbjct: 332 FREYEKAFDS----TIVSVRNELAVYAAIHDVCEKRLARFPTSLEDDLKKLAELEMNSDL 387
Query: 437 RVATQL-VRME-KKMLNACLQVTADMIMLL--------PDVTVSPCPAPYAP 478
R + VRME KK+L + ++ + LL PDVT P P
Sbjct: 388 RKTYAISVRMEDKKILQSVCRLMKEWRNLLENDSNIYPPDVTRQQQPQLSMP 439
>gi|332020870|gb|EGI61268.1| SET domain-containing protein 3 [Acromyrmex echinatior]
Length = 232
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 55/217 (25%), Positives = 94/217 (43%), Gaps = 11/217 (5%)
Query: 252 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
AL+P+ + + A D + R +K GE + + GP+ NS ++ GFV
Sbjct: 18 ALIPMWDMCNHENGRITTDFNATSDRCECYALRDFKKGEQVFISYGPRTNSDFFVHSGFV 77
Query: 312 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-MLPYLR 370
DN D + ++ D +++ + + SV F + G E ISD +L +LR
Sbjct: 78 CMDNEQDGFKLRLGISKADSLQKERIELLSKLDLPSVGEFLLKPG--TEPISDTLLAFLR 135
Query: 371 LGYVSDTSEMQSVISSLGPI------CPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 424
+ + +E+ + S C + +E V L + +A YP TL ED
Sbjct: 136 V-FSMRKAELTHWLRSDKVFDLKHVDCALETVVEENVRKFLLTRLQLLIANYPTTLKEDL 194
Query: 425 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 461
+L + L K++A QL EK++L+ L+ I
Sbjct: 195 ELL-ETTLPQMKKMAVQLRVTEKRILSGALEYVEQWI 230
>gi|413917183|gb|AFW57115.1| hypothetical protein ZEAMMB73_742803 [Zea mays]
Length = 514
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 70/271 (25%), Positives = 111/271 (40%), Gaps = 43/271 (15%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASE + GD A +P SL+++ E + +E L N ++ L L+ M E+
Sbjct: 153 ASESIGVGDIALEIPESLIISDELLCQSEVFLSLKDFNNITSETMLLLWSMRERYNLGSK 212
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL--- 231
F PY L G L + LA L G+ EI++ + ++++Y+EL
Sbjct: 213 F-KPYFDTLPANFNTG-------LSFGIDALAALEGTLLFDEIIQARQHLRQQYDELFPL 264
Query: 232 -----------------DTVW----FMAGSLFQKVSLAR-RFALVPLGPPLLAYSSKCKA 269
D +W + + S+ +S + LVP+ L S
Sbjct: 265 LCTNFPEIFRKDVCTWDDFLWACELWYSNSMMIVLSSGKLSTCLVPVAGLLNHSVSPHIL 324
Query: 270 MLAAVDDA---VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYDRLVVEAA 325
VD+A ++ + RP AGE + G P S LL YGF+ DNPYD + ++
Sbjct: 325 NYGRVDEATKSLKFPLSRPCDAGEQCFLSYGKHPGSHLLTFYGFLPRGDNPYDVIPLDL- 383
Query: 326 LNTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 356
D D+ + AQ + S Q H+ G
Sbjct: 384 ----DTSADDEDITAQSSATTS-QTTHMVRG 409
>gi|357615786|gb|EHJ69829.1| putative SET domain containing 3 [Danaus plexippus]
Length = 489
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 70/321 (21%), Positives = 132/321 (41%), Gaps = 49/321 (15%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
SW+H++G V + E + + A++D G +VP ++++ E+
Sbjct: 88 SWLHEHGAEFEGVEISEFDGYG------FGLKATKDFSEGSLILTVPGKVMMS-EKDPKA 140
Query: 143 ETIAELLTTNKLSEL---ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
++E + + L + LAL+L+ EK SFW PYI L + + L
Sbjct: 141 SDLSEFINIDPLLQNMPNVTLALFLLLEK-NNPNSFWKPYIDVLPEK-------YSTVLY 192
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREY----NELDTV------------------WFM 237
++ ELA L SP L+ I R+Y N++ T+ W +
Sbjct: 193 FNSEELAELRPSPVFESSLKLYRSIVRQYAYFYNKIHTIDLPVLKNLQDIFTFDNYRWAV 252
Query: 238 AGSLFQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR-------PYKAGE 290
+ + ++ ++ + A L + C + L ++R Y+ E
Sbjct: 253 STVMTRQNNIVQGTAFT-LTNAFIPLWDMCNHKHGKITTDFNLELNRGECYALQDYRRDE 311
Query: 291 SIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQV 350
I ++ G +PNS L ++ GFV DN YD L + ++ D K + + G V
Sbjct: 312 QIFIFYGARPNSDLFLHNGFVYPDNDYDSLSIALGISPNDALRNGKVNLLNKLGLSGVTN 371
Query: 351 FHVHAGREKEAISDMLPYLRL 371
F ++ G ++ ++L ++R+
Sbjct: 372 FSLYKGASPISV-ELLAFIRI 391
>gi|336261436|ref|XP_003345507.1| hypothetical protein SMAC_07495 [Sordaria macrospora k-hell]
gi|380088183|emb|CCC13858.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 499
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 68/296 (22%), Positives = 112/296 (37%), Gaps = 65/296 (21%)
Query: 81 LKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
L W H +G P ++ EK + + +P +A+E L + A + P S+ ++
Sbjct: 12 LLDWAHNHGASLHPSVEIYQDEKTGFSLRVKP----SAAESLHSPFKAVTCPTSITLSYL 67
Query: 138 RVLGNETIAELLT---------------TNKLSELACLALYLMYEKKQGKKSFWLPYIRE 182
L + I LT N L YL+ + +GK S W PYI
Sbjct: 68 NALTDGPITPYLTPPALDTQKHAFPERFMNSLPPHVIGRFYLIQQYLKGKSSLWAPYIST 127
Query: 183 LDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS-- 240
L + A+ P W+E ++ L G+ I E + +K EY + + GS
Sbjct: 128 LTDPSQLDKWAL--PPFWTEHDIELLRGTNAYVAIQEIQDNVKSEYKQARKILKQEGSPD 185
Query: 241 --------------LFQKVSL---------ARRFA--LVPLGPPLLAYS----------- 264
+F S AR + L+P G + +S
Sbjct: 186 YRAYTQVLYNWAYCMFTSRSFRPSLILSESAREYVERLLPEGAKIDDFSILQPLYDIGNH 245
Query: 265 ---SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 317
++ L + A +L+ Y+ G+ + G + NS+LL+ YGFV E+N Y
Sbjct: 246 SPEAEYSWNLTSEPSACELICRNSYEPGQQVFNNYGKKTNSELLLGYGFVTENNDY 301
>gi|320168265|gb|EFW45164.1| hypothetical protein CAOG_03170 [Capsaspora owczarzaki ATCC 30864]
Length = 464
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 73/300 (24%), Positives = 117/300 (39%), Gaps = 76/300 (25%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELA---------CLA 161
V A D+ AG +VP +L++T E+ +ET +L+T+ L +EL+ L
Sbjct: 170 VIARRDIPAGQTFINVPEALMMTAEKARKSETF-QLITSGALDSTELSPAMAKLDNFLLR 228
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
++L+ E+++G S+W PYI +L QR R PL ++E EL L SP E +
Sbjct: 229 MFLIVERRRGGNSYWSPYI-DLLPQRFR------LPLYFTEAELELLKPSPALQEAFVQL 281
Query: 222 EGIKREYNELDTVWFM------------AGSLFQKVSLARRFA-LVPLGPPLLAYSSKCK 268
+ R+Y M +G QK+ RR A +P+ L Y C
Sbjct: 282 RNVVRQYAAWKQYLMMLELARAAELPSGSGDAHQKILDQRRRAQAMPVRYNELTYDLFCW 341
Query: 269 AMLAAVDDAVQLVVDR-------------------------------------------- 284
A A Q+VV
Sbjct: 342 ASSAVATRQNQIVVGEVRANQAPELSLALIPGWDMCNHAFGGASSFYDTQTRSLECVAVA 401
Query: 285 PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 344
P GE +++ G + + N FV D+P D+ ++ A+ +DP ++ K + Q G
Sbjct: 402 PIAKGEPVLLHYGDRSSMAYFGNSEFVPADHPTDQYLILLAVGKQDPLFKSKSTILQALG 461
>gi|302836231|ref|XP_002949676.1| Rubisco large subunit N-methyltransferase [Volvox carteri f.
nagariensis]
gi|300265035|gb|EFJ49228.1| Rubisco large subunit N-methyltransferase [Volvox carteri f.
nagariensis]
Length = 484
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 99/396 (25%), Positives = 172/396 (43%), Gaps = 57/396 (14%)
Query: 105 EKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYL 164
+ RP+ + AS D Q GD FSVP+S ++ E V + +L L +AL L
Sbjct: 65 QTDRPV--LIASTDAQQGDVLFSVPDSAWLSAESVK-KAAVGKLAAAAGLEPWLQIALQL 121
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG- 223
+ ++ KS Y + +++PLLWSE EL L G+ ++L+ G
Sbjct: 122 VADRFGSTKSELSAYAASIPED-------LDTPLLWSEDELQELQGT----QVLQTLGGY 170
Query: 224 ---IKREYNELDTVWFMAG------SLFQ------KVSLARRFALVPLGPPLLAYS---- 264
+ + +L + F + S+F V+ R + PL P +A +
Sbjct: 171 LTFFRSTFQQLQSGLFTSNPAAFPPSIFTLPRFLWAVAAVRSRSHPPLDGPKIALAPLTE 230
Query: 265 ---------SKCKAMLAAVDDAVQLVV---DRPYKAGESIVVWCGP-QPNSKLLINYGFV 311
SK A + Q++V R + GE + + GP + + +L++YG +
Sbjct: 231 LVSHRRAANSKLSVRSAGLFGRGQVLVLEATRAIRKGEPLSMDYGPGKLDGPVLVDYGVM 290
Query: 312 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 371
D +P + + D DK + + N V+++ +++ +ML +LRL
Sbjct: 291 DVTSPKPGYSLTLKMPDSDRFIDDKLDILESNDLPQSVVYNLTP--DEQPTIEMLAFLRL 348
Query: 372 GYV--SDTSEMQSVISS--LGPIC-PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM 426
+ SD ++S+ + G + PVS E AV + L++ +A L GY T+ +D A
Sbjct: 349 MQLKGSDAFLLESIFRNDVWGFMQEPVSEGNEEAVCNTLSEGARAALGGYGTTIDQDLAE 408
Query: 427 LTDYNLHPK--KRVATQLVRM-EKKMLNACLQVTAD 459
L K +R A L+R+ EK+ L+A + D
Sbjct: 409 LRAQGSRAKGSRREAALLIRLGEKEALDAVARFFED 444
>gi|260835124|ref|XP_002612559.1| hypothetical protein BRAFLDRAFT_219602 [Branchiostoma floridae]
gi|229297937|gb|EEN68568.1| hypothetical protein BRAFLDRAFT_219602 [Branchiostoma floridae]
Length = 327
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 35/110 (31%), Positives = 58/110 (52%), Gaps = 9/110 (8%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
+D L W+ +NG ++L + P V ++ +L+ GD S+P +L++T
Sbjct: 4 DDSIQLMRWLRRNGFRDSHLVLTDFPDTGRG------VMSTRNLKEGDCIVSLPENLLIT 57
Query: 136 LERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIREL 183
V+ N + + + T K L+ L+LYL+ EK +GK SFW PYI+ L
Sbjct: 58 TTTVV-NSHLGQYIKTWKPRLTPKQVLSLYLIAEKSRGKDSFWYPYIQTL 106
>gi|198413420|ref|XP_002131202.1| PREDICTED: similar to SET domain containing 3 [Ciona intestinalis]
Length = 577
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 91/435 (20%), Positives = 182/435 (41%), Gaps = 83/435 (19%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
KSW+ ++G+ + ++E S E V A +D++ ++P ++T E
Sbjct: 83 FKSWLKEHGVEYSAIDIQE-VSEEEGFG----VIALQDIEIKCPLVTIPRKAMMTYEDA- 136
Query: 141 GNETIAELLTTNKLSEL---ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
+ +A L+ N++ + CLALYL E+ S + PYI + ++ +
Sbjct: 137 KSSYLAGLIEGNEVLSVMPNVCLALYLHCERFT-LNSKYQPYIDMIPQE-------FNTI 188
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNEL-------------------------- 231
L + E+ YL G+ + + + + I R++ L
Sbjct: 189 LYFKPHEMKYLKGTAALSVAINQFKSIVRQFALLYQVFNGSHQKEDVEKLPLQARNAFTF 248
Query: 232 DTVWFMA--------------GSLFQKVSLARRFALVPL--------GPPLLAYSSKCKA 269
DT + A G + + AL+P+ GP AY+
Sbjct: 249 DTYRWCASAVTTRQNKIPTHVGDVLGDLDENSTLALIPMWDMFNHAIGPLSTAYN----- 303
Query: 270 MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 329
A+ ++ + + +K GE + + G + NS LLI+ GFV +++P+D++ + ++ +
Sbjct: 304 ---ALTRGIECLAMQDFKTGEQVKICYGARTNSDLLIHNGFVMKESPFDKVRIHLGVSQK 360
Query: 330 DPQYQDK-RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT------SEMQS 382
DP Y K +++ + N ++S Q +L +LR+ ++++ + ++
Sbjct: 361 DPLYSLKAKLLEKLNVEVSGQFAVCSMDNSLPTSPQLLVFLRVFHMNEEELRSWLEKQKN 420
Query: 383 VISSLGPI---CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVA 439
+SSL I V + V + L + K L G+ E M+ D +L + ++A
Sbjct: 421 ELSSLREIYISGEVKFKSDVKVWEFLENRVKLLLMGFKKIGDNIEEMMEDKSLTHRSKLA 480
Query: 440 TQLVRMEKKMLNACL 454
Q E ++L+AC+
Sbjct: 481 LQFRIEEHRILSACV 495
>gi|307190530|gb|EFN74527.1| SET domain-containing protein 3 [Camponotus floridanus]
Length = 232
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 55/217 (25%), Positives = 94/217 (43%), Gaps = 11/217 (5%)
Query: 252 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
AL+P+ + + A D + R +K GE + + GP+ NS ++ GFV
Sbjct: 18 ALIPMWDMCNHENGRITTDFNATSDHCECYALRNFKKGEQVFISYGPRTNSDFFVHSGFV 77
Query: 312 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-MLPYLR 370
+N D + ++ D +++ + + G SV F + G E ISD +L +LR
Sbjct: 78 YMNNKQDGFKLRLGISKADSLQKERIELLSKLGLPSVGEFLLKPG--TEPISDTLLAFLR 135
Query: 371 LGYVSDTSEMQSVISSLGPI------CPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 424
+ + +E+ + S C + +E V L + +A YP TL ED
Sbjct: 136 V-FSMRKAELAHWLRSDKVFDLKHMDCALETVVEENVRKFLLTRLQLLIANYPTTLKEDL 194
Query: 425 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 461
+L + L K++A QL EK++L L+ I
Sbjct: 195 ELL-ETTLPQIKKMAVQLRVTEKRILLGALEYVEQWI 230
>gi|71895277|ref|NP_001025965.1| SET domain-containing protein 4 [Gallus gallus]
gi|53134599|emb|CAG32346.1| hypothetical protein RCJMB04_23h14 [Gallus gallus]
Length = 439
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 97/429 (22%), Positives = 164/429 (38%), Gaps = 78/429 (18%)
Query: 86 HKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVVTLERV 139
HK K LK++ + RP + + ++ LQAG+ S+P +VT V
Sbjct: 28 HKLEYIKLKKWLKDRGFGDSSLRPAQFWGTGRGLMTTKALQAGELVISLPEKCLVTTTTV 87
Query: 140 LGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYI----------------- 180
L N + E + K +S L L +L+ EK G++S W PY+
Sbjct: 88 L-NSCLGEYIMKWKPPVSPLIALCPFLIAEKHAGERSLWKPYLDVLPKTYSCPVCLEQDV 146
Query: 181 -----RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 235
L +Q + AV + S+ + L + Y+ L+ W
Sbjct: 147 VQLLPEPLRKQAQEQRTAVHELYMSSKAFFSSLQS------LFAENTATIFNYSALEWAW 200
Query: 236 FMAGS--LFQKVSLARRFALVP----LGP--PLLAYS--SKCKAMLAAVDDAVQLVVDRP 285
+ ++ K S F+L P L P LL +S + KA ++ +
Sbjct: 201 CTINTRTIYMKHSQRECFSLEPDVYALAPYLDLLNHSPNVQVKAAFNEQSRNYEIQTNSQ 260
Query: 286 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 345
K E + + GP N +LL+ YGFV DNP+ + V + + DK Q+N K
Sbjct: 261 CKKYEEVFICYGPHDNQRLLLEYGFVAVDNPHSSVYVSSDTLLKYFPSLDK----QKNAK 316
Query: 346 LSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQL 405
LS+ H D+L L G+ + + + + L C RA+ L
Sbjct: 317 LSILKEH-----------DLLENLTFGWDGPSWRLLTALKVLSLGGDEFTCWRRAL---L 362
Query: 406 ADYFKARLAGYPATLSEDEAM-----LTDYNLHPKKRVATQLVRMEKKMLNACLQVTADM 460
D AR +E +A+ + + + + V Q+ +M++ N Q+T
Sbjct: 363 GDVISAR--------NEQQALNITTKICHFLIEETQHVLLQISQMKRDKENLKEQLTLVE 414
Query: 461 IMLLPDVTV 469
+ L D+ +
Sbjct: 415 ALRLEDLKI 423
>gi|302816067|ref|XP_002989713.1| hypothetical protein SELMODRAFT_447801 [Selaginella moellendorffii]
gi|300142490|gb|EFJ09190.1| hypothetical protein SELMODRAFT_447801 [Selaginella moellendorffii]
Length = 400
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 63/244 (25%), Positives = 109/244 (44%), Gaps = 33/244 (13%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+ ++AG+ +P+ LV+T E++ ++ + +LL+T + L L ++ E+ +G+ S
Sbjct: 14 AARSIRAGEQIVRIPHELVLTAEKL--DDCVKKLLSTEY--DWCPLTLLILAEQHKGEAS 69
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PY+ L S + W + EL +L + ER E I EYN + V
Sbjct: 70 RWAPYVSCLPSFGDH-----HSTIFWGKEELKFLECTRAFRGTAERREMISDEYNSVKDV 124
Query: 235 WFMAGSLF-QKVSL---ARRFALVPLGPPLLAYSSKCKAMLAAVD--------------- 275
+F + +SL A +A V A SS+ +M VD
Sbjct: 125 ISSCPHVFGEDISLFQFAHAYATVVSRAWNGALSSEI-SMRPFVDFCNHDPVSHATVSHD 183
Query: 276 ---DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQ 332
DA ++ +R Y GE + + G + N+ L ++YGFV +N D+ + + DP
Sbjct: 184 TCKDAT-IIAERDYTKGEEVFISYGKRSNAVLAVDYGFVLPNNLSDQAELWMEIPWNDPL 242
Query: 333 YQDK 336
+ K
Sbjct: 243 REKK 246
>gi|255562948|ref|XP_002522479.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
gi|223538364|gb|EEF39971.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
Length = 502
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 89/397 (22%), Positives = 158/397 (39%), Gaps = 80/397 (20%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERV----LGNETIAELLTTNKLSELACLALYLMYEKKQ 170
A D+ + +P L + + V +GN + L +AL+L+ EK +
Sbjct: 84 AERDIARNEVVLEIPKKLWINPDAVAASDIGN-------VCSGLKPWISVALFLIREKLK 136
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLW-----------------SETELAYLTGSPT 213
+ S W PY+ L S + W SE ELA L G+
Sbjct: 137 KEGSTWWPYLDILPD-------TTNSTIYWWVLLVAFYVLVLSFQRRSEEELAELQGTQL 189
Query: 214 KAEILERAEGIKREYNEL-------------------DTVWFMAGSLFQKVSLAR----R 250
L E ++RE+ ++ D +W A + + + +R
Sbjct: 190 LRTTLGVKEYMQREFAKVEEEILLPHKELFPSPITLDDFLW--AFGILRSRAFSRLRGQN 247
Query: 251 FALVPLGPPL----------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQ 299
L+PL + AY K + + + L K+GE +++ + +
Sbjct: 248 LVLIPLADLINHSPDITTEDYAYEIKGGGLFSR-ELLFSLRSPISVKSGEQVLIQYDLNK 306
Query: 300 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREK 359
N++L ++YGF+++ + + ++ DP + DK +A+ NG F + G
Sbjct: 307 SNAELALDYGFIEKTPDRNTYTLTLQISESDPFFGDKLDIAETNGSGETADFDIVLGNPL 366
Query: 360 EAISDMLPYLRLGYVSDTSE--MQSVISS--LGPI-CPVSPCMERAVLDQLADYFKARLA 414
MLPYLRL + T ++S+ + G + P+S E + + D K+ L+
Sbjct: 367 PPA--MLPYLRLVALGGTDAFLLESIFRNTIWGHLELPISRANEELICRVVRDACKSALS 424
Query: 415 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 451
GY T+ EDE L +L+P+ +A + EKK+L
Sbjct: 425 GYHTTIEEDEK-LEAADLNPRLEIAVGIRAGEKKVLQ 460
>gi|449662705|ref|XP_002165483.2| PREDICTED: uncharacterized protein LOC100209819 [Hydra
magnipapillata]
Length = 819
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 89/378 (23%), Positives = 153/378 (40%), Gaps = 58/378 (15%)
Query: 107 HRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAEL---LTT-----NKLSELA 158
HR + + A+ED++ G+ F+VP L++ + E L T N S
Sbjct: 127 HR--YGMLATEDIKKGEVLFTVPRQLLLNQNTATLKNRLNEFEKWLDTHGKSLNDSSGWL 184
Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY-LTGSPTKAEI 217
L + LM+E Q K SFW Y+ + G PL W E E G P +I
Sbjct: 185 PLLITLMWEFNQ-KDSFWASYLLLVPEISEFGH-----PLFWKEEEYNLEFQGMPLLNDI 238
Query: 218 LERAEGIKREYNELDTVWF-----MAGSL-------FQKVS---LARRF-------ALVP 255
+ E I+ EY E ++ + GSL F+++ +A F ++VP
Sbjct: 239 IVDRENIETEYAEFVLLFLRRNKDLFGSLENYSLEFFKRMVAFVMAYSFTEDEESPSMVP 298
Query: 256 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD-ED 314
+ +L + S A L +Q++ R K GE + G N++LL YG+V+
Sbjct: 299 MAD-ILNHHSNNNAHLVFHKSNLQMISIRRIKKGEEVFNTFGKLGNTELLQMYGYVEIPS 357
Query: 315 NPYDRLVV-----------EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 363
N YD L++ + +DP K + R G V F +
Sbjct: 358 NQYDSLLLPVKDFYKIMTSKNGTANDDPYLLAKINLLNRTGIAEVDAFFMFDKNGLRCGP 417
Query: 364 DMLPYLRLGYVSDTSEMQSVISSLGPICPVS---PCMERAVLDQLADYFKARLAGYPATL 420
D++ +L++ + SD E++ ++ + P S + + L + + K L +
Sbjct: 418 DLIQFLKIFHASD-RELEKILKTRASKRPESFYHKLLRKLRLSKKTE--KNSLGMTVIDI 474
Query: 421 SEDEAMLTDYNLHPKKRV 438
+ED+ + N + +K V
Sbjct: 475 TEDDTEMDIENFNKRKNV 492
>gi|303277863|ref|XP_003058225.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226460882|gb|EEH58176.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 612
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 82/300 (27%), Positives = 120/300 (40%), Gaps = 61/300 (20%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIA--ELLTTNKLSELACLALYLMYEKKQ 170
AAS DL AG A ++P+S ++T L + T T L E + L+L+YEK
Sbjct: 195 AAASTDLPAGADALTIPSSALLTSRVALEDPTARGDAYRTFAGLGEDTLMTLWLVYEKYA 254
Query: 171 -GKKSFWLPYIREL---------DRQRGRGQLAVESPLLW-SETELAYLTGSPTKAEILE 219
G +S W P + L + G L + +P W +E A L G+P + ++
Sbjct: 255 LGDRSPWAPLLASLPMDDGGGDDGDRTAAGALGL-TPASWPAEVTDALLRGAPLLDDAVK 313
Query: 220 RAEGIKREY----------------NELDTV---------WFMAGSLFQKVSLARRFALV 254
E R++ EL T+ W G Q ++
Sbjct: 314 ARETTARQHAALFPALGEHFPEVFPTELYTLRRFRIASEAWNAYGMTVQAETVGGASGGG 373
Query: 255 PLGPPLLAYSSKCKAMLA-------AV------DDAVQLVVDRPYKAGESIVVWCGPQPN 301
PP A+L AV DDA+ L + R +AGE I V G + N
Sbjct: 374 EHHPPAPTTCLPPIALLCNHATWPHAVRYSRLRDDALHLPIARGVRAGEEIFVSYGAKSN 433
Query: 302 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN-----GKLSVQVFHVHAG 356
++LL+ YGF DNPYD + L+ E PQ + + + A R KLS+ V G
Sbjct: 434 AELLLFYGFGVRDNPYD----DVPLSLELPQGEVRDVSALRERVLHRAKLSLSPHSVRCG 489
>gi|440804394|gb|ELR25271.1| rubisco lsmt substrate-binding protein [Acanthamoeba castellanii
str. Neff]
Length = 408
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 100/389 (25%), Positives = 144/389 (37%), Gaps = 64/389 (16%)
Query: 121 AGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELAC-----LALYLMYEKKQGKKSF 175
A + VP SL++ L E + + K + A LAL++++E ++ SF
Sbjct: 4 ASERILEVPFSLLLDAGAALRAEDVGSVFAAVKPALDAVDNRLPLALFMLHELRK-PDSF 62
Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL-----------ERAEGI 224
W PY L + V P+ W++ ++ L GSP A +L E I
Sbjct: 63 WRPYFDALPSR-------VNLPMFWADEDMQLLAGSPLHAAVLAQKKQARDWHTEHIVPI 115
Query: 225 KREYNELDTVWFMAGSLFQKVSLAR--------------RFAL--------VPLGPPLLA 262
R Y V SL SLAR F L VP+ +
Sbjct: 116 VRRYPRPFGVSDDDSSLEPSYSLARFEWVLSMIASRAFWHFDLKDTWEPHMVPMADLINH 175
Query: 263 YSSKCKAMLAAVDDAVQLV---VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN---- 315
+ DD Q V +PY GE + + N +LL Y + EDN
Sbjct: 176 SLTNDNVSKYTFDDKTQTFIVHVQQPYAEGEQVFITYCTDSNFELLKTYAMMVEDNYNKY 235
Query: 316 ---PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLG 372
D + E + R + QR L+ Q + V + +E D++ LRL
Sbjct: 236 TEIRLDETTIARICPDEVERLTKTRALTQRG--LAKQTYPV---KSEEFPLDLVQALRLY 290
Query: 373 YVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT-DYN 431
++ T S+ PVS E V D +A K L+ YP T ED AML D
Sbjct: 291 HLPLTDSHTE--STCFETDPVSVQNELMVYDTIAGCVKELLSQYPITAQEDAAMLAHDPR 348
Query: 432 LHPKKRVATQLVRMEKKMLNACLQVTADM 460
L R+A R +K L V A+M
Sbjct: 349 LSATARLAVAYRREDKLFLTEVGSVFAEM 377
>gi|156361027|ref|XP_001625323.1| predicted protein [Nematostella vectensis]
gi|156212150|gb|EDO33223.1| predicted protein [Nematostella vectensis]
Length = 447
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 70/276 (25%), Positives = 106/276 (38%), Gaps = 55/276 (19%)
Query: 75 EEDLGDLKSWMHKNGLPPCKVILKEKPS-HNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
EE+ L W +NG+ V K +P+ + R + A E + + + SVP L+
Sbjct: 47 EENYISLLKWAKRNGM----VFKKIRPAIFSSTGRGM---LAIERIHSSECVISVPERLL 99
Query: 134 VT----LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
+T LE +GN + K S L L+LMYEK K SFW PYIR L
Sbjct: 100 ITASSVLESAIGNYVAERMKGGAKSSNDYLLVLFLMYEKYLEKGSFWAPYIRTLPD---- 155
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW----------FMAG 239
+P ++ EL +L + + E+ IK+ Y + F
Sbjct: 156 ---TFNTPCYFTRKEL-FLLPEQCREQAFEQVTQIKQSYKSFAKAYNDVLQDFDCNFWRT 211
Query: 240 SLFQKVSLA---------------RR-------FALVPLGPPLLAYSSKCK--AMLAAVD 275
F+ A RR AL PL LL + K + +
Sbjct: 212 VDFESFKWAWCVVNTRSVYHDEPNRRAQPIDGNCALAPL-LDLLNHCDKAEMCGRFNSSS 270
Query: 276 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
++ V Y+ G + + GP N++L + YGFV
Sbjct: 271 KNYEINVITEYQKGTQVFINYGPHDNTRLFLEYGFV 306
>gi|449542715|gb|EMD33693.1| hypothetical protein CERSUDRAFT_56467 [Ceriporiopsis subvermispora
B]
Length = 510
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 65/226 (28%), Positives = 98/226 (43%), Gaps = 31/226 (13%)
Query: 252 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
A+VP+ L A A L + +++V +P AGE I G PNS LL YG V
Sbjct: 260 AMVPMADMLNARFESENAKLFYEEHYLKMVATKPINAGEQIWNTYGDPPNSDLLRRYGHV 319
Query: 312 D----------EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQV-FHVHAGREK- 359
D E NP D + + A L R + G L V+V F + +
Sbjct: 320 DVVPLGEPLSGEGNPADVVEIRADLVV-----SAVRKARKAAGDLQVRVDFWLEEADDDT 374
Query: 360 -------EAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 412
E ++L ++RL +S T + + + + G + +E +L + D K R
Sbjct: 375 FVLMTDCEVPEELLSFIRL--LSLTKDEWNKVKAKGKLP--KGKLELELLPAIVDVLKER 430
Query: 413 LAGYPATLSEDEAML---TDYNLHPKKRVATQLVRMEKKMLNACLQ 455
L YP T+ EDE++L + NL KR A + EK++L LQ
Sbjct: 431 LKEYPTTIEEDESLLGPDSAVNLSFNKRNAVVVRLGEKRILRGALQ 476
>gi|255947868|ref|XP_002564701.1| Pc22g06730 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211591718|emb|CAP97961.1| Pc22g06730 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 679
Score = 58.5 bits (140), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 52/205 (25%), Positives = 90/205 (43%), Gaps = 38/205 (18%)
Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
A +LM + +G + FW PY+R L + GQL +PL + E ++ ++ G+ +
Sbjct: 105 TFAFFLMGQYLRGSEGFWYPYLRTLPQP---GQLT--TPLFFGEEDVDWIQGTGIPEAAV 159
Query: 219 ERAEGIKREYN----ELDTVWF------------MAGSLFQKVSLARRF----------- 251
ER + +++Y+ +LD + F A ++ + + +
Sbjct: 160 ERIKVWEQKYDLGYLKLDEIGFPDCEQYTWELYLWASTIITSRAFSAKVLSGAVQPDDLP 219
Query: 252 -----ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 306
AL+PL L + K A D+ + L+V + AG+ I GP+ N +LLI
Sbjct: 220 EDGVSALLPL-IDLPNHRPMAKVEWRAGDEDIGLLVLEDHSAGQEISNNYGPRNNEQLLI 278
Query: 307 NYGFVDEDNPYDRLVVEAALNTEDP 331
NYGF NP D +V + + P
Sbjct: 279 NYGFCIAGNPTDYRIVLLGVKPDSP 303
>gi|367029027|ref|XP_003663797.1| hypothetical protein MYCTH_2080826, partial [Myceliophthora
thermophila ATCC 42464]
gi|347011067|gb|AEO58552.1| hypothetical protein MYCTH_2080826, partial [Myceliophthora
thermophila ATCC 42464]
Length = 357
Score = 58.5 bits (140), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 72/308 (23%), Positives = 117/308 (37%), Gaps = 76/308 (24%)
Query: 78 LGDLKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
G L W ++G P ++ L ++ + P +A+E LQ G AA S P + +
Sbjct: 10 FGALVEWAEQHGARLHPSVEIYLDPVSKYSLRVSP----SATEGLQPGFAAVSCPARITL 65
Query: 135 TLERVLGNETIAELLTTNKLSELACL------------------------ALYLMYEKKQ 170
+ L + + +++ ++ A L +L+ E +
Sbjct: 66 SYLNALVDGLLDPSALSDRSAQSARLDQETSSTGAFPPRFTRSVPPHVLGRFFLVKEYLK 125
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVES-PLLWSETELAYLTGSPTKAEILERAEGIKRE-- 227
GK SFW PYI L Q+AV + P W + ++AYL G+ I E E +KRE
Sbjct: 126 GKDSFWWPYIATLPPPE---QVAVWALPPFWPDHDIAYLEGTNAHVAIQEIQENVKREFK 182
Query: 228 ----------------YNELDTVW---------FMAGSLFQKVSLARRFALVPLG----- 257
Y +L W F + + R AL+P G
Sbjct: 183 QARKLLKEEDFPDLPAYTQLLYKWAFCIFTSRSFRPSLVLSDATKRRLSALLPQGVQLDD 242
Query: 258 ----PPLLAYSSKCKAM-----LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 308
PLL ++ +V D +L+ PY+ G + G + NS+LL+ Y
Sbjct: 243 FSVLQPLLDIANHSPTARYTWDTTSVPDTCRLICHDPYQPGTQVYNNYGLKTNSELLLAY 302
Query: 309 GFVDEDNP 316
GF+ + P
Sbjct: 303 GFILPETP 310
>gi|303288796|ref|XP_003063686.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226454754|gb|EEH52059.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 538
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 43/143 (30%), Positives = 64/143 (44%), Gaps = 14/143 (9%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKV---ILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
++ + D L +W+ + G V ++ P + A+ D+ GDAA V
Sbjct: 73 ARTQADFDALWTWLEREGADVASVSPALVDATPGGRGWG-----LVATRDVGGGDAAIVV 127
Query: 129 PNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
P +L +T E ++ I L LAL L++EK G S W YIR L R
Sbjct: 128 PRALWMTKETAFASK-IGTALDPETTPPWCALALQLLHEKSLGDDSRWAAYIRCLPRVE- 185
Query: 189 RGQLAVESPLLWSETELAYLTGS 211
A+++PL WS ELA L G+
Sbjct: 186 ----ALDAPLFWSSEELAELAGT 204
>gi|351701197|gb|EHB04116.1| SET domain-containing protein 3 [Heterocephalus glaber]
Length = 705
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 52/196 (26%), Positives = 95/196 (48%), Gaps = 13/196 (6%)
Query: 275 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 334
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 401 DDRCECVALQDFQAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 460
Query: 335 DKRMVAQRNG---KLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV-------- 383
K V R G + VF +H E + +L +LR+ +++ + +
Sbjct: 461 MKAEVLARAGIPTYVWSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGENAIDR 519
Query: 384 ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQL 442
I +LG PVS E + L D L Y T ED+A+L + +L + ++A +L
Sbjct: 520 IFTLGNSEFPVSWENEVKLWSFLEDRASLLLKTYKTTTEEDKAVLKNPDLPARTKMAIKL 579
Query: 443 VRMEKKMLNACLQVTA 458
EK++L +Q A
Sbjct: 580 RLGEKEILEKAVQSAA 595
>gi|79315114|ref|NP_001030864.1| SET domain-containing protein [Arabidopsis thaliana]
gi|51971180|dbj|BAD44282.1| unnamed protein product [Arabidopsis thaliana]
gi|332645817|gb|AEE79338.1| SET domain-containing protein [Arabidopsis thaliana]
Length = 353
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 49/226 (21%), Positives = 96/226 (42%), Gaps = 44/226 (19%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LA L+ EKK G+KS W+PYI L + + S + W E EL+ + S E ++
Sbjct: 2 LAAVLIREKKMGQKSRWVPYISRLPQ-----PAEMHSSIFWGEDELSMIRCSAVHQETVK 56
Query: 220 RAEGIKREYNELDTVWFMAGSLFQKVSL------------------------ARRFALVP 255
+ I+++++ F+A + Q + ++R +L+P
Sbjct: 57 QKAQIEKDFS------FVAQAFKQHCPIVTERPDLEDFMYAYALVGSRAWENSKRISLIP 110
Query: 256 LGPPLLAYSSKCKAMLAAVDD------AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 309
+ +L D+ +Q+ DR Y G+ + + G N+ L++++G
Sbjct: 111 FADFMNHDGLSASIVLRDEDNQLSEFSTLQVTADRNYSPGDEVFIKYGEFSNATLMLDFG 170
Query: 310 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ---VFH 352
F N +D + ++ + +DP K + Q + +V+ +FH
Sbjct: 171 FTFPYNIHDEVQIQMDVPNDDPLRNMKLGLLQTHHTRTVKDINIFH 216
>gi|403414266|emb|CCM00966.1| predicted protein [Fibroporia radiculosa]
Length = 420
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 78/335 (23%), Positives = 135/335 (40%), Gaps = 78/335 (23%)
Query: 61 DTLVAGSREVVSKKEEDLGDLKSWMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASED 118
D +V+ + +VV+ K+W+ +NG P E+ ++ V AS+D
Sbjct: 8 DGIVSANGDVVA--------FKNWLAENGAEFHPHAAFRTERSGYS--------VIASQD 51
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELL----TTNKLSE--LAC--LALYLMYEKKQ 170
L++ S P SL +T E + + LL T SE L C + ++ + +
Sbjct: 52 LRSDTTVVSCPFSLAITPE--VSKNALTTLLGPTFTGQSWSERQLICSYICMHWILDPSA 109
Query: 171 GKKSFWLPYIREL---DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
+ PYIR L D+ R +PL +S+TEL L GS L+R + E
Sbjct: 110 SSELAHWPYIRMLPAPDKLR--------TPLHFSDTELEALKGSNLYGATLDRRRDWQSE 161
Query: 228 YNE-------LDTVW--------FMAGSLFQKVSLARRFALVPLGPPLLAYSSKCKAMLA 272
+ + +D W +++ S + L P P L++ +L
Sbjct: 162 WEQCQKTIATVDLTWGEQFSWERYLSASTYLSSRAFPSMVLSP-NPSLVSTEESYPVLLP 220
Query: 273 AVD----------------------DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 310
+D + + LV+ + AG ++ GP+PN++L++ YGF
Sbjct: 221 GIDSLNHSRGQPVSWVVSIGTSSDVNRISLVLHKSTPAGSELLNNYGPKPNAELILGYGF 280
Query: 311 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 345
+NP D +V++ N+ Q K V RN +
Sbjct: 281 SLPENPDDTIVLKIGGNSASGLQQQKWEVG-RNAQ 314
>gi|307195794|gb|EFN77608.1| SET domain-containing protein 3 [Harpegnathos saltator]
Length = 245
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 56/217 (25%), Positives = 92/217 (42%), Gaps = 11/217 (5%)
Query: 252 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
AL+P+ + + A D + R ++ GE I + GP+ NS ++ GFV
Sbjct: 31 ALIPMWDMCNHENGRITTDFNATSDRCECYALRNFQKGEQIFISYGPRTNSDFFVHSGFV 90
Query: 312 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDM-LPYLR 370
DN D + ++ D +++ + + SV F + G E ISDM L +LR
Sbjct: 91 YMDNEQDGFKLRLGISKADSLQKERTELLGKLDLPSVGEFLLKPG--TEPISDMLLAFLR 148
Query: 371 LGYVSDTSEMQSVISSLGPI------CPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 424
+ + +E+ + S C + +E V L + +A YP TL ED
Sbjct: 149 V-FSMRKAELAHWLRSDKVFDLKHMDCALETVVEENVRKFLLTRLQLLIANYPTTLKEDL 207
Query: 425 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 461
+L + L K++ QL EKK+L L+ I
Sbjct: 208 ELL-ETTLPQVKKMTVQLRVTEKKILLGALEYVEQWI 243
>gi|302815683|ref|XP_002989522.1| hypothetical protein SELMODRAFT_129980 [Selaginella moellendorffii]
gi|300142700|gb|EFJ09398.1| hypothetical protein SELMODRAFT_129980 [Selaginella moellendorffii]
Length = 464
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 88/366 (24%), Positives = 146/366 (39%), Gaps = 79/366 (21%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
L L L+YE+ Q K S+W PYI L + P+ +S ++ + +P ++ +
Sbjct: 105 LGLKLLYERAQ-KGSYWWPYISMLPH-------SFTLPIFFSGVDIESIDYAPVTHQVKK 156
Query: 220 RAEGIKREYNELDTVWFMAGSLFQKVS-LARRFA-LVPLGPPLLAYSSK----------- 266
R + + +EL + SL +++ A +F LG + A SS+
Sbjct: 157 RCRFLLQFSSELAKL----ESLPEEIHPFAGQFVDSGALGWAMAAVSSRAFRIHGVTNKL 212
Query: 267 CKAMLAAVDDA------------------------VQLVVDRPYKAGESIVVWCGPQPNS 302
C AM+ + D +++V R + G +I + GP N
Sbjct: 213 CSAMMLPLIDMCNHSFQPNAHIEEDLSRDAQDVSFLKVVTKRNLEKGSAITLNYGPLSND 272
Query: 303 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA--QRNGK------LSVQVFH-- 352
LL++YGFV DNP+DR+ L + ++ RM+A R G S QV
Sbjct: 273 LLLLDYGFVIPDNPHDRI----ELRYDGSLMENARMIAGLSRTGSPPFSSPASWQVDRLK 328
Query: 353 ------------VHAGREKEAISDMLPYLRLGYVSDTS--EMQSVIS--SLGPICPVSPC 396
V G +E +L LR+ + E + ++S + G VS
Sbjct: 329 QLGLADSGESQKVTLGGPEEVDGRLLAALRILHAESQEPLERRELVSLQAWGVESMVSSD 388
Query: 397 MERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQV 456
E VL L + T+ EDEA L+D +L R+A Q +K+++ L+
Sbjct: 389 NEERVLRTLCGLGAIVFNQFKTTIEEDEAKLSDKSLAETSRIAVQFRLTKKRLVVRVLES 448
Query: 457 TADMIM 462
+M
Sbjct: 449 LKKRLM 454
>gi|361129824|gb|EHL01706.1| putative Ribosomal N-lysine methyltransferase 4 [Glarea lozoyensis
74030]
Length = 483
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 59/225 (26%), Positives = 99/225 (44%), Gaps = 20/225 (8%)
Query: 251 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 310
+++PL L A + + A + ++ +++ +P AGE I G P S LL YG+
Sbjct: 211 LSMIPLADMLNADAERNNARIYYENEDLEMRTIKPIMAGEEIFNDYGQLPRSDLLRRYGY 270
Query: 311 VDED-NPYDRLVVEAA----LNTEDPQ---------------YQDKRMVAQRNGKLSVQV 350
V E+ YD + + +A L TE PQ +++ +A R G L
Sbjct: 271 VTENYAQYDVVEISSASIKSLMTEKPQEIQSGQFLDPLTSAEAEERVALADREGILEDSY 330
Query: 351 FHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFK 410
A E+ AI D L L ++ D ++++++S + S V L +
Sbjct: 331 DVNIANAEERAIPDELLALLYLFLLDNENLEAIVTSQSALPSRSKLATELVGKVLVKVLR 390
Query: 411 ARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 455
R A Y TL EDE +L NL + +A Q+ EK++L ++
Sbjct: 391 HREAEYATTLEEDEKLLQAANLPRRTAMAIQVRHGEKRVLRLAVE 435
>gi|302510645|ref|XP_003017274.1| hypothetical protein ARB_04152 [Arthroderma benhamiae CBS 112371]
gi|291180845|gb|EFE36629.1| hypothetical protein ARB_04152 [Arthroderma benhamiae CBS 112371]
Length = 479
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 73/313 (23%), Positives = 122/313 (38%), Gaps = 45/313 (14%)
Query: 97 LKEKPSHNEKHRPIHY-----VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTT 151
LK H + H IH A + + F +PN L+++++ + L
Sbjct: 24 LKRSSPHFKMHPGIHIADLRSTGAGRGISEDEELFVIPNDLILSVQNSEARSVLG--LDD 81
Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
+L L + ++YE QG++S W PY R L + ++ + W++ +L+ L GS
Sbjct: 82 KQLGPWLSLIITMIYEYYQGEQSKWYPYFRILPS-------SFDTLMFWTDEQLSELQGS 134
Query: 212 PTKAEILERAEGIKREYNELDTVWFMAGSLFQKVSLARRFALVPLGPPLLAYSSK----C 267
+I + DT+ L Q S R F P PPL + S+ C
Sbjct: 135 AVVGKIGKAVAD--------DTILQKVVPLIQANS--RYFPPRPNMPPLNSPDSQNALLC 184
Query: 268 KAM-LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 326
A + ++ A +++ +A E Y DED P +V A +
Sbjct: 185 LAHRMGSIIMAYAFDIEKTDEADEDTAE-----------DGYMTDDEDEPAKGMVPLADI 233
Query: 327 NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL----RLGYVSDTSEMQS 382
D Q + R+ + + + ++H+G E LP R GYV+D
Sbjct: 234 FNADAQRNNARLFQEEGSFVMKAIKNIHSGEEIFNDYGELPRADLLRRYGYVTDNYAQYD 293
Query: 383 VIS-SLGPICPVS 394
V+ SL IC V+
Sbjct: 294 VVEFSLDAICKVA 306
>gi|452841392|gb|EME43329.1| hypothetical protein DOTSEDRAFT_131367 [Dothistroma septosporum
NZE10]
Length = 445
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 100/430 (23%), Positives = 167/430 (38%), Gaps = 76/430 (17%)
Query: 83 SWMHKNGLP-PCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLG 141
+W+ NG K+ L + N + A EDL + FSVP S ++T E
Sbjct: 16 NWLRDNGASISAKITLDDLRQQNAGRG----IVAVEDLDEDEELFSVPRSTMLTTETSRN 71
Query: 142 NETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
E + + + LS + +AL + G +S W PY L ++ ++ + WS
Sbjct: 72 GEAVLQEVDDPWLSLIVVMALEYL----DGSQSRWKPYFDVL-------PVSFDNLMFWS 120
Query: 202 ETELAYLTGSPTKAEI----------------LERAEGIKREYNE--------------- 230
+ EL +L GS +I +ER K NE
Sbjct: 121 DRELRHLEGSTVVGKIGKEAADATFREQLIPVIERISKAKAADNEELLRMCHRMGSTIMA 180
Query: 231 ----LDTVWFMA---GSLFQKVSLARRF---ALVPLGPPLLAYSSKCKAMLAAVDDAVQL 280
L+T A G +++ S A +VPL L A + + A L DD V +
Sbjct: 181 YGFDLETSSDQAKNDGEEWEEDSDAGETLPKGMVPLADMLNADADRNNAKLFYEDDKVVM 240
Query: 281 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN---------PYDRLVVEAALNTEDP 331
+P KAGE + G P + LL YG++ DN P D + A L T+D
Sbjct: 241 KTIKPVKAGEELYNDFGSLPRADLLRRYGYL-TDNYAQYDVVEIPADLIKERAGLRTQD- 298
Query: 332 QYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPIC 391
++ A+ G L A E+ + L L +E + V +
Sbjct: 299 -VDERWQYAEEQGVLDDGYDVSRASSEEGQFPEELCVLLNLLALPRAEFEKVKNKDKIPK 357
Query: 392 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKR-VATQLVRMEKKML 450
P + +L + Y R A YP + + M +D +L+ ++R +A +++ EK++L
Sbjct: 358 PDLTTNAKKLLRTILVY---RYAAYPGNVDQ---MHSDVSLNDRRRKMAIVVIQGEKQVL 411
Query: 451 NACLQVTADM 460
+ +++
Sbjct: 412 QEAVDAISEI 421
>gi|296419472|ref|XP_002839331.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295635461|emb|CAZ83522.1| unnamed protein product [Tuber melanosporum]
Length = 541
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 55/245 (22%), Positives = 106/245 (43%), Gaps = 44/245 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V D+ + S P++L + + + A+ +T ++ A L ++L E +GK
Sbjct: 43 VITCTDIPSHSQLISCPHTLTINYTKARSAFS-ADFITNT--TQHAALCMFLCLEWLKGK 99
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA----------------- 215
+SFW PY+ L R+ ++PL +S+ +L +L G +A
Sbjct: 100 ESFWWPYLCVLPRE-------FDTPLYFSDEDLQFLQGCNLEATEVEARKLIWREEFEAA 152
Query: 216 -EILERAEGIKREYNELDTVWFMAGSLFQKVSLARRFA-----------LVPLGPPLLAY 263
IL+R EG EY + ++ A ++F S + +P+ PL+
Sbjct: 153 VSILQR-EGYDTEYYTWE-LYLWASTIFTSRSFPGKLMDWDRIIVHEDDTMPILFPLIDS 210
Query: 264 SSKCKAMLAA---VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 320
+ A + D +++++ AG + GP+ N +LL+ YGF NP+D
Sbjct: 211 LNHYPATIITWQPSDTSLRIISGVGVSAGAEVYNNYGPKANEELLMGYGFTLLQNPFDSF 270
Query: 321 VVEAA 325
+++++
Sbjct: 271 LLKSS 275
>gi|327295769|ref|XP_003232579.1| hypothetical protein TERG_06571 [Trichophyton rubrum CBS 118892]
gi|326464890|gb|EGD90343.1| hypothetical protein TERG_06571 [Trichophyton rubrum CBS 118892]
Length = 488
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 73/314 (23%), Positives = 124/314 (39%), Gaps = 41/314 (13%)
Query: 97 LKEKPSHNEKHRPIHY-----------VAASEDLQAGDAAFSVPNSLVVTLERVLGNETI 145
LK H + H IH + AS D+ + F +P+ LV++++ +
Sbjct: 24 LKRSSPHFKMHPGIHIADLRSVGAGRGICASRDIAEDEELFIIPDDLVLSVQNSEARSAL 83
Query: 146 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 205
L +L L + ++YE QG++S W PY R L + ++ + W++ +L
Sbjct: 84 E--LDDKQLGPWLSLIITMIYEYYQGEQSKWYPYFRILPS-------SFDTLMFWTDEQL 134
Query: 206 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQKVSLARRFALVPLGPPLLAYSS 265
L GS +I + A DT+ L Q + R F P PPL + S
Sbjct: 135 LELQGSAVVGKIGKAAAD--------DTILQKVVPLIQ--ANPRHFPPRPNMPPLNSSDS 184
Query: 266 KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 325
+ A+L +++ + ++ V + Y DED P +V A
Sbjct: 185 Q-NALLCLAHRMGSIIMAYAFDIEKTDEVDEDTAEDG-----YMTDDEDEPAKGMVPLAD 238
Query: 326 LNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL----RLGYVSDTSEMQ 381
+ D Q + R+ + + + ++H+G E LP R GYV+D
Sbjct: 239 IFNADAQRNNARLFQEEGSFVMKAIKNIHSGEEIFNDYGELPRADLLRRYGYVTDNYAQY 298
Query: 382 SVIS-SLGPICPVS 394
V+ SL IC V+
Sbjct: 299 DVVEFSLDSICKVA 312
>gi|170067683|ref|XP_001868579.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167863782|gb|EDS27165.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 269
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 50/189 (26%), Positives = 89/189 (47%), Gaps = 24/189 (12%)
Query: 286 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 345
Y+ GE I ++ G + N+ L++ GFV DN + + +LN + Q++ ++ + ++ G
Sbjct: 74 YRKGEQIFIYYGNRTNADFLVHNGFVYPDNANSAVAIPLSLNPTEEQFEQRKQLLEKLGL 133
Query: 346 LSVQVFHVHAGREKEAIS-DMLPYLR--------LGYVSDTSEMQSVISSLGPICPVSPC 396
S F+V G IS ++L + R LG+ +QS + L P C P
Sbjct: 134 ASSGDFNVQRGGGDSFISPELLGFARVFNMTKEQLGHWQGEDAVQSQL--LEPDC---PG 188
Query: 397 MERAVLDQLADYFKARLA----GYPATLSEDEAMLTDYN------LHPKKRVATQLVRME 446
+E ++ +++ Y RL TL +DEA+L + L K + Q +E
Sbjct: 189 LEASLREKVWKYLSIRLQLALRMTGTTLDQDEALLANQGQKGAQKLGHIKSMLVQFRVVE 248
Query: 447 KKMLNACLQ 455
KK+L+ L+
Sbjct: 249 KKILSEALE 257
>gi|326472332|gb|EGD96341.1| SET domain-containing protein [Trichophyton tonsurans CBS 112818]
Length = 485
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 73/319 (22%), Positives = 125/319 (39%), Gaps = 51/319 (15%)
Query: 97 LKEKPSHNEKHRPIHY-----------VAASEDLQAGDAAFSVPNSLVVTLERVLGNETI 145
LK H + H IH + AS D+ + F +P+ L+++++ +
Sbjct: 24 LKRSSPHFKMHSGIHIADLRSIGAGRGICASRDIAEDEELFVIPDDLILSVQNSEARSVL 83
Query: 146 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 205
L +L L + ++YE QG++S W PY R L + ++ + W++ +L
Sbjct: 84 G--LDDKQLGPWLSLIITMIYEYYQGEQSKWYPYFRILPS-------SFDTLMFWTDEQL 134
Query: 206 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQKVSLARRFALVPLGPPLLAYSS 265
+ L GS +I + A DT+ L Q + F P PPL + S
Sbjct: 135 SELQGSAVVGKIGKAAAD--------DTILQKVVPLIQ--ANPHHFPPRPNMPPLNSPDS 184
Query: 266 K----CKAM-LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 320
+ C A + ++ A +++ +A E Y DED P +
Sbjct: 185 QNALLCLAHRMGSIIMAYAFDIEKADEADEDTAE-----------DGYMTDDEDEPAKGM 233
Query: 321 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL----RLGYVSD 376
V A + D Q + R+ + + + ++H+G E LP R GYV+D
Sbjct: 234 VPLADIFNADAQRNNARLFQEEGSFVMKAIKNIHSGEEIFNDYGELPRADLLRRYGYVTD 293
Query: 377 TSEMQSVIS-SLGPICPVS 394
V+ SL IC V+
Sbjct: 294 NYAQYDVVEFSLDGICKVA 312
>gi|328700922|ref|XP_003241429.1| PREDICTED: SET domain-containing protein 3-like [Acyrthosiphon
pisum]
Length = 463
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 61/295 (20%), Positives = 118/295 (40%), Gaps = 49/295 (16%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
+ ++ + L W KNG IL H ++ + + A++++ GD +VP +L
Sbjct: 81 RNDQSIEKLTKWATKNG-----AILNGVEIHQFENYA-YGMKANKNITVGDKLVTVPRAL 134
Query: 133 VVTLERV----LGNETIAELLTTNKLSELACLALYLMYEK-KQGKKSFWLPYIRELDRQR 187
++T E + L +++ N + LA++++ E ++ KKSFW Y+ L
Sbjct: 135 MMTEENIPSSPLWKLHSQDMMLRNMPN--VALAIFILVESLRKDKKSFWHSYLTTL---- 188
Query: 188 GRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMA---GSLFQK 244
+ +P+ + +L L GSP L+ I R+Y ++ ++ S+ K
Sbjct: 189 ---PVTYSTPVYFDVADLEALKGSPAFEAALKLNRNIARQYAYFKKLFQLSNDPASVILK 245
Query: 245 VSLARRF--------------------------ALVPLGPPLLAYSSKCKAMLAAVDDAV 278
+ + AL+PL S + +
Sbjct: 246 DTFTYEYYRWAVSTLMSRQNTVPSSDNPSENVSALIPLWDMFNHRSGRLSTDFVKSSNVC 305
Query: 279 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 333
D Y A E + ++ G + N+ L++ GFV DN +D + + ++ DP Y
Sbjct: 306 VCYADGDYAADEQVYIFYGVRTNADFLVHNGFVYPDNEHDAVKIRLGVSRSDPLY 360
>gi|162606198|ref|XP_001713614.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Guillardia theta]
gi|13794534|gb|AAK39909.1|AF165818_117 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Guillardia theta]
Length = 460
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 75/367 (20%), Positives = 157/367 (42%), Gaps = 68/367 (18%)
Query: 139 VLGNETIAEL--------------LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELD 184
+L NE I E+ + +N + LA+ L+ E + KKSFW PYI L
Sbjct: 108 ILKNEKIIEISENLMFDKFEHNLEINSNGSDNYSDLAIKLLVELFKNKKSFWFPYIGILP 167
Query: 185 RQRGRGQLAVESPLLWSETELAYLTGSPT-------KAEILERAEGIKRE---------- 227
+ L W EL ++ GS K ++ + E + +E
Sbjct: 168 EEYDLKLL-----FRWPLKELFFIKGSRLSKASDYLKKKLKAQYEMVNKEVFQRNRLLYP 222
Query: 228 ---YNELDTVWFMAGSLFQKVSL--ARRFALVPLGPPLLAYSSKCKAMLAA----VDDAV 278
+N + W M+ L + +SL ++ L+P LL ++ + ++ + D+
Sbjct: 223 SKIFNYQNWEWSMSILLSRTISLQETKKVVLIPY-IDLLNHNPFSSSFISYRKIPLSDSK 281
Query: 279 QLVV--DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 336
++VV D+ + + + G + N +LL YGF+ E NPYD +++ +++ +D +++K
Sbjct: 282 EIVVYSDKNCNKFDQLYISYGQKSNLELLNLYGFIAERNPYDSVIIRISMSPKDIFFKEK 341
Query: 337 RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL----GYVSDTSEMQSVISSLGPICP 392
+ N K + + + + +M+ ++++ ++D + + I +
Sbjct: 342 KSFLFSNKKFFYNSYPIFLYKYPD---EMIEFIKICLFNTNINDKNFNLNKIENYDYTKI 398
Query: 393 VSPC----MERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKK 448
+ C +E+++ DY R L E+ ++D ++++ + +EKK
Sbjct: 399 IKSCIVTVIEKSLNSNYNDYENLR----NIMLKENLLHISD-----NQKISIKYNALEKK 449
Query: 449 MLNACLQ 455
+LN L+
Sbjct: 450 ILNRFLE 456
>gi|323449371|gb|EGB05259.1| hypothetical protein AURANDRAFT_66448 [Aureococcus anophagefferens]
Length = 762
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 69/280 (24%), Positives = 113/280 (40%), Gaps = 41/280 (14%)
Query: 66 GSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAA 125
GS VV+ + +W+ G K+ +K H R + AA E G+
Sbjct: 11 GSSAVVTS------EFVAWLRAGGASFDKLAIK----HTALGRGVVATAAYE---PGETL 57
Query: 126 FSVPNSLVVTLERVLGNETIAELLTTNKLSELAC------LALYLMYEKKQGKKSFWLPY 179
SVP +L++T+++ +A L + + LAL+L ++ + W PY
Sbjct: 58 LSVPEALLLTVDKASRRADVAASLGAARARGVDANGGNLALALFLAGDRSEA----WRPY 113
Query: 180 IRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF--- 236
+ R P W + A L GSP +++ R + I+R+ L
Sbjct: 114 RNVISRS------VSHLPCFWPTADEALLAGSPLGEDVVRRRDEIRRDCRSLGLTAVEDR 167
Query: 237 MAGSLFQKVSLARRFA------LVPLGPPL-LAYSSKCKAMLAAVDDAVQLVVDRPYKAG 289
A + + L+R FA +VP + A + A A + R AG
Sbjct: 168 QAFAFAEAQVLSRAFAFNGTRAMVPFADLMNTARHHERHVDFAFERGAFVMRAVRRGAAG 227
Query: 290 ESIVVWCGPQPNSKLLINYGFVDEDN--PYDRLVVEAALN 327
E + GP+ N++ L+NYGF DN RL+ +AAL+
Sbjct: 228 EPVTDSYGPKSNARYLLNYGFAMADNRDEAGRLLDDAALD 267
>gi|400597281|gb|EJP65016.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
Length = 484
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 58/210 (27%), Positives = 82/210 (39%), Gaps = 54/210 (25%)
Query: 149 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 208
L+++ L L L L+YE +G S W PY+ L E+P+ W+ EL L
Sbjct: 96 LSSSPLDAWGALILVLLYEHLRGAASAWRPYLDVL-------PATFETPMFWTGAELGAL 148
Query: 209 TGSPTKAEI-LERAEGIKREYNELDTVWFMAGSLFQKVSLARRFALVPL----GPPLLAY 263
T ++ E AE R L V +FQ + ALV L G ++AY
Sbjct: 149 QAGATAGKVGRESAEDTFRGI--LLPVVRAHPDVFQGSAALSDEALVALAHRMGSTIMAY 206
Query: 264 S---------------------SKCKAMLAAV-------------------DDAVQLVVD 283
+ KAM+ V D+ + +
Sbjct: 207 AFDLENDEEREDEEDEDGWVEDRDGKAMMGMVPMADILNADAEFNAHVNHGDNELTVTAL 266
Query: 284 RPYKAGESIVVWCGPQPNSKLLINYGFVDE 313
RP KAGE I+ + GP PNS+LL YG+V E
Sbjct: 267 RPIKAGEEILNYYGPHPNSELLRRYGYVTE 296
>gi|451999637|gb|EMD92099.1| hypothetical protein COCHEDRAFT_1134267 [Cochliobolus
heterostrophus C5]
Length = 476
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 87/371 (23%), Positives = 145/371 (39%), Gaps = 70/371 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
VAA +D+ + FS+P S ++++E + + I T L L L ++YE G
Sbjct: 40 VAAKQDIAEHELLFSIPRSSILSVENSILSTEIPPT-TFALLGPWLSLILVMLYEYHNGS 98
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL-------------- 218
S W PY L ++ + W+E EL L S +I
Sbjct: 99 ASNWAPYFAVLPTD-------FDTLMFWTEDELTELQASAVVNKIGKEGANEVFIEQLLP 151
Query: 219 -------------ERAEGIKREYNELDTVWFM--AGSLFQKVSLARRFA----------- 252
ERA+ + +E + + M GSL + A
Sbjct: 152 VIEEFADVIFSGDERAKDLAKEMRAPENLELMHKMGSLIMAYAFDVEPAISDKEVDEEGF 211
Query: 253 ------------LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 300
+VPL L A + +C A L D +++ +P +AGE I GP P
Sbjct: 212 AEEEEDAALPKGMVPLADMLNADADRCNARLFYEKDGLEMKALKPIQAGEEIFNDYGPLP 271
Query: 301 NSKLLINYGFVDED-NPYDRLVVEAALNTE----DPQYQDKRMVAQRNGKLSVQVFHVHA 355
S LL YG++ E+ YD + + A L ++ D + +KR+ ++ + + A
Sbjct: 272 RSDLLRRYGYITENYAQYDVVEIPADLVSQALAHDGLWHEKRIEYLDEQEIVDTGYDIAA 331
Query: 356 G---REKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 412
+E++S L L + + E + + S G + P + M + L +AR
Sbjct: 332 SVPFSLEESLSPELVILVETMLLPSEEFER-LQSKGRL-PKAEKMTGKAAEILYKIVQAR 389
Query: 413 LAGYPATLSED 423
+A YP TL +D
Sbjct: 390 IAQYPTTLEQD 400
>gi|384251962|gb|EIE25439.1| ResB-domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 889
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 89/397 (22%), Positives = 156/397 (39%), Gaps = 61/397 (15%)
Query: 79 GDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLER 138
G L+ W+ GLPP KV + + V E L +VP L++T +
Sbjct: 32 GSLEDWLTHRGLPPQKVAISHEIPEGRGLVATRRVRKHEKL------LNVPAQLLLTADV 85
Query: 139 VLGNETIAELLTTNKLSELACLALYLMYEKKQ--GKKSFWLPYIRELDRQRGRGQLAVES 196
L + LL + + + LA +L ++Q G K+ W Y+ L Q G
Sbjct: 86 ALQHSAYGGLLESCGVPAWSVLATFLAETRRQPEGDKNVWGQYVDALPSQTG-------C 138
Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQKVSLARRFALVPL 256
L W+ E+ L G+ R +E+ + + S+ + + R A +P
Sbjct: 139 VLEWASEEVDLLRGTAA-----------MRAADEI--IAACSASVAELAPILRESASMPG 185
Query: 257 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC--GPQPNSKLLINYGFVDE- 313
GP +ML + ++ P K V+ G + +++LL++YGF+
Sbjct: 186 GPLTEQDLRWGFSML------LSRLIRLPGKQDLEACVYVSYGQKSDTQLLLSYGFMPAP 239
Query: 314 -DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLG 372
NP+ + +L +DP + KR + + G + F + D LP +
Sbjct: 240 LSNPHSACNLRLSLQRDDPCFDAKRALLEEAGHSACMEFPLRL--------DSLPQKLIN 291
Query: 373 YVS---DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 429
Y + + + V+S + V + ++ + K L PATL+ D+ L +
Sbjct: 292 YAAFLCTEAPDRRVVSQIDQTGVVDGSARKGAINLITKECKVALTAIPATLAADQEALNN 351
Query: 430 ----------YNLHPKKRVATQL-VRM-EKKMLNACL 454
N+ +R A L VR+ E+K+LN L
Sbjct: 352 IMAQLAGKDILNVTSLQRRADILKVRINERKILNKNL 388
>gi|302826668|ref|XP_002994755.1| hypothetical protein SELMODRAFT_432653 [Selaginella moellendorffii]
gi|300136963|gb|EFJ04180.1| hypothetical protein SELMODRAFT_432653 [Selaginella moellendorffii]
Length = 688
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 47/175 (26%), Positives = 75/175 (42%), Gaps = 25/175 (14%)
Query: 167 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 226
+K Q + S W PYI L +++ LW +TEL+YL SP + ER E I
Sbjct: 503 QKFQLQSSAWAPYISCLPEPA-----ELDNTFLWEDTELSYLRASPLYGKTRERLEIITT 557
Query: 227 EYNELDTVWFMAGSLFQKVS---LARRFALVPLGP------------PLLAYSSKCKAML 271
E+ ++ + LF KVS +A V P P+L + + A
Sbjct: 558 EFGQVQNALDVWPQLFGKVSVEDFMHVYATVFSRPLAIGEDSTLVMIPMLDFFNHNAASF 617
Query: 272 AAVD-----DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 321
A + + + DR + I + CG N++L ++YGF +N YD ++
Sbjct: 618 AKLSFNGLLNYAVVTADRDCAENDQIWINCGDLSNAELALDYGFTVPENRYDEVM 672
>gi|17367341|sp|Q43088.1|RBCMT_PEA RecName: Full=Ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic; AltName:
Full=[Fructose-bisphosphate aldolase]-lysine
N-methyltransferase; AltName:
Full=[Ribulose-bisphosphate carboxylase]-lysine
N-methyltransferase; Short=PsLSMT; Short=RuBisCO LSMT;
Short=RuBisCO methyltransferase; Short=rbcMT; Flags:
Precursor
gi|508551|gb|AAA69903.1| ribulose-1,5 bisphosphate carboxylase large subunit
N-methyltransferase [Pisum sativum]
Length = 489
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 87/376 (23%), Positives = 154/376 (40%), Gaps = 51/376 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A +D+ D VP L + + V +E I + + +L + L+L+ E+ + +
Sbjct: 84 LVALKDISRNDVILQVPKRLWINPDAVAASE-IGRVCS--ELKPWLSVILFLIRERSR-E 139
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W Y L ++ +S + WSE EL L GS + E +K E +L+
Sbjct: 140 DSVWKHYFGILPQE-------TDSTIYWSEEELQELQGSQLLKTTVSVKEYVKNECLKLE 192
Query: 233 TV-----------------WFMAGSLFQKVSLAR----RFALVPLGPPL----------L 261
+F A + + + +R +VP+ +
Sbjct: 193 QEIILPNKRLFPDPVTLDDFFWAFGILRSRAFSRLRNENLVVVPMADLINHSAGVTTEDH 252
Query: 262 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRL 320
AY K A L + D L KAGE + + + + N++L ++YGF++ +
Sbjct: 253 AYEVKGAAGLFSWDYLFSLKSPLSVKAGEQVYIQYDLNKSNAELALDYGFIEPNENRHAY 312
Query: 321 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE- 379
+ ++ DP + DK VA+ NG F + R +LPYLRL + T
Sbjct: 313 TLTLEISESDPFFDDKLDVAESNGFAQTAYFDIFYNRTLPP--GLLPYLRLVALGGTDAF 370
Query: 380 -MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPK 435
++S+ + G + VS E + + + K+ LAGY T+ +D L + NL +
Sbjct: 371 LLESLFRDTIWGHLELSVSRDNEELLCKAVREACKSALAGYHTTIEQDRE-LKEGNLDSR 429
Query: 436 KRVATQLVRMEKKMLN 451
+A + EK +L
Sbjct: 430 LAIAVGIREGEKMVLQ 445
>gi|442753255|gb|JAA68787.1| Putative set domain-containing protein [Ixodes ricinus]
Length = 428
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 65/280 (23%), Positives = 108/280 (38%), Gaps = 55/280 (19%)
Query: 81 LKSWMHKNGLP-PCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
L +WM NG K+ L++ P V A E L G+ +P SL+++
Sbjct: 31 LLTWMEANGFRLHSKLGLRDFPDTGRG------VVALEKLVGGETFLKLPTSLLISTRTA 84
Query: 140 LGNETIAELLTTN---KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES 196
L +++ T KL+ + L L+++ +K G+ S W P++ L R +
Sbjct: 85 L--QSLLHSFITRYHAKLTPIDVLTLFVLDQKLLGEASRWWPFVDSLPR-------TFTT 135
Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNEL------------------------D 232
P+ T L + E+ R I+R + +L +
Sbjct: 136 PVFLRRTVFESLP-KDLREEVHTRITSIQRTFLKLKVLLGGHVEEEPEVQSLSTGFTWNN 194
Query: 233 TVW----------FMAGSLFQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 282
VW F GS + AL P L + K A V + +++
Sbjct: 195 FVWAWTAVNTRCIFAQGSNSSSLWENDHCALAPF-LDCLNHHWKASIETAMVGENFEILS 253
Query: 283 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 322
+ + A E + + GP N +L ++YGFV DNP D +VV
Sbjct: 254 HKSHDANEQVFISYGPHSNRRLFLDYGFVLPDNPNDVVVV 293
>gi|428182808|gb|EKX51668.1| hypothetical protein GUITHDRAFT_102933 [Guillardia theta CCMP2712]
Length = 436
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 78/367 (21%), Positives = 151/367 (41%), Gaps = 78/367 (21%)
Query: 81 LKSWMHK-NGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
L+ W+ + +G+ KV L+ P V A+ L+ G+ F +P S + E V
Sbjct: 29 LRIWLEEEHGVDMSKVDLQRSPLEGLG------VFANRRLEPGETLFMIPKSCCIYPELV 82
Query: 140 LGNETIAELLTTNKLS-------ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
+ + + + KL+ E+ LA +L EK +G +S + P+I L
Sbjct: 83 FEDRQLGK--SMQKLASAAGEGIEVVALATFLAREKMKGSESSYKPFIDVL-------PW 133
Query: 193 AVESPLLWSETELAYLTGSPTKAEIL---ERAE-------------GIKREYNELDT--- 233
PLLW++ E+ L G+ EIL E+ E G K+ + ++T
Sbjct: 134 DSLHPLLWTDEEVDLLEGTYAHREILAFREQVEVATELFEPVLNPKGWKQFFQTIETEKM 193
Query: 234 ----VWFMAGSLFQKVSLARRF--------------ALVPLGPPLLAYSSKCKAMLAAVD 275
FM F V L+R F ++PL L ++ D
Sbjct: 194 TPEEFGFMMRGAFASV-LSRAFDSKIGRGDKGLEERVVIPL---LDIFNHGSYGPSITFD 249
Query: 276 DAVQLVVDRPY-----------KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEA 324
A++ ++ + + GE + + G +PN +L YGFV + +
Sbjct: 250 TALERDNEKGFPVRVADKGKSIEEGEELFGFYGDKPNWNMLTTYGFVSPNPKCQETTLSV 309
Query: 325 ALNTEDPQYQDKRMVAQRNGKLSV-QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV 383
+++ +DP + K + + G ++V Q+F + + + + ++ Y R+ +S+ +++ V
Sbjct: 310 SIDEKDPYFAQKEEILKARGMVAVEQLFDIR--HDTDPMGPLINYFRIREISNEADLTKV 367
Query: 384 ISSLGPI 390
++ G +
Sbjct: 368 QTNYGEM 374
>gi|403349615|gb|EJY74245.1| hypothetical protein OXYTRI_04500 [Oxytricha trifallax]
Length = 689
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 35/131 (26%), Positives = 66/131 (50%), Gaps = 19/131 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-------SELACLALYLM 165
+ A +D+ A +PNS ++++ RV + + ++L+ ++ ++ CLA++LM
Sbjct: 74 IGAKKDIGQYKAFLFIPNSCIISVTRVKKHPIVGQILSNHQELFMKHADADQLCLAVFLM 133
Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL--WSETELAYLTGSPTKAEILERAEG 223
E QG++SFW PYI ++ ES LL W + E+ L + E +
Sbjct: 134 NEYLQGQQSFWWPYINVMN----------ESDLLYKWKDEEIKLLNDFEIYQQAKEYRDD 183
Query: 224 IKREYNELDTV 234
I+ E+N+L +
Sbjct: 184 IEDEWNKLSKI 194
>gi|330806388|ref|XP_003291152.1| hypothetical protein DICPUDRAFT_155733 [Dictyostelium purpureum]
gi|325078672|gb|EGC32310.1| hypothetical protein DICPUDRAFT_155733 [Dictyostelium purpureum]
Length = 465
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 89/394 (22%), Positives = 156/394 (39%), Gaps = 59/394 (14%)
Query: 72 SKKEEDLGDLKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
+K+ E L + K W+ N P + L +K + + A + ++ D S+
Sbjct: 34 TKEIESLKEFKEWLVNNNAYINPNIDIELLDKYGRS--------IVAKKSIKKQDKLISI 85
Query: 129 PNSLVVTLERVLGN-----ETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIREL 183
P ++++ + G + I E + + LS A+++MY K +KSFW PY+ L
Sbjct: 86 PKDIIMS--NIGGYPKKIPKEIYEQVQSIGLSPTNLQAVFIMY-SKLNEKSFWHPYVTVL 142
Query: 184 DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE---------------- 227
+ + L +S+ EL L S K + R +GI+R
Sbjct: 143 PE-------SFSTSLYFSDNELDELQASQLKEFTIIRKDGIERHYESTFSRLSKLVPEFS 195
Query: 228 ----YNELDTVWFMAGSLFQKVSLARR-FALVPLGPPLLAYS-SKCKAMLAAVDDAVQLV 281
YN+ W ++ + SLA +VPL A SK K + D +
Sbjct: 196 NLALYNQELFTWALSCVWSRAFSLAENDGGMVPLADMFNAEDRSKSKVLPKVTDTTLDYY 255
Query: 282 VDRPYKAGESIVVWCG---PQPNSKLLINYGFV-DEDNPYDRLVVEA-ALNTEDPQYQDK 336
GE I G P +S++L++YGF+ DE D + + + ++P K
Sbjct: 256 ASDDIAEGEQIFTPYGVYKPLSSSQMLMDYGFIFDEGTVSDNVAITVPVFHNDEPNLSTK 315
Query: 337 RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGP---ICPV 393
+ + + N ++ +VF + A D+L Y R+ + Q+ L P P+
Sbjct: 316 QEILEENDIIN-EVFLLQKTDPLPA--DLLLYARVKNLIAKECDQAKKHFLSPNTRNTPL 372
Query: 394 SPCMERAVLDQLADYFKARLAGYPATLSEDEAML 427
+ E+ L L + L Y L D+ +L
Sbjct: 373 NTRNEKVSLRFLENLIHRYLDSYGTNLESDKNLL 406
>gi|302762396|ref|XP_002964620.1| hypothetical protein SELMODRAFT_81798 [Selaginella moellendorffii]
gi|300168349|gb|EFJ34953.1| hypothetical protein SELMODRAFT_81798 [Selaginella moellendorffii]
Length = 464
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 87/366 (23%), Positives = 143/366 (39%), Gaps = 79/366 (21%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
L L L+YE+ Q K S+W PYI L + P+ +S ++ + +P ++ +
Sbjct: 105 LGLKLLYERAQ-KGSYWWPYISMLPH-------SFTLPIFFSGVDIESIDYAPVTHQVKK 156
Query: 220 RAEGIKREYNELDTVWFMAGSLFQKVS--LARRFALVPLGPPLLAYSSK----------- 266
R + + EL + SL ++V + LG + A SS+
Sbjct: 157 RCRFLLQFSAELAKL----ESLPEEVHPFAGQSVDSGALGWAMAAVSSRAFRIHGVTNKL 212
Query: 267 CKAMLAAVDDA------------------------VQLVVDRPYKAGESIVVWCGPQPNS 302
C AM+ + D +++V R + G +I + GP N
Sbjct: 213 CSAMMLPLIDMCNHSFQPNAHIEEDLSRDAQDVSFLKVVTKRNLEKGSAITLNYGPLSND 272
Query: 303 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA--QRNGK------LSVQVFH-- 352
LL++YGFV DNP+DR+ L + ++ RM+A R G S QV
Sbjct: 273 LLLLDYGFVIPDNPHDRI----ELRYDGSLMENARMIAGLSRTGSPPFSSPASWQVDRLK 328
Query: 353 ------------VHAGREKEAISDMLPYLRLGYVSDTS--EMQSVIS--SLGPICPVSPC 396
V G +E +L LR+ + E + ++S + G VS
Sbjct: 329 QLGLADSGESQKVTLGGPEEVDGRLLAALRILHAESQEPLERRELVSLQAWGVESMVSSD 388
Query: 397 MERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQV 456
E VL L + T+ EDEA L+D +L R+A Q +K+++ L+
Sbjct: 389 NEERVLRTLCGLAAIVFNQFKTTIEEDEAKLSDKSLAETSRIAVQFRLTKKRLVVRVLES 448
Query: 457 TADMIM 462
+M
Sbjct: 449 LKKRLM 454
>gi|145553305|ref|XP_001462327.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124430166|emb|CAK94954.1| unnamed protein product [Paramecium tetraurelia]
Length = 481
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 86/443 (19%), Positives = 175/443 (39%), Gaps = 50/443 (11%)
Query: 59 SSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASED 118
S L S+ + + + +L W+ KV ++ K +E +R + AS+
Sbjct: 21 SESELRTKSKRITYEDPDPYKNLIQWLKDGKAEVSKVSIEVK---SEGYRTLR---ASQF 74
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIA-ELLTTNKL-SELACLALYLMYEKKQGKKSFW 176
++ G+ VP + ++LE V + I +++ N + + + + + ++ + + SFW
Sbjct: 75 IRQGEWVLFVPRTHYLSLEEVKKSCLINRKMIQLNYIPNNIQTYFVNHLLQENRRQNSFW 134
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN------- 229
PYI L + P + + A L GSPT ++ + + + EY+
Sbjct: 135 KPYIDVLPKD------VSGFPTNFDAEQDALLKGSPTLFTVMNQRKTFQEEYDNLKEAVK 188
Query: 230 ELDTVWFMAGSL--FQKVSLARRF-----------ALVPLGPPLLAYSSKCKAMLAAVDD 276
E + F+ ++++R F LVPL + + + D
Sbjct: 189 EFQRYGYTYNDFVKFRTLTISRSFPVYIGENEQQQLLVPLAD-FINHDNNGFLQYGYSPD 247
Query: 277 AVQLVVD--RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 334
A + R + GE + G N +NYGF NP ++ + L+ D +
Sbjct: 248 ADGFFMQAVRNIQKGEELFYNYGQWSNKYFFMNYGFASLTNPMNQFDFDICLDRNDRMF- 306
Query: 335 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS---DTSEMQSVISSLGPIC 391
+M + G + + + L +R +S D +++ + +
Sbjct: 307 --KMKVELTGGNICWGNRLVNETDHDTFRQSLATVRFAQISKLDDFLQLEEDVQNYNQFW 364
Query: 392 P---VSPC---MERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 445
P +P +E+A L D + LA + +T+ +D+ L D + +R L
Sbjct: 365 PGWHTTPKTIELEKATFKALRDLLVSELANFASTIEDDQRRLNDPSTPEFRRHIIMLTMR 424
Query: 446 EKKMLNACLQVTADMIMLLPDVT 468
EK+++ ++V DM++ + D T
Sbjct: 425 EKQIIKKNIEV-CDMMLSVIDKT 446
>gi|241603784|ref|XP_002405757.1| SET domain-containing protein, putative [Ixodes scapularis]
gi|215502568|gb|EEC12062.1| SET domain-containing protein, putative [Ixodes scapularis]
Length = 429
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 61/276 (22%), Positives = 108/276 (39%), Gaps = 43/276 (15%)
Query: 79 GDLKSWMHKNGLP-PCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
G L +WM NG K+ L++ P V A E L G+ +P +L+++
Sbjct: 30 GRLLTWMEANGFRLHSKLGLRDFPDTGRG------VVALEKLVGGETFLKLPATLLISTR 83
Query: 138 RVLGNETIAELLTTN-KLSELACLALYLMYEKKQGKKSFWLPYI---------------- 180
L + + ++ + KL+ + L L+++ +K G+ S W P++
Sbjct: 84 TALQSRLHSFIIRHHAKLTPIDVLTLFVLDQKLLGEASRWWPFVDSLPRTFTTPVFLRRK 143
Query: 181 ------RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
++L + G ++ L +L L G + E + +N
Sbjct: 144 VFESLPKDLREEVQTGITFIQRTFL----KLKVLLGGHVEEEPEVQCLSTGFTWNNFVWA 199
Query: 235 W--------FMAGSLFQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 286
W F GS + AL P L + K A V + +++ + +
Sbjct: 200 WTAVNTRCIFAQGSNSSSLWEDDHCALAPF-LDCLNHHWKASIETAMVGENFEILSHKSH 258
Query: 287 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 322
A E + + GP N +L ++YGFV DNP D +VV
Sbjct: 259 DANEQVFISYGPHSNRRLFLDYGFVLPDNPNDVVVV 294
>gi|189189204|ref|XP_001930941.1| SET domain-containing protein RMS1 [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187972547|gb|EDU40046.1| SET domain-containing protein RMS1 [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 476
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 72/294 (24%), Positives = 111/294 (37%), Gaps = 66/294 (22%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLER-VLG 141
+W+ ++G I E + + R V AS+D+ + F +P + ++++E +L
Sbjct: 13 AWLRQSGAEISPKIKLEDLRNKDAGRG---VVASQDIAEHELLFRIPRASILSVENSILS 69
Query: 142 NETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
E A L+ L L L ++YE G S W PY L + + + W+
Sbjct: 70 TEIPAATLSL--LGPWLSLILVMLYEYHNGSASNWAPYFAVLPTE-------FNTLMFWT 120
Query: 202 ETELAYLTGSPTKAEIL---------------------------ERAEGIKREYNELDTV 234
E ELA L S ++ ERA+ +E L+ +
Sbjct: 121 EDELAELQASAVVGKVGKESADEAFLEQLLPVIEEFADIVFSGDERAKDKAKEMRSLENL 180
Query: 235 WFM--AGSLFQKVSLARRFA-----------------------LVPLGPPLLAYSSKCKA 269
M GSL + A +VPL L A + +C A
Sbjct: 181 ELMHKMGSLIMAYAFDVEPATPTKEVDEEGFAEEEEDAALPKGMVPLADMLNADADRCNA 240
Query: 270 MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 323
L D +++ +P +AGE I GP P S LL YG+V DN VVE
Sbjct: 241 RLFYEKDCLEMKALKPIQAGEEIFNDYGPLPRSDLLRRYGYVT-DNYAQYDVVE 293
>gi|403350379|gb|EJY74649.1| SET domain containing protein [Oxytricha trifallax]
Length = 2165
Score = 55.8 bits (133), Expect = 4e-05, Method: Composition-based stats.
Identities = 70/305 (22%), Positives = 134/305 (43%), Gaps = 40/305 (13%)
Query: 97 LKEKPSHNEKHRPIHYVA------ASEDLQAGDAAFSVPNSLVVTLERVL----GNETIA 146
L++ SH EK + +Y A A+ D++ G+ VP ++TLE + G +
Sbjct: 154 LEQGGSHFEKLKIRYYTADYRGVHAARDIKKGEIILYVPKHQIITLEMAMTSPVGKKMYE 213
Query: 147 ELLTTNKLS-ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 205
+ L +S + + L+ Y+M EK++ +S W YI L + P+ ++E E
Sbjct: 214 KGLRQRLISPKHSFLSTYIMQEKRK-PESQWQIYIDILPKNFS------NFPIFFTEEER 266
Query: 206 AYLTGSPTKAEILERAEGIKREYNEL-----DTVWF--MAGSLFQKVSLARRFALVPLG- 257
+L GSP +ILE+ E IK +Y+ + + V F S + + +R F + G
Sbjct: 267 IWLKGSPFLDQILEKIEDIKADYDLICKEVPEYVQFPIREYSEIRMMVSSRIFGIQIEGV 326
Query: 258 --------PPLLAYSSKCKAMLAAVDDAVQLVVD--RPYKAGESIVVWCGPQPNSKLLIN 307
+L + + D+ +++ + GE + G + NS+ +N
Sbjct: 327 KTDGFVAYADMLNHKRPRQTSWTYTDEKQGFIIEAMEDIQRGEQVYDSYGKKCNSRFFLN 386
Query: 308 YGFVDEDNPYDRLVVEAALNTEDPQYQDKR-MVAQRNGKLSVQVFHVHAGREKEAISDML 366
YGF++ +N + + ++ +T+D Q K+ M+ + + F V E + +
Sbjct: 387 YGFINLNNDANEVPIKVYYHTDDQLKQVKQDMIVDHS---EFKKFRVVENLEDRVMQEFF 443
Query: 367 PYLRL 371
+LR
Sbjct: 444 SWLRF 448
>gi|24987776|pdb|1MLV|A Chain A, Structure And Catalytic Mechanism Of A Set Domain Protein
Methyltransferase
gi|24987777|pdb|1MLV|B Chain B, Structure And Catalytic Mechanism Of A Set Domain Protein
Methyltransferase
gi|24987778|pdb|1MLV|C Chain C, Structure And Catalytic Mechanism Of A Set Domain Protein
Methyltransferase
gi|33357815|pdb|1OZV|A Chain A, Crystal Structure Of The Set Domain Of Lsmt Bound To
Lysine And Adohcy
gi|33357816|pdb|1OZV|B Chain B, Crystal Structure Of The Set Domain Of Lsmt Bound To
Lysine And Adohcy
gi|33357817|pdb|1OZV|C Chain C, Crystal Structure Of The Set Domain Of Lsmt Bound To
Lysine And Adohcy
gi|33357822|pdb|1P0Y|A Chain A, Crystal Structure Of The Set Domain Of Lsmt Bound To
Melysine And Adohcy
gi|33357823|pdb|1P0Y|B Chain B, Crystal Structure Of The Set Domain Of Lsmt Bound To
Melysine And Adohcy
gi|33357824|pdb|1P0Y|C Chain C, Crystal Structure Of The Set Domain Of Lsmt Bound To
Melysine And Adohcy
Length = 444
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 86/376 (22%), Positives = 152/376 (40%), Gaps = 51/376 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A +D+ D VP L + + V +E ++L + L+L+ E+ + +
Sbjct: 40 LVALKDISRNDVILQVPKRLWINPDAVAASEIGR---VCSELKPWLSVILFLIRERSR-E 95
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W Y L ++ +S + WSE EL L GS + E +K E +L+
Sbjct: 96 DSVWKHYFGILPQE-------TDSTIYWSEEELQELQGSQLLKTTVSVKEYVKNECLKLE 148
Query: 233 TV-----------------WFMAGSLFQKVSLAR----RFALVPLGPPL----------L 261
+F A + + + +R +VP+ +
Sbjct: 149 QEIILPNKRLFPDPVTLDDFFWAFGILRSRAFSRLRNENLVVVPMADLINHSAGVTTEDH 208
Query: 262 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRL 320
AY K A L + D L KAGE + + + + N++L ++YGF++ +
Sbjct: 209 AYEVKGAAGLFSWDYLFSLKSPLSVKAGEQVYIQYDLNKSNAELALDYGFIEPNENRHAY 268
Query: 321 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE- 379
+ ++ DP + DK VA+ NG F + R +LPYLRL + T
Sbjct: 269 TLTLEISESDPFFDDKLDVAESNGFAQTAYFDIFYNRTLPP--GLLPYLRLVALGGTDAF 326
Query: 380 -MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPK 435
++S+ + G + VS E + + + K+ LAGY T+ +D L + NL +
Sbjct: 327 LLESLFRDTIWGHLELSVSRDNEELLCKAVREACKSALAGYHTTIEQDRE-LKEGNLDSR 385
Query: 436 KRVATQLVRMEKKMLN 451
+A + EK +L
Sbjct: 386 LAIAVGIREGEKMVLQ 401
>gi|321470773|gb|EFX81748.1| hypothetical protein DAPPUDRAFT_317395 [Daphnia pulex]
Length = 495
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 93/433 (21%), Positives = 172/433 (39%), Gaps = 73/433 (16%)
Query: 88 NGLPPCKVILKEKPSHNEKHRPIHYVA---------ASEDLQAGDAAFSVPNSLVVTLER 138
+ LPP L+ +H+ K P+ V A++ + + FS+P L+++ E
Sbjct: 75 DTLPP---FLEWMTNHDVKMGPVELVELPLYGCCVRATKQVSTDELLFSIPQKLMLSNET 131
Query: 139 VLGNETIAELLTTNK-LSELACLAL-YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES 196
+ TI + + LS++ +AL + + + KSFW PY+ L + ++
Sbjct: 132 A-NSSTIGHFINNDPILSQMPNVALAFHVLNELYDPKSFWKPYLDALPS-------SYDT 183
Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNEL----------------------DTV 234
+ ++ E+ L GSP + L I R+Y+ D
Sbjct: 184 VMYFTPDEITELKGSPAFDDALRMCRNIARQYSYFYSLLQKNVDPALSNLRANFTYNDYR 243
Query: 235 WFMAGSLF--------QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQL------ 280
W ++ + +++S + L P+ L+ C QL
Sbjct: 244 WAVSTVMTRQNLIPSQEEISGNDKDQLPPVNA-LIPLWDFCNHQDGQFSTEFQLESRRTV 302
Query: 281 -VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 339
R + GE + ++ G + ++ I+ GFVD +N +D L ++ L+ DP + +
Sbjct: 303 CQAGRDFGPGEQVFIFYGTRTCAEQFIHNGFVDINNAHDALTLKVGLSKSDPLAGQRATL 362
Query: 340 AQRNGKLSVQ------VFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPV 393
+ LS + F + AG + +L +LRL ++ S + + S
Sbjct: 363 LCKLRILSDEKISGPIAFQLKAG-PQPVDGKLLAFLRLFCMTKDSLDRWLQSDNASNLMH 421
Query: 394 SPC-MERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKK 448
C +E V D+ + KAR L YP T D ML + +L +R+ L EK+
Sbjct: 422 EECGIETEVDDKSWSFLKARCQLLLQLYPTTKEADLKMLEE-DLSSHRRMCVLLRLAEKR 480
Query: 449 MLNACLQVTADMI 461
+L + ++ A I
Sbjct: 481 ILLSAIECAAQRI 493
>gi|50557134|ref|XP_505975.1| YALI0F28061p [Yarrowia lipolytica]
gi|49651845|emb|CAG78787.1| YALI0F28061p [Yarrowia lipolytica CLIB122]
Length = 454
Score = 55.5 bits (132), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 68/273 (24%), Positives = 108/273 (39%), Gaps = 70/273 (25%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V ASED++ + F +P S +++E + I ++ KL+ L LY+M K G
Sbjct: 42 VIASEDIEEDEVLFKIPRSSFLSVEN--DPDFIKQVPEAKKLNSWLQLILYMM---KAGS 96
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE-- 230
+ W PY L Q ++S ++W++ EL L GS +I G + +Y E
Sbjct: 97 MTKWKPYFDVLPTQ-------LDSLMMWTDDELEGLKGSMIVKKI--GKAGAEEDYQEKL 147
Query: 231 -----------------LDTVWFMAGSLFQKVSLARRF---------------------- 251
L++ M G + +A F
Sbjct: 148 KPIIDAHPEYFKDCDTSLESFHRMGGLI-----MAYSFDAPDSFSEDEEDDEDIEHDDLY 202
Query: 252 ------ALVPLGPPLLAYSSKCKAMLAAVDD-AVQLVVDRPYKAGESIVVWCGPQPNSKL 304
A+VPL L A++ C A L A DD + +P K GE + G PN
Sbjct: 203 NEGLVKAMVPLADTLNAHTRFCNANLIAEDDGGFSMTAIQPIKKGEQVYNTYGELPNCDF 262
Query: 305 LINYGFV-DEDNPYDRLVVEAALNTEDPQYQDK 336
L YG+V +E +D +VE +++ Y +K
Sbjct: 263 LRRYGYVENEGTEFD--IVEFSMDEISDFYANK 293
>gi|255568191|ref|XP_002525071.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
gi|223535652|gb|EEF37318.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
Length = 456
Score = 55.5 bits (132), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 57/235 (24%), Positives = 109/235 (46%), Gaps = 30/235 (12%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ +Q GD VP S + + +L +++LL +++ +A LA+ L+ ++K G++S
Sbjct: 51 ASKSIQTGDCILRVPYSAQIASDNLLPE--LSDLLG-DEVGSVAKLAIVLLVDQKVGQES 107
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI L Q G + S + WS++EL + S E +++ I++++ + V
Sbjct: 108 KWAPYISRLP-QLGE----MHSTIFWSKSELDMIFQSSVYKETIKQKAQIEKDFLTIKPV 162
Query: 235 W--------------FMAGSLFQKVSL---ARRFALVPLGPPLLAYSSKCKAMLAAVDD- 276
FM K + +L+P L + +A++ +D
Sbjct: 163 LEHFPQISRSITFQDFMHAYALVKSRAWGSTKGVSLIPFAD-FLNHDGFSEAVVLNDEDK 221
Query: 277 -AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 330
++ DR Y E +++ G N+ LL+++GF N +++ VE +N D
Sbjct: 222 QVSEVAADRNYAPHEEVLIRYGKFSNATLLLDFGFSLPYNIHEQ--VEIQINIPD 274
>gi|302832548|ref|XP_002947838.1| hypothetical protein VOLCADRAFT_88145 [Volvox carteri f.
nagariensis]
gi|300266640|gb|EFJ50826.1| hypothetical protein VOLCADRAFT_88145 [Volvox carteri f.
nagariensis]
Length = 508
Score = 55.5 bits (132), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 93/395 (23%), Positives = 150/395 (37%), Gaps = 68/395 (17%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+ +SW+ GL ++L+ R + AS L G+ +P+ LV+T ER
Sbjct: 24 EFQSWLRSEGLSTQPLLLRHC------GREGRGLVASRSLSRGEVLVKLPDHLVITAERA 77
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGR---------G 190
G ++ LL LA ++ + W PY+ L ++ G
Sbjct: 78 AGEWSLLALLLAEVKGRLAA------GDRSSPAAARWGPYVAVLPQRPGTLLDWPAKEVQ 131
Query: 191 QLAVESPL------LWSETELAYLTGSPTKAEILERAEGIKREYNEL---DTVWFMAGSL 241
QL SPL + S ++ P A+ RA+G+ E+ L D W L
Sbjct: 132 QLLRGSPLQRLADSITSAASASWRELEPLIAQ--GRADGLVPEHVPLSKGDLEWAFGVLL 189
Query: 242 FQKVSLARRFALVPLGP-------------------------PLLAYSSKCKAMLAAVDD 276
+ + L R L L P P + + A
Sbjct: 190 SRCIRLPSRGDLQVLAPWADQLNHDVSAEEGCHLDWSWDVAGPAVPGGDRAGG---ATKG 246
Query: 277 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV--DEDNPYDRLVVEAALNTE-DPQY 333
A+ L DRPY AG+ + V GP+ + +LL++YGF NP+ + A++ + DP
Sbjct: 247 ALVLRADRPYAAGQQVYVSYGPKSSGELLLSYGFCPPPASNPHQDCRLRVAVDRQGDPLA 306
Query: 334 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL--RLGYVSDTSEMQSVISSLGPIC 391
K R+G S F + E + L +L R +T E+ SV+ G
Sbjct: 307 DLKEQALARHGLPSELEFPLKLEGIPEGLLQYLAFLDARPKVAQETFELASVLFESGGF- 365
Query: 392 PVSPCMERAV--LDQLADYFKARLAGYPATLSEDE 424
P+ + V L L++ A L YP ++ D+
Sbjct: 366 PLLDGQDTLVLALRGLSNRCTAALKAYPTSMEADQ 400
>gi|390354259|ref|XP_001201449.2| PREDICTED: SET domain-containing protein 4-like [Strongylocentrotus
purpuratus]
Length = 455
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 69/288 (23%), Positives = 114/288 (39%), Gaps = 53/288 (18%)
Query: 75 EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
+E L WM ++G CK + ++ R + ++L+ GD+ +P L+V
Sbjct: 40 DEQYITLMKWMKEHGFN-CKGCCLKPAVFSDTGRGL---MTKKNLRPGDSIVEIPRHLLV 95
Query: 135 TLERVLGNETIAELL--TTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
T + +L E + ++ K + + +L+ E+ +GK SFW PYI L +
Sbjct: 96 TAKDILNTE-LGPIIKRQRQKPTPYQVVCAFLLTERSKGKSSFWYPYINVLPKD------ 148
Query: 193 AVESPLLWSETELAYLTGSPTKA----------------------EILERAEGIKREYNE 230
+P S T+ A PT A E +ER R +
Sbjct: 149 -FTTPAFGS-TKQADFDVLPTIARSRAINQLQDIRAAFESASCLFEDIERTFPQYRIFFS 206
Query: 231 LDTV---WFMAGSLFQKVS----------LARRFALVPLGPPLLAYS--SKCKAMLAAVD 275
LD+ WF+ S + + FAL P LL +S ++ A V
Sbjct: 207 LDSFVWAWFVINSRSVYIEPSGCEAFDPKASDDFALAPFL-DLLNHSPGAEVTAGFDPVS 265
Query: 276 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 323
+ ++ Y A + + + GP N LL+ YGFV NP+D + E
Sbjct: 266 NCYRIKTLDSYHAYDQVFIHYGPHDNVNLLLEYGFVIPSNPHDAVSFE 313
>gi|425766115|gb|EKV04742.1| hypothetical protein PDIG_87340 [Penicillium digitatum PHI26]
gi|425778867|gb|EKV16969.1| hypothetical protein PDIP_33360 [Penicillium digitatum Pd1]
Length = 679
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 51/205 (24%), Positives = 86/205 (41%), Gaps = 38/205 (18%)
Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
A +LM + +G + FW PY+R L + GQL +PL + E ++ ++ G+ +
Sbjct: 105 TFAFFLMAQYLRGPEGFWYPYLRTLPQP---GQLT--TPLFFGEEDVDWIQGTGIPEAAV 159
Query: 219 ERAEGIKREYN----ELDTVWF------------MAGSLFQKVSLARRF----------- 251
ER + + +Y+ +L F A ++ + + +
Sbjct: 160 ERIKIWEEKYDSGYLQLGATGFPDCETYTWELYLWASTIITSRAFSAKVLSGAVQPGDLP 219
Query: 252 -----ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 306
AL+PL L + K A D + L+V + AG+ I GP+ N +LLI
Sbjct: 220 EDGVSALLPL-IDLPNHRPMAKVEWRAGDKDIGLLVLEDHSAGQEISNNYGPRNNEQLLI 278
Query: 307 NYGFVDEDNPYDRLVVEAALNTEDP 331
NYGF NP D +V + + P
Sbjct: 279 NYGFCIAGNPTDYRIVHLGVKPDSP 303
>gi|440797255|gb|ELR18348.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
Length = 431
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 66/247 (26%), Positives = 95/247 (38%), Gaps = 54/247 (21%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVT------------LERVLGNETIAELLTTNKLSELACL 160
V A+ D+ G+ SVP SLVV + R+L E N L
Sbjct: 59 VVAAHDIATGETLLSVPFSLVVDSADAPLATAAPEIRRILDEEFPLSATNENAL------ 112
Query: 161 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILER 220
L+ K S W YI L + L +S+ EL+YL GS +R
Sbjct: 113 ---LLLVHKNDPNSPWQRYIDVLPS-------TFSTTLFFSDDELSYLEGSSLHHFARQR 162
Query: 221 AEGIKREYNELDTVWFM--------------AGSLFQKVSLARRF-------ALVPLGP- 258
I+ +Y+ + T F+ A V +R F LVP
Sbjct: 163 RRAIESQYDTIFTPLFVDYPEHFAPEQFSLDAWKWALSVIWSRSFVVDEGKRGLVPWADM 222
Query: 259 -PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ---PNSKLLINYGFVDED 314
+ + + K + AVD + P K GE I V G N++LL++YGFV E+
Sbjct: 223 FNMAPETEQVKVAVDAVDHHLIYSARSPIKKGEQIFVAYGQSRQMSNAQLLMDYGFVLEN 282
Query: 315 NPYDRLV 321
NP+D +V
Sbjct: 283 NPHDAVV 289
>gi|109158151|pdb|2H21|A Chain A, Structure Of Rubisco Lsmt Bound To Adomet
gi|109158152|pdb|2H21|B Chain B, Structure Of Rubisco Lsmt Bound To Adomet
gi|109158153|pdb|2H21|C Chain C, Structure Of Rubisco Lsmt Bound To Adomet
gi|109158154|pdb|2H23|A Chain A, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
Adohcy
gi|109158155|pdb|2H23|B Chain B, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
Adohcy
gi|109158156|pdb|2H23|C Chain C, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
Adohcy
gi|109158157|pdb|2H2E|A Chain A, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
gi|109158158|pdb|2H2E|B Chain B, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
gi|109158159|pdb|2H2E|C Chain C, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
gi|109158160|pdb|2H2J|A Chain A, Structure Of Rubisco Lsmt Bound To Sinefungin And
Monomethyllysine
gi|109158161|pdb|2H2J|B Chain B, Structure Of Rubisco Lsmt Bound To Sinefungin And
Monomethyllysine
gi|109158162|pdb|2H2J|C Chain C, Structure Of Rubisco Lsmt Bound To Sinefungin And
Monomethyllysine
Length = 440
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 86/376 (22%), Positives = 152/376 (40%), Gaps = 51/376 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A +D+ D VP L + + V +E ++L + L+L+ E+ + +
Sbjct: 36 LVALKDISRNDVILQVPKRLWINPDAVAASEIGR---VCSELKPWLSVILFLIRERSR-E 91
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W Y L ++ +S + WSE EL L GS + E +K E +L+
Sbjct: 92 DSVWKHYFGILPQE-------TDSTIYWSEEELQELQGSQLLKTTVSVKEYVKNECLKLE 144
Query: 233 TV-----------------WFMAGSLFQKVSLAR----RFALVPLGPPL----------L 261
+F A + + + +R +VP+ +
Sbjct: 145 QEIILPNKRLFPDPVTLDDFFWAFGILRSRAFSRLRNENLVVVPMADLINHSAGVTTEDH 204
Query: 262 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRL 320
AY K A L + D L KAGE + + + + N++L ++YGF++ +
Sbjct: 205 AYEVKGAAGLFSWDYLFSLKSPLSVKAGEQVYIQYDLNKSNAELALDYGFIEPNENRHAY 264
Query: 321 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE- 379
+ ++ DP + DK VA+ NG F + R +LPYLRL + T
Sbjct: 265 TLTLEISESDPFFDDKLDVAESNGFAQTAYFDIFYNRTLPP--GLLPYLRLVALGGTDAF 322
Query: 380 -MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPK 435
++S+ + G + VS E + + + K+ LAGY T+ +D L + NL +
Sbjct: 323 LLESLFRDTIWGHLELSVSRDNEELLCKAVREACKSALAGYHTTIEQDRE-LKEGNLDSR 381
Query: 436 KRVATQLVRMEKKMLN 451
+A + EK +L
Sbjct: 382 LAIAVGIREGEKMVLQ 397
>gi|226508108|ref|NP_001151788.1| SET domain containing protein [Zea mays]
gi|195649689|gb|ACG44312.1| SET domain containing protein [Zea mays]
Length = 536
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 68/271 (25%), Positives = 110/271 (40%), Gaps = 43/271 (15%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASE + GD A +P L+++ E + +E L N ++ L L+ M E+
Sbjct: 184 ASESIGVGDIALEIPEFLIISDELLCQSEVFLALKDFNNITSETMLLLWSMRERYNLGSK 243
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL--- 231
F PY L G L + LA L G+ EI++ + ++++Y+EL
Sbjct: 244 F-KPYFDTLPANFNTG-------LSFGIDALAALEGTLLFDEIIQARQHLRQQYDELFPL 295
Query: 232 -----------------DTVW----FMAGSLFQKVSLAR-RFALVPLGPPLLAYSSKCKA 269
D +W + + S+ +S + LVP+ L S
Sbjct: 296 LCTNFPEMFRKDVCTWDDFLWACELWYSNSMMIVLSSGKLSTCLVPVAGLLNHSVSPHIL 355
Query: 270 MLAAVDDA---VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYDRLVVEAA 325
VD+A ++ + RP AGE + G P S L+ YGF+ DNPYD + ++
Sbjct: 356 NYGRVDEATKSLKFPLSRPCDAGEQCFLSYGKHPGSHLVTFYGFLPRGDNPYDVIPLDL- 414
Query: 326 LNTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 356
D D+ + AQ + S Q H+ G
Sbjct: 415 ----DTSVDDEDIAAQSSATTS-QTTHMVRG 440
>gi|145516585|ref|XP_001444181.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124411592|emb|CAK76784.1| unnamed protein product [Paramecium tetraurelia]
Length = 658
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 45/77 (58%), Gaps = 10/77 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL---------SELACLALY 163
V+A ++ A ++PN L+++ +VL +E ++++ T+K +E CLALY
Sbjct: 48 VSAKMNIPANKVIIAIPNKLIISHHKVLKSE-LSDMFKTHKQFFDDQITADAEFNCLALY 106
Query: 164 LMYEKKQGKKSFWLPYI 180
+ Y K QG KSFW PY+
Sbjct: 107 IFYHKLQGDKSFWYPYL 123
>gi|320169513|gb|EFW46412.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 495
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 92/411 (22%), Positives = 163/411 (39%), Gaps = 71/411 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYE-KKQG 171
V A DL AG+ VP SL++ +E + + +L +LS+ +A +L+YE +
Sbjct: 79 VFALRDLAAGETVLRVPLSLLLNVEHASAS-PLGGILDDFRLSDAEAMAFWLIYELTRPE 137
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
+ S WLPY+ L QL + + E+ L SP R ++ ++ +
Sbjct: 138 RASPWLPYLESL--PASIKQLT----MFYDPFEMKRLQASPVAEFTSRRTVKMRNKFGKY 191
Query: 232 -------------------------DTVWFMAGSLFQKVSLA----------RRFALVPL 256
D +W MA + +++ R LVPL
Sbjct: 192 REQISKHRPAHLAEIEFPVELITVDDFLWAMAVQFTRLITVQVKHPADGEWERTKCLVPL 251
Query: 257 G-----PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQ--PNSKLLINY 308
P + +C L + + RP G+ ++ + G + N +L+++Y
Sbjct: 252 ADLLNTAPADQINVECATNLDSTH--FECATIRPVAEGQELLTPYGGAEQLSNGQLIMDY 309
Query: 309 GFVDEDNPYDRLVV------EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI 362
G +NP D + + E A+ + M R +L + V KE
Sbjct: 310 GVTFRNNPSDLVALPIPKLRETAVAYDSKMRLLMAMSLDRFDRLQLPVLDHFESIPKE-- 367
Query: 363 SDMLPYLRLGYVS---DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPAT 419
+L + R+ YVS D S+++ V+ + ++P ER L+ L + Y T
Sbjct: 368 --LLAFARV-YVSTPSDLSDLEHVLELMKEHRAINPSNERRALELLLQLTNEMILKYITT 424
Query: 420 LSEDEAMLTDYNLH--PKKRVATQLV-RM-EKKMLNACLQVTADMIMLLPD 466
+ EDE ML + + P +V R+ EK++L++ Q+ I LP+
Sbjct: 425 IEEDETMLRELDAESVPNANAVNAVVLRLGEKRILSSLWQLLDSAIEALPE 475
>gi|303279242|ref|XP_003058914.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226460074|gb|EEH57369.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 457
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 65/282 (23%), Positives = 115/282 (40%), Gaps = 46/282 (16%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNE---KHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
K+W+ NG + ++ +E + P V A D++ G++ +P+S T E
Sbjct: 4 FKTWLRSNGFWWNEDAIELGSRIDEGGGEDAPRVGVKAKRDIEIGESVARIPSSACFTCE 63
Query: 138 RVLGNETIAEL-LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES 196
+ + ++ L+ + LA L L+ E+ G S W Y+ L +
Sbjct: 64 NCAHADAVRKVKLSAGEDEWLASLGTALVLERTLGSSSRWNAYLDSLPHSE------PDV 117
Query: 197 PLLWSET--ELAYLTGSPTKAEILERAEGIKREYNE-----LDTVWFMAGSL-------F 242
++WSE YL G+ + + + + E+ LDT+ A +
Sbjct: 118 VMMWSEDGERRRYLCGTDIEQSLRDERAAARTEWTRHVKPVLDTLRGAAKDVGFDDFLAA 177
Query: 243 QKVSLARRFALVP-LGPPLL--------------AYSSKCKAMLA-------AVDDAVQL 280
+ V+ +R F + P +G L+ Y S + A + DDA+ +
Sbjct: 178 RSVASSRAFTVNPRVGAGLVPIADLFNHRTGGHHVYLSDARGTAAVSERDEGSDDDALFV 237
Query: 281 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 322
V + KAGE + G N+KLL +YGF DNP D++ +
Sbjct: 238 RVVKASKAGEEVFNTYGKLGNAKLLCSYGFAQLDNPADKVTI 279
>gi|118395738|ref|XP_001030215.1| hypothetical protein TTHERM_01108540 [Tetrahymena thermophila]
gi|89284510|gb|EAR82552.1| hypothetical protein TTHERM_01108540 [Tetrahymena thermophila SB210]
Length = 1709
Score = 54.7 bits (130), Expect = 1e-04, Method: Composition-based stats.
Identities = 43/162 (26%), Positives = 76/162 (46%), Gaps = 37/162 (22%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLG-------NETIAELLTTNKLSEL----AC-- 159
+AA +D+ ++PN L+++ ++V G + +++ N+ EL C
Sbjct: 954 IAADQDISPQKVILAIPNKLIISEDKVYGCDLEEVLEKIQQQIIKQNRFPELFDEEKCGD 1013
Query: 160 -----LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 214
LALYLMYEK +G++SFW PY EL+++ + L WS ELA S
Sbjct: 1014 ADFNILALYLMYEKLKGEQSFWHPYF-ELNQKS-------YTLLDWSTEELAQFEDSY-- 1063
Query: 215 AEILERAEGIKREYNELDTVWFMAGSLFQKVSLARRFALVPL 256
I +E N+ + ++F+ S+ + + +L+ L
Sbjct: 1064 ---------ILQEVNQSNQIFFLQQSVLNVLQEKQNKSLISL 1096
>gi|315045047|ref|XP_003171899.1| SET domain-containing protein 6 [Arthroderma gypseum CBS 118893]
gi|311344242|gb|EFR03445.1| SET domain-containing protein 6 [Arthroderma gypseum CBS 118893]
Length = 485
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 67/294 (22%), Positives = 117/294 (39%), Gaps = 44/294 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ AS D+ + F +P L+++++ + L +L L + ++YE QG+
Sbjct: 51 ICASRDITEDEELFVIPEDLILSVQNSEARTVLG--LDDKQLGPWLSLIIAMIYEYYQGE 108
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+S W PY L + ++ + W++ +L+ L GS +I + A D
Sbjct: 109 QSKWYPYFGVLPS-------SFDTLMFWTDEQLSELQGSAVVGKIGKAAAD--------D 153
Query: 233 TVWFMAGSLFQKVSLAR--RFALVPLGPP-----LLAYSSKCKAMLAAVDDAVQLVVDRP 285
T+ L Q SL R + PL P LL+ + + +++ A +++
Sbjct: 154 TILQKVVPLIQANSLHFPPRSDMPPLNSPDSQSALLSLAHRMASLIM----AYAFDIEKA 209
Query: 286 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 345
+A E Y DED P +V A + D Q + R+ +
Sbjct: 210 EEADEDTAE-----------DGYMTDDEDEPAKGMVPLADIFNADAQRNNARLFQEEGSF 258
Query: 346 LSVQVFHVHAGREKEAISDMLPYL----RLGYVSDTSEMQSVIS-SLGPICPVS 394
+ V ++H+G E LP R GYV+D V+ SL IC V+
Sbjct: 259 VMKAVRNIHSGEEIFNDYGELPRADLLRRYGYVTDNYTQYDVVEFSLDSICKVA 312
>gi|347836900|emb|CCD51472.1| similar to SET domain-containing protein [Botryotinia fuckeliana]
Length = 470
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 94/390 (24%), Positives = 157/390 (40%), Gaps = 70/390 (17%)
Query: 126 FSVPNSLVVTLERVLGNETIAELLTTNKLSEL--ACLALY-LMYEKKQGKKSFWLPYIRE 182
FS+P S V L + L + +L+E + LAL ++ + Q S W PY+
Sbjct: 55 FSIPRSAV------LNAQNAKPLAISKRLAEKMPSWLALTSILMAEGQVDDSKWAPYLAI 108
Query: 183 LDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILER-AEGIKREYNELDTVWFMAGSL 241
L Q + S + WS++ELA L S +I ++ AE + + Y + + +
Sbjct: 109 LPEQ-------LNSLVFWSDSELAELQASAVVKKIGKQGAEDMFKTYITPQGLQHSSTEM 161
Query: 242 FQKVS---LARRF----------------------------------ALVPLGPPLLAYS 264
KV+ +A F +++PL L A +
Sbjct: 162 CHKVASVIMAYAFDIPDPSEGPTSGGKGEEAADDLVSDDGEDEKTILSMIPLADMLNADA 221
Query: 265 SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVE 323
+ A L ++ +++ +P GE I G P S LL YG+V D + YD +
Sbjct: 222 DRNNARLICDNEDLEMRAIKPIAKGEEIFNDYGQLPRSDLLRRYGYVTDGYSAYDVAEIS 281
Query: 324 AAL----------NTEDPQY-QDKRMV----AQRNGKLSVQVFHVHAGREKEAISDMLPY 368
A L + P+ QDK V A+R G VH+ ++ +I D L
Sbjct: 282 AELIVSLFRNGKVHPSLPKLTQDKLKVRLDLAEREGVYDESFDLVHSSPDEPSIPDELLA 341
Query: 369 LRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 428
+ D S +++++ S + S LA +AR Y T+ EDE +L
Sbjct: 342 FLYLLLVDESHLKAILDSESSLPSRSKLTTELAGQVLAILLQARENEYSTTVEEDEDLLK 401
Query: 429 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 458
+ +L + +A Q+ EKK+L A ++ A
Sbjct: 402 NADLPIRTAMAIQVRSGEKKVLRAAIREAA 431
>gi|345325919|ref|XP_001512656.2| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Ornithorhynchus anatinus]
Length = 345
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 51/197 (25%), Positives = 94/197 (47%), Gaps = 18/197 (9%)
Query: 275 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 334
DD + V + + AGE I ++ G + N++ +I+ GF ++N +DR+ ++ ++ D Y
Sbjct: 42 DDRCECVALQDFTAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 101
Query: 335 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS--------S 386
K V R G + VF +H E + +L +LR+ +++ + +I +
Sbjct: 102 MKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLIGDHAIDKIFT 160
Query: 387 LG-PICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVATQ 441
LG PVS E +L + +AR L Y T+ ED++ L +L +A +
Sbjct: 161 LGNSEFPVSWDNEV----KLWTFLEARASLLLKTYKTTIEEDKSFLETPDLTFHATMAIK 216
Query: 442 LVRMEKKMLNACLQVTA 458
L EK++L ++ A
Sbjct: 217 LRLGEKEILEKAVKSAA 233
>gi|449283795|gb|EMC90389.1| SET domain-containing protein 4 [Columba livia]
Length = 440
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 84/356 (23%), Positives = 142/356 (39%), Gaps = 64/356 (17%)
Query: 97 LKEKPSHNEKHRPIHY------VAASEDLQAG-DAAFSVPNSLVVTLERVLGNETIAELL 149
LK++ + RP + + ++ LQ D S+P ++T + VL + + E +
Sbjct: 39 LKDRGFEDSHLRPAEFWDTGRGLMTTKTLQVSRDLIISLPEKCLLTTDTVLSS-CLGEYI 97
Query: 150 TTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELD---------------------RQ 186
K +S L L +L+ EK G+KS W PY+ L R+
Sbjct: 98 MKWKPPVSPLTALCTFLIAEKHAGEKSLWKPYLDVLPKTYSCPVCLEHDVVSLLPEPLRK 157
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS--LFQK 244
+ + Q + L+ ++ + + P AE E Y+ L+ W + ++ K
Sbjct: 158 KAQEQ-RTKVHELYISSKAFFSSLQPLFAENTETIFN----YSALEWAWCTINTRTIYMK 212
Query: 245 VSLARRFALVP----LGP--PLLAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 296
S + F+L P L P LL +S + KA + ++ + K E + +
Sbjct: 213 HSQRKCFSLEPDVYALAPYLDLLNHSPNVQVKAAFNEQTRSYEIRTNSLCKKYEEVFICY 272
Query: 297 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 356
GP N +LL+ YGFV DNP+ + V +A + DK QRN K+S+ H
Sbjct: 273 GPHDNQRLLLEYGFVAMDNPHSSVYVSSATLLKYFPPLDK----QRNAKVSILKDH---- 324
Query: 357 REKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 412
D+L L G+ + + + + L C R + L D AR
Sbjct: 325 -------DLLENLTFGWDGPSWRLLTALKVLSLGADEFTCWRRTL---LGDVISAR 370
>gi|148908465|gb|ABR17345.1| unknown [Picea sitchensis]
Length = 350
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/154 (27%), Positives = 73/154 (47%), Gaps = 16/154 (10%)
Query: 286 YKAGESIVVWCG-PQPNSKLLINYGFVDED----NPYDRLVVEAALNTEDPQYQDKRMVA 340
++ GE +++ G + N +L ++YGFV+ + + D + ++ DP + DK +A
Sbjct: 136 FRTGEQVLMQYGMNKSNGQLALDYGFVERNRKNGSNRDIFTLTLEISESDPFFADKLDIA 195
Query: 341 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM-------QSVISSLGPICPV 393
+ NG + F + G + ML +LRL + T SV L PV
Sbjct: 196 ELNGMETTAYFDITQG--QGVPESMLTFLRLIALGGTDAFLLEPLFRDSVWEHLS--LPV 251
Query: 394 SPCMERAVLDQLADYFKARLAGYPATLSEDEAML 427
S E A+ + D ++ L+GY T+ EDEA+L
Sbjct: 252 SQENEAAICKVVLDGCQSTLSGYGTTIEEDEALL 285
>gi|42567909|ref|NP_197226.2| protein SET DOMAIN GROUP 40 [Arabidopsis thaliana]
gi|75271674|sp|Q6NQJ8.1|SDG40_ARATH RecName: Full=Protein SET DOMAIN GROUP 40
gi|34222078|gb|AAQ62875.1| At5g17240 [Arabidopsis thaliana]
gi|51969984|dbj|BAD43684.1| unknown protein [Arabidopsis thaliana]
gi|332005020|gb|AED92403.1| protein SET DOMAIN GROUP 40 [Arabidopsis thaliana]
Length = 491
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 77/386 (19%), Positives = 153/386 (39%), Gaps = 67/386 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNE-TIAELLTT-NKLSELACLALYLMYEKKQ 170
+ A+ +L+ G+ VP ++T E ++ + +++ + N LS L++ L+YE +
Sbjct: 51 LGAARELKKGELVLKVPRKALMTTESIIAKDLKLSDAVNLHNSLSSTQILSVCLLYEMSK 110
Query: 171 GKKSFWLPYIRELDRQRG---------RGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
KKSFW PY+ + R + L VE + +E A +A L +
Sbjct: 111 EKKSFWYPYLFHIPRDYDLLATFGNFEKQALQVEDAVWATEKATAKCQSEWKEAGSLMKE 170
Query: 222 EGIKREYNELDTVWFMAGSLFQKVSLA---------------------RRFALVPLGPP- 259
+K ++ W A + +L ++ P GP
Sbjct: 171 LELKPKFRSFQ-AWLWASATISSRTLHVPWDSAGCLCPVGDLFNYDAPGDYSNTPQGPES 229
Query: 260 --------LLAYSSKCKAMLAAVDDAVQ---LVVDRPYKAGESIVVWCGPQPNSKLLINY 308
L+ + + ++ V L R Y+ GE +++ G N +LL +Y
Sbjct: 230 ANNVEEAGLVVETHSERLTDGGFEEDVNAYCLYARRNYQLGEQVLLCYGTYTNLELLEHY 289
Query: 309 GFVDEDNPYDRLVV--EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML 366
GF+ E+N D++ + E +L + + + ++GKLS ++
Sbjct: 290 GFMLEENSNDKVFIPLETSLFSLASSWPKDSLYIHQDGKLSFA---------------LI 334
Query: 367 PYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM 426
LRL + + +SV+ + +S E V+ +++ + L P +++ED +
Sbjct: 335 STLRLWLIPQSQRDKSVMRLVYAGSQISVKNEILVMKWMSEKCGSVLRDLPTSVTEDTVL 394
Query: 427 LTDYNLHPKKRVATQLVRMEKKMLNA 452
LH ++ +R+E+K A
Sbjct: 395 -----LHNIDKLQDPELRLEQKETEA 415
>gi|384248321|gb|EIE21805.1| SET domain-containing protein, partial [Coccomyxa subellipsoidea
C-169]
Length = 275
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 67/247 (27%), Positives = 97/247 (39%), Gaps = 43/247 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLS----ELACLALYLMYEK 168
V A++D+ G+ VP+ V+ E +E + + TN E L L LM EK
Sbjct: 32 VVATKDISCGEVVVHVPDESVLMPENCSCSEALEDAGLTNASGDAEMESIGLILALMTEK 91
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS----------------- 211
K GK S W Y+ L + PL W +L L G+
Sbjct: 92 KLGKSSKWKGYLDFLPKS------IPGMPLFWDSEQLQSLEGTSLIEKMNGCKAMPDRPL 145
Query: 212 --PTKAE-----ILERAEGIKREYNELDT----VWFMAGSLFQKVSLAR-RF-ALVPLGP 258
P K L+ +K +N T VW A ++ RF A+VP+
Sbjct: 146 EPPCKFNSVVLPFLQSNAHLKLPHNAASTRRLYVWATAMVSAYSFTIGEDRFQAMVPMWD 205
Query: 259 PLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED-N 315
L + L A A++++ GE ++ G PNS+LL YGFV+ D N
Sbjct: 206 ALNHITGHANVRLHHCARKGALRMIATCLITKGEQVINSYGDLPNSELLRRYGFVETDPN 265
Query: 316 PYDRLVV 322
P+D L V
Sbjct: 266 PHDCLEV 272
>gi|168014081|ref|XP_001759585.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689124|gb|EDQ75497.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 340
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 51/212 (24%), Positives = 97/212 (45%), Gaps = 28/212 (13%)
Query: 260 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 319
LL +SS+ +++ +++V ++ + GE++V+ GP N LL++YGFV NP DR
Sbjct: 123 LLQHSSESQSL-----PVLEVVAEKDLEKGENVVLNYGPLSNDILLLDYGFVMPKNPNDR 177
Query: 320 --------------LVVEAALNT-EDPQYQDKRMVAQRN--GKLSVQVFHVHAGREKEAI 362
LV + +++ +DP ++ + N G S Q+ V G +
Sbjct: 178 VELRYDDQLLHMACLVAKVNIDSFKDPTTSQLALLTRLNLHGPSSSQM--VTLGGTELVE 235
Query: 363 SDMLPYLRLGYVSDTSEMQSV----ISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 418
+L +R+ + D E+ V + + P+ ER + L LA +P
Sbjct: 236 GRLLAAVRVMHAQDPMELLDVDLEALQTWNQSPPLGVLNERKTIRTLIGLGMLALASFPT 295
Query: 419 TLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 450
+ ED++ L ++ R+A Q ++K++L
Sbjct: 296 EIEEDQSELVKGDISENHRLAIQFRMLKKRLL 327
>gi|400602586|gb|EJP70188.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
Length = 797
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 31/100 (31%), Positives = 48/100 (48%), Gaps = 10/100 (10%)
Query: 235 WFMAG--SLFQKVSLA------RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 286
WF+ G S + ++ R AL+P+ L ++ C + +A + DR Y
Sbjct: 570 WFLVGTRSFYYEIEETLSYPSHDRLALLPVADVLNHANAGCSVAFST--EAYDITADRAY 627
Query: 287 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 326
+AGE + G N LL YGFV DNP+D+L ++ L
Sbjct: 628 QAGEEVYTSYGAHSNDFLLAEYGFVLPDNPWDQLCLDKVL 667
>gi|224098926|ref|XP_002311320.1| SET domain-containing protein [Populus trichocarpa]
gi|222851140|gb|EEE88687.1| SET domain-containing protein [Populus trichocarpa]
Length = 490
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 86/393 (21%), Positives = 155/393 (39%), Gaps = 65/393 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A D+ + +P L + + V +E + +AL+L+ EK + +
Sbjct: 80 LVAQRDISRNEVVLEIPKKLWINPDVVAASEIGN---VCGGVKPWVSVALFLIREKLK-E 135
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL- 231
S W PY+ L + S + WSE ELA L G+ + L ++RE+ ++
Sbjct: 136 DSTWRPYLDVLPE-------STNSTIFWSEEELAELQGTQLLSTTLGVKSYLRREFLKVE 188
Query: 232 ------------------DTVWFMAGSLFQKVSLAR----RFALVPLG------------ 257
D W A + + S +R L+PL
Sbjct: 189 EEILVPHKQLFPSPVTLDDFSW--AFGILRSRSFSRLRGQNLVLIPLADLCNFLHTWLLD 246
Query: 258 -----PPLL----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ-PNSKLLIN 307
P + Y K + + D L KAGE +++ N++L ++
Sbjct: 247 QVNHSPDITIEDGVYEIKGAGLFSR-DLIFSLRSPISLKAGEQVLIQYNLNLSNAELAVD 305
Query: 308 YGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFH-VHAGREKEAISDML 366
YGF++ + + + ++ DP + DK +A+ NG + F V + L
Sbjct: 306 YGFIEAKSDRNMYTLTLQISESDPFFGDKLDIAETNGLGEIADFDIVLGNPLPPTLLPYL 365
Query: 367 PYLRLGYVSDTSEMQSVISS--LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSED 423
+ LG +D+ ++S+ + G + PVS E + + D K+ L+GY T+ ED
Sbjct: 366 RLVALGG-TDSFLLESIFRNTIWGHLELPVSRANEELICRVVRDACKSALSGYHTTIEED 424
Query: 424 EAMLTDYNLHPKKRVATQLVRMEKKMLNACLQV 456
E L L+P+ +A + EKK+L ++
Sbjct: 425 EK-LKGEELNPRLEIAVGIRAGEKKVLQQIEEI 456
>gi|396495152|ref|XP_003844476.1| similar to SET domain-containing protein [Leptosphaeria maculans
JN3]
gi|312221056|emb|CBY00997.1| similar to SET domain-containing protein [Leptosphaeria maculans
JN3]
Length = 475
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 68/273 (24%), Positives = 104/273 (38%), Gaps = 67/273 (24%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER-VLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
V A+++++ + F +P S V+++E +L E T + L L L ++YE G
Sbjct: 40 VVATQEIREHEVLFRIPRSAVLSVENSILSTEIPTS--TFDLLGPWLSLILVMLYEHLNG 97
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-------------- 217
S W PY L + + + WSE ELA L S A+I
Sbjct: 98 DASNWAPYFAVLPNE-------FNTLMFWSEHELAELQASAVLAKIGREGANEAFLGQLV 150
Query: 218 -----------------LERAEGIKREYNELDTVWFMAGSLFQKVSLARRFA-------- 252
++AE ++ E N T+ GSL + A
Sbjct: 151 PVIKEFAGIFFSGDSRAAQKAEEMRDEKN--ITLMHKMGSLIMAYAFDIEPATPRKDVDE 208
Query: 253 ---------------LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG 297
++PL L A + +C A L +++ +P KAGE I G
Sbjct: 209 EGFAEEEEDEALPKGMIPLADMLNADADRCNARLFYEQKYLEMKALKPIKAGEEIFNDYG 268
Query: 298 PQPNSKLLINYGFVDED-NPYDRLVVEAALNTE 329
P P S LL YG+V E+ YD + V L +E
Sbjct: 269 PLPRSDLLRRYGYVTENYAQYDVVEVPMELVSE 301
>gi|224077384|ref|XP_002305239.1| SET domain protein [Populus trichocarpa]
gi|222848203|gb|EEE85750.1| SET domain protein [Populus trichocarpa]
Length = 518
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 42/151 (27%), Positives = 66/151 (43%), Gaps = 21/151 (13%)
Query: 56 RVSSSDTLVAGSREVVSKKEEDLGD------LKSWMHKNGLPPCKVILKEKP-------- 101
R+ +S T++ + K+ ED G W G+ C L P
Sbjct: 9 RIWASFTVLRRNSRQTKKEMEDAGQDEGFERFLKWAANLGISDCTTNLSLHPQSPTSCLG 68
Query: 102 -SHNEKHRPI---HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL 157
S H P +AA DL+ G+ VP S+++T + +L +E + + N S L
Sbjct: 69 HSLTVSHFPDAGGRGLAAVRDLKKGELVLRVPKSVLITRDSLLKDEKLCSFVNNNTYSSL 128
Query: 158 A---CLALYLMYEKKQGKKSFWLPYIRELDR 185
+ LA+ L+YE +GK S+W PY+ L R
Sbjct: 129 SPTQILAVCLLYEMGKGKSSWWYPYLMHLPR 159
>gi|340519125|gb|EGR49364.1| predicted protein [Trichoderma reesei QM6a]
Length = 963
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 91/414 (21%), Positives = 161/414 (38%), Gaps = 92/414 (22%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERV--------LGNETIAELLTTNKLSELACLALYL 164
+ A +D+ A F+VP S +++ E + ET E+ + + L + +
Sbjct: 537 IVALQDIPAEAVLFTVPRSGILSSETSELKGKLPEIFQETAMEVDDKPQQDPWSTLIIVM 596
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
MYE +G +S W PYI L + E+P+ WS+ EL L S T++++ +A
Sbjct: 597 MYEYFKGSESKWKPYIDVLPS-------SFETPMFWSDAELDELQASATRSKV-GKASAE 648
Query: 225 KREYNELDTVWFMAGSLFQK---------VSLARRFALVPLGPPLLAYS----------- 264
+ +++ V LF + LA R +G +++YS
Sbjct: 649 EMFQDKVLPVIRANQHLFPTSQTYSDDDLIQLAHR-----MGSTIMSYSFDFQNEDEEDE 703
Query: 265 ---------SKCKAMLAAVDDAVQLVVDRPY-------------------KAGESIVVWC 296
+ K+ + V A L D Y KAGE I +
Sbjct: 704 DETEEWVEEREAKSTMGMVPMADILNADAEYNAHVNYGDDALTVTALRTIKAGEEIFNYY 763
Query: 297 GPQPNSKLLINYGFVD-EDNPYDRL---------VVEAALNTEDPQYQDKRMVAQRNGKL 346
GP PNS+LL YG+V + + YD + V A+L Q DK + +L
Sbjct: 764 GPHPNSELLRRYGYVTPKHSRYDVVELPWTLVEESVAASLGLSSEQL-DKARECLDSDEL 822
Query: 347 SVQVFHVHAGREKEAISDMLPYLRLGYVSDT--SEMQSVISSLGPICPVSPCMERA---- 400
E + R + + +++S++ ++ P S +R
Sbjct: 823 EDTFVLERETEEPNPDGTLTGSARFSEIPEDLRDQLKSLLKAIRKAVPSSVVDKRKRDEI 882
Query: 401 ---VLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 451
+L + D +R YP ++SEDE +L ++ ++R A + EK+++
Sbjct: 883 QHNILIRALDALASR---YPTSISEDERILAGNDISERRRAAVTVRLGEKRLIQ 933
>gi|302784522|ref|XP_002974033.1| hypothetical protein SELMODRAFT_414219 [Selaginella moellendorffii]
gi|300158365|gb|EFJ24988.1| hypothetical protein SELMODRAFT_414219 [Selaginella moellendorffii]
Length = 527
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 58/211 (27%), Positives = 87/211 (41%), Gaps = 36/211 (17%)
Query: 135 TLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
T ER L + +L N + +L+ E+ +GK+SFW PYI L +L++
Sbjct: 90 TAERCL---LVGPMLRKNDFRPWLTMCAHLLVERSRGKESFWHPYISALPSVE---ELSI 143
Query: 195 ESPLLW-SETELAYLTGSPTKAEILERAEGIKREYNELDTVW---FMAGS---------L 241
PLLW +ET L GSP I R + + ++ L T F+ G
Sbjct: 144 SHPLLWPAETIQELLQGSPMLDTIATRLKLCQEDHEALLTAGIEKFLPGGETLSEGDVRW 203
Query: 242 FQKVSLARRFA-------------LVPLGPPLLAYSSKCKAMLAAVDDAVQ---LVVDRP 285
V L+R F+ LVP L SS + D + L +
Sbjct: 204 ASAVLLSRAFSLELDVDDDFDTLCLVPWADMLNHCSSAGEESCLIFDQDTKTASLEAHKS 263
Query: 286 YKAGESIVVWCGPQ-PNSKLLINYGFVDEDN 315
Y G+ + GP S+L ++YGFVD++N
Sbjct: 264 YSKGDEVFDSYGPALTGSQLFLDYGFVDDEN 294
>gi|393230612|gb|EJD38215.1| SET domain-containing protein [Auricularia delicata TFB-10046 SS5]
Length = 381
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 74/284 (26%), Positives = 116/284 (40%), Gaps = 56/284 (19%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVT-------LERVLGNETIAELLTTNKLSE--LACLALY 163
V SE+L S P SL +T L+R+LG A+L N LSE L C L
Sbjct: 3 VHTSEELPPDAPVISAPFSLAITPTVAADALQRILGPG--ADL---NSLSERELVCTYLA 57
Query: 164 LMYEKKQ---GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILER 220
+ + K+ G + L + +D R QL +PL ++ ELA L G+ A +R
Sbjct: 58 MHWIAKEVDLGPSAASLDHGPYVDSLPSRAQL--RTPLHFTPQELALLKGTNMAAATTDR 115
Query: 221 AEGIKREYNELDTV---------W--------FMAGSLFQKVSLARRFALVPLGP----- 258
+ E V W ++ F L+ AL+P
Sbjct: 116 EADWRSECERCRAVLGHWGEHLTWEHYLTASTHLSSRAFPSTLLSPEPALIPTPSSHPVL 175
Query: 259 -PLLAYSSKCKAM-----LAAVDDAVQ---LVVDRPYKAGESIVVWCGPQPNSKLLINYG 309
PL+ + +A ++ D+ +V P AG ++ GP+PN++L++ YG
Sbjct: 176 VPLIDSLNHARAHPVSWSVSPADNGAHTLSIVQHAPVAAGAEVLNNYGPKPNAELVLGYG 235
Query: 310 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV 353
F DNP D LV++ + D+R + + G L +F V
Sbjct: 236 FALPDNPDDTLVLKVS------GAADRREIWRAGGGLQRILFDV 273
>gi|85093434|ref|XP_959692.1| hypothetical protein NCU09581 [Neurospora crassa OR74A]
gi|28921141|gb|EAA30456.1| predicted protein [Neurospora crassa OR74A]
Length = 504
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 70/299 (23%), Positives = 112/299 (37%), Gaps = 65/299 (21%)
Query: 72 SKKEEDLGDLKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
S +E L W HK+G P +V E + + +P +A+E L +G A S
Sbjct: 3 SPHKERFEALLDWAHKHGASLHPLLEVYEDEVTGFSLRVKP----SATERLGSGFKAVSC 58
Query: 129 PNSLVVTLERVLGNETIAELLTT---------------NKLSELACLALYLMYEKKQGKK 173
P S+ ++ L + I TT N L YL+ + +GK
Sbjct: 59 PTSITLSYLNALTDGPITPSSTTPAPNTKNPAFPERFMNSLPPHVIGRFYLIQQYLKGKS 118
Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE--- 230
SFW PYI L + A+ P W+E ++ L G+ I E +K EY +
Sbjct: 119 SFWAPYISTLADPSQLDKWAL--PPFWAEDDIELLQGTNAYIAIQEIQNNVKSEYKQARK 176
Query: 231 -------------LDTVWFMAGSLFQKVSL---------ARRFA--LVPLGP-------- 258
++ A +F S AR + L+P G
Sbjct: 177 ILKKEGFPDYREYTQVLYNWAYCMFTSRSFRPSLVLSESAREYVERLLPEGTKIDDFSVL 236
Query: 259 -PLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
PL ++ + L + A +L+ + Y G+ + G + NS+LL+ YGF+
Sbjct: 237 QPLYDIGNHSWDASYTWNLTSEPSACELICNDSYGPGQQVFNNYGFKTNSELLLGYGFI 295
>gi|302660547|ref|XP_003021952.1| hypothetical protein TRV_03939 [Trichophyton verrucosum HKI 0517]
gi|291185873|gb|EFE41334.1| hypothetical protein TRV_03939 [Trichophyton verrucosum HKI 0517]
Length = 479
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 71/315 (22%), Positives = 125/315 (39%), Gaps = 49/315 (15%)
Query: 97 LKEKPSHNEKHRPIHY-----VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTT 151
LK H + H IH A + + F +P+ L+++++ + L
Sbjct: 24 LKRSSPHFKMHPGIHIADLRSTGAGRGISEDEELFVIPDDLILSVQNSEARSVLG--LDD 81
Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
+L L + ++YE QG++S W Y R L + ++ + W++ +L+ L GS
Sbjct: 82 KQLGPWLSLIITMIYEYYQGEQSKWYSYFRILPS-------SFDTLMFWTDEQLSELQGS 134
Query: 212 PTKAEILERAEGIKREYNELDTVWFMAGSLFQKVSLARRFALVPLGPPLLAYSSK----C 267
+I + A DT+ L Q S R F P PPL + S+ C
Sbjct: 135 SVVGKIGKAAAD--------DTILQKVVPLIQANS--RHFPPRPNMPPLNSPDSQNALLC 184
Query: 268 KAM-LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV--DEDNPYDRLVVEA 324
A + ++ A +++ +A E + G++ DED P +V A
Sbjct: 185 LAHRMGSIIMAYAFDIEKTDEADEHTA-------------DDGYMTDDEDEPAKGMVPLA 231
Query: 325 ALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL----RLGYVSDTSEM 380
+ D Q + R+ + + + ++++G E LP R GYV+D
Sbjct: 232 DIFNADAQRNNARLFQEEGSFVMKAIKNIYSGEEIFNDYGELPRADLLRRYGYVTDNYAQ 291
Query: 381 QSVIS-SLGPICPVS 394
V+ SL IC V+
Sbjct: 292 YDVVEFSLDAICKVA 306
>gi|19112238|ref|NP_595446.1| ribosomal lysine methyltransferase Set10 [Schizosaccharomyces pombe
972h-]
gi|74626910|sp|O74738.1|SET10_SCHPO RecName: Full=Ribosomal N-lysine methyltransferase set10
gi|3738151|emb|CAA21252.1| ribosomal lysine methyltransferase Set10 [Schizosaccharomyces
pombe]
Length = 547
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 53/232 (22%), Positives = 96/232 (41%), Gaps = 32/232 (13%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
L +L E +G +S W YI L + +PL ++E + A+L + + E
Sbjct: 82 LCTFLALESLKGIQSKWYGYIEYLPK-------TFNTPLYFNENDNAFLISTNAYSAAQE 134
Query: 220 RAEGIKREYNELDTV------------WFMAGSLFQKVSLARRFAL-----VPLGPPLL- 261
R K EY E ++ + + ++F + P+ PL+
Sbjct: 135 RLHIWKHEYQEALSLHPSPTERFTFDLYIWSATVFSSRCFSSNLIYKDSESTPILLPLID 194
Query: 262 AYSSKCKAMLAAVDD-----AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 316
+ + K K + D +VQL+ G + GP+ N +LL+ YGF DNP
Sbjct: 195 SLNHKPKQPILWNSDFQDEKSVQLISQELVAKGNQLFNNYGPKGNEELLMGYGFCLPDNP 254
Query: 317 YDRLVVEAALNTEDPQYQDKRMVAQRNG--KLSVQVFHVHAGREKEAISDML 366
+D + ++ A++ + P K + + + +LS VF + +KE +L
Sbjct: 255 FDTVTLKVAIHPDLPHKDQKAAILENDCQFQLSNLVFFLPKSPDKEIFQKIL 306
>gi|387191841|gb|AFJ68625.1| set domain-containing protein [Nannochloropsis gaditana CCMP526]
Length = 736
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 43/190 (22%), Positives = 80/190 (42%), Gaps = 33/190 (17%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LA+ L+ E+ +G +SFW PY+R L + P+ ++ +E + +
Sbjct: 260 LAVLLVAERMKGPQSFWWPYLRNLPEK------YAHMPIFYNNSEFGSIQIPSLMRTVQS 313
Query: 220 R-------AEGIKREYNE--------LDTV------WFMAGSLFQKVSLARRFALVPLGP 258
R ++G R+ + LD V W + + + + PL
Sbjct: 314 RCRMLVNISDGYLRQLSHGGPAENPFLDDVHANDMGWGLCAASSRALRNIPGLGSTPLMV 373
Query: 259 PLLAYSSKCKAMLAAVDD------AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 312
P++ + + + D ++QLV R + G+++ + G N +LL++YGF
Sbjct: 374 PVIDFCEHAVSPTCYIKDYRKSGGSIQLVAGRDLQPGDALTISYGNLTNPQLLLDYGFTL 433
Query: 313 EDNPYDRLVV 322
DNP+DR V
Sbjct: 434 SDNPHDRFEV 443
>gi|367023575|ref|XP_003661072.1| hypothetical protein MYCTH_2300057 [Myceliophthora thermophila ATCC
42464]
gi|347008340|gb|AEO55827.1| hypothetical protein MYCTH_2300057 [Myceliophthora thermophila ATCC
42464]
Length = 496
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 68/264 (25%), Positives = 101/264 (38%), Gaps = 74/264 (28%)
Query: 113 VAASEDLQAGDAAFSVP-NSLVVTLERVLGNE----------------TIAELLTTNKLS 155
+ A D+ A F++P +S++ T L NE + E T++
Sbjct: 49 IVARTDIAADTVLFTIPRSSIICTATSALKNEIPGIFDLEGDEDGNSDSGGEDGTSSSQD 108
Query: 156 ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 215
L L L+YE QG S W PY+ L A ++P+ WS TELA L S
Sbjct: 109 SWTLLILILIYEYLQGDASQWKPYLDVL-------PSAFDTPMFWSPTELAELQASALVT 161
Query: 216 EI-LERAEGIKRE-----YNELDTVWFMAGSLFQKVSLARRFALVP-LGPPLLAYS---- 264
++ E A+ + R D V+F G Q++ + F L +G ++AY+
Sbjct: 162 KVGREEADRMIRSKILPVIRGHDHVFFPHGR--QRLDDDQLFELAHRMGSAIMAYAFDLE 219
Query: 265 ------------------SKCKAMLAAVDDAVQLVVD-------------------RPYK 287
+ + ML V A L D R K
Sbjct: 220 KDDDANEEASEQDEWVDDREGRTMLGMVPMADMLNADAEFNAYINHGADSLTATALRTIK 279
Query: 288 AGESIVVWCGPQPNSKLLINYGFV 311
AGE I+ + GP PN +LL YG+V
Sbjct: 280 AGEEILNYYGPLPNGELLRRYGYV 303
>gi|322706860|gb|EFY98439.1| SET domain protein [Metarhizium anisopliae ARSEF 23]
Length = 595
Score = 52.0 bits (123), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 65/281 (23%), Positives = 122/281 (43%), Gaps = 53/281 (18%)
Query: 113 VAASEDLQAGDA------AFSVPNSLVVTLERVLG----NETIAELL-TTNKLSELACLA 161
+ A DL++ +A ++P+ LV++ E V + +LL + S +
Sbjct: 143 LVAHADLESAEADGTSKGPVTIPHDLVLSAEAVEDFAKVDHNFKQLLEAVGRQSTRGDIM 202
Query: 162 LYLMYEKKQGKK------SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 215
LYL+ + Q + + W YIR L R + P +W+E E L G+ +A
Sbjct: 203 LYLVSQFAQSSRPKGLSPTPWTEYIRLLPR-------PIPVPTMWTEPERLLLNGTSLEA 255
Query: 216 EILERAEGIKREYNELDTV---------------------WFMAGSLFQK--VSLARR-F 251
+ + + +E++ L V W + + ++ + L R
Sbjct: 256 ALEAKLLSLGKEFDTLREVSEDFPFWNEFLWSGEEVSLEDWVLVDAWYRSRCLELPRSGT 315
Query: 252 ALVPLGPPLLAYSSKCKAMLAAVD-DAVQLVV--DRPYKAGESIVVWCG-PQPNSKLLIN 307
A+VP G ++ +SSK A D D V L++ P ++GE + + G +P S++L +
Sbjct: 316 AMVP-GLDMVNHSSKATAYYEEDDHDNVVLLIRPGCPVRSGEEVTISYGDAKPASEMLFS 374
Query: 308 YGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 348
YGF+D +N D+L + +DP + K ++ L++
Sbjct: 375 YGFIDPNNIVDKLTLRLDPFPDDPLARAKLRISNSGPTLTI 415
>gi|66825817|ref|XP_646263.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|60474297|gb|EAL72234.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 567
Score = 52.0 bits (123), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 90/450 (20%), Positives = 164/450 (36%), Gaps = 69/450 (15%)
Query: 65 AGSREVVSKKEEDL-GDLKSWMHKNGL--PPCKVILKEKPSHNEKHRPIHYVAASEDLQA 121
A S ++V E L + W+ G CKV + S + A++D++
Sbjct: 56 ANSGKIVEPTEAQLVANFIEWLKGKGFDESKCKVKIDRNTSEGTG------LVATQDIKE 109
Query: 122 GDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLP 178
G+ +P++L +T +L ++L + L+++L+ E S W P
Sbjct: 110 GEDFVEIPSNLFITTAVAFQGLGKPPILENDRLIQSIPGILLSIFLVKELS-NPTSEWGP 168
Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT----- 233
YI+ L +Q + W E GSP + G R+Y L +
Sbjct: 169 YIKLLPKQ-------YNTVYYWGLKEFTQFRGSPNLEYAMRYVRGAMRQYCYLYSMIDRT 221
Query: 234 ---------------VWFMA------GSLFQKVSLARRFALVPLGPPLLAYS--SKCKAM 270
VW ++ ++ AL+P S SK +
Sbjct: 222 QSNIMPISSFTWDAFVWAISTVQSRQNPVYAGNGNGSIMALIPFWDFCNHSSTGSKITSF 281
Query: 271 LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL--NT 328
+ + + +K GE + ++ GP+ N++LL++ GF + N +D E L
Sbjct: 282 YHMDSNCMTSGAIKDFKKGEQVYMFYGPRDNTQLLMHAGFATKTNLHDSYPFELHLLEGN 341
Query: 329 EDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS--- 385
+ ++ ++ +R + V V E +++P+ R+ Y E +++
Sbjct: 342 HEIRHDKVHLLEERGIRDGVVVNLNQNPTSNELPLELIPFYRI-YALSEQETRAIAPPQV 400
Query: 386 ------------SLGPIC--PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYN 431
L P+ ++ E L K +LA YP TL EDE L N
Sbjct: 401 PGEHNHHHGHQLELKPLAFKIITQENEEKAYSNLVQALKGKLASYPTTLEEDEQELKK-N 459
Query: 432 LHPKKRVATQLVRMEKKMLNACLQVTADMI 461
+R EKK+L+ ++ +I
Sbjct: 460 PPANQRFILYTKINEKKILDRNIKYLESLI 489
>gi|388516285|gb|AFK46204.1| unknown [Lotus japonicus]
Length = 271
Score = 52.0 bits (123), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 60/124 (48%), Gaps = 9/124 (7%)
Query: 239 GSLFQKV----SLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESI 292
G LF ++ S+ + ALVP +L +S + L + DRPY+ GE +
Sbjct: 9 GILFSRMVRLPSMDGKVALVPWAD-MLNHSCDVETFLDYDKQSKGIVFTTDRPYQPGEQV 67
Query: 293 VVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQV 350
+ G + N +LL++YGFV + NP D + + +L D Y++K + ++ G Q
Sbjct: 68 FISYGKKSNGELLLSYGFVTREGANPSDSVELSLSLKKSDGSYKEKLELLKKYGLSGSQC 127
Query: 351 FHVH 354
F +
Sbjct: 128 FPIR 131
>gi|302803412|ref|XP_002983459.1| hypothetical protein SELMODRAFT_445547 [Selaginella moellendorffii]
gi|300148702|gb|EFJ15360.1| hypothetical protein SELMODRAFT_445547 [Selaginella moellendorffii]
Length = 536
Score = 52.0 bits (123), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 55/201 (27%), Positives = 81/201 (40%), Gaps = 33/201 (16%)
Query: 145 IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW-SET 203
+ +L N + +L+ E+ +GK+SFW PYI L +L++ PLLW +ET
Sbjct: 97 VGPMLRKNDFRPWLTMCAHLLVERSRGKESFWHPYIAALPSV---DELSISHPLLWPAET 153
Query: 204 ELAYLTGSPTKAEILER------------AEGIKR------EYNELDTVWFMAGSLFQKV 245
L GSP I R GI++ +E D W A L +
Sbjct: 154 IQELLQGSPMLDTIATRLKLCQEDHEALLTAGIEKFLPGGETLSEGDVRWASAVLLSRAF 213
Query: 246 SLA-------RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ---LVVDRPYKAGESIVVW 295
SL LVP L SS + D + L + Y G+ +
Sbjct: 214 SLELDVDDDFDTLCLVPWADMLNHCSSAGEESCLIFDQDTKTASLEAHKSYSKGDEVFDS 273
Query: 296 CGPQ-PNSKLLINYGFVDEDN 315
GP S+L ++YGFVD++N
Sbjct: 274 YGPALTGSQLFLDYGFVDDEN 294
>gi|356577306|ref|XP_003556768.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Glycine
max]
Length = 487
Score = 52.0 bits (123), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 80/374 (21%), Positives = 153/374 (40%), Gaps = 49/374 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A +D+ + VP L + + V +E I ++ L +AL+L+ E+ +
Sbjct: 82 LVALKDISRNEVVLQVPKRLWINPDAVAASE-IGKVCIG--LKPWLAVALFLIRERSR-S 137
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W Y L ++ +S + WSE EL+ L G+ + ++ EY L+
Sbjct: 138 NSLWKHYFSVLPKE-------TDSTIYWSEEELSELQGTQLLNTTRSVKQYVENEYRRLE 190
Query: 233 TV-----------------WFMAGSLFQKVSLAR----RFALVPLGPPL----------L 261
+F A + + + +R ++P +
Sbjct: 191 EEIILPNKKLFPSPLTLDDFFWAFGILRSRAFSRLRNENLVVIPFADFINHSARVTTEDH 250
Query: 262 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRL 320
AY K A L + D L KAG+ + + + + N++L ++YGF++ + +
Sbjct: 251 AYEIKGAAGLFSWDYLFSLRSPLSLKAGDQVYIQYDLNKSNAELALDYGFIEPNADRNAY 310
Query: 321 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV-HAGREKEAISDMLPYLRLGYVSDTSE 379
+ ++ DP + DK +A+ NG F + ++ + L + LG +D
Sbjct: 311 TLTLQISESDPFFGDKLDIAESNGFGETAYFDIFYSRPLPPGLLPYLRLVALG-GTDAFL 369
Query: 380 MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKK 436
++S+ S G + PVS E + + + K LAGY T+ ED+ L + L +
Sbjct: 370 LESIFRNSIWGHLELPVSRDNEELICRVVRETCKTALAGYHTTIEEDQK-LKEAKLDSRH 428
Query: 437 RVATQLVRMEKKML 450
+A + EK++L
Sbjct: 429 AIAVGIREGEKQLL 442
>gi|222640175|gb|EEE68307.1| hypothetical protein OsJ_26571 [Oryza sativa Japonica Group]
Length = 422
Score = 52.0 bits (123), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 61/273 (22%), Positives = 110/273 (40%), Gaps = 45/273 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ ASE++ GD A +P S +++ E + + L + ++ L L+ + E+
Sbjct: 182 MVASENIDVGDIALEIPESSIISEELLCQSGMFLALKDLDSITTETMLLLWSIRERYNPS 241
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
F + Y L G L + LA L G+ E+++ + I
Sbjct: 242 SKFKI-YFEALPANFNTG-------LSFGIDALAALEGTLLFDELMQARQAIY------- 286
Query: 233 TVWFMAGSLFQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESI 292
+ V L + + + P +L Y K V +++ + RP KAGE
Sbjct: 287 --------CYGIVILILQITIQLVSPHILNYGRVDK-----VTKSLKFPLSRPCKAGEQC 333
Query: 293 VVWCGPQPNSKLLINYGFVDEDNPYDR--LVVEAALNTED----------PQYQDKRMVA 340
+ G P S L+ YGF+ DNPYD L ++ +++ ED + +RM+
Sbjct: 334 FLSYGKHPGSHLITFYGFLPRDNPYDVIPLDLDTSVDEEDSSSPSVTTSQTSHMGERMLG 393
Query: 341 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 373
R + +Q + ++ + YLRLG+
Sbjct: 394 -RQSRTGLQ----RSTKKDSFVHCYFVYLRLGH 421
>gi|328854233|gb|EGG03367.1| hypothetical protein MELLADRAFT_90239 [Melampsora larici-populina
98AG31]
Length = 509
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 57/216 (26%), Positives = 92/216 (42%), Gaps = 24/216 (11%)
Query: 252 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
A+VPL L A S A L + + + + GE I PN+ LL YG V
Sbjct: 260 AMVPLADILNAKSGCENAKLFYEPTTLNMTTTKSIRKGEQIYNTYADPPNADLLRRYGHV 319
Query: 312 DEDNPYD----------RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV-------- 353
D++NP+D RL E +L+ DPQ Q+ + K +++V +
Sbjct: 320 DDENPFDLAEVSLELCIRLAAE-SLHPSDPQNQNTLDELKSRAKWALEVSDIDEIFMLPT 378
Query: 354 HAGRE-KEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 412
+ RE KE + D L + +S E Q+ S G + P M + R
Sbjct: 379 KSQREPKEILPDELVIMLRILLSTEEEFQT-WKSKGKVP--KPAMSEPIAQLAIQILSNR 435
Query: 413 LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKK 448
L Y T+ D+ +L D +L ++++ + VR+ +K
Sbjct: 436 LNQYSTTIQNDQDLLKDQSL-SRRKLKSIKVRLGEK 470
>gi|242081035|ref|XP_002445286.1| hypothetical protein SORBIDRAFT_07g007800 [Sorghum bicolor]
gi|241941636|gb|EES14781.1| hypothetical protein SORBIDRAFT_07g007800 [Sorghum bicolor]
Length = 490
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 59/235 (25%), Positives = 97/235 (41%), Gaps = 37/235 (15%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ ASE + G+ A +P SL+++ E + +E L N ++ L L+ M E+
Sbjct: 182 MVASESIGVGEIALEIPESLIISDELLCQSEVFLALKDFNSITSETMLLLWSMRERYNLA 241
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL- 231
F PY L G L + LA L G+ EI++ + ++++Y+EL
Sbjct: 242 SKF-KPYFDTLPANFNTG-------LSFGIDGLAALEGTLLFDEIMQAKQHLRQQYDELF 293
Query: 232 -------------------DTVW----FMAGSLFQKVSLAR-RFALVPLGPPLLAYSSKC 267
+ +W + + S+ +S + LVP+ L S
Sbjct: 294 PLLCTNFPEIFRKDVCTWDNFLWACELWYSNSMMVVLSSGKLSTCLVPVAGLLNHSVSPH 353
Query: 268 KAMLAAVDDA---VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYD 318
VD+A ++ + RP AGE + G P S L+ YGF+ DNPYD
Sbjct: 354 ILNYGRVDEATKSLKFPLSRPCDAGEQCFLSYGKHPGSHLVTFYGFLPRGDNPYD 408
>gi|336467028|gb|EGO55192.1| hypothetical protein NEUTE1DRAFT_147775 [Neurospora tetrasperma
FGSC 2508]
gi|350288355|gb|EGZ69591.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
Length = 504
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 70/299 (23%), Positives = 112/299 (37%), Gaps = 65/299 (21%)
Query: 72 SKKEEDLGDLKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
S +E L W HK+G P +V E + + +P +A+E L +G A S
Sbjct: 3 SPHKERFEALLDWAHKHGASLHPLLEVYEDEVTGFSLRVKP----SATELLGSGFKAVSC 58
Query: 129 PNSLVVTLERVLGNETIAELLTT---------------NKLSELACLALYLMYEKKQGKK 173
P S+ ++ L + I TT N L YL+ + +GK
Sbjct: 59 PTSITLSYLNALTDGPITPSSTTLAPNTENPAFPERFMNSLPPHVIGRFYLIQQYLKGKS 118
Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE--- 230
SFW PYI L + A+ P W+E ++ L G+ I E +K EY +
Sbjct: 119 SFWAPYISTLADPSQLDKWAL--PPFWAEDDIELLKGTNAYVAIQEIQSNVKSEYKQARK 176
Query: 231 -------------LDTVWFMAGSLFQKVSL---------ARRFA--LVPLGP-------- 258
++ A +F S AR + L+P G
Sbjct: 177 ILKKEGFPDYRDYTQVLYNWAYCMFTSRSFRPSLVLSESAREYVERLLPEGSKIDDFSIL 236
Query: 259 -PLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
PL ++ + L + A +L+ + Y G+ + G + NS+LL+ YGF+
Sbjct: 237 QPLYDIGNHSWDASYTWNLTSEPSACELICNDSYGPGQQVFNNYGFKTNSELLLGYGFI 295
>gi|357131865|ref|XP_003567554.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Brachypodium distachyon]
Length = 316
Score = 51.6 bits (122), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 42/167 (25%), Positives = 69/167 (41%), Gaps = 25/167 (14%)
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
KKS W PY+R L R + + + W EL + S E +ER E +E++ +
Sbjct: 38 KKSGWAPYVRSLPRND-----QMHNMMFWDLNELHMVRISSICDEAIERRERAMKEFSAV 92
Query: 232 DTVWFMAGSLFQKVSL-----------------ARRFALVPLGPPLLAYSSKCKAMLA-- 272
LF ++ L +R +L+P L + ++L
Sbjct: 93 KPSLECFPHLFGEIKLEDFMHASALVSSRAWQTSRGVSLIPFAD-FLNHDGVSDSILLYD 151
Query: 273 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 319
D +++ DR Y GE ++V G N+ L +N+GF N YD+
Sbjct: 152 GQKDIAEVISDRNYAVGEQVMVRYGKYSNAMLALNFGFTLPRNIYDQ 198
>gi|168005531|ref|XP_001755464.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693592|gb|EDQ79944.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1033
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 69/282 (24%), Positives = 106/282 (37%), Gaps = 62/282 (21%)
Query: 70 VVSKKEEDLGD-LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
VV + D D SWM NG S +EK H +A L G
Sbjct: 520 VVHQNGTDTTDQFVSWMEGNGF-----------SISEKLSITHLLAGDGKLVRG------ 562
Query: 129 PNSLVVTLERVLGNETIAEL-----------LTTNKLSELACLALYLMYEKKQGKKSFWL 177
VV L+ + ET+ L + ++ A L+ EK +G S W
Sbjct: 563 ----VVVLKNIRRGETLCNLPLDMGLYDNETIVAGEVDSWDRAAARLLREKAKGSSSAWA 618
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
YI L + + P+L + EL + P E+++ + I+ ++ L +V +
Sbjct: 619 SYINILPQN-------MTVPILLEDHELHEVQWWPVLRELVQVRKSIRESFSLL-SVDDL 670
Query: 238 AGSLFQKVSLA------RRFALVPLGPPLLA--------------YSSKCKAMLAAV-DD 276
AG+ F++ A R F L A Y + M +
Sbjct: 671 AGADFEEYRWAAMMVHSRAFTLPVFADDHYAPYVMMPYMDMINHHYHYQADWMSQPIWGG 730
Query: 277 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 318
V++V R K GE + GP+ N L + YGFV +DNP+D
Sbjct: 731 KVEIVARRDIKKGEELFASFGPRANDNLFLYYGFVLKDNPFD 772
>gi|356521657|ref|XP_003529470.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Glycine
max]
Length = 487
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 79/373 (21%), Positives = 149/373 (39%), Gaps = 47/373 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A +D+ + VP L + + V +E I ++ + L +AL+L+ E+ +
Sbjct: 82 LVALKDISRNEVVLQVPKRLWINPDAVAASE-IGKVCSG--LKPWLAVALFLIRERSR-S 137
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W Y L ++ +S + WSE EL+ L G+ + ++ E+ L+
Sbjct: 138 DSLWKHYFSILPKE-------TDSTIYWSEEELSELQGTQLLNTTRSVKQYVQNEFRRLE 190
Query: 233 TV-----------------WFMAGSLFQKVSLAR----RFALVPLGPPL----------L 261
+F A + + + +R ++PL +
Sbjct: 191 EEIIIPNKKLFPSSITLDDFFWAFGILRSRAFSRLRNENLVVIPLADLINHSARVTTDDH 250
Query: 262 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRL 320
AY K A L + D L KAG+ + + + + N++L ++YGF++ + +
Sbjct: 251 AYEIKGAAGLFSWDYLFSLRSPLSLKAGDQVYIQYDLNKSNAELALDYGFIEPNTDRNAY 310
Query: 321 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM 380
+ ++ DP + DK +A+ NG F + R L +D +
Sbjct: 311 TLTLQISESDPFFGDKLDIAESNGFGETAYFDIFYNRPLPPGLLPYLRLVALGGTDAFLL 370
Query: 381 QSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKR 437
+S+ S G + PVS E + + + K LAGY T+ ED+ L + L +
Sbjct: 371 ESIFRNSIWGHLELPVSRDNEELICRVVRETCKTALAGYHTTIEEDQK-LKEAKLDSRHA 429
Query: 438 VATQLVRMEKKML 450
+A + EK +L
Sbjct: 430 IAVGIREGEKNLL 442
>gi|297836754|ref|XP_002886259.1| hypothetical protein ARALYDRAFT_319874 [Arabidopsis lyrata subsp.
lyrata]
gi|297332099|gb|EFH62518.1| hypothetical protein ARALYDRAFT_319874 [Arabidopsis lyrata subsp.
lyrata]
Length = 541
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 57/237 (24%), Positives = 92/237 (38%), Gaps = 46/237 (19%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASEDL+ GD A +P S +++ E V ++ L + ++ + L+ M EK
Sbjct: 173 ASEDLKFGDVALEIPISSIISEEYVFNSDMYPILEKIDGITSETMVLLWTMREKHNLDSK 232
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL--- 231
F PY L G + + + L G+ EI++ E ++ Y+EL
Sbjct: 233 F-KPYFDSLQENFCTG-------MSFGVNAIMELDGTLLLDEIMQAKELLRERYDELIPL 284
Query: 232 -----------DTVW---FMAGSLFQKVSLARRF-------ALVPLG--------PPLLA 262
W A L+ S+ +F L+P+ P ++
Sbjct: 285 LSNHRHVFPPEHYTWEHYLWACELYYSNSMQIKFPDGKLKTCLIPVAGFLNHSIYPHIVK 344
Query: 263 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYD 318
Y C +++ V RP GE + G +S LL YGF+ + DNPYD
Sbjct: 345 YGKVCVET-----SSLKFPVSRPCNKGEQCFLSYGNYSSSHLLTFYGFLPKGDNPYD 396
>gi|358366345|dbj|GAA82966.1| SET domain protein [Aspergillus kawachii IFO 4308]
Length = 673
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 56/229 (24%), Positives = 89/229 (38%), Gaps = 38/229 (16%)
Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
+ + E +L+ + +G + FW PYIR L Q G ++ +P + +L +L G+
Sbjct: 77 DAVGEKESTIFFLIGQYLRGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGT 131
Query: 212 PTKAEILERAEGIKREYNELDT----------------VWFMAGSLFQKVSLARRFALVP 255
A +R E +K +Y + T ++ A S+F + + R L
Sbjct: 132 SLLAAREKRLEVLKEKYEKGSTALRNAGFEGADAYTWDLYLWAASMFISRAFSAR-VLSG 190
Query: 256 LGPPLLAYSSKCKAMLAAVD----------------DAVQLVVDRPYKAGESIVVWCGPQ 299
+ P K +L +D D V VV AG+ I GP+
Sbjct: 191 VFPETDLSEEKLSVLLPIIDMGNHRPLAKVEWRAGKDDVAFVVLEDVSAGQEISNNYGPR 250
Query: 300 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 348
N +L++NYGF NP D +V P Y K Q L+V
Sbjct: 251 NNEQLMMNYGFCIPGNPCDHRIVSLRAPPGSPLYMAKSHQLQMYPDLAV 299
>gi|302804174|ref|XP_002983839.1| hypothetical protein SELMODRAFT_445692 [Selaginella moellendorffii]
gi|300148191|gb|EFJ14851.1| hypothetical protein SELMODRAFT_445692 [Selaginella moellendorffii]
Length = 236
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 46/170 (27%), Positives = 72/170 (42%), Gaps = 28/170 (16%)
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
+ S W PYI L G +++ LW +TEL+YL SP + ER E I E+ ++
Sbjct: 74 QSSAWAPYISCLPEPAG-----LDNTFLWEDTELSYLRASPLYGKTRERLEIITTEFGQV 128
Query: 232 DTVWFMAGSLFQKVS---LARRFALVPLGP------------PLLAYSSKCKAMLAAVD- 275
+ LF KVS +A V P P+L + + A A +
Sbjct: 129 QNALDVWPQLFGKVSVEDFMHVYATVFSRPLAIGEDSTLVMIPMLDFFNHNAASFAKLSF 188
Query: 276 ----DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 321
+ + DR + I + CG N++L ++YGF P +RL+
Sbjct: 189 NGLLNYAVVTADRDCAENDQIWINCGDLSNAELALDYGFT---VPENRLI 235
>gi|238485948|ref|XP_002374212.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
gi|83768069|dbj|BAE58208.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|220699091|gb|EED55430.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
Length = 713
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 48/193 (24%), Positives = 80/193 (41%), Gaps = 38/193 (19%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
+LM + QGK+ FW PYIR L Q G A+ +PL + +L +L G+ ++A
Sbjct: 131 FFLMGQYLQGKEGFWYPYIRTLP-QPG----ALTTPLYYEGDDLEWLEGTSLSPARQQKA 185
Query: 222 EGIKREYNELDT----------------VWFMAGSLFQKVSLARRF-------------- 251
+K +Y + T ++ A ++F + + +
Sbjct: 186 NLLKEKYGTVYTELCKAGFDGAEKYTWDLYLWASTIFVSRAFSAKVLSGVIPDTQLPEEN 245
Query: 252 --ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 309
L+P +L + K A V +V AG+ I GP+ N +L++NYG
Sbjct: 246 VSVLLPF-IDILNHRPLAKVEWRAGKGNVAFLVLEDVAAGQEISNNYGPRNNEQLMMNYG 304
Query: 310 FVDEDNPYDRLVV 322
F +NP D +V
Sbjct: 305 FCLPNNPCDYRIV 317
>gi|224012755|ref|XP_002295030.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220969469|gb|EED87810.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 753
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 54/214 (25%), Positives = 95/214 (44%), Gaps = 33/214 (15%)
Query: 127 SVPNSLVVTLERVLGNET-IAELLTTNKLSELA----CLALYLMYEKK-QGKKSFWLPYI 180
S+P S ++T+E +G T I + T+ L A L +Y+++++K G+ SF+ PY
Sbjct: 141 SIPKSCLITVE--MGQATPIGRKILTSDLELDAPKHIFLMIYILWDRKVNGETSFFAPYY 198
Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS 240
+ L + P+ W+ EL L GS +I +RAE IK +Y + ++ G
Sbjct: 199 KILP------ETLRNMPIFWTREELDALEGSYLLLQIADRAEAIKEDYISICSIAPEFGD 252
Query: 241 L-----FQ---KVSLARRFALVPLG---PPLLAYSS--------KCKAMLAAVDDAVQLV 281
+ FQ + +R F L+ G L+ ++ + K + + +
Sbjct: 253 IATLEEFQWARMIVCSRNFGLLINGHRTSALVPHADMLNHLRPRETKWTFSEESQSFTIT 312
Query: 282 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 315
+ GE + G + N + L+NYGF E N
Sbjct: 313 TLQEIGMGEQVFDSYGQKCNHRFLLNYGFCVERN 346
>gi|121707885|ref|XP_001271968.1| SET domain protein [Aspergillus clavatus NRRL 1]
gi|119400116|gb|EAW10542.1| SET domain protein [Aspergillus clavatus NRRL 1]
Length = 677
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 51/193 (26%), Positives = 86/193 (44%), Gaps = 38/193 (19%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP------TKA 215
+L+ + QG+ FW PYIR L + L++ +PL + +L +L G+ +
Sbjct: 97 FFLIGQYLQGEDGFWFPYIRTLPQP-----LSLTTPLYYEGDDLGWLKGTSLWPAREQRM 151
Query: 216 EILERA--EGI----KREYNELDT----VWFMAGSL-----FQKVSLARRFALVPLGPPL 260
E+L+ A G+ K + ++D ++ A S+ F LA FA + L
Sbjct: 152 ELLKEAYENGVRELRKAGFQDVDKYTWDLYLWASSMIVSRAFSPKVLAEAFADIDLPEDG 211
Query: 261 LAYSSKCKAML-----------AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 309
++ C ++ A D LV++ AG+ I GP+ N +L++NYG
Sbjct: 212 VSVLLPCIDLMNHRPLAKVEWRAGKQDVAYLVLED-VAAGQEIANNYGPRNNEQLMMNYG 270
Query: 310 FVDEDNPYDRLVV 322
F DNP D +V
Sbjct: 271 FCLPDNPCDYRIV 283
>gi|317144568|ref|XP_001820210.2| SET domain protein [Aspergillus oryzae RIB40]
gi|391871646|gb|EIT80803.1| N-methyltransferase [Aspergillus oryzae 3.042]
Length = 703
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 48/193 (24%), Positives = 80/193 (41%), Gaps = 38/193 (19%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
+LM + QGK+ FW PYIR L Q G A+ +PL + +L +L G+ ++A
Sbjct: 121 FFLMGQYLQGKEGFWYPYIRTLP-QPG----ALTTPLYYEGDDLEWLEGTSLSPARQQKA 175
Query: 222 EGIKREYNELDT----------------VWFMAGSLFQKVSLARRF-------------- 251
+K +Y + T ++ A ++F + + +
Sbjct: 176 NLLKEKYGTVYTELCKAGFDGAEKYTWDLYLWASTIFVSRAFSAKVLSGVIPDTQLPEEN 235
Query: 252 --ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 309
L+P +L + K A V +V AG+ I GP+ N +L++NYG
Sbjct: 236 VSVLLPF-IDILNHRPLAKVEWRAGKGNVAFLVLEDVAAGQEISNNYGPRNNEQLMMNYG 294
Query: 310 FVDEDNPYDRLVV 322
F +NP D +V
Sbjct: 295 FCLPNNPCDYRIV 307
>gi|406860468|gb|EKD13526.1| putative SET domain-containing protein [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 474
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 57/226 (25%), Positives = 96/226 (42%), Gaps = 23/226 (10%)
Query: 251 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 310
+++PL L A + A L ++ +++ +P GE I+ G P S LL YG+
Sbjct: 205 LSMIPLADMLNADADGNNARLCCDNEELEMRSIKPISKGEEILNDYGQLPRSDLLRRYGY 264
Query: 311 V-DEDNPYDRLVVE-------AALNTEDP-------------QYQDKRMVAQRNGKLSVQ 349
+ D+ YD V E A+L+TE P + + + +AQR G
Sbjct: 265 ISDKYAAYD--VAELSTQSLLASLSTEQPLLAGGTLQPLSREKLEQRVELAQREGVYEDS 322
Query: 350 VFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYF 409
H G + +I D L L + D + ++ +S + S V L
Sbjct: 323 YDLTHPGPDDPSIPDELLALLYILLLDNENLAAIETSHASLPSRSKLATSLVGQILTKIL 382
Query: 410 KARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 455
++R Y T+ D+A+L NL +KR+A ++ EK +L +Q
Sbjct: 383 ESRKQEYATTIEADQAILQADNLPSRKRMAVEVRLGEKLVLEKAIQ 428
>gi|255075907|ref|XP_002501628.1| predicted protein [Micromonas sp. RCC299]
gi|226516892|gb|ACO62886.1| predicted protein [Micromonas sp. RCC299]
Length = 607
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 58/214 (27%), Positives = 92/214 (42%), Gaps = 35/214 (16%)
Query: 145 IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 204
+A + +L+ A LAL++++E +S Y+ L G+ +V PLLW+ T+
Sbjct: 137 VAASMGAPELATHAALALHVLFELGD-PRSEGFAYLATLPGLAGKASPSV--PLLWTPTQ 193
Query: 205 LAYLTGSPTKAEILERAEGIKREYNEL------------DTVWFMAGSLFQKVSLARR-F 251
+A L G+PT +L RA+ + + L W ++ L + S R +
Sbjct: 194 VATLRGTPTHGRVLRRAKFVSDAHAALFGSGGGGGVPLEKFAWALSSVLSRAASGDRMPY 253
Query: 252 ALVPLGPPLLAYSS-KCKAMLAAV--------------DDAVQLVVDRPYKAGESIVVWC 296
A +P G LL + L+AV D V V D P AGE + +
Sbjct: 254 AFLP-GVDLLNHGGVDANCELSAVKLAPGGNEENVTWGDVEVTCVKDTP--AGEQLTISY 310
Query: 297 GPQP-NSKLLINYGFVDEDNPYDRLVVEAALNTE 329
G + N +LL YGF N +DR +E L +
Sbjct: 311 GDESDNCRLLRLYGFATRGNVHDRRTIELRLTGD 344
>gi|367036287|ref|XP_003648524.1| hypothetical protein THITE_2106073 [Thielavia terrestris NRRL 8126]
gi|346995785|gb|AEO62188.1| hypothetical protein THITE_2106073 [Thielavia terrestris NRRL 8126]
Length = 496
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 56/204 (27%), Positives = 83/204 (40%), Gaps = 65/204 (31%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
L L L+YE QG+ S W PY+ L ++P+ WS TEL+ L S A++
Sbjct: 109 LILVLIYEHLQGEASRWRPYLDVL-------PPTFDTPMFWSPTELSELQASALVAKV-G 160
Query: 220 RAEG----------IKREYNELDTVWFMAGSLFQKVSLARRFALVP-LGPPLLAYS---- 264
RAE + R + E V+F G K+ A+ F L +G ++AY+
Sbjct: 161 RAEADRMIEAKVLPVIRAHEE---VFFPPGR--AKLDDAQLFELAHRMGSTIMAYAFDLE 215
Query: 265 ------------------SKCKAMLAAV-------------------DDAVQLVVDRPYK 287
+ + ML V DDA+ RP +
Sbjct: 216 NDDSDNDEADEDDEWVEDREGRTMLGMVPMADMLNADAEFNAHINHGDDALTATALRPIR 275
Query: 288 AGESIVVWCGPQPNSKLLINYGFV 311
AG+ I+ + GP PN +LL YG+V
Sbjct: 276 AGDEILNYYGPLPNGELLRRYGYV 299
>gi|7329638|emb|CAB82703.1| putative protein [Arabidopsis thaliana]
Length = 486
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 60/115 (52%), Gaps = 8/115 (6%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ + AGD VP + +T + + + +L +N++ + LA L+ EKK G+KS
Sbjct: 75 ASKVIYAGDCMLKVPFNAQITPDELPSD---IRVLLSNEVGNIGMLAAVLIREKKMGQKS 131
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
W+PYI L + + S + W E EL+ + S E +++ I+++++
Sbjct: 132 RWVPYISRLPQPA-----EMHSSIFWGEDELSMIRCSAVHQETVKQKAQIEKDFS 181
>gi|365989356|ref|XP_003671508.1| hypothetical protein NDAI_0H00910 [Naumovozyma dairenensis CBS 421]
gi|343770281|emb|CCD26265.1| hypothetical protein NDAI_0H00910 [Naumovozyma dairenensis CBS 421]
Length = 540
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 61/252 (24%), Positives = 100/252 (39%), Gaps = 65/252 (25%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLT-----TNKLSELA---CLALYL 164
+ AS+D+ + F +P S ++ N T ++L T KL EL+ L + +
Sbjct: 90 IIASKDIDTDELLFEIPRSSIL-------NVTTSQLCVDFPHITGKLMELSQWDSLIICM 142
Query: 165 MYEKKQGK-KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLT--------GSPTKA 215
MYE K + +S W Y L L + W++ EL++LT G
Sbjct: 143 MYEMKVLQHESRWSSYFNVLPSSESLNTL-----MYWNDKELSFLTPSLVVNRVGKGDAE 197
Query: 216 EILERAEGIKREYNELDTVWFMAGSL-------FQKVSLARRF----------------- 251
+ R E+NE D + GS+ + +A F
Sbjct: 198 TMYRRILDTINEFNE-DILTEKLGSISWEEFLYIPSIIMAYSFDVEIKNDDDENEGDEEF 256
Query: 252 -----------ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 300
+++PL L A + KC A L D+++++ +P K GE + G P
Sbjct: 257 DEKEEEPELLKSMIPLADTLNADTHKCNANLTYDKDSLKMLAIKPIKKGEQVYNTYGELP 316
Query: 301 NSKLLINYGFVD 312
NS+LL YG+V+
Sbjct: 317 NSELLRKYGYVE 328
>gi|308811012|ref|XP_003082814.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplast precursor (ISS)
[Ostreococcus tauri]
gi|116054692|emb|CAL56769.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplast precursor (ISS)
[Ostreococcus tauri]
Length = 588
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 81/377 (21%), Positives = 137/377 (36%), Gaps = 62/377 (16%)
Query: 49 RKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSH-NEKH 107
R +R S+ V G + E L W+ + G +V+ + N+
Sbjct: 12 RASRARWTTRSTRARVRGDAQRARASREAYDGLWMWLERRGADVSRVVADAVTTDANDSE 71
Query: 108 RPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL----ACLALY 163
R V A L+ G A +P + + R + + L + + +AL
Sbjct: 72 RAQFGVRAKTTLRRGTRAMVIPREVWMDATRATEDADVGAALRDARYDAVKQPWVRVALL 131
Query: 164 LMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG 223
L+ E+++G + Y+ L + ++SPL WS EL + G+ ++L+ A G
Sbjct: 132 LLKERERGADGEFAAYVATLPK-------TLDSPLFWSADELRDIAGT----QLLDNAAG 180
Query: 224 ----IKREYNELDTVWFM-------AGSLFQKVSLARRF----------------ALVPL 256
++ Y EL F+ F + S F ALVP
Sbjct: 181 YDAYVRAVYEELKNGVFVEYASTFDVDGAFDEASFRWAFGILRSRTMAPLDGANVALVP- 239
Query: 257 GPPLLAYSSKCKAMLAAVDD---------------AVQLVVDRPYKAGESIVVWCGPQP- 300
G L+ +SS A A + DR Y G I V P+
Sbjct: 240 GLDLINHSSLSGARWRVGGGGGMGGLFGGGSGSGVAAYVECDRDYDEGAEIFVNYDPEGI 299
Query: 301 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 360
+SK ++YGF+D NP + ++ +D DK V + G F + + +
Sbjct: 300 DSKFALDYGFIDVVNPSPGYALTLSIPEDDANLFDKLDVLETQGLPEAPTFTLRPYSDPD 359
Query: 361 AISDMLPYLRLGYVSDT 377
++ +LRL + DT
Sbjct: 360 --RELRTFLRLLHCKDT 374
>gi|402224283|gb|EJU04346.1| hypothetical protein DACRYDRAFT_114691 [Dacryopinax sp. DJM-731 SS1]
Length = 1313
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 57/215 (26%), Positives = 90/215 (41%), Gaps = 19/215 (8%)
Query: 252 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
A+VP+ L A A L D +Q++ +P GE I G PNS LL YG+V
Sbjct: 1060 AMVPMADMLNARCGCNNAKLFYTRDDLQMMATKPIAKGEQIWNTYGDPPNSDLLRRYGYV 1119
Query: 312 DE-------DNPYDRLVVEAALNTEDPQ---YQDKRMVAQRNGKLSVQVFHVHAGREKEA 361
D +P D + + A E + YQD+ G V V +
Sbjct: 1120 DALTLPDGVGSPSDVVEINADTVVEAAKVQSYQDRIDWWLEEGGDDAFVLDV-----TYS 1174
Query: 362 ISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLS 421
+ D + L + + + + S P P ++ + L + RLA YP +L+
Sbjct: 1175 VPDEMLSLVRLLLLNQEDWEKAQSK---GKPPKPKLDEKSYEVLLVVLQKRLAMYPISLT 1231
Query: 422 EDEAMLTDYN-LHPKKRVATQLVRMEKKMLNACLQ 455
E E ML N L+ K+R A + E+++L+ L+
Sbjct: 1232 EQEGMLRSSNELNEKRRNALIVTTGEQRILHKTLE 1266
>gi|350595011|ref|XP_003484025.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Sus
scrofa]
Length = 326
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 63/121 (52%), Gaps = 9/121 (7%)
Query: 239 GSLFQKV----SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV--VDRPYKAGESI 292
G LF ++ S+ + ALVP ++ +S + + L + +V DRPY+ GE +
Sbjct: 100 GILFSRMVRLPSMDGKNALVPWAD-MMNHSCEVETFLDYDKSSKGIVFPTDRPYQPGEQV 158
Query: 293 VVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQV 350
+ G + N +LL++YGFV ++ NP D + + +L D Y++K + ++ G Q
Sbjct: 159 FISYGKKSNGELLLSYGFVPKEGTNPSDSVELSLSLKKSDESYKEKLELLKKYGLSGSQC 218
Query: 351 F 351
F
Sbjct: 219 F 219
>gi|145549620|ref|XP_001460489.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124428319|emb|CAK93092.1| unnamed protein product [Paramecium tetraurelia]
Length = 482
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 78/401 (19%), Positives = 152/401 (37%), Gaps = 58/401 (14%)
Query: 103 HNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV----LGNETIAELLTTNKLSELA 158
+E HR + A++ ++ G+ +P + ++LE V L N + ++ K + +
Sbjct: 63 QSEGHRTLR---ATQFIRQGEWVLFIPRTQYLSLEEVKKSCLINRKMIQI--NYKPNNIQ 117
Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
+ + ++ + K SFW PYI L + P + + A L GSPT ++
Sbjct: 118 TYFVNHLLQENRRKYSFWKPYIDVLPKD------VSGFPTYFDAEQDALLKGSPTLFTVI 171
Query: 219 ERAEGIKREYNELDTVWFMAGSLFQK-------------VSLARRFA-----------LV 254
+ + K EY L A FQK ++++R F LV
Sbjct: 172 NQRKVFKEEYENLKE----AVKEFQKYGYTYDDFIKFRILTISRSFTVQIGEKEQQQLLV 227
Query: 255 PLGPPLLAYSSKCKAMLAAVDDAVQLVVD--RPYKAGESIVVWCGPQPNSKLLINYGFVD 312
PL + + + DA + R + GE + G N +NYGF
Sbjct: 228 PLAD-FINHDNNGFLKYGYSKDADGFFMQAVRNIQKGEELFYNYGQWSNKYFFMNYGFAS 286
Query: 313 EDNPYDRLVVEAALNTEDPQYQDKRMVAQRN---GKLSVQVFHVHAGREKEAISDMLPYL 369
NP ++ ++ LN D + K + + N G V R+ A + +
Sbjct: 287 LTNPMNQFDLDICLNKNDRLFNLKISLTKGNMCWGNRLVNETDHDTFRQSLA---TVRFT 343
Query: 370 RLGYVSDTSEMQSVISSLGPICP------VSPCMERAVLDQLADYFKARLAGYPATLSED 423
++ + D +++ + + P + +E+A L L + +T+ +D
Sbjct: 344 QISKLDDFLQLEEDVQNFKQFWPGWHTTIKTIELEKATFKALKGILVTELGNFASTIEDD 403
Query: 424 EAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLL 464
E L D ++ L EK+++ + + M+ ++
Sbjct: 404 ERRLNDPQTPEFRKHIIMLTLREKQIIKKNIDICDLMLQVI 444
>gi|323452617|gb|EGB08490.1| hypothetical protein AURANDRAFT_71532 [Aureococcus anophagefferens]
Length = 1114
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 92/363 (25%), Positives = 132/363 (36%), Gaps = 65/363 (17%)
Query: 123 DAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRE 182
DA + LV + ER + L E LAL L+YE+++G KS W P+I
Sbjct: 64 DAMLHARSPLVCSGEREANDARALGALLGKVTREDDALALRLLYERRKGAKSRWGPHIAL 123
Query: 183 LDRQRGRGQLAVESPLLWSETELAYLTGSPT--------------KAEILERAEGIKREY 228
L + L WSE ELA L GS +EI++++ E
Sbjct: 124 LP------ATPPHALLRWSEAELAELAGSDALELANRWRSQVSSDFSEIVDKSRAAVEES 177
Query: 229 NELDTVWFMAGSLFQKVSLARRFALVPLGPPLLAYS---SKC-----------KAMLAAV 274
+ L V + RF + L A S S+C KA L V
Sbjct: 178 DP-------GKQLSAAVKASLRFPWLDLEGFSWAVSMIWSRCVSVSRKGAPPIKAFLPVV 230
Query: 275 D------------------DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 316
D D L K G+ + + PN+ LL+ YGF +
Sbjct: 231 DMHNHDPGAPENHGFDDARDGFVLRRTGNAKKGDELKLCYDGLPNAWLLLLYGFALDHAA 290
Query: 317 YDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD 376
+ + A L+ E P Y+ KR + KL + A + A D LP RL ++
Sbjct: 291 HAGRDLYAPLSPEAPHYEAKRAALE---KLGLGATADGAAPFRLAADDALPE-RL--LTA 344
Query: 377 TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKK 436
++ + L + S RA L A LA Y + ED A L D P+
Sbjct: 345 LMAQRATLDELPGLPATSEATARAAAGDLVAACDALLAAYRGSEDEDAAALADPATPPRL 404
Query: 437 RVA 439
R+A
Sbjct: 405 RLA 407
>gi|145353540|ref|XP_001421068.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581304|gb|ABO99361.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 813
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 59/236 (25%), Positives = 95/236 (40%), Gaps = 39/236 (16%)
Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWL 177
D G+ +P TL L ++ + + + + +AL++ E+ +G+K+
Sbjct: 29 DCARGEVLLEIPLERGFTLAAALEDDAVKRVASCCARHD-DVVALHVCAERFRGEKATRA 87
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL------ 231
++ L R + ++ WSE EL LTG+ E + E K +Y L
Sbjct: 88 AHVATLPR-------SFDTAFFWSEEELRELTGTTCLRETMNLREETKNDYETLTKKMEA 140
Query: 232 --DTVWFMAGSL-FQKVSLARRFA-------LVPLG-------PPLLAYSSKCKA----- 269
+ W + +++ + AR L+P G P ++ KA
Sbjct: 141 IGEGGWMREHEVDYERYAWARSNLWSRQCDLLMPDGKRTRAMVPTFDIFNHSAKAPLGKT 200
Query: 270 -MLAAVDDAVQLVVDRPYKAGES--IVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 322
L A + V + YKAGE I G NSKLL YGF +DNPY+ L V
Sbjct: 201 HKLNAEKNCVTVYAADDYKAGEQAFISYGSGEAANSKLLTWYGFCIDDNPYEELDV 256
>gi|28393324|gb|AAO42088.1| unknown protein [Arabidopsis thaliana]
Length = 543
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 59/232 (25%), Positives = 92/232 (39%), Gaps = 36/232 (15%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASEDL+ GD A +P S +++ E V ++ L T + ++ L L+ M EK
Sbjct: 180 ASEDLKLGDVALEIPVSSIISEEYVYNSDMYPILETFDGITSETMLLLWTMREKHNLDSK 239
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F PY L G L + + L G+ EI++ E ++ Y+EL +
Sbjct: 240 F-KPYFDSLQENFCTG-------LSFGVDAIMELDGTLLLDEIMQAKELLRERYDELIPL 291
Query: 235 -----------------WFMAGSLFQKVSLARRF-------ALVPLGPPLLAYSSKCKAM 270
+ A L+ S+ +F L+P+ L
Sbjct: 292 LSNHREVFPPELYTWEHYLWACELYYSNSMQIKFPDGKLKTCLIPVAGFLNHSIYPHIVK 351
Query: 271 LAAVD---DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYD 318
VD +++ V RP GE + G +S LL YGF+ + DNPYD
Sbjct: 352 YGKVDIETSSLKFPVSRPCNKGEQCFLSYGNYSSSHLLTFYGFLPKGDNPYD 403
>gi|79557522|ref|NP_179475.3| SET domain-containing protein [Arabidopsis thaliana]
gi|56381987|gb|AAV85712.1| At2g18850 [Arabidopsis thaliana]
gi|330251719|gb|AEC06813.1| SET domain-containing protein [Arabidopsis thaliana]
Length = 543
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 59/232 (25%), Positives = 92/232 (39%), Gaps = 36/232 (15%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASEDL+ GD A +P S +++ E V ++ L T + ++ L L+ M EK
Sbjct: 180 ASEDLKFGDVALEIPVSSIISEEYVYNSDMYPILETFDGITSETMLLLWTMREKHNLDSK 239
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F PY L G L + + L G+ EI++ E ++ Y+EL +
Sbjct: 240 F-KPYFDSLQENFCTG-------LSFGVDAIMELDGTLLLDEIMQAKELLRERYDELIPL 291
Query: 235 -----------------WFMAGSLFQKVSLARRF-------ALVPLGPPLLAYSSKCKAM 270
+ A L+ S+ +F L+P+ L
Sbjct: 292 LSNHREVFPPELYTWEHYLWACELYYSNSMQIKFPDGKLKTCLIPVAGFLNHSIYPHIVK 351
Query: 271 LAAVD---DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYD 318
VD +++ V RP GE + G +S LL YGF+ + DNPYD
Sbjct: 352 YGKVDIETSSLKFPVSRPCNKGEQCFLSYGNYSSSHLLTFYGFLPKGDNPYD 403
>gi|66828265|ref|XP_647487.1| hypothetical protein DDB_G0268558 [Dictyostelium discoideum AX4]
gi|60475797|gb|EAL73732.1| hypothetical protein DDB_G0268558 [Dictyostelium discoideum AX4]
Length = 459
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 74/304 (24%), Positives = 120/304 (39%), Gaps = 59/304 (19%)
Query: 78 LGDLKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
L + W+ N + P ++ + EK + + A + ++ + SVP +++
Sbjct: 35 LNEFNKWLINNKVYKNPKIEIKVLEKYGRS--------IVAKQSIKKNEKLISVPKLIIM 86
Query: 135 T----LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRG 190
+ L NE I E + +S A++LMY K KSFW PY+ L ++
Sbjct: 87 SNMGGFSHHLPNE-IYEPSISIGISPTNLQAIFLMY-CKLNDKSFWYPYVSVLPKE---- 140
Query: 191 QLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAG----------S 240
+ + +SE EL L S K + R +GI+R YN T G +
Sbjct: 141 ---FTTSIYFSEEELDELQSSKLKEFTIIRKDGIERHYNSTFTRLSNRGIAEFSPTSTQT 197
Query: 241 LFQK------------VSLARRFAL-------VPLGPPLLAYS-SKCKAMLAAVDDAVQL 280
L QK +R F+L VPL A SK K D +
Sbjct: 198 LQQKGYTLELFTWALSCVWSRAFSLSDSDGGMVPLADMFNAEEISKSKVQPKVTDSTLDY 257
Query: 281 VVDRPYKAGESIVVWCG---PQPNSKLLINYGFV-DEDNPYDRLVVEA-ALNTEDPQYQD 335
+ GE I G P +S++L++YGFV D P D + + + ++P Q
Sbjct: 258 YASDDIEIGEQIFTPYGVYKPLSSSQMLMDYGFVFDHGTPSDNVAISVPIFHPDEPNIQV 317
Query: 336 KRMV 339
K+ +
Sbjct: 318 KQSI 321
>gi|334184301|ref|NP_001189551.1| SET domain-containing protein [Arabidopsis thaliana]
gi|330251720|gb|AEC06814.1| SET domain-containing protein [Arabidopsis thaliana]
Length = 536
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 59/232 (25%), Positives = 92/232 (39%), Gaps = 36/232 (15%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASEDL+ GD A +P S +++ E V ++ L T + ++ L L+ M EK
Sbjct: 180 ASEDLKFGDVALEIPVSSIISEEYVYNSDMYPILETFDGITSETMLLLWTMREKHNLDSK 239
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F PY L G L + + L G+ EI++ E ++ Y+EL +
Sbjct: 240 F-KPYFDSLQENFCTG-------LSFGVDAIMELDGTLLLDEIMQAKELLRERYDELIPL 291
Query: 235 -----------------WFMAGSLFQKVSLARRF-------ALVPLGPPLLAYSSKCKAM 270
+ A L+ S+ +F L+P+ L
Sbjct: 292 LSNHREVFPPELYTWEHYLWACELYYSNSMQIKFPDGKLKTCLIPVAGFLNHSIYPHIVK 351
Query: 271 LAAVD---DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYD 318
VD +++ V RP GE + G +S LL YGF+ + DNPYD
Sbjct: 352 YGKVDIETSSLKFPVSRPCNKGEQCFLSYGNYSSSHLLTFYGFLPKGDNPYD 403
>gi|159476096|ref|XP_001696150.1| protein N-methyltransferase [Chlamydomonas reinhardtii]
gi|158275321|gb|EDP01099.1| protein N-methyltransferase [Chlamydomonas reinhardtii]
Length = 474
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 53/218 (24%), Positives = 90/218 (41%), Gaps = 60/218 (27%)
Query: 157 LACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAE 216
A + L++ K+QG +S P+I +L G PL WS+ +LA L A+
Sbjct: 138 FAKMGAMLLWHKRQGSQSPLAPWIAQLPADTG-------VPLNWSDKQLAALQYPYLVAQ 190
Query: 217 ILERAEGIKREYNEL-DTV-----------------WFMAG--------------SLFQK 244
+ E+ +RE+ L DT+ W+ G +L +
Sbjct: 191 VKEQ----QREWTALYDTLRGSGMAAGAAPPSREEFWWAMGVVRSRTFSGPYIGSTLSDR 246
Query: 245 VSLA--------------RRFALVPLGPPLLAYSSKCKAMLAA--VDDAVQLVVDRPYKA 288
+ LA +++A+ PL L ++S ++ ++ D+ +V R +K
Sbjct: 247 LRLAGLVAALVVILSRSLKQYAICPL-IDLFNHTSAAQSEVSYNYFGDSYSVVASRDFKK 305
Query: 289 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 326
GE + + G Q N L+ YGF + DNP D V+ L
Sbjct: 306 GEQVFITYGAQSNDSLMQYYGFAEADNPQDTYVISDVL 343
>gi|260831632|ref|XP_002610762.1| hypothetical protein BRAFLDRAFT_91548 [Branchiostoma floridae]
gi|229296131|gb|EEN66772.1| hypothetical protein BRAFLDRAFT_91548 [Branchiostoma floridae]
Length = 604
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 55/223 (24%), Positives = 100/223 (44%), Gaps = 50/223 (22%)
Query: 151 TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 210
T++ + L+L+L+ EK +GK SFW PYIR L + +P+ ++E+EL L+
Sbjct: 230 TSRFTCAQVLSLFLLLEKNKGKDSFWYPYIRSLPN-------SFTTPVYFTESELNALSP 282
Query: 211 SPTKAEILERAEGIKRE----YN-----------ELDTVWFM-----AGSLFQKVSLARR 250
S + E+A +K+E +N ELD+ + A S+ + +L +
Sbjct: 283 S-----LQEKARDLKKELLHAFNDLEPFVTSCLPELDSTFTFDAFRWAWSVLKTRTLYQE 337
Query: 251 --------------FALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVV 294
LVP+ L+ +S KA ++ V PY+ + + +
Sbjct: 338 DCRSPYLSNKEPQTSTLVPM-LDLINHSPSAKARFGYNVNTSCYEVRVLEPYRKYDQVFI 396
Query: 295 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED-PQYQDK 336
G + N++L++ +GF +NP D + + + E PQ D+
Sbjct: 397 SYGFEENTELMLKFGFFVPENPKDFMKINLSEMLESLPQINDE 439
>gi|395332633|gb|EJF65011.1| SET domain-containing protein [Dichomitus squalens LYAD-421 SS1]
Length = 502
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 61/223 (27%), Positives = 97/223 (43%), Gaps = 27/223 (12%)
Query: 252 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
A+VP+ L A A L + +++V +P +AGE I G PNS LL YG V
Sbjct: 257 AMVPMADMLNARFESENAKLFYEERELKMVTTKPVEAGEQIWNTYGDPPNSDLLRRYGHV 316
Query: 312 D----------EDNPYDRLVVEAAL----NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR 357
D NP D + V A L ++ +Y + V + VF +
Sbjct: 317 DVVPLRPPLSGMGNPRDIVEVRADLIVSAVSKKVEYSLQERVDWWLEEAEDDVFILRT-- 374
Query: 358 EKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP 417
+ E +++ + RL ++S+ +++ S P V P VL D ARL YP
Sbjct: 375 DCELPEELVSFERLLFLSEDEWIKTAKKSKLPKPKVDP----DVLTVAIDVLSARLKEYP 430
Query: 418 ATLSEDEAMLT-----DYNLHPKKRVATQLVRMEKKMLNACLQ 455
++ EDE +L+ +L+ K V +L EK++L L+
Sbjct: 431 TSIEEDEKLLSADKVESLSLNKKHAVIVRL--GEKRILQGTLK 471
>gi|323447496|gb|EGB03414.1| hypothetical protein AURANDRAFT_72732 [Aureococcus anophagefferens]
Length = 403
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 60/256 (23%), Positives = 101/256 (39%), Gaps = 38/256 (14%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
W+ +NG + E S++++ R +H A+ DL+ + VP ++T+E +G
Sbjct: 37 WLTENGGKFADCV--ELRSYDDEVRGVH---ATRDLETEEILVEVPLKCLITVE--MGKA 89
Query: 144 TIAELLTTNKLSEL-----ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPL 198
T EL L L+++ +++ +F+ PY L P+
Sbjct: 90 TDVGRAVLEAELELDAPKHVFLMLFVLLDRRDSS-TFFAPYYDIL------PSTLSNMPI 142
Query: 199 LWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW--FMAGSLFQK------------ 244
W EL +L GS +I ER IK +Y + +W F+ ++
Sbjct: 143 FWQPDELEWLKGSYLLTQIEERKRAIKADYEAICGIWPSFIDVCTLEEFKWARMCVCSRN 202
Query: 245 ----VSLARRFALVPLGPPLLAYSSK-CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 299
V+ AR A+VP L + + K A + + G I G +
Sbjct: 203 FGVVVNGARTSAMVPYADMLNHFRPRETKWTFDNSRGAFTITSLQKISVGSQIYDSYGQK 262
Query: 300 PNSKLLINYGFVDEDN 315
N + L+NYGF EDN
Sbjct: 263 CNHRFLLNYGFAIEDN 278
>gi|358369683|dbj|GAA86297.1| SET domain protein [Aspergillus kawachii IFO 4308]
Length = 489
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 41/144 (28%), Positives = 67/144 (46%), Gaps = 16/144 (11%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
+W+ G P K+ K + + H V A DL G+ F++P + V++++ N
Sbjct: 22 TWLA--GKPGVKINPKIQIADLRSHAAGRGVVAQSDLDEGEELFTIPRAHVLSVQ----N 75
Query: 143 ETIAELLTTN--KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
+ LL+ N L L + ++YE QG +S W Y R L R ++ + W
Sbjct: 76 SNLKNLLSQNLEDLGPWLSLMVVMIYEYLQGDQSAWASYFRVLPRN-------FDTLMFW 128
Query: 201 SETELAYLTGSPTKAEILER-AEG 223
S +EL L GS +I ++ AEG
Sbjct: 129 SASELEELQGSAIVEKIGKQGAEG 152
>gi|299115166|emb|CBN75532.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 524
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 70/328 (21%), Positives = 125/328 (38%), Gaps = 88/328 (26%)
Query: 77 DLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTL 136
+L L SW ++G K+ L++ + + L G+ S+P SL +T+
Sbjct: 27 ELDGLLSWFVEHGGSMTKLCLEDLGGEMSLS-----LLTGQALNKGEVVMSIPISLCMTV 81
Query: 137 ERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES 196
+ VL AL+LM E+++G SFW Y+R L V++
Sbjct: 82 DSVL--------------------ALHLMAERRKGDGSFWKQYLRTLPDD-------VDT 114
Query: 197 PLLW----SETELAYLTGSPTKAEILERA--EGIKREYNEL------------------D 232
PL W +E E L G T +L R +++++ E D
Sbjct: 115 PLRWLVEQAEEEFRLLDG--TMVGLLSRMMHSQVRKDWEEFHLPLVEAHPEILGGVTFED 172
Query: 233 TVWFMAGSLFQKV----------SLARRFALVPLG-------------PPLLAYSSKCKA 269
+W M+ S++ + S R A+VP+ ++ + ++
Sbjct: 173 YLWAMS-SIWSRSFDYQEPGPDDSPCSRRAMVPVINAANHDPSAADSLSEMIEFQAQEGG 231
Query: 270 MLAAVDD------AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 323
+ + + +++ R Y A E + G N+KLL +YGFV NPY L
Sbjct: 232 LSMGIGEPGRARGTLRVSAGRDYAAREQFFILYGRYSNAKLLYSYGFVLASNPYGGLDYW 291
Query: 324 AALNTEDPQYQDKRMVAQRNGKLSVQVF 351
+ DP + K+ + + + Q +
Sbjct: 292 VRVPQTDPGFAWKQALLDEHPLTAAQAY 319
>gi|189190580|ref|XP_001931629.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187973235|gb|EDU40734.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 372
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 54/234 (23%), Positives = 96/234 (41%), Gaps = 33/234 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A+ D+QAG+ VP L TL+ V + I+ L N +S A LA YL +K
Sbjct: 33 IIATRDIQAGETILFVPFKLFRTLKHV--PKAISRRLPRN-MSLHALLATYLSLDKTD-- 87
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+F +P D + P LW EL P ++++ KR+++ +
Sbjct: 88 -TFAIPNKTLPDLSSFEAGM----PFLWP-AELHPFLPKPALDLLMKQQRSFKRDWDIVS 141
Query: 233 TV------------WFMAGS--------LFQKVSLARRFALVPLGPPLLAYSSKCKAMLA 272
W + + + +++ R A++P+ C+A A
Sbjct: 142 KAYSNISQDQYLHAWLLVNTRSFYCTTPIMERLPHDDRLAILPVADLFNHADVGCEARFA 201
Query: 273 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 326
+ + + DR Y+ GE + + G LL YGFV +N +D + ++ A+
Sbjct: 202 S--ENYSFIADRDYRTGEELHISYGSHSTDFLLTEYGFVPTENCWDVVCLDEAI 253
>gi|409080258|gb|EKM80618.1| hypothetical protein AGABI1DRAFT_71041 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 492
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 60/221 (27%), Positives = 94/221 (42%), Gaps = 25/221 (11%)
Query: 252 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
A+VP+ L A A L D +++V +P K GE I G PN++LL YG V
Sbjct: 245 AMVPMADILNARYQTENAKLFHEKDELKMVTTKPIKTGEQIWNTYGDLPNAELLRRYGHV 304
Query: 312 D--------EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 363
D NP D + ++A L V K + + + G E I
Sbjct: 305 DFLSLPSGGHGNPGDVVEIKADLIISAVS-STPEAVKDDEAKERID-WWLEEGGEDVFIL 362
Query: 364 D--------MLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAG 415
D M+ +++L ++ ++ S P +E + D L + RLA
Sbjct: 363 DYEYDLPPVMISFVKLLLLTQADWEKAREKS----KPPKSRLEGILYDILISTLEKRLAE 418
Query: 416 YPATLSEDEAMLT-DYNLHPKKRVATQLVRMEKKMLNACLQ 455
YP T+ D+A+LT D L+ K + +L EK++L+ LQ
Sbjct: 419 YPTTIETDKALLTNDTPLNNKNAIIVRL--GEKEILHGILQ 457
>gi|328869852|gb|EGG18227.1| hypothetical protein DFA_03714 [Dictyostelium fasciculatum]
Length = 504
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 29/118 (24%), Positives = 58/118 (49%), Gaps = 11/118 (9%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A +DL+ + +P V++ + +IA +L +L E ++ LMYE +G
Sbjct: 44 IIAKQDLKVDEIIAVIPKRCVLSPKTT----SIAPILEKYELEEAVATSIALMYETSKGV 99
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
+S W YI+ + ++ P+LW + + YL G+ + ++E E ++ +Y E
Sbjct: 100 QSKWYSYIQSM-------PTVIDLPILWDKESIEYLVGTDLEEIVIENIETLEEQYRE 150
>gi|145250231|ref|XP_001396629.1| SET domain protein [Aspergillus niger CBS 513.88]
gi|134082145|emb|CAK42259.1| unnamed protein product [Aspergillus niger]
gi|350636112|gb|EHA24472.1| hypothetical protein ASPNIDRAFT_48629 [Aspergillus niger ATCC 1015]
Length = 489
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 37/131 (28%), Positives = 60/131 (45%), Gaps = 15/131 (11%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
+W+ G P K+ K + + H V A DL G+ F++P + V++++ N
Sbjct: 22 TWLA--GKPGVKINPKIQIADLRSHAAGRGVVAQSDLDEGEELFTIPRAHVLSVQ----N 75
Query: 143 ETIAELLTTN--KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
+ LL+ N L L + ++YE QG +S W Y R L R ++ + W
Sbjct: 76 SNLKNLLSQNLDDLGPWLSLMVVMIYEYLQGDQSAWASYFRVLPRN-------FDTLMFW 128
Query: 201 SETELAYLTGS 211
S +EL L GS
Sbjct: 129 SASELEELQGS 139
>gi|392569623|gb|EIW62796.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
Length = 509
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 61/221 (27%), Positives = 92/221 (41%), Gaps = 23/221 (10%)
Query: 252 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
A+VP+ L A A L + +++V +P KAGE I G PNS LL YG V
Sbjct: 264 AMVPMADMLNARFESENAKLFYDERELKMVSTKPIKAGEQIWNTYGDPPNSDLLRRYGHV 323
Query: 312 D----------EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG----KLSVQVFHVHAGR 357
D NP D + V A L + K + +R + VF +
Sbjct: 324 DLVPLSAPLSGLGNPGDVVEVRADLIVSVAAKKVKHDLKERVDWWLEEADDDVFVLRT-- 381
Query: 358 EKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP 417
+ E +++ ++RL + ++ S P P +++ VL D + RL YP
Sbjct: 382 DCELAEELVSFVRLLLLPKDEWEKAAQKSKLP----KPKLDKDVLTIAVDVLEKRLKDYP 437
Query: 418 ATLSEDEAMLTDY---NLHPKKRVATQLVRMEKKMLNACLQ 455
TL EDEA+ L KR A + EK++L L+
Sbjct: 438 TTLEEDEALFAPERFGELSLNKRHAVVVRLGEKRILRGTLK 478
>gi|358388339|gb|EHK25932.1| hypothetical protein TRIVIDRAFT_82204 [Trichoderma virens Gv29-8]
Length = 915
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 48/198 (24%), Positives = 81/198 (40%), Gaps = 53/198 (26%)
Query: 158 ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
+ L + +M+E +G +S W PY+ L + E+P+ WS EL L S T+ +
Sbjct: 543 SILIIIMMFEYFKGDESKWKPYMDVL-------PASFETPMFWSGAELDELQASATRTK- 594
Query: 218 LERAEGIKREYNELDTVWFMAGSLF---------QKVSLARRFA---------------- 252
+ +A+ + + ++ V +F + V LA R
Sbjct: 595 VGKADAEEMFHAKVLPVIRANHEIFPSSQSYSDDELVQLAHRMGSTIMSYAFDFQNEDEE 654
Query: 253 -------------------LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 293
+VP+ +L ++ A + DDA+ + R KAGE I+
Sbjct: 655 DEEDEEEWVEDRESKSTMGMVPMAD-ILNADAEYNAHVNYGDDALTVTALRTIKAGEEIL 713
Query: 294 VWCGPQPNSKLLINYGFV 311
+ GP PNS+LL YG+V
Sbjct: 714 NYYGPHPNSELLRRYGYV 731
>gi|358056251|dbj|GAA97802.1| hypothetical protein E5Q_04481 [Mixia osmundae IAM 14324]
Length = 433
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 88/383 (22%), Positives = 157/383 (40%), Gaps = 70/383 (18%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETI---AELLTTNKLSELACLALYLMYEKKQG 171
A+ +L++ FS+P SLV+++ +++ +E+ T + + CL MYE+
Sbjct: 39 ATSNLRSETELFSIPRSLVLSVHTSPLPKSLPDWSEISTQGWVGLILCL----MYEQID- 93
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-LERAEG------- 223
S W Y+ + +S + WS+ EL L GS +I E AEG
Sbjct: 94 PASHWKRYLNSM-------PTCFDSLMFWSDDELRELQGSSVLDKIGREEAEGSYYSILV 146
Query: 224 --IKREYNELDTVWFMAGSLFQKVS---LARRF--------------------------A 252
+ + + + + +L+ + L+R F
Sbjct: 147 PYLSKHADIFKPLEAYSLALYHRCGSLILSRSFHVSNQDDSASDASDDDDAAYHEVETVG 206
Query: 253 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 312
+VP+ L A S A L DA+ + + AGE I PN+ LL YG VD
Sbjct: 207 MVPMADVLNAKSGSANACLVYHPDALVMTTTKEIAAGEQIFNTYNDPPNADLLRRYGHVD 266
Query: 313 EDNPYDRLVVEAAL-NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI-SDMLPYLR 370
E N D + + A L +D +R L ++ V+ + E + D + ++
Sbjct: 267 EVNLNDNVEISADLIGCKD---------LERVDWLLDRLDDVYTLTQAEDLPEDFITAVK 317
Query: 371 LGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDY 430
+ + SE + + + P ++ A ++ + + RLA Y +T+ EDE++L
Sbjct: 318 I-LTASKSEFRKIQKA--DDLP-DDVLDEATAMRVREILQMRLAQYSSTIEEDESLLASS 373
Query: 431 NLHPKKRVATQLVRM-EKKMLNA 452
+ + A LVR+ EK++L A
Sbjct: 374 TMLTSRSRAALLVRLGEKRILAA 396
>gi|145537195|ref|XP_001454314.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124422069|emb|CAK86917.1| unnamed protein product [Paramecium tetraurelia]
Length = 481
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 81/426 (19%), Positives = 168/426 (39%), Gaps = 58/426 (13%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+L W+ KV ++ K +E +R + AS+ ++ G+ +P + ++LE V
Sbjct: 42 NLIQWLKDGKAEVSKVQIEVK---SEGYRTLR---ASQFIRQGEWVLFIPRTHYLSLEEV 95
Query: 140 LGNETIA-ELLTTNKL-SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
+ I +++ N + + + + + ++ + + SFW PYI L + P
Sbjct: 96 KKSCLINRKMIQLNYIPNNIQTYFVNHLLQENRRQNSFWKPYIDVLPKD------VSGFP 149
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYN-------ELDTVWFMAGSL--FQKVSLA 248
+ + A L GSPT ++ + + + EY+ E + F+ ++++
Sbjct: 150 TYFDAEQDALLKGSPTLFTVMNQRKIFREEYDNLKEAVKEFQRYGYTYNDFIKFRILTIS 209
Query: 249 RRF-----------ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD--RPYKAGESIVVW 295
R F LVPL + + + DA + R + GE +
Sbjct: 210 RSFPVYIGENEQQQLLVPLAD-FVNHDNNGFLQYGYSPDADGFFMQAVRNIQKGEELFYN 268
Query: 296 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN----GKLSVQVF 351
G N +NYGF NP ++ + L+ D + K + N +L +
Sbjct: 269 YGQWSNKYFFMNYGFASLTNPMNQFDFDVCLDRNDRLFNLKVDLTGGNICWGNRLVNETD 328
Query: 352 HVHAGREKEAISDMLPYLRLGYVS---DTSEMQSVISSLGPICP---VSP---CMERAVL 402
H + L +R +S D +++ + + P +P +E+A
Sbjct: 329 H-------DTFRQALATVRFAQISKLDDFLQLEEDVENYNQFWPGWHTTPKTIALEKATF 381
Query: 403 DQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIM 462
+ + LA + +++ +D+ L D + +R L EK+++ ++V DM++
Sbjct: 382 KAFKELLVSELANFASSIEDDQRRLNDPSTPEFRRHIIMLTMREKQIIKKNIEV-CDMML 440
Query: 463 LLPDVT 468
+ D T
Sbjct: 441 SVIDKT 446
>gi|403342378|gb|EJY70508.1| SET domain containing protein [Oxytricha trifallax]
Length = 653
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 31/124 (25%), Positives = 59/124 (47%), Gaps = 14/124 (11%)
Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGN------ETIAELLTTNKLSELACLALYLMYE 167
AA +++ D VP +++T+ER L + + A + + + L ++L+YE
Sbjct: 56 AAKLNIKNNDVIVYVPQKVLITVERALASPIGFIFDNHASIFKATEDRDYLVLLVFLIYE 115
Query: 168 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
++G +SFW PY +D G L P WS+ + L S K +I + + + +
Sbjct: 116 HQKGTRSFWHPYFEAID----PGLL----PCFWSDQTIEELADSELKDQIRQERDNYEED 167
Query: 228 YNEL 231
++ L
Sbjct: 168 WDML 171
>gi|428173103|gb|EKX42007.1| hypothetical protein GUITHDRAFT_141487 [Guillardia theta CCMP2712]
Length = 355
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 62/255 (24%), Positives = 97/255 (38%), Gaps = 63/255 (24%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERV-------------LGNETIAELLTTNKLSELAC 159
A +D+ G+ ++P+ +++ ERV + +++I+ LSE
Sbjct: 75 TTAKDDIADGELYIAIPDHMLMGPERVEPGSRLDKKLMKIVKSQSISMQEQRRLLSEKNK 134
Query: 160 LALYL---MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAE 216
+ +Y MY K K+SFW PY + + SP+ WSE EL L GS
Sbjct: 135 VLMYFLLQMYNPK--KESFWKPYFDIMPTN-------LTSPIFWSEDELQELAGSEVSNM 185
Query: 217 ILERAEGIKREYNELD------------------TVWFMAGSLFQK--VSLARR--FALV 254
+ ++ Y+EL WF A L+ + L R+ V
Sbjct: 186 ARIEKKRLRAMYDELRERIFKHDRKTFLKQAFTLKNWFWANGLYDSRVIQLNRQTGHGNV 245
Query: 255 PLGPPLL-----------AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 303
P PL+ + K + AAV + DR G + G + N +
Sbjct: 246 PTFIPLIDMVNCIESQDKTFIQYDKKLRAAV-----MYADRAVSRGVQVFESYGNKSNYE 300
Query: 304 LLINYGFVDEDNPYD 318
L+ GFV EDNP D
Sbjct: 301 YLLYNGFVMEDNPND 315
>gi|255945819|ref|XP_002563677.1| Pc20g11910 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211588412|emb|CAP86520.1| Pc20g11910 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 487
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 67/276 (24%), Positives = 106/276 (38%), Gaps = 74/276 (26%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
V A ++ G+ FSVP ++V+T++ N + LL N ++ L L ++YE
Sbjct: 50 VVAQSNISEGEELFSVPRAMVLTVQ----NSELRTLLGENLEEQMGPWLSLMLVMVYEYL 105
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA---EGIKR 226
QG+KS W PY R L + ++ + WS EL L S T E + R+ E I+
Sbjct: 106 QGEKSRWAPYFRVLPSR-------FDTLMFWSPAELQELQAS-TIVEKIGRSGAEESIRN 157
Query: 227 ----------------------EYNELDTVWFMAGSLFQKVSLARRF------------- 251
E + D G + + +A F
Sbjct: 158 SIAPILAKRPDLFPPPQGLASWEGDAGDAALIQVGHIMGSLIMAYAFDIEKSEDDGDEGE 217
Query: 252 ------------------ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 293
+VPL L A + + A L + A+ + +P + GE I
Sbjct: 218 ANDESYMTDDEEEEQLPKGMVPLADLLNADADRNNARLYQEEGALVMKAIKPIQQGEEIF 277
Query: 294 VWCGPQPNSKLLINYGFVDEDNP-YDRLVVEAALNT 328
G P + LL YG+V ++ YD V+E +L T
Sbjct: 278 NDYGEIPRADLLRRYGYVTDNYAVYD--VLELSLET 311
>gi|428171155|gb|EKX40074.1| hypothetical protein GUITHDRAFT_113813 [Guillardia theta CCMP2712]
Length = 353
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 46/181 (25%), Positives = 76/181 (41%), Gaps = 33/181 (18%)
Query: 158 ACLALYLMYEKKQGKKSFWLP-----YIRELDRQRGRGQLAVESPLLWSETELAYLTGSP 212
A LA+YL+Y ++ P Y L +Q V S WSE + L
Sbjct: 97 AKLAIYLLYSIDHAEQD---PRPLRDYFDVLPKQ-------VLSTFSWSEEAIQELQDPY 146
Query: 213 TKAEILERAEGIKREYNELDT-----------VWFMAGSLFQKVSLARRFALVPLGPPLL 261
+I R I+R ++E+ +W + ++ L+R FA G L
Sbjct: 147 MIEQIQTRRRKIQRLFHEIQKGLSPRITYDRLLWAI------EIVLSRAFAFSRTGGDDL 200
Query: 262 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 321
+S Q+V ++ +K G+S+ + G + N +LL++YGF+ DNP D V
Sbjct: 201 VFSG-TSVKYDNSKQEFQIVAEKDFKVGQSVEISYGLKSNHELLLSYGFILPDNPEDFFV 259
Query: 322 V 322
+
Sbjct: 260 I 260
>gi|384249602|gb|EIE23083.1| SET domain-containing protein, partial [Coccomyxa subellipsoidea
C-169]
Length = 306
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/104 (31%), Positives = 54/104 (51%), Gaps = 13/104 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A +DL G +P + V++++ N IA++L +++ L + +MYE GK
Sbjct: 4 VFAVQDLCEGQRLCEIPKTAVLSVQ----NTGIADILEQHRIRGGLGLIIAIMYELSIGK 59
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAE 216
+SFW Y+ EL ++ PL W+E E + L G T+AE
Sbjct: 60 ESFWHGYLEELHKRE-------YLPLFWAEQERSLLQG--TEAE 94
>gi|294948379|ref|XP_002785721.1| hypothetical protein Pmar_PMAR008080 [Perkinsus marinus ATCC 50983]
gi|239899769|gb|EER17517.1| hypothetical protein Pmar_PMAR008080 [Perkinsus marinus ATCC 50983]
Length = 353
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 38/137 (27%), Positives = 60/137 (43%), Gaps = 13/137 (9%)
Query: 195 ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD----------TVWFMAGSLFQK 244
E PL WS EL L G+ T E+ + + + EL V M F+
Sbjct: 108 EHPLEWSADELVCLKGT-TVWEMHQLSLEVVDSVAELCPNSPRAMIRWAVEVMMSRAFE- 165
Query: 245 VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 304
S ++PL S+K + V++ Q++ ++P K GE I G N L
Sbjct: 166 -SEVCGLCVIPLADQFNHSSTKWHTRVREVEEGFQMLAEKPVKKGEEIFNNYGLYTNEML 224
Query: 305 LINYGFVDEDNPYDRLV 321
L+ +GF++ DNP+D +
Sbjct: 225 LLTHGFIEFDNPHDHFI 241
>gi|403215215|emb|CCK69715.1| hypothetical protein KNAG_0C06190 [Kazachstania naganishii CBS
8797]
Length = 496
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 56/248 (22%), Positives = 98/248 (39%), Gaps = 51/248 (20%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLT--TNKLSELACLALYLMYE-KK 169
V A ED++ + F VP + ++ +E ++ E+ + + L + L++E K
Sbjct: 41 VIAIEDIEKDEILFEVPRTTMLNVENCELSKRYPEIKNHLVESVGQWEGLIIALLFEWKV 100
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
G+KS W PY++ L ++ QL + W++ EL L G+ E+ E
Sbjct: 101 VGEKSKWWPYLQVLPKKTDMNQL-----IYWADDELELLKPSLILERVGADKAKEMFENV 155
Query: 222 EGI--KREYNELDT-----VW---FMAGSLFQKVSL------------------------ 247
I K E D+ W + S+ S
Sbjct: 156 VDIINKSTLKEKDSYILKVTWENFLLVASIIMSYSFDVQDYVEEKEGGTDEEEDDNESEN 215
Query: 248 ARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 306
R ++PL L + + KC A L + +++ + K GE I G PNS++L
Sbjct: 216 VRSLKCMIPLADTLNSNTHKCNAHLIHGSNLLEMRSIKAIKKGEQIYNIYGDHPNSEILR 275
Query: 307 NYGFVDED 314
YG+++ D
Sbjct: 276 RYGYIEPD 283
>gi|367048695|ref|XP_003654727.1| hypothetical protein THITE_2117893 [Thielavia terrestris NRRL 8126]
gi|347001990|gb|AEO68391.1| hypothetical protein THITE_2117893 [Thielavia terrestris NRRL 8126]
Length = 481
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 49/191 (25%), Positives = 74/191 (38%), Gaps = 43/191 (22%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
+L+ E +G+ SFW PYI L + A+ P W E ++AYL G+ I E
Sbjct: 107 FFLIKEYLKGRDSFWAPYIATLPQPEHVSAWAL--PAFWPEEDIAYLAGTNAHVAIAEIQ 164
Query: 222 EGIKREYNE----LDTVWFMAGSLFQK--------VSLARRF---------------ALV 254
+K E+ + L F A + + + +R F L+
Sbjct: 165 ANVKSEFKQARKALKAAGFPAWQDYTQMLYKWAFCIFTSRSFRPSLVLSEPAKQQMAELL 224
Query: 255 PLG---------PPLLAYSSKCKAMLAAVDDA-----VQLVVDRPYKAGESIVVWCGPQP 300
P G PL ++ A D A QLV Y+ GE + G +
Sbjct: 225 PPGCQLDDFSILQPLFDIANHSMTARYAWDVASDPASCQLVCHDAYQPGEQVYNNYGLKT 284
Query: 301 NSKLLINYGFV 311
NS+LL+ YGF+
Sbjct: 285 NSELLLAYGFI 295
>gi|389741836|gb|EIM83024.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 502
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 60/231 (25%), Positives = 102/231 (44%), Gaps = 34/231 (14%)
Query: 252 ALVPLGPPLLA-YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 310
A+VP+ L A Y S+ + +D ++++ +P GE I G PNS LL YG
Sbjct: 254 AMVPMADMLNARYRSENAKLFYETED-LRMITTKPILKGEQIFNTYGDPPNSDLLRRYGH 312
Query: 311 VDE--------DNPYDRLVVE-----AALNTEDPQYQDKRMVAQR------NGKLSVQVF 351
VD NP D +VE A + + Q A+R G V +
Sbjct: 313 VDLVPLPNGDIGNPAD--IVELRGDLAFFSISERHKQPVESSAERVDWWLEEGGEDVFIL 370
Query: 352 HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 411
+ E +++P+ RL S + ++ S P V + ++L +A+ +
Sbjct: 371 ETN----HELPDELVPFCRLLLQSQSEWEKTKSKSKLPKAKV----DESILSTIANALER 422
Query: 412 RLAGYPATLSEDEAMLTD-YNLHPKKRVATQLVRMEKKMLNACLQVTADMI 461
RLA YP ++ ED+ +LT+ +L+ K V +L EK++L+ L + +
Sbjct: 423 RLAEYPTSVEEDQKLLTEPLSLNRKHAVIVRL--GEKRILHGTLSTVKEKL 471
>gi|422293951|gb|EKU21251.1| hypothetical protein NGA_2061300, partial [Nannochloropsis gaditana
CCMP526]
Length = 452
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 61/260 (23%), Positives = 105/260 (40%), Gaps = 38/260 (14%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L W +KNG+ I S + A+ ++ G+ +VP +L +++ V
Sbjct: 66 LLEWCNKNGIKDASKITIGPVSQAGMGLGL---VATAPIKQGETLATVPLNLCFSMDSVR 122
Query: 141 GN---ETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
+ + I E L + + +AL L+YE G KS + YI+ L R GQ + P
Sbjct: 123 ASPLGKVIGEF--EPALGDASLIALQLLYEAHMGPKSKYAVYIKSLPRP---GQDGFDHP 177
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAG-----------SLFQ--- 243
L WS E L S T+ + + +Y + + G S F+
Sbjct: 178 LFWSTAEQGVLAKSSTRNLGETLIDAVAEDYGWIQSALARGGISGLQADSFDLSDFEWAV 237
Query: 244 KVSLARRFALVPLG---PPLLAYSSKCKA---------MLAAVDDAVQLVVDRPYKAGES 291
V L+R F G PLL +++ + + ++++ DR G+
Sbjct: 238 AVVLSRSF-FAENGLRLAPLLDMANRGEGCTNEPQIGGLGIFGGKGLKVIADRDTDKGQE 296
Query: 292 IVVWCGPQPNSKLLINYGFV 311
IV+ GP+ + L ++GFV
Sbjct: 297 IVISYGPKSGIEFLEDHGFV 316
>gi|119467702|ref|XP_001257657.1| SET domain protein [Neosartorya fischeri NRRL 181]
gi|119405809|gb|EAW15760.1| SET domain protein [Neosartorya fischeri NRRL 181]
Length = 492
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 56/108 (51%), Gaps = 15/108 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
V A D+ G+ FS+P LV++ + N + +LL+ + L EL L L +MYE
Sbjct: 50 VVARSDIFDGEELFSIPRGLVLSAQ----NSKLKDLLSQD-LEELGPWLSLILVMMYEYL 104
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
G++S W PY + L + + ++ + WS +EL L GS ++I
Sbjct: 105 LGEQSAWAPYFKVLPK-------SFDTLMFWSPSELQELQGSAIVSKI 145
>gi|224042477|ref|XP_002188626.1| PREDICTED: SET domain-containing protein 4 [Taeniopygia guttata]
Length = 457
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 86/385 (22%), Positives = 139/385 (36%), Gaps = 84/385 (21%)
Query: 81 LKSWM------HKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSV 128
LKS+M HK K LKE+ + RP + + ++ LQAGD S+
Sbjct: 17 LKSFMDGVNCSHKLEYIKLKKWLKERGFEDSNLRPAEFWETGRGLMTTKALQAGDLIISL 76
Query: 129 PNSLVVT----LERVLGNE------------TIAELLTTNKLSELACLALYL---MYEKK 169
P ++T L LG + L L L C L + EK
Sbjct: 77 PEKCLLTTGTVLSSCLGGHIEKWKPPVSPLLALCTFLIGQNLELLECFQFLLVNGIAEKH 136
Query: 170 QGKKSFWLPYI----------------------RELDRQRGRGQLAVESPLLWSETELAY 207
G+KS W PY+ + L ++ ++ ++ L+ + +
Sbjct: 137 AGQKSPWKPYLDVLPKAYTCPACLEPDIINLLPKPLQKKAQEQKMLIQE--LFQSSRAFF 194
Query: 208 LTGSPTKAEILERAEGIKREYNELDTVWFMAGS--LFQKVSLARRFALVP----LGP--P 259
+ P AE G ++ L W + ++ K F+L P L P
Sbjct: 195 SSLQPLFAE----DTGNIFNFSALQWAWCTVNTRTIYMKHPHRECFSLEPDVYALAPYLD 250
Query: 260 LLAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 317
LL +S + KA + ++ D K + +++ GP N +LL+ YGFV DNP+
Sbjct: 251 LLNHSPNVQVKAGFNEQTRSYEIWTDSQCKKYQEVLICYGPHDNQRLLLEYGFVATDNPH 310
Query: 318 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 377
+ V A + DK QR K+S+ H D L L G+ +
Sbjct: 311 SSVYVSADTLLKYFSSLDK----QREAKVSILKDH-----------DFLENLTFGWEGPS 355
Query: 378 SEMQSVISSLGPICPVSPCMERAVL 402
+ + + L C R +L
Sbjct: 356 WRLLTALKVLSLAADEFACWRRILL 380
>gi|242066082|ref|XP_002454330.1| hypothetical protein SORBIDRAFT_04g028760 [Sorghum bicolor]
gi|241934161|gb|EES07306.1| hypothetical protein SORBIDRAFT_04g028760 [Sorghum bicolor]
Length = 490
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 62/280 (22%), Positives = 110/280 (39%), Gaps = 57/280 (20%)
Query: 80 DLKSWMHKNG---LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTL 136
DL W+ + G P +V +H E + A D+ GD ++P L + L
Sbjct: 60 DLVRWVQREGGFVHPALRVA-----NHPEHGLGVSAAAPDGDIPPGDVLIALPGRLPLRL 114
Query: 137 ERVLGN-ETIAELLTTNKLSELACLALYL-MYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
R G + + L EL + L L + +++ SFW PYI L
Sbjct: 115 RRPTGAADDVLVQLAQQVPEELWAMKLGLRLLQERAKSDSFWWPYIANLPE-------TF 167
Query: 195 ESPLLWSETELAYLTGSPTKAEI-------LERAEGIKREYNELDTV------------- 234
P+ + ++ L +P ++ LE + I+++ + + +V
Sbjct: 168 TVPIFFPGEDIKNLQYAPLLHQVNKRCRFLLEFEKEIQQKLHTVPSVDHPFYGQDVNSSS 227
Query: 235 --WFMAGSLFQKVSLARRFALVPLGPPLLAYSSKC------------KAMLAAVDDAVQL 280
W M+ + +R F L P LL C + + ++D +V++
Sbjct: 228 LGWAMSAAS------SRAFRLHGEIPMLLPLIDMCNHSFNPNARIVQEGSVNSLDMSVKV 281
Query: 281 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 320
V ++ + SI + G PN L++YGFV NPYD++
Sbjct: 282 VAEKKIEQNASITLNYGCHPNDFFLLDYGFVITPNPYDQV 321
>gi|387193935|gb|AFJ68731.1| hypothetical protein NGATSA_2061300, partial [Nannochloropsis
gaditana CCMP526]
Length = 446
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 61/260 (23%), Positives = 105/260 (40%), Gaps = 38/260 (14%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L W +KNG+ I S + A+ ++ G+ +VP +L +++ V
Sbjct: 60 LLEWCNKNGIKDASKITIGPVSQAGMGLGL---VATAPIKQGETLATVPLNLCFSMDSVR 116
Query: 141 GN---ETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
+ + I E L + + +AL L+YE G KS + YI+ L R GQ + P
Sbjct: 117 ASPLGKVIGEF--EPALGDASLIALQLLYEAHMGPKSKYAVYIKSLPRP---GQDGFDHP 171
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAG-----------SLFQ--- 243
L WS E L S T+ + + +Y + + G S F+
Sbjct: 172 LFWSTAEQGVLAKSSTRNLGETLIDAVAEDYGWIQSALARGGISGLQADSFDLSDFEWAV 231
Query: 244 KVSLARRFALVPLG---PPLLAYSSKCKA---------MLAAVDDAVQLVVDRPYKAGES 291
V L+R F G PLL +++ + + ++++ DR G+
Sbjct: 232 AVVLSRSF-FAENGLRLAPLLDMANRGEGCTNEPQIGGLGIFGGKGLKVIADRDTDKGQE 290
Query: 292 IVVWCGPQPNSKLLINYGFV 311
IV+ GP+ + L ++GFV
Sbjct: 291 IVISYGPKSGIEFLEDHGFV 310
>gi|302792358|ref|XP_002977945.1| hypothetical protein SELMODRAFT_107696 [Selaginella moellendorffii]
gi|300154648|gb|EFJ21283.1| hypothetical protein SELMODRAFT_107696 [Selaginella moellendorffii]
Length = 467
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 68/282 (24%), Positives = 107/282 (37%), Gaps = 70/282 (24%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A DL G+ ++P + +TL + I L L + LMYE+ +GK
Sbjct: 34 VRALRDLHHGELIATIPKAACLTLLTTAARDAIERARLGGGLG----LTVALMYERSKGK 89
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNE- 230
S W Y++ L RQ P LWSE E+ L G+ + E +K ++ E
Sbjct: 90 GSKWYRYLKTLPRQE-------SVPFLWSEEEIDGLLLGTELHKALKEDKLLMKEDWEEN 142
Query: 231 ------LDTVWFMAGSLFQKVSLARR---------------FALVPLGPPLLAYSSKCKA 269
D + F A + LA + + +VPL ++ K A
Sbjct: 143 IAPLTKEDPLEFPAQDFTFESYLAAKSLVSSRSFEIDAEHGYGMVPLAD---LFNHKTDA 199
Query: 270 -----MLAA-----------VDDA---------------VQLVVDRPYKAGESIVVWCGP 298
ML A +DD +++V+ + AG I G
Sbjct: 200 EDVHFMLNASDSDDDDNGLIIDDGLANGDCREISSDKSVLEMVMVKDVAAGSEIFNTYGQ 259
Query: 299 QPNSKLLINYGFVDEDNPYD--RLVVEAALNTEDPQYQDKRM 338
N+ LL YGF + +NP+D L ++ L ++Q KR+
Sbjct: 260 LGNAALLHRYGFTEPNNPHDIVNLDMDCVLEVLLSRFQKKRV 301
>gi|409045252|gb|EKM54733.1| hypothetical protein PHACADRAFT_97093 [Phanerochaete carnosa
HHB-10118-sp]
Length = 513
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 63/222 (28%), Positives = 97/222 (43%), Gaps = 26/222 (11%)
Query: 252 ALVPLGPPLLAYSSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 310
A+VP+ L + A L D+ V +++ KAGE I G PNS LL YGF
Sbjct: 271 AMVPMADMLNGRFNTETARLFYDDEHVLRMMTVHEIKAGEQIWNTYGDPPNSDLLRRYGF 330
Query: 311 VD----------EDNPYD------RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 354
+D NP D LVVEAA + QD+ V + VF V
Sbjct: 331 IDVTKLESPLSGAGNPADIVEIPANLVVEAATKHTTSKTQDR--VDWWLEEAEDDVFVV- 387
Query: 355 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 414
G + E +M+ RL + +E + + G + P M+ + D ++RL
Sbjct: 388 -GTDCELPPEMVSLARL-LLQPKAEWEKT-KAKGKVP--KPTMDTTIAAIAMDVLQSRLK 442
Query: 415 GYPATLSEDEAMLTDYN-LHPKKRVATQLVRMEKKMLNACLQ 455
YP ++ EDE +L D + L +++A + EK++L L+
Sbjct: 443 EYPTSVEEDERLLADESQLGFNRKMAVTVRLGEKRILAGTLR 484
>gi|281207968|gb|EFA82146.1| hypothetical protein PPL_04566 [Polysphondylium pallidum PN500]
Length = 510
Score = 48.1 bits (113), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 31/118 (26%), Positives = 56/118 (47%), Gaps = 11/118 (9%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A +DL+ +P S ++T +I+ L K+ + ++ L+YE G
Sbjct: 59 VIALQDLKIDHTVAIIPKSCLLTPHTT----SISAYLKKYKIKDATATSIALLYEASIGS 114
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
+S W YI+ L L+V+ P+LW++ +L L G+ + + E E + YN+
Sbjct: 115 QSKWYGYIKSL-------PLSVDLPILWNDADLKNLKGTSIETVVYENKETVDATYNK 165
>gi|302820198|ref|XP_002991767.1| hypothetical protein SELMODRAFT_430007 [Selaginella moellendorffii]
gi|300140448|gb|EFJ07171.1| hypothetical protein SELMODRAFT_430007 [Selaginella moellendorffii]
Length = 389
Score = 48.1 bits (113), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 58/232 (25%), Positives = 101/232 (43%), Gaps = 20/232 (8%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+ ++AG+ +P+ LV+T E++ ++ + +LL+T + L L ++ E+ +G+ S
Sbjct: 14 AARSIRAGEQIVRIPHDLVLTAEKL--DDCVKKLLSTEY--DWCPLTLLILAEQHKGEAS 69
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY----NE 230
W PY+ L S + W + EL +L + ER E I EY N
Sbjct: 70 RWAPYVSCLPSFGDH-----HSTIFWEKEELKFLECTRAFRGTAERREMISDEYISVKNV 124
Query: 231 LDTVWFMAGSLFQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD----DAVQLVVDRPY 286
+ + + G A +A V A SS+ +M VD D V
Sbjct: 125 ISSCPHVFGEDISLFQFAHAYATVVSRAWNGALSSEI-SMRPFVDFCNHDPVSHATVSHD 183
Query: 287 KAGESIVVWC--GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 336
++ VV+ G + N+ L ++YGFV +N D+ + + DP + K
Sbjct: 184 SCKDATVVFISYGKRSNAVLAVDYGFVLPNNLSDQAELWMEIPWNDPLREKK 235
>gi|440302460|gb|ELP94773.1| hypothetical protein EIN_341910 [Entamoeba invadens IP1]
Length = 823
Score = 48.1 bits (113), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 84/396 (21%), Positives = 153/396 (38%), Gaps = 92/396 (23%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL---VVTLE 137
+ +W+ ++G V +K P + + +S++ GD S+P L + L
Sbjct: 4 ITTWVKEHGGHIDGVYVKNFPVYGNG------LCSSKEFHEGDTLLSIPYHLQLNTIELH 57
Query: 138 RVL-----GNET--IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRG 190
V G E + E N+ E + + LYL K +K F PYI L
Sbjct: 58 NVFESMVPGFEVPRLGEG-AKNRDDENSVVYLYLAM-NKTNEKCFHFPYINTLPT----- 110
Query: 191 QLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQK------ 244
PL +SE EL L G+ + + +K+ + +T+ + FQ+
Sbjct: 111 --TFSCPLSYSENELKMLKGTKLLVTVEKTKTFLKKLSDYYETLTHQYPTRFQQFDDFYQ 168
Query: 245 -------VSLARRFALV---PLGP-----PLLAYSSKCKAMLAAVDDAVQLVVDRPYKA- 288
V +R F ++ P+G P +S+ + V V +R +
Sbjct: 169 RLVWAHQVFWSRAFLVIYPDPIGDVASLIPFADFSNH------NTETKVTYVSNRQTQTF 222
Query: 289 ---GESIVVWCGPQ--------PNSKLLINYGFVDEDNPYDRLVV-----EAALNTEDPQ 332
V+ CG Q PN K+L+ YGFV +NPYD +++ E + +
Sbjct: 223 SLQTNEKVLHCGEQIFNNYRIRPNEKMLLGYGFVISENPYDEVLLRINFKERHFEKQVEE 282
Query: 333 YQDKRM--------------------VAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLG 372
++ +M + Q + V F + REKE +D+L LR+
Sbjct: 283 SEESKMEVENKENERMEVEEEDNEDEITQILKREGVDRFDYYLTREKELPTDLLRVLRIV 342
Query: 373 YVS--DTSEMQSVISSLGPICPVSPC-MERAVLDQL 405
+S + ++ + L + P++ R++++Q+
Sbjct: 343 NLSLVEANQYSQALLDLSYVSPINEIKATRSLMEQI 378
>gi|340966944|gb|EGS22451.1| hypothetical protein CTHT_0019870 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 499
Score = 48.1 bits (113), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 44/191 (23%), Positives = 69/191 (36%), Gaps = 43/191 (22%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
+L+ E +G+ SFW PYI L + + P W E ++ +L G+ I E
Sbjct: 111 FFLIKEYLKGENSFWWPYIATLPQPEQVNSWTL--PAFWPEDDIQFLEGTNAHVAIGEIQ 168
Query: 222 EGIKREYNELDTV----------------WFMAGSLFQKVSL------------------ 247
IKREY + V + A S+F S
Sbjct: 169 ANIKREYKQARKVLKEENFPNWKEYSQMLYKWAFSIFTSRSFRPSLILSQSVKDYVSTLL 228
Query: 248 --ARRFALVPLGPPLLAYSSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWCGPQP 300
AR + PL ++ D + QL+ Y+ G+ + G +
Sbjct: 229 PSAREIDDFSILQPLFDIANHSMTATYTWDTTSDPNCCQLICQDSYRPGDQVFNNYGFKT 288
Query: 301 NSKLLINYGFV 311
NS+LL+ YGF+
Sbjct: 289 NSELLLAYGFI 299
>gi|297738159|emb|CBI27360.3| unnamed protein product [Vitis vinifera]
Length = 449
Score = 47.8 bits (112), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 87/388 (22%), Positives = 157/388 (40%), Gaps = 62/388 (15%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN-KLSELACLALYLMYEKKQG 171
+AA+ DL G+ +VP S ++T + +L +E ++ + + LS L + L+ E +G
Sbjct: 51 LAAARDLSQGELILTVPKSALMTSQSLLKDEKLSVAVKRHTSLSSPQILTICLLAEMSKG 110
Query: 172 KKSFWLPYIRELDR---------QRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAE 222
K S+W PY+ +L R Q + L V+ + +E + KA L
Sbjct: 111 KSSWWHPYLMQLPRSYDTLANFSQFEKQALQVDDAIWVTERAILKAELEWKKAIPLMEEL 170
Query: 223 GIKREYNELDTVWFMAGSLFQKVSLARRF----ALVPLGPPLLAYSSKCKAML------- 271
+K + W A S ++ + L P+G Y++ +
Sbjct: 171 KLKPQLQNF-RAWLWASSTVSSRTMHIPWDDAGCLCPVG-DFYNYAAPGEEPCGWEDLKD 228
Query: 272 AAVDDAV-QLVVDRPYK---------------AGESIVVWCGPQPNSKLLINYGFVDEDN 315
A DD + Q + D YK GE +++ G N +LL +YGF+ ++N
Sbjct: 229 AEQDDVLSQRLTDGGYKEDLAAYCFYARKNYKKGEQVLLSYGTYTNLELLEHYGFLLDEN 288
Query: 316 PYDRLVV--EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 373
P D+ + E + +D + Q NGK S +L LRL +
Sbjct: 289 PNDKAFIPLEPEVYASSSWPKDSLYIHQ-NGKPSFA---------------LLSALRL-W 331
Query: 374 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 433
+ S+ +SV + +S E V++ +A L P ++ ED +L+
Sbjct: 332 ATPASQRRSVGHLVYSGTQLSSENEIFVMEWIAKSCHVVLENLPTSVEEDSLLLS----M 387
Query: 434 PKKRVATQLVRMEKKMLNACLQVTADMI 461
+ ++A Q K++L C+ ++I
Sbjct: 388 ERWKLAVQWRLRHKRILVDCISRCTEII 415
>gi|297735395|emb|CBI17835.3| unnamed protein product [Vitis vinifera]
Length = 583
Score = 47.8 bits (112), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 54/243 (22%), Positives = 93/243 (38%), Gaps = 47/243 (19%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+EDL+ GD A +P S+V++ E V ++ L + +S L L+ M EK
Sbjct: 211 ATEDLKVGDVALEIPMSIVISEELVHESDMFPILEKIDGISSETMLLLWSMKEKHNSNSK 270
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL--- 231
F Y L A + L + + L G+ EI+E + + +Y EL
Sbjct: 271 F-NTYFNALPE-------AFNTGLSFEFDAIMVLAGTLLLEEIIEAKKHLNAQYEELVPA 322
Query: 232 ----------------------DTVWFMAGSLFQKVSLARRFALVPLG--------PPLL 261
+W+ G R L+P+ P ++
Sbjct: 323 LCKDHPDIFPPEFYTQEQFLWACELWYSNGMQVMFTDGKLRTCLIPIAGFLNHSLYPHIM 382
Query: 262 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYDRL 320
Y + + ++++ V +P GE + G +S L+ YGF+ + DN YD +
Sbjct: 383 HYGK-----VDSKTNSLKFCVSKPCNMGEQCYLSYGNFSSSHLVTFYGFIPQGDNLYDTI 437
Query: 321 VVE 323
+E
Sbjct: 438 PLE 440
>gi|346324642|gb|EGX94239.1| SET domain-containing protein RMS1 [Cordyceps militaris CM01]
Length = 482
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 62/220 (28%), Positives = 86/220 (39%), Gaps = 68/220 (30%)
Query: 158 ACLALYLMYEKKQ------GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
+ L L L+YE Q G W PY+ L A +P+ WS EL L S
Sbjct: 102 SALILVLLYEHLQRDADATGAACRWRPYLDVL-------PAAFATPMFWSPAELGALQAS 154
Query: 212 PTKAEI-LERAEGIKREY---NELDTVWFMAGSLF----QKVSLARRFALVPLGPPLLAY 263
P A++ E A+ + R AGS Q V+LA R +G ++AY
Sbjct: 155 PAVAKVGRESADNMFRGILLPAVRAHAHVFAGSERLSDEQIVALAHR-----MGSTIMAY 209
Query: 264 S--------------------SKCKAMLAAV-------------------DDAVQLVVDR 284
+ KA++ V DD + + R
Sbjct: 210 AFDLDKEEDEDEDGEDGWVEDRDGKALMGMVPMADILNADAEFNVHVNHGDDDLTVTALR 269
Query: 285 PYKAGESIVVWCGPQPNSKLLINYGFVDEDNP-YDRLVVE 323
P +AGE I+ + GP PNS+LL YG+V E + YD VVE
Sbjct: 270 PIRAGEEILNYYGPHPNSELLRRYGYVTERHARYD--VVE 307
>gi|226294776|gb|EEH50196.1| SET domain-containing protein [Paracoccidioides brasiliensis Pb18]
Length = 488
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 51/107 (47%), Gaps = 13/107 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
+ A +D+ + F++P LV++ + N + +L+ N+ L + CL L ++YE Q
Sbjct: 50 IVAYDDINEEEELFAIPQGLVLSFQ----NSKLKDLMEINERDLGQWLCLILVMIYEYLQ 105
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
G S W PY + L ++ + W++ EL L GS I
Sbjct: 106 GAASPWAPYFKVLPTD-------FDTLMFWTDAELLELKGSAVLGRI 145
>gi|317038661|ref|XP_001401929.2| SET domain protein [Aspergillus niger CBS 513.88]
Length = 699
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 53/220 (24%), Positives = 87/220 (39%), Gaps = 44/220 (20%)
Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
+ + E +L+ + +G + FW PYIR L Q G ++ +P + +L +L G+
Sbjct: 103 DAVGEKESTIFFLIGQYLRGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGT 157
Query: 212 PTKAEILERAEGIKREYNELDT----------------VWFMAGSLFQKVSLARRFA--- 252
A +R E +K +Y + T ++ A S+F ++R F+
Sbjct: 158 SLLAAREKRLEVLKEKYEKGSTELRNAGFEGADAYTWDLYLWAASMF----ISRAFSAKV 213
Query: 253 LVPLGPPLLAYSSKCKAMLAAVD----------------DAVQLVVDRPYKAGESIVVWC 296
L + P K +L +D D + VV AG+ I
Sbjct: 214 LSGVFPETDLSEEKLSVLLPIIDMGNHRPLAKVEWRAGKDDIAFVVLEDVWAGQEISNNY 273
Query: 297 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 336
GP+ N +L++NYGF NP D +V P Y K
Sbjct: 274 GPRNNEQLMMNYGFCIPGNPCDHRIVSLRAPPGSPLYMAK 313
>gi|346327621|gb|EGX97217.1| SET domain-containing protein, putative [Cordyceps militaris CM01]
Length = 371
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 46/101 (45%), Gaps = 10/101 (9%)
Query: 234 VWFMAG--SLFQKVS------LARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRP 285
WF+ G S + +V R AL+P+ S C + + + DR
Sbjct: 154 AWFLVGTRSFYYQVDETLPYPWHDRLALLPVADMFNHASVGCAVAFST--EVYDVTADRD 211
Query: 286 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 326
Y+A E + G N LL YGF+ +DNP+D+L ++A L
Sbjct: 212 YEADEELYTSYGAHSNDFLLAEYGFMLQDNPHDQLCLDAVL 252
>gi|302834219|ref|XP_002948672.1| hypothetical protein VOLCADRAFT_104004 [Volvox carteri f.
nagariensis]
gi|300265863|gb|EFJ50052.1| hypothetical protein VOLCADRAFT_104004 [Volvox carteri f.
nagariensis]
Length = 510
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 60/221 (27%), Positives = 97/221 (43%), Gaps = 19/221 (8%)
Query: 249 RRFALVPLGPPLLAYSSKCKAMLAA--VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 306
+++A+ PL L +SS ++ +A D+ +V R +K GE + + G Q N L+
Sbjct: 291 KQYAICPL-IDLFNHSSAVQSEVAYNYFGDSYSVVASREFKKGEQVFISYGAQSNDSLMQ 349
Query: 307 NYGFVDEDNPYDRLVVEAALN--TEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD 364
YGF + +NP D V+ L T R+ A + L+ + V R S+
Sbjct: 350 YYGFAEANNPQDVYVMTDMLRWLTAVRSVGQSRLDALKGSPLANSLQQVAIQRAGFP-SE 408
Query: 365 MLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 424
L +R +D SE + +SS SP E + + +A+ + L ++L ED
Sbjct: 409 TLQAVRFLLAAD-SEAGADVSSFSKSG--SPDQEAQLAEVVAEVVRRELGHLGSSLQEDL 465
Query: 425 AMLTDYNLHPKKR---------VATQLVRMEKK-MLNACLQ 455
A+L+ R VA R+EKK +L A LQ
Sbjct: 466 ALLSSTGASAGGRKGGTAAAAAVAAVAFRVEKKRLLTAVLQ 506
>gi|225678514|gb|EEH16798.1| SET domain-containing protein RMS1 [Paracoccidioides brasiliensis
Pb03]
Length = 488
Score = 47.8 bits (112), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 51/107 (47%), Gaps = 13/107 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
+ A +D+ + F++P LV++ + N + +L+ N+ L + CL L ++YE Q
Sbjct: 50 IVAYDDINEEEELFAIPQGLVLSFQ----NSKLKDLMEINERDLGQWLCLILVMIYEYLQ 105
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
G S W PY + L ++ + W++ EL L GS I
Sbjct: 106 GAASPWAPYFKVLPTD-------FDTLMFWTDAELLELKGSAVLGRI 145
>gi|225446052|ref|XP_002268920.1| PREDICTED: uncharacterized protein LOC100256524 [Vitis vinifera]
Length = 566
Score = 47.8 bits (112), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 54/243 (22%), Positives = 93/243 (38%), Gaps = 47/243 (19%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+EDL+ GD A +P S+V++ E V ++ L + +S L L+ M EK
Sbjct: 194 ATEDLKVGDVALEIPMSIVISEELVHESDMFPILEKIDGISSETMLLLWSMKEKHNSNSK 253
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL--- 231
F Y L A + L + + L G+ EI+E + + +Y EL
Sbjct: 254 F-NTYFNALPE-------AFNTGLSFEFDAIMVLAGTLLLEEIIEAKKHLNAQYEELVPA 305
Query: 232 ----------------------DTVWFMAGSLFQKVSLARRFALVPLG--------PPLL 261
+W+ G R L+P+ P ++
Sbjct: 306 LCKDHPDIFPPEFYTQEQFLWACELWYSNGMQVMFTDGKLRTCLIPIAGFLNHSLYPHIM 365
Query: 262 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYDRL 320
Y + + ++++ V +P GE + G +S L+ YGF+ + DN YD +
Sbjct: 366 HYGK-----VDSKTNSLKFCVSKPCNMGEQCYLSYGNFSSSHLVTFYGFIPQGDNLYDTI 420
Query: 321 VVE 323
+E
Sbjct: 421 PLE 423
>gi|145356486|ref|XP_001422460.1| chloroplast lysine N-methyltransferase [Ostreococcus lucimarinus
CCE9901]
gi|144582703|gb|ABP00777.1| chloroplast lysine N-methyltransferase [Ostreococcus lucimarinus
CCE9901]
Length = 529
Score = 47.8 bits (112), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 85/376 (22%), Positives = 144/376 (38%), Gaps = 66/376 (17%)
Query: 48 RRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKH 107
RR+ R+ +S + D L W+ NG V + + +E
Sbjct: 24 RRRARWGDATTSKTRRPRTRARRDAASSADHDALHEWLSANGADVASVEFYDARAGDEDD 83
Query: 108 RPIHYVA--ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELL----TTNKLSELACLA 161
A+ L G A VP SL +T E + ++ + + L L+ LA
Sbjct: 84 GGDAGWGARATRALARGAKAIVVPKSLWITPEVGMNDDELGKALRDEDVAGGLARWTTLA 143
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
L L+ E+++G++S + Y++ L + SPL W+ EL+ + G+ ++L+ A
Sbjct: 144 LTLLKERERGEESKYAAYVKTLPE-------VLHSPLFWNAEELSEIQGT----QLLDNA 192
Query: 222 EG----IKREYNELDTVWF-------------------MAGSLFQKVSLA----RRFALV 254
G ++ Y L T F A + + ++A ALV
Sbjct: 193 AGYDGYVRGVYETLRTGMFAKHADVFDVEGAFSEDNFRWAFGILRSRTMAPCDGANIALV 252
Query: 255 PLGPPLLAYSSKCKA-------MLAAV---------DDAVQLVV--DRPYKAGESIVVWC 296
P G L+ +SS +A + AV DD V V DR E + V
Sbjct: 253 P-GVDLVNHSSLSQARWRVSGGVAGAVAGLFGGGKGDDGVSARVECDRALNVNEPLYVNY 311
Query: 297 GPQ-PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 355
P+ ++ +++GFVD P + ++ +DP DK V G F + A
Sbjct: 312 NPEGTDTSFALDFGFVDTITPSPGYALSLSVPEDDPNVFDKLDVLDVCGLGETPTFTLRA 371
Query: 356 GREKEAISDMLPYLRL 371
+ + D+ +LRL
Sbjct: 372 YSDPD--PDLRTFLRL 385
>gi|226505024|ref|NP_001151430.1| SET domain containing protein [Zea mays]
gi|195646778|gb|ACG42857.1| SET domain containing protein [Zea mays]
gi|413923893|gb|AFW63825.1| SET domain containing protein [Zea mays]
Length = 491
Score = 47.8 bits (112), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 58/237 (24%), Positives = 104/237 (43%), Gaps = 37/237 (15%)
Query: 114 AASEDLQAGDAAFSVPNSLVVTLER-VLGNETIAELLTTNKLSELACLALYL-MYEKKQG 171
AA D+ GD ++P+ L + L R + + L EL + L L + +++
Sbjct: 93 AAYGDIPIGDVLIALPSQLPLRLRRPTSAADDVLVQLAQQVPDELWAMKLGLRLLQERAK 152
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA----EGIKRE 227
SFW PYI L P+ + ++ L +P ++ +R E K
Sbjct: 153 SDSFWWPYIANLPE-------TFTVPIFFPGEDIKNLQYAPILHQVNKRCRFLLEFEKEV 205
Query: 228 YNELDTVWFM----------AGSLFQKVSLA--RRFAL---VPLGPPLL-----AYSSKC 267
+L TV + + SL +S A R F L VP+ PL+ +++
Sbjct: 206 QQKLHTVPLVDHPFYGQDVNSSSLGWAMSAASSRAFRLHGEVPMLLPLIDMCNHSFNPNA 265
Query: 268 KAM----LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 320
+ + + ++D +V+++ ++ K E+I + G PN L++YGFV NPYD++
Sbjct: 266 RIVQERSVNSLDMSVKVLAEKKIKQNEAITLNYGCYPNDFFLLDYGFVITQNPYDQV 322
>gi|428175234|gb|EKX44125.1| hypothetical protein GUITHDRAFT_109909 [Guillardia theta CCMP2712]
Length = 442
Score = 47.8 bits (112), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 23/91 (25%), Positives = 42/91 (46%)
Query: 248 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 307
R+ + P+ K A +L + + GE I + + N++ L+
Sbjct: 244 GRQRIMAPMADLFNHGDVKTSYTFNAASGHFELFTQQFFSRGEQIFMNYDSKNNAEFLLQ 303
Query: 308 YGFVDEDNPYDRLVVEAALNTEDPQYQDKRM 338
YGFV E NP+D + + A++ + P Y+DK +
Sbjct: 304 YGFVIESNPHDYVGIAASIGNDQPFYRDKSL 334
>gi|350632383|gb|EHA20751.1| hypothetical protein ASPNIDRAFT_120572 [Aspergillus niger ATCC
1015]
Length = 668
Score = 47.8 bits (112), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 53/220 (24%), Positives = 87/220 (39%), Gaps = 44/220 (20%)
Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
+ + E +L+ + +G + FW PYIR L Q G ++ +P + +L +L G+
Sbjct: 77 DAVGEKESTIFFLVGQYLRGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGT 131
Query: 212 PTKAEILERAEGIKREYNELDT----------------VWFMAGSLFQKVSLARRFA--- 252
A +R E +K +Y + T ++ A S+F ++R F+
Sbjct: 132 SLLAAREKRLEVLKEKYEKGSTELRNAGFEGADAYTWDLYLWAASMF----ISRAFSAKV 187
Query: 253 LVPLGPPLLAYSSKCKAMLAAVD----------------DAVQLVVDRPYKAGESIVVWC 296
L + P K +L +D D + VV AG+ I
Sbjct: 188 LSGVFPETDLSEEKLSVLLPIIDMGNHRPLAKVEWRAGKDDIAFVVLEDVWAGQEISNNY 247
Query: 297 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 336
GP+ N +L++NYGF NP D +V P Y K
Sbjct: 248 GPRNNEQLMMNYGFCIPGNPCDHRIVSLRAPPGSPLYMAK 287
>gi|168016200|ref|XP_001760637.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687997|gb|EDQ74376.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 450
Score = 47.4 bits (111), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 70/301 (23%), Positives = 118/301 (39%), Gaps = 68/301 (22%)
Query: 81 LKSWMHKNGLPP--CKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLER 138
+ WM NG+ C++ +PS N ++ A ++ Q P L +T
Sbjct: 18 FRDWMQINGVQSRFCEI----RPSSNGENAGFGLFATKDNAQG--VLMVTPLLLAITPMT 71
Query: 139 VLGNETIA----ELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
VL + + +L+ ++ + + L+L+ E+ +G+ SFW PY+ L + G
Sbjct: 72 VLQDPELGGHYCKLMEEGEVDDRLLIMLFLVIERARGRFSFWAPYLEILPFKFG------ 125
Query: 195 ESPLLWSETELAYLTGSPT-KAEILERAEGIKREYNELD----TVWFMAGSLFQKVSLAR 249
+PL +SE EL+ L G+ +A + G+ LD W A ++ S
Sbjct: 126 -TPLSFSEEELSELKGTHLFQATQQQSTTGLILRCPVLDRANSVFWTRALNIPCPHSFNN 184
Query: 250 RFA--------------------------------LVPLGPPLLAYSSKCKAM------- 270
RFA LVP G + K A+
Sbjct: 185 RFAVDLDSTTHKKPEESSAADTDDVKIPSSVWVEGLVP-GIDFCNHDLKAVALWEVDGPE 243
Query: 271 --LAAVDDAVQLV--VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 326
+ V +++ LV +D G I + G + N +LL YGFV +NP D L+V + +
Sbjct: 244 GSVTGVPNSMYLVTGLDVVISNGSEIFISYGNKSNEELLYLYGFVLVENPDDYLMVRSTI 303
Query: 327 N 327
Sbjct: 304 G 304
>gi|281207217|gb|EFA81400.1| mRNA-decapping enzyme 2 [Polysphondylium pallidum PN500]
Length = 1078
Score = 47.4 bits (111), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 95/457 (20%), Positives = 177/457 (38%), Gaps = 114/457 (24%)
Query: 61 DTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQ 120
D + G ++ +++D ++++W + P + EK + +S D++
Sbjct: 12 DIRIGGQTVQLTFRKDDGINIQTWKQDSKQPLLSLTPNEKG-----------IFSSRDIK 60
Query: 121 AGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLA--------LYLMY--EKKQ 170
G+ S+P +++ +V + + L NK+ +L A LY Y +
Sbjct: 61 EGEELLSLPWYNSLSMNKV--QQQLPWLF--NKIQDLELTAEDGLVVALLYYRYCMDDLS 116
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
S W + E+ + S L +S+ E L GSP +++ + K +
Sbjct: 117 FDYSEWFSAMPEV----------LNSGLFFSDAEAELLNGSPAYIDLMNQRLDAKELFGR 166
Query: 231 LDTVWFMAGSLFQKVSL-------------ARR--------------FALVPLGPPLLAY 263
L +++ F K ++ +R+ F V L P L Y
Sbjct: 167 LKSLF--KEQQFSKCAMTYDRLKWAYSVVDSRKIYTEAPNLDANGNPFITVVLAP-FLDY 223
Query: 264 SSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWCGPQP-NSKLLINYGFVDEDNPY 317
+ + AA D A+++V +P K GE I + G Q NS LLI+YGF+D+ +
Sbjct: 224 FNHAEDAQAAYDFDYDESAIKVVALQPIKKGEQIFLNYGNQDCNSDLLIHYGFIDQSSTA 283
Query: 318 DRLV---VEAALNT---EDPQYQDKRMVAQRNGKLS--VQVFHVHAGREKEAISDMLPYL 369
V VE LNT DPQ +K + + + + +++F E IS L Y
Sbjct: 284 KHCVNVLVEELLNTIPASDPQLIEKTELLTKAFEQNERMKLFKDSLTEELLKISKYLSYK 343
Query: 370 RLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML-- 427
++L L ++ YP T+ ED A++
Sbjct: 344 NF----------------------------SLLPYLKSLIDMKMKAYPTTMEEDRAIIEA 375
Query: 428 -TDYNLHPKKRVATQLVRMEK----KMLNACLQVTAD 459
T++ ++ + ++R+++ K + A +QV D
Sbjct: 376 TTEFEKLSQRSKMSIIMRLQEKETLKEIGALIQVKID 412
>gi|241712095|ref|XP_002413441.1| conserved hypothetical protein [Ixodes scapularis]
gi|215507255|gb|EEC16749.1| conserved hypothetical protein [Ixodes scapularis]
Length = 227
Score = 47.4 bits (111), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 43/157 (27%), Positives = 65/157 (41%), Gaps = 21/157 (13%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLG-- 141
W NG + L+ P + AA +D+Q G VP +++T +G
Sbjct: 11 WCLDNGATINGITLQALPDDE------YGFAAEQDIQVGPVFLGVPLGMMMT---TIGAR 61
Query: 142 NETIAELLTTN---KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPL 198
+ LL + K E L+++L+ E G SFW PYI L R + + L
Sbjct: 62 KSKLGALLKDDPIMKSMENVALSMFLILELCAGSASFWHPYISILPR-------SFNTVL 114
Query: 199 LWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 235
+S EL LTGS E L+ I R+Y ++
Sbjct: 115 YFSVDELQLLTGSSVLDEALKLHRSIARQYAYFHKIF 151
>gi|134074534|emb|CAK38827.1| unnamed protein product [Aspergillus niger]
Length = 625
Score = 47.4 bits (111), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 53/220 (24%), Positives = 87/220 (39%), Gaps = 44/220 (20%)
Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
+ + E +L+ + +G + FW PYIR L Q G ++ +P + +L +L G+
Sbjct: 29 DAVGEKESTIFFLIGQYLRGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGT 83
Query: 212 PTKAEILERAEGIKREYNELDT----------------VWFMAGSLFQKVSLARRFA--- 252
A +R E +K +Y + T ++ A S+F ++R F+
Sbjct: 84 SLLAAREKRLEVLKEKYEKGSTELRNAGFEGADAYTWDLYLWAASMF----ISRAFSAKV 139
Query: 253 LVPLGPPLLAYSSKCKAMLAAVD----------------DAVQLVVDRPYKAGESIVVWC 296
L + P K +L +D D + VV AG+ I
Sbjct: 140 LSGVFPETDLSEEKLSVLLPIIDMGNHRPLAKVEWRAGKDDIAFVVLEDVWAGQEISNNY 199
Query: 297 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 336
GP+ N +L++NYGF NP D +V P Y K
Sbjct: 200 GPRNNEQLMMNYGFCIPGNPCDHRIVSLRAPPGSPLYMAK 239
>gi|159122413|gb|EDP47534.1| SET domain protein [Aspergillus fumigatus A1163]
Length = 492
Score = 47.4 bits (111), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 56/108 (51%), Gaps = 15/108 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
V A D+ G+ FS+P LV++ + N + +LL+ + L EL L L +MYE
Sbjct: 50 VVARSDIFDGEELFSIPRGLVLSAQ----NSKLKDLLSQD-LEELGPWLSLILVMMYEYL 104
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
G++S W PY + L + + ++ + WS +EL L GS ++I
Sbjct: 105 LGEQSAWAPYFKILPK-------SFDTLMFWSPSELRELQGSAIVSKI 145
>gi|70984218|ref|XP_747626.1| SET domain protein [Aspergillus fumigatus Af293]
gi|66845253|gb|EAL85588.1| SET domain protein [Aspergillus fumigatus Af293]
Length = 492
Score = 47.4 bits (111), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 56/108 (51%), Gaps = 15/108 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
V A D+ G+ FS+P LV++ + N + +LL+ + L EL L L +MYE
Sbjct: 50 VVARSDIFDGEELFSIPRGLVLSAQ----NSKLKDLLSQD-LEELGPWLSLILVMMYEYL 104
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
G++S W PY + L + + ++ + WS +EL L GS ++I
Sbjct: 105 LGEQSAWAPYFKILPK-------SFDTLMFWSPSELRELQGSAIVSKI 145
>gi|295668911|ref|XP_002795004.1| SET domain-containing protein [Paracoccidioides sp. 'lutzii' Pb01]
gi|226285697|gb|EEH41263.1| SET domain-containing protein [Paracoccidioides sp. 'lutzii' Pb01]
Length = 488
Score = 47.4 bits (111), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 51/107 (47%), Gaps = 13/107 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
+ A +D+ + F++P LV++ + N + +L+ N+ L + CL L ++YE Q
Sbjct: 50 IVAYDDINKEEELFAIPQGLVLSFQ----NSKLKDLMEINERDLGQWLCLILVMIYEYLQ 105
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
G S W PY + L ++ + W++ EL L GS I
Sbjct: 106 GVASPWAPYFKVLPTD-------FDTLMFWTDAELLELKGSAVLGRI 145
>gi|159490820|ref|XP_001703371.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280295|gb|EDP06053.1| predicted protein [Chlamydomonas reinhardtii]
Length = 339
Score = 47.4 bits (111), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 58/246 (23%), Positives = 97/246 (39%), Gaps = 41/246 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER-----VLGNETIAELLTTNKLSELACLALYLMYE 167
+ AS +++ G+ VP+ V+ E VL E + + ++ E+ L + +M+E
Sbjct: 68 LVASRNIKMGEVVVEVPDDAVLMAENCGLRDVLEEEGMTKDSADEEILEVQGLVIAVMWE 127
Query: 168 KKQGKKSFWLPYI-------------------REL------DRQRGRGQ----LAVESPL 198
+ +G +S W PY+ REL D+ GR Q + PL
Sbjct: 128 RWRGPESRWAPYLALLPDDMTHMPLYWKRREFRELRGTAAYDKMLGRAQHPSDAPTQVPL 187
Query: 199 LWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQKVSLARRFALVPLGP 258
LWSE ++ P G +R Y S + + A+VP+
Sbjct: 188 LWSEVVGPFIAEHPELG-----LPGGERGYELYRWATAAVASYSFILGDDKYQAMVPVWD 242
Query: 259 PLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 316
L + L + +Q++ R AG +V G N++LL YGFV+ N
Sbjct: 243 LLNHITGDVNVRLHHCSKRHVLQMIAMRDIVAGSELVNNYGELSNAELLRGYGFVERANR 302
Query: 317 YDRLVV 322
Y+ + V
Sbjct: 303 YNHIPV 308
>gi|145518912|ref|XP_001445328.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124412772|emb|CAK77931.1| unnamed protein product [Paramecium tetraurelia]
Length = 761
Score = 47.4 bits (111), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 41/138 (29%), Positives = 65/138 (47%), Gaps = 23/138 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERV-LGNETIA-----ELLTTNKLS--ELACLALYL 164
V A++D+ A A VP L+++ E+ L + +I EL N+ S E L YL
Sbjct: 46 VVATKDIPANTAIICVPQPLIISQEKCKLSSLSIVYDKHPELFDENETSDAEFNILIFYL 105
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
EKK+G+KSF+ PY++ + + + + WS+ EL Y+ E I
Sbjct: 106 FNEKKKGEKSFYHPYVQAIQ--------SNNTLIDWSKEELNYIEDPIILDEF-----AI 152
Query: 225 KREYNELDTVWFMAGSLF 242
RE +L +W A +F
Sbjct: 153 VRE--DLKDLWNQAKEIF 168
>gi|449520517|ref|XP_004167280.1| PREDICTED: LOW QUALITY PROTEIN: sulfate transporter 4.1,
chloroplastic-like, partial [Cucumis sativus]
Length = 923
Score = 47.4 bits (111), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 58/124 (46%), Gaps = 22/124 (17%)
Query: 112 YVAASEDLQAGDAAFSVP-------NSLVVTLERVLGNETIAELLTTNKLSELACLALYL 164
++ ASE ++AGD VP +SL + + +LGNE + +A LA+ +
Sbjct: 734 FLFASETIRAGDCILKVPFNVQISPDSLPLPIRDLLGNE----------IGNVAKLAVVV 783
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
+ E K G S W PYI L + + + + W E+EL + S E L + I
Sbjct: 784 LLEHKLGLGSEWAPYIIRLPQ-----PWEMHNTIFWKESELEMIRKSSLYEESLNQRSQI 838
Query: 225 KREY 228
KRE+
Sbjct: 839 KREF 842
>gi|384246211|gb|EIE19702.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 503
Score = 47.4 bits (111), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 102/434 (23%), Positives = 179/434 (41%), Gaps = 69/434 (15%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E L L +W+ + GLP K+ ++ + + + S+ + G +VP+S +T
Sbjct: 50 ETLPPLSAWVEQRGLPLKKLNVRPEIVEGDL-----CLVVSKPTKKGQPLVAVPSSAWLT 104
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
++V+ + +I L+ L +AL+L++E+ + + W ++ + A +
Sbjct: 105 -QQVVRSSSIGSLV--EDLEPWLQIALFLLHERSKPDAA-WQGFLDSI-------PAAPD 153
Query: 196 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL--------------------DTVW 235
PL WSE EL+ L G+ + + + + +Y EL D +W
Sbjct: 154 VPLFWSEEELSQLEGTQLLSSVQGYRQFFEAKYAELEEQLFAPHREAFPPKSHQLDDFLW 213
Query: 236 FMAG--SLFQKVSLARRFALVPLGPPLLAY----SSKCKAMLAA--VDDAVQLVVD--RP 285
+A S ALVPL L+ + ++ + LA A LVV+ R
Sbjct: 214 AVATVRSRVHSPLDGEDVALVPLAD-LVQHRKLQGARWQLQLAGGLFSKAQALVVEAQRD 272
Query: 286 YKAGESIVVWCGP--------QPNSKLLINYGFVDEDNPY-------DRLVVEAALNTED 330
Y GE + + G + +S++L++YG +D D P ++ AL +D
Sbjct: 273 YAEGEVVTMDFGAPLTEEDQEKLDSQVLLDYGALDADRPQADPGVVQGGFILSLALPEDD 332
Query: 331 PQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLG-- 388
Y DK + + NG F + A E L D ++ + +
Sbjct: 333 KYYDDKADILELNGLSEAASFVLRANEEPSEQLLGFLRLLNLSGQDAFLLEPLFRNEAWG 392
Query: 389 -PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQ-LVRM- 445
+ PVS ERAV + + + +A L GY ++ +D L D P R+ LVR+
Sbjct: 393 HMLAPVSEANERAVYESMMEGCRAALQGYATSIDDDLRALRDT--QPGTRLEKAILVRLG 450
Query: 446 EKKMLNACLQVTAD 459
EK+ L+A L D
Sbjct: 451 EKETLDATLAFFED 464
>gi|299473350|emb|CBN77749.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 563
Score = 47.0 bits (110), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 69/284 (24%), Positives = 116/284 (40%), Gaps = 33/284 (11%)
Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW--FMAGSLFQKVSLAR----- 249
P+ W+E E+ L GS ++ ER + I+ +Y + ++ F + ++ AR
Sbjct: 212 PIFWTEEEMRLLQGSYLVTQVEERNQAIEGDYGVICDLYPPFRDVATLEEFKWARMCVCS 271
Query: 250 -----------RFALVPLGPPLLAYSSK-CKAMLAAVDDAVQLVVDRPYKAGESIVVWCG 297
ALVP L Y + K + G + G
Sbjct: 272 RNFGLDINGLRTSALVPYADMLNHYRPRETKWTYDNNRGGFTITTLHRILGGAQVYDSYG 331
Query: 298 PQPNSKLLINYGFVDEDNPY------DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 351
+ N + L+NYGF E+N + + + L+ DP Q K + +G +V
Sbjct: 332 QKCNHRFLLNYGFAIENNQEANGFCPNEVPLLFRLDARDPLRQKKARFWRMDGPEQRRV- 390
Query: 352 HVHAGREKEAISDMLPYLRLGYVSDTSEMQS--VISSLGPI-CPVSPCMERAVLDQLADY 408
+ G + +A+ L LR+ V+D +EM + + ++ + P+S E A +++L
Sbjct: 391 RLCVG-DTDAVRGALSMLRV-IVADAAEMGARYMYRTVKDVRFPLSVRNEVAAMERLLLL 448
Query: 409 FKARLAGYPATLSEDEAMLTDYNLHP--KKRVATQLVRMEKKML 450
L YP TL ED A L + L P +R A V EK +L
Sbjct: 449 TTGALDAYPTTLEEDRAALKNGGLEPFSNRRHALIQVYGEKVVL 492
>gi|327290197|ref|XP_003229810.1| PREDICTED: SET domain-containing protein 4-like [Anolis
carolinensis]
Length = 440
Score = 47.0 bits (110), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 70/299 (23%), Positives = 116/299 (38%), Gaps = 43/299 (14%)
Query: 86 HKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVVTLERV 139
HK+ K LKEK + K RP + + ++ LQ G+ S+P ++T + V
Sbjct: 29 HKDEYILLKKWLKEKGCNVNKLRPAQFPETGRGLVTTKGLQVGELIISLPEKCLLTTDTV 88
Query: 140 LGNETIAELLT--TNKLSELACLALYLMYEKKQGKKSFWLPYIRELD------------- 184
L N + E + T +S L L +L+ EK +KS W PY+ L
Sbjct: 89 L-NSYLREYIVKWTPPISPLIALCTFLIAEKWAQEKSPWKPYLDLLPEIYSCPVCLEQKI 147
Query: 185 --------RQRGRGQLAVESPLLWSETELAYLTG--SPTKAEILERAEGIKREYNELDT- 233
R++ Q + L S + + P + + K + ++T
Sbjct: 148 VNLFPEPLRRKAHEQRKLVQELFISSQQFFFSLQPLFPKDVASVFNYQAFKWAWCTINTR 207
Query: 234 VWFMAGSLFQKVSLARRFALVPLGP--PLLAY--SSKCKAMLAAVDDAVQLVVDRPYKAG 289
+M S Q+ +R L P LL + + + KA ++
Sbjct: 208 TVYMKHS--QRDCFSRDTDTYALAPYLDLLNHNPTVQVKAGFNEKTKCYEITTVTQCHHY 265
Query: 290 ESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 348
+ + GP N +LL+ YGFV DNP+ + V ++ +DK QR KLS+
Sbjct: 266 NEVFICYGPHDNQRLLLEYGFVSRDNPHSSVYVGTDTLLKNVFPEDK----QRPKKLSI 320
>gi|299470104|emb|CBN78133.1| protein N-methyltransferase [Ectocarpus siliculosus]
Length = 482
Score = 47.0 bits (110), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 36/74 (48%)
Query: 250 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 309
R AL+PL + YS M D A+ L V + G+ GP N LL YG
Sbjct: 215 RMALLPLIDSINHYSRMPTHMYWEADGALSLSVGAAFDPGDHAFASYGPVSNDDLLQYYG 274
Query: 310 FVDEDNPYDRLVVE 323
FV++DNP D V+E
Sbjct: 275 FVEQDNPSDTYVLE 288
>gi|294868786|ref|XP_002765694.1| hypothetical protein Pmar_PMAR013760 [Perkinsus marinus ATCC 50983]
gi|239865773|gb|EEQ98411.1| hypothetical protein Pmar_PMAR013760 [Perkinsus marinus ATCC 50983]
Length = 330
Score = 47.0 bits (110), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 21/72 (29%), Positives = 36/72 (50%)
Query: 251 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 310
++PL S+K + V+ Q++ ++P K GE I G N LL+ +GF
Sbjct: 171 LCVIPLADQFNHSSTKWHTRVREVEGGFQMLAEKPVKKGEEIFNNYGLYTNEMLLLTHGF 230
Query: 311 VDEDNPYDRLVV 322
++ DNP+D +
Sbjct: 231 IEFDNPHDHFIT 242
>gi|212546319|ref|XP_002153313.1| SET domain protein [Talaromyces marneffei ATCC 18224]
gi|210064833|gb|EEA18928.1| SET domain protein [Talaromyces marneffei ATCC 18224]
Length = 481
Score = 47.0 bits (110), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 57/231 (24%), Positives = 100/231 (43%), Gaps = 27/231 (11%)
Query: 252 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
+VPL L A + + A L + A+ + +P KAG+ I G P S LL YG+V
Sbjct: 230 GMVPLADLLNADADRNNARLFQEEGALVMRAIKPIKAGDEIFNDYGELPRSDLLRRYGYV 289
Query: 312 DEDNPYDRLVVEAAL----------NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEA 361
DN VVE L N ED +Y +++ Q ++ + + ++
Sbjct: 290 -TDNYAQYDVVELPLTGICHAAGFDNIEDKEYPQLKLLDQL--EILEDGYCILRPSPEDT 346
Query: 362 ISDMLP----YLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP 417
+ D+LP L D+ E+Q ++S P+ E + L D ++++ Y
Sbjct: 347 LLDILPDELLALLKTLTLDSEELQRLLSKNKHPKPILGAREARI---LLDAAQSKMGQYG 403
Query: 418 ATLSEDEAMLTDY-------NLHPKKRVATQLVRMEKKMLNACLQVTADMI 461
T+ ED+ +L + ++ +A Q+ EK++L A L + D +
Sbjct: 404 TTIQEDKILLQQFASSSVLRTRERRRHMAVQVRVGEKEILQALLMMLQDFL 454
Score = 38.1 bits (87), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 26/108 (24%), Positives = 52/108 (48%), Gaps = 9/108 (8%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A ++Q G+ F +P+ +V+ ++ N+ +A+ L L L + ++YE G+
Sbjct: 50 VVARSNIQEGEDLFHLPHHIVLMVKTSRLNQILADDLKN--LGPWLSLVVVMIYEYSLGE 107
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILER 220
+S W Y + L + ++ + WSE E + L S ++ +R
Sbjct: 108 QSNWKQYFQVLPSK-------FDTLMFWSEEEFSQLQASAVVDKVGKR 148
>gi|62860180|ref|NP_001017105.1| SET domain containing 4 [Xenopus (Silurana) tropicalis]
gi|89267009|emb|CAJ81787.1| novel protein containing a SET domain [Xenopus (Silurana)
tropicalis]
Length = 442
Score = 47.0 bits (110), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 55/245 (22%), Positives = 99/245 (40%), Gaps = 43/245 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN-KLSELACLALYLMYEKKQG 171
+ A+ DLQ G+ S+P+S ++T E VL + + T + +S L L +L+ E+
Sbjct: 62 LMATRDLQPGELIISLPDSCLITTETVLQSYLGKYIRTWSPPVSPLLALCTFLIAERVAR 121
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR----- 226
++S W PY+ L + P+ W E+E+ L +P + + LE+ +K
Sbjct: 122 ERSPWKPYLDVLPS-------SYSCPVYW-ESEIISLLPAPLRQKALEQQTEVKELHTES 173
Query: 227 -------------------EYNELDTVWFMAGSLFQKVSLARRFAL------VPLGPPL- 260
Y L W + + RR L + P L
Sbjct: 174 WSFFVSLQPLFGGNITDIYTYGALRWAWCTVNTRTVYMKHPRRHGLSAQQDVYAMAPYLD 233
Query: 261 LAYSSKCKAMLAAVDD---AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 317
L S + AA ++ ++ + + + + GP N +LL+ YGF+ +NP+
Sbjct: 234 LLNHSPAVQVEAAFNEERRCYEIRTNSGCRKHDQAFICYGPHDNQRLLLEYGFIAANNPH 293
Query: 318 DRLVV 322
+ V
Sbjct: 294 RSVYV 298
>gi|384248108|gb|EIE21593.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 229
Score = 47.0 bits (110), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 52/110 (47%), Gaps = 9/110 (8%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
D W+ K G + E + E R V A +++ G +VP L+++
Sbjct: 5 DFAEWLQKGGALIADI---EPGAVAEGFRG---VIAKANIEEGTLLVAVPERLLLSAHSA 58
Query: 140 LGNETIAE-LLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
+ AE LL TNK + LA +L++E +G++SFW PY+ L RQ
Sbjct: 59 KKDRAFAEALLATNKQSIGSSQVLAAHLLHEASKGQESFWRPYLATLPRQ 108
>gi|297807745|ref|XP_002871756.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297317593|gb|EFH48015.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 46.6 bits (109), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 56/279 (20%), Positives = 110/279 (39%), Gaps = 47/279 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAE--LLTTNKLSELACLALYLMYEKKQ 170
+ A +L+ G+ VP + ++T E ++ + ++ LS L++ L+YE +
Sbjct: 54 LGAVRELKKGELVLKVPRNALMTTESMIAKDRKLNDAVILHGSLSSTQILSVCLLYEMGK 113
Query: 171 GKKSFWLPYIRELDR---------QRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
GK+SFW PY+ L R + + L VE + +E +A + +L
Sbjct: 114 GKRSFWYPYLVHLPRDYDLLATFGEFEKQALQVEDAVWATEKAIAKCQFEWKEVGLLMEE 173
Query: 222 EGIKREYNELDTVWFMAGSLFQKVSLARRF----ALVPLG-------------------- 257
+K ++ W A + +L + L P+G
Sbjct: 174 LELKSKFRSFQ-AWLWASATISSRTLHVPWDSAGCLCPVGDLFNYDAPGDDLHTLEGPES 232
Query: 258 ------PPLLAYSSKCKAMLAAVDDAVQ---LVVDRPYKAGESIVVWCGPQPNSKLLINY 308
L+ + + ++ V L R Y+ GE +++ G N +LL +Y
Sbjct: 233 ANDVEEAGLVVETHSERLTDGGFEEDVNAYCLYARRNYQLGEQVLLCYGTYTNLELLEHY 292
Query: 309 GFVDEDNPYDRLVV--EAALNTEDPQYQDKRMVAQRNGK 345
GF+ E+N D++ + E +L + + + ++GK
Sbjct: 293 GFMLEENSNDKVFIPLETSLFSLASSWPKDSLYIHQDGK 331
>gi|296804474|ref|XP_002843089.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
gi|238845691|gb|EEQ35353.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
Length = 455
Score = 46.6 bits (109), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 84/389 (21%), Positives = 156/389 (40%), Gaps = 57/389 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
V A + G+ ++P++ + T+E+ + + +L + + LS LA+YL++ +
Sbjct: 28 VKALRSFKEGERILTIPSACLWTVEKAYADPLLGPVLRSAQPPLSVEDALAVYLLFVRS- 86
Query: 171 GKKSFWLPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIK 225
R + R +A + + ++E EL GS A + + ++
Sbjct: 87 ----------RTSGYEGQRHHIAAMPQSYSASIFFTEDELQVCAGSSLYALTRQLEQRVR 136
Query: 226 REYNEL--------------------DTVWFMAGSLFQK-----VSLARRFALVPLGPPL 260
+Y +L D W + S++ + VS LV +
Sbjct: 137 DDYRQLLVPLLSQHRDLFPLDQFTIEDYKWALC-SIWSRAMDFAVSGTTSVRLVAPLADM 195
Query: 261 LAYS---SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 317
L +S +C A D + ++ + Y+ G+ + ++ G PN++LL YGFV DNP
Sbjct: 196 LNHSPDVKQCHAYDPTSGD-LSILAAKDYQVGDQVFIYYGSVPNNRLLRLYGFVLPDNPN 254
Query: 318 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI-SDMLPYLRLGYVSD 376
D + + P Y+ K + G S + K+ + +++L YLR+ + D
Sbjct: 255 DSYDLVLQTSPLAPLYEQKERLWALAGLDSTCTIPLTV---KDPLPNNVLRYLRIQRL-D 310
Query: 377 TSEMQSVISSL--GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT--DYNL 432
S + + L G VS E VL L D + L G+ L + EA L DY
Sbjct: 311 ESNITDITLQLVNGTDGKVSDGNEMQVLQFLVDSIGSLLEGFGIPLEKLEAQLAAGDYPA 370
Query: 433 HPKKRVATQLVRMEKKMLNACLQVTADMI 461
A + E+++L + D++
Sbjct: 371 GGNAWAAAHVSAGEQRVLTRAKRTAEDLL 399
>gi|302754818|ref|XP_002960833.1| hypothetical protein SELMODRAFT_402226 [Selaginella moellendorffii]
gi|300171772|gb|EFJ38372.1| hypothetical protein SELMODRAFT_402226 [Selaginella moellendorffii]
Length = 322
Score = 46.6 bits (109), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 54/117 (46%), Gaps = 2/117 (1%)
Query: 196 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQKVSL-ARRFALV 254
S W +TEL+YL SP + ER E I E+ ++ F L Q + + + F V
Sbjct: 154 STFRWEDTELSYLRASPLYGKARERLEMITTEFGQVQND-FCTCVLEQALDVWPQLFGKV 212
Query: 255 PLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
L Y++ LA +D+ + DR Y + I + G N++L ++YGF
Sbjct: 213 SLEDLKHVYATVFSRSLAIGEDSTLVTEDRDYAENDQIWINYGDLSNAELALDYGFT 269
>gi|298708218|emb|CBJ30557.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 493
Score = 46.6 bits (109), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 47/190 (24%), Positives = 78/190 (41%), Gaps = 30/190 (15%)
Query: 158 ACLA-LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAE 216
ACL L L++E+ G+ S + Y+ L + PL W+E E+ L G T AE
Sbjct: 133 ACLTVLRLLHERGLGESSPFHSYLSVLPQDH-------RLPLEWTEAEVGLLQG--TSAE 183
Query: 217 ILERAEGIKREYNELD-------TVW----FMAGSLFQKVSLARRFALVPLGPP------ 259
L A + ++ TVW + + V+ R +G P
Sbjct: 184 PLVGAGSLDSQFEAFQSVVAQHPTVWEPSVCTKAAFAKGVNWVRSRGFTVMGDPHMIPGA 243
Query: 260 -LLAYSSKCKAMLAAVDDAVQLVVD--RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 316
+ + +++ D V+ +P KAGE + G N++LL +YGFV N
Sbjct: 244 DMFNHDPNKQSVQIGTDGEEHFVMKTVQPVKAGEEVFSSFGHISNAQLLNSYGFVLPGNS 303
Query: 317 YDRLVVEAAL 326
+D +++ L
Sbjct: 304 FDTVLIPTQL 313
>gi|367013376|ref|XP_003681188.1| hypothetical protein TDEL_0D03930 [Torulaspora delbrueckii]
gi|359748848|emb|CCE91977.1| hypothetical protein TDEL_0D03930 [Torulaspora delbrueckii]
Length = 484
Score = 46.6 bits (109), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 48/100 (48%), Gaps = 6/100 (6%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A +DL+ G +P S + + N +IA LL +++ + L + +YE K
Sbjct: 40 VFAKQDLEEGTVLLKLPKSCLFSA----SNSSIANLLVDDEIDGVLALNIAFLYETTVFK 95
Query: 173 -KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
KS W PY++ + R L V P WSETE L GS
Sbjct: 96 EKSHWFPYLKSI-RIYNDDGLLVLPPSHWSETEKLLLKGS 134
>gi|119500300|ref|XP_001266907.1| SET domain protein [Neosartorya fischeri NRRL 181]
gi|119415072|gb|EAW25010.1| SET domain protein [Neosartorya fischeri NRRL 181]
Length = 704
Score = 46.6 bits (109), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 48/194 (24%), Positives = 79/194 (40%), Gaps = 40/194 (20%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
+L+ + +G + FW PYIR L + L++ +PL + +L +L G+ +R
Sbjct: 124 FFLIGQYLKGSEGFWFPYIRTLPQ-----PLSLTTPLYYEGGDLRWLDGTSLAPAREQRM 178
Query: 222 EGIKREYNELDTVWFMAGSLFQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA---- 277
K +Y T AG FQ V + L++ + K + AV D
Sbjct: 179 GVWKEKYKNGITELRKAG--FQDVDQYTWDLYLWSSSILVSRAFSAKVLAEAVTDVELPE 236
Query: 278 ---------VQLVVDRP-----YKAGESIVVWC---------------GPQPNSKLLINY 308
+ L+ RP ++AG+ V + GP+ N +L++NY
Sbjct: 237 DGVSVLLPCIDLMNHRPLAKVEWRAGKQDVAFVVLEDVGSGQEISNNYGPRNNEQLMMNY 296
Query: 309 GFVDEDNPYDRLVV 322
GF DNP D +V
Sbjct: 297 GFCLPDNPCDYRIV 310
>gi|296810368|ref|XP_002845522.1| SET domain-containing protein [Arthroderma otae CBS 113480]
gi|238842910|gb|EEQ32572.1| SET domain-containing protein [Arthroderma otae CBS 113480]
Length = 491
Score = 46.6 bits (109), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 67/296 (22%), Positives = 111/296 (37%), Gaps = 57/296 (19%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A D+ + F +P L++++E E + L +L L + ++YE QG+
Sbjct: 61 LGAVRDIAEDEELFVIPEDLILSVENSKAREALG--LNETQLGPWLSLIIVMIYEYYQGE 118
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA--EGIKREYNE 230
+S W PY L + ++ + W+E +L L G +I + A E I ++
Sbjct: 119 QSRWEPYFHIL-------PTSFDTLMFWTEAQLQELQGCAVVDKIGKSAADEAILQKVVP 171
Query: 231 L---DTVWFMAGSLFQKVS-------LARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQL 280
L + F A S + LA R +G ++AY+ + A DDA +
Sbjct: 172 LIQANPHHFPARSGMPPLDSNDALLCLAHR-----MGSLIMAYAFDIEKTEGADDDAAE- 225
Query: 281 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 340
Y DED P +V A + D Q + R+
Sbjct: 226 -------------------------DGYMTDDEDEPAKGMVPLADIFNADAQRNNARLFQ 260
Query: 341 QRNGKLSVQVFHVHAGREKEAISDMLPYL----RLGYVSDTSEMQSVIS-SLGPIC 391
+ + + ++ AG E LP R GYV+D V+ SL IC
Sbjct: 261 EEGSFVMKAIRNIQAGEEIFNDYGELPRADLLRRYGYVTDNYAQYDVVEFSLDSIC 316
>gi|218200748|gb|EEC83175.1| hypothetical protein OsI_28406 [Oryza sativa Indica Group]
Length = 319
Score = 46.2 bits (108), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 52/220 (23%), Positives = 88/220 (40%), Gaps = 48/220 (21%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ ASE++ GD A +P S +++ E + ++ L + ++ L L+ + E+
Sbjct: 64 MVASENIDVGDIALEIPESSIISEELLCQSDMFLALKDLDSITTETMLLLWSIRERYNPS 123
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
F + Y L G L + LA L G+
Sbjct: 124 SKFKI-YFEALPANFNTG-------LSFGIDALAALEGTL-------------------- 155
Query: 233 TVWFMAGSLFQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESI 292
LF ++ AR+ + P +L Y K V +++ + RP KAGE
Sbjct: 156 --------LFDELMQARQVS-----PHILNYGRVDK-----VTKSLKFPLSRPCKAGEQC 197
Query: 293 VVWCGPQPNSKLLINYGFVDEDNPYDR--LVVEAALNTED 330
+ G P S L+ YGF+ DNPYD L ++ +++ ED
Sbjct: 198 FLSYGKHPGSHLITFYGFLPRDNPYDVIPLDLDTSVDEED 237
>gi|408392258|gb|EKJ71616.1| hypothetical protein FPSE_08255 [Fusarium pseudograminearum CS3096]
Length = 527
Score = 46.2 bits (108), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 35/63 (55%), Gaps = 2/63 (3%)
Query: 264 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 323
S CK + +A+ +VQ DR YK GE + V GP N LL YGF+ + N +D + ++
Sbjct: 194 SQGCKLVYSALGYSVQ--TDRAYKQGEEVFVSYGPHSNDFLLTEYGFILDTNRWDEVYLD 251
Query: 324 AAL 326
+
Sbjct: 252 EVI 254
>gi|315039895|ref|XP_003169325.1| hypothetical protein MGYG_08872 [Arthroderma gypseum CBS 118893]
gi|311337746|gb|EFQ96948.1| hypothetical protein MGYG_08872 [Arthroderma gypseum CBS 118893]
Length = 455
Score = 46.2 bits (108), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 86/388 (22%), Positives = 154/388 (39%), Gaps = 55/388 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
V A + G+ ++P++ + T+++ + + +L + LS LALYL++ K
Sbjct: 28 VKALRSFKEGERILTIPSACLWTVKKAYADPLLGPVLRAAQPPLSVEDSLALYLLFVKS- 86
Query: 171 GKKSFWLPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIK 225
R L + R +A + + +++ EL GS A + + +
Sbjct: 87 ----------RTLGYEGQRHHIAAMPQSYSASIFFTDDELQVCKGSSLYALTPQLEQRVH 136
Query: 226 REYNEL--------------------DTVWFMAGSLFQKVSLA-RRFALVPLGPPL---L 261
+Y +L D W + + + A A V L PL L
Sbjct: 137 DDYRQLLVALLSQHRDLFPLDQFTIEDYKWALCSIWSRAMDFAVSETASVRLVAPLADML 196
Query: 262 AYS---SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 318
+S +C A D + ++ + Y+ G+ I ++ G PN++LL YGFV DNP D
Sbjct: 197 NHSPDVKQCHAYDPTSGD-LSILAAKDYQVGDQIFIYYGSVPNNRLLRLYGFVLPDNPND 255
Query: 319 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI-SDMLPYLRLGYVSDT 377
+ + P Y+ K + G S + K+ + +++L YLR+ + D
Sbjct: 256 SYDLVLQTSPLAPLYEQKERLWALAGLDSTCTIPLTV---KDPLPNNVLRYLRIQRL-DE 311
Query: 378 SEMQSVISSL--GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT--DYNLH 433
S + + L G V+ E VL L D + L G+ L + EA L DY
Sbjct: 312 SNITDITLRLVNGTDGKVNDGNEIQVLQFLVDSIGSLLEGFGIPLEKLEAQLVAGDYPAG 371
Query: 434 PKKRVATQLVRMEKKMLNACLQVTADMI 461
A + E+++L + D++
Sbjct: 372 GNAWAAAHVSAGEQRVLTRAKKTAEDLL 399
>gi|453083670|gb|EMF11715.1| SET domain-containing protein [Mycosphaerella populorum SO2202]
Length = 477
Score = 46.2 bits (108), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 69/264 (26%), Positives = 98/264 (37%), Gaps = 72/264 (27%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA----CLALYLMYEK 168
V A++DL + FS+P + ++T NET L N EL L L +++E
Sbjct: 45 VVATQDLSEDEELFSIPRASILT------NETTD--LPANLRKELDHPWLSLILVMVHEY 96
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS----------------- 211
+G KS W PY L +S + WS+ EL L GS
Sbjct: 97 LKGTKSSWYPYFNLLPE-------TFDSLMFWSDEELLSLKGSAVVDKIGKESADSTFTE 149
Query: 212 ---PTKAEILERAEGIKREYNELDTVWFMAGSLF-------------------------- 242
P A+ + R +EL ++ GS
Sbjct: 150 QLIPLIAQHANIFQTAGRSNDELLSLCHRMGSTIMAYAFDLEKPEPSQPPNQQDDEEWEE 209
Query: 243 --QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 300
+SL + ++PL L A + A L DD V + +AGE + GP P
Sbjct: 210 EESAISLPK--GMIPLADMLNANADHNNAKLFYQDDKVVMKTLHAVRAGEELFNDFGPLP 267
Query: 301 NSKLLINYGFV-DEDNPYDRLVVE 323
S LL YG+V D+ YD VVE
Sbjct: 268 RSDLLRRYGYVTDQYAKYD--VVE 289
>gi|169626351|ref|XP_001806576.1| hypothetical protein SNOG_16462 [Phaeosphaeria nodorum SN15]
gi|160705819|gb|EAT76160.2| hypothetical protein SNOG_16462 [Phaeosphaeria nodorum SN15]
Length = 474
Score = 46.2 bits (108), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 70/303 (23%), Positives = 114/303 (37%), Gaps = 70/303 (23%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLER-VLG 141
+W+ ++G+ I E + + R V A++D+ + F +P + ++++E +L
Sbjct: 13 AWLRRSGVEISPKIQLEDLRNAQAGRG---VVATQDIPEHELLFRIPRTAILSVENSILS 69
Query: 142 NETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
E A T L L L ++YE G S W PY L + + + WS
Sbjct: 70 TEIPA--ATFEMLGPWLSLILVMLYEYINGDASNWAPYFSVLPTE-------FNTLMFWS 120
Query: 202 ETELAYLTGSPTKAEI-------------------------------LERAEGIKREYNE 230
E ELA L S +I +RAE ++ E N
Sbjct: 121 EDELAELQASAVLNKIGKEGANEAFMEQLLPIIKEFADIFFAGDERAKQRAEEMRDERNV 180
Query: 231 LDTVWFMAGSLFQKVSL------ARR-----------------FALVPLGPPLLAYSSKC 267
L + GSL + +R+ ++PL L A +
Sbjct: 181 L--LMHKMGSLIMAYAFDVEPATSRKDVDEEGFAEEEEDEALPKGMIPLADMLNADADCN 238
Query: 268 KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAAL 326
A L + +++ +P +AGE + GP P S LL YG+V D YD + + L
Sbjct: 239 NARLFYEEKYLEMKALKPIRAGEEVFNDYGPLPRSDLLRRYGYVTDNYAQYDVVEINMDL 298
Query: 327 NTE 329
TE
Sbjct: 299 VTE 301
>gi|146180409|ref|XP_001020886.2| hypothetical protein TTHERM_00411920 [Tetrahymena thermophila]
gi|146144524|gb|EAS00641.2| hypothetical protein TTHERM_00411920 [Tetrahymena thermophila
SB210]
Length = 726
Score = 46.2 bits (108), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 39/173 (22%), Positives = 76/173 (43%), Gaps = 19/173 (10%)
Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGNE-----TIAELLTTNKLSELA---CLALYLM 165
AA++D+ A S+PN ++++ +R +E +E L + K ++ A L ++ M
Sbjct: 67 AATKDIAPLTAFISIPNKIIISYDRARFSELKSFFKQSEDLFSEKENDEAGVNVLTVFFM 126
Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIK 225
YE+ +GKKS W Y L+ E+ L W+ E+ + + + E E +
Sbjct: 127 YERLKGKKSLWHEYFEILENN--------ETILTWTAEEINRIPDPYIQKQAREYKEQVD 178
Query: 226 REYNELDTVWFMAGSLFQKVSLARRFALVPLGPPL---LAYSSKCKAMLAAVD 275
++EL + + FQK + + L + Y+ K +++ D
Sbjct: 179 ELWDELKELLHSQPNFFQKATATKELFLWAYNIVMSRCFGYTQKGTSIVPFAD 231
>gi|298715435|emb|CBJ28046.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 719
Score = 46.2 bits (108), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 67/291 (23%), Positives = 118/291 (40%), Gaps = 25/291 (8%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
W+ +G + E PS +E + A D+ GD +P++L+++ +
Sbjct: 26 WLRSHG---AAIDCVEWPS-SETESGVRGAVARRDIAPGDHMVIIPHALMMSEFHAKADP 81
Query: 144 TIAEL--LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
+ L T L LALY+M E + ++SF+ PY+R L ES LL
Sbjct: 82 KYGHVHRLNTRLLGSDNGLALYIMQEILKEERSFYWPYLRMLPTPCNLRNWNRESLLLLQ 141
Query: 202 ETELAYLTGSPTK---AEILERAEGIKREYNELDTVWFMAGSLF-------QKVSLARRF 251
+ +L T + ++ A E E + Y EL T LF Q + +R
Sbjct: 142 DHKLVRRTAARSRQLLALYRETIEFLSSSYPELYTADRYTFELFDFAWRTIQARAFGKRL 201
Query: 252 ---ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DRPYKAGESIVVWCGPQPNSKLL 305
ALVP L + + K + + + Y ++ G + N LL
Sbjct: 202 KSSALVPFADCLNHGNVQTKYDFDVGGNGTFRLFPSGNNRYPRNSEVLNSYGRRANDNLL 261
Query: 306 INYGFVDEDNPYDRLVVEAAL---NTEDPQYQDKRMVAQRNGKLSVQVFHV 353
++YGF DN +D V +L + + P + ++ + +G+ +V++ V
Sbjct: 262 LDYGFAMLDNEWDAAEVICSLPPSHDQSPLDRRRKACLRASGQHTVRILRV 312
>gi|346980096|gb|EGY23548.1| SET domain-containing protein RMS1 [Verticillium dahliae VdLs.17]
Length = 469
Score = 46.2 bits (108), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 84/377 (22%), Positives = 141/377 (37%), Gaps = 108/377 (28%)
Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 213
L L L ++YE QG S W PY L +Q ++P+ WS+ EL L G+
Sbjct: 91 LDSWGQLILVMLYEVLQGDASRWKPYFDILPQQ-------FDTPIFWSDGELLELQGTSL 143
Query: 214 KAEILERAEGIKREYNELDTVWFMAGSLF-----------QKVSLARRFALVPLGPPLLA 262
AE + + E +++ + ++F + + LA R +G ++A
Sbjct: 144 TAEKIGKVESDAMFRSKILPIVQANPAIFYPEGAAQPTEDELLHLAHR-----MGSTIMA 198
Query: 263 YS-------------------SKCKAMLAAVDDAVQLVVDRPYKA--------------- 288
Y+ + + ML V A L + + A
Sbjct: 199 YAFDLENDDENENEEDGWVEDREGRTMLGMVPMADTLNANAEFNAHINHGESLEATAIRA 258
Query: 289 ----GESIVVWCGPQPNSKLLINYGFVD-EDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 343
G+ I+ + GP P S+LL YG+V E + YD + V L E V +
Sbjct: 259 DIKAGDQILNYYGPLPTSELLRRYGYVTPEHSRYDVVEVPWTLVKE---------VIVSS 309
Query: 344 GKLSVQVF-HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVL 402
LS + + V + + E I D Y + S ++ + VSP + +
Sbjct: 310 LSLSAEAWKQVESQIDDEEIED---YFVIERDSGEPGPDGRFTAPAVLREVSPEL----V 362
Query: 403 DQLADYFKA-----------------------------RLAGYPATLSEDEAMLTDYNLH 433
+QL ++ KA RLA YP ++ DE +L + +L
Sbjct: 363 EQLKEFLKAVKKLDSERIPDKRKRDEICDAVIAEVLKVRLAQYPTSIETDEKLLAEADLP 422
Query: 434 PKKRVATQLVRMEKKML 450
++R+A + EKK+L
Sbjct: 423 ARRRMAVVVRLGEKKLL 439
>gi|330924929|ref|XP_003300837.1| hypothetical protein PTT_12198 [Pyrenophora teres f. teres 0-1]
gi|311324820|gb|EFQ91062.1| hypothetical protein PTT_12198 [Pyrenophora teres f. teres 0-1]
Length = 372
Score = 46.2 bits (108), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 59/269 (21%), Positives = 105/269 (39%), Gaps = 43/269 (15%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+L SW + G+ + + PS + A+ D+QAG+ VP + +L+ V
Sbjct: 6 ELLSWATERGVKLSGIKPQNIPSRGTG------IIATRDIQAGETILFVPFKVFRSLKHV 59
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQ--GKKSFWLPYIRELDRQRGRGQLAVESP 197
+ IA L N +S A LA YL +K + LP + + P
Sbjct: 60 --PKAIARRLPRN-MSLHALLAAYLTLDKTDTFAIANQTLPDLSSFE---------AGMP 107
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV------------WFMAGSL---- 241
LW EL P + ++ +R++ + V W + +
Sbjct: 108 FLWP-AELHPFLPKPALDLLKKQQRNFQRDWATVSKVYSNVSHEQYLHSWLLVNTRSFYC 166
Query: 242 ----FQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG 297
+++ R A++P+ C+A A+ + + DR Y+AGE + + G
Sbjct: 167 TTPSMERLPHDDRLAILPVADLFNHADVGCEAQFAS--ENYSFIADRTYRAGEELYISYG 224
Query: 298 PQPNSKLLINYGFVDEDNPYDRLVVEAAL 326
LL YGFV +N +D + ++ A+
Sbjct: 225 THSTDFLLAEYGFVPAENRWDVVCLDEAI 253
>gi|218200744|gb|EEC83171.1| hypothetical protein OsI_28399 [Oryza sativa Indica Group]
Length = 437
Score = 46.2 bits (108), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 51/220 (23%), Positives = 88/220 (40%), Gaps = 48/220 (21%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ ASE++ GD A +P S +++ E + ++ L + ++ L L+ + E+
Sbjct: 182 MVASENIDVGDIALEIPESSIISEELLCQSDMFLALKDLDSITTETMLLLWSIRERYSPS 241
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
F + Y L G L + LA L G+
Sbjct: 242 SKFKI-YFEALPANFNTG-------LSFGIDALAALEGTL-------------------- 273
Query: 233 TVWFMAGSLFQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESI 292
LF ++ AR+ + P ++ Y KA +++ + RP KAGE
Sbjct: 274 --------LFDELMQARQ-----VSPHIINYGRVDKAT-----KSLKFPLSRPCKAGEQC 315
Query: 293 VVWCGPQPNSKLLINYGFVDEDNPYDR--LVVEAALNTED 330
+ G P S L+ YGF+ DNPYD L ++ +++ ED
Sbjct: 316 FLSYGKHPGSHLITFYGFLPRDNPYDVIPLDLDTSVDEED 355
>gi|344277088|ref|XP_003410336.1| PREDICTED: SET domain-containing protein 4 [Loxodonta africana]
Length = 440
Score = 45.8 bits (107), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 71/318 (22%), Positives = 119/318 (37%), Gaps = 55/318 (17%)
Query: 67 SREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAF 126
SR V + + +LK W+ +I P + + LQ G
Sbjct: 22 SRGVNESYKSEFIELKKWLKDRKFEDTNLIPARFPGTGRG------LMSKTSLQVGQMII 75
Query: 127 SVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFWLPYIRELD 184
S+P S +++ + V+ + +T K S L L +L+ EK G +S W PY+ L
Sbjct: 76 SLPESCLLSTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVLEKHAGDQSSWKPYLETLP 134
Query: 185 RQRGRGQLAVESPLLWSETELAYLTGSPTKAEILER----------------------AE 222
+ P+ W E E+ L P +A+ E+ +E
Sbjct: 135 K-------TYTCPVCW-EPEVVNLLPRPLRAKAQEQRTRVQEFFTSFRDFFSSLQPLFSE 186
Query: 223 GIKREYNELDTVWFMAGSLFQKVSLARR----FALVP----LGP--PLLAYS--SKCKAM 270
++ + +W + V L R F+ P L P LL +S + KA
Sbjct: 187 AVENIFTYSALLWAWCTVNTRAVYLRHRQLRCFSAEPDTCALAPYLDLLNHSPDVQVKAA 246
Query: 271 LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 330
++V + E + + GP N +LL+ YGFV NP+ + V + +
Sbjct: 247 FNEKTRCYEIVAVSSCRKHEEVFICYGPHDNHRLLLEYGFVSTRNPHACVYVSRDILVKY 306
Query: 331 PQYQDKRMVAQRNGKLSV 348
DK+M N K+S+
Sbjct: 307 LPSTDKQM----NKKISI 320
>gi|403350232|gb|EJY74567.1| hypothetical protein OXYTRI_04175 [Oxytricha trifallax]
Length = 766
Score = 45.8 bits (107), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 38/155 (24%), Positives = 66/155 (42%), Gaps = 21/155 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--------SELACLALYL 164
V A ED++ +A VP L++T+E + I + NK E L +++
Sbjct: 27 VRAREDIEHREAFLYVPFKLLITMELAHNHPIIGHVFKENKQIFTKEHEDFEQLTLTVFM 86
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
+YE ++G +SFW PY+ L VE WS++++ + E I
Sbjct: 87 LYEYQKGLESFWFPYLNLLP--------DVEFFCNWSKSDIEAIDDQELAYETKSYKRDI 138
Query: 225 KREYNELDTVWF-----MAGSLFQKVSLARRFALV 254
+ E+ E++ + + +L K R FA V
Sbjct: 139 EIEWKEIELLLLHYPQHFSSALIDKHLFMRIFAQV 173
>gi|367009050|ref|XP_003679026.1| hypothetical protein TDEL_0A04830 [Torulaspora delbrueckii]
gi|359746683|emb|CCE89815.1| hypothetical protein TDEL_0A04830 [Torulaspora delbrueckii]
Length = 484
Score = 45.8 bits (107), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 39/68 (57%), Gaps = 1/68 (1%)
Query: 252 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
+++PL L A +SKC A L +++++ +P GE + G PNS+LL YG+V
Sbjct: 213 SMIPLADTLNANTSKCNANLVYDIESLKMCATKPIGMGEQVYNIYGDHPNSELLRRYGYV 272
Query: 312 D-EDNPYD 318
+ E + YD
Sbjct: 273 EWEGSKYD 280
>gi|425773952|gb|EKV12277.1| hypothetical protein PDIG_46020 [Penicillium digitatum PHI26]
gi|425782378|gb|EKV20291.1| hypothetical protein PDIP_17950 [Penicillium digitatum Pd1]
Length = 487
Score = 45.8 bits (107), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 31/102 (30%), Positives = 50/102 (49%), Gaps = 14/102 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
V A ++ G+ FS+P ++V+T++ N + LL N ++ L L ++YE
Sbjct: 50 VVAQSNIVEGEELFSIPRTMVLTVQ----NSELRTLLAENLEEQMGPWLSLMLVMVYEYL 105
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
QG+KS W PY R L + ++ + WS EL L S
Sbjct: 106 QGEKSRWAPYFRVLPSR-------FDTLMFWSPAELQELQAS 140
>gi|195439104|ref|XP_002067471.1| GK16171 [Drosophila willistoni]
gi|194163556|gb|EDW78457.1| GK16171 [Drosophila willistoni]
Length = 511
Score = 45.8 bits (107), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 77/361 (21%), Positives = 139/361 (38%), Gaps = 67/361 (18%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A++D+ A VP + + E++ E + T + LA L+ EK +G S
Sbjct: 118 ATKDINADQQVLRVPRKKIFSEEQLSKTERESFCNFTTNFN----LANALVVEKSRGADS 173
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI L + + L ++ ++ L G+ + L + I R+Y +L
Sbjct: 174 IWKPYIDVLPSR-------YNTVLYFTVEQMRRLRGTSVCSSALRQCRMIARKYAKLYAF 226
Query: 235 WFMAGS-------LFQKVSLAR---RFALVP------LGPPLLAYSSKCKAMLAAVDDAV 278
+ S LF + L R+A+ L P +A + ++A+
Sbjct: 227 AYCDSSYLRPDTGLFTQHGLCYELYRWAVSTVMTRQNLVPREIATKDDGNSPISALIPCW 286
Query: 279 QLVVDRP---------------------YKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 317
+ RP KAG ++ G +PN+ LL++ GFVD +N
Sbjct: 287 DMANHRPGKITSFYDSNAHQMECTAQEFCKAGNQFFIYYGDRPNADLLVHNGFVDPNNNK 346
Query: 318 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREK-----EAIS-DMLPYLRL 371
D + + L+ D +A++ +L ++ H G + E IS +L ++R+
Sbjct: 347 DFVNIRLGLSPTDG-------LAEKRSRLLDRLNIEHKGEFRVLPAPEYISGQLLAFVRV 399
Query: 372 GYVSD------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEA 425
+S S+++ + L C + +E FK L ATL E +
Sbjct: 400 FNMSSDQLDHWCSDLERAVDLLHIDCALETDLETRTWQYFHQRFKLLLGVLEATLREADE 459
Query: 426 M 426
+
Sbjct: 460 L 460
>gi|403158396|ref|XP_003307692.2| hypothetical protein PGTG_00642 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375163798|gb|EFP74686.2| hypothetical protein PGTG_00642 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 622
Score = 45.8 bits (107), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 30/78 (38%), Positives = 39/78 (50%), Gaps = 2/78 (2%)
Query: 251 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 310
A+VPL L A + A L D +++ R K GE I G PNS LL YG
Sbjct: 345 IAMVPLADLLNAKTGSENARLFYETDCLKMKATRNIKKGEQIYNTYGDPPNSDLLRRYGH 404
Query: 311 VDEDNPYDRLVVEAALNT 328
VD+ N +D VVE ++ T
Sbjct: 405 VDDPNRFD--VVEISIKT 420
>gi|46129354|ref|XP_389038.1| hypothetical protein FG08862.1 [Gibberella zeae PH-1]
Length = 478
Score = 45.8 bits (107), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 23/67 (34%), Positives = 36/67 (53%), Gaps = 2/67 (2%)
Query: 260 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 319
L + CK + +A+ +VQ DR YK GE + V GP N LL YGF+ + N +D
Sbjct: 187 LFNHDQGCKLVYSALGYSVQ--TDRVYKQGEEVYVSYGPHSNDFLLTEYGFILDTNRWDE 244
Query: 320 LVVEAAL 326
+ ++ +
Sbjct: 245 VYLDEVI 251
>gi|302754812|ref|XP_002960830.1| hypothetical protein SELMODRAFT_402221 [Selaginella moellendorffii]
gi|300171769|gb|EFJ38369.1| hypothetical protein SELMODRAFT_402221 [Selaginella moellendorffii]
Length = 393
Score = 45.8 bits (107), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 48/183 (26%), Positives = 77/183 (42%), Gaps = 40/183 (21%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LAL ++ E+ +G+ + W PYI L + +++ W +TEL+YL SP + E
Sbjct: 167 LALIVLMERYKGQ-AIWAPYISCLPQPA-----ELDNTFRWEDTELSYLRASPLYGKARE 220
Query: 220 RAEGIKREYNELDT------------VWFMAGSLFQKVSL-----------ARRFAL--- 253
R E I E+ ++ VW LF KVSL +R A+
Sbjct: 221 RLEMITTEFGQVQNDFCTCVLEQALDVW---PQLFGKVSLEDLKHVYATVFSRSLAIGED 277
Query: 254 -VPLGPPLLAY----SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 308
L P L + +S K + + + DR Y + I + G N++L ++Y
Sbjct: 278 STTLIPMLDFFNHNATSFAKLSFNGLLNYAVVTADRDYAENDQIWINYGDLSNAELALDY 337
Query: 309 GFV 311
GF
Sbjct: 338 GFT 340
>gi|303275314|ref|XP_003056953.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226461305|gb|EEH58598.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 701
Score = 45.8 bits (107), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 38/136 (27%), Positives = 59/136 (43%), Gaps = 19/136 (13%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
WM + G+ V + P R A+ D+ GD SVP ++T E + +
Sbjct: 37 WMKRRGIVLNGVGVGRFP------RTGRGCVATRDIAPGDVLVSVPEDAIITAETSVAAD 90
Query: 144 TIAEL-LTTNKLS-------ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ + L +++S E L L ++ E +G +S + PY+ L R A
Sbjct: 91 ALTKFGLGGDEMSAEASPRLEREALVLAVLAEMSRGHESDFAPYLAALPTLR-----ATH 145
Query: 196 SPLLWSETELAYLTGS 211
SPL WS ELA L G+
Sbjct: 146 SPLAWSGAELAELEGT 161
>gi|428181778|gb|EKX50641.1| hypothetical protein GUITHDRAFT_135258 [Guillardia theta CCMP2712]
Length = 254
Score = 45.8 bits (107), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 64/257 (24%), Positives = 102/257 (39%), Gaps = 54/257 (21%)
Query: 167 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 226
EK Q + S W Y+ + + + L + E+ L GSP ++++ +++
Sbjct: 2 EKNQVETSKWKEYLSSCPK-------SFHNALDLTTEEIEELQGSPALDYLVQQKNDLRQ 54
Query: 227 EYNELDTVWFMAGSLFQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 286
Y +L F K+S A P + Y S+ L V+ VQL+ P
Sbjct: 55 LYEDL----------FPKLSQA--------FPEVQRYKSE----LGRVE--VQLLA--PV 88
Query: 287 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL 346
KAGE I ++ G + L +GF D DNP D + E L +E + Q K M
Sbjct: 89 KAGEQIFIYYGALSTASELTRFGFCDRDNPNDTVPFELDL-SEMTELQRKAM-------- 139
Query: 347 SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLA 406
+V+ ++ D LP RL + + + +S E+ V +
Sbjct: 140 --EVWEFRPDVQQLLKRDGLPSWRL----------LAMLRILHLNQLSVANEKLVWGTME 187
Query: 407 DYFKARLAGYPATLSED 423
+ A AGYP L ED
Sbjct: 188 ELLNAVTAGYPTRLEED 204
>gi|328771298|gb|EGF81338.1| hypothetical protein BATDEDRAFT_87914 [Batrachochytrium
dendrobatidis JAM81]
Length = 607
Score = 45.4 bits (106), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 41/154 (26%), Positives = 71/154 (46%), Gaps = 17/154 (11%)
Query: 78 LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
L LK W +N + + ++ + N R V A + L+ GD ++P +++++
Sbjct: 4 LNILKQWFGENKIAYDEEKIRIEHDTNNGFR----VFAKQTLEVGDILCAIPKEAILSIK 59
Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
N +A++L L L + LM+E+ G+KS W YI+ L L P
Sbjct: 60 ----NCGVADVLEEQGLGGQLGLVIALMFERSLGEKSPWYGYIQSL-------PLRENIP 108
Query: 198 LLWSETELAYLTGSPTKAEILE-RAEGIKREYNE 230
L W + + A L G+ A +LE + +K +Y E
Sbjct: 109 LFWEKDQQACLDGTAV-AHLLEPMPKDLKADYKE 141
>gi|396468374|ref|XP_003838159.1| hypothetical protein LEMA_P116830.1 [Leptosphaeria maculans JN3]
gi|312214726|emb|CBX94680.1| hypothetical protein LEMA_P116830.1 [Leptosphaeria maculans JN3]
Length = 660
Score = 45.4 bits (106), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 32/120 (26%), Positives = 57/120 (47%), Gaps = 17/120 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLE-----RVL-----GNETIAELLTTNKLSELACLAL 162
+ A+ D+ A F++P + ++ +E R+L G AE L A L L
Sbjct: 41 IVATRDIPAETTLFTIPRNAIINVETSDLARLLPGIFDGTLNDAEDEKAEPLDPWASLIL 100
Query: 163 YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAE 222
++ E G++S+W PYI L + ++P+ W++ EL L G+ AE + ++E
Sbjct: 101 VMLREYLHGEQSYWKPYIDIL-------PTSFDTPIFWTQDELKELEGTVLTAEKIGKSE 153
>gi|260835045|ref|XP_002612520.1| hypothetical protein BRAFLDRAFT_214305 [Branchiostoma floridae]
gi|229297897|gb|EEN68529.1| hypothetical protein BRAFLDRAFT_214305 [Branchiostoma floridae]
Length = 287
Score = 45.4 bits (106), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 39/169 (23%), Positives = 74/169 (43%), Gaps = 15/169 (8%)
Query: 75 EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
EE W+H+NG C+ + + R + A++ L+ + +P L++
Sbjct: 18 EESFVRFFQWLHRNG---CRNVPLKPAVFPGTGRGM---MATKALKHEELMLVIPQRLLI 71
Query: 135 TLERVLGNETIAELLTTN-KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
T++ ++ + + + +L+ LA++LM EK + +KSFW PYI L +
Sbjct: 72 TMDAIMDSYIAPYIERADPRLTPTQALAVFLMCEKYRREKSFWRPYIDILPEE------- 124
Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLF 242
P ++E + L S + + + +EY EL + M LF
Sbjct: 125 YSCPTFFTEDDFRLLPNS-LRGKAKAKKYECHKEYKELAPFFKMLADLF 172
>gi|195132508|ref|XP_002010685.1| GI21676 [Drosophila mojavensis]
gi|193907473|gb|EDW06340.1| GI21676 [Drosophila mojavensis]
Length = 593
Score = 45.4 bits (106), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 84/391 (21%), Positives = 149/391 (38%), Gaps = 61/391 (15%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+ D++AG+ SVP L+ + E + + L N + L + L+ EK +G S
Sbjct: 206 ATRDIKAGEQVLSVPRKLIFSEELLPEKQ---RQLFRNFPTHLK-VTYTLIMEKLRGADS 261
Query: 175 FWLPYIREL-------------DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
W P+I L QR RG A + + +A L S K ++
Sbjct: 262 PWQPFIDTLPSRYNTVLYFTVEQMQRLRGTSACSAAVRHCRV-IARLYASMYKCAFMQLD 320
Query: 222 EGIKREYNELDTVWFMAGSLFQKV--SLARRFALVPLG-----------PPLLAY----- 263
+ + L T + + L++ ++ R LVP L+ Y
Sbjct: 321 DSVMGGMANLFTDYGLCYELYRWAVSTVTTRQNLVPRQEIPSDAANLPISALIPYWDMAN 380
Query: 264 --SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 321
S K + ++ YK+GE ++ G + N+ L++ GFVD NP D +
Sbjct: 381 HRSGKITSFYDQAAGQMECTAQEAYKSGEQYFIYYGDRSNADRLVHNGFVDMQNPKDYVQ 440
Query: 322 VEAALNTEDPQYQDKRMV-----AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD 376
+ L+ D + + ++ +R +L V H E +L ++R+ +S
Sbjct: 441 IRLGLSPTDALAEQRAILLAELNIERKAELRVLPAPEHISGE------LLAFVRVFNMSK 494
Query: 377 ------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSED------E 424
S+++ + L C + +E L K L ATL E E
Sbjct: 495 EQLEHWCSDLERAVDLLHIDCALETDLETRTWQYLYQRLKLLLGVLEATLKETDELKQLE 554
Query: 425 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 455
A+ + + Q R+E+++L+ LQ
Sbjct: 555 ALQQQADASEIDIMVLQYRRLERRILSDALQ 585
>gi|328772383|gb|EGF82421.1| hypothetical protein BATDEDRAFT_86633 [Batrachochytrium
dendrobatidis JAM81]
Length = 648
Score = 45.4 bits (106), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 26/45 (57%)
Query: 297 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 341
GP+ N LL NYGFV E+NP D V L EDP Y K+ V +
Sbjct: 298 GPKGNENLLSNYGFVIENNPEDYFKVFLGLQQEDPLYTAKKAVLE 342
>gi|384251065|gb|EIE24543.1| hypothetical protein COCSUDRAFT_40909 [Coccomyxa subellipsoidea
C-169]
Length = 685
Score = 45.4 bits (106), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 44/168 (26%), Positives = 64/168 (38%), Gaps = 43/168 (25%)
Query: 201 SETELAYLTGSPTKAEILERAEGIKREYNELDTV-------------------------- 234
+E E++ L G+P +E + I+ +Y V
Sbjct: 65 TEEEVSMLEGTPAHTTFVEARQHIREQYRAAQPVLQALTAAYPDDITPDLVTEDKFIWAC 124
Query: 235 --WFMAGSLFQKVSLARRFALVPLG--------PPLLAYSSKCKAMLAAVDDAVQLVVDR 284
W+ + V A R LVP+ P ++ Y L A D+++L R
Sbjct: 125 ELWYSYAIEVEYVDGAVRQTLVPIAHLLNHSPWPHIVRYGR-----LDAATDSLRLRAFR 179
Query: 285 PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR--LVVEAALNTED 330
AGE + GP PN KLL+ YGF DNP+D + EA N D
Sbjct: 180 HCAAGEQCFLSYGPLPNLKLLLFYGFALPDNPHDTVPITFEAEKNEGD 227
>gi|403412960|emb|CCL99660.1| predicted protein [Fibroporia radiculosa]
Length = 508
Score = 45.1 bits (105), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 62/225 (27%), Positives = 90/225 (40%), Gaps = 33/225 (14%)
Query: 253 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 312
+VP+ L A A L + +++V +P KAGE I G PNS LL YG VD
Sbjct: 258 MVPMADMLNARFGSENAKLFYEEHHLKMVTTKPIKAGEQIWNTYGDPPNSDLLRRYGHVD 317
Query: 313 ----------EDNPYD------RLVVEAALNTEDPQYQDK--RMVAQRNGKLSVQVFHVH 354
NP D L V AA + QDK + N V
Sbjct: 318 LVPLEPPLAGLGNPADIVEIGADLAVFAAKKDSPEKLQDKIDWWLEVANDDTFV------ 371
Query: 355 AGREKEAISDMLPYLRLGYV-SDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARL 413
G + + +++ + RL ++ D E S L P ++ VL D R+
Sbjct: 372 IGTDCQLPEELVSFARLLFLPRDEWEKVRQKSKL-----PKPKIDAQVLSVAEDVLSRRI 426
Query: 414 AGYPATLSEDEAMLTDYNLHP---KKRVATQLVRMEKKMLNACLQ 455
Y T+ +DEA+L N P K+ A + EK++L+ LQ
Sbjct: 427 NEYSTTIEDDEALLALENAQPLSLNKKHALIVRHGEKRILHGTLQ 471
>gi|145517214|ref|XP_001444490.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124411912|emb|CAK77093.1| unnamed protein product [Paramecium tetraurelia]
Length = 748
Score = 45.1 bits (105), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 44/77 (57%), Gaps = 8/77 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERV-LGNETIA-----ELLTTNKLS--ELACLALYL 164
V A++D+ A A VP +L+++ E+ L + +I EL N+ S E L YL
Sbjct: 46 VVATQDIPANTAIICVPQTLIISQEKCKLSSLSIVYDKHPELFDENQTSDAEFNILIFYL 105
Query: 165 MYEKKQGKKSFWLPYIR 181
EKK+G++SF+ PYI+
Sbjct: 106 FNEKKKGEQSFFYPYIQ 122
>gi|428174941|gb|EKX43834.1| hypothetical protein GUITHDRAFT_140267 [Guillardia theta CCMP2712]
Length = 805
Score = 45.1 bits (105), Expect = 0.092, Method: Compositional matrix adjust.
Identities = 43/190 (22%), Positives = 77/190 (40%), Gaps = 35/190 (18%)
Query: 69 EVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
E +K+ EDL + W+ +NG+ KV L+ H + + A + ++ + F +
Sbjct: 540 EGSAKRNEDLIEFSKWLRRNGVDDSKVKLRADGGHGMGNS----LYARQMIKEDELLFRI 595
Query: 129 PNSLVVTLERVLGNETIAELLTTNKL-----SELACLALYLM--------YE-------- 167
P + + V + T+ ++ ++ E L+L LM YE
Sbjct: 596 PLKIAFYSDAVRRHPTLGSVIKGARIPQGMQGETFLLSLMLMGPLTHLEQYEACQVGHME 655
Query: 168 ---KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
K + SFWLPYI+ L + +P+ W+E E L GS + +
Sbjct: 656 TGCKLSNETSFWLPYIKILPK-------TFSAPIFWNEVERQELKGSQVMEMLNDDLAQA 708
Query: 225 KREYNELDTV 234
+RE+ + V
Sbjct: 709 RREWEMMKIV 718
>gi|390602144|gb|EIN11537.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
SS5]
Length = 503
Score = 44.7 bits (104), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 59/220 (26%), Positives = 87/220 (39%), Gaps = 24/220 (10%)
Query: 252 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
A+VP+ L A A L +++V +P +GE I G PNS LL YG V
Sbjct: 255 AMVPMADMLNARYGSENAKLFYESRDLRMVTTKPIASGEQIWNTYGDPPNSDLLRRYGHV 314
Query: 312 D---------EDNPYDRLVVEA--ALNTEDPQYQDKRMVAQ-----RNGKLSVQVFHVHA 355
D NP D + V A LN + + Q + + G V VF
Sbjct: 315 DLLALSDGDGMGNPSDIVEVRADLVLNHVNSKKQSHELEERIDWWLEEGGDDVFVFT--- 371
Query: 356 GREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAG 415
R+ E S+++ +RL + T ++ P V +L + RL
Sbjct: 372 -RDAELPSELVSLIRLLILPPTEWTKTRDKGKLPKGKVDDVR---ILHVVTGALHERLQQ 427
Query: 416 YPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 455
YP ++ +DEA+L L KR A + EK +L L
Sbjct: 428 YPTSIEDDEALLA-TALSENKRQAVIVRLAEKHILRKALH 466
>gi|422293007|gb|EKU20308.1| ribulose- -bisphosphate carboxylase oxygenase small subunit
n-methyltransferase i [Nannochloropsis gaditana CCMP526]
Length = 385
Score = 44.7 bits (104), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 40/139 (28%), Positives = 59/139 (42%), Gaps = 12/139 (8%)
Query: 78 LGDLKSWMHKN---GLPPCKVILKEKP-SHNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
LG+ WM G+PP ++L + E + + G+A F +P S+V
Sbjct: 115 LGENGVWMQDKSGWGVPPHPLLLSSRTIDEIELEDSGRGLICKYPINMGNALFQLPLSIV 174
Query: 134 VTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
+ E+ L A ++E +AL L+ E+ G SFW PYI L
Sbjct: 175 IDKEKSLAAFDGA---LPADINEYFAIALMLIKERALGPSSFWAPYIDVLPTTE-----E 226
Query: 194 VESPLLWSETELAYLTGSP 212
V L+W E +LA L SP
Sbjct: 227 VNPTLVWPEGDLALLEASP 245
>gi|299115489|emb|CBN75653.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 451
Score = 44.7 bits (104), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 21/70 (30%), Positives = 38/70 (54%)
Query: 275 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 334
+D + LV + + +G + GP PNS+LL+ +GF DNP++ + + A + P +
Sbjct: 229 NDCLHLVTLQDWASGSEVKFSYGPLPNSRLLLLHGFCLPDNPFESVELWAMMEPGAPGFA 288
Query: 335 DKRMVAQRNG 344
+K + NG
Sbjct: 289 EKNKIMLDNG 298
>gi|378731232|gb|EHY57691.1| hypothetical protein HMPREF1120_05719 [Exophiala dermatitidis
NIH/UT8656]
Length = 714
Score = 44.7 bits (104), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 28/45 (62%)
Query: 278 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 322
V L V PY++G+ I GP+ N LL+ YGF DNP+D +V+
Sbjct: 292 VGLQVLEPYESGQEIFNNYGPRDNETLLVAYGFTIPDNPFDHVVI 336
>gi|348684109|gb|EGZ23924.1| hypothetical protein PHYSODRAFT_296170 [Phytophthora sojae]
Length = 452
Score = 44.7 bits (104), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 43/174 (24%), Positives = 70/174 (40%), Gaps = 31/174 (17%)
Query: 194 VESPLLWSETELAYLTGSPTK-----------AEILERAEGIKREYNELDTVWFMAGSLF 242
V+ PL W + + L G +++ + G ++ + ++ L
Sbjct: 143 VDLPLYWDDKQFEELQGCEEARRAMQHGARFYSQVYKHLFGANNQFVNAEAFFWAISILM 202
Query: 243 QKVSLARR--FALVPLGPPLLAY---SSKCKAMLAAVDDAVQ---------LVVDRPYKA 288
+ + + FAL+P S C+ L + D+ VQ + R Y+
Sbjct: 203 SRATSGQNQPFALIPFFDWFNHAGNGSDNCRHALDS-DECVQDFDMQKGFTIHTTRSYEP 261
Query: 289 GESIVVWCGPQPNSKLLINYGFVDEDNPYD--RLVVEAAL---NTEDPQYQDKR 337
GE + + G N +LL NYGF +NPYD L + AAL N DP + KR
Sbjct: 262 GEQLFINYGSHGNLRLLRNYGFTMPNNPYDVVNLPMPAALQQPNEADPAFAQKR 315
>gi|19114522|ref|NP_593610.1| lysine methyltransferase Set8 (predicted) [Schizosaccharomyces
pombe 972h-]
gi|74626633|sp|O14135.1|SET8_SCHPO RecName: Full=SET domain-containing protein 8
gi|2462684|emb|CAB16739.1| lysine methyltransferase Set8 (predicted) [Schizosaccharomyces
pombe]
Length = 429
Score = 44.7 bits (104), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 47/189 (24%), Positives = 78/189 (41%), Gaps = 27/189 (14%)
Query: 144 TIAELLTTNKLSELACLALYLMYEKKQGKKSFWL---PYIRELDRQRGRGQLAVESPLLW 200
+IA L N +S L+L Y WL P + D+ L++ +P W
Sbjct: 69 SIATLEEWNDMSFRTQAMLFLCY--------LWLGIQPRTNKWDKFLTVLPLSINTPAQW 120
Query: 201 SETELAYLTGSPTKAEILERAEGIKREYNELDT-------------VWFMAGSLFQKVSL 247
E E+ L G+ + + + +++E+ L+ W A +LF L
Sbjct: 121 PEKEVYSLQGTSIFNPVCVKRKILQQEWLSLNQRYSDSWPSKITLPKWVHADALFHSRCL 180
Query: 248 ARRFALVPLGP--PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS-KL 304
F L P L +SSK A + +DA+QL +D+ E + + G + S +
Sbjct: 181 ESPFKDPVLAPVIDLCNHSSKSNAKWSFSEDAMQLYLDKDIDENEEVTINYGSEKGSAEF 240
Query: 305 LINYGFVDE 313
L +YGF+ E
Sbjct: 241 LFSYGFLPE 249
>gi|358056332|dbj|GAA97699.1| hypothetical protein E5Q_04377 [Mixia osmundae IAM 14324]
Length = 347
Score = 44.7 bits (104), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 38/75 (50%), Gaps = 2/75 (2%)
Query: 251 FALVPLGPPLLAYSSKCKAMLAAVDDA-VQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 309
F LVPL + +SS C D A +L + RP K GE IV G + L YG
Sbjct: 154 FTLVPL-LDMANHSSTCANATVKYDHAHFELKLTRPVKRGEEIVFEYGGHDQATLWAEYG 212
Query: 310 FVDEDNPYDRLVVEA 324
F++ NP++R+ + A
Sbjct: 213 FIESSNPHERIDLTA 227
>gi|366987955|ref|XP_003673744.1| hypothetical protein NCAS_0A08050 [Naumovozyma castellii CBS 4309]
gi|342299607|emb|CCC67363.1| hypothetical protein NCAS_0A08050 [Naumovozyma castellii CBS 4309]
Length = 499
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 47/85 (55%), Gaps = 3/85 (3%)
Query: 252 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
+++PL L + ++KC A L +D++++ KAGE + G PN+++L YG+V
Sbjct: 229 SMIPLADTLNSDTNKCNAHLIYDEDSLKMRAISNIKAGEQVYNIYGNHPNAEILRRYGYV 288
Query: 312 D-EDNPYD--RLVVEAALNTEDPQY 333
+ E + YD L +E + T QY
Sbjct: 289 EWEGSKYDFGELPLEVIIETLHEQY 313
>gi|50556556|ref|XP_505686.1| YALI0F20944p [Yarrowia lipolytica]
gi|49651556|emb|CAG78495.1| YALI0F20944p [Yarrowia lipolytica CLIB122]
Length = 402
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 93/206 (45%), Gaps = 40/206 (19%)
Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIREL-DRQRGRGQLAVESPLLWSETELAYLTGSP 212
+S LAL+L+ ++ G KS W ++ L DR+ G ++ PL WS+ + LT P
Sbjct: 79 MSAHQVLALFLVIQQSLGSKSDWKAFMGLLPDRKEG----FLDVPLQWSKEDQDSLT--P 132
Query: 213 TKAEILERA-EGIKREYNELDTV---------------WFMAGS--LFQKVSL--ARRFA 252
+L++ + + +Y++ T W S L+ ++L ++ A
Sbjct: 133 EGIVVLKKTLDTFEADYDKTKTFVAKYDSDPRDAYLWAWLCVNSRCLYFDLTLTTGKKDA 192
Query: 253 L-----VPLGP--PLLAYS-----SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 300
+ L P L+ +S + C+ +++ + L R Y A E I + GP+
Sbjct: 193 QEVPDNITLAPYVDLINHSVESGPTHCQLKTSSIGFEI-LCGQRGYTADEEIFLCYGPRS 251
Query: 301 NSKLLINYGFVDEDNPYDRLVVEAAL 326
NS LL YGF +NP+D + + AL
Sbjct: 252 NSVLLCEYGFTVPENPWDDVDISDAL 277
>gi|428163078|gb|EKX32170.1| hypothetical protein GUITHDRAFT_121664 [Guillardia theta CCMP2712]
Length = 449
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 35/154 (22%), Positives = 71/154 (46%), Gaps = 15/154 (9%)
Query: 77 DLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTL 136
D D+ W NG KV+L++ + + PI + A ED++AG+ S+P +L+
Sbjct: 25 DGSDVYEWAAANGANVSKVVLRD----DGEAGPI--LHAKEDIEAGEVILSLPANLLFP- 77
Query: 137 ERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES 196
RV + + ++ + + + LYL+ E+ S W P+++ L +
Sbjct: 78 TRVSDHSPVVHMIENTTIGRITAICLYLISERADS-SSHWKPWLQSLPPRFFHA------ 130
Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
L +SE ++ + S K + + +++EY +
Sbjct: 131 -LSYSEDDMLHFQASSFKELRDRKKKNVRQEYEQ 163
>gi|258563540|ref|XP_002582515.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237908022|gb|EEP82423.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 445
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 68/284 (23%), Positives = 118/284 (41%), Gaps = 45/284 (15%)
Query: 122 GDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPY 179
G+ ++P ++ T+E + + +L + + LS LA Y+++ +
Sbjct: 37 GERILTIPRDVLWTVEHAYADPLLGPVLRSARPPLSVDDTLATYILFVRS---------- 86
Query: 180 IRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
RE R LA S + ++E EL G+ A + I+ +Y L
Sbjct: 87 -RESGYDGLRSHLAAVPKSYSSSIFFTEDELEVCAGTSLYAITKQLGRCIEDDYRALVYK 145
Query: 235 WFMAGSLFQKVSLARRFAL-----VPLGPP---LLAYSSKCKAMLAA--VDDAVQLVVDR 284
W +L S A FAL V L P +L +SS+ + A + + ++ +
Sbjct: 146 W----ALCTVWSRAMDFALPDGKSVRLLAPFADMLNHSSEVRQCHAYDPLSGNLSILAGK 201
Query: 285 PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 344
Y+AG+ + + G PN++LL YGFV NP D + + P ++ KR + G
Sbjct: 202 GYEAGDQVFIHYGSVPNNRLLRLYGFVIPSNPNDSYDLVLETHPLAPFFEQKRKLWALAG 261
Query: 345 KLSVQVFHVHAGREKEAISDMLP-----YLRLGYVSDTSEMQSV 383
S + ++D LP YLR+ SD S++ +V
Sbjct: 262 LDSTSTISL-------TLTDPLPNNVLRYLRI-QRSDESDLAAV 297
>gi|403338831|gb|EJY68658.1| hypothetical protein OXYTRI_10728 [Oxytricha trifallax]
Length = 770
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 21/77 (27%), Positives = 40/77 (51%), Gaps = 6/77 (7%)
Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGNE------TIAELLTTNKLSELACLALYLMYE 167
A ED+Q +A +PN ++T+ER +E + +++ + L +++M E
Sbjct: 67 AVKEDIQHNEAFVYIPNKCLITVERARSSEIGFIFANHENVFKSSEDRDFLTLLVFMMCE 126
Query: 168 KKQGKKSFWLPYIRELD 184
++G +SFW PY +D
Sbjct: 127 FQKGDQSFWYPYFNAVD 143
>gi|387197713|gb|AFJ68815.1| set domain protein, partial [Nannochloropsis gaditana CCMP526]
Length = 327
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 37/151 (24%), Positives = 69/151 (45%), Gaps = 13/151 (8%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
+W+ +G K+ E PS+ + I A +D+ + + S+P L++T + L +
Sbjct: 74 AWLRAHGARCDKI---EWPSYATGSQ-IRGAVALDDINSNEDMVSIPEPLLLTPDVALKD 129
Query: 143 ETIAELLTTN--KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
I ++ N S+ L + LM+E+ +G+ SF+ PY+ L R ++ L W
Sbjct: 130 PDIGKVFEDNLEDFSDEDMLLILLMHERGKGETSFFYPYLATLPR-------LPDTLLNW 182
Query: 201 SETELAYLTGSPTKAEILERAEGIKREYNEL 231
+E L++L E+ R + Y L
Sbjct: 183 NEEGLSWLQDEGLSLEVFLRESQLTAHYTRL 213
>gi|154272535|ref|XP_001537120.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150409107|gb|EDN04563.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 485
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 35/140 (25%), Positives = 62/140 (44%), Gaps = 15/140 (10%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+ SW+ + P KV K K + + A +D+ + F++P SLV++ +
Sbjct: 19 EFMSWLKQR--PGVKVSPKIKIADLRSEGAGRGIVADDDIGEDEELFAIPQSLVLSFQ-- 74
Query: 140 LGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
N + +LL N+ CL + ++YE QG S W Y + L ++
Sbjct: 75 --NSRLKDLLDFNERDFDPWLCLIVVMIYEYLQGGASTWSRYFQLLPTN-------FDTL 125
Query: 198 LLWSETELAYLTGSPTKAEI 217
+ W++ EL L+GS +I
Sbjct: 126 MFWTDEELRELSGSAVLNKI 145
>gi|225554758|gb|EEH03053.1| SET domain-containing protein [Ajellomyces capsulatus G186AR]
Length = 485
Score = 44.3 bits (103), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 33/134 (24%), Positives = 61/134 (45%), Gaps = 15/134 (11%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+ SW+ + P KV K K + + A +D+ + F++P +LV++ +
Sbjct: 19 EFMSWLKQR--PGVKVSPKIKIADLRSEGAGRGIVADDDIGEDEELFAIPQNLVLSFQ-- 74
Query: 140 LGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
N ++ +LL N+ CL + ++YE QG S W Y + L ++
Sbjct: 75 --NSSLKDLLDFNERDFDPWLCLIVVMIYEYLQGGASTWSRYFQLLPTN-------FDTL 125
Query: 198 LLWSETELAYLTGS 211
+ W++ EL L+GS
Sbjct: 126 MFWTDEELRELSGS 139
>gi|397614688|gb|EJK62954.1| hypothetical protein THAOC_16417 [Thalassiosira oceanica]
Length = 383
Score = 44.3 bits (103), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 57/244 (23%), Positives = 96/244 (39%), Gaps = 41/244 (16%)
Query: 97 LKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSE 156
L+ + + + ++R Y AS+ + D VP + +T E
Sbjct: 61 LQLRSTQDGQYR---YATASQPIAPNDTILRVPLASCLTAEN------------------ 99
Query: 157 LACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLT-GSPTKA 215
L LA L YE+ G +S + PYI+ L QL V P W + +A +T G ++
Sbjct: 100 LEALAERLKYERDAGARSKFEPYIKMLPSFE---QLKV-LPRFWDDNLVATVTDGGQLES 155
Query: 216 EILERAEGIKREYNELDTVWFMAGSLFQKVSLARRFALVPL--------GPPLLAYSSKC 267
I+ R+ N++ W +A + + ++L P A K
Sbjct: 156 RII-------RDKNDILDPWALACVDSRANFMQDCYSLTPFLDMINHDASLQTKARVEKN 208
Query: 268 KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 327
+ + D + L YK GE + G N L +YGF+ EDNP ++ ++ L
Sbjct: 209 RGFASTEGDVLCLQSSTYYKEGEECFISYGDLSNLDALCDYGFLTEDNPCNKESIQVRLI 268
Query: 328 TEDP 331
+P
Sbjct: 269 GSEP 272
>gi|358386801|gb|EHK24396.1| hypothetical protein TRIVIDRAFT_168260 [Trichoderma virens Gv29-8]
Length = 370
Score = 44.3 bits (103), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 22/77 (28%), Positives = 38/77 (49%), Gaps = 2/77 (2%)
Query: 250 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 309
R AL+P+ CK +A D +V DR YK GE + + N +L+ YG
Sbjct: 172 RLALIPVADLFNHSDDGCKVYYSA--DGYHIVADREYKKGEELFISYSSHSNDYILLEYG 229
Query: 310 FVDEDNPYDRLVVEAAL 326
F+ +++ D + ++ A+
Sbjct: 230 FIPDESLDDDVYIDDAV 246
>gi|255077808|ref|XP_002502485.1| set domain protein [Micromonas sp. RCC299]
gi|226517750|gb|ACO63743.1| set domain protein [Micromonas sp. RCC299]
Length = 728
Score = 44.3 bits (103), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 61/137 (44%), Gaps = 19/137 (13%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE-RVLG 141
+WM K G+ V + P H + A+ D++ GD VP + ++T + V G
Sbjct: 51 AWMKKKGVKLNGVSIGRFP-HTGRG-----CVATRDIKEGDVLVEVPEAAIITADGSVAG 104
Query: 142 NETIAELLTTNKL-------SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
+ +A L L E L L +M E +G++S + PY+ L R A
Sbjct: 105 SALVAFGLGGEALLHEYSPRLEREALVLAVMAEMSRGEESEFAPYLAALPTLR-----AT 159
Query: 195 ESPLLWSETELAYLTGS 211
SPL WS EL+ L G+
Sbjct: 160 HSPLGWSGAELSELEGT 176
>gi|452824261|gb|EME31265.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase
[Galdieria sulphuraria]
Length = 546
Score = 44.3 bits (103), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 44/164 (26%), Positives = 69/164 (42%), Gaps = 27/164 (16%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPS---HNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
E +L++W+ NG+P +K KP HN + A L+ G+ ++P
Sbjct: 71 EKTEELENWLFDNGVPS----IKGKPVLSPHNCRT-----FRAKIPLKLGEEVLAIPERF 121
Query: 133 VVTL---ERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
+T E++LG + LS+ +A L+ E + + SFW P+I L
Sbjct: 122 WLTKQLSEKLLG-------FHVSDLSDEEAIAALLLVETARKETSFWKPWIETLPSSDEL 174
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
L+WS E YL S T +IL E + EL+T
Sbjct: 175 HHF-----LVWSTAETQYLESSSTFEDILSLRETASLVFEELNT 213
>gi|451852693|gb|EMD65988.1| hypothetical protein COCSADRAFT_86793 [Cochliobolus sativus ND90Pr]
Length = 478
Score = 44.3 bits (103), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 46/177 (25%), Positives = 75/177 (42%), Gaps = 10/177 (5%)
Query: 288 AGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRL-----VVEAALNTEDPQYQDKRMVAQ 341
AG I+ + GP P+S+LL YG+V E + YD +V AL E +D +
Sbjct: 275 AGSEILNYYGPLPSSELLRRYGYVTSEHHRYDVAEISWSLVRTALAEELKLSEDTIADIE 334
Query: 342 RNGKLSVQVFHVHAGREKEAIS----DMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCM 397
R + ++ F V E S P LR + ++ + +L P
Sbjct: 335 RKLESELEEFFVIERDAGEPSSYGTLTQPPVLREISTELEEQTKAFLKALKKRDPKRKRS 394
Query: 398 ERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 454
E L + RL YP + +DE++L+ L + R+A ++ EK++L L
Sbjct: 395 ETICNTVLEKALRTRLGQYPTSAKQDESLLSKEGLSKRHRMAVEVRLGEKRLLQEAL 451
>gi|358397725|gb|EHK47093.1| hypothetical protein TRIATDRAFT_298882 [Trichoderma atroviride IMI
206040]
Length = 481
Score = 44.3 bits (103), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 48/193 (24%), Positives = 78/193 (40%), Gaps = 45/193 (23%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
L L+ E +G++SFW PYI+ L + A+ P W E E L G+ + + +
Sbjct: 90 LLLIKELLRGEESFWWPYIQALPQPEDVDDWAL--PPFWPEEEAELLEGTNVEVGLDKIR 147
Query: 222 EGIKREYNELDTV----------------------W---------FMAG---------SL 241
+ +KRE+ E + W F A +L
Sbjct: 148 DDLKREFREAKAMLLASQKDAEDDFSELLTRELYNWAYCIFSSRSFRASLVMTEAQQQAL 207
Query: 242 FQKVSLARRFALVPL---GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 298
+ VS+ L+PL G +A + + A A QL V R ++ G+ I P
Sbjct: 208 PEDVSVDDFSVLLPLFDIGNHDMAVDVRWELDAANSGAACQLRVGREHQPGQQIFNNYSP 267
Query: 299 QPNSKLLINYGFV 311
+ N++LL+ YGF+
Sbjct: 268 KTNAELLLGYGFM 280
>gi|302410103|ref|XP_003002885.1| SET domain-containing protein RMS1 [Verticillium albo-atrum
VaMs.102]
gi|261357909|gb|EEY20337.1| SET domain-containing protein RMS1 [Verticillium albo-atrum
VaMs.102]
Length = 469
Score = 44.3 bits (103), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 83/377 (22%), Positives = 140/377 (37%), Gaps = 108/377 (28%)
Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 213
L L L ++YE QG S W PY L +Q ++P+ WS+ EL L G+
Sbjct: 91 LDSWGQLILVMLYEVLQGDSSRWKPYFDILPQQ-------FDTPIFWSDGELLELQGTSL 143
Query: 214 KAEILERAEGIKREYNELDTVWFMAGSLF-----------QKVSLARRFALVPLGPPLLA 262
AE + + E +++ + ++F + + LA R +G ++A
Sbjct: 144 TAEKIGKVESDAMFRSKILPIVQANPAIFYPEGAAQPTEDELLHLAHR-----MGSTIMA 198
Query: 263 YS-------------------SKCKAMLAAVDDAVQLVVDRPYKA--------------- 288
Y+ + + ML V A L + + A
Sbjct: 199 YAFDLENDDENENEEDGWVEDREGRTMLGMVPMADTLNANAEFNAHINHGESLEATAIRA 258
Query: 289 ----GESIVVWCGPQPNSKLLINYGFVD-EDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 343
G+ ++ + GP P S+LL YG+V E + YD + V L E V
Sbjct: 259 DIRAGDQVLNYYGPLPTSELLRRYGYVTPEHSRYDVVEVPWTLVKE---------VIVSC 309
Query: 344 GKLSVQVF-HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVL 402
LS + + V + + E I D Y + S ++ + VSP + +
Sbjct: 310 LSLSAEAWKQVESQIDDEEIED---YFVIERDSGEPGPDGRFTAPAVLREVSPEL----V 362
Query: 403 DQLADYFKA-----------------------------RLAGYPATLSEDEAMLTDYNLH 433
+QL ++ KA RLA YP ++ DE +L + +L
Sbjct: 363 EQLKEFLKAVKKLDSERIPDKRKRDEICDAVIAEVLKVRLAQYPTSIETDEKLLAEADLP 422
Query: 434 PKKRVATQLVRMEKKML 450
++R+A + EKK+L
Sbjct: 423 ARRRMAVVVRLGEKKLL 439
>gi|345325921|ref|XP_001512684.2| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Ornithorhynchus anatinus]
Length = 392
Score = 44.3 bits (103), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 37/164 (22%), Positives = 73/164 (44%), Gaps = 29/164 (17%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHY-----VAASEDLQAGDAAFS 127
K+E+ DL W NG + E +++ + A+ +++A +
Sbjct: 74 KREDYFPDLMKWATANG------------ASTEGFELVNFEEGFGLRATREIKAEELFLW 121
Query: 128 VPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELD 184
VP L++T+E N + L + +++ + LA +L+ E+ SFWLPYI+ L
Sbjct: 122 VPRKLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERAN-PSSFWLPYIQTLP 179
Query: 185 RQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
+ ++PL + E E+ YL + ++ + + R+Y
Sbjct: 180 SE-------YDTPLYFEEDEVQYLQSTQAIHDVFSQYKNTARQY 216
>gi|211826273|gb|AAH09054.2| SETD3 protein [Homo sapiens]
Length = 228
Score = 43.9 bits (102), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 39/159 (24%), Positives = 73/159 (45%), Gaps = 18/159 (11%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 6 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 59
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 60 LMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE--- 114
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
++PL + E E+ YL + ++ + + R+Y
Sbjct: 115 ----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY 149
>gi|358055500|dbj|GAA98620.1| hypothetical protein E5Q_05307 [Mixia osmundae IAM 14324]
Length = 462
Score = 43.9 bits (102), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 66/282 (23%), Positives = 113/282 (40%), Gaps = 58/282 (20%)
Query: 81 LKSWM-HKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT---L 136
L SW+ H +G +I+ + ++ + A+ DL AG S P++L +T
Sbjct: 10 LGSWLRHHDGFIHEHLIVVQDELGDKS------IIATTDLPAGTCIASCPHTLAITPTSA 63
Query: 137 ERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWL--PYIRELDRQRGRGQLAV 194
LG+ +LS+ + LYL+ K L Y+ L + A+
Sbjct: 64 RAALGHHA-------TELSDHQAMVLYLVLHKHPSPAVCCLHQAYVDTLPPRS-----AM 111
Query: 195 ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAG--SLFQ--------- 243
+PL ++ E+ L G+ + +R + E+ TV AG LF+
Sbjct: 112 RTPLWFNPAEVQLLQGTNLAGAVTDRQRDWQLEWM---TVLRRAGQSGLFKASFEETWPS 168
Query: 244 -----KVSLARRF----------ALVP-LGPPLLAYSS----KCKAMLAAVDDAVQLVVD 283
+ +R F A P L P + A++ K ++ LV D
Sbjct: 169 ALWAATILSSRAFPSHLIDGNEQASTPVLFPGVDAFNHQQARKVTWQTSSASGRFNLVQD 228
Query: 284 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 325
P AG+ + GP+ N + L+ YGF+ +NP D +V++ A
Sbjct: 229 EPTAAGQQVFNNYGPKSNEEFLLGYGFIIPNNPDDHMVLKLA 270
>gi|407852222|gb|EKG05847.1| hypothetical protein TCSYLVIO_003073 [Trypanosoma cruzi]
Length = 565
Score = 43.9 bits (102), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 43/148 (29%), Positives = 70/148 (47%), Gaps = 15/148 (10%)
Query: 289 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRL-----VVEAALNTEDPQYQDKR--MVAQ 341
G I + GP N +LL YGFV E N +DRL EAA+ E + +R +VA+
Sbjct: 362 GREIWMSYGPLQNWELLQFYGFVLEGNEHDRLPFPLDFPEAAVGDE---WDGRRAALVAK 418
Query: 342 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 401
L+ + H GR A ++ LR+ ++++ E ++ + GP + E V
Sbjct: 419 YGLHLAGCCWICHDGRPPPA---LVALLRV-HLAEAEEFDTMERN-GPFASLGAGTEARV 473
Query: 402 LDQLADYFKARLAGYPATLSEDEAMLTD 429
+AD + L + +L EDE +L +
Sbjct: 474 FATIADTIRCILDLFSTSLEEDERLLEN 501
>gi|170588849|ref|XP_001899186.1| SET domain containing protein [Brugia malayi]
gi|158593399|gb|EDP31994.1| SET domain containing protein [Brugia malayi]
Length = 278
Score = 43.9 bits (102), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 57/239 (23%), Positives = 97/239 (40%), Gaps = 37/239 (15%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+ D + + S+P L++T + ++ L L + + EK+Q K
Sbjct: 35 ATTDFRENETIISIPVGLIITAGFIAEMPDYCDVFKRYCLKPFEALVYFFLVEKEQNSK- 93
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL--- 231
W PY+ L + A P L E + Y K +++ E +K Y +
Sbjct: 94 -WTPYLEVLPKSFSTP--ASLHPSLKPE-DFPYCL---RKQWYVQKNE-LKIMYEKFVTI 145
Query: 232 ---DTVW---FMAGSLFQKVSLARR--------------FALVPLGPPLLAYS--SKCKA 269
+T+W A + + R A+VPL +L +S S+C A
Sbjct: 146 LADNTIWDHFLWAWHIVNTRCIYRNNKLHPLIDNTEDDSLAIVPL-IDMLNHSNDSQCCA 204
Query: 270 MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 328
+ + + +++V RP + GE I + G N L I YGF +DN D+ VE +L +
Sbjct: 205 IWDSKFNLYKVIVTRPIRKGEQIFICYGSHTNGSLWIEYGFYLKDNICDK--VEISLGS 261
>gi|340503949|gb|EGR30449.1| SET domain protein [Ichthyophthirius multifiliis]
Length = 518
Score = 43.9 bits (102), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 50/211 (23%), Positives = 91/211 (43%), Gaps = 25/211 (11%)
Query: 275 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 334
D+ + +P K G+ I G + N LL+ YGF N YD + +N Q
Sbjct: 250 DNYFVVTTQKPEKKGQQIYNCYGQRTNKFLLMWYGFCFNKNRYDSYSLRLWINMRQEQLN 309
Query: 335 D---KRMVAQ---------------RNGKLSVQVFHVHAGREKEAIS-DMLPYLRLGYVS 375
+ +++V Q + K+++ + +K I+ D++ YLRL +
Sbjct: 310 NDLFEKIVFQEFLEKEDCKGGFVWKKQEKVNLDDITQNFRIKKNKINIDLIIYLRLYLMM 369
Query: 376 DTS--EMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 433
+++ V+ SL PVSP E VL L+ + T+ +D+ +L + NL+
Sbjct: 370 HYKGPDLKRVMVSL----PVSPVYECFVLSFAIRLLSYLLSRFTTTIKDDKELLQNQNLN 425
Query: 434 PKKRVATQLVRMEKKMLNACLQVTADMIMLL 464
K R A +K++L + + ++LL
Sbjct: 426 YKYRFAIIYRLNQKEILQEQISLMNQALILL 456
>gi|308812294|ref|XP_003083454.1| N-methyltransferase (ISS) [Ostreococcus tauri]
gi|116055335|emb|CAL58003.1| N-methyltransferase (ISS) [Ostreococcus tauri]
Length = 492
Score = 43.9 bits (102), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 20/56 (35%), Positives = 32/56 (57%)
Query: 263 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 318
+S A + ++ V+LV R KAG+ I + G N +L ++YGF+ EDN +D
Sbjct: 248 HSFDASARVRECENGVELVTTRDLKAGQPIELCYGELSNDELFLDYGFIVEDNAFD 303
>gi|307104961|gb|EFN53212.1| hypothetical protein CHLNCDRAFT_137077 [Chlorella variabilis]
Length = 512
Score = 43.9 bits (102), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 40/73 (54%), Gaps = 2/73 (2%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
V A++D+ AG VP L++++E + + L ++ L+ LA++L+ E +
Sbjct: 37 VLATQDIPAGTCVLRVPRHLLMSVESARRDAELCTALRQHRAALTSDQVLAVHLLCEASK 96
Query: 171 GKKSFWLPYIREL 183
G SFW PY+R L
Sbjct: 97 GAASFWQPYLRSL 109
>gi|378728064|gb|EHY54523.1| SET domain-containing protein 6 [Exophiala dermatitidis NIH/UT8656]
Length = 495
Score = 43.9 bits (102), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 77/326 (23%), Positives = 114/326 (34%), Gaps = 82/326 (25%)
Query: 118 DLQAGDAAFSVPNSLVVTL-ERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFW 176
D+ + + F++P SLV+T + + EL L + ++YE +G+ S W
Sbjct: 55 DIASDEELFAIPRSLVLTTATSSIPRSVLKELEDKGATGAWPPLIVTIIYEYLRGESSPW 114
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN------- 229
PY + L + + W++ ELA L S +I R + E+
Sbjct: 115 HPYFKIL-------PTTFNTLMFWNDAELAELQASAVVDKIGRRQ--AEEEWQNTIIPTM 165
Query: 230 --------------ELDTVWFMAGSLFQKVSLARRF------------------------ 251
+L + MAGSL +
Sbjct: 166 ADHPDLFPVGGSSAKLIELAHMAGSLIMAYAFDIDRDDMEDDNDNDKDGADSADDEFEED 225
Query: 252 -------ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 304
+VP L A + K A L D + + +P AGE I GP P S L
Sbjct: 226 DEDEPFKGMVPFADMLNADADKNNARLFQEPDYLIMKATKPISAGEQIFNDYGPLPRSDL 285
Query: 305 LINYGFV-DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 363
L YG+V D YD + L E VA ++ K QV+ RE+E
Sbjct: 286 LRMYGYVTDNYAQYDVVEFSHDLLLE---------VAGKHSKSKDQVW-----REREQQL 331
Query: 364 DMLPYLRLGYV-----SDTSEMQSVI 384
D L L GY DT +Q V+
Sbjct: 332 DELGVLDDGYAITRPEYDTQGLQDVL 357
>gi|326480913|gb|EGE04923.1| SET domain-containing protein [Trichophyton equinum CBS 127.97]
Length = 692
Score = 43.5 bits (101), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 73/192 (38%), Gaps = 38/192 (19%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LA ++ +E+ +G+ S W PY+ L R + S L + +++L +L G+
Sbjct: 108 LAFFVAHEQLKGRDSHWWPYLATLPRAS-----ELTSALFYQDSDLDWLQGTNLYQTHQA 162
Query: 220 RAEGIKREYNELDTV-------------WFM---AGSLFQKVSLARRFALVPL-GPPLLA 262
+K EY+ ++ W + A +L + R L P L
Sbjct: 163 YRNTVKEEYDSAISILRDEGCLAVESYSWDIFCWAYTLIASRAFTSRVLDAYLSNHPTLK 222
Query: 263 YSSKCKAMLAAVDDA----------------VQLVVDRPYKAGESIVVWCGPQPNSKLLI 306
+ + ML VD + + L V P GE I GP N +L+
Sbjct: 223 QDEEFQIMLPLVDSSNHKPLAKIEWRAEATEIGLKVIEPTFTGEEIHNNYGPLNNQQLMT 282
Query: 307 NYGFVDEDNPYD 318
YGF DNP D
Sbjct: 283 TYGFCIVDNPCD 294
>gi|207346544|gb|EDZ73016.1| YDR257Cp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 354
Score = 43.5 bits (101), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 252 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
+++PL L A +SKC A L + +++V R + E + G PNS+LL YG+V
Sbjct: 80 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 139
Query: 312 DED-NPYD 318
+ D + YD
Sbjct: 140 EWDGSKYD 147
>gi|451854554|gb|EMD67847.1| hypothetical protein COCSADRAFT_34629 [Cochliobolus sativus ND90Pr]
Length = 476
Score = 43.5 bits (101), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 95/423 (22%), Positives = 156/423 (36%), Gaps = 91/423 (21%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A +D+ + FS+P S ++ +E + + I T L L L ++YE G
Sbjct: 40 VVAKQDIAEHELLFSIPRSSILGVENSILSTEIPPA-TFAHLGPWLSLILIMLYEYHNGS 98
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE-- 230
S W PY L ++ + W+E ELA L S +I EG + E
Sbjct: 99 ASNWAPYFAVLPTD-------FDTLMFWTEDELAELQASAVVNKI--GKEGANEVFIEQL 149
Query: 231 LDTVWFMAGSLFQKVSLARRFA-----------LVPLGPPLLAYS--------------- 264
L + A +F A+ A + +G ++AY+
Sbjct: 150 LPVIEEFADVIFSGDERAKHKAKEMRAPENLELMHKMGSLIMAYAFDVEPAISDKEVDEE 209
Query: 265 --------------------------SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 298
+C A L D +++ +P +AG+ I GP
Sbjct: 210 GFAEEEEDAALPKGMVPLADMLNADGDRCNARLFYEKDGLEMKALKPIQAGDEIFNDYGP 269
Query: 299 QPNSKLLINYGFV-DEDNPYD-----------RLVVEAALNTEDPQYQDKRMVAQRNGKL 346
P S LL YG++ D YD L + + E +Y D++ + +
Sbjct: 270 LPRSDLLRRYGYITDNYAQYDVVEIPVDLVSQTLAHDGLWHEERIEYLDEQEIVDTGYDI 329
Query: 347 SVQV-FHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQL 405
+ + F + +E++S L L + E + + S G + P + M L
Sbjct: 330 AASIPFSL-----EESLSPELVILVETMLLPREEFER-LQSKGRL-PKAEKMTGKAAKFL 382
Query: 406 ADYFKARLAGYPATLSED-----EAMLTDYNLHPKKRVA-TQLVRM-EKKMLNACLQVTA 458
+AR+A YP TL +D E ++RVA + VR+ EKK+L + A
Sbjct: 383 YKIVQARIAQYPTTLEQDLQISSETQPVQTMSRKERRVAMARAVRIGEKKLLVQTEERLA 442
Query: 459 DMI 461
D I
Sbjct: 443 DKI 445
>gi|242009061|ref|XP_002425311.1| SET domain-containing protein, putative [Pediculus humanus
corporis]
gi|212509085|gb|EEB12573.1| SET domain-containing protein, putative [Pediculus humanus
corporis]
Length = 399
Score = 43.5 bits (101), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 42/181 (23%), Positives = 74/181 (40%), Gaps = 22/181 (12%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPL-----------LWSETELAYL 208
L+++LMYE S + YI+ L + +S + + +T+L +L
Sbjct: 95 LSIFLMYENHLKNDSKYFNYIQTLPQSYSNVYFCTDSEIQLLPDLIKKLVVTQKTDLEFL 154
Query: 209 ---TGSPTKAEILERAE-GIKREYNELDTVW--FMAGS---LFQKVSLARR-FALVPLGP 258
+ EI + IK+ YN + +W F + ++ S+ ++ AL P
Sbjct: 155 FEKLQNNLNDEICSHCDKSIKKLYNRYEFIWAWFTVNTRSVYYEDKSMRKKSLALAPFLD 214
Query: 259 PLLAYSSKCKAMLAAVDDAVQLVVD-RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 317
S M D+ + ++ ++ + I + GP N KLLI YGF+ N Y
Sbjct: 215 MFNHSSDANTKMYIDFDNELYILKTLNSFRKHQQIFIKYGPHSNLKLLIEYGFIIPCNHY 274
Query: 318 D 318
D
Sbjct: 275 D 275
>gi|336258546|ref|XP_003344085.1| hypothetical protein SMAC_09068 [Sordaria macrospora k-hell]
gi|380093059|emb|CCC09296.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 421
Score = 43.5 bits (101), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 29/50 (58%)
Query: 277 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 326
A + RPY AGE + + G N LLI YGF+ ++N +D + ++ A+
Sbjct: 263 AFTITTTRPYSAGEEVYICYGNHSNDFLLIEYGFLFDENVWDEVCIDDAI 312
>gi|354548388|emb|CCE45124.1| hypothetical protein CPAR2_701280 [Candida parapsilosis]
Length = 565
Score = 43.5 bits (101), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 69/312 (22%), Positives = 118/312 (37%), Gaps = 76/312 (24%)
Query: 108 RPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYE 167
+P ++ A S+ G A+ +P LVVT ++ G + + T + + L +YL Y
Sbjct: 28 KPNYFGAISK--SNGKASIQIPRELVVTCDK--GIDLYKD--TYKNANHSSLLKIYLCYS 81
Query: 168 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
+ Q +SF PY+ L + A++SP +WS + A L G+ + E + E
Sbjct: 82 RTQ--QSFHQPYLDTLPSLQ-----AIDSPYIWSAEDKALLKGTNLGNSLKENISSLVEE 134
Query: 228 ---------------------------------------YNELDTV-W-----FMAGSLF 242
+NE+DT W ++ SL
Sbjct: 135 WWNAINLLPEDVPKPEQHFINLKFYYENKFYTDDDYYSYFNEVDTSNWTSFPNYLWASLV 194
Query: 243 QKVSLARRFALVPLGPP----------LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESI 292
K + + P P LL ++ K K + D D +GE +
Sbjct: 195 LKSRAFPAYIIDPSLPKNEPMLLPVVDLLNHNPKTKVQWSGTDGGFLFQSDDA-SSGEEL 253
Query: 293 VVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFH 352
G + N +LL+ YGF E+NP D AAL + P D ++ ++ + + H
Sbjct: 254 FNNYGQKGNEELLLAYGFAIENNPADS----AALKIKIP---DSKLQVVKDLGIKLPSIH 306
Query: 353 VHAGREKEAISD 364
+ + +SD
Sbjct: 307 DYTNSVIDQVSD 318
>gi|255070351|ref|XP_002507257.1| predicted protein [Micromonas sp. RCC299]
gi|226522532|gb|ACO68515.1| predicted protein [Micromonas sp. RCC299]
Length = 986
Score = 43.5 bits (101), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 50/107 (46%), Gaps = 15/107 (14%)
Query: 113 VAASEDLQA----GDAAFSVPNSLVVTLERVLGNET---IAELLTTNK-LSELACLALYL 164
V A+E++ GD FS+P + ++T + T + EL ++ + + L +L
Sbjct: 45 VIAAENVNGAQDGGDTIFSIPITCLMTPAAAFADVTYGKVFELFAAHQSVEDRTVLVFFL 104
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
E+++G S W PYIREL +PL WS E L G+
Sbjct: 105 AIERQRGMTSHWGPYIRELPS-------IFSNPLNWSRAETLRLAGT 144
>gi|85113406|ref|XP_964517.1| hypothetical protein NCU02158 [Neurospora crassa OR74A]
gi|28926302|gb|EAA35281.1| hypothetical protein NCU02158 [Neurospora crassa OR74A]
Length = 504
Score = 43.5 bits (101), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 29/50 (58%)
Query: 277 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 326
A + RPY AGE + + G N LLI YGF+ ++N +D + ++ A+
Sbjct: 270 AFTITTTRPYAAGEEVYICYGNHSNDFLLIEYGFLFDENVWDEVCIDDAI 319
>gi|384483765|gb|EIE75945.1| hypothetical protein RO3G_00649 [Rhizopus delemar RA 99-880]
Length = 376
Score = 43.5 bits (101), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 56/242 (23%), Positives = 102/242 (42%), Gaps = 51/242 (21%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A+ED++AG+ SVP + ++T NE++ +L T+ LS LAL+L+ + K
Sbjct: 1 MMATEDIEAGEVIVSVPRNFLIT------NESLTKLYGTHSLSPHQLLALHLVLLTRD-K 53
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY---- 228
+S+W PY L + S L L++L S K E +++ + I +Y
Sbjct: 54 QSWWKPYTDLLPMHFNTMPVNYPSEL------LSHLPNS-LKQETMQQKDNIHTDYVTCL 106
Query: 229 --------------NELDTVWFMAGSLFQKVSL------ARRFALVPLGPPLLAYSSKCK 268
E W + +++ AL P+ L ++++ K
Sbjct: 107 KFCKSKQLPQDITAEEFKWAWLCVNTRCIHMTVPDYLAKGENIALAPM-LDFLNHTTEAK 165
Query: 269 AMLAAVDDAVQLVVDR-------PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 321
++ + R YK GE + + GP N +L YGFV +N Y+ ++
Sbjct: 166 -----IESGFNIRTQRFEIKTLTAYKKGEQVYINYGPHDNLAMLKEYGFVLNENIYNFVL 220
Query: 322 VE 323
++
Sbjct: 221 LD 222
>gi|336463341|gb|EGO51581.1| hypothetical protein NEUTE1DRAFT_125257 [Neurospora tetrasperma
FGSC 2508]
gi|350297448|gb|EGZ78425.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
Length = 503
Score = 43.5 bits (101), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 29/50 (58%)
Query: 277 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 326
A + RPY AGE + + G N LLI YGF+ ++N +D + ++ A+
Sbjct: 270 AFTITTTRPYAAGEEVYICYGNHSNDFLLIEYGFLFDENVWDEVCIDDAI 319
>gi|323355591|gb|EGA87411.1| Set7p [Saccharomyces cerevisiae VL3]
Length = 515
Score = 43.5 bits (101), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 252 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
+++PL L A +SKC A L + +++V R + E + G PNS+LL YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279
Query: 312 DED-NPYD 318
+ D + YD
Sbjct: 280 EWDGSKYD 287
>gi|330933580|ref|XP_003304225.1| hypothetical protein PTT_16721 [Pyrenophora teres f. teres 0-1]
gi|311319308|gb|EFQ87682.1| hypothetical protein PTT_16721 [Pyrenophora teres f. teres 0-1]
Length = 476
Score = 43.5 bits (101), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 28/72 (38%), Positives = 37/72 (51%), Gaps = 1/72 (1%)
Query: 252 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
+VPL L A + +C A L D +++ +P +AGE I GP P S LL YG+V
Sbjct: 223 GMVPLADMLNADADRCNARLFYEKDCLEMKALKPIQAGEEIFNDYGPLPRSDLLRRYGYV 282
Query: 312 DEDNPYDRLVVE 323
DN VVE
Sbjct: 283 T-DNYAQYDVVE 293
Score = 39.7 bits (91), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 35/136 (25%), Positives = 59/136 (43%), Gaps = 13/136 (9%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLER-VLG 141
+W+ K+G I E + + R V AS+++ + F +P + ++++E +L
Sbjct: 13 AWLRKSGAEISPKIKLEDLRNKDAGRG---VVASQEIAEHELLFRIPRTSILSVENSILS 69
Query: 142 NETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
E A L+ L L L ++YE G S W PY L + + + W+
Sbjct: 70 TEIPAATLSL--LGPWLSLILVMLYEYHNGSASNWAPYFAVLPTE-------FNTLMFWT 120
Query: 202 ETELAYLTGSPTKAEI 217
E ELA L S +I
Sbjct: 121 EDELAELQASAVVGKI 136
>gi|212542185|ref|XP_002151247.1| SET domain protein [Talaromyces marneffei ATCC 18224]
gi|210066154|gb|EEA20247.1| SET domain protein [Talaromyces marneffei ATCC 18224]
Length = 709
Score = 43.1 bits (100), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 58/217 (26%), Positives = 87/217 (40%), Gaps = 41/217 (18%)
Query: 161 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE--LAYLTGSPTKAEIL 218
A +LM + + FW PYIR L G+ + +PL + E E L +L G + A
Sbjct: 108 AFFLMGQYLLQEHGFWYPYIRSLP-----GKEELTTPLFFREEEGDLEWL-GMTSLAASR 161
Query: 219 ERAEGI-----KREYNELDTVWF------------MAGSLFQKVSLARRF--ALVP---- 255
ER I +R Y L + F A ++ + + +++P
Sbjct: 162 ERRLAIWRGNYERGYTMLKELGFEGVEGYTWDLYLWASTIISSRAFTAKVLASVIPELKN 221
Query: 256 --------LGPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLL 305
L P + A + K K A D++ LVV AGE + GP+ N +L+
Sbjct: 222 AEVDRVSVLLPLIDATNHKPLSKVEWRAGTDSIGLVVMSDVAAGEEVGNNYGPRNNEQLM 281
Query: 306 INYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQR 342
+NYGF DNP + VV + P Q K Q
Sbjct: 282 MNYGFCIPDNPCEYRVVSLRAPLDSPLAQIKAQYEQH 318
>gi|400594002|gb|EJP61885.1| histone-lysine N-methyltransferase [Beauveria bassiana ARSEF 2860]
Length = 481
Score = 43.1 bits (100), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 56/222 (25%), Positives = 93/222 (41%), Gaps = 21/222 (9%)
Query: 223 GIKREYNELDTVWFMAGSLFQKVSLARRFALVPLGPPLLAYSS---KCKAMLAAVDDAVQ 279
GI+ L TVW VS L+ +L +SS +C A D +
Sbjct: 166 GIEHYKWALCTVWSRGMDF--TVSEGNSLRLLAPFADMLNHSSDVKQCHAYDPTTGD-LS 222
Query: 280 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 339
++ + Y G+ + ++ GP PN++LL YGFV +NP+D + + P Y+ K +
Sbjct: 223 ILASKDYNVGDQVFIYYGPVPNNRLLRLYGFVLPENPHDSYDLVLQTSPMAPLYEQKERL 282
Query: 340 AQRNGKLSVQVFHVHAGREKEAISDMLP-----YLRLGYVSDT--SEMQSVISSLGPICP 392
+ G + + A +D LP YLR+ + ++ M I++ G
Sbjct: 283 WKLAGLDTACTIPLTA-------NDPLPRSVLRYLRIQRLDESLLGAMTMQIAT-GADEK 334
Query: 393 VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 434
+S E +L L D A L G+ L A L +++P
Sbjct: 335 ISDDSETLILQFLIDSISAILEGFSIPLDILTAQLAAGDVYP 376
>gi|255071849|ref|XP_002499599.1| predicted protein [Micromonas sp. RCC299]
gi|226514861|gb|ACO60857.1| predicted protein [Micromonas sp. RCC299]
Length = 588
Score = 43.1 bits (100), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 62/249 (24%), Positives = 94/249 (37%), Gaps = 52/249 (20%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVL---GNETIAELLTTNKLSELACLALYLMYEKK 169
AA+ + AGD A ++P + T+ L G A + L E AL+L+ E+
Sbjct: 188 AAATTHIPAGDIAAAIPVERLFTVRHALEMPGPRGDAYRMFA-ALGEDTIAALWLIAERA 246
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVE-----SPLLWS-ETELAYLTGSPTKAEILERAEG 223
G+ S W I L G G+ + +P+ W E A L G+P A+ + +E
Sbjct: 247 LGEASPWHAVIASLPWPEG-GEGSASPCGGCTPVSWPREACDALLGGTPLLADAIAASEK 305
Query: 224 IKREYNEL-------------------------DTVWFMAGSLFQKVSLARRFALVP--- 255
+ R++ L W G Q +P
Sbjct: 306 LARQHAALFPALSEHMADVFPASAYTLDNFRRAHEAWNSYGMTVQASPGEPAATCLPPVA 365
Query: 256 ------LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 309
L P ++ YS D ++L V R AGE + V G + N++LL+ YG
Sbjct: 366 MLCNHALWPHVVRYSRL-------RDGTLRLPVARSVHAGEEVFVSYGAKSNAELLLFYG 418
Query: 310 FVDEDNPYD 318
F NPYD
Sbjct: 419 FALPGNPYD 427
>gi|367016539|ref|XP_003682768.1| hypothetical protein TDEL_0G01900 [Torulaspora delbrueckii]
gi|359750431|emb|CCE93557.1| hypothetical protein TDEL_0G01900 [Torulaspora delbrueckii]
Length = 573
Score = 43.1 bits (100), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 25/79 (31%), Positives = 37/79 (46%), Gaps = 9/79 (11%)
Query: 260 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD- 318
LL + + K +D V V K GE + G + N LL++YGFV + NPYD
Sbjct: 229 LLNHKNDTKVKWTFTNDNVCFVSQEIMKEGEEVFNNYGEKSNEDLLLSYGFVQDQNPYDL 288
Query: 319 --------RLVVEAALNTE 329
+ +++ ALN E
Sbjct: 289 TRLTLRLTKEMIDEALNAE 307
>gi|323334121|gb|EGA75505.1| Set7p [Saccharomyces cerevisiae AWRI796]
Length = 515
Score = 43.1 bits (100), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 252 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
+++PL L A +SKC A L + +++V R + E + G PNS+LL YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279
Query: 312 DED-NPYD 318
+ D + YD
Sbjct: 280 EWDGSKYD 287
>gi|151942233|gb|EDN60589.1| SET domain-containing protein [Saccharomyces cerevisiae YJM789]
Length = 494
Score = 43.1 bits (100), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 252 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
+++PL L A +SKC A L + +++V R + E + G PNS+LL YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279
Query: 312 DED-NPYD 318
+ D + YD
Sbjct: 280 EWDGSKYD 287
>gi|358399747|gb|EHK49084.1| hypothetical protein TRIATDRAFT_213818 [Trichoderma atroviride IMI
206040]
Length = 378
Score = 43.1 bits (100), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 21/77 (27%), Positives = 37/77 (48%), Gaps = 2/77 (2%)
Query: 250 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 309
R AL+P+ + C+ + + +V DR YK GE + + N L+ YG
Sbjct: 179 RLALIPVADLFNHADAGCRVYYSP--EGYHIVADRDYKRGEELYISYSSHSNDYNLVEYG 236
Query: 310 FVDEDNPYDRLVVEAAL 326
FV ++NP D + ++ +
Sbjct: 237 FVPDENPSDDVYIDDVI 253
>gi|6320463|ref|NP_010543.1| Rkm4p [Saccharomyces cerevisiae S288c]
gi|46577338|sp|Q12504.1|RKM4_YEAST RecName: Full=Ribosomal N-lysine methyltransferase 4; AltName:
Full=SET domain-containing protein 7
gi|1136212|emb|CAA92714.1| unknown [Saccharomyces cerevisiae]
gi|1226033|emb|CAA94096.1| unknown [Saccharomyces cerevisiae]
gi|51830266|gb|AAU09704.1| YDR257C [Saccharomyces cerevisiae]
gi|190404795|gb|EDV08062.1| hypothetical protein SCRG_00269 [Saccharomyces cerevisiae RM11-1a]
gi|259145494|emb|CAY78758.1| Set7p [Saccharomyces cerevisiae EC1118]
gi|285811273|tpg|DAA12097.1| TPA: Rkm4p [Saccharomyces cerevisiae S288c]
gi|323349272|gb|EGA83501.1| Set7p [Saccharomyces cerevisiae Lalvin QA23]
gi|365766338|gb|EHN07836.1| Set7p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
gi|392300372|gb|EIW11463.1| Rkm4p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 494
Score = 43.1 bits (100), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 252 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
+++PL L A +SKC A L + +++V R + E + G PNS+LL YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279
Query: 312 DED-NPYD 318
+ D + YD
Sbjct: 280 EWDGSKYD 287
>gi|348690659|gb|EGZ30473.1| hypothetical protein PHYSODRAFT_553476 [Phytophthora sojae]
Length = 437
Score = 43.1 bits (100), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 92/405 (22%), Positives = 156/405 (38%), Gaps = 87/405 (21%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLS---ELACLALYLMYEK- 168
V +ED+ FS+P V++++ + N + + +L+ E LA+ L+YEK
Sbjct: 47 VFIAEDVTPHAEVFSIPLDSVLSVKSLQENAVLQSIAFFQQLTPEREDDQLAIALLYEKF 106
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
+G KS W +I L R + L + EL L GS + E + +Y
Sbjct: 107 VRGSKSKWAKHIELLPR-------TYHNALYFGPEELRALEGSNVYFIAQQMEEKVAHDY 159
Query: 229 NELDTVWFMAGSLFQKV-----------------------SLARRFALVPLGPPLLAYSS 265
L + LF+ V ++ RF VP+
Sbjct: 160 ARLKESVLL--ELFENVPEGINVDLFDEFFSLENYKWALSTIWSRFGDVPVA------KQ 211
Query: 266 KCKAMLAAVD------------------DAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 307
KAM+ D +LV + + AG + + GP N KLL
Sbjct: 212 SFKAMVPVFDMLNHDPEAEMSHFFDMSTQRFKLVSHQHWNAGAQMFINYGPLSNHKLLAL 271
Query: 308 YGFVDEDNPYDRLVVEAALNTEDPQ---YQDKRMVAQRNGKLSVQVFHVHAGREKEAISD 364
YGFV NP+D VE L ++ +Q+K + NG HA E ++D
Sbjct: 272 YGFVIIGNPFD--AVEMWLPMDEASTKFFQEKEQLLLTNGL-------DHATNPFELVAD 322
Query: 365 -----MLPYLRLGYVS--DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP 417
+L R+ + E + + + +S E+ L +L + L +P
Sbjct: 323 ESNDLLLMAARIQEIDCETVEEFEELANKALEGEMISLENEQEALTRLIYTLEKMLESFP 382
Query: 418 ATLSEDEAML------TDYNLHPKKRVATQLVRMEKKMLNACLQV 456
++ ED+ +L TD NL+ +R+A + R +K +L+ + +
Sbjct: 383 TSIEEDDILLEQDDKKTD-NLN-HERMAVAVRRSDKYILSENINM 425
>gi|349577313|dbj|GAA22482.1| K7_Set7p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 494
Score = 43.1 bits (100), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 252 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
+++PL L A +SKC A L + +++V R + E + G PNS+LL YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279
Query: 312 DED-NPYD 318
+ D + YD
Sbjct: 280 EWDGSKYD 287
>gi|217074704|gb|ACJ85712.1| unknown [Medicago truncatula]
Length = 209
Score = 43.1 bits (100), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 32/120 (26%), Positives = 56/120 (46%), Gaps = 8/120 (6%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ +Q GD VP SL +T + + + + + +A LA L+ K G+ S
Sbjct: 66 ASKSIQTGDCILQVPYSLQLTPDNLPPE---IKPFISEDVGNIAKLATVLLIHKNLGQDS 122
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI L Q + + + W+E+EL + S E + + I++++ E+ V
Sbjct: 123 EWHPYISCLPPQA-----EMHNTIFWNESELEMIRQSSVYQETIYQKSQIEKDFLEIKPV 177
>gi|328872715|gb|EGG21082.1| hypothetical protein DFA_00957 [Dictyostelium fasciculatum]
Length = 643
Score = 43.1 bits (100), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 90/450 (20%), Positives = 177/450 (39%), Gaps = 84/450 (18%)
Query: 71 VSKKEEDLGDLKSWM-HKNG-LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
+S EDL + W+ +KN L P I+ P + A+ +++ + +
Sbjct: 202 ISTTPEDLKSFQQWLSNKNTYLNPSIDIVDLGPPFGRS------MVANTNIKKDEILVEI 255
Query: 129 PNSLVVTLERVLGN--ETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
P +++T + ++ N I + + K+S A+ ++Y + S+W Y+ L +Q
Sbjct: 256 PKGIMMTPKSMIKNLPRFIIDWMDEMKISRTDQQAIAIIYSILH-EDSYWYEYVSILPKQ 314
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN----------------- 229
+ + ++ E+ L SP R G+ R Y+
Sbjct: 315 -------FTTTVYFTREEMTQLQASPVHRFTEMRLNGVHRHYDTTISRLRFGYEGGEDDS 367
Query: 230 -------ELDTV--------------WFMAGSLFQKVSLARR-FALVPLGPPLLAYS--- 264
+LD + W + + SL+ +VPL A +
Sbjct: 368 TKTKTKSQLDAMKEFKDDRYTLDQFKWALGCVWSRAFSLSEEDGGMVPLADMFNADTVIS 427
Query: 265 -SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP---NSKLLINYGFVDED-NPYDR 319
SK ++A ++ + +AGE I G + ++L++YGF+ ED + D
Sbjct: 428 RSKVHPKISASSPSLVYTASQDIEAGEQIFTPYGVYKTLGSGQMLMDYGFIHEDGSSADS 487
Query: 320 LVVEAA-LNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS--D 376
+V A + +P Y KR + Q NG + + F + + + ++ + R+ ++ +
Sbjct: 488 TIVTVAPIPPSEPLYDLKRHLMQSNG-IESEEFTI---TKNKLAKELFLFARIKSINKKE 543
Query: 377 TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML-------TD 429
+ + + S ++P E+A L L++ L Y T+ +D +L T+
Sbjct: 544 SDQASAHFMSTQRHSMLNPRNEKAALRLLSNLISRHLDAYQTTIDQDNQILKEIEKDKTN 603
Query: 430 YNLHPKKRVAT----QLVRMEKKMLNACLQ 455
N H T +L MEK +LN+ L+
Sbjct: 604 TN-HSSVTFNTINAIKLRLMEKNILNSFLK 632
>gi|297608243|ref|NP_001061350.2| Os08g0244400 [Oryza sativa Japonica Group]
gi|255678277|dbj|BAF23264.2| Os08g0244400, partial [Oryza sativa Japonica Group]
Length = 195
Score = 43.1 bits (100), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 34/59 (57%), Gaps = 2/59 (3%)
Query: 274 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR--LVVEAALNTED 330
V +++ + RP KAGE + G P S L+ YGF+ DNPYD L ++ +++ ED
Sbjct: 14 VTKSLKFPLSRPCKAGEQCFLSYGKHPGSHLITFYGFLPRDNPYDVIPLDLDTSVDEED 72
>gi|260819628|ref|XP_002605138.1| hypothetical protein BRAFLDRAFT_122719 [Branchiostoma floridae]
gi|229290469|gb|EEN61148.1| hypothetical protein BRAFLDRAFT_122719 [Branchiostoma floridae]
Length = 453
Score = 43.1 bits (100), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 43/178 (24%), Positives = 80/178 (44%), Gaps = 17/178 (9%)
Query: 67 SREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAF 126
+R V EE W+H+NG C+ + + E R + A++ L+ +
Sbjct: 19 TRPVSLAHEESFVRFFQWLHRNG---CRNVPLKPAVFPETGRGL---MATKALKHEELIL 72
Query: 127 SVPNSLVVTLERVLGNETIAELL--TTNKLSELACLALYLMYEKKQGKKSFWLPYIRELD 184
+P L++T++ ++ + +A + ++L+ LA++LM EK + +KSFW PYI L
Sbjct: 73 VIPKRLLITIDAIM-DSYLAPYIERADSQLTPSQALAVFLMCEKCRREKSFWRPYIDILP 131
Query: 185 RQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLF 242
+ P ++E + L S ++ E +E+ EL + M LF
Sbjct: 132 EE-------YTCPAFFTEEDFRLLPNSLRGKAKAKKYE-CHKEFMELAPFFKMLADLF 181
>gi|80479475|gb|AAI08868.1| Unknown (protein for MGC:132347) [Xenopus laevis]
Length = 456
Score = 43.1 bits (100), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 51/243 (20%), Positives = 98/243 (40%), Gaps = 43/243 (17%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAEL-LTTNKLSELACLALYLMYEKKQGKK 173
A+ DL+ G+ ++P + ++T E VL + + L +S L L +L+ E+ G++
Sbjct: 64 ATRDLKPGELIIALPETCLITTETVLQSYLGKYIRLWRPHVSPLLALCTFLIAERFAGER 123
Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR------- 226
S W PY+ + P+ W E E+ +L +P + + LE+ ++
Sbjct: 124 SQWKPYLDVIPS-------TYSCPVYW-ELEIVHLLPAPLRQKALEQKTEVQELHTESLA 175
Query: 227 -----------------EYNELDTVWFMAG--SLFQKVSLARRFA----LVPLGP--PLL 261
Y+ L W +++ K + R + L P LL
Sbjct: 176 FFNSLQPLFCDNVADIYTYDALRWAWCTVNTRTVYMKHTQQDRLLAQQDVCALAPYLDLL 235
Query: 262 AYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 319
+S + + D ++ + + + + GP N +LL+ YGFV +NP+
Sbjct: 236 NHSPEVQVEAEFSKDRRCYEIRTNSGCRKHDQAFICYGPHDNQRLLLEYGFVAANNPHRS 295
Query: 320 LVV 322
+ V
Sbjct: 296 VYV 298
>gi|256270722|gb|EEU05884.1| Set7p [Saccharomyces cerevisiae JAY291]
Length = 494
Score = 43.1 bits (100), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 252 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
+++PL L A +SKC A L + +++V R + E + G PNS+LL YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279
Query: 312 DED-NPYD 318
+ D + YD
Sbjct: 280 EWDGSKYD 287
>gi|1150596|emb|CAA86307.1| putative transcription regulator [Saccharomyces cerevisiae]
Length = 496
Score = 43.1 bits (100), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 252 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
+++PL L A +SKC A L + +++V R + E + G PNS+LL YG+V
Sbjct: 222 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 281
Query: 312 DED-NPYD 318
+ D + YD
Sbjct: 282 EWDGSKYD 289
>gi|44890428|gb|AAH66931.1| SETD3 protein [Homo sapiens]
Length = 292
Score = 43.1 bits (100), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 39/160 (24%), Positives = 73/160 (45%), Gaps = 18/160 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
++PL + E E+ YL + ++ + + R+Y
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY 217
>gi|402076002|gb|EJT71425.1| hypothetical protein GGTG_10683 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 497
Score = 42.7 bits (99), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 50/202 (24%), Positives = 77/202 (38%), Gaps = 58/202 (28%)
Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
L L +++E +G S W PY+ L + E+P+ WS ELA L SP A +
Sbjct: 104 SLILVMIHEHLRGSASPWRPYLDVLPAR-------FETPMFWSAAELAELQASPVVASV- 155
Query: 219 ERAEG-------IKREYNELDTVWFMAGSLF----QKVSLARR----------------- 250
RAEG I E + ++F AG + V LA R
Sbjct: 156 GRAEGDAMIRSRILPVIRENEALFFGAGGAAMGDEELVELAHRMGSTIMAYAFDLERDDD 215
Query: 251 ---------------------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 289
+VP+ +L ++ A + ++A+ + R AG
Sbjct: 216 AMDEDDAEGDGWVEDRDGRTVMGMVPMA-DILNADAEFNAHINHSEEALVAISLRKIPAG 274
Query: 290 ESIVVWCGPQPNSKLLINYGFV 311
E I+ + GP PN +L YG+
Sbjct: 275 EEILNYYGPLPNGQLCRRYGYT 296
>gi|356511297|ref|XP_003524363.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
setd3-like [Glycine max]
Length = 449
Score = 42.7 bits (99), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 70/172 (40%), Gaps = 19/172 (11%)
Query: 275 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD--RLVVEAAL------ 326
D +++V + K + +++ G N L++YGFV NPYD L + AL
Sbjct: 232 DSKMKVVAETAIKEDDPLLLCYGCLNNDLFLLDYGFVMHSNPYDCIELKYDGALLDAAST 291
Query: 327 -------NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE 379
N P + +++Q N V G ++ +L LR+ ++
Sbjct: 292 AAGVSSPNFSTPAPWQELILSQLNLAGETPDLKVSLGGQETVEGRLLAALRVILSTNVET 351
Query: 380 MQ----SVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 427
MQ S++ SL P+ E AV L L +P + +DE++L
Sbjct: 352 MQKYDLSILQSLDAEAPLGVANEIAVFRTLIALCVIALGHFPTKIMDDESLL 403
>gi|40068483|ref|NP_954574.1| histone-lysine N-methyltransferase setd3 isoform b [Homo sapiens]
gi|28071060|emb|CAD61911.1| unnamed protein product [Homo sapiens]
gi|111309143|gb|AAI20968.1| SET domain containing 3 [Homo sapiens]
gi|118341365|gb|AAI27625.1| SET domain containing 3 [Homo sapiens]
gi|118341638|gb|AAI27626.1| SET domain containing 3 [Homo sapiens]
gi|119602071|gb|EAW81665.1| SET domain containing 3, isoform CRA_b [Homo sapiens]
gi|156138972|gb|AAI48252.1| SET domain containing 3 [Homo sapiens]
Length = 296
Score = 42.7 bits (99), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 39/160 (24%), Positives = 73/160 (45%), Gaps = 18/160 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
++PL + E E+ YL + ++ + + R+Y
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY 217
>gi|395326815|gb|EJF59220.1| SET domain-containing protein [Dichomitus squalens LYAD-421 SS1]
Length = 429
Score = 42.7 bits (99), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 66/307 (21%), Positives = 119/307 (38%), Gaps = 76/307 (24%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHY--------VAASEDLQAGD 123
S + E++ + KSW+ + G + H +H+ VAA D+ +
Sbjct: 3 SIEPENVANFKSWIAQQG--------------GQIHAGVHFEPVEFGFNVAARSDIPSDA 48
Query: 124 AAFSVPNSLVVTLERVLGNETIAELLTTNKLS----ELAC--LALYLMYEKKQGKKSFWL 177
S+P SL +T + I +LL T + +L C + L+ + E
Sbjct: 49 TVVSIPFSLAITPN--VARHAIKQLLNTEPQNWSERQLECTYIVLHSIVEPIDPSILRHR 106
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-DTV-- 234
PY+ L + +PL ++E EL+ GS L+R + E+ + +TV
Sbjct: 107 PYLDTLPSPE-----QLRTPLHFTEAELSSFRGSNLFGATLDRKHEWETEWQQCKNTVSA 161
Query: 235 ----WFMAGSLFQKVSLA-----RRFALVPLG--PPLLAYSSKCKAMLAAVD-------- 275
W + + + ++ A R F L P L+ + +L +D
Sbjct: 162 AIAGWGQSFTWEKYLTAATYLSSRAFPSTILSDTPSLVTTETSYPVLLPGIDALNHARGH 221
Query: 276 -------------------DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 316
++ LV+ P G ++ GP+PNS+L++ YGF +NP
Sbjct: 222 PVSWVVSAPSQTSSSQRSESSISLVIHTPTPRGSELLNNYGPKPNSELILGYGFSLPNNP 281
Query: 317 YDRLVVE 323
D +V++
Sbjct: 282 DDTIVLK 288
>gi|242769547|ref|XP_002341787.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
gi|218724983|gb|EED24400.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
Length = 739
Score = 42.7 bits (99), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 54/220 (24%), Positives = 88/220 (40%), Gaps = 40/220 (18%)
Query: 161 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE--LAYLTGSPTKAEIL 218
+LM + + ++ FW PYI+ L G + +PLL+ E + LA+L + A
Sbjct: 138 TFFLMGQYLRREEGFWYPYIQSLP-----GPEELTTPLLFKEEDGDLAWLNMTSLAASRE 192
Query: 219 ERAEGIKREYNELDT------------------VW---------FMA---GSLFQKVSLA 248
R + K Y + + +W F A S+ K+ A
Sbjct: 193 RRLQIWKVNYEKAYSMMQDLGVENARLYTWDLYLWASTIISSRAFTAKVLASVIPKLQTA 252
Query: 249 RRFALVPLGPPLLAYSSK---CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLL 305
+ + PL+ ++ K A D++ LVV +AG+ + GP+ N +L+
Sbjct: 253 EEGDRISVLLPLIDATNHKPLSKVEWRAGTDSIGLVVMSDLRAGDEVGNNYGPRNNEQLM 312
Query: 306 INYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 345
+NYGF DNP + VV + P Q K Q K
Sbjct: 313 MNYGFCIPDNPCEYRVVSLRAPPDSPLAQIKAQYEQHCSK 352
>gi|323309789|gb|EGA62995.1| Set7p [Saccharomyces cerevisiae FostersO]
Length = 417
Score = 42.7 bits (99), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 252 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
+++PL L A +SKC A L + +++V R + E + G PNS+LL YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279
Query: 312 DED-NPYD 318
+ D + YD
Sbjct: 280 EWDGSKYD 287
>gi|374619603|ref|ZP_09692137.1| acetyl-CoA acetyltransferase [gamma proteobacterium HIMB55]
gi|374302830|gb|EHQ57014.1| acetyl-CoA acetyltransferase [gamma proteobacterium HIMB55]
Length = 385
Score = 42.7 bits (99), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 71/284 (25%), Positives = 116/284 (40%), Gaps = 49/284 (17%)
Query: 193 AVESPLLWSETELAYLTGSPTKAEIL----ERAEGIKREYNE-----LDTVWFMAGSLFQ 243
A+ SP LA + G+ IL ER + EY++ LDT+ +AG + +
Sbjct: 12 AIRSPTGKRRGSLATVHGADLGGFILKALVERHDIPDDEYDDVVFGCLDTIGPLAGDIAR 71
Query: 244 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 303
LA + V P +C + AV A Q V+ IVV G Q S
Sbjct: 72 TCWLAAGLSDVV---PGTTVDRQCGSSQQAVHFAAQAVMSGTMD----IVVAGGVQTMSS 124
Query: 304 LLIN--------YGFVDEDNPY--DRLVVEAALNTEDPQYQDKRMVAQR--NGKLSVQVF 351
+ I+ YGF D+P+ V E Q++ +M+A + + ++ F
Sbjct: 125 IPISAAMYAGQSYGF---DSPFTGSEGWVARYGTQEVSQFKSAQMIADKWEISREEMEAF 181
Query: 352 HVHAGREKEAISD-------MLPYLRLGYVSDTSEMQSVISSLGPICPVSP--CMERAVL 402
+ + R A +D ++P+ LG+ D + + + ++ + P++P + AV
Sbjct: 182 SLESHRRARAATDAGYFEREIIPFEGLGF--DETIRNTSMEAMAGLEPLAPGGTITAAVS 239
Query: 403 DQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRME 446
Q D G A L E L YNL P+ R+A VR +
Sbjct: 240 SQTCD-------GSSAVLIVSEEALKRYNLTPRARIAHMSVRAD 276
>gi|255584095|ref|XP_002532790.1| Protein SET DOMAIN GROUP, putative [Ricinus communis]
gi|223527460|gb|EEF29592.1| Protein SET DOMAIN GROUP, putative [Ricinus communis]
Length = 510
Score = 42.7 bits (99), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 23/74 (31%), Positives = 39/74 (52%), Gaps = 1/74 (1%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSELACLALYLMYEKKQG 171
+ A+ DL+ G+ VP S ++T + L + + + + LS L + L+YE +G
Sbjct: 57 LGAARDLKKGELVLRVPKSALLTKDSFLKDGLLLSAINNHSALSPTQTLTVCLLYEMSKG 116
Query: 172 KKSFWLPYIRELDR 185
+ SFW PY+ L R
Sbjct: 117 QSSFWYPYLMHLPR 130
>gi|391342782|ref|XP_003745694.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Metaseiulus occidentalis]
Length = 278
Score = 42.7 bits (99), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 18/79 (22%), Positives = 43/79 (54%)
Query: 284 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 343
R YK E + ++ G + N++ +++ GFV ++N +D L ++ L+ D ++ KR + ++
Sbjct: 96 REYKKNEQVNIFYGNRANAQFMLHNGFVPDENQWDSLAIKIGLSKADKLFEMKRRLCEQM 155
Query: 344 GKLSVQVFHVHAGREKEAI 362
+ VF + + + +
Sbjct: 156 KIPTSDVFELKKAPDGDGV 174
>gi|50303805|ref|XP_451849.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49640981|emb|CAH02242.1| KLLA0B07161p [Kluyveromyces lactis]
Length = 553
Score = 42.7 bits (99), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 25/45 (55%)
Query: 274 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 318
D+ V +++ KAGE I G NS LL YGF EDNP+D
Sbjct: 319 TDECVDIILSNDVKAGEEIFNSYGDHSNSYLLARYGFCIEDNPHD 363
>gi|452986759|gb|EME86515.1| hypothetical protein MYCFIDRAFT_131111 [Pseudocercospora fijiensis
CIRAD86]
Length = 391
Score = 42.7 bits (99), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 22/68 (32%), Positives = 34/68 (50%), Gaps = 9/68 (13%)
Query: 277 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV---------DEDNPYDRLVVEAALN 327
++ +R Y AGE ++ G PN KLL++YGF+ D+D D +++ N
Sbjct: 216 GYEVTANRDYVAGEEVLATYGAHPNDKLLVHYGFINSSKPGAPSDDDIRLDHYILDNLSN 275
Query: 328 TEDPQYQD 335
T Q QD
Sbjct: 276 TTRDQLQD 283
>gi|111306423|gb|AAI20969.1| SETD3 protein [Homo sapiens]
Length = 284
Score = 42.7 bits (99), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 39/160 (24%), Positives = 73/160 (45%), Gaps = 18/160 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
++PL + E E+ YL + ++ + + R+Y
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY 217
>gi|10177069|dbj|BAB10511.1| unnamed protein product [Arabidopsis thaliana]
Length = 447
Score = 42.4 bits (98), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 40/179 (22%), Positives = 80/179 (44%), Gaps = 22/179 (12%)
Query: 276 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV--EAALNTEDPQY 333
+A L R Y+ GE +++ G N +LL +YGF+ E+N D++ + E +L + +
Sbjct: 213 NAYCLYARRNYQLGEQVLLCYGTYTNLELLEHYGFMLEENSNDKVFIPLETSLFSLASSW 272
Query: 334 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPV 393
+ ++GKLS ++ LRL + + +SV+ + +
Sbjct: 273 PKDSLYIHQDGKLSFA---------------LISTLRLWLIPQSQRDKSVMRLVYAGSQI 317
Query: 394 SPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 452
S E V+ +++ + L P +++ED + LH ++ +R+E+K A
Sbjct: 318 SVKNEILVMKWMSEKCGSVLRDLPTSVTEDTVL-----LHNIDKLQDPELRLEQKETEA 371
>gi|358384831|gb|EHK22428.1| hypothetical protein TRIVIDRAFT_84056 [Trichoderma virens Gv29-8]
Length = 458
Score = 42.4 bits (98), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 21/67 (31%), Positives = 37/67 (55%)
Query: 278 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 337
+ ++ + Y+A + + ++ GP PNS+LL YGFV DNP D + + + P Y+ K+
Sbjct: 216 LSILAGKDYEAEDQVFIYYGPMPNSRLLRLYGFVIPDNPNDSYDLVLSTHPLAPFYEQKQ 275
Query: 338 MVAQRNG 344
+ G
Sbjct: 276 KLWASAG 282
>gi|383863095|ref|XP_003707018.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Megachile
rotundata]
Length = 277
Score = 42.4 bits (98), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 46/91 (50%), Gaps = 4/91 (4%)
Query: 284 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE---DPQYQDKRMVA 340
R +K G+ I + GP+PNS ++ GFV D+ +D L E DP ++R +
Sbjct: 80 RDFKKGDQIFISYGPRPNSDFFLHSGFVYMDHKHDTLKFWVGSFLESNLDPHLAERRQLL 139
Query: 341 QRNGKLSVQVFHVHAGREKEAISDMLPYLRL 371
++ F V++GRE S +L Y+R+
Sbjct: 140 KKLHLQPWSEFVVNSGREPIPGS-VLAYMRV 169
>gi|301094169|ref|XP_002997928.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262109714|gb|EEY67766.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 440
Score = 42.4 bits (98), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 57/273 (20%), Positives = 104/273 (38%), Gaps = 47/273 (17%)
Query: 194 VESPLLWSETELAYLTGSPTK-----------AEILERAEGIKREYNELDTVWFMAGSLF 242
V+ PL W + + L G +++ + G ++ + ++ L
Sbjct: 143 VDLPLYWDDRKFKELQGCEEARRAVQHGARFYSQVYQHLFGTNNQFVSAEAFFWAISILM 202
Query: 243 QKVSLARR--FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD--RPYKAGESIVVWCGP 298
+ + + FAL+P ++ + D V + Y+ GE + + G
Sbjct: 203 SRATSGQNQPFALIPFFD-WFNHADNGDECVQEFDPQKGFTVHTTKAYEPGEQLYINYGS 261
Query: 299 QPNSKLLINYGFVDEDNPYDRLVVEAAL-----NTEDPQYQDKRMVAQR-NGKLSVQV-- 350
N +LL NYGF +NPYD + + + N DP + KR + Q G S +
Sbjct: 262 HSNLRLLRNYGFTTPNNPYDVVTLPMPIALQQPNPADPAFLQKRGLLQSATGSHSTDIPA 321
Query: 351 -----FHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVI-----------SSLGPICPVS 394
F+ H G+ L L ++ E+ +I S++ P+S
Sbjct: 322 LRSLRFN-HDGQLAPNAEHWLEIL----LATPEELSEIITQAASQSGAADSTISLALPMS 376
Query: 395 PCMERAVLDQLADYFKARLAGYPATLSEDEAML 427
++ V ++ ARL + +TL ED+A L
Sbjct: 377 --LKHKVHSEVGSLVTARLKQHSSTLEEDDAFL 407
>gi|359476494|ref|XP_002269094.2| PREDICTED: protein SET DOMAIN GROUP 40-like [Vitis vinifera]
Length = 504
Score = 42.4 bits (98), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 43/74 (58%), Gaps = 1/74 (1%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN-KLSELACLALYLMYEKKQG 171
+AA+ DL G+ +VP S ++T + +L +E ++ + + LS L + L+ E +G
Sbjct: 51 LAAARDLSQGELILTVPKSALMTSQSLLKDEKLSVAVKRHTSLSSPQILTICLLAEMSKG 110
Query: 172 KKSFWLPYIRELDR 185
K S+W PY+ +L R
Sbjct: 111 KSSWWHPYLMQLPR 124
>gi|242823770|ref|XP_002488126.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
gi|218713047|gb|EED12472.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
Length = 480
Score = 42.4 bits (98), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 49/193 (25%), Positives = 85/193 (44%), Gaps = 20/193 (10%)
Query: 252 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
+VPL L A + + A L D A+ + +P K G+ I G P S LL YG+V
Sbjct: 230 GMVPLADLLNADADRNNARLFQEDGALVMRAIKPIKTGDEIFNDYGELPRSDLLRRYGYV 289
Query: 312 DEDNPYDRLVVEAAL----------NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEA 361
DN VVE L N E +Y +++ + ++ + + +++
Sbjct: 290 -TDNYAQYDVVELPLTGICHAAGLDNIESQEYPHLKLLHEL--EILEDGYCILRPSAEDS 346
Query: 362 ISDMLP----YLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP 417
++D+LP L + E+Q + S P P+ E + L D K++L+ Y
Sbjct: 347 LTDILPDELLALLKSLTLEREELQRLQSKQKPPKPILAAREARI---LLDSVKSKLSQYG 403
Query: 418 ATLSEDEAMLTDY 430
T+ +D+A+L +
Sbjct: 404 TTVEQDKAILQQF 416
>gi|410079629|ref|XP_003957395.1| hypothetical protein KAFR_0E01060 [Kazachstania africana CBS 2517]
gi|372463981|emb|CCF58260.1| hypothetical protein KAFR_0E01060 [Kazachstania africana CBS 2517]
Length = 534
Score = 42.4 bits (98), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 46/100 (46%), Gaps = 6/100 (6%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A DL AG +P S + + N TI+ LL ++ + L L +YE +
Sbjct: 40 VFAKRDLPAGTTLLQLPKSAIFSA----SNSTISNLLVEEEIDGVLALNLAFIYETTVFR 95
Query: 173 -KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
KS W PY++ + +G ++V P WSE L G+
Sbjct: 96 EKSHWYPYLKSIQVVDSQGNISV-PPGYWSEEAKDLLRGT 134
>gi|67540796|ref|XP_664172.1| hypothetical protein AN6568.2 [Aspergillus nidulans FGSC A4]
gi|40738718|gb|EAA57908.1| hypothetical protein AN6568.2 [Aspergillus nidulans FGSC A4]
gi|259480141|tpe|CBF71002.1| TPA: SET domain protein (AFU_orthologue; AFUA_6G04520) [Aspergillus
nidulans FGSC A4]
Length = 484
Score = 42.4 bits (98), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 29/101 (28%), Positives = 48/101 (47%), Gaps = 13/101 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN--KLSELACLALYLMYEKKQ 170
V A D+ + F++P LV++ N + +LL+ + +L L L +++E Q
Sbjct: 50 VVAQADIDEDEELFAIPRDLVLSTH----NSKLKDLLSQDLDQLGPWLSLMLVMIFEYLQ 105
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
G KS W PY + L + ++ + WS EL L GS
Sbjct: 106 GGKSTWAPYFKVLPQN-------FDTLMFWSPEELEELQGS 139
>gi|346319394|gb|EGX88996.1| Protein kinase-like domain [Cordyceps militaris CM01]
Length = 1753
Score = 42.4 bits (98), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 51/200 (25%), Positives = 84/200 (42%), Gaps = 8/200 (4%)
Query: 266 KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 325
+C A D + ++ + Y+ G+ I ++ G PN++LL YGFV DNP D +
Sbjct: 1088 QCHAYDPTSGD-LSILAAKDYQVGDQIFIYYGSVPNNRLLRLYGFVLLDNPNDSYDLVLQ 1146
Query: 326 LNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS 385
+ P Y+ K + G S + A + ++L YLR + D +++ +
Sbjct: 1147 TSPMAPLYEQKERLWALAGLDSTCTIPLTA--KHPLPKNVLRYLRTQRL-DAADVADMTL 1203
Query: 386 SL--GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD--YNLHPKKRVATQ 441
L G V+ E VL L D + L G+ L + EA L Y A Q
Sbjct: 1204 QLLNGTDGKVNDGNEIQVLQFLIDSLGSVLEGFGIPLEKLEAQLAGGFYPAGGNAWAAAQ 1263
Query: 442 LVRMEKKMLNACLQVTADMI 461
+ E+ +L + DM+
Sbjct: 1264 VSAGEQGILTRAKKTAEDML 1283
>gi|307103393|gb|EFN51653.1| hypothetical protein CHLNCDRAFT_139846 [Chlorella variabilis]
Length = 712
Score = 42.4 bits (98), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 53/101 (52%), Gaps = 7/101 (6%)
Query: 284 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 343
RP +AG+ + + GP PN KLL YGFV NP+D +V L + + +++ A
Sbjct: 444 RPCQAGQQVFISYGPVPNLKLLCYYGFVVPHNPHD--LVPLQLEPPEGPLKQQQLAAMEA 501
Query: 344 GKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVI 384
L ++ H+ ++ +L LRL V+ ++E+Q V+
Sbjct: 502 LGLGLE----HSLQDGPLSKQLLACLRL-IVATSAELQLVV 537
>gi|71659283|ref|XP_821365.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70886742|gb|EAN99514.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 661
Score = 42.4 bits (98), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 52/114 (45%), Gaps = 13/114 (11%)
Query: 149 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 208
L ++ + +AC+A Y+ YEKKQ + + L Y R L Q+ V++ LW+ L
Sbjct: 466 LDSSNMESIACIAAYMFYEKKQPEIALRL-YRRLL-------QMGVQTTELWNNLGLCCF 517
Query: 209 TGSPTKAEI--LERAEGIKREYNELDTVWFMAGSL---FQKVSLARRFALVPLG 257
S + L+RA I E L VW+ G + + LA R V LG
Sbjct: 518 YSSQYDIALSCLQRAVAISTEDETLADVWYNIGHIGIGTGDLGLAHRAFKVALG 571
>gi|255080174|ref|XP_002503667.1| set domain protein [Micromonas sp. RCC299]
gi|226518934|gb|ACO64925.1| set domain protein [Micromonas sp. RCC299]
Length = 401
Score = 42.4 bits (98), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 24/87 (27%), Positives = 44/87 (50%), Gaps = 5/87 (5%)
Query: 144 TIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 203
++AE L +L L + +M+E+ G+ S W Y L RG+ + P+ W+
Sbjct: 38 SVAETLREARLGGGLALNIAIMHERSLGEGSRWAGYFAVLP---ARGERTL--PMFWTSA 92
Query: 204 ELAYLTGSPTKAEILERAEGIKREYNE 230
+L +L G+ + E AE ++ ++NE
Sbjct: 93 QLEHLRGTDLLRHVTEDAESMRLDFNE 119
>gi|194707708|gb|ACF87938.1| unknown [Zea mays]
Length = 352
Score = 42.4 bits (98), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 48/189 (25%), Positives = 85/189 (44%), Gaps = 36/189 (19%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
L L L+ E+ + SFW PYI L P+ + ++ L +P ++ +
Sbjct: 3 LGLRLLQERAK-SDSFWWPYIANLPE-------TFTVPIFFPGEDIKNLQYAPILHQVNK 54
Query: 220 RA----EGIKREYNELDTVWFM----------AGSLFQKVSLA--RRFAL---VPLGPPL 260
R E K +L TV + + SL +S A R F L VP+ PL
Sbjct: 55 RCRFLLEFEKEVQQKLHTVPLVDHPFYGQDVNSSSLGWAMSAASSRAFRLHGEVPMLLPL 114
Query: 261 L-----AYSSKCKAM----LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
+ +++ + + + ++D +V+++ ++ K E+I + G PN L++YGFV
Sbjct: 115 IDMCNHSFNPNARIVQERSVNSLDMSVKVLAEKKIKQNEAITLNYGCYPNDFFLLDYGFV 174
Query: 312 DEDNPYDRL 320
NPYD++
Sbjct: 175 ITQNPYDQV 183
>gi|358335378|dbj|GAA53907.1| histone-lysine N-methyltransferase setd3 [Clonorchis sinensis]
Length = 254
Score = 42.4 bits (98), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 37/173 (21%), Positives = 82/173 (47%), Gaps = 15/173 (8%)
Query: 292 IVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 351
I++ G + +++ L+ GFV NP++ + + ++ D + + + S +
Sbjct: 58 ILMDYGKRTSAEFLMFSGFVPATNPHNNVRIVLGVSKSDQLSSKREQLLELIALQSPLIL 117
Query: 352 HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS---------SLGPICPVSPCMERAVL 402
H+ + ++SD + + R+ +V D+ ++ + +S P+CP P ++A+
Sbjct: 118 HITG--DLSSLSDAIAFARV-FVMDSDQLDAHLSMTTSALHALRTSPLCPGDPIDDQAIA 174
Query: 403 DQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 455
L F+ ++ Y +SEDE NL P +R +L E ++L +C++
Sbjct: 175 -FLIMRFELLVSAYGPMVSEDEVGYE--NLTPIQRYCERLRVQEVQILRSCIE 224
>gi|452825744|gb|EME32739.1| ribulose-1,5 bisphosphate carboxylase oxygenase large subunit
N-methyltransferase, putative isoform 1 [Galdieria
sulphuraria]
Length = 487
Score = 42.4 bits (98), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 73/321 (22%), Positives = 118/321 (36%), Gaps = 58/321 (18%)
Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI--------------LE 219
S W PYI L G + WS +ELA L P E+ E
Sbjct: 166 SLWKPYIDILPHALNTGLVY------WSSSELAQLQYRPLIEEVKINQYYREALYTRVFE 219
Query: 220 RAEGIKREY--NELDTVWFMAGSLFQKVSLA------RRFALVPLGPPL-LAYSSKCKAM 270
R + NE + V+F A + Q + + +AL+P+ L +S+ +
Sbjct: 220 SLSSPVRVWLQNEKENVFFWALDMVQSRAFGIPDVGNKTYALLPMMDMLNHRVNSQTHFL 279
Query: 271 LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 330
++ + ++ G I + GP N LL YGF+ +NP D V+
Sbjct: 280 YDSIANQYEMKTYSKLSPGTDIYISYGPLDNDHLLHFYGFLQTNNPSDYFQVKDIFQWLH 339
Query: 331 PQYQDKRMVAQ------------------RNGKLSVQVFHVHAGREKEAISDMLPYLRLG 372
Y+ + AQ NGK + ++H H E + I LR+
Sbjct: 340 LMYEQEEWQAQPSHLLEEKLSLLRKYHIYENGK-TFHLYHDHYDDEIDII------LRVF 392
Query: 373 YVSDTSEMQSVISSLGPICPVSPCMERA--VLDQLADYFKARLAGYPATLSEDEAMLTDY 430
S T Q + + + +E V + K L ++ EDE +L +
Sbjct: 393 MASKTDWQQIQENFAMGLFHKALSLENQLHVWQVIIGGCKHLLKDMKTSVEEDEQLLKNK 452
Query: 431 N-LHPKKRVATQLVRMEKKML 450
+ L K ++A Q R+EKK +
Sbjct: 453 DQLSTKLQLAIQF-RLEKKYI 472
>gi|326913214|ref|XP_003202935.1| PREDICTED: SET domain-containing protein 4-like, partial [Meleagris
gallopavo]
Length = 241
Score = 42.4 bits (98), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 48/103 (46%), Gaps = 9/103 (8%)
Query: 86 HKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVVTLERV 139
HK K LK++ + RP + + + LQAG+ S+P +VT V
Sbjct: 28 HKLEYIKLKKWLKDRGFGDSSLRPAQFWGTGRGLMTTRALQAGELVISLPEKCLVTTNTV 87
Query: 140 LGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYI 180
L N + E + K +S L L +L+ EK G+KS W PY+
Sbjct: 88 L-NSCLGEYIMKWKPPVSPLIALCTFLIAEKHAGEKSLWKPYL 129
>gi|302840199|ref|XP_002951655.1| hypothetical protein VOLCADRAFT_105180 [Volvox carteri f.
nagariensis]
gi|300262903|gb|EFJ47106.1| hypothetical protein VOLCADRAFT_105180 [Volvox carteri f.
nagariensis]
Length = 517
Score = 42.0 bits (97), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 48/196 (24%), Positives = 80/196 (40%), Gaps = 38/196 (19%)
Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
+++ E+ L + +MYEK +G++S W PY+ + PL W E L G+
Sbjct: 179 DEILEVQGLIIAVMYEKSRGRQSRWAPYLNLIPDD------MTHMPLYWKHREFKELRGT 232
Query: 212 ----------------PTKAEIL--ERAEGIKREYNELDTVWFMAG-SLFQKVSLA---- 248
PT+ +L E E +E+ EL+ AG L++ + A
Sbjct: 233 AAYDKMMGKVQCPADAPTQVPVLWSEVVEPFIQEHPELELPEGKAGYDLYRWATCAVASY 292
Query: 249 -------RRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQ 299
+ A+VP+ L + + L A + ++ R GE +V G
Sbjct: 293 SFILGDDKYQAMVPVWDLLNHITGRVNVRLHHCAKRHVLHMIATRDILRGEELVNNYGEL 352
Query: 300 PNSKLLINYGFVDEDN 315
N++LL YGFV+ N
Sbjct: 353 SNAELLRGYGFVEARN 368
>gi|449702130|gb|EMD42824.1| Hypothetical protein EHI5A_004190 [Entamoeba histolytica KU27]
Length = 749
Score = 42.0 bits (97), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 22/65 (33%), Positives = 33/65 (50%)
Query: 287 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL 346
K GE I + N KLL+ YGFV+E+NP D L++ +D QY + + ++
Sbjct: 187 KPGEQIFNNYRIRSNEKLLLGYGFVEENNPCDNLLLRIYFEVDDNQYNEIEEILKQEEIK 246
Query: 347 SVQVF 351
S F
Sbjct: 247 SFDFF 251
>gi|388250581|gb|AFK23406.1| histone-lysine N-methyltransferase [Cordyceps militaris]
Length = 479
Score = 42.0 bits (97), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 48/188 (25%), Positives = 81/188 (43%), Gaps = 7/188 (3%)
Query: 278 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 337
+ ++ + Y+ G+ I ++ G PN++LL YGFV DNP D + + P Y+ K
Sbjct: 239 LSILAAKDYQVGDQIFIYYGSVPNNRLLRLYGFVLLDNPNDSYDLVLQTSPMAPLYEQKE 298
Query: 338 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV--SDTSEMQSVISSLGPICPVSP 395
+ G S + A + ++L YLR + +D ++M + + G V+
Sbjct: 299 RLWALAGLDSTCTIPLTA--KHPLPKNVLRYLRTQRLDAADVADMTLQLLN-GTDGKVND 355
Query: 396 CMERAVLDQLADYFKARLAGYPATLSEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNAC 453
E VL L D + L G+ L + EA L Y A Q+ E+ +L
Sbjct: 356 GNEIQVLQFLIDSLGSVLEGFGIPLEKLEAQLAGGFYPAGGNAWAAAQVSAGEQGILTRA 415
Query: 454 LQVTADMI 461
+ DM+
Sbjct: 416 KKTAEDML 423
>gi|169847976|ref|XP_001830696.1| hypothetical protein CC1G_03233 [Coprinopsis cinerea okayama7#130]
gi|116508170|gb|EAU91065.1| hypothetical protein CC1G_03233 [Coprinopsis cinerea okayama7#130]
Length = 496
Score = 42.0 bits (97), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 60/121 (49%), Gaps = 19/121 (15%)
Query: 115 ASEDLQAGDAAFSVPNSLVVT-----LERVLGNETIAELLTTNKLSE-LACLALYLMYEK 168
A +DL G F++P +L ++ L + G E L KL + A L L +M+E
Sbjct: 42 ALKDLPEGHVLFTIPRALTLSTRTSRLPELFGLEEWKRL----KLHQGWAGLMLCMMWEA 97
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
QGK+S W Y+ L A ++P+ W+E +L+ L G+ ++ + E +R+Y
Sbjct: 98 AQGKESRWAGYLDIL-------PAAFDTPMFWNEEDLSELAGTSIVGKLGK--EDAERDY 148
Query: 229 N 229
+
Sbjct: 149 D 149
>gi|67484540|ref|XP_657490.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56474743|gb|EAL52100.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
Length = 791
Score = 42.0 bits (97), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 22/65 (33%), Positives = 33/65 (50%)
Query: 287 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL 346
K GE I + N KLL+ YGFV+E+NP D L++ +D QY + + ++
Sbjct: 229 KPGEQIFNNYRIRSNEKLLLGYGFVEENNPCDNLLLRIYFEVDDNQYNEIEEILKQEEIK 288
Query: 347 SVQVF 351
S F
Sbjct: 289 SFDFF 293
>gi|302829721|ref|XP_002946427.1| hypothetical protein VOLCADRAFT_86703 [Volvox carteri f.
nagariensis]
gi|300268173|gb|EFJ52354.1| hypothetical protein VOLCADRAFT_86703 [Volvox carteri f.
nagariensis]
Length = 658
Score = 42.0 bits (97), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 25/90 (27%), Positives = 40/90 (44%), Gaps = 17/90 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-----------------LS 155
+ A+ DLQ G+A VP L++T + +A L + L
Sbjct: 32 IVATRDLQPGEAVLRVPERLLLTTRSAARDPQLAAALQRHTERSRGVAAAPSCGGGCGLG 91
Query: 156 ELACLALYLMYEKKQGKKSFWLPYIRELDR 185
LA +L+ E +G +SFW PY+++L R
Sbjct: 92 PHQVLACHLLLEVSRGPQSFWWPYLKQLPR 121
Score = 40.0 bits (92), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 29/91 (31%), Positives = 42/91 (46%), Gaps = 12/91 (13%)
Query: 275 DDAVQ---LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL----- 326
D+A Q +VV RPY+ GE +++ G N +LL YGFV E N +D ++ AL
Sbjct: 312 DEATQQYCIVVRRPYREGEQVMLCYGRYTNLELLEYYGFVLEGNLHDTARLDPALLPLPS 371
Query: 327 ----NTEDPQYQDKRMVAQRNGKLSVQVFHV 353
P NG+ S Q+ H+
Sbjct: 372 AARTAGGAPHLAPSDCFLHANGQPSWQLLHL 402
>gi|407417214|gb|EKF38012.1| hypothetical protein MOQ_001785 [Trypanosoma cruzi marinkellei]
Length = 578
Score = 42.0 bits (97), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 37/137 (27%), Positives = 65/137 (47%), Gaps = 9/137 (6%)
Query: 297 GPQPNSKLLINYGFVDEDNPYDRLVVEAAL--NTEDPQYQDKR--MVAQRNGKLSVQVFH 352
GP N +LL YGFV E+N +DRL ++ +R +VA L+ + +
Sbjct: 370 GPLQNWELLQFYGFVVEENEHDRLPFPFDFPEGVAGDEWDRRRATLVATYGLHLAGRCWI 429
Query: 353 VHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 412
H GR A ++ LR+ ++++ E ++ + GP + E V+ +AD +
Sbjct: 430 CHDGRPPPA---LVALLRV-HLAEAEEFDTMERN-GPFASLGAGTEARVVATIADTIRCI 484
Query: 413 LAGYPATLSEDEAMLTD 429
L + +L EDE +L +
Sbjct: 485 LDLFSTSLEEDEWLLEN 501
>gi|171684553|ref|XP_001907218.1| hypothetical protein [Podospora anserina S mat+]
gi|170942237|emb|CAP67889.1| unnamed protein product [Podospora anserina S mat+]
Length = 396
Score = 42.0 bits (97), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 21/87 (24%), Positives = 40/87 (45%), Gaps = 2/87 (2%)
Query: 243 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 302
++++ + AL P+ L C+ + + DR YK GE + + G N
Sbjct: 194 ERLTKDDKMALQPVADLLNHSDEGCEVVFDT--GCYTISADREYKQGEEVYICYGTHSND 251
Query: 303 KLLINYGFVDEDNPYDRLVVEAALNTE 329
L++ YGF E+N +D + ++ + E
Sbjct: 252 FLMVEYGFCPEENKWDEVCIDEVVLEE 278
>gi|407035166|gb|EKE37568.1| [Ribulose-bisphosphate-carboxylase]-lysine N-methyltransferase
[Entamoeba nuttalli P19]
Length = 791
Score = 42.0 bits (97), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 22/65 (33%), Positives = 33/65 (50%)
Query: 287 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL 346
K GE I + N KLL+ YGFV+E+NP D L++ +D QY + + ++
Sbjct: 229 KPGEQIFNNYRIRSNEKLLLGYGFVEENNPCDNLLLRIYFEVDDNQYNEIEEILKQEEIK 288
Query: 347 SVQVF 351
S F
Sbjct: 289 SFDFF 293
>gi|167389227|ref|XP_001738871.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165897700|gb|EDR24782.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 791
Score = 42.0 bits (97), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 22/65 (33%), Positives = 33/65 (50%)
Query: 287 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL 346
K GE I + N KLL+ YGFV+E+NP D L++ +D QY + + ++
Sbjct: 229 KPGEQIFNNYRIRSNEKLLLGYGFVEENNPCDNLLLRIYFEVDDNQYNEIEEILKQEEIK 288
Query: 347 SVQVF 351
S F
Sbjct: 289 SFDFF 293
>gi|365982325|ref|XP_003667996.1| hypothetical protein NDAI_0A05980 [Naumovozyma dairenensis CBS 421]
gi|343766762|emb|CCD22753.1| hypothetical protein NDAI_0A05980 [Naumovozyma dairenensis CBS 421]
Length = 573
Score = 42.0 bits (97), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 23/54 (42%), Positives = 28/54 (51%)
Query: 265 SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 318
SK + L DD V +V R GE I + GP PN+ LL GF DNP+D
Sbjct: 335 SKPEEELNNPDDYVDIVTTRGILKGEEIFISYGPLPNAFLLAKCGFTMADNPFD 388
>gi|322697804|gb|EFY89580.1| putative histone-lysine N-methyltransferase [Metarhizium acridum
CQMa 102]
Length = 466
Score = 42.0 bits (97), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 23/74 (31%), Positives = 40/74 (54%), Gaps = 1/74 (1%)
Query: 264 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 323
S +C A+ + + ++ + Y+AG+ + + G PNS+LL YGFV DNP D +
Sbjct: 202 SKQCHVYDASTGN-LSILAGKDYEAGDQVYIHYGSIPNSRLLRLYGFVIPDNPNDSYDLV 260
Query: 324 AALNTEDPQYQDKR 337
A + P ++ K+
Sbjct: 261 LATHPMAPFFEQKQ 274
>gi|50294638|ref|XP_449730.1| hypothetical protein [Candida glabrata CBS 138]
gi|49529044|emb|CAG62706.1| unnamed protein product [Candida glabrata]
Length = 510
Score = 42.0 bits (97), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 34/66 (51%)
Query: 252 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
++VPL L A + KC A L ++++ R +GE + G NS+LL YG+V
Sbjct: 238 SMVPLADTLNADTKKCNANLLHSKQTLRMIAIRDIPSGEQVYNTYGELSNSELLRRYGYV 297
Query: 312 DEDNPY 317
+ D Y
Sbjct: 298 EWDGSY 303
>gi|407923069|gb|EKG16157.1| hypothetical protein MPH_06594 [Macrophomina phaseolina MS6]
Length = 305
Score = 41.6 bits (96), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 48/200 (24%), Positives = 78/200 (39%), Gaps = 37/200 (18%)
Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWL 177
D+Q G+ F++P S V++ + + +L L A L + ++YE +G S W
Sbjct: 3 DIQEGEVLFTIPRSAVLSATNSSLSSILPQLF--EHLDPWASLIVTMIYEYLRGDASPWK 60
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
PY L ++ + WS+ ELA L S +I K NE+ T
Sbjct: 61 PYFDVLPAH-------FDTLMFWSDDELAELQASAVTQKIG------KDSANEMFTNTI- 106
Query: 238 AGSLFQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC- 296
+ L RR A V P A + +LA ++ G +I+ +
Sbjct: 107 -------IPLVRRHASVFFPDPNTAQGASDGDLLALA-----------HRMGSTIMAYAF 148
Query: 297 --GPQPNSKLLINYGFVDED 314
P P SK + G+ +D
Sbjct: 149 DIEPDPASKQVDEEGYASDD 168
>gi|218189844|gb|EEC72271.1| hypothetical protein OsI_05430 [Oryza sativa Indica Group]
Length = 1243
Score = 41.6 bits (96), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 31/114 (27%), Positives = 54/114 (47%), Gaps = 8/114 (7%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ +Q GD VP + +TL+++ L + + + + LA L+ E+ G +S
Sbjct: 66 ASKPIQEGDCIMQVPYHVQLTLDKLPQKFNT---LLDHAVGDTSKLAALLIMEQHLGNES 122
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
W PYI+ L + + + +LW EL + S E +E E K+E+
Sbjct: 123 GWAPYIKSLPTKD-----QMHNMVLWDLNELHAVQNSSIYDEAIEHKEQAKKEF 171
>gi|302790237|ref|XP_002976886.1| hypothetical protein SELMODRAFT_416932 [Selaginella moellendorffii]
gi|300155364|gb|EFJ21996.1| hypothetical protein SELMODRAFT_416932 [Selaginella moellendorffii]
Length = 177
Score = 41.6 bits (96), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 47/98 (47%), Gaps = 6/98 (6%)
Query: 260 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 319
L S CK + AV +++++ R KAG + G PN LL YGFV E+NP+D
Sbjct: 54 FLWASELCK--IDAVTNSLKVYSLRSCKAGMQCFISYGALPNIDLLCFYGFVLENNPFDT 111
Query: 320 LVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR 357
+ VE E P+ K + +R +S F + R
Sbjct: 112 IPVE----LEVPESPAKVALMERYNLVSHISFELRGFR 145
>gi|225561342|gb|EEH09622.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
Length = 487
Score = 41.6 bits (96), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 84/411 (20%), Positives = 155/411 (37%), Gaps = 75/411 (18%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
+ G+ F++P ++ T+E + + L + + LS LA+Y+++ +
Sbjct: 52 FKEGERIFTIPADVLWTVEHAYADSLLGPALRSARPPLSVDDTLAMYILFVRS------- 104
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
RE R LA S + +++ EL GS A I+ +Y L
Sbjct: 105 ----RESGYDGPRSHLATLPKSYSSSIFFTDDELEVCAGSSLYALTKRLGRCIEDDYRAL 160
Query: 232 --------------------DTVWFMAGSLFQKVSL----ARRFALVPLGPPLLAYSSKC 267
D W + + + + L+ +L +SS+
Sbjct: 161 VVRLLVQHQDLFPLDKFTIEDYKWALCTVWSRAMDFVLPGGKSIRLMAPFADMLNHSSEV 220
Query: 268 KAMLAA--VDDAVQLVVDRPYKAGES-----IVVWCGPQPNSKLLINYGFVDEDNPYDRL 320
+ A + + ++ + Y+AG+ + ++ G PN++LL YGFV NP D
Sbjct: 221 RQCHAYDPLSGNLTILAGKDYEAGDQGVFFQVFIYYGSIPNNRLLRLYGFVMPGNPNDSY 280
Query: 321 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV----SD 376
+ + P ++ KR + G S + ++D LP LGY+ SD
Sbjct: 281 DLVLETHPMAPFFEQKRKLWDLAGFDSTSTISI-------TLTDPLPKNVLGYLRIQRSD 333
Query: 377 TSEMQSVISSLGPICP----VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNL 432
S++ S+ I P +S E VL L + F L + L E L + +
Sbjct: 334 ESDLASIARQ--RIDPKYEKISDSNEVEVLQSLIESFCGLLDSFGTQLESLEKQLAE-GV 390
Query: 433 HPKKR---VATQLVRMEKKMLNACLQVTADMIMLLPDVTVS-----PCPAP 475
+P + A + E+++L + DM+ + + + P PAP
Sbjct: 391 YPSRGNAWAAAHVSLGEQQVLRLARKRAEDMLAAVESGSGNEKGSLPAPAP 441
>gi|299472213|emb|CBN77183.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloropl [Ectocarpus
siliculosus]
Length = 460
Score = 41.6 bits (96), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 64/136 (47%), Gaps = 15/136 (11%)
Query: 84 WMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLG 141
W+ K+G L V+ P E+ + A++ ++ G + ++P SL +T L
Sbjct: 18 WLTKSGVRLTDNAVLAGRSPLAGERG-----LVAAKAIETGQSVLAIPQSLGLTATG-LK 71
Query: 142 NETIAELLTT--NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
+ IA+ + E +AL +++E+ QG+ S P+I L ++ G+L E PL
Sbjct: 72 SSGIAQYVEGFEGWTGETGLIALQVLWERAQGEGSKMAPWIAVLPKE---GEL--EMPLF 126
Query: 200 WSETELAYLTGSPTKA 215
W E +L S T+
Sbjct: 127 WGEADLTLADASSTRG 142
>gi|330800139|ref|XP_003288096.1| hypothetical protein DICPUDRAFT_152307 [Dictyostelium purpureum]
gi|325081857|gb|EGC35358.1| hypothetical protein DICPUDRAFT_152307 [Dictyostelium purpureum]
Length = 525
Score = 41.6 bits (96), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 51/228 (22%), Positives = 97/228 (42%), Gaps = 47/228 (20%)
Query: 267 CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 326
C + D+ + + V + KA + + G N+ LL YGF++ DNP D L VE +L
Sbjct: 310 CGDIAEEDDEHLYIKVVKAVKANQEVYNTYGDHSNATLLSKYGFIEMDNPCDNLPVEKSL 369
Query: 327 -----------NTEDPQYQDKRM--------VAQRNGKLSVQVFHVHAGREKEA------ 361
N D KR+ + RN S+++ +GR +A
Sbjct: 370 VDTNLISLCKENGFDSNELSKRISFYASLFDIDSRNTH-SIEI----SGRLDDALVCSVG 424
Query: 362 -----ISDMLPYLRLGYVSDTSEMQSVISSLGP--ICPVSPCMERAVLDQLADYFKARLA 414
+S+ +L++ +++ L I + +++A++ L + +L+
Sbjct: 425 IALAPLSEFEGWLKMS----EHKLEKYFEKLEAEDIVKQNAQVKKAIVQILNN----KLS 476
Query: 415 GYPATLSEDEAMLTDY--NLHPKKRVATQLVRMEKKMLNACLQVTADM 460
YP TL +D+ L + N +K ++T L EKK++ ++ D+
Sbjct: 477 NYPTTLEQDQNKLKELKENEENRKIISTSLNICEKKLIYKSIKYYEDL 524
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 24/99 (24%), Positives = 47/99 (47%), Gaps = 11/99 (11%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ +++DL+ + +P +++++ +I+ +LT + A+ L+YE G+
Sbjct: 82 IISNKDLKVNNIVAKIPKDIILSIHT----SSISNILTKYTMERNIATAIALIYEASIGE 137
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
KS W YI L L V+ P+LW + L G+
Sbjct: 138 KSKWYGYISSL-------PLKVDIPILWDKESQQLLNGT 169
>gi|170093191|ref|XP_001877817.1| SET-domain protein [Laccaria bicolor S238N-H82]
gi|164647676|gb|EDR11920.1| SET-domain protein [Laccaria bicolor S238N-H82]
Length = 524
Score = 41.6 bits (96), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 59/228 (25%), Positives = 98/228 (42%), Gaps = 32/228 (14%)
Query: 251 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 310
A+VP+ L A A L ++ ++++ RP K GE I G PN++LL YG
Sbjct: 276 IAMVPMADILNARYGSENAKLFYEENYLKMISTRPIKGGEQIWNTYGDLPNAELLRRYGH 335
Query: 311 VD--------EDNPYD------RLVVEAA-----LNTEDPQYQDKRMVAQRNGKLSVQVF 351
VD + NP D L+V A L+T+D + + + + F
Sbjct: 336 VDVIQLPNGGQGNPGDVAEIRADLIVSVAAEQHSLSTDDTHERIDWWLEEGGDDVFDLYF 395
Query: 352 HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 411
+ E ++ +RL + D E + + P P M+ L L + +
Sbjct: 396 DL------EIPPSIISVIRLLLLPD-EEWEKIKEKA---KPPKPKMDAVALTVLHEVLQR 445
Query: 412 RLAGYPATLSEDEAML-TDYNLHPKKRVATQLVRMEKKMLNACLQVTA 458
RL YP ++ +DE +L T +L+ + + +L EKK+L+ L TA
Sbjct: 446 RLKEYPTSIQDDEQLLMTAPSLNLRHAIIVRL--GEKKILDGILTKTA 491
>gi|401624185|gb|EJS42251.1| set7p [Saccharomyces arboricola H-6]
Length = 494
Score = 41.6 bits (96), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 22/68 (32%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 252 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
+++PL L A +SKC A L +++++ R + E + G PNS++L YG+V
Sbjct: 220 SMIPLADTLNADTSKCNANLTYDSGSLKMIAVRDIEIDEQVYNIYGEHPNSEILRRYGYV 279
Query: 312 DED-NPYD 318
+ D + YD
Sbjct: 280 EWDGSKYD 287
>gi|407846232|gb|EKG02467.1| hypothetical protein TCSYLVIO_006496 [Trypanosoma cruzi]
Length = 546
Score = 41.6 bits (96), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 52/114 (45%), Gaps = 13/114 (11%)
Query: 149 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 208
L ++ + +AC+A Y+ YEKKQ + + L Y R L Q+ V++ LW+ L
Sbjct: 351 LDSSNMESIACIAAYMFYEKKQPEIALRL-YRRLL-------QMGVQTTELWNNLGLCCF 402
Query: 209 TGSPTKAEI--LERAEGIKREYNELDTVWFMAGSL---FQKVSLARRFALVPLG 257
S + L+RA I E L VW+ G + + LA R V LG
Sbjct: 403 YSSQYDIALSCLQRAVAISTEDETLADVWYNIGHIGIGTGDLGLAHRAFKVALG 456
>gi|71425330|ref|XP_813082.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70877934|gb|EAN91231.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 565
Score = 41.6 bits (96), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 40/148 (27%), Positives = 66/148 (44%), Gaps = 15/148 (10%)
Query: 289 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRL-----VVEAALNTEDPQYQDKR--MVAQ 341
G I + GP N +LL YGFV E N +DRL E + E + +R +VA
Sbjct: 362 GREIWMSYGPLQNWELLQFYGFVLEGNEHDRLPFPFDFPEGVVGDE---WDGRRAALVAT 418
Query: 342 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 401
L+ + + H GR A+ +L ++++ E + + GP + E V
Sbjct: 419 YGLHLAGRCWICHDGRPPPALVALLRV----HLAEAEEFDT-MERKGPFASLGAGTEARV 473
Query: 402 LDQLADYFKARLAGYPATLSEDEAMLTD 429
+ +AD + L + +L EDE +L +
Sbjct: 474 VATIADTIRCILDLFSTSLEEDERLLEN 501
>gi|71409849|ref|XP_807248.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70871208|gb|EAN85397.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 544
Score = 41.6 bits (96), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 52/114 (45%), Gaps = 13/114 (11%)
Query: 149 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 208
L ++ + +AC+A Y+ YEKKQ + + L Y R L Q+ V++ LW+ L
Sbjct: 349 LDSSNMESIACIAAYMFYEKKQPEIALRL-YRRLL-------QMGVQTTELWNNLGLCCF 400
Query: 209 TGSPTKAEI--LERAEGIKREYNELDTVWFMAGSL---FQKVSLARRFALVPLG 257
S + L+RA I E L VW+ G + + LA R V LG
Sbjct: 401 YSSQYDIALSCLQRAVAISTEDETLADVWYNIGHIGIGTGDLGLAHRAFKVALG 454
>gi|357137816|ref|XP_003570495.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Brachypodium distachyon]
Length = 485
Score = 41.6 bits (96), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 64/274 (23%), Positives = 102/274 (37%), Gaps = 45/274 (16%)
Query: 80 DLKSWMHKNG---LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTL 136
DL W+ + G P +V H E + A D+ G +P + + L
Sbjct: 55 DLLRWVQREGGFVHPALRV-----SDHPEYGLGVSATGADGDIPPGTVLIDLPGRIPLRL 109
Query: 137 ERVLGN-ETIAELLTTNKLSELACLALYL-MYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
R G + + L EL + L L + +++ SFW PYI L
Sbjct: 110 RRPAGAADAVLMQLADQVPEELWAMRLGLRLLQERTKSDSFWWPYIANLPE-------TF 162
Query: 195 ESPLLWSETELAYLTGSPTKAEILERA----EGIKREYNELDTVWFM----------AGS 240
P+ + ++ L +P +I +R E K ++L TV + S
Sbjct: 163 TVPIFFPGEDIKNLQYAPLLHQINKRCRFLLEFEKEVQHKLGTVPLADHPFCGQDVNSSS 222
Query: 241 LFQKVSLA--RRFALVPLGPPLLAYSSKCKAMLA------------AVDDAVQLVVDRPY 286
L +S A R F L P LL C + + D +V++V D
Sbjct: 223 LGWAMSAASSRAFRLHGEIPMLLPLVDMCNHSFSPNARIVQDGDVDSPDMSVKVVADTQI 282
Query: 287 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 320
++ + G PN L++YGFV NPYD++
Sbjct: 283 DQNATVTLNYGCYPNDFYLLDYGFVVTSNPYDQV 316
>gi|238494116|ref|XP_002378294.1| SET domain protein [Aspergillus flavus NRRL3357]
gi|317148877|ref|XP_001822982.2| SET domain protein [Aspergillus oryzae RIB40]
gi|220694944|gb|EED51287.1| SET domain protein [Aspergillus flavus NRRL3357]
Length = 478
Score = 41.6 bits (96), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 35/131 (26%), Positives = 60/131 (45%), Gaps = 15/131 (11%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
SW+ +G P KV K + + V A D+ G+ F++P V++ + N
Sbjct: 22 SWL--SGKPGVKVNPKIRLADLRSRAAGRGVVAQSDIAEGEELFTIPREHVLSTQ----N 75
Query: 143 ETIAELLTTN--KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
+ +LL+ + +L L L ++YE G +S W Y + L R+ ++ + W
Sbjct: 76 SKLKDLLSQDVEELGPWLSLMLVMIYEYLLGDQSAWASYFKILPRK-------FDTLMFW 128
Query: 201 SETELAYLTGS 211
S +EL L GS
Sbjct: 129 SPSELQELQGS 139
>gi|303271033|ref|XP_003054878.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226462852|gb|EEH60130.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 664
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 25/64 (39%), Positives = 39/64 (60%), Gaps = 8/64 (12%)
Query: 149 LTTNKLSELACLALYLMYE-KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY 207
+T+ +++ A +AL+L++E Q +KS W P++ L R VE+PLLW+ ELA
Sbjct: 189 ITSREVTIDAVIALHLLHELYVQREKSEWWPWVSILPRD-------VETPLLWTPRELAQ 241
Query: 208 LTGS 211
L GS
Sbjct: 242 LEGS 245
>gi|357131408|ref|XP_003567330.1| PREDICTED: ribosomal N-lysine methyltransferase 3-like
[Brachypodium distachyon]
Length = 495
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 19/55 (34%), Positives = 32/55 (58%)
Query: 275 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 329
D+ +++++ R AGE + G N+ LL YGF + DNPYD + ++ L T+
Sbjct: 251 DEDLEMIIVRDANAGEEVYNTYGTMGNAALLHRYGFTELDNPYDIVNIDLTLVTK 305
>gi|340520781|gb|EGR51016.1| N-methyltransferase [Trichoderma reesei QM6a]
Length = 470
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 22/72 (30%), Positives = 37/72 (51%), Gaps = 1/72 (1%)
Query: 266 KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 325
+C A D + + + Y+ G+ + ++ GP PN++LL YGFV DNP D +
Sbjct: 214 QCHAYDVKSGD-LSVFAGKDYEIGDQVYIYYGPIPNNRLLRLYGFVIPDNPNDSYDLVLT 272
Query: 326 LNTEDPQYQDKR 337
+ P Y+ K+
Sbjct: 273 THPMAPFYEQKQ 284
>gi|424512980|emb|CCO66564.1| predicted protein [Bathycoccus prasinos]
Length = 542
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 18/43 (41%), Positives = 25/43 (58%)
Query: 284 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 326
+ YK GE +++ G PN +LL+ YGFVD+ N D E L
Sbjct: 350 KSYKKGEEVLISYGLMPNDELLLRYGFVDDQNVADTYQFEGLL 392
>gi|396469509|ref|XP_003838423.1| similar to SET domain-containing protein [Leptosphaeria maculans
JN3]
gi|312214991|emb|CBX94944.1| similar to SET domain-containing protein [Leptosphaeria maculans
JN3]
Length = 415
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 36/75 (48%), Gaps = 2/75 (2%)
Query: 244 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 303
K++ A +A+ P S C+ A ++ DR Y+AGE + V GP N
Sbjct: 190 KLTSADCYAMCPFMDYFNHSDSGCEPQHNA--HGYSVLADRAYRAGEEVYVSYGPHTNDF 247
Query: 304 LLINYGFVDEDNPYD 318
LL+ YGF+ + N D
Sbjct: 248 LLVEYGFLLDANSND 262
>gi|159464317|ref|XP_001690388.1| hypothetical protein CHLREDRAFT_144255 [Chlamydomonas reinhardtii]
gi|158279888|gb|EDP05647.1| predicted protein [Chlamydomonas reinhardtii]
Length = 486
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 35/55 (63%), Gaps = 3/55 (5%)
Query: 275 DDAVQ---LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 326
D+A Q +VV R AG+ +++ G N +LL +YGFV +DNP+D ++AAL
Sbjct: 244 DEARQQYVIVVRRRVAAGQQVLLCYGRHTNLELLEHYGFVMQDNPHDTAPLDAAL 298
>gi|428177750|gb|EKX46628.1| hypothetical protein GUITHDRAFT_107412 [Guillardia theta CCMP2712]
Length = 606
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 17/49 (34%), Positives = 27/49 (55%)
Query: 278 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 326
+Q P K G + + GP N++LL+ YG+ ++DNPY +E L
Sbjct: 440 LQFCTMAPIKQGSQVFLNYGPLDNTQLLLYYGYAEQDNPYQTYAIELEL 488
>gi|297845640|ref|XP_002890701.1| hypothetical protein ARALYDRAFT_472886 [Arabidopsis lyrata subsp.
lyrata]
gi|297336543|gb|EFH66960.1| hypothetical protein ARALYDRAFT_472886 [Arabidopsis lyrata subsp.
lyrata]
Length = 471
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 38/175 (21%), Positives = 71/175 (40%), Gaps = 19/175 (10%)
Query: 272 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL----------V 321
A + V++V + K + +++ G N L++YGFV E NPYD +
Sbjct: 252 AESNTLVKVVAETELKENDPLLLNYGCLSNDFFLLDYGFVIESNPYDTIELKYDEQLMDA 311
Query: 322 VEAALNTEDPQYQ-----DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD 376
A P++ ++++Q N + V G + +L +R+ +
Sbjct: 312 ASMAAGVSSPKFSSPAPWQHQLLSQLNLAGEMPNLKVTIGGPEPVEGRLLAAIRILLCGE 371
Query: 377 TSEMQ----SVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 427
E++ + SL I P+ E AV + L+ +P + EDEA++
Sbjct: 372 MVEVEKHDLDTLKSLSAIAPLGIANEIAVFRTVIALCVIALSHFPTKIMEDEAII 426
>gi|189193345|ref|XP_001933011.1| predicted protein [Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187978575|gb|EDU45201.1| predicted protein [Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 642
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 48/202 (23%), Positives = 84/202 (41%), Gaps = 52/202 (25%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP-------TK 214
L L+ ++ +G+ S W YI L R + +PL + E ++A+L G+ K
Sbjct: 99 LLLIEQRDKGQASPWHAYIACLPNSRD-----MTTPLWFDEGDMAFLAGTSLVPAAKERK 153
Query: 215 AEILERAEGIKREYNEL--------DT---VWFMAGSLFQKVSLARRFALVPLGPPLLAY 263
AE+ ++ EG EL DT +W A ++F +R F + +L
Sbjct: 154 AELQQQWEGAIAVMEELSIPLAKGIDTESLLW--AATIFT----SRAF----ISTHILPE 203
Query: 264 SSKCKAMLAAVD---DAVQLVVDRPYKAGESIVVWC----------------GPQPNSKL 304
+ VD +V V+ ++ G+S + C P+ N +L
Sbjct: 204 RETVPILFPVVDILNHSVSAKVEWDFQPGQSFALKCLDGDSFQPEQELFNNYAPKQNDEL 263
Query: 305 LINYGFVDEDNPYDRLVVEAAL 326
L+ YGF E+NP ++ ++ A
Sbjct: 264 LLGYGFCLENNPIEQFALKLAF 285
>gi|320170797|gb|EFW47696.1| hypothetical protein CAOG_05634 [Capsaspora owczarzaki ATCC 30864]
Length = 903
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 42/168 (25%), Positives = 72/168 (42%), Gaps = 14/168 (8%)
Query: 62 TLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQA 121
T V G+R + + +L W+H G+ I + S + V A+E ++A
Sbjct: 371 TAVIGTRPAALESRKIGDNLLQWLHNAGMTS---IAENHLSIADFEHTGRGVLANERIEA 427
Query: 122 GDAAFSVPNSLVVTLERVLG-NETIAELLTT--NKLSELACLALYLMYEK-KQGKKSFWL 177
G +P L++ + L + I +L+ ++ + L LY+++EK G S W
Sbjct: 428 GVEVLHLPQHLLINIHVALDESHPIGRVLSDLRDEYDDDTLLLLYVLHEKLVAGSASRWA 487
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIK 225
P+ L SPLL+ TEL L G+ E E +G++
Sbjct: 488 PFFETL-------PATYNSPLLFHVTELLELEGTRLIDETFEIKDGLR 528
>gi|281201674|gb|EFA75882.1| tryptophan 2,3-dioxygenase [Polysphondylium pallidum PN500]
Length = 732
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 29/119 (24%), Positives = 58/119 (48%), Gaps = 12/119 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKK 169
+ A+ +++ GD VP L ++ E ++I + ++ L ++ L ++L+ E +
Sbjct: 98 LIANTEIKEGDEFIKVPLKLFMSQETAF--KSIGDKVSREPLFKMLPNMLLVIHLIQETQ 155
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
+ ++SFW PYIR L + + ++ L ++ E L GSP E + R+Y
Sbjct: 156 KQQQSFWAPYIRMLPK-------SYKTALYFTLAEFQLLIGSPVLEESINTYRNTLRQY 207
>gi|342875304|gb|EGU77102.1| hypothetical protein FOXB_12400 [Fusarium oxysporum Fo5176]
Length = 371
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 35/77 (45%), Gaps = 2/77 (2%)
Query: 250 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 309
R +P CK +A+ +VQ DR Y GE + V GP N LL YG
Sbjct: 178 RLVCMPTADLFNHADQGCKLAYSALGYSVQ--ADRVYHQGEEVYVSYGPHSNDFLLSEYG 235
Query: 310 FVDEDNPYDRLVVEAAL 326
F+ + N +D + ++ +
Sbjct: 236 FILDTNRWDEVYLDEVI 252
>gi|115386294|ref|XP_001209688.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114190686|gb|EAU32386.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 486
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 30/102 (29%), Positives = 50/102 (49%), Gaps = 15/102 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
V A D+ + F++P LV++ + N + +LL+ + L EL L L +MYE
Sbjct: 50 VVAQTDIPENEELFTIPRDLVLSTQ----NSKLKDLLSQD-LEELGPWLSLMLVMMYEYL 104
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
G +S W Y + L R+ ++ + W+ +EL L GS
Sbjct: 105 LGDQSTWAAYFKVLPRK-------FDTLMFWTPSELLELQGS 139
Score = 38.9 bits (89), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 47/189 (24%), Positives = 77/189 (40%), Gaps = 9/189 (4%)
Query: 252 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
+VPL L A + + A L ++A+ + +P GE I G P + LL YG+V
Sbjct: 233 GMVPLADLLNADADRNNARLFQDENALVMKAIKPIAKGEEIFNDYGEIPRADLLRRYGYV 292
Query: 312 -DEDNPYDRLVVE-------AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 363
D PYD + V A L+ DP+ Q L +E + ++
Sbjct: 293 TDNYAPYDVVEVSLDVICKAAGLSDSDPEKQPPLEFLDELELLDDGYVIPRPSQEDDQLT 352
Query: 364 DMLP-YLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSE 422
D+LP L + + T + + P P A LA + + A Y T+++
Sbjct: 353 DILPDELIILLRTLTLSPEQLAQQRSKNKPPKPAFAEAEATILAKAIQLKQAQYATTIAQ 412
Query: 423 DEAMLTDYN 431
D+ +L+ N
Sbjct: 413 DQEILSQLN 421
>gi|297598048|ref|NP_001044988.2| Os01g0879500 [Oryza sativa Japonica Group]
gi|255673923|dbj|BAF06902.2| Os01g0879500 [Oryza sativa Japonica Group]
Length = 263
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 20/63 (31%), Positives = 33/63 (52%)
Query: 267 CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 326
C + D+ ++++V R GE + G N+ LL YGF + DN YD + ++ AL
Sbjct: 15 CSYYVGDDDEDLEMIVVRDVNEGEEVFNTYGTMGNAALLHRYGFTEMDNSYDIVNIDLAL 74
Query: 327 NTE 329
T+
Sbjct: 75 VTK 77
>gi|323473309|gb|ADX78230.1| CIA6 [Chlamydomonas reinhardtii]
Length = 699
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 35/55 (63%), Gaps = 3/55 (5%)
Query: 275 DDAVQ---LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 326
D+A Q +VV R AG+ +++ G N +LL +YGFV +DNP+D ++AAL
Sbjct: 396 DEARQQYVIVVRRRVAAGQQVLLCYGRHTNLELLEHYGFVMQDNPHDTAPLDAAL 450
>gi|443730800|gb|ELU16158.1| hypothetical protein CAPTEDRAFT_140019 [Capitella teleta]
Length = 255
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 56/246 (22%), Positives = 92/246 (37%), Gaps = 47/246 (19%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGN---ETIAELLTTNKLSELACLALYLMYEKK 169
V L GD ++P SL++T VL + I + L +LS L ++L+ E+
Sbjct: 8 VMVRRRLLTGDTIIAIPESLLITTSTVLRSYLGPVIHDFLPC-RLSPTETLVIFLLCERN 66
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
+G SFW PY+ L + L W+ E+ L TK + + +N
Sbjct: 67 KGCSSFWKPYVDILPS-------SYTDILHWTSKEMDLLPKF-TKRRACDLRLKAEESFN 118
Query: 230 ELDTVWF-------------MAGSLFQKVSLARRFALVPLGPP---LLAYSSKCKAMLAA 273
L + LF+ + V + P +L+ + K+ LA
Sbjct: 119 RLCNGFLPLLVRQMPQFNGAFTWDLFKWAWSSVNTRCVYMSQPQNSVLSPDEEDKSALAP 178
Query: 274 VDDAVQLVVD-----------RPYKAG--------ESIVVWCGPQPNSKLLINYGFVDED 314
D + VD + YK + + + GP N KLL+ YGF
Sbjct: 179 FLDLLNHTVDVEVNARFDDSSKSYKITTLTACKPYDQVFINYGPHSNEKLLLEYGFTLPC 238
Query: 315 NPYDRL 320
NP++ +
Sbjct: 239 NPHNNI 244
>gi|387219019|gb|AFJ69218.1| set domain-containing protein 3, partial [Nannochloropsis gaditana
CCMP526]
Length = 265
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 39/163 (23%), Positives = 62/163 (38%), Gaps = 29/163 (17%)
Query: 286 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQ-YQDKRMVAQRNG 344
YK GE + G + N++LL+ YGF DN ++ + + P +Q G
Sbjct: 18 YKKGEEVFTSYGRRTNAELLLFYGFALLDNEHESVALSMPGIPSPPSWFQASHSALGTAG 77
Query: 345 KLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---SEMQSVISSLGPICPVS------- 394
+ GR + D+L L + T SE+ + +L C ++
Sbjct: 78 SV--------GGRARSMAEDVLRPSHLLFAGATELPSELVAYFRALTACCSMNEKDLVEQ 129
Query: 395 --------PC--MERAVLDQLADYFKARLAGYPATLSEDEAML 427
PC ER L + A LA +P ++ EDE L
Sbjct: 130 KLDYMQHFPCSRHERDAFSTLGAHMSASLAAFPTSIEEDEVEL 172
>gi|240276868|gb|EER40379.1| SET domain-containing protein [Ajellomyces capsulatus H143]
Length = 485
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 33/136 (24%), Positives = 59/136 (43%), Gaps = 15/136 (11%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
W+ + P KV K K + + A +D+ + F++P +LV+ + N
Sbjct: 23 WLKQR--PGVKVSPKIKIADLRSEGAGRGIVADDDIGEDEELFAIPQNLVLGFQ----NS 76
Query: 144 TIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
+ +LL N+ CL + ++YE QG S W Y + L ++ + W+
Sbjct: 77 RLKDLLDFNERDFDPWLCLIVVMIYEYLQGGASTWSRYFQLLPTN-------FDTLMFWT 129
Query: 202 ETELAYLTGSPTKAEI 217
+ EL L+GS +I
Sbjct: 130 DEELRELSGSAVLNKI 145
>gi|145502426|ref|XP_001437191.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124404340|emb|CAK69794.1| unnamed protein product [Paramecium tetraurelia]
Length = 637
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 38/132 (28%), Positives = 64/132 (48%), Gaps = 23/132 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIA------ELLT---TNKLSELACLALY 163
V ASEDL + +P SL+++ ++ + I E+ TN+ +E L Y
Sbjct: 46 VVASEDLPSDTVIICIPQSLIISPDKCKQSTLITVYNSHPEMFDEEETNE-AEFNILTFY 104
Query: 164 LMYEKKQGKKSFWLPYIRELDRQRGRGQLA--------VESPLLWSETELAY--LTGSPT 213
+ EKK+G++SF+ PYI+ + Q +A +E PL+ E +L G +
Sbjct: 105 MFNEKKKGEQSFYYPYIQAI--QTSNTLMAWSNEDLQKIEDPLILEEFQLIKQDFLGLWS 162
Query: 214 KAE-ILERAEGI 224
KA+ I + A+ I
Sbjct: 163 KAKLIFDNAQDI 174
>gi|325095092|gb|EGC48402.1| SET domain-containing protein [Ajellomyces capsulatus H88]
Length = 485
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 33/136 (24%), Positives = 59/136 (43%), Gaps = 15/136 (11%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
W+ + P KV K K + + A +D+ + F++P +LV+ + N
Sbjct: 23 WLKQR--PGVKVSPKIKIADLRSEGAGRGIVADDDIGEDEELFAIPQNLVLGFQ----NS 76
Query: 144 TIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
+ +LL N+ CL + ++YE QG S W Y + L ++ + W+
Sbjct: 77 RLKDLLDFNERDFDPWLCLIVVMIYEYLQGGASTWSRYFQLLPTN-------FDTLMFWT 129
Query: 202 ETELAYLTGSPTKAEI 217
+ EL L+GS +I
Sbjct: 130 DEELRELSGSAVLNKI 145
>gi|403375581|gb|EJY87766.1| hypothetical protein OXYTRI_23666 [Oxytricha trifallax]
Length = 789
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 49/235 (20%), Positives = 96/235 (40%), Gaps = 32/235 (13%)
Query: 62 TLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQA 121
T + + + K++E + + W+ NG+ V + P + + +AA +D+
Sbjct: 28 TFIHHEKTNLLKQQEKYVNFQKWLEDNGVLHPGV---DYPVAFGRQGQLIGMAARKDIPP 84
Query: 122 GDAAFSVPNSLVVTLERVLGNETIA-------ELLTTNKLSELACLALYLMYEKKQGKKS 174
A VP L+++ E + N IA E+ ++ +E + ++ +E +G+ S
Sbjct: 85 QKAFLFVPQRLMIS-EVTVRNSKIAPLLSKHPEIFKHHQDAEYLVIIAFVWHELMKGEAS 143
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
FW PY + ++ + P+LWS+ E+ + +I + K EY +
Sbjct: 144 FWHPYFQIIN--------LSDLPMLWSDQEIQEFQDQVLQKDI----QDYKVEY---ENE 188
Query: 235 WFMAGSLFQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 289
W + F K + P ++ + + A D A Q VV R + G
Sbjct: 189 WKLVYEAFSKDETYDEYY------PGISNPENKEMIRAFYDRAYQSVVTRCFGWG 237
>gi|363747032|ref|XP_003643892.1| PREDICTED: histone-lysine N-methyltransferase setd3-like, partial
[Gallus gallus]
Length = 283
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 29/119 (24%), Positives = 59/119 (49%), Gaps = 12/119 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
+ A+ +++A + VP L++T+E N + L + +++ + LA +L+ E+
Sbjct: 108 LKATREIKAEELFLWVPRKLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERA 166
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
SFWLPYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 167 -NPNSFWLPYIQTLPSE-------YDTPLYFEEDEVQYLRSTQAIHDVFSQYKNTARQY 217
>gi|323455796|gb|EGB11664.1| hypothetical protein AURANDRAFT_61664 [Aureococcus anophagefferens]
Length = 1916
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 17/46 (36%), Positives = 27/46 (58%)
Query: 276 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 321
DA + R Y AG+ + G + N++L+ NYGF++ NP+D V
Sbjct: 294 DAFAVNAHRDYDAGDEVHASYGKKSNAQLVANYGFLEPGNPFDDYV 339
>gi|255637489|gb|ACU19071.1| unknown [Glycine max]
Length = 497
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 39/72 (54%), Gaps = 1/72 (1%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSELACLALYLMYEKKQG 171
+ A DL+ G+ VP S ++T E V+ ++ + + + + LS L + L+YE +G
Sbjct: 55 LGAVRDLRRGEIVLRVPKSALMTRETVMEDKKLCDAVNRHSSLSSAQILIVCLLYEMGKG 114
Query: 172 KKSFWLPYIREL 183
K S W PY+ L
Sbjct: 115 KTSRWHPYLMHL 126
>gi|240278777|gb|EER42283.1| conserved hypothetical protein [Ajellomyces capsulatus H143]
gi|325090312|gb|EGC43622.1| conserved hypothetical protein [Ajellomyces capsulatus H88]
Length = 471
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 54/261 (20%), Positives = 99/261 (37%), Gaps = 48/261 (18%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
+ G+ ++P+ ++ T+E + + L + + LS LA Y+++ +
Sbjct: 34 FKEGERILTIPSDVLWTVEHAYADSLLGPTLHSARPPLSVDDTLATYILFVRS------- 86
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
RE R LA S + ++E EL TG+ A + I+ +Y L
Sbjct: 87 ----RESGYNGLRSHLAALPKSYSSSIFFTEDELEVCTGTSLYAITKQLGRCIQDDYKAL 142
Query: 232 --------------------------DTVWFMAGSLFQKVSLARRFALVPLGPPLLAYSS 265
TVW A + R L P +L +SS
Sbjct: 143 VVRLLIQHRDLFPLSKFTIEDYKWALCTVWSRAMDFVLPDGKSIRL-LAPFAD-MLNHSS 200
Query: 266 KCKAMLAA--VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 323
+ A + + ++ + YKAG+ + ++ G PN++LL YGF+ NP D +
Sbjct: 201 DVRQCHAYDPLSGNLSILAGKDYKAGDQVFIYYGSIPNNRLLRLYGFIIPSNPNDNYELV 260
Query: 324 AALNTEDPQYQDKRMVAQRNG 344
+ P ++ K + + G
Sbjct: 261 LETHPMAPFFEQKHKLWESAG 281
>gi|71019075|ref|XP_759768.1| hypothetical protein UM03621.1 [Ustilago maydis 521]
gi|46099291|gb|EAK84524.1| hypothetical protein UM03621.1 [Ustilago maydis 521]
Length = 685
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 38/83 (45%), Gaps = 6/83 (7%)
Query: 251 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 310
++ P+ L A A L +++ +P AGE I PNS LL YG
Sbjct: 368 ISMTPMADMLNAKFESDNARLFYKSHVLEMRATKPIAAGEQIFNTYADPPNSDLLRRYGH 427
Query: 311 VDEDNPYD------RLVVEAALN 327
VDE N D +LVV+AA+N
Sbjct: 428 VDEPNGNDVVELDAKLVVQAAVN 450
>gi|336468018|gb|EGO56181.1| hypothetical protein NEUTE1DRAFT_83233 [Neurospora tetrasperma FGSC
2508]
gi|350289741|gb|EGZ70966.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
Length = 459
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 19/60 (31%), Positives = 35/60 (58%)
Query: 278 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 337
+ ++ + Y+AG+ + + GP PNS+LL YGFV NP D + + + + P ++ K+
Sbjct: 215 LSVLAGKDYEAGDQVFINYGPVPNSRLLRLYGFVIPGNPNDSYDLVLSTHPQAPFFEQKQ 274
>gi|356553227|ref|XP_003544959.1| PREDICTED: protein SET DOMAIN GROUP 40-like [Glycine max]
Length = 475
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 39/72 (54%), Gaps = 1/72 (1%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSELACLALYLMYEKKQG 171
+ A DL+ G+ VP S ++T E V+ ++ + + + + LS L + L+YE +G
Sbjct: 55 LGAVRDLRRGEIVLRVPKSALMTRETVMEDKKLCDAVNRHSSLSSAQILIVCLLYEMGKG 114
Query: 172 KKSFWLPYIREL 183
K S W PY+ L
Sbjct: 115 KTSRWHPYLMHL 126
>gi|156849027|ref|XP_001647394.1| hypothetical protein Kpol_1018p68 [Vanderwaltozyma polyspora DSM
70294]
gi|156118080|gb|EDO19536.1| hypothetical protein Kpol_1018p68 [Vanderwaltozyma polyspora DSM
70294]
Length = 494
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 18/61 (29%), Positives = 34/61 (55%)
Query: 252 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
+++PL L + + C A L + +++ +P +AGE + G PNS++L YG+V
Sbjct: 221 SMIPLADTLNSDTHLCNANLMYDKETLKMTAIKPIRAGEEVFNIYGEHPNSEILRRYGYV 280
Query: 312 D 312
+
Sbjct: 281 E 281
>gi|400596811|gb|EJP64567.1| histone-lysine N-methyltransferase [Beauveria bassiana ARSEF 2860]
Length = 406
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 70/150 (46%), Gaps = 9/150 (6%)
Query: 284 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 343
R Y+ G+ + ++ G NS+LL YGFV DNP D + ++ P Y+ K QR
Sbjct: 223 RDYEVGDQVFIYYGNVSNSRLLRLYGFVLPDNPNDNYELVLQTSSMAPLYEQK----QRL 278
Query: 344 GKLSV--QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGP--ICPVSPCMER 399
KL+ ++ + + +L YLR+ + D S++ ++ + +S E
Sbjct: 279 WKLAGLDEISTIPLSLQNPLPDSVLRYLRIQRL-DASDLGTMTMQIATESYTKISDENES 337
Query: 400 AVLDQLADYFKARLAGYPATLSEDEAMLTD 429
+L L+ +A L G+ +L + E L +
Sbjct: 338 QILLFLSQSIEALLEGFEISLEKLETQLAE 367
>gi|400598098|gb|EJP65818.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
Length = 356
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 23/80 (28%), Positives = 38/80 (47%), Gaps = 2/80 (2%)
Query: 250 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 309
R A++P+ S CK A ++ +V DR Y G+ + G N LL YG
Sbjct: 176 RLAMLPVADLFNHASVGCKVSYCA--ESYDIVADREYGTGDEVCTCYGEHSNDFLLAEYG 233
Query: 310 FVDEDNPYDRLVVEAALNTE 329
F+ ++N DR + +++E
Sbjct: 234 FLLQNNTNDRFDPDDLISSE 253
>gi|158295743|ref|XP_001688855.1| AGAP006364-PD [Anopheles gambiae str. PEST]
gi|347965224|ref|XP_003435732.1| AGAP013401-PA [Anopheles gambiae str. PEST]
gi|333469389|gb|EGK97284.1| AGAP013401-PA [Anopheles gambiae str. PEST]
Length = 451
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 41/86 (47%), Gaps = 13/86 (15%)
Query: 279 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT------EDPQ 332
L D Y+AGE I + G N+KLL+ YGF NP D VE + T DP+
Sbjct: 276 NLHTDTAYRAGEQIFISYGTHNNTKLLLEYGFSIPSNPDD--FVELTIGTINAFMKHDPE 333
Query: 333 YQDKRMVAQR-----NGKLSVQVFHV 353
+ R+ ++ + +L Q+F V
Sbjct: 334 LRCLRLPREKYRFLADHRLDEQLFFV 359
>gi|148226164|ref|NP_001079674.1| SET domain containing 4 [Xenopus laevis]
gi|28422727|gb|AAH46855.1| MGC53706 protein [Xenopus laevis]
Length = 456
Score = 40.8 bits (94), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 51/243 (20%), Positives = 96/243 (39%), Gaps = 43/243 (17%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAEL-LTTNKLSELACLALYLMYEKKQGKK 173
A+ DL+ G+ ++P + ++T E VL + + L +S L L +L+ E+ G
Sbjct: 64 ATRDLKPGELIIALPETCLITTETVLQSYLGKYIRLWRPHVSPLLALCTFLIAERFAGDC 123
Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR------- 226
S W PY+ + P+ W E E+ +L +P + + LE+ ++
Sbjct: 124 SQWKPYLDVIPS-------TYSCPVYW-ELEIIHLLPAPLRKKALEQKTEVQELHTESLA 175
Query: 227 -----------------EYNELDTVWFMAG--SLFQKVSLARRFA----LVPLGP--PLL 261
Y+ L W +++ K + R + L P LL
Sbjct: 176 FFSSLQPLFCDNVADIYTYDALRWAWCTVNTRTVYMKHTQQDRLLAQQDVCALAPYLDLL 235
Query: 262 AYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 319
+S + + D ++ + + + + GP N +LL+ YGFV +NP+
Sbjct: 236 NHSPEVQVEAEFSKDRRCYEIRTNSGCRKHDQAFICYGPHDNQRLLLEYGFVAANNPHRS 295
Query: 320 LVV 322
+ V
Sbjct: 296 VYV 298
>gi|195565510|ref|XP_002106342.1| GD16174 [Drosophila simulans]
gi|194203718|gb|EDX17294.1| GD16174 [Drosophila simulans]
Length = 395
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 22/79 (27%), Positives = 37/79 (46%)
Query: 252 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
AL+P K + AAV ++ AGE ++ G + N+ LL++ GFV
Sbjct: 229 ALIPYWDMANHRPGKITSFYAAVPRQLECTAQEAVDAGEQFFIYYGDRSNTDLLVHNGFV 288
Query: 312 DEDNPYDRLVVEAALNTED 330
D++N D + + L+ D
Sbjct: 289 DDNNLKDYVNIRVGLSLTD 307
>gi|410082051|ref|XP_003958604.1| hypothetical protein KAFR_0H00600 [Kazachstania africana CBS 2517]
gi|372465193|emb|CCF59469.1| hypothetical protein KAFR_0H00600 [Kazachstania africana CBS 2517]
Length = 508
Score = 40.4 bits (93), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 18/61 (29%), Positives = 34/61 (55%)
Query: 252 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
+++PL L + + +C A L D +++ +P K GE + G PN+++L YG+V
Sbjct: 215 SMIPLADTLNSDTHQCNANLMYDDKFLKMYAIKPIKKGEQVFNIYGNHPNAEILRRYGYV 274
Query: 312 D 312
+
Sbjct: 275 E 275
>gi|12718364|emb|CAC28558.1| related to histone-lysine N-methyltransferase [Neurospora crassa]
Length = 471
Score = 40.4 bits (93), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 19/60 (31%), Positives = 34/60 (56%)
Query: 278 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 337
+ + + Y+AG+ + + GP PNS+LL YGFV NP D + + + + P ++ K+
Sbjct: 227 LSVFAGKDYEAGDQVFINYGPVPNSRLLRLYGFVIPGNPNDSYDLVLSTHPQAPFFEQKQ 286
>gi|254577261|ref|XP_002494617.1| ZYRO0A05654p [Zygosaccharomyces rouxii]
gi|238937506|emb|CAR25684.1| ZYRO0A05654p [Zygosaccharomyces rouxii]
Length = 494
Score = 40.4 bits (93), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 19/63 (30%), Positives = 37/63 (58%)
Query: 252 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
++VPL L A + + A L +++++V +P K G+ + + G PN+++L YG+V
Sbjct: 226 SMVPLADTLNADTKQFNAHLVYDKESLKMVSVKPIKMGQQVYNFYGEHPNAEILRRYGYV 285
Query: 312 DED 314
+ D
Sbjct: 286 EWD 288
>gi|195040205|ref|XP_001991024.1| GH12451 [Drosophila grimshawi]
gi|193900782|gb|EDV99648.1| GH12451 [Drosophila grimshawi]
Length = 573
Score = 40.4 bits (93), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 23/95 (24%), Positives = 44/95 (46%), Gaps = 3/95 (3%)
Query: 248 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ---LVVDRPYKAGESIVVWCGPQPNSKL 304
AR + L P + +C + + D A Q +KAGE ++ G + N+
Sbjct: 339 ARNAPISALIPYWDMTNHRCGKITSYYDRAAQQMECTAQEAFKAGEQFFIYYGDRSNADR 398
Query: 305 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 339
L+++GF+D N D + + L+ DP + + ++
Sbjct: 399 LVHHGFLDMHNLKDYVQIRLGLSPTDPLVEQRSLL 433
>gi|116200882|ref|XP_001226253.1| hypothetical protein CHGG_10986 [Chaetomium globosum CBS 148.51]
gi|88175700|gb|EAQ83168.1| hypothetical protein CHGG_10986 [Chaetomium globosum CBS 148.51]
Length = 400
Score = 40.4 bits (93), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 53/234 (22%), Positives = 87/234 (37%), Gaps = 33/234 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A L+ G VP LV +L V E I+ L + +S A LA L + G
Sbjct: 55 MVAHRRLKRGQKILRVPTQLVHSLHTV--PERISGRLPPD-MSIHALLAANLTVDGMAGL 111
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W + L G P +W + EL L P + + ++ + R++N++
Sbjct: 112 -STWKDSLPTL------GDFNTGLPFMWHK-ELQELLPKPARELLKKQQDSFHRDWNKVA 163
Query: 233 TV------------WFM--------AGSLFQKVSLARRFALVPLGPPLLAYSSKCKAMLA 272
WF+ A +K R A+VP+ + C+
Sbjct: 164 KAFPDLRQDDYLHSWFVINTRTFYYATPRTEKYPPVDRLAIVPIADFFNHADTGCEVTFD 223
Query: 273 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 326
D + DR Y + + + G N LL YGF+ N +D + V+ +
Sbjct: 224 K--DGFIVSADRDYHGDQEVYISYGAHTNDFLLAEYGFLPAANRWDEVCVDEVI 275
>gi|340507383|gb|EGR33354.1| SET domain protein [Ichthyophthirius multifiliis]
Length = 165
Score = 40.4 bits (93), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 50/103 (48%), Gaps = 20/103 (19%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAEL--------LTTNKLSELACLALYL 164
V A E++ A ++PN+L+++ V +E L L + ++ LALYL
Sbjct: 51 VIAKEEIPANKVFVAIPNNLLLSTYLVEQSELKVILEENPHLFDLDEDDDAQFNKLALYL 110
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL--WSETEL 205
M EK +G+ SFW PY+ Q+A ES L W E E+
Sbjct: 111 MKEKIKGENSFWYPYL----------QIAPESFTLLDWKEEEV 143
>gi|428167603|gb|EKX36559.1| hypothetical protein GUITHDRAFT_155193 [Guillardia theta CCMP2712]
Length = 321
Score = 40.4 bits (93), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 31/133 (23%), Positives = 58/133 (43%), Gaps = 10/133 (7%)
Query: 57 VSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNG-LPPCKVILKEKPSHNEKHRPIHYVAA 115
V++ D A + ++ +ED W NG + K+ +K + V
Sbjct: 50 VAAGDQGAASGADQQAQLQEDWTAFVKWFRSNGGIISSKLTVKVRNGRQG-------VYF 102
Query: 116 SEDLQAGDAAFSVPNSLVVTLERVLGNET--IAELLTTNKLSELACLALYLMYEKKQGKK 173
E ++ G+ S P +L + + + + + + L +K + L++++E K GK
Sbjct: 103 KERMRRGETIVSFPRNLRLDEKTAMKGKAGHVFQRLKQDKCYPDLMVILHVVHEDKLGKD 162
Query: 174 SFWLPYIRELDRQ 186
SFW PY + L RQ
Sbjct: 163 SFWFPYFKLLRRQ 175
>gi|194896580|ref|XP_001978500.1| GG17647 [Drosophila erecta]
gi|190650149|gb|EDV47427.1| GG17647 [Drosophila erecta]
Length = 544
Score = 40.4 bits (93), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 22/79 (27%), Positives = 37/79 (46%)
Query: 252 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
AL+P K + AAV ++ +AGE ++ G + N+ LL++ GFV
Sbjct: 319 ALIPYWDMANHKPGKITSFYAAVSRQLECTAQEAVEAGEQFFIYYGDRSNTDLLVHNGFV 378
Query: 312 DEDNPYDRLVVEAALNTED 330
D +N D + + L+ D
Sbjct: 379 DVNNLKDYVNIRVGLSPTD 397
>gi|340505659|gb|EGR31971.1| SET domain protein [Ichthyophthirius multifiliis]
Length = 705
Score = 40.4 bits (93), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 26/104 (25%), Positives = 52/104 (50%), Gaps = 10/104 (9%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN---------KLSELACLALY 163
+AA+ED+ A +PN ++++L ++ E + +++ N +E +A+Y
Sbjct: 21 IAAAEDIPANTIIACIPNKIMISLNQIKECE-LKDIINENPSLFDEEENAEAEFNIIAMY 79
Query: 164 LMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY 207
+++EK +G+KSF+ PY + R +E L E+ Y
Sbjct: 80 VIHEKLKGEKSFYKPYFDTIQRSYTMYDWTIEEVKLTESEEIIY 123
>gi|164423408|ref|XP_963594.2| hypothetical protein NCU08733 [Neurospora crassa OR74A]
gi|157070080|gb|EAA34358.2| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 459
Score = 40.4 bits (93), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 19/60 (31%), Positives = 34/60 (56%)
Query: 278 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 337
+ + + Y+AG+ + + GP PNS+LL YGFV NP D + + + + P ++ K+
Sbjct: 215 LSVFAGKDYEAGDQVFINYGPVPNSRLLRLYGFVIPGNPNDSYDLVLSTHPQAPFFEQKQ 274
>gi|145354549|ref|XP_001421544.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581782|gb|ABO99837.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 488
Score = 40.4 bits (93), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 17/42 (40%), Positives = 27/42 (64%)
Query: 277 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 318
V+L+ R +GE I + G N +LL++YGF+ +DNP+D
Sbjct: 276 GVELIARRALTSGEPIELSYGNLSNDELLLDYGFIVKDNPFD 317
>gi|195353393|ref|XP_002043189.1| GM17489 [Drosophila sechellia]
gi|194127287|gb|EDW49330.1| GM17489 [Drosophila sechellia]
Length = 537
Score = 40.0 bits (92), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 22/79 (27%), Positives = 36/79 (45%)
Query: 252 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
AL+P K + AAV ++ AGE ++ G + N+ LL++ GFV
Sbjct: 319 ALIPYWDMANHRQGKITSFYAAVPRQLECTAQEAVDAGEQFFIYYGDRSNTDLLVHNGFV 378
Query: 312 DEDNPYDRLVVEAALNTED 330
D+ N D + + L+ D
Sbjct: 379 DDYNLKDYVNIRVGLSLTD 397
>gi|308809221|ref|XP_003081920.1| N-methyltransferase (ISS) [Ostreococcus tauri]
gi|116060387|emb|CAL55723.1| N-methyltransferase (ISS) [Ostreococcus tauri]
Length = 403
Score = 40.0 bits (92), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 36/74 (48%), Gaps = 7/74 (9%)
Query: 158 ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
A LA+ LM + G + W Y L AV+S ++WS+ EL L GS +
Sbjct: 47 ATLAVALMQQTNGGASARWRAYCDAL-------PAAVDSLMMWSDEELEVLQGSALRQRA 99
Query: 218 LERAEGIKREYNEL 231
+ R + KREY+ L
Sbjct: 100 VFRRDLCKREYDAL 113
>gi|406607002|emb|CCH41620.1| SET domain-containing protein 4 [Wickerhamomyces ciferrii]
Length = 424
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 42/197 (21%), Positives = 76/197 (38%), Gaps = 33/197 (16%)
Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW---SETELAYLTG 210
LS ++L+L E +GK+S+W P+I+ L + SP LW + EL
Sbjct: 114 LSSFQIMSLFLELESSRGKESWWDPFIQMLPTIND----FLTSPFLWQIQGKYELIEKLP 169
Query: 211 SPTKAEILERAEGIKREYNELDTV-------------------WFMAGS--LF----QKV 245
T+ L+ + ++ + T+ W S L+ QK
Sbjct: 170 KSTQKHSLKMFNRFESDFKAVKTLLETHNASKDIINHDKFVLYWMCINSRCLYMEIPQKK 229
Query: 246 SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLL 305
+ + F + P + +S+ + L ++ YK + + + GP N LL
Sbjct: 230 TTSDNFTMAPY-VDFINHSTNDQCKLKIDRTGFHVITTSNYKENDELYLSYGPHSNEFLL 288
Query: 306 INYGFVDEDNPYDRLVV 322
YGF +N ++ L +
Sbjct: 289 CEYGFHLSNNEWNDLDI 305
>gi|121703688|ref|XP_001270108.1| SET domain protein [Aspergillus clavatus NRRL 1]
gi|119398252|gb|EAW08682.1| SET domain protein [Aspergillus clavatus NRRL 1]
Length = 492
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 41/77 (53%), Gaps = 3/77 (3%)
Query: 252 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
+VPL L A + + A L +D++ + +P AG+ I G P S LL YG+V
Sbjct: 234 GMVPLADILNADADRNNARLFQEEDSLVMKAIKPIAAGDEIFNDYGELPRSDLLRRYGYV 293
Query: 312 -DEDNPYDRLVVEAALN 327
D PYD V+EA+L+
Sbjct: 294 TDNYAPYD--VIEASLD 308
>gi|194764087|ref|XP_001964163.1| GF21412 [Drosophila ananassae]
gi|190619088|gb|EDV34612.1| GF21412 [Drosophila ananassae]
Length = 1017
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 22/88 (25%), Positives = 38/88 (43%)
Query: 252 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
AL+P S + + V AGE ++ G + N++ L+N GFV
Sbjct: 802 ALIPFWDMANHGSGRITTFFDSTAGEVSCNAQAACSAGEQFFIYYGDRTNTEFLVNNGFV 861
Query: 312 DEDNPYDRLVVEAALNTEDPQYQDKRMV 339
D DN D + + L+ DP + + ++
Sbjct: 862 DPDNRNDYVNIRLGLSPTDPLAEKRAII 889
>gi|302828172|ref|XP_002945653.1| hypothetical protein VOLCADRAFT_120141 [Volvox carteri f.
nagariensis]
gi|300268468|gb|EFJ52648.1| hypothetical protein VOLCADRAFT_120141 [Volvox carteri f.
nagariensis]
Length = 163
Score = 40.0 bits (92), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 25/91 (27%), Positives = 41/91 (45%), Gaps = 6/91 (6%)
Query: 111 HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
+ + A E ++ G VP L+++ + +E + L+E L L+L+ E+
Sbjct: 77 YSLVADEPVRRGQILVRVPRRLLMSQDTARASEACGRTVREAGLNEWQSLILHLLCERAL 136
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
G +SFW PY+ L Q PLLW
Sbjct: 137 GSRSFWAPYLDTLP------QDMSFHPLLWG 161
>gi|322698908|gb|EFY90674.1| putative histone-lysine N-methyltransferase [Metarhizium acridum
CQMa 102]
Length = 437
Score = 40.0 bits (92), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 25/87 (28%), Positives = 45/87 (51%), Gaps = 2/87 (2%)
Query: 260 LLAYSSKCKA--MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 317
+L +SS+ K + A + ++ + Y+AG+ + + GP PN++LL YGFV NP
Sbjct: 195 MLNHSSEAKQCHVYDASSGNLSVLAGKDYEAGDQVFINYGPMPNNRLLRLYGFVVPGNPN 254
Query: 318 DRLVVEAALNTEDPQYQDKRMVAQRNG 344
D + A + P ++ K+ + G
Sbjct: 255 DSYDLVLATHPMAPFFKQKQKLWASAG 281
>gi|299748031|ref|XP_002911244.1| tho2 protein [Coprinopsis cinerea okayama7#130]
gi|298407787|gb|EFI27750.1| tho2 protein [Coprinopsis cinerea okayama7#130]
Length = 2474
Score = 40.0 bits (92), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 15/47 (31%), Positives = 30/47 (63%)
Query: 277 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 323
++ L+ G+ + GP+PNS+L+++YGF +DNP D ++++
Sbjct: 209 SISLIAHSAIWTGQEVFNNYGPKPNSELILSYGFSIQDNPDDSIILK 255
>gi|50554725|ref|XP_504771.1| YALI0E34397p [Yarrowia lipolytica]
gi|49650640|emb|CAG80378.1| YALI0E34397p [Yarrowia lipolytica CLIB122]
Length = 492
Score = 40.0 bits (92), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 43/221 (19%), Positives = 86/221 (38%), Gaps = 26/221 (11%)
Query: 127 SVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
++P L++ E+ + A + + + + EK +G SF+ PYI +L ++
Sbjct: 44 NIPTKLIINHEKAVKEFGAASSTFSTVADKQSLTKYFFAVEKNKGSDSFFHPYISQLPKK 103
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV---------WFM 237
V +PL ++ + L G+ + ++ E ++E+ +L +
Sbjct: 104 -------VTTPLYFTPEQQESLVGTNLEFYKNDKTELWEKEFKKLQQFVKTSVTLEDYLY 156
Query: 238 AGSLFQKVSLARRF---------ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 288
A ++F S R L+P+ L+ + DA
Sbjct: 157 ASTIFTSRSFPERLMDPKNEDLSMLIPV-LDLINHKPLTAVEWNVTGDAFAFKACSDVVK 215
Query: 289 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 329
G + G + N +LL YGF +DN +D + ++A + E
Sbjct: 216 GSQVFNNYGSKGNEELLGAYGFALKDNEFDTMPLKAVIKGE 256
>gi|363746364|ref|XP_003643627.1| PREDICTED: histone-lysine N-methyltransferase setd3-like, partial
[Gallus gallus]
Length = 225
Score = 40.0 bits (92), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 29/119 (24%), Positives = 59/119 (49%), Gaps = 12/119 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
+ A+ +++A + VP L++T+E N + L + +++ + LA +L+ E+
Sbjct: 108 LKATREIKAEELFLWVPRKLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERA 166
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
SFWLPYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 167 -NPNSFWLPYIQTLPSE-------YDTPLYFEEDEVQYLRSTQAIHDVFSQYKNTARQY 217
>gi|367042232|ref|XP_003651496.1| hypothetical protein THITE_2111880 [Thielavia terrestris NRRL 8126]
gi|346998758|gb|AEO65160.1| hypothetical protein THITE_2111880 [Thielavia terrestris NRRL 8126]
Length = 377
Score = 40.0 bits (92), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 22/77 (28%), Positives = 38/77 (49%), Gaps = 1/77 (1%)
Query: 250 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 309
R L P+ L +++ +A + + DR Y GE +++ G N LL+ YG
Sbjct: 182 RMVLQPVAD-LFNHAADGGCEVAFTPASFAITADRAYAEGEEVLICYGRHSNDFLLVEYG 240
Query: 310 FVDEDNPYDRLVVEAAL 326
FV E N +D + ++ A+
Sbjct: 241 FVLEQNRWDEVGLDEAV 257
>gi|340517549|gb|EGR47793.1| hypothetical protein TRIREDRAFT_122428 [Trichoderma reesei QM6a]
Length = 482
Score = 40.0 bits (92), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 65/291 (22%), Positives = 113/291 (38%), Gaps = 72/291 (24%)
Query: 128 VPNSLVVTLERVLG----NETIAELL-TTNKLSELACLALYLMYEKKQ------GKKSF- 175
+P LV++ E V ++ +LL S + LYL+ Q G ++F
Sbjct: 57 IPRDLVLSAEAVEEYAKVDQNFKQLLDVAGHQSTRGDIMLYLLTHLVQSKATSPGTRAFA 116
Query: 176 ---WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL- 231
W YIR L R + P +W+ E L G+ +A + + + EY++L
Sbjct: 117 STPWTEYIRFLPR-------PIPVPTMWTNDERELLKGTSLEAAVSAKLSALSSEYDKLC 169
Query: 232 ---------DTV---------WFMAGSLFQK--VSLARR-FALVPLGPPLLAYSSKCKAM 270
T+ W +A + ++ + L R A+VP G + +S A
Sbjct: 170 EEASALSFWSTLLSESATLEDWVLADAWYRSRCLELPRAGHAMVP-GLDMANHSQSHSAY 228
Query: 271 LAAVDDAVQLVVDRPYK---AGESIVVWCG-PQPNSKLLINYGFVDEDNPYDRLVVEAAL 326
D +++ RP AG I + G +P +++L +YGF+D+D+ L +
Sbjct: 229 YDESSDGDVVLLPRPGSKIPAGAEITISYGEAKPAAEMLFSYGFIDKDSTVKELTLHLEA 288
Query: 327 NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 377
+DP L FH++ G P +RL ++D
Sbjct: 289 LPDDP--------------LGRAKFHIYKGP---------PTVRLSIINDN 316
>gi|300122775|emb|CBK23792.2| unnamed protein product [Blastocystis hominis]
Length = 854
Score = 40.0 bits (92), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 32/121 (26%), Positives = 56/121 (46%), Gaps = 10/121 (8%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGN----ETIAELLTTNKLSELACLALYLMYEK 168
V A E +Q G+ + N V+ L L + + + N+LSE A +AL L++EK
Sbjct: 512 VIAKEAIQKGEEVLRIHNDTVIGLHTALTHPRFGKAFSAFYHQNQLSEYALIALTLLWEK 571
Query: 169 KQGKK-SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
++ S + P++ +L P+L S+ +L +L GS E+ + RE
Sbjct: 572 FDNERWSLFAPFLAKLPSIE-----EFHHPVLLSKDDLLHLYGSALLDEVSALNATLHRE 626
Query: 228 Y 228
+
Sbjct: 627 F 627
>gi|320163048|gb|EFW39947.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 476
Score = 40.0 bits (92), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 56/239 (23%), Positives = 104/239 (43%), Gaps = 20/239 (8%)
Query: 250 RFALVPLGPPLLAYSSKCK-AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 308
+ A+VPL +L + + C A L + + P AG + G NS+LL Y
Sbjct: 235 KIAMVPLAD-MLDHKTGCNNARLFYGKTTLAMSCIEPCAAGHELYNTYGDLSNSELLRKY 293
Query: 309 GFVDEDNPYDRLVVEAAL---NTEDPQYQDKRMVAQRNGKLSVQVFHVHAG----REKEA 361
GF+D+ N ++ + + + E + ++ M A + FH+ A +E EA
Sbjct: 294 GFIDDVNEHNSVDIPVEMLEERFESCSFMEEAMEALEEIGCWLPEFHIPADALPPQELEA 353
Query: 362 ISDML-----PYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGY 416
+L L + D E++S +++L V+ C R V + L + + R Y
Sbjct: 354 SIALLFQSPKQVRALRALDDEDEIRSFLATL-----VNKC-RRKVSETLLAFGQKRAEEY 407
Query: 417 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAP 475
T EDE L + +L ++++A ++ E+ +L+ + + + P + PAP
Sbjct: 408 TTTREEDEERLKESDLTHRQKMALRVRIGERTILHNYISHLKERLETTPPDQETKEPAP 466
>gi|159471213|ref|XP_001693751.1| transcription factor, E2F and DP-related [Chlamydomonas
reinhardtii]
gi|158283254|gb|EDP09005.1| transcription factor, E2F and DP-related [Chlamydomonas
reinhardtii]
Length = 656
Score = 39.7 bits (91), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%)
Query: 284 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 320
RP G + + GP PNSKLL+ YGF DNP D +
Sbjct: 273 RPCARGRQLFLSYGPYPNSKLLLFYGFALPDNPVDEV 309
>gi|392563539|gb|EIW56718.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
Length = 441
Score = 39.7 bits (91), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 16/46 (34%), Positives = 29/46 (63%)
Query: 278 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 323
+ LV+ P G ++ GP+PN++L++ YGF +NP D +V++
Sbjct: 250 ISLVIHTPTTTGSELLNNYGPKPNAELILGYGFSLPNNPDDTIVLK 295
>gi|270005260|gb|EFA01708.1| hypothetical protein TcasGA2_TC007288 [Tribolium castaneum]
Length = 253
Score = 39.7 bits (91), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 35/119 (29%), Positives = 58/119 (48%), Gaps = 12/119 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSEL--ACLALYLMYEKK 169
V A+ D+ +VP L++++E + +L+ +K L + L+++L+ EK
Sbjct: 116 VKANVDIAESSLVIAVPRKLMMSVENA-KESVLKDLIEKDKILGSMPNVALSIFLLLEKY 174
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
+G SFW PYI L + + L +S EL L GSPT L + + I R+Y
Sbjct: 175 KGD-SFWKPYIDILPK-------TYTTVLYFSIDELEELRGSPTLEVALRQIKSITRQY 225
>gi|440802665|gb|ELR23594.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
Length = 984
Score = 39.7 bits (91), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 22/70 (31%), Positives = 39/70 (55%), Gaps = 3/70 (4%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELL--TTNKLSELACLALYLMYEKKQ 170
V A++ + AG A ++P L++T++ L + E L L E L L+L++EK +
Sbjct: 497 VFAAQAVPAGQALLTIPRQLLITVDTAL-ESPLGEALQYVEGGLDEDTVLTLFLVWEKGR 555
Query: 171 GKKSFWLPYI 180
G+ S W P++
Sbjct: 556 GQASPWYPFL 565
>gi|195480581|ref|XP_002101314.1| GE17555 [Drosophila yakuba]
gi|194188838|gb|EDX02422.1| GE17555 [Drosophila yakuba]
Length = 548
Score = 39.7 bits (91), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 22/79 (27%), Positives = 36/79 (45%)
Query: 252 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 311
AL+P K + AAV ++ AGE ++ G + N+ LL++ GFV
Sbjct: 323 ALIPYWDMANHRPGKITSFYAAVSRQLECTAQEAVAAGEQFFIYYGDRSNTDLLVHNGFV 382
Query: 312 DEDNPYDRLVVEAALNTED 330
D +N D + + L+ D
Sbjct: 383 DVNNLKDYVNIRVGLSPTD 401
>gi|4185151|gb|AAD08954.1| unknown protein [Arabidopsis thaliana]
gi|20197036|gb|AAM14885.1| unknown protein [Arabidopsis thaliana]
Length = 441
Score = 39.7 bits (91), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 33/117 (28%), Positives = 52/117 (44%), Gaps = 8/117 (6%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASEDL+ GD A +P S +++ E V ++ L T + ++ L L+ M EK
Sbjct: 173 ASEDLKFGDVALEIPVSSIISEEYVYNSDMYPILETFDGITSETMLLLWTMREKHNLDSK 232
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
F PY L G L + + L G+ EI++ E ++ Y+EL
Sbjct: 233 F-KPYFDSLQENFCTG-------LSFGVDAIMELDGTLLLDEIMQAKELLRERYDEL 281
>gi|345566622|gb|EGX49564.1| hypothetical protein AOL_s00078g53 [Arthrobotrys oligospora ATCC
24927]
Length = 611
Score = 39.7 bits (91), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 41/175 (23%), Positives = 74/175 (42%), Gaps = 16/175 (9%)
Query: 253 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 312
L+PL +L + K + + L+ G + GP+ N +LL+ YGFV
Sbjct: 237 LIPL-VDILNHKPNTKIIWEPTQTSFSLITPETISEGSQVFNNYGPKGNEELLMGYGFVI 295
Query: 313 EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV-QVFHVHAGREKEAISDMLPYLRL 371
+NP D L ++ ++ P+ Q ++ QR K + +VFH+ + + +P L
Sbjct: 296 PENPGDSLAMKFTIS---PRGQAAQIWEQRALKQTWREVFHLTKSADSGQKTSTVPALES 352
Query: 372 GY-----------VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAG 415
+ V++ +E+ + + P+S E AV L K +LA
Sbjct: 353 DWPEAFVDLFRILVANENEIDDLENGDINATPISIRNELAVALGLKAAIKQKLAA 407
>gi|281205954|gb|EFA80143.1| hypothetical protein PPL_06965 [Polysphondylium pallidum PN500]
Length = 417
Score = 39.7 bits (91), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 86/391 (21%), Positives = 160/391 (40%), Gaps = 63/391 (16%)
Query: 76 EDLGDLKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
+DL K WM G+ P ++ E + + A+ ++ GD VP ++
Sbjct: 9 DDLVTFKQWMDDEGIYLNPSLDIVKLEDYGRS--------IIANTLIKEGDVLIRVPRNV 60
Query: 133 VVT---LERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQR 187
+++ +E + E I ++ +N+ + A+YLMY K S+W Y L +Q
Sbjct: 61 MMSRTGIELHIPKE-IRSIIDSNRDDIGSTDGQAVYLMY-SLLNKDSYWHQYTSILPKQ- 117
Query: 188 GRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQKVSL 247
+ + + + E+ L S + R GI+R YN + F+K
Sbjct: 118 ------FTTSIYFDQDEMKELQLSKLRYFTESRLSGIERHYNVIFKKLSSLNDEFKKKEY 171
Query: 248 -------------ARRFAL-------VPLGPPLLAYSSKCKAMLAAVDDAVQLV--VDRP 285
+R F+L VPL A K K+ + A QL+ +
Sbjct: 172 TFELFKWALSCIWSRAFSLSSDDGGMVPLADMFNAIE-KAKSKVRPDSRADQLIYYASKD 230
Query: 286 YKAGESIVVWCGPQP---NSKLLINYGFVDEDNPYDRLVVEAALN--TEDPQYQDKRMVA 340
+ GE + G N+++L++YGF D+P + ++ L+ ++D Y D ++
Sbjct: 231 IERGEQVFTPYGVYKTIGNAQMLMDYGFA-FDDPSEGDTIQLTLDNFSDDELYIDTKIDL 289
Query: 341 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQ----SVISSLGPICPVSPC 396
+ V+ F++ + + ++L Y R+ + + +E+Q + PVS
Sbjct: 290 LEQLDI-VREFNL---KRNQLPQELLIYARVKNLKE-NELQLAKEHYRNDDNRNKPVSRR 344
Query: 397 MERAVLDQLADYFKARLAGYPATLSEDEAML 427
E+ L L++Y L Y TLS+D +L
Sbjct: 345 NEKTALRYLSNYLSRYLDSYETTLSDDLELL 375
>gi|219122993|ref|XP_002181819.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407095|gb|EEC47033.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 579
Score = 39.7 bits (91), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 26/89 (29%), Positives = 44/89 (49%), Gaps = 6/89 (6%)
Query: 236 FMAGSLFQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIV 293
F+ LF+K RR+ + P+ S K ++ +A L D+ +G+ +
Sbjct: 323 FVLPKLFKK----RRYVICPMIDMANHQSVKFAGQVSFEYFANAYSLATDQAIPSGDEVY 378
Query: 294 VWCGPQPNSKLLINYGFVDEDNPYDRLVV 322
+ GP+ N +LL YGFV+ +NP D V+
Sbjct: 379 ISYGPRSNDQLLQYYGFVERNNPNDVYVM 407
>gi|393217169|gb|EJD02658.1| SET domain-containing protein [Fomitiporia mediterranea MF3/22]
Length = 513
Score = 39.7 bits (91), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 36/122 (29%), Positives = 58/122 (47%), Gaps = 17/122 (13%)
Query: 115 ASEDLQAGDAAFSVPNSLVVT-----LERVLGNETIAELLTTNKLSELACLALYLMYEKK 169
A +D+ G FSVP SL ++ L +++G E + L NK L L +M+E+
Sbjct: 34 ALQDIPEGHTLFSVPRSLTLSTHTSELPKLIG-EAAWKSLRLNK--GWVGLILCMMWEEC 90
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
+ S W Y L R A ++P+ W+ EL L G+ +I + E +R+Y
Sbjct: 91 RWTDSKWCGYFNILPR-------AFDTPMFWTGDELKELDGTDVLGKIGK--EQAERDYY 141
Query: 230 EL 231
E+
Sbjct: 142 EI 143
Score = 38.5 bits (88), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 55/222 (24%), Positives = 84/222 (37%), Gaps = 19/222 (8%)
Query: 251 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 310
A+VP+ L A A L + +V +P + GE I G PNS LL YG
Sbjct: 240 IAMVPMADMLNAQYGSENAKLFYEPTHLNMVSTKPIRRGEQIYNAYGDLPNSALLREYGH 299
Query: 311 VD----------EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 360
VD E NP D + + A L R+ A+ + K + + G +
Sbjct: 300 VDLVPLPGVPWKEGNPADVVEIPADLALHAVLSSQARVDAE-SLKERIDWWLEEGGDDVF 358
Query: 361 AIS------DMLPYLRLGYVSDTSEMQSVISSLGPICP-VSPCMERAVLDQLADYFKARL 413
+ D++ + E + S P P + + + + RL
Sbjct: 359 VLGTDLELPDVMISFLKLLLLSKLEWEKARSKSKPPKPKLDMDSKLQTFPLVLGMLERRL 418
Query: 414 AGYPATLSEDEAMLTDYNLHPKKRVATQLVRM-EKKMLNACL 454
A YP TL DEA+L+ P +VR+ EK +L C+
Sbjct: 419 AKYPTTLEHDEALLSGQTSLPYNVRNAIIVRIGEKHILVGCM 460
>gi|255071473|ref|XP_002499410.1| predicted protein [Micromonas sp. RCC299]
gi|226514673|gb|ACO60669.1| predicted protein [Micromonas sp. RCC299]
Length = 323
Score = 39.7 bits (91), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 24/35 (68%)
Query: 289 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 323
GE +V+ G + N +LL +GF D DNP+D LV++
Sbjct: 199 GEEVVISYGDKTNEELLFVHGFADRDNPHDALVLQ 233
>gi|407394128|gb|EKF26797.1| hypothetical protein MOQ_009497 [Trypanosoma cruzi marinkellei]
Length = 547
Score = 39.3 bits (90), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 51/114 (44%), Gaps = 13/114 (11%)
Query: 149 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 208
L ++ + +AC+A Y+ YEKKQ + + L Y R L Q+ V++ LW+ L
Sbjct: 352 LDSSNMESIACIAAYMFYEKKQPEIALRL-YRRLL-------QMGVQTTELWNNLGLCCF 403
Query: 209 TGSPTKAEI--LERAEGIKREYNELDTVWFMAGSL---FQKVSLARRFALVPLG 257
S + L+RA E L VW+ G + + LA R V LG
Sbjct: 404 YSSQYDIALSCLQRAIATSTEDETLADVWYNIGHIGIGTGDLGLAHRAFKVALG 457
>gi|302810436|ref|XP_002986909.1| hypothetical protein SELMODRAFT_235145 [Selaginella moellendorffii]
gi|300145314|gb|EFJ11991.1| hypothetical protein SELMODRAFT_235145 [Selaginella moellendorffii]
Length = 447
Score = 39.3 bits (90), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 67/284 (23%), Positives = 107/284 (37%), Gaps = 72/284 (25%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A DL G+ ++P + +TL + IA L L + +MYE+ +GK
Sbjct: 10 VRALRDLHHGELIATIPKAACLTLLTTAARDAIARARLGGGLG----LTVAVMYERSKGK 65
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNE- 230
S W Y++ L Q P LWSE E+ L G+ + E +K ++ E
Sbjct: 66 GSKWYRYLKTLPCQE-------SVPFLWSEEEIDGLLLGTELHKALKEDKLLMKEDWEEN 118
Query: 231 ------LDTVWFMAGSLFQKVSLARR---------------FALVPLGPPLLAYSSKCKA 269
D + F A + LA + + +VPL ++ K A
Sbjct: 119 IAPLTKEDPLEFPAQDFTFESYLAAKSLVSSRSFEIDAEHGYGMVPLAD---LFNHKTDA 175
Query: 270 -----MLAA-------------VDDA---------------VQLVVDRPYKAGESIVVWC 296
ML A +DD +++V+ + AG I
Sbjct: 176 EDVHFMLNASDSDDDDDNNGLIIDDGLANGDCREISSDKSVLEMVMVKDVAAGSEIFNTY 235
Query: 297 GPQPNSKLLINYGFVDEDNPYD--RLVVEAALNTEDPQYQDKRM 338
G N+ LL YGF + +NP+D L ++ L ++Q KR+
Sbjct: 236 GQLGNAALLHRYGFTEPNNPHDIVNLDMDCLLEVLLSRFQKKRV 279
>gi|302896942|ref|XP_003047350.1| hypothetical protein NECHADRAFT_106552 [Nectria haematococca mpVI
77-13-4]
gi|256728280|gb|EEU41637.1| hypothetical protein NECHADRAFT_106552 [Nectria haematococca mpVI
77-13-4]
Length = 471
Score = 39.3 bits (90), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 22/75 (29%), Positives = 44/75 (58%), Gaps = 3/75 (4%)
Query: 161 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-LE 219
A++L+ + G++SFW PYI+ L + A+ PLLW E++L +L G+ + + +
Sbjct: 111 AIFLVQQYLLGEQSFWYPYIQILPQPDDDKDSAI--PLLWPESDLLWLRGTHLEEAVSKQ 168
Query: 220 RAEGIKREYNELDTV 234
+ + +KR ++T+
Sbjct: 169 KVDHVKRWTEAMETL 183
>gi|126325439|ref|XP_001376285.1| PREDICTED: SET domain-containing protein 4-like [Monodelphis
domestica]
Length = 437
Score = 39.3 bits (90), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 60/281 (21%), Positives = 110/281 (39%), Gaps = 55/281 (19%)
Query: 86 HKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVVTLERV 139
HK + LK++ + RP + + A + LQ G+ S+P ++T + V
Sbjct: 30 HKQEFIELRKWLKKRKFEDHNLRPTRFSNTGRGLMAVKSLQPGELIISLPKECLLTTDTV 89
Query: 140 LGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
+ + + + +T +S L L +L+ EK G KS W PY+ L + +
Sbjct: 90 IRS-YLGDYITKWMPPISPLLALCAFLISEKHAGNKSPWKPYLDVLPK--------AYTC 140
Query: 198 LLWSETELAYLTGSPTKAEILER----------------------AEGIKREYNELDTVW 235
L+ E E+ L P + + E+ E +K ++ +W
Sbjct: 141 LVCLEPEVVRLLPRPLQMKAEEQRMQVQKLFISSRGFFSSLQSLFTEDVKHVFHYHAFLW 200
Query: 236 ----------FMAGSLFQKVSLARRFALVPLGP--PLLAYSSKCKAMLAAVDDAV--QLV 281
+M + QK L+ + L P LL +S + A ++ ++
Sbjct: 201 AWCTINTRTVYMKHA--QKQCLSAEPDVYALAPYLDLLNHSPRVWVEAAFNEETCCYEIR 258
Query: 282 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 322
K E + + GP N +LL+ YGFV +NP+ + +
Sbjct: 259 TTSHCKKFEELFICYGPHDNHRLLLEYGFVASNNPHSAVYI 299
>gi|403376910|gb|EJY88443.1| hypothetical protein OXYTRI_16493 [Oxytricha trifallax]
Length = 787
Score = 39.3 bits (90), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 48/235 (20%), Positives = 95/235 (40%), Gaps = 32/235 (13%)
Query: 62 TLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQA 121
T + + +++E + + W+ NG+ V + P + + +AA +D+
Sbjct: 28 TFIHHEKTNKLRQQEKYVNFQKWLEDNGVLHPGV---DYPVAFGRQGQLIGMAARKDIPP 84
Query: 122 GDAAFSVPNSLVVTLERVLGNETIA-------ELLTTNKLSELACLALYLMYEKKQGKKS 174
A VP L+++ E + N IA E+ ++ +E + ++ +E +G+ S
Sbjct: 85 QKAFLFVPQRLMIS-EVTVRNSKIAPLLSKHPEIFKHHQDAEYLVIIAFVWHELMKGEAS 143
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
FW PY + ++ + P+LWS+ E+ + +I + K EY +
Sbjct: 144 FWYPYFQIIN--------LSDLPMLWSDQEIQEFQDQVLQKDI----QDYKVEY---ENE 188
Query: 235 WFMAGSLFQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 289
W + F K + P ++ + + A D A Q VV R + G
Sbjct: 189 WKLVYEAFSKDETYDEYY------PGISNLENKEMIRAFYDRAYQSVVTRCFGWG 237
>gi|302835223|ref|XP_002949173.1| hypothetical protein VOLCADRAFT_120737 [Volvox carteri f.
nagariensis]
gi|300265475|gb|EFJ49666.1| hypothetical protein VOLCADRAFT_120737 [Volvox carteri f.
nagariensis]
Length = 593
Score = 39.3 bits (90), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 23/37 (62%)
Query: 284 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 320
RP +AG + + GP N+KLL+ YGF DNP D +
Sbjct: 390 RPCEAGNQLFLSYGPYSNAKLLLFYGFAVRDNPADEV 426
>gi|392594054|gb|EIW83379.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
Length = 508
Score = 38.9 bits (89), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 35/148 (23%), Positives = 58/148 (39%), Gaps = 25/148 (16%)
Query: 78 LGDLKSWMHKNGLPPCKVILK--EKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
L SW +G ++ E P H A +D+ G FS+P L ++
Sbjct: 10 LAPFASWFESHGGVVASHLMDVAELPGHGRG------AFALQDIHEGTTLFSLPRELTLS 63
Query: 136 LERVLGNETIAELLTTNKLSELAC------LALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
L T+ LL ++ E L L +M+E+ +G +S W Y+ L
Sbjct: 64 LR----TSTLPSLLGVDRWKEFGLNKGWVGLILCMMWEESRGVESKWDVYLSSLPS---- 115
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEI 217
++P+ WS +L L G+ +I
Sbjct: 116 ---TFDTPMFWSAEDLEELKGTAVPDKI 140
>gi|21952799|dbj|BAC06215.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
gi|22202682|dbj|BAC07340.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
gi|215769224|dbj|BAH01453.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222619626|gb|EEE55758.1| hypothetical protein OsJ_04288 [Oryza sativa Japonica Group]
Length = 495
Score = 38.9 bits (89), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 19/55 (34%), Positives = 31/55 (56%)
Query: 275 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 329
D+ ++++V R GE + G N+ LL YGF + DN YD + ++ AL T+
Sbjct: 255 DEDLEMIVVRDVNEGEEVFNTYGTMGNAALLHRYGFTEMDNSYDIVNIDLALVTK 309
>gi|403366800|gb|EJY83208.1| hypothetical protein OXYTRI_19172 [Oxytricha trifallax]
Length = 869
Score = 38.9 bits (89), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 37/170 (21%), Positives = 75/170 (44%), Gaps = 18/170 (10%)
Query: 271 LAAVDDAVQLVVDRPYKAG---ESIVVWC-GPQPNSKLLINYGFVDEDNPYDRLVVEAAL 326
++ V D ++ R Y G S V C G N ++L YGF N Y+ + ++ L
Sbjct: 454 VSQVPDDFNFII-RTYNDGFPKGSQVFLCYGRMSNREMLKRYGFCLTYNKYNYIFIKLRL 512
Query: 327 NTEDPQYQDKRMVAQR-------NGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE 379
+DP + ++ V ++ K+ + H +K + +L ++++ Y + +
Sbjct: 513 EQQDPDFIYRKYVLRKFFSIEPETDKMDISSRHFRIYFQK-LNTKVLKFIKILYFNVQED 571
Query: 380 MQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 429
S I + S +E +L D ++ L +P T+ ED+ +L++
Sbjct: 572 DISCI-----VETRSLSLEYLAFQRLRDVYETFLKSFPTTIGEDKKILSE 616
>gi|340522118|gb|EGR52351.1| predicted protein [Trichoderma reesei QM6a]
Length = 377
Score = 38.9 bits (89), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 19/74 (25%), Positives = 35/74 (47%), Gaps = 1/74 (1%)
Query: 250 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 309
R AL+P+ + C+ + + +V DR YK GE + + N L+ YG
Sbjct: 179 RLALIPVADLFNHAAGGCRVYYSP-EGCYHVVADRAYKKGEELFISYSSHSNDYNLLEYG 237
Query: 310 FVDEDNPYDRLVVE 323
F+ ++N D + ++
Sbjct: 238 FIPDENSLDDVYID 251
>gi|328726082|ref|XP_001952202.2| PREDICTED: SET domain-containing protein 3-like [Acyrthosiphon
pisum]
Length = 241
Score = 38.9 bits (89), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 38/161 (23%), Positives = 73/161 (45%), Gaps = 20/161 (12%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
+ ++ + L W KNG IL H ++ + + A++++ GD +VP +L
Sbjct: 81 RNDQSIEKLTKWATKNG-----AILNGVEIHQFENYA-YGMKANKNITVGDKLVTVPRAL 134
Query: 133 VVTLERV----LGNETIAELLTTNKLSELACLALYLMYEK-KQGKKSFWLPYIRELDRQR 187
++T E + L +++ N + LA++++ E ++ KKSFW Y+ L
Sbjct: 135 MMTEENIPSSPLWKLHSQDMMLRNMPN--VALAIFILVESLRKDKKSFWHSYLTTLP--- 189
Query: 188 GRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
+ +P+ + +L L GSP L+ I R+Y
Sbjct: 190 ----VTYSTPVYFDVADLEALKGSPAFEAALKLNRNIARQY 226
>gi|125528589|gb|EAY76703.1| hypothetical protein OsI_04658 [Oryza sativa Indica Group]
Length = 495
Score = 38.9 bits (89), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 19/55 (34%), Positives = 31/55 (56%)
Query: 275 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 329
D+ ++++V R GE + G N+ LL YGF + DN YD + ++ AL T+
Sbjct: 255 DEDLEMIVVRDVNEGEEVFNTYGTMGNAALLHRYGFTEMDNSYDIVNIDLALVTK 309
>gi|384490907|gb|EIE82103.1| hypothetical protein RO3G_06808 [Rhizopus delemar RA 99-880]
Length = 216
Score = 38.9 bits (89), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 28/119 (23%), Positives = 59/119 (49%), Gaps = 12/119 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V ++ ++ + +VP S+ +T E+V N T S +L+L+ +K GK
Sbjct: 22 VYTTDTVKENEKFATVPFSICIT-EKVARNA----FPTLTGFSGRVLQSLFLVQQKNLGK 76
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
KSF+ PYI L ++ + + L + E ++ Y+ + + + ER ++ ++++L
Sbjct: 77 KSFYFPYINILPKK-------IVTALHFDENDMNYIKKTNLELALRERKTALRDDFDKL 128
>gi|449544081|gb|EMD35055.1| hypothetical protein CERSUDRAFT_107074 [Ceriporiopsis subvermispora
B]
Length = 457
Score = 38.9 bits (89), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 18/49 (36%), Positives = 30/49 (61%)
Query: 275 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 323
D AV L++ P G ++ GP+PN++L++ YGF NP D +V++
Sbjct: 275 DLAVSLLLHSPTPRGAELLNNYGPKPNAELVLGYGFALPSNPDDTIVLK 323
>gi|212544736|ref|XP_002152522.1| hypothetical protein PMAA_003730 [Talaromyces marneffei ATCC 18224]
gi|210065491|gb|EEA19585.1| hypothetical protein PMAA_003730 [Talaromyces marneffei ATCC 18224]
Length = 429
Score = 38.9 bits (89), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 19/59 (32%), Positives = 30/59 (50%)
Query: 284 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQR 342
R YK GE I + GP PN L + YGF E N D + ++ + + + + ++ QR
Sbjct: 264 RLYKKGEEIYMSYGPHPNDFLFVEYGFYLETNESDAIFLDDIIFKDFTVAEKEELIRQR 322
>gi|413951745|gb|AFW84394.1| hypothetical protein ZEAMMB73_159573, partial [Zea mays]
Length = 339
Score = 38.9 bits (89), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 19/57 (33%), Positives = 32/57 (56%)
Query: 273 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 329
A D+ +++++ R GE + G N+ LL YGF + DN YD + ++ AL T+
Sbjct: 99 ANDEDLEIIIVRDVNEGEEVYNTYGTMGNAALLHRYGFTELDNQYDIVNIDLALVTK 155
>gi|395518633|ref|XP_003763464.1| PREDICTED: SET domain-containing protein 4 [Sarcophilus harrisii]
Length = 440
Score = 38.9 bits (89), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 64/281 (22%), Positives = 110/281 (39%), Gaps = 55/281 (19%)
Query: 86 HKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVVTLERV 139
HK + LKE+ + RP + + A + LQ G+ S+P ++T + V
Sbjct: 29 HKLEFIELRKWLKERKFEDHNLRPTRFSGTGRGLMAVKSLQPGELIISLPEKCLLTTDTV 88
Query: 140 LGNETIAELLT--TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
+ + + + +T T +S L L +L+ E G KS W PY+ L + +
Sbjct: 89 IKS-YLGDYITKWTPPISPLLALCTFLISENNAGNKSPWKPYLDILPKDY--------TC 139
Query: 198 LLWSETELAYLTGSPTKAEILER----------------------AEGIKREYNELDTVW 235
L+ E ++ L P K + E+ E +K ++ +W
Sbjct: 140 LVCLEPQVVRLLPKPLKIKAQEQKTQVQELFVSSRGFFSSLQSLFTEDVKHIFHYHAFLW 199
Query: 236 ----------FMAGSLFQKVSLARRFALVPLGPPL-LAYSSKCKAMLAAVDDAVQLVVDR 284
+M + QK L+ + L P L L S + AA ++ + R
Sbjct: 200 AWCTINTRTVYMKHA--QKKCLSAEPDVYALAPYLDLLNHSPGVQVNAAFNEKTRCYEIR 257
Query: 285 PY---KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 322
K E + + GP N +LL+ YGFV +NP+ + V
Sbjct: 258 TTSSCKKYEELFICYGPHDNHRLLLEYGFVAINNPHSAVYV 298
>gi|169595142|ref|XP_001790995.1| hypothetical protein SNOG_00305 [Phaeosphaeria nodorum SN15]
gi|160701026|gb|EAT91800.2| hypothetical protein SNOG_00305 [Phaeosphaeria nodorum SN15]
Length = 391
Score = 38.9 bits (89), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 26/48 (54%), Gaps = 5/48 (10%)
Query: 280 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN-----PYDRLVV 322
+ DR YKAGE + V G N LL+ YGF+ + N P D L++
Sbjct: 203 VTADREYKAGEEVFVSYGAHTNDFLLVEYGFILDSNRNDAIPLDHLIL 250
>gi|50546259|ref|XP_500648.1| YALI0B08624p [Yarrowia lipolytica]
gi|49646514|emb|CAG82890.1| YALI0B08624p [Yarrowia lipolytica CLIB122]
Length = 490
Score = 38.9 bits (89), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 45/100 (45%), Gaps = 8/100 (8%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A +DL AGD VP S ++ R G IA LL + L +A L + +YE+ G
Sbjct: 36 VFAKKDLDAGDIVLKVPKSACLS-PRTCG---IANLLDEHDLDNIAGLLVAFLYERSLGD 91
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETE-LAYLTGS 211
+S W + L E P WS E A L+G+
Sbjct: 92 QSPWHEFFESLKPVIAD---VPEIPKFWSNDEDRALLSGT 128
>gi|351694473|gb|EHA97391.1| SET domain-containing protein 3 [Heterocephalus glaber]
Length = 297
Score = 38.9 bits (89), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 40/72 (55%), Gaps = 4/72 (5%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQG 171
A+ D++AG+ VP LV+T+E N + L + +++ + LA +L+ ++
Sbjct: 112 ATRDIKAGELFLWVPRKLVMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLLCERAS 170
Query: 172 KKSFWLPYIREL 183
SFWLPYI+ L
Sbjct: 171 PISFWLPYIQTL 182
>gi|308802011|ref|XP_003078319.1| N-methyltransferase (ISS) [Ostreococcus tauri]
gi|116056770|emb|CAL53059.1| N-methyltransferase (ISS) [Ostreococcus tauri]
Length = 429
Score = 38.9 bits (89), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 51/185 (27%), Positives = 76/185 (41%), Gaps = 17/185 (9%)
Query: 284 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQD-KRMVAQR 342
+ Y+ GE +++ G N +L+ YGFVD DN D E ++ Y KR +
Sbjct: 246 KDYETGEEVLISYGVLNNDELITRYGFVDVDNVADIYRFEGLMSYLQASYDPMKRALGAD 305
Query: 343 NGKLSVQVFHVH-----AGREKEAISD------MLPYLRLGYVSDTSEMQSVISSLGPIC 391
+LS + H A E ISD +L LR V T E + +
Sbjct: 306 QKRLST-LKRTHPELDQALWEGNFISDGNADPKLLWALRT--VLATPEEYAAAKGVDGFK 362
Query: 392 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 451
ER D + ++RLA YP T+ EDE L N +R A Q +K++L
Sbjct: 363 LGGGAPERRAADAVRAAVESRLAEYPTTIEEDEEALKTAN--GNERTAIQYRIRKKRILR 420
Query: 452 ACLQV 456
++
Sbjct: 421 DASRI 425
>gi|428183324|gb|EKX52182.1| hypothetical protein GUITHDRAFT_150712, partial [Guillardia theta
CCMP2712]
Length = 205
Score = 38.5 bits (88), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 29/116 (25%), Positives = 55/116 (47%), Gaps = 15/116 (12%)
Query: 116 SEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSF 175
++D+++ S+P+ L L R + +KL +AL ++YEK + ++SF
Sbjct: 88 TQDVKSNSVVCSIPSKLF--LSRSTTRAAFGSM--ADKLDVRTAMALQILYEKSKKEESF 143
Query: 176 WLPYIREL-DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
W +++ L DR+ + +P LW E + L G+ + E E K Y++
Sbjct: 144 WCEWLKVLPDREN------LGTPCLWPEDDQNLLKGT----SVFEEVEASKSLYSK 189
>gi|336260071|ref|XP_003344832.1| hypothetical protein SMAC_06115 [Sordaria macrospora k-hell]
Length = 456
Score = 38.5 bits (88), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 37/155 (23%), Positives = 63/155 (40%), Gaps = 12/155 (7%)
Query: 278 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 337
+ ++ + Y+ G+ + + G PNS+LL YGFV NP D + + + + P Y+ K
Sbjct: 215 LSVLAGKDYEPGDQVFINYGSVPNSRLLRLYGFVIPGNPNDTYDLVLSTHPQAPFYEQKH 274
Query: 338 MVAQRNGKLSVQVFHVHAGREKEAISDMLP-----YLRLGYVSDTSEMQSVISSLGPICP 392
+ G S + ++D LP YLR+ + + +
Sbjct: 275 KLWVSAGLDSTSTIPL-------TLTDPLPKNVLRYLRIQRADASDLAAMALQNAKADEK 327
Query: 393 VSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 427
VS E +L L + F L G+ L + E L
Sbjct: 328 VSDSNEVEILQFLVESFGHLLGGFGTPLEKLEEQL 362
>gi|261190993|ref|XP_002621905.1| SET domain-containing protein [Ajellomyces dermatitidis SLH14081]
gi|239590949|gb|EEQ73530.1| SET domain-containing protein [Ajellomyces dermatitidis SLH14081]
gi|239613147|gb|EEQ90134.1| SET domain-containing protein [Ajellomyces dermatitidis ER-3]
gi|327354785|gb|EGE83642.1| SET domain-containing protein [Ajellomyces dermatitidis ATCC 18188]
Length = 481
Score = 38.5 bits (88), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 26/101 (25%), Positives = 49/101 (48%), Gaps = 13/101 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELL--TTNKLSELACLALYLMYEKKQ 170
+ A ++ + F++P +LV++ + N + +LL + L CL L ++YE Q
Sbjct: 50 IVALSNINEDEELFAIPQNLVLSFQ----NSKLKDLLHISEKDLGPWLCLILVMIYEYLQ 105
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
G S W Y + L + ++ + W++ EL L+GS
Sbjct: 106 GGASPWSRYFQVLPTE-------FDTLMFWTDEELRELSGS 139
>gi|223946389|gb|ACN27278.1| unknown [Zea mays]
Length = 289
Score = 38.5 bits (88), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 19/57 (33%), Positives = 32/57 (56%)
Query: 273 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 329
A D+ +++++ R GE + G N+ LL YGF + DN YD + ++ AL T+
Sbjct: 47 ANDEDLEIIIVRDVNEGEEVYNTYGTMGNAALLHRYGFTELDNQYDIVNIDLALVTK 103
>gi|380089029|emb|CCC12973.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 465
Score = 38.5 bits (88), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 37/155 (23%), Positives = 63/155 (40%), Gaps = 12/155 (7%)
Query: 278 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 337
+ ++ + Y+ G+ + + G PNS+LL YGFV NP D + + + + P Y+ K
Sbjct: 224 LSVLAGKDYEPGDQVFINYGSVPNSRLLRLYGFVIPGNPNDTYDLVLSTHPQAPFYEQKH 283
Query: 338 MVAQRNGKLSVQVFHVHAGREKEAISDMLP-----YLRLGYVSDTSEMQSVISSLGPICP 392
+ G S + ++D LP YLR+ + + +
Sbjct: 284 KLWVSAGLDSTSTIPL-------TLTDPLPKNVLRYLRIQRADASDLAAMALQNAKADEK 336
Query: 393 VSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 427
VS E +L L + F L G+ L + E L
Sbjct: 337 VSDSNEVEILQFLVESFGHLLGGFGTPLEKLEEQL 371
>gi|149742140|ref|XP_001496337.1| PREDICTED: SET domain-containing protein 4 [Equus caballus]
Length = 440
Score = 38.5 bits (88), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 59/255 (23%), Positives = 100/255 (39%), Gaps = 43/255 (16%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSELACLALYLMYEKKQGKKSFWL 177
LQ G S+P S ++T + V+ + A + LS L L +L+ EK G +S W
Sbjct: 68 LQEGQMIISLPESCLLTTDTVIRSYLGAYIAKWQPPLSPLLALCTFLVAEKHAGDRSVWK 127
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILER----------------- 220
PY+ L + A P+ E E+ L P KA+ E+
Sbjct: 128 PYLEVLPK-------AYTCPVC-LEPEVVDLLPKPLKAKAREQRTRLQAFFTSSRDFFSS 179
Query: 221 -----AEGIKREYNELDTVWFMAG----SLFQKVSLARRFALVP----LGP--PLLAYS- 264
+E ++ ++ +W +++ K R F+ P L P LL +S
Sbjct: 180 LRPLFSEAVESIFSYSAFLWAWCTVNTRAVYMKPRRRRCFSAEPDTYALAPYLDLLNHSP 239
Query: 265 -SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 323
+ +A ++ + E + + GP N +LL+ YGFV NP+ + V
Sbjct: 240 DVQVRAGFNEETRCYEIRTVSSCRKHEEVFICYGPHDNQRLLLEYGFVSIHNPHACVYVS 299
Query: 324 AALNTEDPQYQDKRM 338
+ + DK+M
Sbjct: 300 KDILVKYLPSTDKQM 314
>gi|414886517|tpg|DAA62531.1| TPA: hypothetical protein ZEAMMB73_960129 [Zea mays]
Length = 147
Score = 38.5 bits (88), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 23/74 (31%), Positives = 43/74 (58%), Gaps = 3/74 (4%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNE-TIAELLTTNK--LSELACLALYLMYEKK 169
+AA+ DL+ G+ +P + ++T +RV ++ IA ++ +K LS + L + L+ E
Sbjct: 51 LAAARDLRRGELVLRLPRAALLTSDRVTADDPRIAACVSAHKPRLSSVQILIVCLLAEVG 110
Query: 170 QGKKSFWLPYIREL 183
+G S W PY+ +L
Sbjct: 111 KGSNSVWYPYLCQL 124
>gi|410900968|ref|XP_003963968.1| PREDICTED: SET domain-containing protein 4-like [Takifugu rubripes]
Length = 386
Score = 38.5 bits (88), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 22/65 (33%), Positives = 37/65 (56%), Gaps = 3/65 (4%)
Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSF 175
+++ GD S+P S ++T VL N + + + K LS L L ++L+ E+ +G+ S
Sbjct: 65 NVKPGDMLISLPESCLLTTSTVL-NSYLGSFIKSWKPHLSPLLALCVFLVCERHRGEASD 123
Query: 176 WLPYI 180
W PYI
Sbjct: 124 WFPYI 128
>gi|303272215|ref|XP_003055469.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463443|gb|EEH60721.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 468
Score = 38.5 bits (88), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 25/87 (28%), Positives = 41/87 (47%), Gaps = 5/87 (5%)
Query: 144 TIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 203
++A+ L +L L + +M E+ G +S W Y L RG L P+ W+E
Sbjct: 85 SVAKELRDARLGGGLALNVAVMVERALGSESRWRDYFAVLP-SRGERTL----PMFWTEA 139
Query: 204 ELAYLTGSPTKAEILERAEGIKREYNE 230
L L G+ + E AE ++ +Y+E
Sbjct: 140 RLEALKGTDLATHVREDAENLRADYDE 166
>gi|311031127|ref|ZP_07709217.1| D-alanyl-alanine synthetase A [Bacillus sp. m3-13]
Length = 356
Score = 38.5 bits (88), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 43/174 (24%), Positives = 75/174 (43%), Gaps = 19/174 (10%)
Query: 35 IRDPNFGSSLRLVRRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCK 94
++ N GSS+ + + KNR + + + + +V + E +++ + N P C
Sbjct: 176 VKPANLGSSVGISKAKNREELETAFREAFDYDRKIIVEQGLEGAREIEIGVLGNDEPECS 235
Query: 95 VILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAEL-LTTNK 153
VI + P K Y A ED GD A +P + + E + E+ +T K
Sbjct: 236 VIGEIAP----KTEFYDYKAKYED---GDTAMIIPAEISEDI-----YEQVKEMAITAFK 283
Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY 207
+ A L + K+GK YI E++ G ++ PLLW T+++Y
Sbjct: 284 AIDGAGLVRADFFLTKEGKL-----YINEVNTMPGFTPFSM-FPLLWQHTDVSY 331
>gi|301112144|ref|XP_002905151.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262095481|gb|EEY53533.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 510
Score = 38.1 bits (87), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 38/141 (26%), Positives = 63/141 (44%), Gaps = 10/141 (7%)
Query: 92 PCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTT 151
P +L+ P R Y+ A E+++ G S+P S V+++E + LL
Sbjct: 88 PMSTVLQ--PEGFNFGRGTAYITA-ENVEVGSELLSLPMSQVMSVESA-ARGRVGLLLEV 143
Query: 152 N-KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 210
N L L L+L+ E+ G S + ++ L A+ S L +SE E+ L G
Sbjct: 144 NPDLPSAIALGLHLLEERALGAASNFSDFVATLPTIE-----AINSTLFYSEDEMNELEG 198
Query: 211 SPTKAEILERAEGIKREYNEL 231
S + L RA+ ++ Y+ L
Sbjct: 199 SQLQRFTLGRAQAVEAFYDAL 219
>gi|301099608|ref|XP_002898895.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262104601|gb|EEY62653.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 440
Score = 38.1 bits (87), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 63/143 (44%), Gaps = 10/143 (6%)
Query: 90 LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELL 149
L P +L+ P R Y+ +E+++ G S+P S V+++E + LL
Sbjct: 16 LAPMSTVLQ--PEGFNFGRGTAYIT-TENVEVGSVLLSLPMSQVMSVESA-ARGRVGLLL 71
Query: 150 TTN-KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 208
N L L L+L+ E+ G S + ++ L A+ S L +SE E+ L
Sbjct: 72 EVNPDLPSAIALGLHLLEERALGAASNFSDFVATLPTIE-----AINSTLFYSEDEMKGL 126
Query: 209 TGSPTKAEILERAEGIKREYNEL 231
GS + L RA+ + Y+ L
Sbjct: 127 EGSQLQRFTLGRAQAVDAFYDAL 149
>gi|226492747|ref|NP_001140859.1| uncharacterized protein LOC100272935 [Zea mays]
gi|194701488|gb|ACF84828.1| unknown [Zea mays]
gi|413951742|gb|AFW84391.1| hypothetical protein ZEAMMB73_159573 [Zea mays]
Length = 495
Score = 38.1 bits (87), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 19/57 (33%), Positives = 32/57 (56%)
Query: 273 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 329
A D+ +++++ R GE + G N+ LL YGF + DN YD + ++ AL T+
Sbjct: 253 ANDEDLEIIIVRDVNEGEEVYNTYGTMGNAALLHRYGFTELDNQYDIVNIDLALVTK 309
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.133 0.389
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,234,549,345
Number of Sequences: 23463169
Number of extensions: 289157682
Number of successful extensions: 803412
Number of sequences better than 100.0: 740
Number of HSP's better than 100.0 without gapping: 333
Number of HSP's successfully gapped in prelim test: 407
Number of HSP's that attempted gapping in prelim test: 802105
Number of HSP's gapped (non-prelim): 1238
length of query: 481
length of database: 8,064,228,071
effective HSP length: 146
effective length of query: 335
effective length of database: 8,933,572,693
effective search space: 2992746852155
effective search space used: 2992746852155
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 79 (35.0 bits)